Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend.

Summary of "Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend."

The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.


The Biostatistics Center, Departments of Epidemiology and Biostatistics and Statistics, The George Washington University, 6110 Executive Boulevard, Suite 750, Rockville, MD, 20852, USA.

Journal Details

This article was published in the following journal.

Name: Statistics in medicine
ISSN: 1097-0258
Pages: 3057-66


DeepDyve research library

PubMed Articles [18642 Associated PubMed Articles listed on BioPortfolio]

An Evaluation of Increasing Sample Size Based on Conditional Power.

We evaluate properties of sample size re-estimation (SSR) designs similar to the promising zone design considered by Mehta and Pocock (2011). We evaluate these designs under the assumption of a true e...

Optimal adaptive group sequential design with flexible timing of sample size determination.

Flexible sample size designs, including group sequential and sample size re-estimation designs, have been used as alternatives to fixed sample size designs to achieve more robust statistical power and...

Power comparison of Cochran-Armitage trend test against allelic and genotypic tests in large-scale case-control genetic association studies.

The Cochran-Armitage trend test (CA) has become a standard procedure for association testing in large-scale genome-wide association studies (GWAS). However, when the disease model is unknown, there is...

Adjustment for unbalanced sample size for analytical biosimilar equivalence assessment.

Large sample size imbalance is not uncommon in the biosimilar development. At the beginning of a product development, sample sizes of a biosimilar and a reference product may be limited. Thus a sample...

Power Calculations to Select Instruments for Clinical Trial Secondary Endpoints: A Case Study of Instrument Selection for Post-Traumatic Stress Symptoms in Subjects with ARDS.

After the sample size of a randomized control trial (RCT) is set by the power requirement of its primary endpoint, investigators select secondary endpoints while unable to further adjust sample size. ...

Clinical Trials [3806 Associated Clinical Trials listed on BioPortfolio]

Sample Size Definition in Cochrane Hepato-Biliary Trials

Sample size definition provides important information, allowing the groundwork for transparent reporting. The sample predefinition allows the trial to be large enough to be able to addres...

UltraShape Power for Abdominal Fat and Circumference Reduction

Prospective, one arm, baseline-controlled, clinical study for the evaluation of the UltraShape Power treatment using the U-Sculpt Power Transducer for non-invasive abdominal fat and circum...

Impact of Clinical Pharmacy Service on Patient Care and Cost Saving

Background: Pharmacists have been proven to improve patient outcomes, medication adherence, glycemic control, reduce blood pressure, low-density lipoprotein, health care costs and length o...

Erlotinib Versus Gemcitabine/Carboplatin in Chemo-naive Stage IIIB/IV Non-Small Cell Lung Cancer Patients With Epidermal Growth Factor Receptor (EGFR) Exon 19 or 21 Mutation

Epidermal growth factor receptor tyrosine kinase inhibitors (EGFR TKIs) such as erlotinib have proved effective in second or third line therapy for advanced non-small cell lung cancer.It ...

Impact of Enhanced External Counterpulsation (EECP) on VO2 MAX

The purpose of this study is to assess the effects of 35 EECP sessions on cardiopulmonary training performance in healthy volunteers. Data from this study will be used to generate sample ...

Medical and Biotech [MESH] Definitions

The number of units (persons, animals, patients, specified circumstances, etc.) in a population to be studied. The sample size should be big enough to have a high likelihood of detecting a true difference between two groups. (From Wassertheil-Smoller, Biostatistics and Epidemiology, 1990, p95)

The analysis of a chemical substance by inserting a sample into a carrier stream of reagent using a sample injection valve that propels the sample downstream where mixing occurs in a coiled tube, then passes into a flow-through detector and a recorder or other data handling device.

A type of scanning probe microscopy in which a very sharp conducting needle is swept just a few angstroms above the surface of a sample. The tiny tunneling current that flows between the sample and the needle tip is measured, and from this are produced three-dimensional topographs. Due to the poor electron conductivity of most biological samples, thin metal coatings are deposited on the sample.

The process of discovering or asserting an objective or intrinsic relation between two objects or concepts; a faculty or power that enables a person to make judgments; the process of bringing to light and asserting the implicit meaning of a concept; a critical evaluation of a person or situation.

Studies determining the effectiveness or value of processes, personnel, and equipment, or the material on conducting such studies. For drugs and devices, CLINICAL TRIALS AS TOPIC; DRUG EVALUATION; and DRUG EVALUATION, PRECLINICAL are available.

Quick Search

DeepDyve research library