Abstract

Background

Issues of reported study quality have not been addressed empirically with large-scale data in the cancer prognostic literature.

Methods

Eight quality measures pertaining to study design and assay methods (i.e., blinding, prospective versus retrospective design, power calculations, outcomes' definitions, time of enrollment, reporting of variables, assay description, and assay reference) were evaluated in cancer prognostic marker studies included in meta-analyses identified in Medline and EMBASE. To be eligible, meta-analyses had to include at least six studies and to examine binary outcomes. We estimated the ratios of relative risks, which compared the overall prognostic effects (summary relative risks) between poor-quality and good-quality studies for each quality item. Between-study heterogeneity was tested with the Q statistic (statistically significant at P <.10). All statistical tests were two-sided.

Results

We identified 20 meta-analyses that included 331 cancer prognostic marker studies published between 1987 and 2005. Only three (0.9%) of the 331 studies presented power calculations, 129 (39.0%) studies stated that analyses were blinded, and 73 (21.5%) stated that they were prospective. Time of enrollment was defined in 232 (70.0%), 234 (70.7%) gave lists of candidate variables, and 254 (76.7%) defined outcomes. The assay used was described in 317 (95.8%), but only 177 (53.5%) provided the assay reference. Estimates of prognostic effects from poor-quality studies varied considerably and could be larger or smaller than summary estimates derived from meta-analyses. Summary ratios of relative risks of poor- versus good-quality studies for the seven quality measures ranged from 0.95 to but 1.26, but none was statistically significantly. There was statistically significant heterogeneity ( P <.10) between the ratios of relative risk estimates across meta-analyses for blinding, defining endpoints, and stating variables and assay references.

Conclusions

Among cancer prognostic marker studies, reporting quality of design and assay information often appears suboptimal, indicating that this literature may be largely unreliable. Given the potential clinical importance of prognostic marker information, improved design and reporting of these studies are warranted.

CONTEXT AND CAVEATS
Prior knowledge

Many prognostic markers have been proposed for various diseases, including cancer, many have not been validated, and most have not changed clinical practice. Suboptimal study design (such as lack of blinding and retrospective design) and missing methodologic information have been associated with exaggerated results for other types of clinical research.

Study design

Meta-analysis of 331 studies on cancer prognostic markers from 20 meta-analyses.

Contribution

The body of literature for prognostic marker studies may be largely unreliable because the quality of the information presented about study design and assays used often appeared suboptimal.

Implications

Because of the potential clinical importance of prognostic marker information, improvements in the design and reporting of prognostic marker studies are warranted.

Limitations

Reported study quality is only a surrogate for true study quality. Individual meta-analyses, from which estimates and data were obtained, may themselves suffer from various biases and used different statistical measures to compare outcomes.

The literature on prognostic factors is rapidly expanding. Numerous prognostic markers have been proposed for diverse clinical outcomes of various diseases ( 1 , 2 ), with cancer being one of the key fields in which prognostic markers are developed. However, most proposed cancer prognostic markers have not changed clinical practice ( 35 ). The initial promising results of many proposed markers have not been validated, even when cutting-edge molecular techniques have been used to develop such markers ( 6 , 7 ). This problem may result from poor study quality and may be compounded by suboptimal reporting of prognostic studies ( 1 , 2 , 811 ). Identification of this problem has led to the development of recommendations for the reporting of prognostic marker studies (i.e., the REMARK criteria) ( 12 ). The goal of these guidelines was to encourage transparent and complete reporting to help readers judge the data and understand the context in which the conclusions apply ( 12 ). REMARK is not a quality appraisal tool per se, but many of the reporting guideline items clearly indicate which study designs and other characteristics are desirable. In particular, REMARK guidelines on reporting of study design and assay methods are directly relevant to the conduct and implementation of such research ( 12 ).

Several groups have tried to determine whether studies with good reporting quality identify weaker effects than studies with poor reporting quality or vice versa. Various empirical investigations have found that for randomized controlled trials, poor quality—in particular, lack of masking and subverted allocation concealment—may be related to exaggerated treatment effects ( 1316 ); that poor quality may be related to either stronger or weaker treatment effects ( 1719 ); and that findings from studies with retrospective nonrandomized designs deviate more from the findings of randomized studies than those from trials with prospective nonrandomized designs ( 20 ). However, the reationship, if any, between reported quality and prognostic effect has not been investigated in a quantitative manner in the prognostic literature.

The purpose of this study was to describe the quality of reporting of tumor prognostic marker studies included in meta-analyses and to examine whether various aspects of the reporting quality of the study were associated with the magnitude of the estimated prognostic effect across investigations that evaluated the same tumor markers. We focused on aspects of the study design and implementation in prognostic studies because retrospective design and lack of blinding have been associated with exaggerated results from studies with other designs ( 1316 ).

Studies and Methods

Search Strategy and Eligibility Criteria for Meta-analyses

We identified meta-analyses of prognostic markers for cancer by searching Medline and EMBASE (last update June 2005). The search algorithm used the terms prognostic marker, prognostic factor, survival, mortality, recurrence, prediction, outcome, AND meta-analysis AND (cancer OR neoplas* OR malign*). We also included a variety of the most commonly used molecular markers and clinical prognostic factors (available from the authors upon request), to increase the sensitivity of the search.

We accepted meta-analyses that were published in the English language and that analyzed studies of nongenetic prognostic markers and cancer. Eligible meta-analyses had to include at least six primary studies ( 17 ) and to address at least one prognostic marker that fit with the definition given by the REMARK consensus statement ( 12 ), i.e., a marker has to have an association with some clinical outcome, typically a time-to-event outcome such as overall survival or recurrence-free survival, that could be assessed for the clinical management of a patient. With fewer studies, it would have been difficult to examine the association between reported quality and prognostic effects. When an article contained more than one eligible meta-analysis on different prognostic markers, each meta-analysis was considered separately. When two or more meta-analyses addressed the same prognostic marker, we used only the meta-analysis with the greater number of primary studies to avoid duplication of the information.

We accepted meta-analyses regardless of the adverse clinical outcome they addressed (death, disease progression, or no treatment response). When meta-analyses of a specific marker included more than one clinical outcome, we used only the outcome that was examined by the largest number of primary studies.

Primary Studies Included in the Meta-analyses

We retrieved the full text of the primary studies included in the identified eligible meta-analyses. Primary studies were used to evaluate quality of various reported study components. To avoid making subjective choices, we captured estimates of the prognostic effect directly from the meta-analyses for data synthesis. We did not try to re-extract the prognostic effect data from the primary studies, unless these data were not readily discernible from the corresponding meta-analysis.

Data Extraction

Two investigators (PAK and DDK) extracted data from eligible studies independently, discussed the data, and reached consensus on all components. In each meta-analysis, we recorded the author name, journal name, year of publication, number of studies included, number of patients analyzed, and the malignancy and prognostic factor addressed. For each primary study included in each meta-analysis, we recorded the point estimate of the relative risk and its 95% confidence interval for the adverse clinical outcome, as provided by the meta-analysis. Relative risk was expressed typically as hazard ratio or incidence rate ratio, but we also accepted odds ratios or risk ratios if the former metrics were not available. When the effect size information was not presented numerically in a study, this information was extracted from the meta-analysis forest plot by digitizing the forest plot images with the Engauge Digitizer version 2.12 software to obtain maximal accuracy. When a published meta-analysis presented only subgroup analyses for the outcome of interest, we included data from all studies across all subgroups.

Reported Quality Measures

For all 331 primary studies that were included in the 20 meta-analyses identified in our searches, we recorded the following information on study design and assay methods: whether the study stated that the evaluation of the prognostic marker was blinded to the patient outcome, whether the study was prospective (i.e., whether markers were measured at the time of patient enrollment and whether patients were then followed prospectively for outcomes), whether the study specified the time period during which patients were enrolled, whether the study precisely defined all clinical outcomes examined in the primary study (regardless of whether they were also examined in the meta-analysis or not), whether the study provided a list of candidate variables (i.e., variables that could potentially influence in any way the prognostic effect of the postulated prognostic marker), whether the study provided a rationale for the sample size (power calculation), and whether the study adequately described and referenced the assay methods for the prognostic marker. These quality components correspond to the sections on study design and assay methods in the REMARK recommendations ( 12 ).

Statistical Methods

The extracted relative risks for each primary study were analyzed with the use of random-effects calculations ( 21 , 22 ). Between-study heterogeneity was tested with the Q statistic (which was considered to be statistically significant for P <.10) ( 23 ), and its magnitude was measured with the I2 statistic (which was considered to be large for values of ≥50%) ( 23 ). The summary estimates of relative risk and their 95% confidence intervals, which we derived from scanned and digitized figure data, always agreed with the summary estimates stated in the published meta-analysis, with all deviations being less than 4% (mean = 1.3%, and median = 2%).

All summary estimates of relative risk were designed to be at least 1.0, i.e., we used the inverse of relative risk estimates of less than 1.0 (protective markers for the adverse clinical outcome), so that the comparison was always between the group with the higher risk for the primary outcome and the group with the lower risk for that outcome. Using this notation for the overall summary effect for each reported quality measure in each meta-analysis, we first calculated the random-effects summary estimate separately in studies with good and poor reported quality for each quality measure and then calculated the ratios of relative risks for poor- versus good-quality studies. Ratio of relative risk values of greater than 1.0 indicate that poor reported quality studies estimate larger prognostic effects than good reported quality studies and vice versa. The variance of the natural logarithm of the ratio of relative risk was estimated as the square root of the sum of variances of the natural logarithms of the summary relative risks in poor- and good-quality studies. The 95% confidence intervals were estimated as the point estimate plus or minus 1.96 times the square root of the variance. A meta-analysis could be included in this analysis only if it included studies with and without the quality component of interest. We also evaluated the difference between studies with good and poor reported quality by estimating the heterogeneity between the summary relative risks in the two subgroups with the Q and I2 statistics, described above. Finally, using random-effects calculations, we calculated the summary ratio of relative risk for each quality component across all the meta-analyses. We should acknowledge that, according to our definitions, studies with poor reported quality for a given component are likely a combination of studies with good quality in which the authors simply do not state methodologic details and studies with truly poor quality. Therefore, we would expect a dilution of the true effect of poor quality on estimated prognostic effect sizes.

Analyses were conducted with SPSS software version 13.0 (SPSS, Inc, Chicago, IL), StatXact-3 software version 3.0.2 (StatXact, 1996), and the metan module of Intercooled Stata version 8.2 (Stata Corp., College Station, TX). All P values were from two-sided statistical tests.

Results

Meta-analyses and Primary Studies Included in the Study

We initially identified 24 English language meta-analyses, four of which were excluded—three ( 2426 ) because they overlapped with another meta-analysis and one ( 27 ) because it incorporated fewer than six studies. Therefore, 20 meta-analyses ( 2847 ) were eligible for the analysis ( Table 1 ). The most common malignancy addressed was non–small-cell lung cancer in eight meta-analyses ( 32 , 3436 , 38 , 40 , 42 , 45 ), followed by breast cancer in six ( 28 , 37 , 39 , 41 , 44 , 47 ); other malignancies included osteosarcoma in two meta-analyses ( 31 , 46 ), head and neck squamous cell carcinoma in two ( 29 , 30 ), glioblastoma in one ( 33 ), and various neoplasms in one ( 43 ). The number of primary studies included in each meta-analysis ranged from six to 50, and the number of patients enrolled ranged from 395 to 8029. The prognostic factor had a statistically significant effect on the risk of the main outcome in 15 (75%) of the meta-analyses ( 28 , 30 , 32 , 3545 , 47 ), as judged by the obtained summary estimates, but only four of the summary relative risks exceeded 2.00 ( 30 , 41 , 43 , 47 ).

Table 1.

Characteristics of the meta-analyses analyzed and the percentage of studies fulfilling each quality item per meta-analysis *

ReferenceMalignancyPrognostic factorNo. of studies (No. of patients)I2 , % RR (95% CI)Blinded, %Prospective, %Time period, %Outcome, %Variables, %Power, %AD, %AR, %
Uzzan et al. ( 28 ) Breast cancerMVD25 (6501)761.56 (1.30 to 1.74)563662.55091.6479.145.8
Kyzas et al. ( 29 ) HNSCCTP5342 (3388)531.23 (0.97 to 1.57)47.614.371.459.566.6092.928.6
Kyzas et al. ( 30 ) HNSCCVEGF12 (1002)302.68 (1.82 to 3.96)66.607558.383.301000
Pakos et al. ( 31 ) OsteosarcomaTP5314 (436)01.45 (0.80 to 2.63)28.5057.164.335.7092.921.4
Martin et al. ( 32 ) NSCLCKi-6717 (1863)361.52 (1.26 to 1.84)4711.764.794.282.45.810029.4
Huncharek et al. ( 33 ) GlioblastomaEGFR7 (395)351.12 (0.61 to 2.07)014.357.110057.1010085.7
Meert et al. ( 34 ) NSCLCEGFR11 (2185)581.16 (0.95 to 1.42)54.518.281.810072.7010072.7
Meert et al. ( 35 ) NSCLCMVD23 (3620)681.92 (1.54 to 2.39)78.38.786.991.391.3010026.1
Meert et al. ( 36 ) NSCLCc-erbB-222 (3365)561.53 (1.28 to 1.83)63.627.381.895.577.3510050
Deming et al. ( 37 ) Breast cancerC-myc6 (1003)01.63 (1.25 to 2.14)0083.35083.3010083.3
Martin et al. ( 38 ) NSCLCBcl-219 (2909)651.69 (1.33 to 2.17)38.85.561.188.961.1094.455.5
Ryu et al. ( 39 ) Breast cancerBMI12 (8029)641.56 (1.22 to 2.00)066.6100100100066.658.3
Mascaux et al. ( 40 ) NSCLCK-Ras30 (3620)551.32 (1.14 to 1.51)3013.3709066.6010076.6
Weinschenker et al. ( 41 ) Breast cancerBMM9 (2252)782.03 (1.46 to 2.84)33.310066.677.766.6010088.8
Steels et al. ( 42 ) NSCLCTP5350 (5924)371.57 (1.41 to 1.75)48.38.388927209872
Funke and Schraut ( 43 ) VariousBMM9 (548)682.36 (1.70 to 3.28)33.310044.455.544.4010077.7
Ferrandina et al. ( 44 ) Breast cancerCathepsin-D12 (2691)521.98 (1.42 to 2.77)2516.666.650100010058.3
Choma et al. ( 45 ) NSCLCDNA ploidy34 (3761)241.89 (1.51 to 2.32)22.241.770.485.266.6010074.1
Pakos and Ioannidis ( 46 ) OsteosarcomaPgp14 (597)521.32 (0.72 to 2.41)57.214.357.178.635.7010042.9
Pharoah et al. ( 47 ) Breast cancerTP5312 (2319)552.20 (1.56 to 3.09)18.207558.3100010083.3
ReferenceMalignancyPrognostic factorNo. of studies (No. of patients)I2 , % RR (95% CI)Blinded, %Prospective, %Time period, %Outcome, %Variables, %Power, %AD, %AR, %
Uzzan et al. ( 28 ) Breast cancerMVD25 (6501)761.56 (1.30 to 1.74)563662.55091.6479.145.8
Kyzas et al. ( 29 ) HNSCCTP5342 (3388)531.23 (0.97 to 1.57)47.614.371.459.566.6092.928.6
Kyzas et al. ( 30 ) HNSCCVEGF12 (1002)302.68 (1.82 to 3.96)66.607558.383.301000
Pakos et al. ( 31 ) OsteosarcomaTP5314 (436)01.45 (0.80 to 2.63)28.5057.164.335.7092.921.4
Martin et al. ( 32 ) NSCLCKi-6717 (1863)361.52 (1.26 to 1.84)4711.764.794.282.45.810029.4
Huncharek et al. ( 33 ) GlioblastomaEGFR7 (395)351.12 (0.61 to 2.07)014.357.110057.1010085.7
Meert et al. ( 34 ) NSCLCEGFR11 (2185)581.16 (0.95 to 1.42)54.518.281.810072.7010072.7
Meert et al. ( 35 ) NSCLCMVD23 (3620)681.92 (1.54 to 2.39)78.38.786.991.391.3010026.1
Meert et al. ( 36 ) NSCLCc-erbB-222 (3365)561.53 (1.28 to 1.83)63.627.381.895.577.3510050
Deming et al. ( 37 ) Breast cancerC-myc6 (1003)01.63 (1.25 to 2.14)0083.35083.3010083.3
Martin et al. ( 38 ) NSCLCBcl-219 (2909)651.69 (1.33 to 2.17)38.85.561.188.961.1094.455.5
Ryu et al. ( 39 ) Breast cancerBMI12 (8029)641.56 (1.22 to 2.00)066.6100100100066.658.3
Mascaux et al. ( 40 ) NSCLCK-Ras30 (3620)551.32 (1.14 to 1.51)3013.3709066.6010076.6
Weinschenker et al. ( 41 ) Breast cancerBMM9 (2252)782.03 (1.46 to 2.84)33.310066.677.766.6010088.8
Steels et al. ( 42 ) NSCLCTP5350 (5924)371.57 (1.41 to 1.75)48.38.388927209872
Funke and Schraut ( 43 ) VariousBMM9 (548)682.36 (1.70 to 3.28)33.310044.455.544.4010077.7
Ferrandina et al. ( 44 ) Breast cancerCathepsin-D12 (2691)521.98 (1.42 to 2.77)2516.666.650100010058.3
Choma et al. ( 45 ) NSCLCDNA ploidy34 (3761)241.89 (1.51 to 2.32)22.241.770.485.266.6010074.1
Pakos and Ioannidis ( 46 ) OsteosarcomaPgp14 (597)521.32 (0.72 to 2.41)57.214.357.178.635.7010042.9
Pharoah et al. ( 47 ) Breast cancerTP5312 (2319)552.20 (1.56 to 3.09)18.207558.3100010083.3
*

The main outcome in all meta-analyses was death, except for three studies ( 28 , 43 , 44 ) in which disease progression or death was presented and one study ( 46 ) in which no response to treatment was presented. The effect metric was the hazard ratio in 11 studies ( 28 , 32 , 3440 , 42 , 47 ) and the odds ratio in the others ( 2931 , 33 , 41 , 4346 ). RR = relative risk; CI = confidence interval; AD = assay described; AR = assay referenced, MVD = microvessel density; HNSCC = head and neck squamous cell carcinoma; TP53 = tumor suppressor protein 53 and its gene; VEGF = vascular endothelial growth factor; NSCLC = non–small-cell lung cancer; Ki-67 = proliferation index Ki-67, c-erbB-2: HER-2/neu receptor; EGFR = epidermal growth factor receptor; C-myc = oncogene c-myc; Bcl-2 = oncogene bcl-2; BMI = body mass index; K-Ras = oncogene K-Ras; BMM = bone marrow micrometastases; Pgp = P-glycoprotein.

Table 1.

Characteristics of the meta-analyses analyzed and the percentage of studies fulfilling each quality item per meta-analysis *

ReferenceMalignancyPrognostic factorNo. of studies (No. of patients)I2 , % RR (95% CI)Blinded, %Prospective, %Time period, %Outcome, %Variables, %Power, %AD, %AR, %
Uzzan et al. ( 28 ) Breast cancerMVD25 (6501)761.56 (1.30 to 1.74)563662.55091.6479.145.8
Kyzas et al. ( 29 ) HNSCCTP5342 (3388)531.23 (0.97 to 1.57)47.614.371.459.566.6092.928.6
Kyzas et al. ( 30 ) HNSCCVEGF12 (1002)302.68 (1.82 to 3.96)66.607558.383.301000
Pakos et al. ( 31 ) OsteosarcomaTP5314 (436)01.45 (0.80 to 2.63)28.5057.164.335.7092.921.4
Martin et al. ( 32 ) NSCLCKi-6717 (1863)361.52 (1.26 to 1.84)4711.764.794.282.45.810029.4
Huncharek et al. ( 33 ) GlioblastomaEGFR7 (395)351.12 (0.61 to 2.07)014.357.110057.1010085.7
Meert et al. ( 34 ) NSCLCEGFR11 (2185)581.16 (0.95 to 1.42)54.518.281.810072.7010072.7
Meert et al. ( 35 ) NSCLCMVD23 (3620)681.92 (1.54 to 2.39)78.38.786.991.391.3010026.1
Meert et al. ( 36 ) NSCLCc-erbB-222 (3365)561.53 (1.28 to 1.83)63.627.381.895.577.3510050
Deming et al. ( 37 ) Breast cancerC-myc6 (1003)01.63 (1.25 to 2.14)0083.35083.3010083.3
Martin et al. ( 38 ) NSCLCBcl-219 (2909)651.69 (1.33 to 2.17)38.85.561.188.961.1094.455.5
Ryu et al. ( 39 ) Breast cancerBMI12 (8029)641.56 (1.22 to 2.00)066.6100100100066.658.3
Mascaux et al. ( 40 ) NSCLCK-Ras30 (3620)551.32 (1.14 to 1.51)3013.3709066.6010076.6
Weinschenker et al. ( 41 ) Breast cancerBMM9 (2252)782.03 (1.46 to 2.84)33.310066.677.766.6010088.8
Steels et al. ( 42 ) NSCLCTP5350 (5924)371.57 (1.41 to 1.75)48.38.388927209872
Funke and Schraut ( 43 ) VariousBMM9 (548)682.36 (1.70 to 3.28)33.310044.455.544.4010077.7
Ferrandina et al. ( 44 ) Breast cancerCathepsin-D12 (2691)521.98 (1.42 to 2.77)2516.666.650100010058.3
Choma et al. ( 45 ) NSCLCDNA ploidy34 (3761)241.89 (1.51 to 2.32)22.241.770.485.266.6010074.1
Pakos and Ioannidis ( 46 ) OsteosarcomaPgp14 (597)521.32 (0.72 to 2.41)57.214.357.178.635.7010042.9
Pharoah et al. ( 47 ) Breast cancerTP5312 (2319)552.20 (1.56 to 3.09)18.207558.3100010083.3
ReferenceMalignancyPrognostic factorNo. of studies (No. of patients)I2 , % RR (95% CI)Blinded, %Prospective, %Time period, %Outcome, %Variables, %Power, %AD, %AR, %
Uzzan et al. ( 28 ) Breast cancerMVD25 (6501)761.56 (1.30 to 1.74)563662.55091.6479.145.8
Kyzas et al. ( 29 ) HNSCCTP5342 (3388)531.23 (0.97 to 1.57)47.614.371.459.566.6092.928.6
Kyzas et al. ( 30 ) HNSCCVEGF12 (1002)302.68 (1.82 to 3.96)66.607558.383.301000
Pakos et al. ( 31 ) OsteosarcomaTP5314 (436)01.45 (0.80 to 2.63)28.5057.164.335.7092.921.4
Martin et al. ( 32 ) NSCLCKi-6717 (1863)361.52 (1.26 to 1.84)4711.764.794.282.45.810029.4
Huncharek et al. ( 33 ) GlioblastomaEGFR7 (395)351.12 (0.61 to 2.07)014.357.110057.1010085.7
Meert et al. ( 34 ) NSCLCEGFR11 (2185)581.16 (0.95 to 1.42)54.518.281.810072.7010072.7
Meert et al. ( 35 ) NSCLCMVD23 (3620)681.92 (1.54 to 2.39)78.38.786.991.391.3010026.1
Meert et al. ( 36 ) NSCLCc-erbB-222 (3365)561.53 (1.28 to 1.83)63.627.381.895.577.3510050
Deming et al. ( 37 ) Breast cancerC-myc6 (1003)01.63 (1.25 to 2.14)0083.35083.3010083.3
Martin et al. ( 38 ) NSCLCBcl-219 (2909)651.69 (1.33 to 2.17)38.85.561.188.961.1094.455.5
Ryu et al. ( 39 ) Breast cancerBMI12 (8029)641.56 (1.22 to 2.00)066.6100100100066.658.3
Mascaux et al. ( 40 ) NSCLCK-Ras30 (3620)551.32 (1.14 to 1.51)3013.3709066.6010076.6
Weinschenker et al. ( 41 ) Breast cancerBMM9 (2252)782.03 (1.46 to 2.84)33.310066.677.766.6010088.8
Steels et al. ( 42 ) NSCLCTP5350 (5924)371.57 (1.41 to 1.75)48.38.388927209872
Funke and Schraut ( 43 ) VariousBMM9 (548)682.36 (1.70 to 3.28)33.310044.455.544.4010077.7
Ferrandina et al. ( 44 ) Breast cancerCathepsin-D12 (2691)521.98 (1.42 to 2.77)2516.666.650100010058.3
Choma et al. ( 45 ) NSCLCDNA ploidy34 (3761)241.89 (1.51 to 2.32)22.241.770.485.266.6010074.1
Pakos and Ioannidis ( 46 ) OsteosarcomaPgp14 (597)521.32 (0.72 to 2.41)57.214.357.178.635.7010042.9
Pharoah et al. ( 47 ) Breast cancerTP5312 (2319)552.20 (1.56 to 3.09)18.207558.3100010083.3
*

The main outcome in all meta-analyses was death, except for three studies ( 28 , 43 , 44 ) in which disease progression or death was presented and one study ( 46 ) in which no response to treatment was presented. The effect metric was the hazard ratio in 11 studies ( 28 , 32 , 3440 , 42 , 47 ) and the odds ratio in the others ( 2931 , 33 , 41 , 4346 ). RR = relative risk; CI = confidence interval; AD = assay described; AR = assay referenced, MVD = microvessel density; HNSCC = head and neck squamous cell carcinoma; TP53 = tumor suppressor protein 53 and its gene; VEGF = vascular endothelial growth factor; NSCLC = non–small-cell lung cancer; Ki-67 = proliferation index Ki-67, c-erbB-2: HER-2/neu receptor; EGFR = epidermal growth factor receptor; C-myc = oncogene c-myc; Bcl-2 = oncogene bcl-2; BMI = body mass index; K-Ras = oncogene K-Ras; BMM = bone marrow micrometastases; Pgp = P-glycoprotein.

There were 380 analyzed relative risk estimates across the 20 meta-analyses. These estimates were derived from 340 primary articles [(53–392) available online]; some articles contributed two different population estimates in a meta-analysis or provided data for more than one meta-analysis [(57,70,74,101,108,118,127,136,137,155,158,161,171,175,188,194,197,199,217,239,251,253,264, 274,282,284,346,350,352,355) available online]. Only four primary studies [(111,179,235,358) available online] had more than 1000 patients, with the largest sample size being 1238 patients [(235) available online]. We extracted data (Supplementary Table 1 and Supplementary Data, available online) from 331 of these 340 primary articles—seven [(213,249,257,321,322,365,385) available online] were in Japanese, and we could not retrieve the full text for two studies [(89,124) available online]. All articles were published between 1987 and 2005. The relative risk and standard error for each study were obtained from digitized forest plots from 12 of the meta-analyses ( 28 , 3237 , 39 , 40 , 42 , 44 , 47 ). Five meta-analyses ( 37 , 39 , 41 , 44 , 47 ) used adjusted results for all studies. Four meta-analyses ( 2931 , 46 ) used unadjusted results for all studies. The remaining 11 meta-analyses ( 28 , 3236 , 38 , 40 , 42 , 43 , 45 ) apparently used adjusted and unadjusted estimates.

Quality Measures

The percentage of studies that adequately reported each quality component is presented for each meta-analysis in Table 1 . One hundred and twenty-nine (39.0%) of the 331 studies clearly stated that the evaluation of the prognostic marker was blinded to the patient outcome (for the many uncited references here and elsewhere in the section, see Supplementary Table 1 and Supplementary Data, available online). Only four studies [(72,91,197,253) available online] stated that the evaluation of the patient outcome was blinded to the marker. The percentage of blinded studies within a single meta-analysis varied substantially across meta-analyses, from 0% to 78.3% (exact P <.001). Only 73 (21.5%) of the 331 primary studies stated that they were prospective studies. However, again, substantial variability across meta-analyses was found, from 0% to 100% (exact P <.001), and two meta-analyses analyzed only prospective studies ( 41 , 43 ). The time period during which patients were selected was precisely defined in 232 (70.0%) of the 331 studies, with less variability across meta-analyses (range = 44%–100%; exact P = .092).

Outcomes were precisely defined in 254 (76.7%) of the 331 studies (range = 50%–100% across meta-analyses; exact P <.001). This percentage may be spuriously high because we considered all mortality definitions to be appropriate regardless of whether any level of detail was provided. A list of the considered candidate variables was presented in 234 (70.7%, range = 36%–100%; exact P <.001) studies. Only three (0.9%) [(61,152,155) available online] of the 331 primary studies gave a rationale (power calculation) for the sample size used. Finally, assay methods were described in 317 (95.8%, range = 67%–100%; exact P = .003) of the 331 studies, but the methods were referenced in only about half of them (i.e., 177 [53.5%] studies), with considerable variability across meta-analyses (range = 0%–89%; exact P <.001).

Quality Measures and Magnitude of Prognostic Effects

For each quality measure, we calculated a ratio of relative risk among all the meta-analyses. The ratio of relative risk and its 95% confidence interval for each quality measure is shown in Fig. 1 . Data on power calculations could not be reliably analyzed because too few studies provided a power calculation. We found scattered signals for statistically significant prognostic associations across seven ( 28 , 30 , 33 , 35 , 40 , 43 , 47 ) of the 20 meta-analyses that one or more quality component may be statistically significantly related to the magnitude of the prognostic effect. However, the signals were not consistent across studies, and only one meta-analysis ( 43 ) obtained statistically significantly larger effect sizes in poor-quality studies than in good-quality studies for three different quality components. In this meta-analysis, bone marrow micrometastases were found to be a stronger predictor of disease progression in studies that did not define the outcomes adequately than in those with well-defined outcomes, in studies that did not list the candidate variables than in those that listed them, and in studies that did not reference the assay used than in those with well-referenced assays.

Fig. 1.

Ratios of relative risks (RRRs) and 95% confidence intervals (CIs) for each quality measure among all the meta-analyses. Summary RRRs and 95% CIs were derived from random-effects analyses. Each meta-analysis is shown by the reference number given in Table 1 . Summary RRR = dashed line ; RRR = open square ; 95% CI = line ; 95% CIs of summary RRRs = solid diamonds . Size of symbol is proportional to the inverse variance of the estimate.

Statistically significantly larger prognostic effects were found in one meta-analysis for nonblinded studies than for blinded studies ( 40 ), but the opposite result was found in another meta-analysis ( 30 ). No statistically significant differences in prognostic effects were found between prospective studies and retrospective studies. In two meta-analyses ( 33 , 35 ), studies that did not define the enrollment time period gave statistically significantly higher prognostic effects, and in one meta-analysis ( 40 ), such studies gave statistically significantly smaller effects. In addition to the meta-analysis of Funke and Schraut ( 43 ), another meta-analysis ( 47 ) found greater prognostic effects in studies that did not provide a reference for the assay, but the opposite was observed in two other meta-analyses ( 28 , 33 ).

Summary Ratios of Relative Risks

Summary ratios of relative risks for poor- versus good-quality studies for various quality components ranged from 0.95 to 1.26 ( Table 2 ). With the exception of listing of candidate variables, these summary ratios of relative risks tended to indicate that larger prognostic effects were associated with poor-quality studies than with good-quality studies; however, none of these ratios of relative risks was statistically significant. There was statistically significant variability across the combined meta-analyses for the ratio of relative risks defined by the following four of these quality components: blinding ( P = .05), defining outcome ( P = .03), listing candidate variables ( P = .09), and providing references for the assays ( P = .02). This result is consistent with the pattern described above, in which several meta-analyses individually observed larger or smaller prognostic effects in poor- versus good-quality studies ( Fig. 1 ).

Table 2.

Summary ratios of relative risks (RRRs) and 95% confidence intervals (CIs) for studies with poor- versus good-quality reporting for all the quality measures

Quality measure Meta-analyses *Summary RRR (95% CI)I2 ( P value)
Blinding171.05 (0.90 to 1.23)40 (.05)
Design 141.12 (0.94 to 1.32)17 (.27)
Outcomes51.26 (0.76 to 2.09)62 (.03)
Time period191.00 (0.86 to 1.17)25 (.15)
Variables170.95 (0.79 to 1.14)34 (.09)
Assay described61.10 (0.82 to 1.47)0 (.68)
Assay referenced191.01 (0.86 to 1.19)45 (.02)
Quality measure Meta-analyses *Summary RRR (95% CI)I2 ( P value)
Blinding171.05 (0.90 to 1.23)40 (.05)
Design 141.12 (0.94 to 1.32)17 (.27)
Outcomes51.26 (0.76 to 2.09)62 (.03)
Time period191.00 (0.86 to 1.17)25 (.15)
Variables170.95 (0.79 to 1.14)34 (.09)
Assay described61.10 (0.82 to 1.47)0 (.68)
Assay referenced191.01 (0.86 to 1.19)45 (.02)
*

The number of meta-analyses had both studies with poor quality and studies with good quality (based on reporting).

All statistical tests were two-sided.

Retrospective versus prospective design. Studies with unstated design (amounting to 10% of total) are not included in the calculations presented. Their inclusion in the poor-quality group would not change the estimates considerably (data not shown).

Table 2.

Summary ratios of relative risks (RRRs) and 95% confidence intervals (CIs) for studies with poor- versus good-quality reporting for all the quality measures

Quality measure Meta-analyses *Summary RRR (95% CI)I2 ( P value)
Blinding171.05 (0.90 to 1.23)40 (.05)
Design 141.12 (0.94 to 1.32)17 (.27)
Outcomes51.26 (0.76 to 2.09)62 (.03)
Time period191.00 (0.86 to 1.17)25 (.15)
Variables170.95 (0.79 to 1.14)34 (.09)
Assay described61.10 (0.82 to 1.47)0 (.68)
Assay referenced191.01 (0.86 to 1.19)45 (.02)
Quality measure Meta-analyses *Summary RRR (95% CI)I2 ( P value)
Blinding171.05 (0.90 to 1.23)40 (.05)
Design 141.12 (0.94 to 1.32)17 (.27)
Outcomes51.26 (0.76 to 2.09)62 (.03)
Time period191.00 (0.86 to 1.17)25 (.15)
Variables170.95 (0.79 to 1.14)34 (.09)
Assay described61.10 (0.82 to 1.47)0 (.68)
Assay referenced191.01 (0.86 to 1.19)45 (.02)
*

The number of meta-analyses had both studies with poor quality and studies with good quality (based on reporting).

All statistical tests were two-sided.

Retrospective versus prospective design. Studies with unstated design (amounting to 10% of total) are not included in the calculations presented. Their inclusion in the poor-quality group would not change the estimates considerably (data not shown).

Discussion

Our analysis indicates that the reporting of important quality information in cancer prognostic studies is often suboptimal. Power calculations for the required sample size were rarely presented, indicating that investigators may tend to conduct opportunistic studies on the basis of specimen availability rather than on thoughtful design. Most studies that we examined were small and underpowered to detect modest effects. It is also possible that considerable publication bias may be operating in this field; this bias would further confound efforts to arrive at accurate estimates of prognostic effects. Only a few studies stated that the analysis was blinded, and the blinding in these studies was limited to masking the evaluation of the prognostic marker to the outcome but not the converse. Only 20% of the studies that we analyzed were apparently prospective. Furthermore, although almost all studies described the method for the assessment of the prognostic marker, only half of them referenced the assay method to make further details readily retrievable. The lists of candidate variables and specification of the outcomes were usually adequate. However, mortality, one of the least ambiguous outcomes, was a very common outcome in the studies that we evaluated. The percentage of studies in which all clinical endpoints were precisely defined appeared to be lower for studies examining other softer endpoints than for those examining mortality. Again, we emphasize that the quality of the reporting should be distinguished from the quality of what was actually done.

For all quality components examined, we did not find strong evidence that studies with a poor-quality indicator had a larger prognostic effect than studies with a good-quality indicator. However, there was wide diversity across meta-analyses for most of the quality measures examined. In several meta-analyses, studies with poor-quality reporting provided statistically significantly different estimates, but these estimates could be either smaller or larger effects than those of good-quality studies. Because it is not possible to predict how quality problems could alter the estimated prognostic effect in a specific meta-analysis, it is not possible to correct the final estimates by simply adjusting for quality problems.

Previous studies ( 1 , 2 , 811 ) that evaluated selected aspects of reporting of specific quality features in prognostic marker studies concur that major problems exist and that there is much room for improvement. We should acknowledge, however, that poor reporting does not by itself indicate bias, unless the incomplete reporting is an attempt to avoid disclosing some adverse aspect of how the study was designed or conducted. A recent review ( 48 ) noted that evaluation of these quality measures in systematic reviews is often incomplete and inadequate. Given these deficits, the recently proposed REMARK reporting guidelines ( 12 ) are a step forward. However, even the best-quality reporting in a study will not transform a poorly designed or analyzed study into a good one. Improved design and conduct of prognostic studies are really the key components in increasing the reliability of this important literature. Improvement in this area may require the development of common standards and collaborations between investigators who focus on minimizing selective reporting and publication bias ( 49 ).

Our study has several limitations. First, reported quality is only a surrogate for the true study quality ( 16 ), and it may blur the relationship between true quality and prognostic effects. However, it is extremely difficult to evaluate the true quality because there is no protocol registration of prognostic studies that would allow comparison of what was done and what was reported ( 12 ). Second, we used data and estimates from previously published meta-analyses. It is possible that one or more of these meta-analyses suffer from various biases ( 29 ), including publication bias, selective reporting, and lack of standardization of predictors and outcomes. However, unpublished and missing or nonstandardized data are unlikely to represent large, well-powered investigations of better quality. Third, we unavoidably had to accept different effect metrics (hazard ratios or odds ratios) to compare outcomes across meta-analyses. Most of the meta-analyses examined reported hazard ratios. Odds ratios may deviate more from one than the hazard ratios. However, inflation of the effect is likely to be only modest for the values encountered in these studies ( 50 ), and the ratio of odds ratios should generally be a good approximation to the ratio of hazard ratios. Fourth, we only examined prognostic studies in the field of cancer. Similar analyses in randomized trials ( 17 , 18 ) have presented variable results for different quality measures in various medical fields. Therefore, even though cancer is one of the most prominent fields in the prognostic literature, our results should be confirmed across other fields. Analyses of the genetics literature have also found major problems that raise questions about the credibility of the results of the genetic associations ( 51 , 52 ). Even though three-quarters of the analyzed prognostic factors had statistically significant summary estimates of relative risk, these relative risks were generally modest. It is unclear whether quality-related biases may be more prominent when reported relative risks are more pronounced.

Even when we allow for these caveats, the association between quality defects and prognostic effects may be unpredictable. Because of poor-quality reporting and the overall poor quality of the design and the performance of cancer prognostic studies ( 29 ), the literature on prognostic markers in cancer may be largely unreliable. Given the potential clinical importance of this prognostic information, efforts to improve the design and reporting of these studies are warranted.

There is no funding or conflict of interest regarding this study. The authors had full responsibility for the design of the study, the collection of the data, the analysis and interpretation of the data, the decision to submit the manuscript for publication, and the writing of the manuscript.

References

(1)

Altman DG. Systematic reviews of evaluations of prognostic variables.

BMJ
2001
;
323
:
224
–8.

(2)

Simon R, Altman DG. Statistical aspects of prognostic factor studies in oncology.

Br J Cancer
1994
;
69
:
979
–85.

(3)

Schilsky RL, Taube SE. Tumor markers as clinical cancer tests—are we there yet?.

Semin Oncol
2002
;
29
:
211
–2.

(4)

Hayes DF, Bast RC, Desch CE, Fritsche H Jr, Kemeny NE, Jessup JM, et al. Tumor marker utility grading system: a framework to evaluate clinical utility of tumor markers.

J Natl Cancer Inst
1996
;
88
:
1456
–66.

(5)

Bast RC Jr, Ravdin P, Hayes DF, Bates S, Fritsche H Jr, Jessup JM, et al.

2000
update of recommendations for the use of tumor markers in breast and colorectal cancer: clinical practice guidelines of the American Society of Clinical Oncology.
J Clin Oncol
2001
;
19
:
1865
–78.

(6)

Ransohoff DF. Lessons from controversy: ovarian cancer screening and serum proteomics.

J Natl Cancer Inst
2005
;
97
:
315
–9.

(7)

Ioannidis JP. Why most published research findings are false.

PLoS Med
2005
;
2
:
e124
.

(8)

Riley RD, Abrams KR, Sutton AJ, Lambert PC, Jones DR, Heney D, et al. Reporting of prognostic markers: current problems and development of guidelines for evidence-based practice in the future.

Br J Cancer
2003
;
88
:
1191
–8.

(9)

Altman DG, Lyman GH. Methodological challenges in the evaluation of prognostic factors in breast cancer.

Breast Cancer Res Treat
1998
;
52
:
289
–303.

(10)

Burton A, Altman DG. Missing covariate data within cancer prognostic studies: a review of current reporting and proposed guidelines.

Br J Cancer
2004
;
91
:
4
–8.

(11)

Deeks JJ, Dinnes J, D'Amico R, Sowden AJ, Sakarovitch C, Song F, et al. International Stroke Trial Collaborative Group; European Carotid Surgery Trial Collaborative Group. Evaluating non-randomised intervention studies.

Health Technol Assess
2003
;
7
:
iii
–x, 1–173.

(12)

McShane LM, Altman DG, Sauerbrei W, Taube SE, Gion M, Clark GM. Statistics Subcommittee of the NCI-EORTC Working Group on Cancer Diagnostics. Reporting recommendations for tumor marker prognostic studies (REMARK).

J Natl Cancer Inst
2005
;
97
:
1180
–4.

(13)

Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, van der Meulen JH, et al. Empirical evidence of design-related bias in studies of diagnostic tests.

JAMA
1999
;
282
:
1061
–6.

(14)

Juni P, Egger M. Allocation concealment in clinical trials.

JAMA
2002
;
288
:
2407
–8;author reply 2408–9.

(15)

Linde K, Scholz M, Ramirez G, Clausius N, Melchart D, Jonas WB. Impact of study quality on outcome in placebo-controlled trials of homeopathy.

J Clin Epidemiol
1999
;
52
:
631
–6.

(16)

Kjaergard LL, Villumsen J, Gluud C. Reported methodologic quality and discrepancies between large and small randomized trials in meta-analyses.

Ann Intern Med
2001
;
135
:
982
–9.

(17)

Balk EM, Bonis PA, Moskowitz H, Schmid CH, Ioannidis JP, Wang C, et al. Correlation of quality measures with estimates of treatment effect in meta-analyses of randomized controlled trials.

JAMA
2002
;
287
:
2973
–82.

(18)

Moher D, Jadad AR, Nichol G, Penman M, Tugwell P, Walsh S. Assessing the quality of randomized controlled trials: an annotated bibliography of scales and checklists.

Control Clin Trials
1995
;
16
:
62
–73.

(19)

Juni P, Witschi A, Bloch R, Egger M. The hazards of scoring the quality of clinical trials for meta-analysis.

JAMA
1999
;
282
:
1054
–60.

(20)

Ioannidis JP, Haidich AB, Pappa M, Pantazis N, Kokori SI, Tektonidou MG, et al. Comparison of evidence of treatment effects in randomized and nonrandomized studies.

JAMA
2001
;
286
:
821
–30.

(21)

Pettiti DB. Meta-analysis, decision analysis and cost-effectiveness analysis. 2nd ed. New York (NY): Oxford University Press;

1999
.

(22)

Lau J, Ioannidis JP, Schmid CH. Quantitative synthesis in systematic reviews.

Ann Intern Med
1997
;
127
:
820
–6.

(23)

Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis.

Stat Med
2002
;
21
:
1539
–58.

(24)

Mitsudomi T, Hamajima N, Ogawa M, Takahashi T. Prognostic significance of p53 alterations in patients with non-small cell lung cancer: a meta-analysis.

Clin Cancer Res
2000
;
6
:
4055
–63.

(25)

Huncharek M, Kupelnick B, Geschwind JF, Caubet JF. Prognostic significance of p53 mutations in non-small cell lung cancer: a meta-analysis of 829 cases from eight published studies.

Cancer Lett
2000
;
153
:
219
–26.

(26)

Huncharek M, Muscat J, Geschwind JF. K-ras oncogene mutation as a prognostic marker in non-small cell lung cancer: a combined analysis of 881 cases.

Carcinogenesis
1999
;
20
:
1507
–10.

(27)

Riley RD, Burchill SA, Abrams KR, Heney D, Sutton AJ, Jones DR, et al. A systematic review of molecular and biological markers in tumours of the Ewing's sarcoma family.

Eur J Cancer
2003
;
39
:
19
–30.

(28)

Uzzan B, Nicolas P, Cucherat M, Perret GY. Microvessel density as a prognostic factor in women with breast cancer: a systematic review of the literature and meta-analysis.

Cancer Res
2004
;
64
:
2941
–55.

(29)

Kyzas PA, Loizou KT, Ioannidis JP. Selective reporting biases in cancer prognostic factor studies.

J Natl Cancer Inst
2005
;
97
:
1043
–55.

(30)

Kyzas PA, Cunha IW, Ioannidis JP. Prognostic significance of vascular endothelial growth factor immunohistochemical expression in head and neck squamous cell carcinoma: a meta-analysis.

Clin Cancer Res
2005
;
11
:
1434
–40.

(31)

Pakos EE, Kyzas PA, Ioannidis JP. Prognostic significance of TP53 tumor suppressor gene expression and mutations in human osteosarcoma: a meta-analysis.

Clin Cancer Res
2004
;
10
:
6208
–14.

(32)

Martin B, Paesmans M, Mascaux C, Berghmans T, Lothaire P, Meert AP, et al. Ki-67 expression and patients survival in lung cancer: systematic review of the literature with meta-analysis.

Br J Cancer
2004
;
91
:
2018
–25.

(33)

Huncharek M, Kupelnick B. Epidermal growth factor receptor gene amplification as a prognostic marker in glioblastoma multiforme: results of a meta-analysis.

Oncol Res
2000
;
12
:
107
–12.

(34)

Meert AP, Martin B, Delmotte P, Berghmans T, Lafitte JJ, Mascaux C, et al. The role of EGF-R expression on patient survival in lung cancer: a systematic review with meta-analysis.

Eur Respir J
2002
;
20
:
975
–81.

(35)

Meert AP, Paesmans M, Martin B, Delmotte P, Berghmans T, Verdebout JM, et al. The role of microvessel density on the survival of patients with lung cancer: a systematic review of the literature with meta-analysis.

Br J Cancer
2002
;
87
:
694
–701.

(36)

Meert AP, Martin B, Paesmans M, Berghmans T, Mascaux C, Verdebout JM, et al. The role of HER-2/neu expression on the survival of patients with lung cancer: a systematic review of the literature.

Br J Cancer
2003
;
89
:
959
–65.

(37)

Deming SL, Nass SJ, Dickson RB, Trock BJ. C-myc amplification in breast cancer: a meta-analysis of its occurrence and prognostic relevance.

Br J Cancer
2000
;
83
:
1688
–95.

(38)

Martin B, Paesmans M, Berghmans T, Branle F, Ghisdal L, Mascaux C, et al. Role of Bcl-2 as a prognostic factor for survival in lung cancer: a systematic review of the literature with meta-analysis.

Br J Cancer
2003
;
89
:
55
–64.

(39)

Ryu SY, Kim CB, Nam CM, Park JK, Kim KS, Park J, et al. Is body mass index the prognostic factor in breast cancer?: a meta-analysis.

J Korean Med Sci
2001
;
16
:
610
–4.

(40)

Mascaux C, Iannino N, Martin B, Paesmans M, Berghmans T, Dusart M, et al. The role of RAS oncogene in survival of patients with lung cancer: a systematic review of the literature with meta-analysis.

Br J Cancer
2005
;
92
:
131
–9.

(41)

Weinschenker P, Soares HP, Clark O, Del Giglio A. Immunocytochemical detection of epithelial cells in the bone marrow of primary breast cancer patients: a meta-analysis.

Breast Cancer Res Treat
2004
;
87
:
215
–24.

(42)

Steels E, Paesmans M, Berghmans T, Branle F, Lemaitre F, Mascaux C, et al. Role of p53 as a prognostic factor for survival in lung cancer: a systematic review of the literature with a meta-analysis.

Eur Respir J
2001
;
18
:
705
–19.

(43)

Funke I, Schraut W. Meta-analyses of studies on bone marrow micrometastases: an independent prognostic impact remains to be substantiated.

J Clin Oncol
1998
;
16
:
557
–66.

(44)

Ferrandina G, Scambia G, Bardelli F, Benedetti Panici P, Mancuso S, Messori A. Relationship between cathepsin-D content and disease-free survival in node-negative breast cancer patients: a meta-analysis.

Br J Cancer
1997
;
76
:
661
–6.

(45)

Choma D, Daures JP, Quantin X, Pujol JL. Aneuploidy and prognosis of non-small-cell lung cancer: a meta-analysis of published data.

Br J Cancer
2001
;
85
:
14
–22.

(46)

Pakos EE, Ioannidis JP. The association of P-glycoprotein with response to chemotherapy and clinical outcome in patients with osteosarcoma. A meta-analysis.

Cancer
2003
;
98
:
581
–9.

(47)

Pharoah PD, Day NE, Caldas C. Somatic mutations in the p53 gene and prognosis in breast cancer: a meta-analysis.

Br J Cancer
1999
;
80
:
1968
–73.

(48)

Hayden JA, Cote P, Bombardier C. Evaluation of the quality of prognosis studies in systematic reviews.

Ann Intern Med
2006
;
144
:
427
–37.

(49)

Ioannidis JP, Gwinn M, Little J, Higgins JP, Bernstein JL, Boffetta P, et al. A road map for efficient and reliable human genome epidemiology.

Nat Genet
2006
;
38
:
3
–5.

(50)

Selvin S. Statistical analysis of epidemiologic data. New York (NY): Oxford University Press;

1996
. p. 419–21.

(51)

Bogardus ST Jr, Concato J, Feinstein AR. Clinical epidemiological quality in molecular genetic research: the need for methodological standards.

JAMA
1999
;
281
:
1919
–26.

(52)

Ioannidis JP, Trikalinos TA, Ntzani EE, Contopoulos-Ioannidis DG. Genetic associations in large versus small studies: an empirical assessment.

Lancet
2003
;
361
:
567
–71.