A New Look at P Values for Randomized Clinical Trials.

Autor: van Zwet E; Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands., Gelman A; Department of Statistics, Columbia University, New York.; Department of Political Science, Columbia University, New York., Greenland S; Department of Epidemiology, University of California, Los Angeles, Los Angeles.; Department of Statistics, University of California, Los Angeles, Los Angeles., Imbens G; Graduate School of Business, Department of Economics, Stanford University, Stanford, CA., Schwab S; Swisstransplant, Bern, Switzerland., Goodman SN; Department of Epidemiology and Population Health, Stanford University, Stanford, CA.
Jazyk: angličtina
Zdroj: NEJM evidence [NEJM Evid] 2024 Jan; Vol. 3 (1), pp. EVIDoa2300003. Date of Electronic Publication: 2023 Dec 22.
DOI: 10.1056/EVIDoa2300003
Abstrakt: BACKGROUND: We have examined the primary efficacy results of 23,551 randomized clinical trials from the Cochrane Database of Systematic Reviews. METHODS: We estimate that the great majority of trials have much lower statistical power for actual effects than the 80 or 90% for the stated effect sizes. Consequently, “statistically significant” estimates tend to seriously overestimate actual treatment effects, “nonsignificant” results often correspond to important effects, and efforts to replicate often fail to achieve “significance” and may even appear to contradict initial results. To address these issues, we reinterpret the P value in terms of a reference population of studies that are, or could have been, in the Cochrane Database. RESULTS: This leads to an empirical guide for the interpretation of an observed P value from a “typical” clinical trial in terms of the degree of overestimation of the reported effect, the probability of the effect’s sign being wrong, and the predictive power of the trial. CONCLUSIONS: Such an interpretation provides additional insight about the effect under study and can guard medical researchers against naive interpretations of the P value and overoptimistic effect sizes. Because many research fields suffer from low power, our results are also relevant outside the medical domain. (Funded by the U.S. Office of Naval Research.)
Databáze: MEDLINE