论文标题
调整时间:通过调整明显的选择性推断,改善实验心理学的可复制性
Time to adjust: Improving replicability in experimental psychology by adjustment for evident selective inference
论文作者
论文摘要
心理科学领域一直在努力应对可复制性危机。各种问题已被确定为该问题的潜在来源。我们揭示了一个潜在的来源,该来源在很大程度上被忽略了,并证明了其对问题的重要贡献:多次比较的实践。我们分析了心理学可重复性项目中的88篇论文,发现在一篇论文中通常报告了多个结果,范围为4至730(M = 77.7),而没有多重比较调整。我们使用层次FDR控制程序(TreeBH; Bogomolov等,2021)追溯应用了这种调整。调整后,88个结果中有21个被认为微不足道。这21个结果中有20个确实没有复制,构成了不可恢复的发现的三分之一,同时保持了97%的功率。我们建议,这应该成为提高实验心理学复制性的必要手段的共同做法。
The field of psychological sciences has been grappling with the replicability crisis. Various issues have been identified as potential sources of this problem. We bring to light a potential source that has largely been overlooked and demonstrate its significant contribution to the problem: the practice of multiple comparisons. We analyzed 88 papers from the Reproducibility Project in Psychology and found that multiple results are commonly reported in a single paper, ranging from 4 to 730 (M=77.7), without multiple comparison adjustments. We retroactively applied such an adjustment using a hierarchical FDR controlling procedure (TreeBH; Bogomolov et al., 2021). 21 of 88 results were deemed insignificant after adjustment. Twenty of these 21 results indeed failed to replicate, constituting over a third of the non-replicable findings, while maintaining 97% power. We propose that this should become a common practice as an essential means to increase replicability in experimental psychology.