The Danger of Testing by Selecting Controlled Subsets, with Applications to Spoken-Word Recognition.

IF 2.781
David Liben-Nowell, Julia Strand, Alexa Sharp, Tom Wexler, Kevin Woods
{"title":"The Danger of Testing by Selecting Controlled Subsets, with Applications to Spoken-Word Recognition.","authors":"David Liben-Nowell, Julia Strand, Alexa Sharp, Tom Wexler, Kevin Woods","doi":"10.5334/joc.51","DOIUrl":null,"url":null,"abstract":"<p><p>When examining the effects of a continuous variable <i>x</i> on an outcome <i>y</i>, a researcher might choose to dichotomize on <i>x</i>, dividing the population into two sets-low <i>x</i> and high <i>x</i>-and testing whether these two subpopulations differ with respect to <i>y</i>. Dichotomization has long been known to incur a cost in statistical power, but there remain circumstances in which it is appealing: an experimenter might use it to control for confounding covariates through subset selection, by carefully choosing a subpopulation of Low and a corresponding subpopulation of High that are balanced with respect to a list of control variables, and then comparing the subpopulations' <i>y</i> values. This \"divide, select, and test\" approach is used in many papers throughout the psycholinguistics literature, and elsewhere. Here we show that, despite the apparent innocuousness, these methodological choices can lead to erroneous results, in two ways. First, if the balanced subsets of Low and High are selected in certain ways, it is possible to conclude a relationship between <i>x</i> and <i>y</i> not present in the full population. Specifically, we show that previously published conclusions drawn from this methodology-about the effect of a particular lexical property on spoken-word recognition-do not in fact appear to hold. Second, if the balanced subsets of Low and High are selected randomly, this methodology frequently fails to show a relationship between <i>x</i> and <i>y</i> that is present in the full population. Our work uncovers a new facet of an ongoing research effort: to identify and reveal the implicit freedoms of experimental design that can lead to false conclusions.</p>","PeriodicalId":58,"journal":{"name":"The Journal of Physical Chemistry ","volume":"79 13","pages":"2"},"PeriodicalIF":2.7810,"publicationDate":"2019-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6634384/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry ","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/joc.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

When examining the effects of a continuous variable x on an outcome y, a researcher might choose to dichotomize on x, dividing the population into two sets-low x and high x-and testing whether these two subpopulations differ with respect to y. Dichotomization has long been known to incur a cost in statistical power, but there remain circumstances in which it is appealing: an experimenter might use it to control for confounding covariates through subset selection, by carefully choosing a subpopulation of Low and a corresponding subpopulation of High that are balanced with respect to a list of control variables, and then comparing the subpopulations' y values. This "divide, select, and test" approach is used in many papers throughout the psycholinguistics literature, and elsewhere. Here we show that, despite the apparent innocuousness, these methodological choices can lead to erroneous results, in two ways. First, if the balanced subsets of Low and High are selected in certain ways, it is possible to conclude a relationship between x and y not present in the full population. Specifically, we show that previously published conclusions drawn from this methodology-about the effect of a particular lexical property on spoken-word recognition-do not in fact appear to hold. Second, if the balanced subsets of Low and High are selected randomly, this methodology frequently fails to show a relationship between x and y that is present in the full population. Our work uncovers a new facet of an ongoing research effort: to identify and reveal the implicit freedoms of experimental design that can lead to false conclusions.

选择受控子集测试的危险及其在口语识别中的应用
在检查连续变量x对结果y的影响时,研究人员可能会选择对x进行二分类,将总体分为两组-低x和高x -并测试这两个亚总体相对于y是否不同。二分类长期以来一直被认为会导致统计能力的成本,但仍然存在吸引人的情况:实验者可以使用它通过子集选择来控制混杂协变量,通过仔细选择相对于控制变量列表平衡的Low亚种群和相应的High亚种群,然后比较亚种群的y值。这种“划分、选择和测试”的方法在心理语言学文献和其他地方的许多论文中都有使用。在这里,我们表明,尽管表面上无害,这些方法的选择可能导致错误的结果,在两个方面。首先,如果以某种方式选择Low和High的平衡子集,则有可能得出在整个总体中不存在的x和y之间的关系。具体来说,我们表明,以前发表的结论,从这个方法得出的,关于一个特定的词汇属性对口语单词识别的影响,实际上似乎并不成立。其次,如果Low和High的平衡子集是随机选择的,那么这种方法通常无法显示整个总体中存在的x和y之间的关系。我们的工作揭示了正在进行的研究工作的一个新方面:识别和揭示可能导致错误结论的实验设计的隐性自由。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信