选择受控子集测试的危险及其在口语识别中的应用

IF 2.781

The Journal of Physical Chemistry Pub Date : 2019-01-24 DOI:10.5334/joc.51

David Liben-Nowell, Julia Strand, Alexa Sharp, Tom Wexler, Kevin Woods

{"title":"选择受控子集测试的危险及其在口语识别中的应用","authors":"David Liben-Nowell, Julia Strand, Alexa Sharp, Tom Wexler, Kevin Woods","doi":"10.5334/joc.51","DOIUrl":null,"url":null,"abstract":"When examining the effects of a continuous variable x on an outcome y, a researcher might choose to dichotomize on x, dividing the population into two sets-low x and high x-and testing whether these two subpopulations differ with respect to y. Dichotomization has long been known to incur a cost in statistical power, but there remain circumstances in which it is appealing: an experimenter might use it to control for confounding covariates through subset selection, by carefully choosing a subpopulation of Low and a corresponding subpopulation of High that are balanced with respect to a list of control variables, and then comparing the subpopulations' y values. This \"divide, select, and test\" approach is used in many papers throughout the psycholinguistics literature, and elsewhere. Here we show that, despite the apparent innocuousness, these methodological choices can lead to erroneous results, in two ways. First, if the balanced subsets of Low and High are selected in certain ways, it is possible to conclude a relationship between x and y not present in the full population. Specifically, we show that previously published conclusions drawn from this methodology-about the effect of a particular lexical property on spoken-word recognition-do not in fact appear to hold. Second, if the balanced subsets of Low and High are selected randomly, this methodology frequently fails to show a relationship between x and y that is present in the full population. Our work uncovers a new facet of an ongoing research effort: to identify and reveal the implicit freedoms of experimental design that can lead to false conclusions.","PeriodicalId":58,"journal":{"name":"The Journal of Physical Chemistry ","volume":"79 13","pages":"2"},"PeriodicalIF":2.7810,"publicationDate":"2019-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6634384/pdf/","citationCount":"0","resultStr":"{\"title\":\"The Danger of Testing by Selecting Controlled Subsets, with Applications to Spoken-Word Recognition.\",\"authors\":\"David Liben-Nowell, Julia Strand, Alexa Sharp, Tom Wexler, Kevin Woods\",\"doi\":\"10.5334/joc.51\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When examining the effects of a continuous variable x on an outcome y, a researcher might choose to dichotomize on x, dividing the population into two sets-low x and high x-and testing whether these two subpopulations differ with respect to y. Dichotomization has long been known to incur a cost in statistical power, but there remain circumstances in which it is appealing: an experimenter might use it to control for confounding covariates through subset selection, by carefully choosing a subpopulation of Low and a corresponding subpopulation of High that are balanced with respect to a list of control variables, and then comparing the subpopulations' y values. This \\\"divide, select, and test\\\" approach is used in many papers throughout the psycholinguistics literature, and elsewhere. Here we show that, despite the apparent innocuousness, these methodological choices can lead to erroneous results, in two ways. First, if the balanced subsets of Low and High are selected in certain ways, it is possible to conclude a relationship between x and y not present in the full population. Specifically, we show that previously published conclusions drawn from this methodology-about the effect of a particular lexical property on spoken-word recognition-do not in fact appear to hold. Second, if the balanced subsets of Low and High are selected randomly, this methodology frequently fails to show a relationship between x and y that is present in the full population. Our work uncovers a new facet of an ongoing research effort: to identify and reveal the implicit freedoms of experimental design that can lead to false conclusions.\",\"PeriodicalId\":58,\"journal\":{\"name\":\"The Journal of Physical Chemistry \",\"volume\":\"79 13\",\"pages\":\"2\"},\"PeriodicalIF\":2.7810,\"publicationDate\":\"2019-01-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6634384/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Physical Chemistry \",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5334/joc.51\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry ","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5334/joc.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在检查连续变量x对结果y的影响时，研究人员可能会选择对x进行二分类，将总体分为两组-低x和高x -并测试这两个亚总体相对于y是否不同。二分类长期以来一直被认为会导致统计能力的成本，但仍然存在吸引人的情况:实验者可以使用它通过子集选择来控制混杂协变量，通过仔细选择相对于控制变量列表平衡的Low亚种群和相应的High亚种群，然后比较亚种群的y值。这种“划分、选择和测试”的方法在心理语言学文献和其他地方的许多论文中都有使用。在这里，我们表明，尽管表面上无害，这些方法的选择可能导致错误的结果，在两个方面。首先，如果以某种方式选择Low和High的平衡子集，则有可能得出在整个总体中不存在的x和y之间的关系。具体来说，我们表明，以前发表的结论，从这个方法得出的，关于一个特定的词汇属性对口语单词识别的影响，实际上似乎并不成立。其次，如果Low和High的平衡子集是随机选择的，那么这种方法通常无法显示整个总体中存在的x和y之间的关系。我们的工作揭示了正在进行的研究工作的一个新方面:识别和揭示可能导致错误结论的实验设计的隐性自由。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

The Danger of Testing by Selecting Controlled Subsets, with Applications to Spoken-Word Recognition.

查看原文本刊更多论文

The Danger of Testing by Selecting Controlled Subsets, with Applications to Spoken-Word Recognition.

When examining the effects of a continuous variable x on an outcome y, a researcher might choose to dichotomize on x, dividing the population into two sets-low x and high x-and testing whether these two subpopulations differ with respect to y. Dichotomization has long been known to incur a cost in statistical power, but there remain circumstances in which it is appealing: an experimenter might use it to control for confounding covariates through subset selection, by carefully choosing a subpopulation of Low and a corresponding subpopulation of High that are balanced with respect to a list of control variables, and then comparing the subpopulations' y values. This "divide, select, and test" approach is used in many papers throughout the psycholinguistics literature, and elsewhere. Here we show that, despite the apparent innocuousness, these methodological choices can lead to erroneous results, in two ways. First, if the balanced subsets of Low and High are selected in certain ways, it is possible to conclude a relationship between x and y not present in the full population. Specifically, we show that previously published conclusions drawn from this methodology-about the effect of a particular lexical property on spoken-word recognition-do not in fact appear to hold. Second, if the balanced subsets of Low and High are selected randomly, this methodology frequently fails to show a relationship between x and y that is present in the full population. Our work uncovers a new facet of an ongoing research effort: to identify and reveal the implicit freedoms of experimental design that can lead to false conclusions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Physical Chemistry

自引率

0.00%

发文量