论便利性、多样性和普遍性：评Scaff等人（2025）

IF 3.2 1区心理学 Q2 PSYCHOLOGY, DEVELOPMENTAL

Developmental Science Pub Date : 2025-07-17 DOI:10.1111/desc.70050

Evan Kidd, Rowena Garcia

{"title":"论便利性、多样性和普遍性：评Scaff等人（2025）","authors":"Evan Kidd, Rowena Garcia","doi":"10.1111/desc.70050","DOIUrl":null,"url":null,"abstract":"The Child Language Data Exchange System (CHILDES, MacWhinney 2000) is the jewel in the crown of child language research. Emerging in the 1980s to archive and facilitate the sharing and re-use of precious and laborious-to-collect-and-process corpus data (MacWhinney and Snow 1985), its forward-thinking ethos predated the modern Open Science movement by decades. Thanks to the hard work of Brian MacWhinney and others, it has continued to expand, has birthed similar repositories (AphasiaBank, Forbes et al. 2012; HomeBank, VanDam et al. 2016), and no doubt inspired others (e.g., WordBank, Frank et al. 2017). Progress in the field of child language acquisition has unquestionably accelerated because of its existence. Yet, as Scaff et al. (2025) show in their paper, the data in CHILDES are not fully representative of the languages of the world and the children who learn them. Despite containing corpora on many dozens of languages, those languages are predominantly Indo-European, with the data mostly coming from affluent urban nuclear families in wealthy countries. They conclude that, because of this skew in the data, researchers should be mindful of generalising from the data.In this commentary, we discuss two issues that the Scaff et al. (2025) paper raises, addressing (i) the importance of data coverage in studies of child language, and (ii) the extent to which demographic variables allow generalisations from existing corpus data.Research in the cognitive and psychological sciences overwhelmingly relies on convenience sampling. In studies of language, convenience sampling comes in two forms—selection of the target language(s) and selection of participants. Both contain variability. The circa 7000 languages spoken across the world vary on many different dimensions, such that it is difficult to identify substantive universals (Evans and Levinson 2009), and the boundaries of what is possible in language continue to expand as language documentation uncovers new phenomena (Seifart et al. 2018). However, we are a long way from understanding how numerous features of language are acquired. In a paper that analysed language coverage in child language journals across 45 years of publishing, we found a large skew in the literature towards English and a handful of other, mostly Indo-European languages (Kidd and Garcia 2022a). Passmore et al. (2025) juxtaposed those data against two forms of data: (i) the Expanded Graded Intergenerational Disruption Scale (EGIDS, Eberhard et al. 2021; Lewis and Simons 2010), which ranks languages according to their vitality (i.e., their vulnerability to loss, from healthy national languages like English and Spanish to ‘sleeping’ languages with no current speakers), and (ii) large language databases used to represent the design space of phonology (Phoible: Moran and McCloy 2019) and grammar (Grambank: Skirgård et al. 2023). They showed that studies of child language overwhelmingly concentrate on conveniently sampled big (i.e., widely spoken by many speakers) national languages at the expense of smaller vulnerable languages, and that the languages studied occupy a small subset of the estimated design space of language. Thus, convenience sampling has a scientific cost, limiting our capacity to understand how the vast corners of the design space for human language are acquired. There is also a humanitarian cost—ignoring vulnerable languages limits the potential to contribute to efforts to stem the continual flow of language loss (Evans 2022; Pye 2021).Scaff et al. (2025) identify limitations in language coverage and in participant selection. Skewed language coverage is not unique to child language and is indicative of a broader problem in the cognitive science of language (see Berghoff and Bylund 2025; Bylund et al. 2024). There is no easy solution, but it almost certainly involves diversifying the researchers in the field (Aravena-Bravo et al. 2023; Kidd and Garcia 2022b). In cases of vulnerable languages, widening participation to include community members to create projects that align with community goals (e.g., creating knowledge to inform educational priorities) adds social value (Passmore et al. 2025). Corpus data are an important resource in this endeavour, but the creation of corpora takes considerable time and resources. The gold standard in the field is to collect longitudinal corpora from several children at a sampling density that allows the researcher to capture the acquisition of a large range of frequent and infrequent phenomena. However, achieving such depth comes at the expense of breadth of language coverage. Initiatives like the Sketch Acquisition Project attempt to increase breadth of coverage by sacrificing depth (Hellwig et al. 2023), which will hopefully address some of the problems in language coverage. Technological development in recording will also speed up corpus development (e.g., Scaff et al. 2024), with much promise for relieving the transcription bottleneck.Increasing language coverage and the rapid pace of technological development demands a reassessment of ethical processes. The CHILDES database pioneered data sharing, the value of which has been immeasurable. That this has been possible is remarkable given the ethics of working with children, who cannot give direct consent for their participation. The standard use of video and the advent of daylong recordings, which capture families in their most unguarded moments, raise considerable issues concerning privacy, an issue that promises to become more complicated to navigate now that an individual's digital footprint presents potentially unimaginable uses as technology develops (see Léon and Cristia 2024). Governments are now protecting their citizens through important privacy laws, giving individuals power to control their data. The end result is that posting data on CHILDES and similar repositories is likely to evolve into a process of continual consent and heightened access restrictions. A related point that is relevant to Indigenous cultures is the issue of data sovereignty (Russo Carroll et al. 2021). Groups whose languages and cultures have been decimated by colonialism quite rightly demand sovereignty over any data collected in their communities (e.g., in Australia, it is considered best practice). Many groups see anything less as a form of epistemic violence that perpetuates colonialism (Woods 2022).The skew in language coverage raises clear difficulties in generalisation: if we sample from a restricted corner of the total design space of language then we cannot know how phenomena outside of the design space are acquired. However, the degree to which the other demographic biases identified by Scaff et al. (2025)—i.e., socioeconomic status (SES), urbanisation, and family structure—limit generalisation is less clear. As sources of within-language variation, their influence on acquisition will be via the way they change the language environment and other more distal variables that might have a broader influence on cognitive development (e.g., due to increased neurotoxin levels in urban environments, Shadbegian et al. 2019). Distal environmental variables will affect acquisition via complex developmental cascades, and we leave discussion of these for another time. Concerning variation in the input, the key questions are: (i) in what ways does variation in the input affect acquisition? And (ii) can we build theoretical models that account for such variability?Concerning (i), the answer is complex and, in some cases, not without controversy. Take, for example, SES, the most well-studied variable on the list. One underappreciated problem in discussions of SES and its influence on development is that, as a kind of formative measurement (i.e., it is only measurable through the measurement of other variables, e.g., Bainter and Bollen 2014), it is defined based on several many interacting indicators (e.g., parental education, income, postcode), making it difficult to pin down exactly why it may influence an outcome variable. Additionally, as a distinctly culturally-specific concept, it will be defined differently and will likely have a variable influence on development across countries (Singh et al. 2025). Ambiguity about how it influences acquisition has led to intense debate in countries like the USA (e.g., Sperry et al. 2019; Golinkoff et al. 2019). Debates on its role in acquisition have centred on attainment, but we might instead ask: does variation in the input due to SES have a significant enough effect on the process of acquisition, such that we should not expect to see the same underlying developmental mechanisms at play? If not, then we may be able to make cautious generalisations beyond individual cases. Regardless, we agree with Scaff et al. that good metadata for corpora should include this kind of background information.Concerning (ii), building a theory of how variation in the input influences acquisition is no small task, in part because it will interact with endogenous sources of variation (e.g., Eghbalzad et al. 2021). Regardless, understanding when and how variation matters has the potential to provide valuable insights into mechanisms underlying acquisition (Kidd and Donnelly 2020).The cognitive and psychological sciences are currently within an extended period of soul-searching concerning their ability to explain how all humans learn, think, and feel. The crux of this is the realisation that the modern study of human psychology has primarily been a Western (or one might say, WEIRD, Henrich et al. 2010) concern (or perhaps more accurately, dominated by Western European academic traditions). In most cases, ‘the field’, broadly construed, is poorly equipped to engage with issues of cultural influences on behaviour because of a range of barriers, including a lack of researcher diversity, a bias for inferring universals from skewed data, and a lack of meaningful interaction with adjacent disciplines like anthropology (see Núñez et al. 2019). Child language has always had a proud history of crosslinguistic research (e.g., Slobin 1985), and its location at the intersection of psychology, linguistics, anthropology, and computer science means it has the potential to act as a leader in discussions about interdisciplinarity, generalisability, and the necessity of inclusion. Raising awareness of gaps in our data coverage, as Scaff et al. (2025) have done, is an important step in this process.","PeriodicalId":48392,"journal":{"name":"Developmental Science","volume":"28 5","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/desc.70050","citationCount":"0","resultStr":"{\"title\":\"On Convenience, Diversity, and Generalisability: A Commentary on Scaff et al. (2025)\",\"authors\":\"Evan Kidd, Rowena Garcia\",\"doi\":\"10.1111/desc.70050\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Child Language Data Exchange System (CHILDES, MacWhinney 2000) is the jewel in the crown of child language research. Emerging in the 1980s to archive and facilitate the sharing and re-use of precious and laborious-to-collect-and-process corpus data (MacWhinney and Snow 1985), its forward-thinking ethos predated the modern Open Science movement by decades. Thanks to the hard work of Brian MacWhinney and others, it has continued to expand, has birthed similar repositories (AphasiaBank, Forbes et al. 2012; HomeBank, VanDam et al. 2016), and no doubt inspired others (e.g., WordBank, Frank et al. 2017). Progress in the field of child language acquisition has unquestionably accelerated because of its existence. Yet, as Scaff et al. (2025) show in their paper, the data in CHILDES are not fully representative of the languages of the world and the children who learn them. Despite containing corpora on many dozens of languages, those languages are predominantly Indo-European, with the data mostly coming from affluent urban nuclear families in wealthy countries. They conclude that, because of this skew in the data, researchers should be mindful of generalising from the data.In this commentary, we discuss two issues that the Scaff et al. (2025) paper raises, addressing (i) the importance of data coverage in studies of child language, and (ii) the extent to which demographic variables allow generalisations from existing corpus data.Research in the cognitive and psychological sciences overwhelmingly relies on convenience sampling. In studies of language, convenience sampling comes in two forms—selection of the target language(s) and selection of participants. Both contain variability. The circa 7000 languages spoken across the world vary on many different dimensions, such that it is difficult to identify substantive universals (Evans and Levinson 2009), and the boundaries of what is possible in language continue to expand as language documentation uncovers new phenomena (Seifart et al. 2018). However, we are a long way from understanding how numerous features of language are acquired. In a paper that analysed language coverage in child language journals across 45 years of publishing, we found a large skew in the literature towards English and a handful of other, mostly Indo-European languages (Kidd and Garcia 2022a). Passmore et al. (2025) juxtaposed those data against two forms of data: (i) the Expanded Graded Intergenerational Disruption Scale (EGIDS, Eberhard et al. 2021; Lewis and Simons 2010), which ranks languages according to their vitality (i.e., their vulnerability to loss, from healthy national languages like English and Spanish to ‘sleeping’ languages with no current speakers), and (ii) large language databases used to represent the design space of phonology (Phoible: Moran and McCloy 2019) and grammar (Grambank: Skirgård et al. 2023). They showed that studies of child language overwhelmingly concentrate on conveniently sampled big (i.e., widely spoken by many speakers) national languages at the expense of smaller vulnerable languages, and that the languages studied occupy a small subset of the estimated design space of language. Thus, convenience sampling has a scientific cost, limiting our capacity to understand how the vast corners of the design space for human language are acquired. There is also a humanitarian cost—ignoring vulnerable languages limits the potential to contribute to efforts to stem the continual flow of language loss (Evans 2022; Pye 2021).Scaff et al. (2025) identify limitations in language coverage and in participant selection. Skewed language coverage is not unique to child language and is indicative of a broader problem in the cognitive science of language (see Berghoff and Bylund 2025; Bylund et al. 2024). There is no easy solution, but it almost certainly involves diversifying the researchers in the field (Aravena-Bravo et al. 2023; Kidd and Garcia 2022b). In cases of vulnerable languages, widening participation to include community members to create projects that align with community goals (e.g., creating knowledge to inform educational priorities) adds social value (Passmore et al. 2025). Corpus data are an important resource in this endeavour, but the creation of corpora takes considerable time and resources. The gold standard in the field is to collect longitudinal corpora from several children at a sampling density that allows the researcher to capture the acquisition of a large range of frequent and infrequent phenomena. However, achieving such depth comes at the expense of breadth of language coverage. Initiatives like the Sketch Acquisition Project attempt to increase breadth of coverage by sacrificing depth (Hellwig et al. 2023), which will hopefully address some of the problems in language coverage. Technological development in recording will also speed up corpus development (e.g., Scaff et al. 2024), with much promise for relieving the transcription bottleneck.Increasing language coverage and the rapid pace of technological development demands a reassessment of ethical processes. The CHILDES database pioneered data sharing, the value of which has been immeasurable. That this has been possible is remarkable given the ethics of working with children, who cannot give direct consent for their participation. The standard use of video and the advent of daylong recordings, which capture families in their most unguarded moments, raise considerable issues concerning privacy, an issue that promises to become more complicated to navigate now that an individual's digital footprint presents potentially unimaginable uses as technology develops (see Léon and Cristia 2024). Governments are now protecting their citizens through important privacy laws, giving individuals power to control their data. The end result is that posting data on CHILDES and similar repositories is likely to evolve into a process of continual consent and heightened access restrictions. A related point that is relevant to Indigenous cultures is the issue of data sovereignty (Russo Carroll et al. 2021). Groups whose languages and cultures have been decimated by colonialism quite rightly demand sovereignty over any data collected in their communities (e.g., in Australia, it is considered best practice). Many groups see anything less as a form of epistemic violence that perpetuates colonialism (Woods 2022).The skew in language coverage raises clear difficulties in generalisation: if we sample from a restricted corner of the total design space of language then we cannot know how phenomena outside of the design space are acquired. However, the degree to which the other demographic biases identified by Scaff et al. (2025)—i.e., socioeconomic status (SES), urbanisation, and family structure—limit generalisation is less clear. As sources of within-language variation, their influence on acquisition will be via the way they change the language environment and other more distal variables that might have a broader influence on cognitive development (e.g., due to increased neurotoxin levels in urban environments, Shadbegian et al. 2019). Distal environmental variables will affect acquisition via complex developmental cascades, and we leave discussion of these for another time. Concerning variation in the input, the key questions are: (i) in what ways does variation in the input affect acquisition? And (ii) can we build theoretical models that account for such variability?Concerning (i), the answer is complex and, in some cases, not without controversy. Take, for example, SES, the most well-studied variable on the list. One underappreciated problem in discussions of SES and its influence on development is that, as a kind of formative measurement (i.e., it is only measurable through the measurement of other variables, e.g., Bainter and Bollen 2014), it is defined based on several many interacting indicators (e.g., parental education, income, postcode), making it difficult to pin down exactly why it may influence an outcome variable. Additionally, as a distinctly culturally-specific concept, it will be defined differently and will likely have a variable influence on development across countries (Singh et al. 2025). Ambiguity about how it influences acquisition has led to intense debate in countries like the USA (e.g., Sperry et al. 2019; Golinkoff et al. 2019). Debates on its role in acquisition have centred on attainment, but we might instead ask: does variation in the input due to SES have a significant enough effect on the process of acquisition, such that we should not expect to see the same underlying developmental mechanisms at play? If not, then we may be able to make cautious generalisations beyond individual cases. Regardless, we agree with Scaff et al. that good metadata for corpora should include this kind of background information.Concerning (ii), building a theory of how variation in the input influences acquisition is no small task, in part because it will interact with endogenous sources of variation (e.g., Eghbalzad et al. 2021). Regardless, understanding when and how variation matters has the potential to provide valuable insights into mechanisms underlying acquisition (Kidd and Donnelly 2020).The cognitive and psychological sciences are currently within an extended period of soul-searching concerning their ability to explain how all humans learn, think, and feel. The crux of this is the realisation that the modern study of human psychology has primarily been a Western (or one might say, WEIRD, Henrich et al. 2010) concern (or perhaps more accurately, dominated by Western European academic traditions). In most cases, ‘the field’, broadly construed, is poorly equipped to engage with issues of cultural influences on behaviour because of a range of barriers, including a lack of researcher diversity, a bias for inferring universals from skewed data, and a lack of meaningful interaction with adjacent disciplines like anthropology (see Núñez et al. 2019). Child language has always had a proud history of crosslinguistic research (e.g., Slobin 1985), and its location at the intersection of psychology, linguistics, anthropology, and computer science means it has the potential to act as a leader in discussions about interdisciplinarity, generalisability, and the necessity of inclusion. Raising awareness of gaps in our data coverage, as Scaff et al. (2025) have done, is an important step in this process.\",\"PeriodicalId\":48392,\"journal\":{\"name\":\"Developmental Science\",\"volume\":\"28 5\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/desc.70050\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Developmental Science\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/desc.70050\",\"RegionNum\":1,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY, DEVELOPMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Developmental Science","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/desc.70050","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, DEVELOPMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

儿童语言数据交换系统（CHILDES, MacWhinney 2000）是儿童语言研究领域的一颗明珠。它出现于20世纪80年代，目的是存档和促进宝贵的、费力收集和处理的语料库数据的共享和再利用（MacWhinney and Snow 1985），其前瞻性思想比现代开放科学运动早了几十年。由于Brian MacWhinney等人的辛勤工作，它不断扩大，诞生了类似的存储库(AphasiaBank, Forbes et al. 2012；HomeBank, VanDam et al. 2016)，毫无疑问也启发了其他人（例如，WordBank, Frank et al. 2017）。由于它的存在，儿童语言习得领域的进展无疑加快了。然而，正如Scaff et al.（2025）在他们的论文中所显示的那样，CHILDES中的数据并不能完全代表世界上的语言和学习这些语言的儿童。尽管包含了几十种语言的语料库，但这些语言主要是印欧语，数据大多来自富裕国家富裕的城市核心家庭。他们的结论是，由于数据的这种偏差，研究人员应该注意从数据中进行概括。在这篇评论中，我们讨论了Scaff等人（2025）论文提出的两个问题，解决了(i)儿童语言研究中数据覆盖的重要性，以及（ii）人口统计学变量允许从现有语料库数据进行概括的程度。认知科学和心理科学的研究绝大多数依赖于方便的抽样。在语言研究中，方便抽样有两种形式：目的语选择和被试选择。两者都包含可变性。世界各地使用的大约7000种语言在许多不同的维度上各不相同，因此很难确定实质性的共性（Evans和Levinson 2009），并且随着语言文档发现新现象，语言可能的边界继续扩大（Seifart et al. 2018）。然而，我们距离理解语言的众多特征是如何习得的还有很长的路要走。在一篇分析了45年来儿童语言期刊上的语言覆盖范围的论文中，我们发现，文献中有很大的倾向于英语和少数其他语言，主要是印欧语言（Kidd和Garcia 2022a）。Passmore等人（2025）将这些数据与两种形式的数据并列：(i)扩展分级代际中断量表(EGIDS， Eberhard等人，2021；Lewis and Simons 2010)，根据语言的活力（即，从英语和西班牙语等健康的民族语言到没有当前使用者的“睡眠”语言）对语言进行排名，以及（ii）用于表示音韵学设计空间的大型语言数据库（Phoible: Moran and McCloy 2019）和语法（Grambank: skirg<s:1>等人，2023）。他们表明，对儿童语言的研究绝大多数集中在方便抽样的大型（即被许多人广泛使用的）民族语言上，而牺牲了较小的脆弱语言，而且所研究的语言只占语言估计设计空间的一小部分。因此，方便的抽样是有科学代价的，限制了我们理解人类语言设计空间的巨大角落是如何获得的能力。此外，还存在人道主义成本——忽视脆弱语言会限制为阻止语言持续流失做出贡献的潜力(Evans 2022；派伊2021)。Scaff等人（2025）确定了语言覆盖和参与者选择的局限性。扭曲的语言覆盖范围并不是儿童语言所独有的，它表明了语言认知科学中一个更广泛的问题(见Berghoff和Bylund 2025；Bylund et al. 2024)。没有简单的解决方案，但几乎可以肯定，这涉及到该领域研究人员的多样化(Aravena-Bravo等人，2023；基德和加西亚2022b)。在弱势语言的情况下，扩大参与，包括社区成员创建符合社区目标的项目（例如，创造知识以告知教育优先事项），可以增加社会价值（Passmore et al. 2025）。语料库数据是这项工作的重要资源，但语料库的创建需要大量的时间和资源。该领域的黄金标准是以一定的采样密度从几个孩子那里收集纵向语料库，这使得研究人员能够捕捉到大范围的频繁和不频繁现象的采集。然而，达到这样的深度是以语言覆盖的广度为代价的。像草图获取项目这样的计划试图通过牺牲深度来增加覆盖的广度（Hellwig et al. 2023），这有望解决语言覆盖中的一些问题。录音技术的发展也将加速语料库的发展（例如，Scaff等人，2024），有望缓解转录瓶颈。语言覆盖范围的扩大和技术发展的快速步伐要求对伦理过程进行重新评估。CHILDES数据库开创了数据共享的先路，其价值不可估量。考虑到与儿童一起工作的道德规范，这是可能的，因为儿童不能直接同意他们的参与。视频的标准使用和全天录像的出现，捕捉了家庭最不受保护的时刻，引发了相当大的隐私问题，随着技术的发展，个人的数字足迹可能会出现难以想象的用途，这个问题将变得更加复杂。政府现在正在通过重要的隐私法保护公民，赋予个人控制数据的权力。最终的结果是，在CHILDES和类似的存储库上发布数据可能会演变成一个持续同意和提高访问限制的过程。与土著文化相关的一个相关点是数据主权问题（Russo Carroll et al. 2021）。语言和文化被殖民主义摧毁的群体理所当然地要求对其社区收集的任何数据拥有主权（例如，在澳大利亚，这被认为是最佳实践）。许多团体认为，任何不平等都是一种延续殖民主义的认知暴力形式（Woods 2022）。语言覆盖范围的倾斜给泛化带来了明显的困难：如果我们从语言的总设计空间的一个有限角落进行采样，那么我们就无法知道设计空间之外的现象是如何获得的。然而，Scaff等人（2025）确定的其他人口统计学偏差的程度-即。例如，社会经济地位（SES）、城市化和家庭结构限制的概括不太清楚。作为语言内变异的来源，它们对习得的影响将通过它们改变语言环境的方式和其他可能对认知发展产生更广泛影响的更远的变量（例如，由于城市环境中神经毒素水平的增加，Shadbegian et al. 2019）。远端环境变量将通过复杂的发育级联影响习得，我们将把这些留到另一个时间讨论。关于输入的变化，关键问题是：(i)输入的变化以何种方式影响习得？（二）我们能否建立理论模型来解释这种可变性？关于(1)，答案是复杂的，在某些情况下，并非没有争议。以社会经济地位为例，这是榜单上研究得最充分的变量。在讨论社会经济地位及其对发展的影响时，一个被低估的问题是，作为一种形成性测量（即，它只能通过测量其他变量来测量，例如，Bainter和Bollen 2014），它是基于几个相互作用的指标（例如，父母教育，收入，邮政编码）来定义的，因此很难确切地确定它为什么会影响一个结果变量。此外，作为一个明显的文化特定概念，它将被不同地定义，并可能对各国的发展产生不同的影响（Singh et al. 2025）。关于它如何影响收购的模糊性在美国等国家引发了激烈的争论(例如，Sperry等人。2019；Golinkoff et al. 2019)。关于社会经济地位在习得中的作用的争论集中在成就上，但我们可能会问：由于社会经济地位而产生的输入变化是否对习得过程有足够大的影响，以至于我们不应该期望看到同样的潜在发展机制在起作用？如果没有，那么我们也许能够在个别案例之外做出谨慎的概括。无论如何，我们同意Scaff等人的观点，即语料库的良好元数据应该包括这种背景信息。关于（ii），建立一个关于输入的变化如何影响习得的理论不是一项小任务，部分原因是它将与内源性变化源相互作用（例如，Eghbalzad et al. 2021）。无论如何，了解变异何时以及如何起作用，有可能为获取机制提供有价值的见解（Kidd和Donnelly 2020）。认知科学和心理科学目前正处于一个长期的自我反省时期，他们是否有能力解释所有人类是如何学习、思考和感受的。问题的关键在于人们认识到，人类心理学的现代研究主要是西方（或者有人可能会说，WEIRD, Henrich et al. 2010）所关注的（或者更准确地说，是由西欧学术传统主导的）。在大多数情况下，由于一系列障碍，“该领域”在处理文化对行为的影响问题方面能力不足，包括缺乏研究人员的多样性，从扭曲的数据中推断普遍性的偏见，以及与人类学等邻近学科缺乏有意义的互动（见Núñez et al. 2019）。儿童语言在跨语言研究方面一直有着令人骄傲的历史（例如，Slobin 1985），它位于心理学、语言学、人类学和计算机科学的交叉点，这意味着它有潜力在跨学科、普遍性和包容性的必要性的讨论中发挥领导作用。正如Scaff等人（2025）所做的那样，提高对数据覆盖差距的认识是这一过程中的重要一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On Convenience, Diversity, and Generalisability: A Commentary on Scaff et al. (2025)

The Child Language Data Exchange System (CHILDES, MacWhinney 2000) is the jewel in the crown of child language research. Emerging in the 1980s to archive and facilitate the sharing and re-use of precious and laborious-to-collect-and-process corpus data (MacWhinney and Snow 1985), its forward-thinking ethos predated the modern Open Science movement by decades. Thanks to the hard work of Brian MacWhinney and others, it has continued to expand, has birthed similar repositories (AphasiaBank, Forbes et al. 2012; HomeBank, VanDam et al. 2016), and no doubt inspired others (e.g., WordBank, Frank et al. 2017). Progress in the field of child language acquisition has unquestionably accelerated because of its existence. Yet, as Scaff et al. (2025) show in their paper, the data in CHILDES are not fully representative of the languages of the world and the children who learn them. Despite containing corpora on many dozens of languages, those languages are predominantly Indo-European, with the data mostly coming from affluent urban nuclear families in wealthy countries. They conclude that, because of this skew in the data, researchers should be mindful of generalising from the data.

In this commentary, we discuss two issues that the Scaff et al. (2025) paper raises, addressing (i) the importance of data coverage in studies of child language, and (ii) the extent to which demographic variables allow generalisations from existing corpus data.

Research in the cognitive and psychological sciences overwhelmingly relies on convenience sampling. In studies of language, convenience sampling comes in two forms—selection of the target language(s) and selection of participants. Both contain variability. The circa 7000 languages spoken across the world vary on many different dimensions, such that it is difficult to identify substantive universals (Evans and Levinson 2009), and the boundaries of what is possible in language continue to expand as language documentation uncovers new phenomena (Seifart et al. 2018). However, we are a long way from understanding how numerous features of language are acquired. In a paper that analysed language coverage in child language journals across 45 years of publishing, we found a large skew in the literature towards English and a handful of other, mostly Indo-European languages (Kidd and Garcia 2022a). Passmore et al. (2025) juxtaposed those data against two forms of data: (i) the Expanded Graded Intergenerational Disruption Scale (EGIDS, Eberhard et al. 2021; Lewis and Simons 2010), which ranks languages according to their vitality (i.e., their vulnerability to loss, from healthy national languages like English and Spanish to ‘sleeping’ languages with no current speakers), and (ii) large language databases used to represent the design space of phonology (Phoible: Moran and McCloy 2019) and grammar (Grambank: Skirgård et al. 2023). They showed that studies of child language overwhelmingly concentrate on conveniently sampled big (i.e., widely spoken by many speakers) national languages at the expense of smaller vulnerable languages, and that the languages studied occupy a small subset of the estimated design space of language. Thus, convenience sampling has a scientific cost, limiting our capacity to understand how the vast corners of the design space for human language are acquired. There is also a humanitarian cost—ignoring vulnerable languages limits the potential to contribute to efforts to stem the continual flow of language loss (Evans 2022; Pye 2021).

Scaff et al. (2025) identify limitations in language coverage and in participant selection. Skewed language coverage is not unique to child language and is indicative of a broader problem in the cognitive science of language (see Berghoff and Bylund 2025; Bylund et al. 2024). There is no easy solution, but it almost certainly involves diversifying the researchers in the field (Aravena-Bravo et al. 2023; Kidd and Garcia 2022b). In cases of vulnerable languages, widening participation to include community members to create projects that align with community goals (e.g., creating knowledge to inform educational priorities) adds social value (Passmore et al. 2025). Corpus data are an important resource in this endeavour, but the creation of corpora takes considerable time and resources. The gold standard in the field is to collect longitudinal corpora from several children at a sampling density that allows the researcher to capture the acquisition of a large range of frequent and infrequent phenomena. However, achieving such depth comes at the expense of breadth of language coverage. Initiatives like the Sketch Acquisition Project attempt to increase breadth of coverage by sacrificing depth (Hellwig et al. 2023), which will hopefully address some of the problems in language coverage. Technological development in recording will also speed up corpus development (e.g., Scaff et al. 2024), with much promise for relieving the transcription bottleneck.

Increasing language coverage and the rapid pace of technological development demands a reassessment of ethical processes. The CHILDES database pioneered data sharing, the value of which has been immeasurable. That this has been possible is remarkable given the ethics of working with children, who cannot give direct consent for their participation. The standard use of video and the advent of daylong recordings, which capture families in their most unguarded moments, raise considerable issues concerning privacy, an issue that promises to become more complicated to navigate now that an individual's digital footprint presents potentially unimaginable uses as technology develops (see Léon and Cristia 2024). Governments are now protecting their citizens through important privacy laws, giving individuals power to control their data. The end result is that posting data on CHILDES and similar repositories is likely to evolve into a process of continual consent and heightened access restrictions. A related point that is relevant to Indigenous cultures is the issue of data sovereignty (Russo Carroll et al. 2021). Groups whose languages and cultures have been decimated by colonialism quite rightly demand sovereignty over any data collected in their communities (e.g., in Australia, it is considered best practice). Many groups see anything less as a form of epistemic violence that perpetuates colonialism (Woods 2022).

The skew in language coverage raises clear difficulties in generalisation: if we sample from a restricted corner of the total design space of language then we cannot know how phenomena outside of the design space are acquired. However, the degree to which the other demographic biases identified by Scaff et al. (2025)—i.e., socioeconomic status (SES), urbanisation, and family structure—limit generalisation is less clear. As sources of within-language variation, their influence on acquisition will be via the way they change the language environment and other more distal variables that might have a broader influence on cognitive development (e.g., due to increased neurotoxin levels in urban environments, Shadbegian et al. 2019). Distal environmental variables will affect acquisition via complex developmental cascades, and we leave discussion of these for another time. Concerning variation in the input, the key questions are: (i) in what ways does variation in the input affect acquisition? And (ii) can we build theoretical models that account for such variability?

Concerning (i), the answer is complex and, in some cases, not without controversy. Take, for example, SES, the most well-studied variable on the list. One underappreciated problem in discussions of SES and its influence on development is that, as a kind of formative measurement (i.e., it is only measurable through the measurement of other variables, e.g., Bainter and Bollen 2014), it is defined based on several many interacting indicators (e.g., parental education, income, postcode), making it difficult to pin down exactly why it may influence an outcome variable. Additionally, as a distinctly culturally-specific concept, it will be defined differently and will likely have a variable influence on development across countries (Singh et al. 2025). Ambiguity about how it influences acquisition has led to intense debate in countries like the USA (e.g., Sperry et al. 2019; Golinkoff et al. 2019). Debates on its role in acquisition have centred on attainment, but we might instead ask: does variation in the input due to SES have a significant enough effect on the process of acquisition, such that we should not expect to see the same underlying developmental mechanisms at play? If not, then we may be able to make cautious generalisations beyond individual cases. Regardless, we agree with Scaff et al. that good metadata for corpora should include this kind of background information.

Concerning (ii), building a theory of how variation in the input influences acquisition is no small task, in part because it will interact with endogenous sources of variation (e.g., Eghbalzad et al. 2021). Regardless, understanding when and how variation matters has the potential to provide valuable insights into mechanisms underlying acquisition (Kidd and Donnelly 2020).

The cognitive and psychological sciences are currently within an extended period of soul-searching concerning their ability to explain how all humans learn, think, and feel. The crux of this is the realisation that the modern study of human psychology has primarily been a Western (or one might say, WEIRD, Henrich et al. 2010) concern (or perhaps more accurately, dominated by Western European academic traditions). In most cases, ‘the field’, broadly construed, is poorly equipped to engage with issues of cultural influences on behaviour because of a range of barriers, including a lack of researcher diversity, a bias for inferring universals from skewed data, and a lack of meaningful interaction with adjacent disciplines like anthropology (see Núñez et al. 2019). Child language has always had a proud history of crosslinguistic research (e.g., Slobin 1985), and its location at the intersection of psychology, linguistics, anthropology, and computer science means it has the potential to act as a leader in discussions about interdisciplinarity, generalisability, and the necessity of inclusion. Raising awareness of gaps in our data coverage, as Scaff et al. (2025) have done, is an important step in this process.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Developmental Science Multiple-

CiteScore

8.10

自引率

8.10%

发文量

132

期刊介绍： Developmental Science publishes cutting-edge theory and up-to-the-minute research on scientific developmental psychology from leading thinkers in the field. It is currently the only journal that specifically focuses on human developmental cognitive neuroscience. Coverage includes: - Clinical, computational and comparative approaches to development - Key advances in cognitive and social development - Developmental cognitive neuroscience - Functional neuroimaging of the developing brain