Flynn effects are biased by differential item functioning over time: A test using overlapping items in Wechsler scales

IF 4.3 3区 材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Corentin Gonthier , Jacques Grégoire
{"title":"Flynn effects are biased by differential item functioning over time: A test using overlapping items in Wechsler scales","authors":"Corentin Gonthier ,&nbsp;Jacques Grégoire","doi":"10.1016/j.intell.2022.101688","DOIUrl":null,"url":null,"abstract":"<div><p><span>The items of intelligence tests can demonstrate differential item functioning across different groups: cross-sample differences in item difficulty or discrimination, independently of any difference of ability. This is also true of comparisons over time: as the cultural context changes, items may increase or decrease in difficulty. This phenomenon is well-known, but its impact on estimates of the Flynn effect has not been systematically investigated. In the current study, we tested differential item functioning in a subset of 111 items common to consecutive versions of the French WAIS-R (1989), WAIS-III (1999) and/or WAIS-IV (2009), using the three normative samples (total </span><em>N</em> = 2979). Over half the items had significant differential functioning over time, generally becoming more difficult from one version to the next for the same level of ability. The magnitude of differential item functioning tended to be small for each item separately, but the cumulative effect over all items led to underestimating the Flynn effect by about 3 IQ points per decade, a bias close to the expected size of the effect itself. In this case, this bias substantially affected the conclusions, even creating an ersatz negative Flynn effect for the 1999–2009 period, when in fact ability increased (1989–1999) or stagnated (1999–2009) when accounting for differential item functioning. We recommend that studies of the Flynn effect systematically investigate the possibility of differential item functioning to obtain unbiased ability estimates.</p></div>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160289622000691","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 5

Abstract

The items of intelligence tests can demonstrate differential item functioning across different groups: cross-sample differences in item difficulty or discrimination, independently of any difference of ability. This is also true of comparisons over time: as the cultural context changes, items may increase or decrease in difficulty. This phenomenon is well-known, but its impact on estimates of the Flynn effect has not been systematically investigated. In the current study, we tested differential item functioning in a subset of 111 items common to consecutive versions of the French WAIS-R (1989), WAIS-III (1999) and/or WAIS-IV (2009), using the three normative samples (total N = 2979). Over half the items had significant differential functioning over time, generally becoming more difficult from one version to the next for the same level of ability. The magnitude of differential item functioning tended to be small for each item separately, but the cumulative effect over all items led to underestimating the Flynn effect by about 3 IQ points per decade, a bias close to the expected size of the effect itself. In this case, this bias substantially affected the conclusions, even creating an ersatz negative Flynn effect for the 1999–2009 period, when in fact ability increased (1989–1999) or stagnated (1999–2009) when accounting for differential item functioning. We recommend that studies of the Flynn effect systematically investigate the possibility of differential item functioning to obtain unbiased ability estimates.

弗林效应是有偏差的差异项目功能随着时间的推移:一个测试使用重叠项目韦氏量表
智力测验的项目可以证明不同群体的不同项目功能:项目难度或歧视的跨样本差异,独立于任何能力差异。随着时间的推移,比较也是如此:随着文化背景的变化,项目的难度可能会增加或减少。这种现象是众所周知的,但它对弗林效应估计的影响还没有被系统地研究过。在当前的研究中,我们使用三个标准样本(总N = 2979),测试了法语WAIS-R(1989)、WAIS-III(1999)和/或WAIS-IV(2009)连续版本中共有的111个项目的差异项目功能。随着时间的推移,超过一半的项目具有显著的功能差异,对于相同的能力水平,从一个版本到下一个版本通常变得更加困难。每个项目的差异功能的大小往往很小,但所有项目的累积效应导致弗林效应每十年被低估约3个智商点,这一偏差接近于效应本身的预期大小。在这种情况下,这种偏差极大地影响了结论,甚至在1999-2009年期间产生了假的负弗林效应,而实际上,当考虑到差异项目功能时,能力增加了(1989-1999)或停滞了(1999-2009)。我们建议弗林效应的研究系统地调查差异项目功能的可能性,以获得无偏的能力估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
4.30%
发文量
567
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信