{"title":"What do (most of) our dispersion measures measure (most)? Dispersion?","authors":"S. Gries","doi":"10.1075/jsls.21029.gri","DOIUrl":null,"url":null,"abstract":"\n This paper discusses the degree to which most of the most widely-used measures of dispersion in corpus linguistics\n are not particularly valid in the sense of actually measuring dispersion rather than some amalgam of a lot of frequency and a\n little dispersion. The paper demonstrates these issues on the basis of data from a variety of corpora. I then outline how to\n design a dispersion measure that only measures dispersion and show that (i) it indeed measures information that is different from\n frequency in an intuitive way and (ii) has a higher degree of predictive power of lexical decision times from the MALD database\n than nearly all other measures in nearly all corpora tested.","PeriodicalId":29903,"journal":{"name":"Journal of Second Language Studies","volume":" ","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2021-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Second Language Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/jsls.21029.gri","RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 4
Abstract
This paper discusses the degree to which most of the most widely-used measures of dispersion in corpus linguistics
are not particularly valid in the sense of actually measuring dispersion rather than some amalgam of a lot of frequency and a
little dispersion. The paper demonstrates these issues on the basis of data from a variety of corpora. I then outline how to
design a dispersion measure that only measures dispersion and show that (i) it indeed measures information that is different from
frequency in an intuitive way and (ii) has a higher degree of predictive power of lexical decision times from the MALD database
than nearly all other measures in nearly all corpora tested.