{"title":"On Pawley’s conjecture","authors":"Koenraad Kuiper","doi":"10.1515/phras-2015-0008","DOIUrl":null,"url":null,"abstract":"Abstract This paper shows that Pawley’s conjecture that the frequency of lexical items in text corpora is positively correlated with the number of phrasal lexical items which have those lexical items as heads of phrase is confirmed. Data for testing Pawley’s conjecture are taken from two sources: Kilgarriff’s lemmatized frequency lists from the BNC of the 6,318 words which appear more than 800 times (http://www.kilgarriff.co.uk) and the around 14,000 PLIs in the Syntactically Annotated Idiom Dictionary (Kuiper et al., 2003). Why this statistical fact should be the case is a matter for further research.","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/phras-2015-0008","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/phras-2015-0008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract This paper shows that Pawley’s conjecture that the frequency of lexical items in text corpora is positively correlated with the number of phrasal lexical items which have those lexical items as heads of phrase is confirmed. Data for testing Pawley’s conjecture are taken from two sources: Kilgarriff’s lemmatized frequency lists from the BNC of the 6,318 words which appear more than 800 times (http://www.kilgarriff.co.uk) and the around 14,000 PLIs in the Syntactically Annotated Idiom Dictionary (Kuiper et al., 2003). Why this statistical fact should be the case is a matter for further research.
摘要本文证实了Pawley关于语篇语料库中词汇项的出现频率与短语中以这些词项为词头的短语词汇项的数量呈正相关的猜想。测试Pawley猜想的数据来自两个来源:Kilgarriff从BNC中提取的出现超过800次的6,318个单词的词源化频率列表(http://www.kilgarriff.co.uk)和语法注释成语词典中大约14,000个PLIs (Kuiper et al., 2003)。为什么会有这样的统计事实,还有待进一步研究。