大规模英语语料库中区分大小写的字母和双字母频率计数。

Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc Pub Date : 2004-08-01 DOI:10.3758/bf03195586

Michael N Jones, D J K Mewhort

{"title":"大规模英语语料库中区分大小写的字母和双字母频率计数。","authors":"Michael N Jones, D J K Mewhort","doi":"10.3758/bf03195586","DOIUrl":null,"url":null,"abstract":"We tabulated upper- and lowercase letter frequency using several large-scale English corpora (approximately 183 million words in total). The results indicate that the relative frequencies for upper- and lowercase letters are not equivalent. We report a letter-naming experiment in which uppercase frequency predicted response time to uppercase letters better than did lowercase frequency. Tables of case-sensitive letter and bigram frequency are provided, including common nonalphabetic characters. Because subjects are sensitive to frequency relationships among letters, we recommend that experimenters use case-sensitive counts when constructing stimuli from letters.","PeriodicalId":79800,"journal":{"name":"Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc","volume":"36 3","pages":"388-96"},"PeriodicalIF":0.0000,"publicationDate":"2004-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.3758/bf03195586","citationCount":"114","resultStr":"{\"title\":\"Case-sensitive letter and bigram frequency counts from large-scale English corpora.\",\"authors\":\"Michael N Jones, D J K Mewhort\",\"doi\":\"10.3758/bf03195586\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We tabulated upper- and lowercase letter frequency using several large-scale English corpora (approximately 183 million words in total). The results indicate that the relative frequencies for upper- and lowercase letters are not equivalent. We report a letter-naming experiment in which uppercase frequency predicted response time to uppercase letters better than did lowercase frequency. Tables of case-sensitive letter and bigram frequency are provided, including common nonalphabetic characters. Because subjects are sensitive to frequency relationships among letters, we recommend that experimenters use case-sensitive counts when constructing stimuli from letters.\",\"PeriodicalId\":79800,\"journal\":{\"name\":\"Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc\",\"volume\":\"36 3\",\"pages\":\"388-96\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.3758/bf03195586\",\"citationCount\":\"114\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3758/bf03195586\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3758/bf03195586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 114

摘要

我们使用几个大型英语语料库(总共约1.83亿个单词)将大写字母和小写字母的频率制成表格。结果表明，大写字母和小写字母的相对频率不相等。我们报告了一个字母命名实验，其中大写频率比小写频率更能预测对大写字母的响应时间。提供了区分大小写的字母和双字母频率表，包括常见的非字母字符。由于受试者对字母之间的频率关系很敏感，我们建议实验者在从字母构建刺激时使用区分大小写的计数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Case-sensitive letter and bigram frequency counts from large-scale English corpora.

We tabulated upper- and lowercase letter frequency using several large-scale English corpora (approximately 183 million words in total). The results indicate that the relative frequencies for upper- and lowercase letters are not equivalent. We report a letter-naming experiment in which uppercase frequency predicted response time to uppercase letters better than did lowercase frequency. Tables of case-sensitive letter and bigram frequency are provided, including common nonalphabetic characters. Because subjects are sensitive to frequency relationships among letters, we recommend that experimenters use case-sensitive counts when constructing stimuli from letters.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc

自引率

0.00%

发文量