SUBTLEX-CY: A new word frequency database for Welsh.

IF 1.5 3区 心理学 Q4 PHYSIOLOGY
Walter Jb van Heuven, Joshua S Payne, Manon W Jones
{"title":"SUBTLEX-CY: A new word frequency database for Welsh.","authors":"Walter Jb van Heuven, Joshua S Payne, Manon W Jones","doi":"10.1177/17470218231190315","DOIUrl":null,"url":null,"abstract":"<p><p>We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh television subtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against words with inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the <i>Cronfa Electroneg o'r Gymraeg</i> (CEG), and three other Welsh word frequency databases. Words were selected that were classified as low frequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as medium frequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were responded to more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welsh word frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, and other lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.</p>","PeriodicalId":20869,"journal":{"name":"Quarterly Journal of Experimental Psychology","volume":" ","pages":"1052-1067"},"PeriodicalIF":1.5000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11032624/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quarterly Journal of Experimental Psychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1177/17470218231190315","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/30 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"PHYSIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

We present SUBTLEX-CY, a new word frequency database created from a 32-million-word corpus of Welsh television subtitles. An experiment comprising a lexical decision task examined SUBTLEX-CY frequency estimates against words with inconsistent frequencies in a much smaller Welsh corpus that is often used by researchers, the Cronfa Electroneg o'r Gymraeg (CEG), and three other Welsh word frequency databases. Words were selected that were classified as low frequency (LF) in SUBTLEX-CY and high frequency (HF) in CEG and compared with words that were classified as medium frequency (MF) in both SUBTLEX-CY and CEG. Reaction time analyses showed that HF words in CEG were responded to more slowly compared to MF words, suggesting that SUBTLEX-CY corpus provides a more reliable estimate of Welsh word frequencies. The new Welsh word frequency database that also includes part-of-speech, contextual diversity, and other lexical information is freely available for research purposes on the Open Science Framework repository at https://osf.io/9gkqm/.

SUBTLEX-CY:新的威尔士语词频数据库。
我们介绍的 SUBTLEX-CY 是一个新的词频数据库,由 3200 万字的威尔士语电视字幕语料库创建而成。一项由词性判断任务组成的实验对照研究人员经常使用的规模小得多的威尔士语语料库 Cronfa Electroneg o'r Gymraeg (CEG) 和其他三个威尔士语词频数据库中词频不一致的词来检验 SUBTLEX-CY 的词频估计值。我们选择了在 SUBTLEX-CY 中被归类为低频 (LF) 和在 CEG 中被归类为高频 (HF) 的单词,并将其与在 SUBTLEX-CY 和 CEG 中被归类为中频 (MF) 的单词进行比较。反应时间分析表明,与中频词相比,CEG 中的高频词反应速度更慢,这表明 SUBTLEX-CY 语料库对威尔士语词频的估计更为可靠。新的威尔士语词频数据库还包括语音部分、上下文多样性和其他词汇信息,可在 https://osf.io/9gkqm/ 的开放科学框架资源库中免费获取,用于研究目的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
5.90%
发文量
178
审稿时长
3-8 weeks
期刊介绍: Promoting the interests of scientific psychology and its researchers, QJEP, the journal of the Experimental Psychology Society, is a leading journal with a long-standing tradition of publishing cutting-edge research. Several articles have become classic papers in the fields of attention, perception, learning, memory, language, and reasoning. The journal publishes original articles on any topic within the field of experimental psychology (including comparative research). These include substantial experimental reports, review papers, rapid communications (reporting novel techniques or ground breaking results), comments (on articles previously published in QJEP or on issues of general interest to experimental psychologists), and book reviews. Experimental results are welcomed from all relevant techniques, including behavioural testing, brain imaging and computational modelling. QJEP offers a competitive publication time-scale. Accepted Rapid Communications have priority in the publication cycle and usually appear in print within three months. We aim to publish all accepted (but uncorrected) articles online within seven days. Our Latest Articles page offers immediate publication of articles upon reaching their final form. The journal offers an open access option called Open Select, enabling authors to meet funder requirements to make their article free to read online for all in perpetuity. Authors also benefit from a broad and diverse subscription base that delivers the journal contents to a world-wide readership. Together these features ensure that the journal offers authors the opportunity to raise the visibility of their work to a global audience.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信