Beyond Frequency: On the Dictionarization of New Words in Spanish

Q2 Arts and Humanities
Dictionaries Pub Date : 2020-05-15 DOI:10.1353/dic.2020.0008
J. Freixa, Sergi Torner
{"title":"Beyond Frequency: On the Dictionarization of New Words in Spanish","authors":"J. Freixa, Sergi Torner","doi":"10.1353/dic.2020.0008","DOIUrl":null,"url":null,"abstract":"ABSTRACT:The most recent literature on neology has discussed the criteria that must be taken into account in order to include new words in dictionaries (Metcalf 2002, Ishikawa 2006, O'Donovan and O'Neill 2008, Cook 2010, Freixa 2016, among others). Although there are other factors that must be considered, such as morphologic features and semantic transparency (Adelstein and Freixa 2013, Bernal et al. 2018), authors broadly agree that frequency plays a central role, given that high corpus frequency may be taken as evidence of the institutionalization of a lexical unit. However, it has also been pointed out that frequency is a complex criterion in itself, and, therefore, aspects such as stabilization in use (Cook 2010) or a possible longitudinal change in frequency (Metcalf 2002, Ishikawa 2006) must also be taken into account when measuring frequency in corpora.In this paper, we approach lexical frequency as a criterion to evaluate whether neologisms should be included in Spanish dictionaries from a new perspective. Specifically, we compare data concerning change in frequency of neologisms through time with speakers' perceptions about their novelty, known as \"neological feeling\" in the specialized literature (Gardin et al. 1974, Sablayrolles 2003). Data about speakers' perceptions is obtained from online questionnaires carried out within the framework of the Neómetro project (Bernal et al. in press). A set of questionnaires was launched in which 100 subjects evaluated their perceptions of 130 neologisms in Spanish according to four different criteria (correct formation, frequency, novelty and necessity of inclusion in dictionaries). Frequency data is taken from an extensive corpus of texts from the press, Factiva, which provides histograms of frequency through time.For this study, we analyze the neologisms that were perceived as the most and the least frequent in the questionnaires. We analyze their frequency curve through time in Factiva to find correlations between stabilization in time and speakers' perceptions of their institutionalization. The data allows us to improve the predictive capacity of frequency as a measure to decide which neologisms should be included in dictionaries, as it introduces factors (formal, semantic or usage) that favor or hinder institutionalization into the equation.","PeriodicalId":35106,"journal":{"name":"Dictionaries","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/dic.2020.0008","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dictionaries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1353/dic.2020.0008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 5

Abstract

ABSTRACT:The most recent literature on neology has discussed the criteria that must be taken into account in order to include new words in dictionaries (Metcalf 2002, Ishikawa 2006, O'Donovan and O'Neill 2008, Cook 2010, Freixa 2016, among others). Although there are other factors that must be considered, such as morphologic features and semantic transparency (Adelstein and Freixa 2013, Bernal et al. 2018), authors broadly agree that frequency plays a central role, given that high corpus frequency may be taken as evidence of the institutionalization of a lexical unit. However, it has also been pointed out that frequency is a complex criterion in itself, and, therefore, aspects such as stabilization in use (Cook 2010) or a possible longitudinal change in frequency (Metcalf 2002, Ishikawa 2006) must also be taken into account when measuring frequency in corpora.In this paper, we approach lexical frequency as a criterion to evaluate whether neologisms should be included in Spanish dictionaries from a new perspective. Specifically, we compare data concerning change in frequency of neologisms through time with speakers' perceptions about their novelty, known as "neological feeling" in the specialized literature (Gardin et al. 1974, Sablayrolles 2003). Data about speakers' perceptions is obtained from online questionnaires carried out within the framework of the Neómetro project (Bernal et al. in press). A set of questionnaires was launched in which 100 subjects evaluated their perceptions of 130 neologisms in Spanish according to four different criteria (correct formation, frequency, novelty and necessity of inclusion in dictionaries). Frequency data is taken from an extensive corpus of texts from the press, Factiva, which provides histograms of frequency through time.For this study, we analyze the neologisms that were perceived as the most and the least frequent in the questionnaires. We analyze their frequency curve through time in Factiva to find correlations between stabilization in time and speakers' perceptions of their institutionalization. The data allows us to improve the predictive capacity of frequency as a measure to decide which neologisms should be included in dictionaries, as it introduces factors (formal, semantic or usage) that favor or hinder institutionalization into the equation.
超越频率:论西班牙语新词的词典化
摘要:最近关于新词的文献讨论了词典收录新词必须考虑的标准(Metcalf 2002, Ishikawa 2006, O'Donovan and O'Neill 2008, Cook 2010, Freixa 2016等)。尽管必须考虑其他因素,如形态特征和语义透明度(Adelstein and Freixa 2013, Bernal et al. 2018),但作者普遍认为频率起着核心作用,因为高语料库频率可能被视为词汇单位制度化的证据。然而,也有人指出,频率本身是一个复杂的标准,因此,在测量语料库中的频率时,还必须考虑诸如使用稳定性(Cook 2010)或频率可能的纵向变化(Metcalf 2002, Ishikawa 2006)等方面。本文从一个新的角度出发,探讨了词汇频次作为衡量新词是否应纳入西班牙语词典的标准。具体来说,我们比较了有关新词频率随时间变化的数据与说话者对其新颖性的感知,在专业文献中被称为“新词感觉”(Gardin et al. 1974, Sablayrolles 2003)。关于说话者感知的数据是从Neómetro项目框架内进行的在线问卷中获得的(Bernal等人)。研究人员开展了一套调查问卷,让100名受试者根据四个不同的标准(正确的构词方式、使用频率、新颖性和词典收录的必要性)来评估他们对130个西班牙语新词的看法。频率数据取自媒体Factiva的大量文本语料库,该语料库提供频率随时间的直方图。在这项研究中,我们分析了在问卷中被认为是最频繁和最不频繁的新词。我们在Factiva中分析了他们的频率随时间的曲线,以发现时间稳定与说话者对其制度化的看法之间的相关性。这些数据使我们能够提高频率的预测能力,作为决定哪些新词应该被纳入词典的一种措施,因为它引入了有利于或阻碍制度化的因素(形式、语义或用法)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Dictionaries
Dictionaries Arts and Humanities-Language and Linguistics
CiteScore
0.80
自引率
0.00%
发文量
12
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信