The pluralization palette: unveiling semantic clusters in English nominal pluralization through distributional semantics

IF 1.5 0 LANGUAGE & LINGUISTICS
Elnaz Shafaei-Bajestan, Masoumeh Moradipour-Tari, Peter Uhrig, R. Harald Baayen
{"title":"The pluralization palette: unveiling semantic clusters in English nominal pluralization through distributional semantics","authors":"Elnaz Shafaei-Bajestan, Masoumeh Moradipour-Tari, Peter Uhrig, R. Harald Baayen","doi":"10.1007/s11525-024-09428-9","DOIUrl":null,"url":null,"abstract":"<p>Using distributional semantics, we show that English nominal pluralization exhibits semantic clusters. For instance, the change in semantic space from singulars to plurals differs depending on whether a word denotes, e.g., a fruit, or an animal. Languages with extensive noun classes such as Swahili and Kiowa distinguish between these kind of words in their morphology. In English, even though not marked morphologically, plural semantics actually also varies by semantic class. A semantically informed method, CosClassAvg, is introduced that is compared to two other methods, one implementing a fixed shift from singular to plural, and one creating plural vectors from singular vectors using a linear mapping (FRACSS). Compared to FRACSS, CosClassAvg predicted plural vectors that were more similar to the corpus-extracted plural vectors in terms of vector length, but somewhat less similar in terms of orientation. Both FRACSS and CosClassAvg outperform the method using a fixed shift vector to create plural vectors, which does not do justice to the intricacies of English plural semantics. A computational modeling study revealed that the observed difference between the plural semantics generated by these three methods carries over to how well a computational model of the listener can understand previously unencountered plural forms. Among all methods, CosClassAvg provides a good balance for the trade-off between productivity (being able to understand novel plural forms) and faithfulness to corpus-extracted plural vectors (i.e., understanding the particulars of the meaning of a given plural form).</p>","PeriodicalId":51849,"journal":{"name":"Morphology","volume":"16 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Morphology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11525-024-09428-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Using distributional semantics, we show that English nominal pluralization exhibits semantic clusters. For instance, the change in semantic space from singulars to plurals differs depending on whether a word denotes, e.g., a fruit, or an animal. Languages with extensive noun classes such as Swahili and Kiowa distinguish between these kind of words in their morphology. In English, even though not marked morphologically, plural semantics actually also varies by semantic class. A semantically informed method, CosClassAvg, is introduced that is compared to two other methods, one implementing a fixed shift from singular to plural, and one creating plural vectors from singular vectors using a linear mapping (FRACSS). Compared to FRACSS, CosClassAvg predicted plural vectors that were more similar to the corpus-extracted plural vectors in terms of vector length, but somewhat less similar in terms of orientation. Both FRACSS and CosClassAvg outperform the method using a fixed shift vector to create plural vectors, which does not do justice to the intricacies of English plural semantics. A computational modeling study revealed that the observed difference between the plural semantics generated by these three methods carries over to how well a computational model of the listener can understand previously unencountered plural forms. Among all methods, CosClassAvg provides a good balance for the trade-off between productivity (being able to understand novel plural forms) and faithfulness to corpus-extracted plural vectors (i.e., understanding the particulars of the meaning of a given plural form).

Abstract Image

复数化调色板:通过分布语义揭示英语名词复数化中的语义集群
通过分布语义学,我们发现英语名词复数化呈现出语义集群。例如,从单数到复数的语义空间变化取决于一个词是表示水果还是动物。斯瓦希里语和基奥瓦语等拥有大量名词类的语言在词形上对这类词进行了区分。在英语中,复数的语义实际上也因语义类别的不同而有所区别,尽管没有在形态学上进行标记。本文介绍了一种基于语义的方法--CosClassAvg,并将其与其他两种方法进行了比较,一种是从单数到复数的固定转换,另一种是使用线性映射从单数向量创建复数向量(FRACSS)。与 FRACSS 相比,CosClassAvg 预测的复数向量在向量长度方面与语料库提取的复数向量更为相似,但在方向方面则不太相似。FRACSS 和 CosClassAvg 都优于使用固定位移向量来创建复数向量的方法,因为这种方法无法充分发挥英语复数语义的复杂性。计算建模研究表明,这三种方法生成的复数语义之间的差异可以反映出听者的计算模型对以前未遇到过的复数形式的理解程度。在所有方法中,CosClassAvg 在生产率(能够理解新的复数形式)和忠实于语料库提取的复数向量(即理解特定复数形式的特定含义)之间取得了很好的平衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Morphology
Morphology LANGUAGE & LINGUISTICS-
CiteScore
3.60
自引率
0.00%
发文量
11
期刊介绍: Aim The aim of Morphology is to publish high quality articles that contribute to the further articulation of morphological theory and linguistic theory in general, or present new and unexplored data. Relevant empirical evidence for the theoretical claims in the articles will be provided by in-depth analyses of specific languages or by comparative, cross-linguistic analyses of the relevant facts. The sources of data can be grammatical descriptions, corpora of data concerning language use and other naturalistic data, and experiments. Scope Morphology publishes articles on morphology proper, as well as articles on the interaction of morphology with phonology, syntax, and semantics, the acquisition and processing of morphological information, the nature of the mental lexicon, and morphological variation and change. Its main focus is on formal models of morphological knowledge, morphological typology (the range and limits of variation in natural languages), the position of morphology in the architecture of the human language faculty, and the evolution and change of language. In addition, the journal deals with the acquisition of morphological knowledge and its role in language processing. Articles on computational morphology and neurolinguistic approaches to morphology are also welcome. The first volume of Morphology appeared as Volume 16 (2006). Previous volumes were published under the title Yearbook of Morphology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信