Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach

IF 1.1 0 LANGUAGE & LINGUISTICS

Word Structure Pub Date : 2018-10-25 DOI:10.3366/WORD.2018.0131

Gabriella Lapesa, Lea Kawaletz, I. Plag, M. Andreou, M. Kisselew, Sebastian Padó

{"title":"Disambiguation of newly derived nominalizations in context: A Distributional Semantics approach","authors":"Gabriella Lapesa, Lea Kawaletz, I. Plag, M. Andreou, M. Kisselew, Sebastian Padó","doi":"10.3366/WORD.2018.0131","DOIUrl":null,"url":null,"abstract":"One of the central problems in the semantics of derived words is polysemy (see, for example, the recent contributions by Lieber 2016 and Plag et al. 2018 ). In this paper, we tackle the problem of disambiguating newly derived words in context by applying Distributional Semantics ( Firth 1957 ) to deverbal -ment nominalizations (e.g. bedragglement, emplacement). We collected a dataset containing contexts of low frequency deverbal -ment nominalizations (55 types, 406 tokens, see Appendix B) extracted from large corpora such as the Corpus of Contemporary American English. We chose low frequency derivatives because high frequency formations are often lexicalized and thus tend to not exhibit the kind of polysemous readings we are interested in. Furthermore, disambiguating low-frequency words presents an especially difficult task because there is little to no prior knowledge about these words from which their semantic properties can be extrapolated. The data was manually annotated according to eventive vs. non-eventive interpretations, allowing also an ambiguous label in those cases where the context did not disambiguate. Our question then was to what extent, and under which conditions, context-derived representations such as those of Distributional Semantics can be successfully employed in the disambiguation of low-frequency derivatives. Our results show that, first, our models are able to distinguish between eventive and non-eventive readings with some success. Second, very small context windows are sufficient to find the intended interpretation in the majority of cases. Third, ambiguous instances tend to be classified as events. Fourth, the performance of the classifier differed for different subcategories of nouns, with non-eventive derivatives being harder to classify correctly. We present indirect evidence that this is due to the semantic similarity of abstract non-eventive nouns to eventive nouns. Overall, this paper demonstrates that distributional semantic models can be fruitfully employed for the disambiguation of low frequency words in spite of the scarcity of available contextual information. 1","PeriodicalId":43166,"journal":{"name":"Word Structure","volume":"1 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2018-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Word Structure","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3366/WORD.2018.0131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 10

Abstract

One of the central problems in the semantics of derived words is polysemy (see, for example, the recent contributions by Lieber 2016 and Plag et al. 2018 ). In this paper, we tackle the problem of disambiguating newly derived words in context by applying Distributional Semantics ( Firth 1957 ) to deverbal -ment nominalizations (e.g. bedragglement, emplacement). We collected a dataset containing contexts of low frequency deverbal -ment nominalizations (55 types, 406 tokens, see Appendix B) extracted from large corpora such as the Corpus of Contemporary American English. We chose low frequency derivatives because high frequency formations are often lexicalized and thus tend to not exhibit the kind of polysemous readings we are interested in. Furthermore, disambiguating low-frequency words presents an especially difficult task because there is little to no prior knowledge about these words from which their semantic properties can be extrapolated. The data was manually annotated according to eventive vs. non-eventive interpretations, allowing also an ambiguous label in those cases where the context did not disambiguate. Our question then was to what extent, and under which conditions, context-derived representations such as those of Distributional Semantics can be successfully employed in the disambiguation of low-frequency derivatives. Our results show that, first, our models are able to distinguish between eventive and non-eventive readings with some success. Second, very small context windows are sufficient to find the intended interpretation in the majority of cases. Third, ambiguous instances tend to be classified as events. Fourth, the performance of the classifier differed for different subcategories of nouns, with non-eventive derivatives being harder to classify correctly. We present indirect evidence that this is due to the semantic similarity of abstract non-eventive nouns to eventive nouns. Overall, this paper demonstrates that distributional semantic models can be fruitfully employed for the disambiguation of low frequency words in spite of the scarcity of available contextual information. 1

查看原文本刊更多论文

语境中新派生名词化的歧义消解：分布语义学方法

派生词语义的核心问题之一是多义（例如，参见Lieber 2016和Plag等人2018的最新贡献）。在本文中，我们通过将分布语义学（Firth 1957）应用于名词化的发展（如铺位、铺位）来解决上下文中新派生词的歧义消除问题。我们收集了一个数据集，其中包含从大型语料库（如《当代美国英语语料库》）中提取的低频名词化上下文（55种类型，406个标记，见附录B）。我们选择低频导数是因为高频成分经常被词汇化，因此往往不会表现出我们感兴趣的多义词阅读。此外，消除低频词的歧义是一项特别困难的任务，因为几乎没有关于这些词的先验知识，可以从中推断出它们的语义特性。根据事件与非事件的解释手动注释数据，在上下文没有消除歧义的情况下，也允许使用歧义标签。我们当时的问题是，在什么程度上，在什么条件下，上下文派生的表示，如分布语义的表示，可以成功地用于低频导数的消歧。我们的结果表明，首先，我们的模型能够区分事件和非事件读数，并取得了一些成功。其次，在大多数情况下，非常小的上下文窗口足以找到预期的解释。第三，模棱两可的实例往往被归类为事件。第四，分类器的性能因名词的不同子类别而异，非事件派生词更难正确分类。我们提供的间接证据表明，这是由于抽象的非事件性名词与事件性名词的语义相似。总之，本文证明，尽管缺乏可用的上下文信息，但分布语义模型可以有效地用于低频词的消歧。1.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Word Structure LANGUAGE & LINGUISTICS-

CiteScore

1.60

自引率

0.00%

发文量