{"title":"Semantic Noise and Conceptual Stagnation in Natural Language Processing","authors":"S. de Jager","doi":"10.1080/0969725X.2023.2216555","DOIUrl":null,"url":null,"abstract":"Abstract Semantic noise, the effect ensuing from the denotative and thus functional variability exhibited by different terms in different contexts, is a common concern in natural language processing (NLP). While unarguably problematic in specific applications (e.g., certain translation tasks), the main argument of this paper is that failing to observe this linguistic matter of fact as a generative effect rather than as an obstacle, leads to actual obstacles in instances where language model outputs are presented as neutral. Given that a common and long-standing challenge in NLP is the interpretation of ambiguous – i.e., semantically noisy – cases, this article focuses on an exemplar ambiguity-resolution task in NLP: the problem of anaphora in Winograd schemas. The main question considered is: to what extent is the standard approach to disambiguation in NLP subject to a stagnant “image of language”? And, can a transdisciplinary, dynamic approach combining linguistics and philosophy elucidate new perspectives on these possible conceptual shortcomings? In order to answer these questions we explore the term and concept of noise, particularly in its presentation as semantic noise. Owing to its definitional plurality, and sometimes even desirable unspecificity, the term noise is thus used as proof of concept for semantic generativity being an inherent characteristic in linguistic representation, and its concept is used to interrogate assumptions admitted in the resolution of Winograd schemas. The argument is speculative and theoretical in method, and the result is an analysis which provides an account of the fundamentally dialogical and necessarily open-ended effects of semantic noise in natural language.","PeriodicalId":45929,"journal":{"name":"ANGELAKI-JOURNAL OF THE THEORETICAL HUMANITIES","volume":"28 1","pages":"111 - 132"},"PeriodicalIF":0.2000,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ANGELAKI-JOURNAL OF THE THEORETICAL HUMANITIES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/0969725X.2023.2216555","RegionNum":4,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Semantic noise, the effect ensuing from the denotative and thus functional variability exhibited by different terms in different contexts, is a common concern in natural language processing (NLP). While unarguably problematic in specific applications (e.g., certain translation tasks), the main argument of this paper is that failing to observe this linguistic matter of fact as a generative effect rather than as an obstacle, leads to actual obstacles in instances where language model outputs are presented as neutral. Given that a common and long-standing challenge in NLP is the interpretation of ambiguous – i.e., semantically noisy – cases, this article focuses on an exemplar ambiguity-resolution task in NLP: the problem of anaphora in Winograd schemas. The main question considered is: to what extent is the standard approach to disambiguation in NLP subject to a stagnant “image of language”? And, can a transdisciplinary, dynamic approach combining linguistics and philosophy elucidate new perspectives on these possible conceptual shortcomings? In order to answer these questions we explore the term and concept of noise, particularly in its presentation as semantic noise. Owing to its definitional plurality, and sometimes even desirable unspecificity, the term noise is thus used as proof of concept for semantic generativity being an inherent characteristic in linguistic representation, and its concept is used to interrogate assumptions admitted in the resolution of Winograd schemas. The argument is speculative and theoretical in method, and the result is an analysis which provides an account of the fundamentally dialogical and necessarily open-ended effects of semantic noise in natural language.
期刊介绍:
Angelaki: journal of the theoretical humanities was established in September 1993 to provide an international forum for vanguard work in the theoretical humanities. In itself a contentious category, "theoretical humanities" represents the productive nexus of work in the disciplinary fields of literary criticism and theory, philosophy, and cultural studies. The journal is dedicated to the refreshing of intellectual coordinates, and to the challenging and vivifying process of re-thinking. Angelaki: journal of the theoretical humanities encourages a critical engagement with theory in terms of disciplinary development and intellectual and political usefulness, the inquiry into and articulation of culture.