{"title":"Dynamics of scope ambiguities: comparative analysis of human and large language model performance in Korean","authors":"Myung Hye Yoo , Sanghoun Song","doi":"10.1016/j.lingua.2025.103998","DOIUrl":null,"url":null,"abstract":"<div><div>This study investigates how native Korean speakers and large language models (LLMs) resolve scope ambiguities and integrate them with discourse information, focusing on interactions between negation and quantificational phrases (QPs). The objectives were twofold: (i) to determine whether the general preference for surface scope interpretations and integration with discourse information persists in complex syntactic constructions in Korean, which require refined processing, and (ii) to assess how well LLMs comprehend and integrate semantic structures compared with human performance. The results showed a preference for surface scope among Korean speakers but did not rigidly hold against the inverse scope, particularly influenced by object QPs or long-form negation, even when contexts favor an inverse scope. LLMs developed by OpenAI—GPT-3.5 Turbo, GPT-4 Turbo, and GPT-4o—align with human judgments, mainly favoring surface scope interpretations when contexts favor the inverse scope. However, when the context supports an inverse scope, discrepancies in the handling of syntactic nuances are evident. This model tends to overgeneralize the inverse scope in specific configurations in which humans typically find the inverse scope more accessible. These findings highlight the challenges of mimicking human linguistic processing and the need for further refinement of language models to improve their interpretive accuracy.</div></div>","PeriodicalId":47955,"journal":{"name":"Lingua","volume":"324 ","pages":"Article 103998"},"PeriodicalIF":1.1000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lingua","FirstCategoryId":"98","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0024384125001238","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigates how native Korean speakers and large language models (LLMs) resolve scope ambiguities and integrate them with discourse information, focusing on interactions between negation and quantificational phrases (QPs). The objectives were twofold: (i) to determine whether the general preference for surface scope interpretations and integration with discourse information persists in complex syntactic constructions in Korean, which require refined processing, and (ii) to assess how well LLMs comprehend and integrate semantic structures compared with human performance. The results showed a preference for surface scope among Korean speakers but did not rigidly hold against the inverse scope, particularly influenced by object QPs or long-form negation, even when contexts favor an inverse scope. LLMs developed by OpenAI—GPT-3.5 Turbo, GPT-4 Turbo, and GPT-4o—align with human judgments, mainly favoring surface scope interpretations when contexts favor the inverse scope. However, when the context supports an inverse scope, discrepancies in the handling of syntactic nuances are evident. This model tends to overgeneralize the inverse scope in specific configurations in which humans typically find the inverse scope more accessible. These findings highlight the challenges of mimicking human linguistic processing and the need for further refinement of language models to improve their interpretive accuracy.
期刊介绍:
Lingua publishes papers of any length, if justified, as well as review articles surveying developments in the various fields of linguistics, and occasional discussions. A considerable number of pages in each issue are devoted to critical book reviews. Lingua also publishes Lingua Franca articles consisting of provocative exchanges expressing strong opinions on central topics in linguistics; The Decade In articles which are educational articles offering the nonspecialist linguist an overview of a given area of study; and Taking up the Gauntlet special issues composed of a set number of papers examining one set of data and exploring whose theory offers the most insight with a minimal set of assumptions and a maximum of arguments.