Turing Jest: Distributional Semantics and One-Line Jokes

IF 2.3 2区 心理学 Q2 PSYCHOLOGY, EXPERIMENTAL
Sean Trott, Drew E. Walker, Samuel M. Taylor, Seana Coulson
{"title":"Turing Jest: Distributional Semantics and One-Line Jokes","authors":"Sean Trott,&nbsp;Drew E. Walker,&nbsp;Samuel M. Taylor,&nbsp;Seana Coulson","doi":"10.1111/cogs.70066","DOIUrl":null,"url":null,"abstract":"<p>Humor is an essential aspect of human experience, yet surprisingly, little is known about how we recognize and understand humorous utterances. Most theories of humor emphasize the role of incongruity detection and resolution (e.g., frame-shifting), as well as cognitive capacities like Theory of Mind and pragmatic reasoning. In multiple preregistered experiments, we ask whether and to what extent exposure to purely linguistic input can account for the human ability to recognize one-line jokes and identify their entailments. We find that GPT-3, a large language model (LLM) trained on only language data, exhibits above-chance performance in tasks designed to test its ability to detect, appreciate, and comprehend jokes. In exploratory work, we also find above-chance performance in humor detection and comprehension in several open-source LLMs, such as Llama-3 and Mixtral. Although all LLMs tested fall short of human performance, both humans and LLMs show a tendency to misclassify nonjokes with surprising endings as jokes. Results suggest that LLMs are remarkably adept at some tasks involving one-line jokes, but reveal key limitations of distributional approaches to meaning.</p>","PeriodicalId":48349,"journal":{"name":"Cognitive Science","volume":"49 5","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70066","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Science","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70066","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Humor is an essential aspect of human experience, yet surprisingly, little is known about how we recognize and understand humorous utterances. Most theories of humor emphasize the role of incongruity detection and resolution (e.g., frame-shifting), as well as cognitive capacities like Theory of Mind and pragmatic reasoning. In multiple preregistered experiments, we ask whether and to what extent exposure to purely linguistic input can account for the human ability to recognize one-line jokes and identify their entailments. We find that GPT-3, a large language model (LLM) trained on only language data, exhibits above-chance performance in tasks designed to test its ability to detect, appreciate, and comprehend jokes. In exploratory work, we also find above-chance performance in humor detection and comprehension in several open-source LLMs, such as Llama-3 and Mixtral. Although all LLMs tested fall short of human performance, both humans and LLMs show a tendency to misclassify nonjokes with surprising endings as jokes. Results suggest that LLMs are remarkably adept at some tasks involving one-line jokes, but reveal key limitations of distributional approaches to meaning.

图灵笑话:分布语义和单行笑话
幽默是人类经验的一个重要方面,然而令人惊讶的是,我们对如何识别和理解幽默的话语知之甚少。大多数幽默理论强调发现和解决不一致性的作用(例如,框架转移),以及认知能力,如心理理论和语用推理。在多个预先注册的实验中,我们询问是否以及在多大程度上暴露于纯语言输入可以解释人类识别单行笑话和识别其含义的能力。我们发现GPT-3,一个只在语言数据上训练的大型语言模型(LLM),在测试其检测、欣赏和理解笑话的能力的任务中表现出高于机会的表现。在探索性工作中,我们也发现了几个开源llm,如Llama-3和Mixtral在幽默检测和理解方面的表现高于偶然。尽管所有经过测试的法学硕士的表现都不及人类,但人类和法学硕士都倾向于将结尾令人惊讶的非笑话错误地归类为笑话。结果表明,法学硕士在一些涉及一行笑话的任务上表现得非常娴熟,但也揭示了意义分布方法的关键局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cognitive Science
Cognitive Science PSYCHOLOGY, EXPERIMENTAL-
CiteScore
4.10
自引率
8.00%
发文量
139
期刊介绍: Cognitive Science publishes articles in all areas of cognitive science, covering such topics as knowledge representation, inference, memory processes, learning, problem solving, planning, perception, natural language understanding, connectionism, brain theory, motor control, intentional systems, and other areas of interdisciplinary concern. Highest priority is given to research reports that are specifically written for a multidisciplinary audience. The audience is primarily researchers in cognitive science and its associated fields, including anthropologists, education researchers, psychologists, philosophers, linguists, computer scientists, neuroscientists, and roboticists.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信