Sean Trott, Drew E. Walker, Samuel M. Taylor, Seana Coulson
{"title":"图灵笑话:分布语义和单行笑话","authors":"Sean Trott, Drew E. Walker, Samuel M. Taylor, Seana Coulson","doi":"10.1111/cogs.70066","DOIUrl":null,"url":null,"abstract":"<p>Humor is an essential aspect of human experience, yet surprisingly, little is known about how we recognize and understand humorous utterances. Most theories of humor emphasize the role of incongruity detection and resolution (e.g., frame-shifting), as well as cognitive capacities like Theory of Mind and pragmatic reasoning. In multiple preregistered experiments, we ask whether and to what extent exposure to purely linguistic input can account for the human ability to recognize one-line jokes and identify their entailments. We find that GPT-3, a large language model (LLM) trained on only language data, exhibits above-chance performance in tasks designed to test its ability to detect, appreciate, and comprehend jokes. In exploratory work, we also find above-chance performance in humor detection and comprehension in several open-source LLMs, such as Llama-3 and Mixtral. Although all LLMs tested fall short of human performance, both humans and LLMs show a tendency to misclassify nonjokes with surprising endings as jokes. Results suggest that LLMs are remarkably adept at some tasks involving one-line jokes, but reveal key limitations of distributional approaches to meaning.</p>","PeriodicalId":48349,"journal":{"name":"Cognitive Science","volume":"49 5","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70066","citationCount":"0","resultStr":"{\"title\":\"Turing Jest: Distributional Semantics and One-Line Jokes\",\"authors\":\"Sean Trott, Drew E. Walker, Samuel M. Taylor, Seana Coulson\",\"doi\":\"10.1111/cogs.70066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Humor is an essential aspect of human experience, yet surprisingly, little is known about how we recognize and understand humorous utterances. Most theories of humor emphasize the role of incongruity detection and resolution (e.g., frame-shifting), as well as cognitive capacities like Theory of Mind and pragmatic reasoning. In multiple preregistered experiments, we ask whether and to what extent exposure to purely linguistic input can account for the human ability to recognize one-line jokes and identify their entailments. We find that GPT-3, a large language model (LLM) trained on only language data, exhibits above-chance performance in tasks designed to test its ability to detect, appreciate, and comprehend jokes. In exploratory work, we also find above-chance performance in humor detection and comprehension in several open-source LLMs, such as Llama-3 and Mixtral. Although all LLMs tested fall short of human performance, both humans and LLMs show a tendency to misclassify nonjokes with surprising endings as jokes. Results suggest that LLMs are remarkably adept at some tasks involving one-line jokes, but reveal key limitations of distributional approaches to meaning.</p>\",\"PeriodicalId\":48349,\"journal\":{\"name\":\"Cognitive Science\",\"volume\":\"49 5\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cogs.70066\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Science\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70066\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY, EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Science","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cogs.70066","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
Turing Jest: Distributional Semantics and One-Line Jokes
Humor is an essential aspect of human experience, yet surprisingly, little is known about how we recognize and understand humorous utterances. Most theories of humor emphasize the role of incongruity detection and resolution (e.g., frame-shifting), as well as cognitive capacities like Theory of Mind and pragmatic reasoning. In multiple preregistered experiments, we ask whether and to what extent exposure to purely linguistic input can account for the human ability to recognize one-line jokes and identify their entailments. We find that GPT-3, a large language model (LLM) trained on only language data, exhibits above-chance performance in tasks designed to test its ability to detect, appreciate, and comprehend jokes. In exploratory work, we also find above-chance performance in humor detection and comprehension in several open-source LLMs, such as Llama-3 and Mixtral. Although all LLMs tested fall short of human performance, both humans and LLMs show a tendency to misclassify nonjokes with surprising endings as jokes. Results suggest that LLMs are remarkably adept at some tasks involving one-line jokes, but reveal key limitations of distributional approaches to meaning.
期刊介绍:
Cognitive Science publishes articles in all areas of cognitive science, covering such topics as knowledge representation, inference, memory processes, learning, problem solving, planning, perception, natural language understanding, connectionism, brain theory, motor control, intentional systems, and other areas of interdisciplinary concern. Highest priority is given to research reports that are specifically written for a multidisciplinary audience. The audience is primarily researchers in cognitive science and its associated fields, including anthropologists, education researchers, psychologists, philosophers, linguists, computer scientists, neuroscientists, and roboticists.