Dongyu Zhang , Xingyuan Lu , Mulin Zhuang , Senqi Yang , Hongjun Chen
{"title":"Multimodal metaphor recognition based on chain-of-cognition prompting","authors":"Dongyu Zhang , Xingyuan Lu , Mulin Zhuang , Senqi Yang , Hongjun Chen","doi":"10.1016/j.cogsys.2025.101356","DOIUrl":null,"url":null,"abstract":"<div><div>Metaphor is a way of thinking and cognition prevalent in human language. With the development of social media and multimodal data, metaphor recognition research has expanded from the traditional unimodal scope (such as text or images) to the multimodality. However, current multimodal metaphor processing methods mainly focus on fusion techniques for multiple modalities such as text and image, but neglect the cognitive mechanism of metaphor as a way of thinking, and are deficient in utilizing pre-trained information from large language models. Therefore, this paper proposes a chain-of-cognition prompting (CoC) method to address multimodal metaphor recognition task, which makes full use of the pre-training information of the large model in order to better recognize metaphors. The method utilizes prompting words to construct inputs that guide the large language model to reason about potential metaphorical source and target domain related entities and associations between entities in the sample. At the same time, visual information is obtained through image caption extraction and a visual encoder to enable the model to reason and produce metaphor recognition results. The experimental results show that the method performs well on the metaphor recognition task, which is better than the existing baseline model, verifying the effectiveness of the method on the metaphor recognition task.</div></div>","PeriodicalId":55242,"journal":{"name":"Cognitive Systems Research","volume":"91 ","pages":"Article 101356"},"PeriodicalIF":2.1000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Systems Research","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389041725000361","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Metaphor is a way of thinking and cognition prevalent in human language. With the development of social media and multimodal data, metaphor recognition research has expanded from the traditional unimodal scope (such as text or images) to the multimodality. However, current multimodal metaphor processing methods mainly focus on fusion techniques for multiple modalities such as text and image, but neglect the cognitive mechanism of metaphor as a way of thinking, and are deficient in utilizing pre-trained information from large language models. Therefore, this paper proposes a chain-of-cognition prompting (CoC) method to address multimodal metaphor recognition task, which makes full use of the pre-training information of the large model in order to better recognize metaphors. The method utilizes prompting words to construct inputs that guide the large language model to reason about potential metaphorical source and target domain related entities and associations between entities in the sample. At the same time, visual information is obtained through image caption extraction and a visual encoder to enable the model to reason and produce metaphor recognition results. The experimental results show that the method performs well on the metaphor recognition task, which is better than the existing baseline model, verifying the effectiveness of the method on the metaphor recognition task.
期刊介绍:
Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial.
The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition.
Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.