{"title":"多模态语篇摘要的系统文献综述","authors":"Abid Ali, Diego Molla","doi":"10.1145/3763245","DOIUrl":null,"url":null,"abstract":"The proliferation of information-sharing platforms and the ease of access to diverse resources have led to an overwhelming volume of multimodal data that is increasingly difficult to process effectively. The integration of multiple data types, including text, images, video, and audio, highlights the growing importance of Multimodal Text Summarization (MMTS). Collecting and synthesizing existing research on this topic can provide a comprehensive foundation for advancing the field. Following a Systematic Literature Review (SLR) methodology, we addressed three pivotal research questions concerning methodologies, evaluation measures, and datasets in MMTS. Through a systematic analysis of 132 papers , we examined the strategies employed to address MMTS challenges, assessed the evaluation methods used to quantify performance, and compiled a detailed list of available datasets along with their limitations. This review offers critical insights and identifies future research directions, aiming to inform and guide continued innovation in this dynamic and evolving domain.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"63 1","pages":""},"PeriodicalIF":28.0000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Systematic Literature Review on Multimodal Text Summarization\",\"authors\":\"Abid Ali, Diego Molla\",\"doi\":\"10.1145/3763245\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The proliferation of information-sharing platforms and the ease of access to diverse resources have led to an overwhelming volume of multimodal data that is increasingly difficult to process effectively. The integration of multiple data types, including text, images, video, and audio, highlights the growing importance of Multimodal Text Summarization (MMTS). Collecting and synthesizing existing research on this topic can provide a comprehensive foundation for advancing the field. Following a Systematic Literature Review (SLR) methodology, we addressed three pivotal research questions concerning methodologies, evaluation measures, and datasets in MMTS. Through a systematic analysis of 132 papers , we examined the strategies employed to address MMTS challenges, assessed the evaluation methods used to quantify performance, and compiled a detailed list of available datasets along with their limitations. This review offers critical insights and identifies future research directions, aiming to inform and guide continued innovation in this dynamic and evolving domain.\",\"PeriodicalId\":50926,\"journal\":{\"name\":\"ACM Computing Surveys\",\"volume\":\"63 1\",\"pages\":\"\"},\"PeriodicalIF\":28.0000,\"publicationDate\":\"2025-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Computing Surveys\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3763245\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3763245","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
A Systematic Literature Review on Multimodal Text Summarization
The proliferation of information-sharing platforms and the ease of access to diverse resources have led to an overwhelming volume of multimodal data that is increasingly difficult to process effectively. The integration of multiple data types, including text, images, video, and audio, highlights the growing importance of Multimodal Text Summarization (MMTS). Collecting and synthesizing existing research on this topic can provide a comprehensive foundation for advancing the field. Following a Systematic Literature Review (SLR) methodology, we addressed three pivotal research questions concerning methodologies, evaluation measures, and datasets in MMTS. Through a systematic analysis of 132 papers , we examined the strategies employed to address MMTS challenges, assessed the evaluation methods used to quantify performance, and compiled a detailed list of available datasets along with their limitations. This review offers critical insights and identifies future research directions, aiming to inform and guide continued innovation in this dynamic and evolving domain.
期刊介绍:
ACM Computing Surveys is an academic journal that focuses on publishing surveys and tutorials on various areas of computing research and practice. The journal aims to provide comprehensive and easily understandable articles that guide readers through the literature and help them understand topics outside their specialties. In terms of impact, CSUR has a high reputation with a 2022 Impact Factor of 16.6. It is ranked 3rd out of 111 journals in the field of Computer Science Theory & Methods.
ACM Computing Surveys is indexed and abstracted in various services, including AI2 Semantic Scholar, Baidu, Clarivate/ISI: JCR, CNKI, DeepDyve, DTU, EBSCO: EDS/HOST, and IET Inspec, among others.