{"title":"Unlocking the potential of news: A systematic review of advantages and challenges for event detection and analysis","authors":"Klaifer Garcia, Lilian Berton","doi":"10.1016/j.cosrev.2025.100838","DOIUrl":null,"url":null,"abstract":"<div><div>Social media platforms, including both social networks and news outlets, have been widely utilized for event detection and analysis tasks. While social networks constitute the most commonly used data source due to their high volume and immediacy, news articles offer distinctive advantages such as access to well-structured historical archives and the availability of more coherent, detailed narratives, which can enhance the reliability and interpretability of event-related insights. In this study, we conduct a review and highlight key considerations that should be addressed when developing event detection applications based on news data sources. In our systematic review, we retrieved 654 papers from 2019 until 2024, covering four digital libraries (Springer Link, Science Direct from Elsevier, ACM, IEEE Explore). After applying exclusion criteria, we analyzed 79 papers qualitatively and quantitatively. We aimed to answer the following research questions: What is the motivation for using news data? What is the time span of the analyzed events? How detailed can the information be extracted? What are the most commonly used techniques and evaluation metrics? Based on the results, we identified several use cases where news is the most effective source of data in terms of the amount of information that can be retrieved, the quality of the content, and the response time, which can be as fast as social networks in some situations. Finally, we presented some challenges and opportunities in the area.</div></div>","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":"59 ","pages":"Article 100838"},"PeriodicalIF":12.7000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574013725001145","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Social media platforms, including both social networks and news outlets, have been widely utilized for event detection and analysis tasks. While social networks constitute the most commonly used data source due to their high volume and immediacy, news articles offer distinctive advantages such as access to well-structured historical archives and the availability of more coherent, detailed narratives, which can enhance the reliability and interpretability of event-related insights. In this study, we conduct a review and highlight key considerations that should be addressed when developing event detection applications based on news data sources. In our systematic review, we retrieved 654 papers from 2019 until 2024, covering four digital libraries (Springer Link, Science Direct from Elsevier, ACM, IEEE Explore). After applying exclusion criteria, we analyzed 79 papers qualitatively and quantitatively. We aimed to answer the following research questions: What is the motivation for using news data? What is the time span of the analyzed events? How detailed can the information be extracted? What are the most commonly used techniques and evaluation metrics? Based on the results, we identified several use cases where news is the most effective source of data in terms of the amount of information that can be retrieved, the quality of the content, and the response time, which can be as fast as social networks in some situations. Finally, we presented some challenges and opportunities in the area.
包括社交网络和新闻媒体在内的社交媒体平台已被广泛用于事件检测和分析任务。虽然社交网络由于其高容量和即时性而构成了最常用的数据源,但新闻文章提供了独特的优势,例如可以访问结构良好的历史档案,并且可以获得更连贯、更详细的叙述,这可以提高事件相关见解的可靠性和可解释性。在这项研究中,我们进行了回顾,并强调了在开发基于新闻数据源的事件检测应用程序时应该解决的关键问题。在我们的系统综述中,我们检索了从2019年到2024年的654篇论文,涵盖了四个数字图书馆(b施普林格Link, Science Direct from Elsevier, ACM, IEEE Explore)。应用排除标准对79篇论文进行定性和定量分析。我们旨在回答以下研究问题:使用新闻数据的动机是什么?所分析事件的时间跨度是多少?可以提取出多详细的信息?最常用的技术和评估指标是什么?根据结果,我们确定了几个用例,在这些用例中,从可检索的信息量、内容质量和响应时间来看,新闻是最有效的数据来源,在某些情况下,响应时间可以与社交网络一样快。最后,我们提出了该领域的一些挑战和机遇。
期刊介绍:
Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.