Incremental visual text analytics of news story development

Milos Krstajic, Mohammad Najm-Araghi, Florian Mansmann, D. Keim
{"title":"Incremental visual text analytics of news story development","authors":"Milos Krstajic, Mohammad Najm-Araghi, Florian Mansmann, D. Keim","doi":"10.1117/12.912456","DOIUrl":null,"url":null,"abstract":"Online news sources produce thousands of news articles every day, reporting on local and global real-world \nevents. New information quickly replaces the old, making it difficult for readers to put current events in the \ncontext of the past. Additionally, the stories have very complex relationships and characteristics that are difficult \nto model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we \npresent a visual analytics system for exploration of news topics in dynamic information streams, which combines \ninteractive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge \nover time. We employ text clustering techniques to automatically extract stories from online news streams and \npresent a visualization that: 1) shows temporal characteristics of stories in different time frames with different \nlevel of detail; 2) allows incremental updates of the display without recalculating the visual features of the past \ndata; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories \ncan be filtered based on their duration and characteristics in order to be explored in full detail with details on \ndemand. To demonstrate the usefulness of our system, case studies with real news data are presented and show \nthe capabilities for detailed dynamic text stream exploration.","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"16 1","pages":"829407"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.912456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23

Abstract

Online news sources produce thousands of news articles every day, reporting on local and global real-world events. New information quickly replaces the old, making it difficult for readers to put current events in the context of the past. Additionally, the stories have very complex relationships and characteristics that are difficult to model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we present a visual analytics system for exploration of news topics in dynamic information streams, which combines interactive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge over time. We employ text clustering techniques to automatically extract stories from online news streams and present a visualization that: 1) shows temporal characteristics of stories in different time frames with different level of detail; 2) allows incremental updates of the display without recalculating the visual features of the past data; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories can be filtered based on their duration and characteristics in order to be explored in full detail with details on demand. To demonstrate the usefulness of our system, case studies with real news data are presented and show the capabilities for detailed dynamic text stream exploration.
新闻故事发展的增量视觉文本分析
在线新闻来源每天产生数千篇新闻文章,报道当地和全球现实世界的事件。新信息迅速取代旧信息,使得读者很难将当前事件置于过去的背景中。此外,故事具有非常复杂的关系和特征,难以建模:它们可以是弱连接或强连接,或者它们可以随时间合并或分裂。在本文中,我们提出了一个可视化分析系统,用于探索动态信息流中的新闻主题,该系统结合了交互式可视化和文本挖掘技术,以促进对随时间分裂和合并的类似主题的分析。本文采用文本聚类技术从在线新闻流中自动提取故事,并呈现出一种可视化效果:1)以不同的细节水平显示不同时间框架下故事的时间特征;2)允许显示的增量更新,而无需重新计算过去数据的视觉特征;3)通过最小化边缘交叉带来的混乱和重叠来对故事进行分类。通过使用交互,故事可以根据其持续时间和特征进行过滤,以便根据需要对细节进行全面的探索。为了证明我们的系统的实用性,给出了真实新闻数据的案例研究,并展示了详细动态文本流探索的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信