Milos Krstajic, Mohammad Najm-Araghi, Florian Mansmann, D. Keim
{"title":"新闻故事发展的增量视觉文本分析","authors":"Milos Krstajic, Mohammad Najm-Araghi, Florian Mansmann, D. Keim","doi":"10.1117/12.912456","DOIUrl":null,"url":null,"abstract":"Online news sources produce thousands of news articles every day, reporting on local and global real-world \nevents. New information quickly replaces the old, making it difficult for readers to put current events in the \ncontext of the past. Additionally, the stories have very complex relationships and characteristics that are difficult \nto model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we \npresent a visual analytics system for exploration of news topics in dynamic information streams, which combines \ninteractive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge \nover time. We employ text clustering techniques to automatically extract stories from online news streams and \npresent a visualization that: 1) shows temporal characteristics of stories in different time frames with different \nlevel of detail; 2) allows incremental updates of the display without recalculating the visual features of the past \ndata; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories \ncan be filtered based on their duration and characteristics in order to be explored in full detail with details on \ndemand. To demonstrate the usefulness of our system, case studies with real news data are presented and show \nthe capabilities for detailed dynamic text stream exploration.","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"16 1","pages":"829407"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"Incremental visual text analytics of news story development\",\"authors\":\"Milos Krstajic, Mohammad Najm-Araghi, Florian Mansmann, D. Keim\",\"doi\":\"10.1117/12.912456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Online news sources produce thousands of news articles every day, reporting on local and global real-world \\nevents. New information quickly replaces the old, making it difficult for readers to put current events in the \\ncontext of the past. Additionally, the stories have very complex relationships and characteristics that are difficult \\nto model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we \\npresent a visual analytics system for exploration of news topics in dynamic information streams, which combines \\ninteractive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge \\nover time. We employ text clustering techniques to automatically extract stories from online news streams and \\npresent a visualization that: 1) shows temporal characteristics of stories in different time frames with different \\nlevel of detail; 2) allows incremental updates of the display without recalculating the visual features of the past \\ndata; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories \\ncan be filtered based on their duration and characteristics in order to be explored in full detail with details on \\ndemand. To demonstrate the usefulness of our system, case studies with real news data are presented and show \\nthe capabilities for detailed dynamic text stream exploration.\",\"PeriodicalId\":89305,\"journal\":{\"name\":\"Visualization and data analysis\",\"volume\":\"16 1\",\"pages\":\"829407\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visualization and data analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.912456\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.912456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Incremental visual text analytics of news story development
Online news sources produce thousands of news articles every day, reporting on local and global real-world
events. New information quickly replaces the old, making it difficult for readers to put current events in the
context of the past. Additionally, the stories have very complex relationships and characteristics that are difficult
to model: they can be weakly or strongly connected, or they can merge or split over time. In this paper, we
present a visual analytics system for exploration of news topics in dynamic information streams, which combines
interactive visualization and text mining techniques to facilitate the analysis of similar topics that split and merge
over time. We employ text clustering techniques to automatically extract stories from online news streams and
present a visualization that: 1) shows temporal characteristics of stories in different time frames with different
level of detail; 2) allows incremental updates of the display without recalculating the visual features of the past
data; 3) sorts the stories by minimizing clutter and overlap from edge crossings. By using interaction, stories
can be filtered based on their duration and characteristics in order to be explored in full detail with details on
demand. To demonstrate the usefulness of our system, case studies with real news data are presented and show
the capabilities for detailed dynamic text stream exploration.