FemaRepViz: Automatic Extraction and Geo-Temporal Visualization of FEMA National Situation Updates

2007 IEEE Symposium on Visual Analytics Science and Technology Pub Date : 2007-10-30 DOI:10.1109/VAST.2007.4388991

C. Pan, P. Mitra

{"title":"FemaRepViz: Automatic Extraction and Geo-Temporal Visualization of FEMA National Situation Updates","authors":"C. Pan, P. Mitra","doi":"10.1109/VAST.2007.4388991","DOIUrl":null,"url":null,"abstract":"An architecture for visualizing information extracted from text documents is proposed. In conformance with this architecture, a toolkit, FemaRepViz, has been implemented to extract and visualize temporal, geospatial, and summarized information from FEMA national update reports. Preliminary tests have shown satisfactory accuracy for FEMARepViz. A central component of the architecture is an entity extractor that extracts named entities like person names, location names, temporal references, etc. FEMARepViz is based on FactXtractor, an entity-extractor that works on text documents. The information extracted using FactXtractor is processed using GeoTagger, a geographical name disambiguation tool based on a novel clustering-based disambiguation algorithm. To extract relationships among entities, we propose a machine-learning based algorithm that uses a novel stripped dependency tree kernel. We illustrate and evaluate the usefulness of our system on the FEMA National Situation Updates. Daily reports are fetched by FEMARepViz from the FEMA website, segmented into coherent sections and each section is classified into one of several known incident types. We use concept Vista, Google maps and Google earth to visualize the events extracted from the text reports and allow the user to interactively filter the topics, locations, and time-periods of interest to create a visual analytics toolkit that is useful for rapid analysis of events reported in a large set of text documents.","PeriodicalId":227910,"journal":{"name":"2007 IEEE Symposium on Visual Analytics Science and Technology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE Symposium on Visual Analytics Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VAST.2007.4388991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

An architecture for visualizing information extracted from text documents is proposed. In conformance with this architecture, a toolkit, FemaRepViz, has been implemented to extract and visualize temporal, geospatial, and summarized information from FEMA national update reports. Preliminary tests have shown satisfactory accuracy for FEMARepViz. A central component of the architecture is an entity extractor that extracts named entities like person names, location names, temporal references, etc. FEMARepViz is based on FactXtractor, an entity-extractor that works on text documents. The information extracted using FactXtractor is processed using GeoTagger, a geographical name disambiguation tool based on a novel clustering-based disambiguation algorithm. To extract relationships among entities, we propose a machine-learning based algorithm that uses a novel stripped dependency tree kernel. We illustrate and evaluate the usefulness of our system on the FEMA National Situation Updates. Daily reports are fetched by FEMARepViz from the FEMA website, segmented into coherent sections and each section is classified into one of several known incident types. We use concept Vista, Google maps and Google earth to visualize the events extracted from the text reports and allow the user to interactively filter the topics, locations, and time-periods of interest to create a visual analytics toolkit that is useful for rapid analysis of events reported in a large set of text documents.

查看原文本刊更多论文

FemaRepViz: FEMA国家情况更新的自动提取和地理-时间可视化

提出了一种从文本文档中提取信息的可视化体系结构。根据这一体系结构，FemaRepViz工具包已经实现，用于从FEMA国家更新报告中提取和可视化时间、地理空间和汇总信息。初步测试表明，FEMARepViz的准确性令人满意。该体系结构的一个核心组件是实体提取器，它提取命名实体，如人名、位置名称、时间引用等。FEMARepViz基于FactXtractor，这是一种用于文本文档的实体提取器。利用FactXtractor提取的信息使用GeoTagger进行处理，GeoTagger是一种基于新型聚类消歧算法的地名消歧工具。为了提取实体之间的关系，我们提出了一种基于机器学习的算法，该算法使用了一种新的剥离依赖树内核。我们说明和评估我们的系统在联邦应急管理局国家情况更新的有用性。FEMARepViz从FEMA网站获取每日报告，将其分割成连贯的部分，每个部分被分类为几种已知事件类型中的一种。我们使用概念Vista、谷歌地图和谷歌地球来可视化从文本报告中提取的事件，并允许用户交互式地过滤主题、位置和感兴趣的时间段，以创建一个可视化分析工具包，该工具包对于快速分析大量文本文档中报告的事件非常有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 IEEE Symposium on Visual Analytics Science and Technology

自引率

0.00%

发文量