FemaRepViz: Automatic Extraction and Geo-Temporal Visualization of FEMA National Situation Updates

C. Pan, P. Mitra
{"title":"FemaRepViz: Automatic Extraction and Geo-Temporal Visualization of FEMA National Situation Updates","authors":"C. Pan, P. Mitra","doi":"10.1109/VAST.2007.4388991","DOIUrl":null,"url":null,"abstract":"An architecture for visualizing information extracted from text documents is proposed. In conformance with this architecture, a toolkit, FemaRepViz, has been implemented to extract and visualize temporal, geospatial, and summarized information from FEMA national update reports. Preliminary tests have shown satisfactory accuracy for FEMARepViz. A central component of the architecture is an entity extractor that extracts named entities like person names, location names, temporal references, etc. FEMARepViz is based on FactXtractor, an entity-extractor that works on text documents. The information extracted using FactXtractor is processed using GeoTagger, a geographical name disambiguation tool based on a novel clustering-based disambiguation algorithm. To extract relationships among entities, we propose a machine-learning based algorithm that uses a novel stripped dependency tree kernel. We illustrate and evaluate the usefulness of our system on the FEMA National Situation Updates. Daily reports are fetched by FEMARepViz from the FEMA website, segmented into coherent sections and each section is classified into one of several known incident types. We use concept Vista, Google maps and Google earth to visualize the events extracted from the text reports and allow the user to interactively filter the topics, locations, and time-periods of interest to create a visual analytics toolkit that is useful for rapid analysis of events reported in a large set of text documents.","PeriodicalId":227910,"journal":{"name":"2007 IEEE Symposium on Visual Analytics Science and Technology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE Symposium on Visual Analytics Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VAST.2007.4388991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

Abstract

An architecture for visualizing information extracted from text documents is proposed. In conformance with this architecture, a toolkit, FemaRepViz, has been implemented to extract and visualize temporal, geospatial, and summarized information from FEMA national update reports. Preliminary tests have shown satisfactory accuracy for FEMARepViz. A central component of the architecture is an entity extractor that extracts named entities like person names, location names, temporal references, etc. FEMARepViz is based on FactXtractor, an entity-extractor that works on text documents. The information extracted using FactXtractor is processed using GeoTagger, a geographical name disambiguation tool based on a novel clustering-based disambiguation algorithm. To extract relationships among entities, we propose a machine-learning based algorithm that uses a novel stripped dependency tree kernel. We illustrate and evaluate the usefulness of our system on the FEMA National Situation Updates. Daily reports are fetched by FEMARepViz from the FEMA website, segmented into coherent sections and each section is classified into one of several known incident types. We use concept Vista, Google maps and Google earth to visualize the events extracted from the text reports and allow the user to interactively filter the topics, locations, and time-periods of interest to create a visual analytics toolkit that is useful for rapid analysis of events reported in a large set of text documents.
FemaRepViz: FEMA国家情况更新的自动提取和地理-时间可视化
提出了一种从文本文档中提取信息的可视化体系结构。根据这一体系结构,FemaRepViz工具包已经实现,用于从FEMA国家更新报告中提取和可视化时间、地理空间和汇总信息。初步测试表明,FEMARepViz的准确性令人满意。该体系结构的一个核心组件是实体提取器,它提取命名实体,如人名、位置名称、时间引用等。FEMARepViz基于FactXtractor,这是一种用于文本文档的实体提取器。利用FactXtractor提取的信息使用GeoTagger进行处理,GeoTagger是一种基于新型聚类消歧算法的地名消歧工具。为了提取实体之间的关系,我们提出了一种基于机器学习的算法,该算法使用了一种新的剥离依赖树内核。我们说明和评估我们的系统在联邦应急管理局国家情况更新的有用性。FEMARepViz从FEMA网站获取每日报告,将其分割成连贯的部分,每个部分被分类为几种已知事件类型中的一种。我们使用概念Vista、谷歌地图和谷歌地球来可视化从文本报告中提取的事件,并允许用户交互式地过滤主题、位置和感兴趣的时间段,以创建一个可视化分析工具包,该工具包对于快速分析大量文本文档中报告的事件非常有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信