Guided text analysis using adaptive visual analytics

C. Steed, Christopher T. Symons, Frank DeNap, T. Potok
{"title":"Guided text analysis using adaptive visual analytics","authors":"C. Steed, Christopher T. Symons, Frank DeNap, T. Potok","doi":"10.1117/12.904904","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine \nlearning techniques to improve the discovery of significant associations and insight in the search and analysis of \ntextual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection \nof techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical \nquestions over an indexed collection of open-source publications related to national infrastructure. The Gryffin \nclient hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency \nviews, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the \ninteractions are recorded and used to label the search records. These labeled records are then used to drive \nsemi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially \nrelevant records are moved to the top of the record listing. Gryffin is described in the context of the daily \ntasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating \nin its development. The resulting system is capable of addressing the analysts information overload that can be \ndirectly attributed to the deluge of information that must be addressed in search and investigative analysis of \ntextual information.","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"459 1","pages":"829408"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.904904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine learning techniques to improve the discovery of significant associations and insight in the search and analysis of textual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection of techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical questions over an indexed collection of open-source publications related to national infrastructure. The Gryffin client hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency views, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the interactions are recorded and used to label the search records. These labeled records are then used to drive semi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially relevant records are moved to the top of the record listing. Gryffin is described in the context of the daily tasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating in its development. The resulting system is capable of addressing the analysts information overload that can be directly attributed to the deluge of information that must be addressed in search and investigative analysis of textual information.
使用自适应视觉分析的引导文本分析
本文展示了利用半监督机器学习技术增强交互式可视化的前景,以改善文本信息搜索和分析中重要关联和洞察力的发现。更具体地说,我们开发了一个叫做格兰芬多的系统,它拥有一个独特的技术集合,可以在与国家基础设施相关的开源出版物的索引集合上,促进与不断变化的分析问题相关的个性化调查搜索。格兰芬多客户端通过焦点+上下文记录列表、时间线、术语频率视图和多个协调视图托管搜索结果的动态显示。此外,当分析人员与显示器交互时,将记录交互并用于标记搜索记录。然后使用这些标记的记录来驱动半监督机器学习算法,该算法对未标记的搜索记录重新排序,以便将可能相关的记录移动到记录列表的顶部。格兰芬多是在国土安全部融合中心的日常任务背景下被描述的,我们正在与他们合作开发它。由此产生的系统能够解决分析人员的信息过载问题,这可以直接归因于必须在文本信息的搜索和调查分析中解决的信息泛滥。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信