使用自适应视觉分析的引导文本分析

Visualization and data analysis Pub Date : 2012-01-22 DOI:10.1117/12.904904

C. Steed, Christopher T. Symons, Frank DeNap, T. Potok

{"title":"使用自适应视觉分析的引导文本分析","authors":"C. Steed, Christopher T. Symons, Frank DeNap, T. Potok","doi":"10.1117/12.904904","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine \nlearning techniques to improve the discovery of significant associations and insight in the search and analysis of \ntextual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection \nof techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical \nquestions over an indexed collection of open-source publications related to national infrastructure. The Gryffin \nclient hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency \nviews, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the \ninteractions are recorded and used to label the search records. These labeled records are then used to drive \nsemi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially \nrelevant records are moved to the top of the record listing. Gryffin is described in the context of the daily \ntasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating \nin its development. The resulting system is capable of addressing the analysts information overload that can be \ndirectly attributed to the deluge of information that must be addressed in search and investigative analysis of \ntextual information.","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"459 1","pages":"829408"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Guided text analysis using adaptive visual analytics\",\"authors\":\"C. Steed, Christopher T. Symons, Frank DeNap, T. Potok\",\"doi\":\"10.1117/12.904904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine \\nlearning techniques to improve the discovery of significant associations and insight in the search and analysis of \\ntextual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection \\nof techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical \\nquestions over an indexed collection of open-source publications related to national infrastructure. The Gryffin \\nclient hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency \\nviews, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the \\ninteractions are recorded and used to label the search records. These labeled records are then used to drive \\nsemi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially \\nrelevant records are moved to the top of the record listing. Gryffin is described in the context of the daily \\ntasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating \\nin its development. The resulting system is capable of addressing the analysts information overload that can be \\ndirectly attributed to the deluge of information that must be addressed in search and investigative analysis of \\ntextual information.\",\"PeriodicalId\":89305,\"journal\":{\"name\":\"Visualization and data analysis\",\"volume\":\"459 1\",\"pages\":\"829408\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visualization and data analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.904904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.904904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

本文展示了利用半监督机器学习技术增强交互式可视化的前景，以改善文本信息搜索和分析中重要关联和洞察力的发现。更具体地说，我们开发了一个叫做格兰芬多的系统，它拥有一个独特的技术集合，可以在与国家基础设施相关的开源出版物的索引集合上，促进与不断变化的分析问题相关的个性化调查搜索。格兰芬多客户端通过焦点+上下文记录列表、时间线、术语频率视图和多个协调视图托管搜索结果的动态显示。此外，当分析人员与显示器交互时，将记录交互并用于标记搜索记录。然后使用这些标记的记录来驱动半监督机器学习算法，该算法对未标记的搜索记录重新排序，以便将可能相关的记录移动到记录列表的顶部。格兰芬多是在国土安全部融合中心的日常任务背景下被描述的，我们正在与他们合作开发它。由此产生的系统能够解决分析人员的信息过载问题，这可以直接归因于必须在文本信息的搜索和调查分析中解决的信息泛滥。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Guided text analysis using adaptive visual analytics

This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine learning techniques to improve the discovery of significant associations and insight in the search and analysis of textual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection of techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical questions over an indexed collection of open-source publications related to national infrastructure. The Gryffin client hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency views, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the interactions are recorded and used to label the search records. These labeled records are then used to drive semi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially relevant records are moved to the top of the record listing. Gryffin is described in the context of the daily tasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating in its development. The resulting system is capable of addressing the analysts information overload that can be directly attributed to the deluge of information that must be addressed in search and investigative analysis of textual information.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Visualization and data analysis

自引率

0.00%

发文量