C. Steed, Christopher T. Symons, Frank DeNap, T. Potok
{"title":"使用自适应视觉分析的引导文本分析","authors":"C. Steed, Christopher T. Symons, Frank DeNap, T. Potok","doi":"10.1117/12.904904","DOIUrl":null,"url":null,"abstract":"This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine \nlearning techniques to improve the discovery of significant associations and insight in the search and analysis of \ntextual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection \nof techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical \nquestions over an indexed collection of open-source publications related to national infrastructure. The Gryffin \nclient hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency \nviews, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the \ninteractions are recorded and used to label the search records. These labeled records are then used to drive \nsemi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially \nrelevant records are moved to the top of the record listing. Gryffin is described in the context of the daily \ntasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating \nin its development. The resulting system is capable of addressing the analysts information overload that can be \ndirectly attributed to the deluge of information that must be addressed in search and investigative analysis of \ntextual information.","PeriodicalId":89305,"journal":{"name":"Visualization and data analysis","volume":"459 1","pages":"829408"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Guided text analysis using adaptive visual analytics\",\"authors\":\"C. Steed, Christopher T. Symons, Frank DeNap, T. Potok\",\"doi\":\"10.1117/12.904904\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine \\nlearning techniques to improve the discovery of significant associations and insight in the search and analysis of \\ntextual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection \\nof techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical \\nquestions over an indexed collection of open-source publications related to national infrastructure. The Gryffin \\nclient hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency \\nviews, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the \\ninteractions are recorded and used to label the search records. These labeled records are then used to drive \\nsemi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially \\nrelevant records are moved to the top of the record listing. Gryffin is described in the context of the daily \\ntasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating \\nin its development. The resulting system is capable of addressing the analysts information overload that can be \\ndirectly attributed to the deluge of information that must be addressed in search and investigative analysis of \\ntextual information.\",\"PeriodicalId\":89305,\"journal\":{\"name\":\"Visualization and data analysis\",\"volume\":\"459 1\",\"pages\":\"829408\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Visualization and data analysis\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.904904\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visualization and data analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.904904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Guided text analysis using adaptive visual analytics
This paper demonstrates the promise of augmenting interactive visualizations with semi-supervised machine
learning techniques to improve the discovery of significant associations and insight in the search and analysis of
textual information. More specifically, we have developed a system-called Gryffin-that hosts a unique collection
of techniques that facilitate individualized investigative search pertaining to an ever-changing set of analytical
questions over an indexed collection of open-source publications related to national infrastructure. The Gryffin
client hosts dynamic displays of the search results via focus+context record listings, temporal timelines, term-frequency
views, and multiple coordinated views. Furthermore, as the analyst interacts with the display, the
interactions are recorded and used to label the search records. These labeled records are then used to drive
semi-supervised machine learning algorithms that re-rank the unlabeled search records such that potentially
relevant records are moved to the top of the record listing. Gryffin is described in the context of the daily
tasks encountered at the Department of Homeland Security's Fusion Centers, with whom we are collaborating
in its development. The resulting system is capable of addressing the analysts information overload that can be
directly attributed to the deluge of information that must be addressed in search and investigative analysis of
textual information.