Mennatallah El-Assady, F. Sperrle, R. Sevastjanova, M. Sedlmair, D. Keim
{"title":"LTMA: Layered Topic Matching for the Comparative Exploration, Evaluation, and Refinement of Topic Modeling Results","authors":"Mennatallah El-Assady, F. Sperrle, R. Sevastjanova, M. Sedlmair, D. Keim","doi":"10.1109/BDVA.2018.8534018","DOIUrl":null,"url":null,"abstract":"We present LTMA, a Layered Topic Matching approach for the unsupervised comparative analysis of topic modeling results. Due to the vast number of available modeling algorithms, an efficient and effective comparison of their results is detrimental to a data- and task-driven selection of a model. LTMA automates this comparative analysis by providing topic matching based on two layers (document-overlap and keyword-similarity), creating a novel topic-match data structure. This data structure builds a basis for model exploration and optimization, thus, allowing for an efficient evaluation of their performance in the context of a given type of text data and task. This is especially important for text types where an annotated gold standard dataset is not readily available and, therefore, quantitative evaluation methods are not applicable. We confirm the usefulness of our technique based on three use cases, namely: (1) the automatic comparative evaluation of topic models, (2) the visual exploration of topic modeling differences, and (3) the optimization of topic modeling results through combining matches.","PeriodicalId":92742,"journal":{"name":"2018 International Symposium on Big Data Visual and Immersive Analytics (BDVA) : Konstanz, Germany, October 17 -19, 2018. IEEE International Symposium on Big Data Visual and Immersive Analytics (4th : 2018 : Konstanz, Germany)","volume":"27 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Symposium on Big Data Visual and Immersive Analytics (BDVA) : Konstanz, Germany, October 17 -19, 2018. IEEE International Symposium on Big Data Visual and Immersive Analytics (4th : 2018 : Konstanz, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BDVA.2018.8534018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
We present LTMA, a Layered Topic Matching approach for the unsupervised comparative analysis of topic modeling results. Due to the vast number of available modeling algorithms, an efficient and effective comparison of their results is detrimental to a data- and task-driven selection of a model. LTMA automates this comparative analysis by providing topic matching based on two layers (document-overlap and keyword-similarity), creating a novel topic-match data structure. This data structure builds a basis for model exploration and optimization, thus, allowing for an efficient evaluation of their performance in the context of a given type of text data and task. This is especially important for text types where an annotated gold standard dataset is not readily available and, therefore, quantitative evaluation methods are not applicable. We confirm the usefulness of our technique based on three use cases, namely: (1) the automatic comparative evaluation of topic models, (2) the visual exploration of topic modeling differences, and (3) the optimization of topic modeling results through combining matches.