{"title":"sDTM:用于文本分析的监督贝叶斯深度主题模型","authors":"Yi Yang, Kunpeng Zhang, Yangyang Fan","doi":"10.2139/ssrn.3612168","DOIUrl":null,"url":null,"abstract":"This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.","PeriodicalId":13594,"journal":{"name":"Information Systems & Economics eJournal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics\",\"authors\":\"Yi Yang, Kunpeng Zhang, Yangyang Fan\",\"doi\":\"10.2139/ssrn.3612168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.\",\"PeriodicalId\":13594,\"journal\":{\"name\":\"Information Systems & Economics eJournal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Systems & Economics eJournal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3612168\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems & Economics eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3612168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics
This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.