sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics

Yi Yang, Kunpeng Zhang, Yangyang Fan
{"title":"sDTM: A Supervised Bayesian Deep Topic Model for Text Analytics","authors":"Yi Yang, Kunpeng Zhang, Yangyang Fan","doi":"10.2139/ssrn.3612168","DOIUrl":null,"url":null,"abstract":"This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.","PeriodicalId":13594,"journal":{"name":"Information Systems & Economics eJournal","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems & Economics eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3612168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

This study proposes a novel supervised deep topic modeling approach for effective text analysis. This approach leverages the auxiliary data associated with text, such as ratings in consumer reviews or categories of posts in online forums, to enhance the discovery of latent topics in text. The proposed approach can effectively improve topic modeling performance in several ways. First, the learned latent topics are more meaningful and distinguishable, which helps text data exploration. Second, the latent topics discovered by the novel supervised deep topic model are more accurate, which improves the performance of downstream econometrics and predictive analytics that utilize latent topics as inputs. Given the prevalence of auxiliary data in real-world text analysis tasks and the wide adoption of topic modeling in business research and practice, the study offers an effective solution for extracting insights from text data.
sDTM:用于文本分析的监督贝叶斯深度主题模型
本研究提出了一种新的监督深度主题建模方法,用于有效的文本分析。这种方法利用与文本相关的辅助数据,例如消费者评论中的评分或在线论坛中帖子的类别,来增强对文本中潜在主题的发现。该方法可以从多个方面有效地提高主题建模性能。首先,学习到的潜在主题更有意义和可区分,这有助于文本数据的探索。其次,新监督深度主题模型发现的潜在主题更加准确,从而提高了利用潜在主题作为输入的下游计量经济学和预测分析的性能。鉴于辅助数据在现实世界文本分析任务中的普遍存在以及主题建模在商业研究和实践中的广泛采用,本研究为从文本数据中提取见解提供了有效的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信