Identifying Valuable Knowledge Topics in Innovation Communities Using Innovation-LDA

Hongting Tang Hongting Tang, Yanlin Zhang Hongting Tang, Xianyun Lin Yanlin Zhang, Lanteng Wu Xianyun Lin
{"title":"Identifying Valuable Knowledge Topics in Innovation Communities Using Innovation-LDA","authors":"Hongting Tang Hongting Tang, Yanlin Zhang Hongting Tang, Xianyun Lin Yanlin Zhang, Lanteng Wu Xianyun Lin","doi":"10.53106/160792642023112406012","DOIUrl":null,"url":null,"abstract":"Researchers and practitioners have recognized that user-generated content in the innovation community plays an important role. However, it is challenging to automatically identify valuable knowledge from these unstructured texts. Thus, in this study, we propose an efficient model for extracting innovation-oriented topics and, simultaneously, for assigning discovered topics to each post in the online innovation community. Specifically, we introduce a variant of the latent Dirichlet allocation (LDA) topic model, called the Innovation-LDA model, which comprehensively considers users’ interests (reflected by pageviews and replies) and the structure of threads (e.g., header or body) to generate the valuable topics. We access the quality of discovered information through statistical fit as well as substantive fit. Based on our experimental results, we can conclude that our proposed method exhibits better performance than that of the contrasted method and can locate more meaningful innovation topics; that is, our innovation-LDA model is capable of not only identifying more rigorous topics for each thread by utilizing the text structure but is also capable of learning more semantic and coherent themes from user interests. This investigation expands topic identification research by providing both a new theoretical perspective and useful guidance for enterprises in product innovation.","PeriodicalId":442331,"journal":{"name":"網際網路技術學刊","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"網際網路技術學刊","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.53106/160792642023112406012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Researchers and practitioners have recognized that user-generated content in the innovation community plays an important role. However, it is challenging to automatically identify valuable knowledge from these unstructured texts. Thus, in this study, we propose an efficient model for extracting innovation-oriented topics and, simultaneously, for assigning discovered topics to each post in the online innovation community. Specifically, we introduce a variant of the latent Dirichlet allocation (LDA) topic model, called the Innovation-LDA model, which comprehensively considers users’ interests (reflected by pageviews and replies) and the structure of threads (e.g., header or body) to generate the valuable topics. We access the quality of discovered information through statistical fit as well as substantive fit. Based on our experimental results, we can conclude that our proposed method exhibits better performance than that of the contrasted method and can locate more meaningful innovation topics; that is, our innovation-LDA model is capable of not only identifying more rigorous topics for each thread by utilizing the text structure but is also capable of learning more semantic and coherent themes from user interests. This investigation expands topic identification research by providing both a new theoretical perspective and useful guidance for enterprises in product innovation.
利用创新-LDA 在创新社区中识别有价值的知识主题
研究人员和从业人员已经认识到,用户生成的内容在创新社区中发挥着重要作用。然而,从这些非结构化文本中自动识别有价值的知识是一项挑战。因此,在本研究中,我们提出了一种高效的模型,用于提取面向创新的主题,同时将发现的主题分配给在线创新社区中的每个帖子。具体来说,我们引入了潜在德里赫特分配(LDA)主题模型的一个变体,称为创新-LDA 模型,该模型综合考虑了用户的兴趣(通过页面浏览量和回复量反映)和主题的结构(如标题或正文)来生成有价值的主题。我们通过统计拟合和实质拟合来获取所发现信息的质量。根据实验结果,我们可以得出结论:与对比方法相比,我们提出的方法表现出更好的性能,并能找到更有意义的创新主题;也就是说,我们的创新-LDA 模型不仅能利用文本结构为每个主题识别出更严谨的主题,还能从用户兴趣中学习到更多语义连贯的主题。这项研究拓展了主题识别研究,为企业的产品创新提供了新的理论视角和有益指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信