Real-time topics extraction and visualization in online discussions

Yaodong Li, Xiaodong Zhou, Xia Cui, Ruoxi Dai
{"title":"Real-time topics extraction and visualization in online discussions","authors":"Yaodong Li, Xiaodong Zhou, Xia Cui, Ruoxi Dai","doi":"10.1109/ICMLC.2002.1174519","DOIUrl":null,"url":null,"abstract":"Workshop line is a newly developed online discussion system. It is built on an enhanced version of chat line. The rules and other measures adopted in the workshop line discussions make it possible to extract topics with actual meanings in real time. In this paper, a tool for real-time topics extraction and visualization in workshop line discussions-THETA (a thesaurus-based topic analysis tool) is introduced. It is mainly based on a hierarchically structured thesaurus, which incorporates expertise. By simply scanning input text, counting frequencies of terms, summing up frequencies and sorting frequencies, topics with actual meanings can be extracted rapidly. Experimental results show that THETA works well and the real-time output can help online users to understand the contents of discussions and to discuss more fluently and efficiently.","PeriodicalId":90702,"journal":{"name":"Proceedings. International Conference on Machine Learning and Cybernetics","volume":"153 1","pages":"928-933 vol.2"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2002.1174519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Workshop line is a newly developed online discussion system. It is built on an enhanced version of chat line. The rules and other measures adopted in the workshop line discussions make it possible to extract topics with actual meanings in real time. In this paper, a tool for real-time topics extraction and visualization in workshop line discussions-THETA (a thesaurus-based topic analysis tool) is introduced. It is mainly based on a hierarchically structured thesaurus, which incorporates expertise. By simply scanning input text, counting frequencies of terms, summing up frequencies and sorting frequencies, topics with actual meanings can be extracted rapidly. Experimental results show that THETA works well and the real-time output can help online users to understand the contents of discussions and to discuss more fluently and efficiently.
在线讨论中的实时主题提取和可视化
Workshop line是一个新开发的在线讨论系统。它是建立在一个增强版的聊天线。车间线上讨论所采用的规则和其他措施,使得实时抽取具有实际意义的话题成为可能。本文介绍了一种用于车间在线讨论实时主题提取和可视化的工具——theta(基于同义词库的主题分析工具)。它主要基于分层结构的同义词典,其中包含专业知识。通过对输入文本进行简单的扫描,对词条的频率进行计数,对频率进行求和,对频率进行排序,就可以快速提取出具有实际意义的主题。实验结果表明,THETA效果良好,实时输出可以帮助在线用户理解讨论内容,更流畅、高效地进行讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信