{"title":"Internet Public Opinion Hotspot Detection and Analysis Based on Kmeans and SVM Algorithm","authors":"Hong Liu","doi":"10.1109/ISME.2010.207","DOIUrl":null,"url":null,"abstract":"Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic. To address this issue, we propose a model for internet public opinion hotspot detection and analysis. Due to the text format of internet public opinion, we introduce the traditional vector space model (VSM) to express them, and then use Kmeans algorithm to perform text clustering on a corpus collected from some news website, and use SVM classifier to perform text categorization for new text opinion analysis, the result of the experiment shows that the efficiency and effectiveness of such method.","PeriodicalId":348878,"journal":{"name":"2010 International Conference of Information Science and Management Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference of Information Science and Management Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISME.2010.207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic. To address this issue, we propose a model for internet public opinion hotspot detection and analysis. Due to the text format of internet public opinion, we introduce the traditional vector space model (VSM) to express them, and then use Kmeans algorithm to perform text clustering on a corpus collected from some news website, and use SVM classifier to perform text categorization for new text opinion analysis, the result of the experiment shows that the efficiency and effectiveness of such method.