{"title":"基于Kmeans和SVM算法的网络舆情热点检测与分析","authors":"Hong Liu","doi":"10.1109/ISME.2010.207","DOIUrl":null,"url":null,"abstract":"Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic. To address this issue, we propose a model for internet public opinion hotspot detection and analysis. Due to the text format of internet public opinion, we introduce the traditional vector space model (VSM) to express them, and then use Kmeans algorithm to perform text clustering on a corpus collected from some news website, and use SVM classifier to perform text categorization for new text opinion analysis, the result of the experiment shows that the efficiency and effectiveness of such method.","PeriodicalId":348878,"journal":{"name":"2010 International Conference of Information Science and Management Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Internet Public Opinion Hotspot Detection and Analysis Based on Kmeans and SVM Algorithm\",\"authors\":\"Hong Liu\",\"doi\":\"10.1109/ISME.2010.207\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic. To address this issue, we propose a model for internet public opinion hotspot detection and analysis. Due to the text format of internet public opinion, we introduce the traditional vector space model (VSM) to express them, and then use Kmeans algorithm to perform text clustering on a corpus collected from some news website, and use SVM classifier to perform text categorization for new text opinion analysis, the result of the experiment shows that the efficiency and effectiveness of such method.\",\"PeriodicalId\":348878,\"journal\":{\"name\":\"2010 International Conference of Information Science and Management Engineering\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference of Information Science and Management Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISME.2010.207\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference of Information Science and Management Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISME.2010.207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Internet Public Opinion Hotspot Detection and Analysis Based on Kmeans and SVM Algorithm
Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic. To address this issue, we propose a model for internet public opinion hotspot detection and analysis. Due to the text format of internet public opinion, we introduce the traditional vector space model (VSM) to express them, and then use Kmeans algorithm to perform text clustering on a corpus collected from some news website, and use SVM classifier to perform text categorization for new text opinion analysis, the result of the experiment shows that the efficiency and effectiveness of such method.