{"title":"A Text Mining Based on Refined Feature Selection to Predict Sentimental Review","authors":"Ching-Hsue Cheng","doi":"10.1145/3033288.3033307","DOIUrl":null,"url":null,"abstract":"This paper proposed an additional feature set and reduced the data dimension by SVD and PCA in order to increase accuracy and decrease executing time in text mining. The contribution of this study has: (i) proposed a preprocessing algorithm for sentiment classification, (ii) refined a feature set by adding adjective and adverb feature for sentiment classification, and (iii) utilized SVD then PCA to reduce data dimension for manager identified the sentimental labels. The experimental results show that the proposed model can obtain the better accuracy and the additional features make the better performance. Moreover, the dimension reduction can reduce the executing time effectively.","PeriodicalId":253625,"journal":{"name":"International Conference on Network, Communication and Computing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Network, Communication and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3033288.3033307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposed an additional feature set and reduced the data dimension by SVD and PCA in order to increase accuracy and decrease executing time in text mining. The contribution of this study has: (i) proposed a preprocessing algorithm for sentiment classification, (ii) refined a feature set by adding adjective and adverb feature for sentiment classification, and (iii) utilized SVD then PCA to reduce data dimension for manager identified the sentimental labels. The experimental results show that the proposed model can obtain the better accuracy and the additional features make the better performance. Moreover, the dimension reduction can reduce the executing time effectively.