{"title":"Stock trend prediction relying on text mining and sentiment analysis with tweets","authors":"P. Meesad, Jiajia Li","doi":"10.1109/WICT.2014.7077275","DOIUrl":null,"url":null,"abstract":"Stock trend prediction based on text has gained much attention from researchers in recent years. According to investment theories, investors' behaviors will influence the stock market, and the way people invest their money is based on the history trend and information they hold. On account of this indirectly influential relationship between information of stock and stock trend, stock trend prediction based on text has been done by many researchers. However, due to the serious feature sparse problem in tweets and unreliability of using average sentiment score to indicate one day's sentiment, this work proposed a text-sentiment based stock trend prediction model with a hybrid feature selection method. Instead of applying sentiment analysis to add sentiment related features, this paper uses SentiWordNet to give an additional weight to the selected features. Besides, this work also compares the results with those of other learning algorithms. SVM linear algorithm based on leave-one-out cross validation yields the best performance of 90.34%.","PeriodicalId":439852,"journal":{"name":"2014 4th World Congress on Information and Communication Technologies (WICT 2014)","volume":"132 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th World Congress on Information and Communication Technologies (WICT 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WICT.2014.7077275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
Stock trend prediction based on text has gained much attention from researchers in recent years. According to investment theories, investors' behaviors will influence the stock market, and the way people invest their money is based on the history trend and information they hold. On account of this indirectly influential relationship between information of stock and stock trend, stock trend prediction based on text has been done by many researchers. However, due to the serious feature sparse problem in tweets and unreliability of using average sentiment score to indicate one day's sentiment, this work proposed a text-sentiment based stock trend prediction model with a hybrid feature selection method. Instead of applying sentiment analysis to add sentiment related features, this paper uses SentiWordNet to give an additional weight to the selected features. Besides, this work also compares the results with those of other learning algorithms. SVM linear algorithm based on leave-one-out cross validation yields the best performance of 90.34%.