{"title":"基于Twitter和StockTwits数据的情绪分析的股票走势预测方法","authors":"Christina Nousi, Christos Tjortjis","doi":"10.1109/SEEDA-CECNSM53056.2021.9566242","DOIUrl":null,"url":null,"abstract":"Application of Machine Learning (ML) and sentiment analysis on data from microblogging services has become a common approach for stock market prediction. In this paper, we propose a methodology using sentiment analysis on Twitter and StockTwits data for Stock movement prediction. The methodology was evaluated by analyzing stock movement and sentiment data. We present a case study focusing on Microsoft stock. We collected tweets from Twitter and StockTwits along with financial data extracted from Finance Yahoo. Sentiment analysis was applied on tweets, and two ML models namely SVM and Logistic Regression were implemented. Best results were achieved when using tweets from Twitter with VADER and SVM. Top F-score was 76.3% and top Area Under Curve (AUC) was 67%. SVM also achieves the greatest accuracy equal to 65.8%, when using StockTwits with TextBlob on this imbalanced data set.","PeriodicalId":68279,"journal":{"name":"计算机工程与设计","volume":"426 1-2 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A Methodology for Stock Movement Prediction Using Sentiment Analysis on Twitter and StockTwits Data\",\"authors\":\"Christina Nousi, Christos Tjortjis\",\"doi\":\"10.1109/SEEDA-CECNSM53056.2021.9566242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Application of Machine Learning (ML) and sentiment analysis on data from microblogging services has become a common approach for stock market prediction. In this paper, we propose a methodology using sentiment analysis on Twitter and StockTwits data for Stock movement prediction. The methodology was evaluated by analyzing stock movement and sentiment data. We present a case study focusing on Microsoft stock. We collected tweets from Twitter and StockTwits along with financial data extracted from Finance Yahoo. Sentiment analysis was applied on tweets, and two ML models namely SVM and Logistic Regression were implemented. Best results were achieved when using tweets from Twitter with VADER and SVM. Top F-score was 76.3% and top Area Under Curve (AUC) was 67%. SVM also achieves the greatest accuracy equal to 65.8%, when using StockTwits with TextBlob on this imbalanced data set.\",\"PeriodicalId\":68279,\"journal\":{\"name\":\"计算机工程与设计\",\"volume\":\"426 1-2 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机工程与设计\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/SEEDA-CECNSM53056.2021.9566242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机工程与设计","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/SEEDA-CECNSM53056.2021.9566242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Methodology for Stock Movement Prediction Using Sentiment Analysis on Twitter and StockTwits Data
Application of Machine Learning (ML) and sentiment analysis on data from microblogging services has become a common approach for stock market prediction. In this paper, we propose a methodology using sentiment analysis on Twitter and StockTwits data for Stock movement prediction. The methodology was evaluated by analyzing stock movement and sentiment data. We present a case study focusing on Microsoft stock. We collected tweets from Twitter and StockTwits along with financial data extracted from Finance Yahoo. Sentiment analysis was applied on tweets, and two ML models namely SVM and Logistic Regression were implemented. Best results were achieved when using tweets from Twitter with VADER and SVM. Top F-score was 76.3% and top Area Under Curve (AUC) was 67%. SVM also achieves the greatest accuracy equal to 65.8%, when using StockTwits with TextBlob on this imbalanced data set.
期刊介绍:
Computer Engineering and Design is supervised by China Aerospace Science and Industry Corporation and sponsored by the 706th Institute of the Second Academy of China Aerospace Science and Industry Corporation. It was founded in 1980. The purpose of the journal is to disseminate new technologies and promote academic exchanges. Since its inception, it has adhered to the principle of combining depth and breadth, theory and application, and focused on reporting cutting-edge and hot computer technologies. The journal accepts academic papers with innovative and independent academic insights, including papers on fund projects, award-winning research papers, outstanding papers at academic conferences, doctoral and master's theses, etc.