A Sentiment Analysis Based Stock Recommendation System

Jayanth Rao, V. Ramaraju, James Smith, Ajay Bansal
{"title":"A Sentiment Analysis Based Stock Recommendation System","authors":"Jayanth Rao, V. Ramaraju, James Smith, Ajay Bansal","doi":"10.1109/AIKE55402.2022.00020","DOIUrl":null,"url":null,"abstract":"There is tremendous value in the ability to predict stock market trends and outcomes. The public sentiment surrounding a stock is unquestionably a vital factor contributing to the rise or fall of a stock price. This paper aims to detail how data from public sentiment can be integrated into traditional stock analyses and how these analyses can then be used to make predictions of stock price trends. Headlines from seven news publications and conversations from Yahoo! Finance's conversations forum were processed by the Valence Aware Dictionary and sEntiment Reasoner (VADER) natural language processing package to determine numerical polarities which represent a positive, negative, or neutral public sentiment around a stock ticker. The resulting polarities were paired with popular stock-table metrics (PEG Ratio, Forward EPS, etc.) to create a dataset for a Logistic Regression machine learning model. The model was trained on approximately 4400 major stocks to determine a binary “Buy” (1) or “Not Buy” (0) recommendation for each stock. The model achieved an F1 accuracy of 82.5% and for most major stocks, the model's recommendations were aligned with the stock analysts' ratings from the NASDAQ website. The logistic regression model would improve from leveraging a historical compass of data, given the hive-mind behavior that online discussion forums exhibit.","PeriodicalId":441077,"journal":{"name":"2022 IEEE Fifth International Conference on Artificial Intelligence and Knowledge Engineering (AIKE)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Fifth International Conference on Artificial Intelligence and Knowledge Engineering (AIKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIKE55402.2022.00020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There is tremendous value in the ability to predict stock market trends and outcomes. The public sentiment surrounding a stock is unquestionably a vital factor contributing to the rise or fall of a stock price. This paper aims to detail how data from public sentiment can be integrated into traditional stock analyses and how these analyses can then be used to make predictions of stock price trends. Headlines from seven news publications and conversations from Yahoo! Finance's conversations forum were processed by the Valence Aware Dictionary and sEntiment Reasoner (VADER) natural language processing package to determine numerical polarities which represent a positive, negative, or neutral public sentiment around a stock ticker. The resulting polarities were paired with popular stock-table metrics (PEG Ratio, Forward EPS, etc.) to create a dataset for a Logistic Regression machine learning model. The model was trained on approximately 4400 major stocks to determine a binary “Buy” (1) or “Not Buy” (0) recommendation for each stock. The model achieved an F1 accuracy of 82.5% and for most major stocks, the model's recommendations were aligned with the stock analysts' ratings from the NASDAQ website. The logistic regression model would improve from leveraging a historical compass of data, given the hive-mind behavior that online discussion forums exhibit.
基于情绪分析的股票推荐系统
预测股票市场趋势和结果的能力具有巨大的价值。围绕一只股票的公众情绪无疑是影响股价涨跌的重要因素。本文旨在详细介绍如何将公众情绪数据整合到传统的股票分析中,以及如何使用这些分析来预测股票价格趋势。来自七个新闻出版物的头条新闻和来自雅虎的对话!《财经》的对话论坛由价感知词典和情绪推理器(VADER)自然语言处理包进行处理,以确定代表股票报价机周围积极、消极或中性公众情绪的数值极性。得到的极性与流行的股票表指标(PEG Ratio, Forward EPS等)配对,为逻辑回归机器学习模型创建一个数据集。该模型在大约4400只主要股票上进行了训练,以确定每种股票的二元“买入”(1)或“不买入”(0)建议。该模型的F1准确率达到了82.5%,对于大多数主要股票,该模型的推荐与纳斯达克网站上股票分析师的评级一致。考虑到在线讨论论坛所表现出的蜂群思维行为,逻辑回归模型将通过利用历史数据罗盘得到改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信