使用文本分类检查布拉格证券交易所的股票价格变动

International journal of new computer architectures and their applications Pub Date : 1900-01-01 DOI:10.17781/P002293

Jonás Petrovský, Frantisek Darena Pavel Netolický

{"title":"使用文本分类检查布拉格证券交易所的股票价格变动","authors":"Jonás Petrovský, Frantisek Darena Pavel Netolický","doi":"10.17781/P002293","DOIUrl":null,"url":null,"abstract":"The goal of the article was to examine the relationship between the content of text documents published on the Internet and the direction of movement of stock prices on the Prague Stock Exchange. The relationship was modeled by text classification. As data were used news articles and discussion posts on Czech websites and the value of the PX stock index and stock price of company CEZ. Document’s class (plus/minus/constant) was determined by the relative price change that happened between the publication date of a document and the next working day. We achieved a high accuracy of 75% for classification of discussion posts, however the classification accuracy for news articles was about 60%. We tried both binary (documents with constant class were discarded) and ternary classification – the former was in all cases more successful.","PeriodicalId":211757,"journal":{"name":"International journal of new computer architectures and their applications","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Examining Stock Price Movements on Prague Stock Exchange Using Text Classification\",\"authors\":\"Jonás Petrovský, Frantisek Darena Pavel Netolický\",\"doi\":\"10.17781/P002293\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The goal of the article was to examine the relationship between the content of text documents published on the Internet and the direction of movement of stock prices on the Prague Stock Exchange. The relationship was modeled by text classification. As data were used news articles and discussion posts on Czech websites and the value of the PX stock index and stock price of company CEZ. Document’s class (plus/minus/constant) was determined by the relative price change that happened between the publication date of a document and the next working day. We achieved a high accuracy of 75% for classification of discussion posts, however the classification accuracy for news articles was about 60%. We tried both binary (documents with constant class were discarded) and ternary classification – the former was in all cases more successful.\",\"PeriodicalId\":211757,\"journal\":{\"name\":\"International journal of new computer architectures and their applications\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of new computer architectures and their applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17781/P002293\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of new computer architectures and their applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17781/P002293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

这篇文章的目的是研究互联网上发布的文本文件的内容与布拉格证券交易所股票价格走势之间的关系。该关系通过文本分类建模。As数据采用捷克网站上的新闻文章和讨论帖，以及PX股票指数和CEZ公司股价的数值。文件的类别(加/减/常数)由文件发布日期和下一个工作日之间发生的相对价格变化决定。我们对讨论帖的分类准确率达到了75%，而对新闻文章的分类准确率在60%左右。我们尝试了二进制(带有常量类的文档被丢弃)和三元分类——前者在所有情况下都更成功。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Examining Stock Price Movements on Prague Stock Exchange Using Text Classification

The goal of the article was to examine the relationship between the content of text documents published on the Internet and the direction of movement of stock prices on the Prague Stock Exchange. The relationship was modeled by text classification. As data were used news articles and discussion posts on Czech websites and the value of the PX stock index and stock price of company CEZ. Document’s class (plus/minus/constant) was determined by the relative price change that happened between the publication date of a document and the next working day. We achieved a high accuracy of 75% for classification of discussion posts, however the classification accuracy for news articles was about 60%. We tried both binary (documents with constant class were discarded) and ternary classification – the former was in all cases more successful.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International journal of new computer architectures and their applications

自引率

0.00%

发文量