{"title":"Financial Time Series Forecasting Enriched with Textual Information","authors":"Lord Flaubert Steve Ataucuri Cruz, D. F. Silva","doi":"10.1109/ICMLA52953.2021.00066","DOIUrl":null,"url":null,"abstract":"The ability to extract knowledge and forecast stock trends is crucial to mitigate investors’ risks and uncertainties in the market. The stock trend is affected by non-linearity, complexity, noise, and especially the surrounding news. External factors such as daily news became one of the investors’ primary resources for buying or selling assets. However, this kind of information appears very fast. There are thousands of news generated by different web sources, taking a long time to analyze them, causing significant losses for investors due to late decisions. Although recent contextual language models have transformed the area of natural language processing, models to make predictions using news that influence stock values still face barriers such as unlabeled data and class imbalance. This paper proposes a hybrid methodology that enriches the time series forecasting considering textual knowledge extracted from sites without a widely annotated corpus. We show that the proposed method can improve forecasting using an empirical evaluation of Bitcoin prices prediction.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"43 1","pages":"385-390"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA52953.2021.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The ability to extract knowledge and forecast stock trends is crucial to mitigate investors’ risks and uncertainties in the market. The stock trend is affected by non-linearity, complexity, noise, and especially the surrounding news. External factors such as daily news became one of the investors’ primary resources for buying or selling assets. However, this kind of information appears very fast. There are thousands of news generated by different web sources, taking a long time to analyze them, causing significant losses for investors due to late decisions. Although recent contextual language models have transformed the area of natural language processing, models to make predictions using news that influence stock values still face barriers such as unlabeled data and class imbalance. This paper proposes a hybrid methodology that enriches the time series forecasting considering textual knowledge extracted from sites without a widely annotated corpus. We show that the proposed method can improve forecasting using an empirical evaluation of Bitcoin prices prediction.