{"title":"Machine learning techniques for stock market trends identification","authors":"E. Zolotareva","doi":"10.5937/spsunp1901047z","DOIUrl":null,"url":null,"abstract":"The research concentrates on recognizing stock markets long-term upward and downward trends. The key results are obtained with the use of gradient boosting algorithms, XGBoost in particular. The raw data is represented by time series with basic stock market quotes with periods labelled by experts as Trend or Flat. The features are then obtained via various data transformations, aiming to catch implicit factors resulting in change of stock direction. Modelling is done in two stages: stage one aims to detect endpoints of tendencies (i.e. \"sliding windows\"), stage two recognizes the tendency itself inside the window. The research addresses such issues as imbalanced datasets and contradicting labels, as well as the need of specific quality metrics to keep up with practical applicability. The model can be used to design an investment strategy though further research in feature engineering and fine calibration is required.","PeriodicalId":394770,"journal":{"name":"Scientific Publications of the State University of Novi Pazar Series A: Applied Mathematics, Informatics and mechanics","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Publications of the State University of Novi Pazar Series A: Applied Mathematics, Informatics and mechanics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5937/spsunp1901047z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The research concentrates on recognizing stock markets long-term upward and downward trends. The key results are obtained with the use of gradient boosting algorithms, XGBoost in particular. The raw data is represented by time series with basic stock market quotes with periods labelled by experts as Trend or Flat. The features are then obtained via various data transformations, aiming to catch implicit factors resulting in change of stock direction. Modelling is done in two stages: stage one aims to detect endpoints of tendencies (i.e. "sliding windows"), stage two recognizes the tendency itself inside the window. The research addresses such issues as imbalanced datasets and contradicting labels, as well as the need of specific quality metrics to keep up with practical applicability. The model can be used to design an investment strategy though further research in feature engineering and fine calibration is required.