Enrico Fernandez, Anderies, Michael Gilbert Winata, Fadly Haikal Fasya, A. A. Gunawan
{"title":"Improving IndoBERT for Sentiment Analysis on Indonesian Stock Trader Slang Language","authors":"Enrico Fernandez, Anderies, Michael Gilbert Winata, Fadly Haikal Fasya, A. A. Gunawan","doi":"10.1109/IoTaIS56727.2022.9975975","DOIUrl":null,"url":null,"abstract":"Recently, more people access mobile stock trading apps and investors send messages, comments, and posts. Interest in performing sentiment analysis of these messages to predict stock price changes requires ever-improving machine learning models, though, this requires identifying Bahasa Indonesian slang phrases in comments and posts. For developing the model to perform a sentiment analysis on stock price changes, we retrieved data from comments and posts on third-party applications. In the current paper, we presented such a model and test data acquisition using datasets manually labelled by the authors. Our sentiment analysis approach was implemented with a fine-tuned IndoBERT model and achieved 60.35% accuracy predicting the sentiment of 1289 records comments, and posts which better than previous research study. By testing the model, it can do a sentiment analysis on stock price changes and is also capable of identifying the number of slang phrases in the comments and posts by Indonesian traders.","PeriodicalId":138894,"journal":{"name":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IoTaIS56727.2022.9975975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Recently, more people access mobile stock trading apps and investors send messages, comments, and posts. Interest in performing sentiment analysis of these messages to predict stock price changes requires ever-improving machine learning models, though, this requires identifying Bahasa Indonesian slang phrases in comments and posts. For developing the model to perform a sentiment analysis on stock price changes, we retrieved data from comments and posts on third-party applications. In the current paper, we presented such a model and test data acquisition using datasets manually labelled by the authors. Our sentiment analysis approach was implemented with a fine-tuned IndoBERT model and achieved 60.35% accuracy predicting the sentiment of 1289 records comments, and posts which better than previous research study. By testing the model, it can do a sentiment analysis on stock price changes and is also capable of identifying the number of slang phrases in the comments and posts by Indonesian traders.