{"title":"Machine learning and sentiment analysis: Projecting bank insolvency risk","authors":"Diego Pitta de Jesus, Cássio da Nóbrega Besarria","doi":"10.1016/j.rie.2023.03.001","DOIUrl":null,"url":null,"abstract":"<div><p>The main motivation of this paper is to use machine learning techniques to build a new insolvency risk rating metric for banks traded on Brazilian stock exchange. Then, a set of prediction models will be used to project the risk rating of these institutions. Conventionally, the literature analyzes bank insolvency risk from accounting data and macroeconomic variables<span>. In addition to these variables, this paper will construct a series of bank institution manager sentiment, via quarterly reports (ITR), and this will be used to improve the accuracy of bank risk predictions. The results indicate that the bank risk classification, via the k-means algorithm, was able to classify 17% of the sample into the highest risk group (1), while 83% of the sample was in the lowest bankruptcy risk group (0). Using the Z-score metric, we found that 65% of the sample is in the low-risk group, and 35% of the sample is in the high-risk group. Thus, the k-means algorithm is more rigorous in classifying a bank in the highest risk category. Next we used the data already described to project the risk of bank insolvency. The results of this step showed that the decision tree model performed the best for the test sample. In addition, it was found that the inclusion of the bank sentiment variable was able to improve the performance of the prediction models, especially, when bank sentiment is constructed from a time-varying dictionary.</span></p></div>","PeriodicalId":46094,"journal":{"name":"Research in Economics","volume":"77 2","pages":"Pages 226-238"},"PeriodicalIF":1.2000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Economics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1090944323000224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 1
Abstract
The main motivation of this paper is to use machine learning techniques to build a new insolvency risk rating metric for banks traded on Brazilian stock exchange. Then, a set of prediction models will be used to project the risk rating of these institutions. Conventionally, the literature analyzes bank insolvency risk from accounting data and macroeconomic variables. In addition to these variables, this paper will construct a series of bank institution manager sentiment, via quarterly reports (ITR), and this will be used to improve the accuracy of bank risk predictions. The results indicate that the bank risk classification, via the k-means algorithm, was able to classify 17% of the sample into the highest risk group (1), while 83% of the sample was in the lowest bankruptcy risk group (0). Using the Z-score metric, we found that 65% of the sample is in the low-risk group, and 35% of the sample is in the high-risk group. Thus, the k-means algorithm is more rigorous in classifying a bank in the highest risk category. Next we used the data already described to project the risk of bank insolvency. The results of this step showed that the decision tree model performed the best for the test sample. In addition, it was found that the inclusion of the bank sentiment variable was able to improve the performance of the prediction models, especially, when bank sentiment is constructed from a time-varying dictionary.
期刊介绍:
Established in 1947, Research in Economics is one of the oldest general-interest economics journals in the world and the main one among those based in Italy. The purpose of the journal is to select original theoretical and empirical articles that will have high impact on the debate in the social sciences; since 1947, it has published important research contributions on a wide range of topics. A summary of our editorial policy is this: the editors make a preliminary assessment of whether the results of a paper, if correct, are worth publishing. If so one of the associate editors reviews the paper: from the reviewer we expect to learn if the paper is understandable and coherent and - within reasonable bounds - the results are correct. We believe that long lags in publication and multiple demands for revision simply slow scientific progress. Our goal is to provide you a definitive answer within one month of submission. We give the editors one week to judge the overall contribution and if acceptable send your paper to an associate editor. We expect the associate editor to provide a more detailed evaluation within three weeks so that the editors can make a final decision before the month expires. In the (rare) case of a revision we allow four months and in the case of conditional acceptance we allow two months to submit the final version. In both cases we expect a cover letter explaining how you met the requirements. For conditional acceptance the editors will verify that the requirements were met. In the case of revision the original associate editor will do so. If the revision cannot be at least conditionally accepted it is rejected: there is no second revision.