Machine learning and sentiment analysis: Projecting bank insolvency risk

IF 1.2 Q3 ECONOMICS

Research in Economics Pub Date : 2023-06-01 DOI:10.1016/j.rie.2023.03.001

Diego Pitta de Jesus, Cássio da Nóbrega Besarria

{"title":"Machine learning and sentiment analysis: Projecting bank insolvency risk","authors":"Diego Pitta de Jesus, Cássio da Nóbrega Besarria","doi":"10.1016/j.rie.2023.03.001","DOIUrl":null,"url":null,"abstract":"<div><p>The main motivation of this paper is to use machine learning techniques to build a new insolvency risk rating metric for banks traded on Brazilian stock exchange. Then, a set of prediction models will be used to project the risk rating of these institutions. Conventionally, the literature analyzes bank insolvency risk from accounting data and macroeconomic variables<span>. In addition to these variables, this paper will construct a series of bank institution manager sentiment, via quarterly reports (ITR), and this will be used to improve the accuracy of bank risk predictions. The results indicate that the bank risk classification, via the k-means algorithm, was able to classify 17% of the sample into the highest risk group (1), while 83% of the sample was in the lowest bankruptcy risk group (0). Using the Z-score metric, we found that 65% of the sample is in the low-risk group, and 35% of the sample is in the high-risk group. Thus, the k-means algorithm is more rigorous in classifying a bank in the highest risk category. Next we used the data already described to project the risk of bank insolvency. The results of this step showed that the decision tree model performed the best for the test sample. In addition, it was found that the inclusion of the bank sentiment variable was able to improve the performance of the prediction models, especially, when bank sentiment is constructed from a time-varying dictionary.</span></p></div>","PeriodicalId":46094,"journal":{"name":"Research in Economics","volume":"77 2","pages":"Pages 226-238"},"PeriodicalIF":1.2000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Economics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1090944323000224","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 1

Abstract

The main motivation of this paper is to use machine learning techniques to build a new insolvency risk rating metric for banks traded on Brazilian stock exchange. Then, a set of prediction models will be used to project the risk rating of these institutions. Conventionally, the literature analyzes bank insolvency risk from accounting data and macroeconomic variables. In addition to these variables, this paper will construct a series of bank institution manager sentiment, via quarterly reports (ITR), and this will be used to improve the accuracy of bank risk predictions. The results indicate that the bank risk classification, via the k-means algorithm, was able to classify 17% of the sample into the highest risk group (1), while 83% of the sample was in the lowest bankruptcy risk group (0). Using the Z-score metric, we found that 65% of the sample is in the low-risk group, and 35% of the sample is in the high-risk group. Thus, the k-means algorithm is more rigorous in classifying a bank in the highest risk category. Next we used the data already described to project the risk of bank insolvency. The results of this step showed that the decision tree model performed the best for the test sample. In addition, it was found that the inclusion of the bank sentiment variable was able to improve the performance of the prediction models, especially, when bank sentiment is constructed from a time-varying dictionary.

查看原文本刊更多论文

机器学习和情绪分析：预测银行破产风险

本文的主要动机是使用机器学习技术为在巴西证券交易所交易的银行建立一个新的破产风险评级指标。然后，使用一套预测模型来预测这些机构的风险评级。传统上，文献从会计数据和宏观经济变量分析银行破产风险。除了这些变量之外，本文将通过季度报告(ITR)构建一系列银行机构经理情绪，并将其用于提高银行风险预测的准确性。结果表明，通过k-means算法，银行风险分类能够将17%的样本划分为最高风险组(1)，而83%的样本处于最低破产风险组(0)。使用Z-score指标，我们发现65%的样本处于低风险组，35%的样本处于高风险组。因此，k-means算法在将银行划分为最高风险类别时更为严格。接下来，我们使用已经描述的数据来预测银行破产的风险。这一步的结果表明，决策树模型对测试样本的表现最好。此外，研究发现，纳入银行情绪变量能够提高预测模型的性能，特别是当银行情绪从时变字典中构建时。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Research in Economics ECONOMICS-

CiteScore

1.40

自引率

0.00%

发文量

审稿时长

89 days

期刊介绍： Established in 1947, Research in Economics is one of the oldest general-interest economics journals in the world and the main one among those based in Italy. The purpose of the journal is to select original theoretical and empirical articles that will have high impact on the debate in the social sciences; since 1947, it has published important research contributions on a wide range of topics. A summary of our editorial policy is this: the editors make a preliminary assessment of whether the results of a paper, if correct, are worth publishing. If so one of the associate editors reviews the paper: from the reviewer we expect to learn if the paper is understandable and coherent and - within reasonable bounds - the results are correct. We believe that long lags in publication and multiple demands for revision simply slow scientific progress. Our goal is to provide you a definitive answer within one month of submission. We give the editors one week to judge the overall contribution and if acceptable send your paper to an associate editor. We expect the associate editor to provide a more detailed evaluation within three weeks so that the editors can make a final decision before the month expires. In the (rare) case of a revision we allow four months and in the case of conditional acceptance we allow two months to submit the final version. In both cases we expect a cover letter explaining how you met the requirements. For conditional acceptance the editors will verify that the requirements were met. In the case of revision the original associate editor will do so. If the revision cannot be at least conditionally accepted it is rejected: there is no second revision.