Application of Classification Algorithms for the Assessment of Confirmation to Quality Remarks

Fabio Zambuto, S. Arcuti, R. Sabatini, D. Zambuto
{"title":"Application of Classification Algorithms for the Assessment of Confirmation to Quality Remarks","authors":"Fabio Zambuto, S. Arcuti, R. Sabatini, D. Zambuto","doi":"10.2139/ssrn.3896315","DOIUrl":null,"url":null,"abstract":"In the context of the data quality management of supervisory banking data, the Bank of Italy receives a significant number of data reports at various intervals from Italian banks. If any anomalies are found, a quality remark is sent back, questioning the data submitted. This process can lead to the bank in question confirming or revising the data it previously transmitted. We propose an innovative methodology, based on text mining and machine learning techniques, for the automatic processing of the data confirmations received from banks. A classification model is employed to predict whether these confirmations should be accepted or rejected based on the reasons provided by the reporting banks, the characteristics of the validation quality checks, and reporting behaviour across the banking system. The model was trained on past cases already labelled by data managers and its performance was assessed against a set of cross-checked cases that were used as gold standard. The empirical findings show that the methodology predicts the correct decisions on recurrent data confirmations and that the performance of the proposed model is comparable to that of data managers currently engaged in data analysis.","PeriodicalId":389704,"journal":{"name":"Bank of Italy Research Paper Series","volume":"9 22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bank of Italy Research Paper Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3896315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In the context of the data quality management of supervisory banking data, the Bank of Italy receives a significant number of data reports at various intervals from Italian banks. If any anomalies are found, a quality remark is sent back, questioning the data submitted. This process can lead to the bank in question confirming or revising the data it previously transmitted. We propose an innovative methodology, based on text mining and machine learning techniques, for the automatic processing of the data confirmations received from banks. A classification model is employed to predict whether these confirmations should be accepted or rejected based on the reasons provided by the reporting banks, the characteristics of the validation quality checks, and reporting behaviour across the banking system. The model was trained on past cases already labelled by data managers and its performance was assessed against a set of cross-checked cases that were used as gold standard. The empirical findings show that the methodology predicts the correct decisions on recurrent data confirmations and that the performance of the proposed model is comparable to that of data managers currently engaged in data analysis.
分类算法在质量评价确认评估中的应用
在对银行监管数据进行数据质量管理的背景下,意大利银行每隔一段时间就会收到大量来自意大利银行的数据报告。如果发现任何异常,则发送质量评论,质疑提交的数据。这一过程可能导致相关银行确认或修改其先前传输的数据。我们提出了一种基于文本挖掘和机器学习技术的创新方法,用于自动处理从银行收到的数据确认。根据报告银行提供的原因、验证质量检查的特征以及整个银行系统的报告行为,采用分类模型来预测这些确认是否应该被接受或拒绝。该模型是根据数据管理人员已经标记的过去案例进行训练的,其性能是根据一组交叉核对的案例进行评估的,这些案例被用作黄金标准。实证结果表明,该方法预测了对反复数据确认的正确决策,并且所提出模型的性能可与目前从事数据分析的数据管理人员的性能相媲美。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信