Unsupervised-learning financial reconciliation: a robust, accurate approach inspired by machine translation

Peter A. Chew
{"title":"Unsupervised-learning financial reconciliation: a robust, accurate approach inspired by machine translation","authors":"Peter A. Chew","doi":"10.1145/3383455.3422517","DOIUrl":null,"url":null,"abstract":"Financial reconciliation (cross-checking independent sources of data) is a time-honored and widespread function in finance and audit. Its objectives are to ensure completeness, timeliness, and accuracy of recording of transactions. With the proliferation of data over recent decades, the ways in which reconciliation is approached have evolved. There are now a number of software products that promise to automate the process of detailed transaction matching. However, the 'state of the art' in these products is rules-based reconciliation. Rules-based systems in any domain tend to be brittle in that any change in the data streams causes the rules to have to be debugged and rebuilt, itself often a time-consuming process. And to make matters worse, this might be caused not just by data schema changes, but the contents of what is in the data fields themselves. Either of these can occur if a third party such as a bank changes its internal processes. In a sense, automated reconciliation is where machine translation was almost 70 years ago; IBM's 1954 Georgetown experiment approached Russian-to-English translation using rules, but it took another 4 decades for researchers in data-driven Artificial Intelligence (AI) to realize why machine translation did not initially live up to its promises and develop a truly robust methodology for machine translation based on unsupervised learning. It turns out that financial reconciliation can be cast as a machine translation problem based on unsupervised learning. To our knowledge, we are the first to propose this. Here, we demonstrate via experiments on real-life (albeit small-scale) financial data that this way of approaching the problem demonstrates promise in terms of accuracy, as well as solving the problem of lack of robustness inherent in rules-based approaches.","PeriodicalId":447950,"journal":{"name":"Proceedings of the First ACM International Conference on AI in Finance","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First ACM International Conference on AI in Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3383455.3422517","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Financial reconciliation (cross-checking independent sources of data) is a time-honored and widespread function in finance and audit. Its objectives are to ensure completeness, timeliness, and accuracy of recording of transactions. With the proliferation of data over recent decades, the ways in which reconciliation is approached have evolved. There are now a number of software products that promise to automate the process of detailed transaction matching. However, the 'state of the art' in these products is rules-based reconciliation. Rules-based systems in any domain tend to be brittle in that any change in the data streams causes the rules to have to be debugged and rebuilt, itself often a time-consuming process. And to make matters worse, this might be caused not just by data schema changes, but the contents of what is in the data fields themselves. Either of these can occur if a third party such as a bank changes its internal processes. In a sense, automated reconciliation is where machine translation was almost 70 years ago; IBM's 1954 Georgetown experiment approached Russian-to-English translation using rules, but it took another 4 decades for researchers in data-driven Artificial Intelligence (AI) to realize why machine translation did not initially live up to its promises and develop a truly robust methodology for machine translation based on unsupervised learning. It turns out that financial reconciliation can be cast as a machine translation problem based on unsupervised learning. To our knowledge, we are the first to propose this. Here, we demonstrate via experiments on real-life (albeit small-scale) financial data that this way of approaching the problem demonstrates promise in terms of accuracy, as well as solving the problem of lack of robustness inherent in rules-based approaches.
无监督学习财务对账:受机器翻译启发的稳健、准确的方法
财务对账(交叉核对独立数据来源)在财务和审计中是一项历史悠久且广泛的功能。其目标是确保交易记录的完整性、及时性和准确性。随着近几十年来数据的激增,和解的方式也发生了变化。现在有许多软件产品承诺将详细的交易匹配过程自动化。然而,这些产品的“最新技术”是基于规则的协调。在任何领域中,基于规则的系统往往是脆弱的,因为数据流中的任何更改都会导致必须对规则进行调试和重建,而这本身通常是一个耗时的过程。更糟糕的是,这可能不仅仅是由数据模式更改引起的,还可能是由数据字段本身的内容引起的。如果第三方(如银行)改变其内部流程,这两种情况都可能发生。从某种意义上说,自动对调相当于70年前的机器翻译;IBM在1954年的乔治城实验中使用规则进行了俄语到英语的翻译,但数据驱动的人工智能(AI)的研究人员又花了40年的时间才意识到为什么机器翻译最初没有兑现其承诺,并开发出一种真正强大的基于无监督学习的机器翻译方法。事实证明,财务对账可以作为一个基于无监督学习的机器翻译问题。据我们所知,我们是第一个提出这个建议的。在这里,我们通过对现实生活(尽管是小规模的)金融数据的实验证明,这种处理问题的方式在准确性方面表现出了希望,并解决了基于规则的方法缺乏固有的鲁棒性的问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信