可解释的聚类及其在财富管理合规中的应用

Proceedings of the First ACM International Conference on AI in Finance Pub Date : 2019-09-29 DOI:10.1145/3383455.3422530

Enguerrand Horel, K. Giesecke, Victor Storchan, Naren Chittar

{"title":"可解释的聚类及其在财富管理合规中的应用","authors":"Enguerrand Horel, K. Giesecke, Victor Storchan, Naren Chittar","doi":"10.1145/3383455.3422530","DOIUrl":null,"url":null,"abstract":"Many applications from the financial industry successfully leverage clustering algorithms to reveal meaningful patterns among a vast amount of unstructured financial data. However, these algorithms suffer from a lack of interpretability that is required both at a business and regulatory level. In order to overcome this issue, we propose a novel two-steps method to explain clusters. A classifier is first trained to predict the clusters labels, then the Single Feature Introduction Test (SFTT) method is run on the model to identify the statistically significant features that characterize each cluster. We describe a real wealth management compliance use-case that highlights the necessity of such an interpretable clustering method. We illustrate the performance of the method using simulated data and through an experiment on financial ratios of U.S. companies.","PeriodicalId":447950,"journal":{"name":"Proceedings of the First ACM International Conference on AI in Finance","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Explainable clustering and application to wealth management compliance\",\"authors\":\"Enguerrand Horel, K. Giesecke, Victor Storchan, Naren Chittar\",\"doi\":\"10.1145/3383455.3422530\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many applications from the financial industry successfully leverage clustering algorithms to reveal meaningful patterns among a vast amount of unstructured financial data. However, these algorithms suffer from a lack of interpretability that is required both at a business and regulatory level. In order to overcome this issue, we propose a novel two-steps method to explain clusters. A classifier is first trained to predict the clusters labels, then the Single Feature Introduction Test (SFTT) method is run on the model to identify the statistically significant features that characterize each cluster. We describe a real wealth management compliance use-case that highlights the necessity of such an interpretable clustering method. We illustrate the performance of the method using simulated data and through an experiment on financial ratios of U.S. companies.\",\"PeriodicalId\":447950,\"journal\":{\"name\":\"Proceedings of the First ACM International Conference on AI in Finance\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the First ACM International Conference on AI in Finance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3383455.3422530\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the First ACM International Conference on AI in Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3383455.3422530","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

来自金融行业的许多应用程序成功地利用聚类算法在大量非结构化金融数据中揭示有意义的模式。然而，这些算法在业务和监管层面都缺乏可解释性。为了克服这一问题，我们提出了一种新的两步法来解释聚类。首先训练分类器来预测聚类标签，然后在模型上运行单一特征引入测试(SFTT)方法，以识别表征每个聚类的统计显著特征。我们描述了一个真实的财富管理合规用例，强调了这种可解释聚类方法的必要性。我们使用模拟数据并通过对美国公司财务比率的实验来说明该方法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Explainable clustering and application to wealth management compliance

Many applications from the financial industry successfully leverage clustering algorithms to reveal meaningful patterns among a vast amount of unstructured financial data. However, these algorithms suffer from a lack of interpretability that is required both at a business and regulatory level. In order to overcome this issue, we propose a novel two-steps method to explain clusters. A classifier is first trained to predict the clusters labels, then the Single Feature Introduction Test (SFTT) method is run on the model to identify the statistically significant features that characterize each cluster. We describe a real wealth management compliance use-case that highlights the necessity of such an interpretable clustering method. We illustrate the performance of the method using simulated data and through an experiment on financial ratios of U.S. companies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the First ACM International Conference on AI in Finance

自引率

0.00%

发文量