基于离线深度强化学习的消费信贷动态定价

Proceedings of the Third ACM International Conference on AI in Finance Pub Date : 2022-03-06 DOI:10.1145/3533271.3561682

Raad Khraishi, Ramin Okhrati

{"title":"基于离线深度强化学习的消费信贷动态定价","authors":"Raad Khraishi, Ramin Okhrati","doi":"10.1145/3533271.3561682","DOIUrl":null,"url":null,"abstract":"We introduce a method for pricing consumer credit using recent advances in offline deep reinforcement learning. This approach relies on a static dataset and as opposed to commonly used pricing approaches it requires no assumptions on the functional form of demand. Using both real and synthetic data on consumer credit applications, we demonstrate that our approach using the conservative Q-Learning algorithm is capable of learning an effective personalized pricing policy without any online interaction or price experimentation. In particular, using historical data on online auto loan applications we estimate an increase in expected profit of 21% with a less than 15% average change in prices relative to the original pricing policy.","PeriodicalId":134888,"journal":{"name":"Proceedings of the Third ACM International Conference on AI in Finance","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit\",\"authors\":\"Raad Khraishi, Ramin Okhrati\",\"doi\":\"10.1145/3533271.3561682\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We introduce a method for pricing consumer credit using recent advances in offline deep reinforcement learning. This approach relies on a static dataset and as opposed to commonly used pricing approaches it requires no assumptions on the functional form of demand. Using both real and synthetic data on consumer credit applications, we demonstrate that our approach using the conservative Q-Learning algorithm is capable of learning an effective personalized pricing policy without any online interaction or price experimentation. In particular, using historical data on online auto loan applications we estimate an increase in expected profit of 21% with a less than 15% average change in prices relative to the original pricing policy.\",\"PeriodicalId\":134888,\"journal\":{\"name\":\"Proceedings of the Third ACM International Conference on AI in Finance\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third ACM International Conference on AI in Finance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3533271.3561682\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third ACM International Conference on AI in Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3533271.3561682","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

我们介绍了一种利用离线深度强化学习的最新进展为消费者信贷定价的方法。这种方法依赖于静态数据集，与常用的定价方法相反，它不需要对需求的功能形式进行假设。使用消费者信贷应用的真实和合成数据，我们证明了我们使用保守Q-Learning算法的方法能够在没有任何在线交互或价格实验的情况下学习有效的个性化定价策略。特别是，使用在线汽车贷款申请的历史数据，我们估计预期利润增长21%，相对于原始定价政策的平均价格变化小于15%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit

We introduce a method for pricing consumer credit using recent advances in offline deep reinforcement learning. This approach relies on a static dataset and as opposed to commonly used pricing approaches it requires no assumptions on the functional form of demand. Using both real and synthetic data on consumer credit applications, we demonstrate that our approach using the conservative Q-Learning algorithm is capable of learning an effective personalized pricing policy without any online interaction or price experimentation. In particular, using historical data on online auto loan applications we estimate an increase in expected profit of 21% with a less than 15% average change in prices relative to the original pricing policy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Third ACM International Conference on AI in Finance

自引率

0.00%

发文量