基于医疗注册数据的动态治疗方案的深度强化学习。

Healthcare informatics : the business magazine for information and communication systems Pub Date : 2017-08-01 DOI:10.1109/ICHI.2017.45

Ying Liu, Brent Logan, Ning Liu, Zhiyuan Xu, Jian Tang, Yanzhi Wang

{"title":"基于医疗注册数据的动态治疗方案的深度强化学习。","authors":"Ying Liu, Brent Logan, Ning Liu, Zhiyuan Xu, Jian Tang, Yanzhi Wang","doi":"10.1109/ICHI.2017.45","DOIUrl":null,"url":null,"abstract":"In this paper, we propose the first deep reinforcement learning framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real life complexity in heterogeneous disease progression and treatment choices, with the goal to provide doctor and patients the data-driven personalized decision recommendations. The proposed deep reinforcement learning framework contains a supervised learning step to predict the most possible expert actions; and a deep reinforcement learning step to estimate the long term value function of Dynamic Treatment Regimes. We motivated and implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease. We showed results of the initial implementation that demonstrates promising accuracy in predicting human expert decisions and initial implementation for the reinforcement learning step.","PeriodicalId":79623,"journal":{"name":"Healthcare informatics : the business magazine for information and communication systems","volume":"2017 ","pages":"380-385"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICHI.2017.45","citationCount":"82","resultStr":"{\"title\":\"Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.\",\"authors\":\"Ying Liu, Brent Logan, Ning Liu, Zhiyuan Xu, Jian Tang, Yanzhi Wang\",\"doi\":\"10.1109/ICHI.2017.45\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose the first deep reinforcement learning framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real life complexity in heterogeneous disease progression and treatment choices, with the goal to provide doctor and patients the data-driven personalized decision recommendations. The proposed deep reinforcement learning framework contains a supervised learning step to predict the most possible expert actions; and a deep reinforcement learning step to estimate the long term value function of Dynamic Treatment Regimes. We motivated and implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease. We showed results of the initial implementation that demonstrates promising accuracy in predicting human expert decisions and initial implementation for the reinforcement learning step.\",\"PeriodicalId\":79623,\"journal\":{\"name\":\"Healthcare informatics : the business magazine for information and communication systems\",\"volume\":\"2017 \",\"pages\":\"380-385\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1109/ICHI.2017.45\",\"citationCount\":\"82\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Healthcare informatics : the business magazine for information and communication systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICHI.2017.45\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare informatics : the business magazine for information and communication systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICHI.2017.45","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 82

摘要

在本文中，我们提出了第一个深度强化学习框架，用于从观察性医疗数据中估计最优动态治疗方案。与现有的强化学习方法相比，该框架对高维动作和状态空间更具灵活性和适应性，可以模拟异构疾病进展和治疗选择中的现实生活复杂性，目标是为医生和患者提供数据驱动的个性化决策建议。提出的深度强化学习框架包含一个监督学习步骤来预测最可能的专家行为;以及一个深度强化学习步骤来估计动态治疗方案的长期价值函数。我们在国际骨髓移植研究中心(CIBMTR)注册数据库的数据集上启动并实施了拟议的框架，重点关注急性和慢性移植物抗宿主病的预防和治疗顺序。我们展示了初步实现的结果，证明了在预测人类专家决策和强化学习步骤的初步实现方面有希望的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

查看原文本刊更多论文

Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

In this paper, we propose the first deep reinforcement learning framework to estimate the optimal Dynamic Treatment Regimes from observational medical data. This framework is more flexible and adaptive for high dimensional action and state spaces than existing reinforcement learning methods to model real life complexity in heterogeneous disease progression and treatment choices, with the goal to provide doctor and patients the data-driven personalized decision recommendations. The proposed deep reinforcement learning framework contains a supervised learning step to predict the most possible expert actions; and a deep reinforcement learning step to estimate the long term value function of Dynamic Treatment Regimes. We motivated and implemented the proposed framework on a data set from the Center for International Bone Marrow Transplant Research (CIBMTR) registry database, focusing on the sequence of prevention and treatments for acute and chronic graft versus host disease. We showed results of the initial implementation that demonstrates promising accuracy in predicting human expert decisions and initial implementation for the reinforcement learning step.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Healthcare informatics : the business magazine for information and communication systems

自引率

0.00%

发文量