从医学数据中半自动提取预测和临床有意义的时间特征用于预测建模的路线图

Q1 Social Sciences

Global Transitions Pub Date : 2019-01-01 DOI:10.1016/j.glt.2018.11.001

Gang Luo

{"title":"从医学数据中半自动提取预测和临床有意义的时间特征用于预测建模的路线图","authors":"Gang Luo","doi":"10.1016/j.glt.2018.11.001","DOIUrl":null,"url":null,"abstract":"<div><p>Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in healthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50–80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is problematic. Healthcare systems have limited resources for model building, while inaccurate models produce suboptimal outcomes and are often useless. Second, most machine learning models provide no explanation of their prediction results. However, offering such explanations is essential for a model to be used in usual clinical practice. To address these two hurdles, this paper outlines: 1) a data-driven method for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling; and 2) a method of using these features to automatically explain machine learning prediction results and suggest tailored interventions. This provides a roadmap for future research.</p></div>","PeriodicalId":33615,"journal":{"name":"Global Transitions","volume":"1 ","pages":"Pages 61-82"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.glt.2018.11.001","citationCount":"15","resultStr":"{\"title\":\"A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling\",\"authors\":\"Gang Luo\",\"doi\":\"10.1016/j.glt.2018.11.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in healthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50–80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is problematic. Healthcare systems have limited resources for model building, while inaccurate models produce suboptimal outcomes and are often useless. Second, most machine learning models provide no explanation of their prediction results. However, offering such explanations is essential for a model to be used in usual clinical practice. To address these two hurdles, this paper outlines: 1) a data-driven method for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling; and 2) a method of using these features to automatically explain machine learning prediction results and suggest tailored interventions. This provides a roadmap for future research.</p></div>\",\"PeriodicalId\":33615,\"journal\":{\"name\":\"Global Transitions\",\"volume\":\"1 \",\"pages\":\"Pages 61-82\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.glt.2018.11.001\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Transitions\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2589791819300015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Transitions","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589791819300015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 15

摘要

基于医疗数据的机器学习的预测建模在改善医疗保健和降低成本方面具有巨大的潜力。然而，有两个障碍阻碍了它在医疗保健领域的广泛采用。首先，医学数据本质上是纵向的。预处理它们，特别是特征工程，是劳动密集型的，通常需要50-80%的模型构建工作。预测时间特征是建立准确模型的基础，但很难识别。这是有问题的。医疗保健系统用于模型构建的资源有限，而不准确的模型会产生次优结果，而且通常是无用的。其次，大多数机器学习模型不提供对其预测结果的解释。然而，提供这样的解释对于一个模型在通常的临床实践中使用是必不可少的。为了解决这两个障碍，本文概述了:1)一种数据驱动的方法，用于从医疗数据中半自动提取预测和临床有意义的时间特征，用于预测建模;2)利用这些特征自动解释机器学习预测结果并提出量身定制的干预措施的方法。这为未来的研究提供了路线图。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling

查看原文本刊更多论文

A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling

Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in healthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50–80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is problematic. Healthcare systems have limited resources for model building, while inaccurate models produce suboptimal outcomes and are often useless. Second, most machine learning models provide no explanation of their prediction results. However, offering such explanations is essential for a model to be used in usual clinical practice. To address these two hurdles, this paper outlines: 1) a data-driven method for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling; and 2) a method of using these features to automatically explain machine learning prediction results and suggest tailored interventions. This provides a roadmap for future research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Global Transitions Social Sciences-Development

CiteScore

18.90

自引率

0.00%

发文量

审稿时长

20 weeks