Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France

IF 23.8 1区医学 Q1 MEDICAL INFORMATICS

Lancet Digital Health Pub Date : 2024-04-24 DOI:10.1016/S2589-7500(24)00044-X

Vincent Alcazer MD , Grégoire Le Meur MD , Marie Roccon PharmD , Sabrina Barriere MD , Baptiste Le Calvez MD , Bouchra Badaoui MD , Agathe Spaeth PharmD , Prof Olivier Kosmider PharmD , Nicolas Freynet MD , Prof Marion Eveillard PharmD , Carolyne Croizier MD , Simon Chevalier PharmD , Prof Pierre Sujobert MD

{"title":"Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France","authors":"Vincent Alcazer MD , Grégoire Le Meur MD , Marie Roccon PharmD , Sabrina Barriere MD , Baptiste Le Calvez MD , Bouchra Badaoui MD , Agathe Spaeth PharmD , Prof Olivier Kosmider PharmD , Nicolas Freynet MD , Prof Marion Eveillard PharmD , Carolyne Croizier MD , Simon Chevalier PharmD , Prof Pierre Sujobert MD","doi":"10.1016/S2589-7500(24)00044-X","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Acute leukaemias are life-threatening haematological cancers characterised by the infiltration of transformed immature haematopoietic cells in the blood and bone marrow. Prompt and accurate diagnosis of the three main acute leukaemia subtypes (ie acute lymphocytic leukaemia [ALL], acute myeloid leukaemia [AML], and acute promyelocytic leukaemia [APL]) is of utmost importance to guide initial treatment and prevent early mortality but requires cytological expertise that is not always available. We aimed to benchmark different machine-learning strategies using a custom variable selection algorithm to propose an extreme gradient boosting model to predict leukaemia subtypes on the basis of routine laboratory parameters.</p></div><div><h3>Methods</h3><p>This multicentre model development and validation study was conducted with data from six independent French university hospital databases. Patients aged 18 years or older diagnosed with AML, APL, or ALL in any one of these six hospital databases between March 1, 2012, and Dec 31, 2021, were recruited. 22 routine parameters were collected at the time of initial disease evaluation; variables with more than 25% of missing values in two datasets were not used for model training, leading to the final inclusion of 19 parameters. The performances of the final model were evaluated on internal testing and external validation sets with area under the receiver operating characteristic curves (AUCs), and clinically relevant cutoffs were chosen to guide clinical decision making. The final tool, Artificial Intelligence Prediction of Acute Leukemia (AI-PAL), was developed from this model.</p></div><div><h3>Findings</h3><p>1410 patients diagnosed with AML, APL, or ALL were included. Data quality control showed few missing values for each cohort, with the exception of uric acid and lactate dehydrogenase for the cohort from Hôpital Cochin. 679 patients from Hôpital Lyon Sud and Centre Hospitalier Universitaire de Clermont-Ferrand were split into the training (n=477) and internal testing (n=202) sets. 731 patients from the four other cohorts were used for external validation. Overall AUCs across all validation cohorts were 0·97 (95% CI 0·95–0·99) for APL, 0·90 (0·83–0·97) for ALL, and 0·89 (0·82–0·95) for AML. Cutoffs were then established on the overall cohort of 1410 patients to guide clinical decisions. Confident cutoffs showed two (0·14%) wrong predictions for ALL, four (0·28%) wrong predictions for APL, and three (0·21%) wrong predictions for AML. Use of the overall cutoff greatly reduced the number of missing predictions; diagnosis was proposed for 1375 (97·5%) of 1410 patients for each category, with only a slight increase in wrong predictions. The final model evaluation across both the internal testing and external validation sets showed accuracy of 99·5% for ALL diagnosis, 98·8% for AML diagnosis, and 99·7% for APL diagnosis in the confident model and accuracy of 87·9% for ALL diagnosis, 86·3% for AML diagnosis, and 96·1% for APL diagnosis in the overall model.</p></div><div><h3>Interpretation</h3><p>AI-PAL allowed for accurate diagnosis of the three main acute leukaemia subtypes. Based on ten simple laboratory parameters, its broad availability could help guide initial therapies in a context where cytological expertise is lacking, such as in low-income countries.</p></div><div><h3>Funding</h3><p>None.</p></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"6 5","pages":"Pages e323-e333"},"PeriodicalIF":23.8000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S258975002400044X/pdfft?md5=98d113fd5143121b2cc564451bf756e5&pid=1-s2.0-S258975002400044X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lancet Digital Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S258975002400044X","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Acute leukaemias are life-threatening haematological cancers characterised by the infiltration of transformed immature haematopoietic cells in the blood and bone marrow. Prompt and accurate diagnosis of the three main acute leukaemia subtypes (ie acute lymphocytic leukaemia [ALL], acute myeloid leukaemia [AML], and acute promyelocytic leukaemia [APL]) is of utmost importance to guide initial treatment and prevent early mortality but requires cytological expertise that is not always available. We aimed to benchmark different machine-learning strategies using a custom variable selection algorithm to propose an extreme gradient boosting model to predict leukaemia subtypes on the basis of routine laboratory parameters.

Methods

This multicentre model development and validation study was conducted with data from six independent French university hospital databases. Patients aged 18 years or older diagnosed with AML, APL, or ALL in any one of these six hospital databases between March 1, 2012, and Dec 31, 2021, were recruited. 22 routine parameters were collected at the time of initial disease evaluation; variables with more than 25% of missing values in two datasets were not used for model training, leading to the final inclusion of 19 parameters. The performances of the final model were evaluated on internal testing and external validation sets with area under the receiver operating characteristic curves (AUCs), and clinically relevant cutoffs were chosen to guide clinical decision making. The final tool, Artificial Intelligence Prediction of Acute Leukemia (AI-PAL), was developed from this model.

Findings

1410 patients diagnosed with AML, APL, or ALL were included. Data quality control showed few missing values for each cohort, with the exception of uric acid and lactate dehydrogenase for the cohort from Hôpital Cochin. 679 patients from Hôpital Lyon Sud and Centre Hospitalier Universitaire de Clermont-Ferrand were split into the training (n=477) and internal testing (n=202) sets. 731 patients from the four other cohorts were used for external validation. Overall AUCs across all validation cohorts were 0·97 (95% CI 0·95–0·99) for APL, 0·90 (0·83–0·97) for ALL, and 0·89 (0·82–0·95) for AML. Cutoffs were then established on the overall cohort of 1410 patients to guide clinical decisions. Confident cutoffs showed two (0·14%) wrong predictions for ALL, four (0·28%) wrong predictions for APL, and three (0·21%) wrong predictions for AML. Use of the overall cutoff greatly reduced the number of missing predictions; diagnosis was proposed for 1375 (97·5%) of 1410 patients for each category, with only a slight increase in wrong predictions. The final model evaluation across both the internal testing and external validation sets showed accuracy of 99·5% for ALL diagnosis, 98·8% for AML diagnosis, and 99·7% for APL diagnosis in the confident model and accuracy of 87·9% for ALL diagnosis, 86·3% for AML diagnosis, and 96·1% for APL diagnosis in the overall model.

Interpretation

AI-PAL allowed for accurate diagnosis of the three main acute leukaemia subtypes. Based on ten simple laboratory parameters, its broad availability could help guide initial therapies in a context where cytological expertise is lacking, such as in low-income countries.

Funding

None.

查看原文本刊更多论文

基于实验室参数的机器学习模型对急性白血病亚型预测的评估：法国多中心模型开发和验证研究

背景急性白血病是一种危及生命的血液肿瘤，其特点是血液和骨髓中的未成熟造血细胞浸润转化。及时准确地诊断三种主要的急性白血病亚型（即急性淋巴细胞白血病、急性髓性白血病和急性早幼粒细胞白血病）对于指导初始治疗和预防早期死亡至关重要，但这需要细胞学方面的专业知识，而这些专业知识并非总能获得。我们的目标是使用自定义变量选择算法对不同的机器学习策略进行基准测试，以提出一种极端梯度提升模型，根据常规实验室参数预测白血病亚型。研究人员招募了 2012 年 3 月 1 日至 2021 年 12 月 31 日期间在这六家医院数据库中任何一家医院确诊为 AML、APL 或 ALL 的 18 岁或以上患者。初始疾病评估时收集了 22 个常规参数；两个数据集中缺失值超过 25% 的变量未用于模型训练，因此最终纳入了 19 个参数。最终模型的性能在内部测试集和外部验证集上用接收器操作特征曲线下面积（AUC）进行了评估，并选择了与临床相关的临界值来指导临床决策。根据该模型开发了最终工具--急性白血病人工智能预测模型（AI-PAL）。数据质量控制显示，除科钦医院队列中的尿酸和乳酸脱氢酶外，每个队列中几乎没有缺失值。里昂南方医院和克莱蒙费朗大学中心医院的679名患者被分为训练组（477人）和内部测试组（202人）。来自其他四个队列的 731 名患者被用于外部验证。所有验证队列的总AUC分别为：APL 0-97（95% CI 0-95-0-99），ALL 0-90（0-83-0-97），AML 0-89（0-82-0-95）。然后在 1410 例患者的总体队列中确定了临界值，以指导临床决策。可靠的临界值显示，对 ALL 的错误预测有 2 次（0-14%），对 APL 的错误预测有 4 次（0-28%），对 AML 的错误预测有 3 次（0-21%）。使用总体临界值大大减少了预测缺失的数量；在每个类别的 1410 例患者中，有 1375 例（97-5%）提出了诊断，错误预测仅略有增加。对内部测试集和外部验证集进行的最终模型评估显示，自信模型对 ALL 诊断的准确率为 99-5%，对 AML 诊断的准确率为 98-8%，对 APL 诊断的准确率为 99-7%；整体模型对 ALL 诊断的准确率为 87-9%，对 AML 诊断的准确率为 86-3%，对 APL 诊断的准确率为 96-1%。基于十个简单的实验室参数，在缺乏细胞学专业知识的情况下，如在低收入国家，它的广泛应用有助于指导初始治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Lancet Digital Health Multiple-

CiteScore

41.20

自引率

1.60%

发文量

232

审稿时长

13 weeks

期刊介绍： The Lancet Digital Health publishes important, innovative, and practice-changing research on any topic connected with digital technology in clinical medicine, public health, and global health. The journal’s open access content crosses subject boundaries, building bridges between health professionals and researchers.By bringing together the most important advances in this multidisciplinary field,The Lancet Digital Health is the most prominent publishing venue in digital health. We publish a range of content types including Articles,Review, Comment, and Correspondence, contributing to promoting digital technologies in health practice worldwide.