Validation of the ELN2024 and Mayo Genetic Risk Models in an External Cohort of 352 Patients With Newly Diagnosed AML Receiving Less-Intensive Therapies

IF 9.9 1区医学 Q1 HEMATOLOGY

American Journal of Hematology Pub Date : 2025-07-15 DOI:10.1002/ajh.70003

Marco Roncador, Maddalena Marconato, Jeremy Werner Deuel, Stefan Balabanov

{"title":"Validation of the ELN2024 and Mayo Genetic Risk Models in an External Cohort of 352 Patients With Newly Diagnosed AML Receiving Less-Intensive Therapies","authors":"Marco Roncador, Maddalena Marconato, Jeremy Werner Deuel, Stefan Balabanov","doi":"10.1002/ajh.70003","DOIUrl":null,"url":null,"abstract":"Risk stratification is critical for guiding therapeutic decisions and predicting outcomes in acute myeloid leukemia (AML). Current widely used risk models, such as the European LeukemiaNet (ELN) genetic risk classification, were predominantly developed based on cohorts treated with intensive chemotherapy [1]. However, when applied to patients receiving less-intensive regimens, these systems have often demonstrated limited discriminatory power, frequently skewing patients toward adverse-risk categories [2].Recently, Gangat et al. [3] proposed a new Mayo Genetic Risk Model in the American Journal of Hematology, specifically designed to predict outcomes in newly diagnosed AML patients treated with venetoclax combined with hypomethylating agents (Ven-HMA). Alongside the updated ELN2024 classification, which was adapted to better stratify patients receiving non-intensive therapies, these models represent a new generation of tools tailored for patients ineligible for intensive chemotherapy. Nevertheless, their applicability to other non-intensive treatment regimens, such as low-dose cytarabine, remains unclear.In the present study, we conducted an independent validation of the Mayo Genetic Risk Model and the ELN2024 classification outside the context of hypomethylating agent (HMA)-based treatments, using a publicly available dataset from Tazi et al. [1], and compared their prognostic performance with that of the established ELN2022 model.We analyzed a cohort of 352 AML patients treated with non-intensive regimens, enrolled across several UK-based NCRI clinical trials, including AML11, AML12, AML14, AML15, AML16, and AML LI1. Specifically, the cohort comprised 162 patients from the AML16 trial and 190 from the remaining studies, with the majority of the latter originating from AML15 and AML LI1 (exact numbers unavailable). Available data included age, sex, complete blood counts, bone marrow blast percentage, genetic and cytogenetic profiles, and WHO 2016 [4] diagnostic classifications. Treatment details at the individual level were not accessible. All patients in the AML16 trial (n = 162, 46%) and AML LI1 received low-dose cytarabine [5], either alone or in combination with investigational agents. Non-intensively treated participants in AML15 received gemtuzumab ozogamicin monotherapy. Risk scores were reconstructed using R software (v4.3.1), and survival differences between groups were assessed with Cox proportional hazards models and visualized via Kaplan–Meier plots.Table 1 summarizes the patient characteristics: the median age was 75.6 years, with a slight male predominance (57.1%) and a modestly decreased ability to perform activities of daily living (ECOG 1 in 52.6% of patients). The most frequent diagnoses (WHO 2016 classification) were AML not otherwise specified (AML NOS; 37.8%), AML with NPM1 mutation (23.0%), and AML with RUNX1 mutation (20.2%). The most mutated gene in the cohort was TET2 (111, 31.5%), followed by SRSF2 (97, 27.6%) and RUNX1 (88, 25.0%); in comparison with the cohort employed to develop the Mayo Risk Score, we observed a significantly lower frequency of TP53 mutations and increased prevalence of myelodysplasia-related mutations such as DNMT3A, TET2, and SFRSF2 (Table S1). A complex karyotype was identified in 16.4% of patients. Most patients were originally stratified as high risk according to the ELN2017 (56.5%) guidelines.We reconstructed the Mayo Genetic Risk, ELN2022, and ELN2024 scores for all patients. The Risk group assignments according to the two classifications explicitly designed for non-intensively treated patients (ELN2024 and Mayo Genetic Risk) overlapped in 166 subjects: 51 were classified as adverse risk, 87 as intermediate risk, and 28 as favorable risk. Notably, the Mayo Genetic Risk Model tended to classify patients toward higher-risk categories: 153 patients (84.5%) initially deemed favorable by ELN2024 were reclassified as intermediate or adverse by the Mayo model (McNemar's Chi-squared p-value < 0.001, degrees of freedom [df] = 3). We considered including VEN-PRS [6], a third risk score designed for non-intensively treated AML patients; however, we ultimately excluded it from the comparative analysis due to the lack of critical clinical information such as extramedullary AML involvement and prior AML treatments in our cohort.All three tested models, Mayo, ELN2022, and ELN2024 classifications significantly distinguished groups with different overall survival (OS) (Cox proportional hazards p = 0.0035 for Mayo Genetic Risk Score; p < 0.001 for ELN2024 (Figure 1); p = 0.0048 for ELN2022). However, risk groups derived from ELN2022 exhibited inconsistent behavior: patients classified as favorable had slightly shorter median OS (5.44 months) than those classified as intermediate (5.62 months), and their survival curves closely resembled those of adverse-risk patients (Figure S1). For these reasons, ELN2022 was not further investigated.Median OS estimates according to the Mayo and ELN2024 prognostic models are summarized in Table 1. Remarkably, observed OS in the current cohort was considerably shorter across all risk categories compared to the outcomes initially reported in the development studies of these models. For the Mayo model, median OS was 6.28 months for the favorable risk group (originally not reached), 4.99 months for intermediate risk (originally 19.1 months), and 2.92 months for the adverse risk group. Similarly, using the ELN2024 classification, median OS was 6.21 months versus the originally reported 23–39 months for favorable risk, 3.61 months versus 12–13 months for intermediate risk, and 2.46 months versus 5–8 months for adverse risk groups.Finally, we evaluated model performance using the area under the receiver operating characteristic curve (AUC), concordance index (C-index), the log-likelihood ratio, and corrected Akaike Information Criterion (AIC). Both models demonstrated limited discriminatory power. The Mayo model achieved an AUC of 0.544 and a C-index of 0.544. The ELN2024 classification demonstrated slightly improved predictive performance compared to the Mayo model, with a C-index of 0.572 and an AUC of 0.555. This superior predictive ability of the ELN2024 model was further supported by a significantly higher log-likelihood value (log-likelihood = 27.81, p < 0.001) compared to the Mayo model (log-likelihood = 10.56, p = 0.005). Model parsimony was comparable between the two approaches, as indicated by similar AIC values of 3397.7 (Mayo) and 3380.5 (ELN2024), suggesting that the inclusion of additional variables in the Mayo Genetic Risk Model—such as KMT2A rearrangements, IDH1 mutation status, and karyotype—did not lead to increased model complexity or overfitting.The survival outcomes observed in our cohort were notably poorer than those originally reported during the development of both the Mayo Genetic Risk Model and the ELN2024 classification. This discrepancy likely arises from the lower efficacy of treatment regimens lacking venetoclax in our patient cohort. Venetoclax, when combined with HMA, has demonstrated a strong synergistic effect, substantially improving response rates and survival outcomes compared to HMA monotherapy [7].Despite the shorter median OS observed in our cohort compared to the original studies, both the Mayo Genetic Risk Model and the ELN2024 classification consistently stratified patients into prognostic groups with decreasing OS. This risk-aligned survival gradient reinforces the internal validity of both models, even in a population not treated with HMA. Their overall discriminatory ability was, however, limited, as reflected by low C-index and AUC values. The marginally better performance of the ELN2024 classification compared to the Mayo model may be attributed to its broader validation across different non-intensive regimens, but even this advantage was modest. The ELN2022 model performed poorly, with paradoxical survival patterns among favorable and intermediate-risk groups, reaffirming that older risk systems derived from cohorts of patients treated with intensive chemotherapy are unsuitable for patients receiving low-intensity treatments.In conclusion, our findings support the clinical utility of both the Mayo Genetic Risk Model and the ELN2024 classification beyond their originally intended context of HMA and venetoclax-based therapy and suggest that these models may be preferable to the ELN2022 classification for risk stratification in patients receiving non-intensive regimens. As anticipated, survival estimates within risk categories may not be directly comparable to those derived from cohorts treated with hypomethylating agents. Further independent validation studies in bigger, well-characterized patient cohorts, stratified by specific nonintensive treatment regimens, are necessary to refine the clinical applicability of these scores and support a broader implementation in clinical practice.M.R. and S.B. conceived and designed the study. M.R. acquired the available data and performed statistical analyses. M.R., M.M., J.W.D., and S.B. interpreted the results, drafted the manuscript, and critically revised it. All authors approved the final version for submission.All the studies referred to in the present work were conducted in accordance with the Declaration of Helsinki.The authors declare no conflicts of interest.","PeriodicalId":7724,"journal":{"name":"American Journal of Hematology","volume":"100 10","pages":"1861-1864"},"PeriodicalIF":9.9000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ajh.70003","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Hematology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ajh.70003","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEMATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Risk stratification is critical for guiding therapeutic decisions and predicting outcomes in acute myeloid leukemia (AML). Current widely used risk models, such as the European LeukemiaNet (ELN) genetic risk classification, were predominantly developed based on cohorts treated with intensive chemotherapy [1]. However, when applied to patients receiving less-intensive regimens, these systems have often demonstrated limited discriminatory power, frequently skewing patients toward adverse-risk categories [2].

Recently, Gangat et al. [3] proposed a new Mayo Genetic Risk Model in the American Journal of Hematology, specifically designed to predict outcomes in newly diagnosed AML patients treated with venetoclax combined with hypomethylating agents (Ven-HMA). Alongside the updated ELN2024 classification, which was adapted to better stratify patients receiving non-intensive therapies, these models represent a new generation of tools tailored for patients ineligible for intensive chemotherapy. Nevertheless, their applicability to other non-intensive treatment regimens, such as low-dose cytarabine, remains unclear.

In the present study, we conducted an independent validation of the Mayo Genetic Risk Model and the ELN2024 classification outside the context of hypomethylating agent (HMA)-based treatments, using a publicly available dataset from Tazi et al. [1], and compared their prognostic performance with that of the established ELN2022 model.

We analyzed a cohort of 352 AML patients treated with non-intensive regimens, enrolled across several UK-based NCRI clinical trials, including AML11, AML12, AML14, AML15, AML16, and AML LI1. Specifically, the cohort comprised 162 patients from the AML16 trial and 190 from the remaining studies, with the majority of the latter originating from AML15 and AML LI1 (exact numbers unavailable). Available data included age, sex, complete blood counts, bone marrow blast percentage, genetic and cytogenetic profiles, and WHO 2016 [4] diagnostic classifications. Treatment details at the individual level were not accessible. All patients in the AML16 trial (n = 162, 46%) and AML LI1 received low-dose cytarabine [5], either alone or in combination with investigational agents. Non-intensively treated participants in AML15 received gemtuzumab ozogamicin monotherapy. Risk scores were reconstructed using R software (v4.3.1), and survival differences between groups were assessed with Cox proportional hazards models and visualized via Kaplan–Meier plots.

Table 1 summarizes the patient characteristics: the median age was 75.6 years, with a slight male predominance (57.1%) and a modestly decreased ability to perform activities of daily living (ECOG 1 in 52.6% of patients). The most frequent diagnoses (WHO 2016 classification) were AML not otherwise specified (AML NOS; 37.8%), AML with NPM1 mutation (23.0%), and AML with RUNX1 mutation (20.2%). The most mutated gene in the cohort was TET2 (111, 31.5%), followed by SRSF2 (97, 27.6%) and RUNX1 (88, 25.0%); in comparison with the cohort employed to develop the Mayo Risk Score, we observed a significantly lower frequency of TP53 mutations and increased prevalence of myelodysplasia-related mutations such as DNMT3A, TET2, and SFRSF2 (Table S1). A complex karyotype was identified in 16.4% of patients. Most patients were originally stratified as high risk according to the ELN2017 (56.5%) guidelines.

We reconstructed the Mayo Genetic Risk, ELN2022, and ELN2024 scores for all patients. The Risk group assignments according to the two classifications explicitly designed for non-intensively treated patients (ELN2024 and Mayo Genetic Risk) overlapped in 166 subjects: 51 were classified as adverse risk, 87 as intermediate risk, and 28 as favorable risk. Notably, the Mayo Genetic Risk Model tended to classify patients toward higher-risk categories: 153 patients (84.5%) initially deemed favorable by ELN2024 were reclassified as intermediate or adverse by the Mayo model (McNemar's Chi-squared p-value < 0.001, degrees of freedom [df] = 3). We considered including VEN-PRS [6], a third risk score designed for non-intensively treated AML patients; however, we ultimately excluded it from the comparative analysis due to the lack of critical clinical information such as extramedullary AML involvement and prior AML treatments in our cohort.

All three tested models, Mayo, ELN2022, and ELN2024 classifications significantly distinguished groups with different overall survival (OS) (Cox proportional hazards p = 0.0035 for Mayo Genetic Risk Score; p < 0.001 for ELN2024 (Figure 1); p = 0.0048 for ELN2022). However, risk groups derived from ELN2022 exhibited inconsistent behavior: patients classified as favorable had slightly shorter median OS (5.44 months) than those classified as intermediate (5.62 months), and their survival curves closely resembled those of adverse-risk patients (Figure S1). For these reasons, ELN2022 was not further investigated.

Median OS estimates according to the Mayo and ELN2024 prognostic models are summarized in Table 1. Remarkably, observed OS in the current cohort was considerably shorter across all risk categories compared to the outcomes initially reported in the development studies of these models. For the Mayo model, median OS was 6.28 months for the favorable risk group (originally not reached), 4.99 months for intermediate risk (originally 19.1 months), and 2.92 months for the adverse risk group. Similarly, using the ELN2024 classification, median OS was 6.21 months versus the originally reported 23–39 months for favorable risk, 3.61 months versus 12–13 months for intermediate risk, and 2.46 months versus 5–8 months for adverse risk groups.

Finally, we evaluated model performance using the area under the receiver operating characteristic curve (AUC), concordance index (C-index), the log-likelihood ratio, and corrected Akaike Information Criterion (AIC). Both models demonstrated limited discriminatory power. The Mayo model achieved an AUC of 0.544 and a C-index of 0.544. The ELN2024 classification demonstrated slightly improved predictive performance compared to the Mayo model, with a C-index of 0.572 and an AUC of 0.555. This superior predictive ability of the ELN2024 model was further supported by a significantly higher log-likelihood value (log-likelihood = 27.81, p < 0.001) compared to the Mayo model (log-likelihood = 10.56, p = 0.005). Model parsimony was comparable between the two approaches, as indicated by similar AIC values of 3397.7 (Mayo) and 3380.5 (ELN2024), suggesting that the inclusion of additional variables in the Mayo Genetic Risk Model—such as KMT2A rearrangements, IDH1 mutation status, and karyotype—did not lead to increased model complexity or overfitting.

The survival outcomes observed in our cohort were notably poorer than those originally reported during the development of both the Mayo Genetic Risk Model and the ELN2024 classification. This discrepancy likely arises from the lower efficacy of treatment regimens lacking venetoclax in our patient cohort. Venetoclax, when combined with HMA, has demonstrated a strong synergistic effect, substantially improving response rates and survival outcomes compared to HMA monotherapy [7].

Despite the shorter median OS observed in our cohort compared to the original studies, both the Mayo Genetic Risk Model and the ELN2024 classification consistently stratified patients into prognostic groups with decreasing OS. This risk-aligned survival gradient reinforces the internal validity of both models, even in a population not treated with HMA. Their overall discriminatory ability was, however, limited, as reflected by low C-index and AUC values. The marginally better performance of the ELN2024 classification compared to the Mayo model may be attributed to its broader validation across different non-intensive regimens, but even this advantage was modest. The ELN2022 model performed poorly, with paradoxical survival patterns among favorable and intermediate-risk groups, reaffirming that older risk systems derived from cohorts of patients treated with intensive chemotherapy are unsuitable for patients receiving low-intensity treatments.

In conclusion, our findings support the clinical utility of both the Mayo Genetic Risk Model and the ELN2024 classification beyond their originally intended context of HMA and venetoclax-based therapy and suggest that these models may be preferable to the ELN2022 classification for risk stratification in patients receiving non-intensive regimens. As anticipated, survival estimates within risk categories may not be directly comparable to those derived from cohorts treated with hypomethylating agents. Further independent validation studies in bigger, well-characterized patient cohorts, stratified by specific nonintensive treatment regimens, are necessary to refine the clinical applicability of these scores and support a broader implementation in clinical practice.

M.R. and S.B. conceived and designed the study. M.R. acquired the available data and performed statistical analyses. M.R., M.M., J.W.D., and S.B. interpreted the results, drafted the manuscript, and critically revised it. All authors approved the final version for submission.

All the studies referred to in the present work were conducted in accordance with the Declaration of Helsinki.

The authors declare no conflicts of interest.

Abstract Image

查看原文本刊更多论文

ELN2024和Mayo遗传风险模型在352例接受低强度治疗的新诊断AML患者外部队列中的验证

表1总结了Mayo和ELN2024预后模型的中位OS估计值。值得注意的是，与这些模型发展研究中最初报告的结果相比，当前队列中观察到的OS在所有风险类别中都要短得多。对于Mayo模型，有利风险组（最初未达到）的中位OS为6.28个月，中等风险组（最初为19.1个月）的中位OS为4.99个月，不良风险组的中位OS为2.92个月。同样，使用ELN2024分类，有利风险组的中位OS为6.21个月，而最初报道的23-39个月，中等风险组的中位OS为3.61个月，而12-13个月，不良风险组的中位OS为2.46个月，而5-8个月。最后，我们使用受试者工作特征曲线下面积（AUC）、一致性指数（C-index）、对数似然比和修正的Akaike信息准则（AIC）来评估模型的性能。两种模式都显示出有限的歧视权力。Mayo模型的AUC为0.544，C-index为0.544。与Mayo模型相比，ELN2024分类的预测性能略有提高，c指数为0.572，AUC为0.555。与Mayo模型（log-likelihood = 10.56, p = 0.005）相比，ELN2024模型的对数似然值（log-likelihood = 27.81, p < 0.001）显著更高，进一步支持了ELN2024模型优越的预测能力。两种方法的模型简约性具有可比性，AIC值分别为3397.7 （Mayo）和3380.5 (ELN2024)，这表明在Mayo遗传风险模型中加入额外的变量（如KMT2A重排、IDH1突变状态和核型）不会导致模型复杂性增加或过拟合。在我们的队列中观察到的生存结果明显低于Mayo遗传风险模型和ELN2024分类发展过程中最初报道的结果。这种差异可能是由于在我们的患者队列中，缺乏维托clax的治疗方案的疗效较低。与HMA单药相比，Venetoclax联合HMA已显示出强大的协同效应，显着提高了缓解率和生存结果。尽管与原始研究相比，我们的队列中观察到的中位生存期较短，但Mayo遗传风险模型和ELN2024分类一致地将患者分为生存期降低的预后组。这种与风险相关的生存梯度强化了两种模型的内部有效性，即使在未接受HMA治疗的人群中也是如此。然而，它们的总体区分能力有限，c指数和AUC值较低。与Mayo模型相比，ELN2024分类的性能略好，这可能归因于它在不同非强化方案中得到了更广泛的验证，但即使是这种优势也是有限的。ELN2022模型表现不佳，在有利和中等风险组中存在矛盾的生存模式，重申了来自接受强化化疗的患者队列的旧风险系统不适合接受低强度治疗的患者。总之，我们的研究结果支持Mayo遗传风险模型和ELN2024分类的临床应用，超出了它们最初的HMA和venetoclax为基础的治疗背景，并表明这些模型可能比ELN2022分类更适合接受非强化方案的患者进行风险分层。正如预期的那样，风险类别内的生存估计可能无法与使用低甲基化药物治疗的队列的生存估计直接比较。有必要在更大的、特征明确的患者队列中进行进一步的独立验证研究，并按特定的非强化治疗方案分层，以完善这些评分的临床适用性，并支持在临床实践中更广泛的实施。和S.B.构思并设计了这项研究。磁共振获得可用数据并进行统计分析。m.r., m.m., j.w.d.和S.B.解释了结果，起草了手稿，并对其进行了严格的修改。所有作者都同意提交最终版本。本工作中提到的所有研究都是按照《赫尔辛基宣言》进行的。作者声明无利益冲突。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

American Journal of Hematology 医学-血液学

CiteScore

15.70

自引率

3.90%

发文量

363

审稿时长

3-6 weeks

期刊介绍： The American Journal of Hematology offers extensive coverage of experimental and clinical aspects of blood diseases in humans and animal models. The journal publishes original contributions in both non-malignant and malignant hematological diseases, encompassing clinical and basic studies in areas such as hemostasis, thrombosis, immunology, blood banking, and stem cell biology. Clinical translational reports highlighting innovative therapeutic approaches for the diagnosis and treatment of hematological diseases are actively encouraged.The American Journal of Hematology features regular original laboratory and clinical research articles, brief research reports, critical reviews, images in hematology, as well as letters and correspondence.