David Starostka, Richard Dolezilek, Hans Michael Kvasnicka, Milos Kudelka, Petra Miczkova, Eva Kriegova, David Kolacek, Barbora Sotkovska, Tomas Anlauf, Jarmila Juranova, Katerina Chasakova, Sona Kolarova, Michael Paprota, David Buffa, Peter Kovac, Vit Zmatlo
{"title":"The utility of automated artificial intelligence-assisted digital cytomorphology for bone marrow analysis in diagnostic haemato-oncology","authors":"David Starostka, Richard Dolezilek, Hans Michael Kvasnicka, Milos Kudelka, Petra Miczkova, Eva Kriegova, David Kolacek, Barbora Sotkovska, Tomas Anlauf, Jarmila Juranova, Katerina Chasakova, Sona Kolarova, Michael Paprota, David Buffa, Peter Kovac, Vit Zmatlo","doi":"10.1002/ctm2.70364","DOIUrl":null,"url":null,"abstract":"<p>Dear Editor,</p><p>In our research, we showcased that AI-driven automated digital morphology (ADM), with its innovative categorisation options and ability to visually represent cellular contexts, opens up unprecedented avenues for bone marrow (BM) cellular classification in haemato-oncology, supporting clinical decisions. However, despite its remarkable image quality and clinical consistency, particularly in reactive haemopoiesis and myeloproliferative and myelodysplastic neoplasms, certain neoplastic BM cells are notably challenging for ADM to classify, which can significantly affect BM cytomorphological diagnosis.</p><p>Correct cytomorphological evaluation of BM smears remains a cornerstone of diagnostics in haematology with critical clinical impact.<span><sup>1, 2</sup></span> Current expert optical microscopy is limited by substantial subjective interobserver variability and the need for highly skilled cytomorphologists. Furthermore, expert findings may not be considered definitive or the only possible correct results, especially within closely related borderline categories or when dealing with ambiguous cellular classifications. Therefore, innovative objective digital technologies for BM cytomorphology are vitally needed.<span><sup>1-4</sup></span> There is limited data on the utility of ADM in this field.<span><sup>3, 4</sup></span></p><p>Our study of a real-world cohort of 328 BM smears of European ancestry patients aims to comprehensively assess the effectiveness, reliability, and limitations of AI-assisted ADM (Morphogo system) by comparing it with expert optical microscopy (Figure 1). The cases were divided into six diagnostic groups: myelodysplastic neoplasms (MDN: 15%), multiple myeloma (MM: 14%), mature B/T-cell neoplasms (B/T-lymphoma: 13%), acute leukaemia and chronic myelomonocytic leukaemia (AL+CMML: 9%), myeloproliferative neoplasms (MPN: 8%), and reactive haemopoiesis (reactive: 41%). High-resolution digital images (magnification 1000×) of 500 BM nucleated cells were acquired. ADM's cell recognition (classification) capabilities were evaluated by asking it to correctly classify cells into one of 25 categories, compared with independent consensual expertise with double reading. The percentage of relevant correctly classified cells out of all classified cells was 95.4%. Satisfactory correlation values, as indicated by the Matthews correlation coefficient (MCC), were .400 or above for 22 out of 25 (88.0%) cell types; unsatisfactory MCC values below .400 were noted for 3 out of 25 (12%) cell types (lymphoblasts, prolymphocytes, and promonocytes) (Figure S4).</p><p>Next, we evaluated the percentage of correctly classified cells in individual patients for clinical consistency. The overall relevant clinical consistency was 97.1% (median), with 94.5% of cases showing consistency rates between 80% and 100%. In 5.5% of patients, critical misclassification was noted, with relevant clinical consistency values ranging from 36% to 79%, due to failure to recognise neoplastic cells. Errors included misclassifying neoplastic lymphocytes as blasts; small lymphoblasts/myeloblasts as lymphocytes; granular monoblasts as promyelocytes; and dysplastic monocytic lineage elements, plasmablasts, and immature plasma cells, all exhibiting atypical cytomorphology (Figure 2). Within the B/T-lymphoma group, misclassified lymphocytes were medium-sized, featuring abundant basophilic or pale cytoplasm with projections and finer chromatin, sometimes with a nucleolus or presenting a blastoid appearance, unlike correctly classified neoplastic cells. In the AL+CMML group, the misclassified lymphoblasts/myeloblasts were small, displayed a high nucleoplasmic ratio, and had coarser chromatin and nucleoli. Numerous granular monoblasts were misclassified as promyelocytes. In CMML, misclassifications involved the monocytic lineage. In MM, misclassifications were noted in the plasma cell lineage (plasmablasts and immature plasma cells) (Figure 2). The patient similarity network<span><sup>5</sup></span> was constructed to visualise differences between patients and highlight cases with misclassifications (Figure 3). Patients with critical misclassifications with misdiagnosis potential were identified only in three diagnostic categories: AL+CMML, MM, and B/T-lymphoma. None were found in the MPN, MDN, or reactive groups (Figures 2 and 3). In addition to cytomorphological overlap, misclassifications likely reflected the rarity of some diagnoses and expert inconsistency for certain cells, resulting in incorrect annotations and gaps in training data.</p><p>Furthermore, we compared the numerical results of the myelogram from conventional analysis with those obtained from expert ADM, as these are crucial for diagnostics. The most significant numerical differences in the representation of diagnostically relevant cells (blasts, monocytes, lymphocytes, and plasma cells) were observed in AL+CMML for blast counts, AL+CMML for monocyte counts, B/T-lymphoma for lymphocyte counts, and MM for plasma cell counts (Figure 4). This discovery prompts a thought-provoking inquiry into the precision of measuring essential diagnostic features and the overall legitimacy of expert assessments. The rigorously standardised procedure for automatically selecting the adaptive area for 1000× immersion-lens analysis in ADM is a significant factor contributing to numerical discrepancies. ADM can offer greater reliability than traditional light microscopy's subjective and variable area selection.</p><p>There are differences between the results of our analysis and prior studies. Unlike our cohort with 59% haematolymphoid tumours including rare diagnoses, the rate of neoplasms was under 30% in the Chinese study,<span><sup>3</sup></span> with approximately one-fifth of the BM smears categorised as ‘relatively normal’. The representation of B/T-lymphomas in that study was 2.4%, compared with 13% in our cohort. The differences are also attributable to the number of BM samples analysed and the methodology employed to assess classification consistency. Our novel clinical approach, which takes individual patients into account, alongside the use of patient similarity networks to visually evaluate the dataset, is highly relevant to BM cytomorphological diagnosis and yields different results compared with an approach focused solely on cellular classification.</p><p>Regarding the perspectives, further, AI training focused on specific cell types is essential to improving classification consistency, enabling the subclassification of lymphomas into specific diagnostic entities,<span><sup>6</sup></span> analysing elements of megakaryopoiesis, detecting metastatic cells in BM,<span><sup>7-9</sup></span> and facilitating reliable recognition of lineage dysplasia.<span><sup>10</sup></span></p><p>In summary, the data clearly shows that ADM is a highly beneficial diagnostical method for BM cytomorphology. The approach has the potential to revolutionise and improve diagnostics by minimising subjectivity and variability in evaluations. Our research is the first to identify and describe the cell misclassifications in detail, adversely affecting BM cytomorphological diagnosis and undermining the system's reliability. Consequently, this study highlights candidate cells for future AI training and testing. The results of the study also point to a possible (and ongoing) limitation of the use of AI in this case. Any negative consequences can be effectively addressed through expert supervision, underscoring the crucial role of highly trained morphologists in ensuring accurate cell classification and diagnostic interpretation in haemato-oncology and driving innovation to reach full potential.</p><p>David Starostka: Conceptualisation and design of the study, comprehensive BM diagnostics, expert review of pre-classification, data management, statistical analysis and initial drafting of the manuscript. Richard Dolezilek: Conceptualisation and design of the study, comprehensive BM diagnostics and initial drafting of the manuscript. Hans Michael Kvasnicka: Conceptualisation and design of the study and review of the manuscript draft. Milos Kudelka: Contribution to the design of the study, statistical analysis and review of the manuscript draft. Petra Miczkova: Preparation and scanning of BM slides, expert review of pre-classification, data management and review of the manuscript draft. Eva Kriegova: Contribution to the design of the study and review of the manuscript draft. David Kolacek: Comprehensive BM diagnostics, data management and review of the manuscript draft. Barbora Sotkovska and Jarmila Juranova: Preparation and scanning of BM slides, data management and review of the manuscript draft. Tomas Anlauf: Statistical analysis and review of the manuscript draft. Katerina Chasakova, Michael Paprota and Peter Kovac: Data management and review of the manuscript draft. Sona Kolarova: Review of the manuscript draft. David Buffa: Data management and review of the manuscript draft. Vit Zmatlo: Graphics and review of the manuscript draft. All authors have made a significant contribution to this study and have approved the final manuscript.</p><p>The authors declare no conflict of interest.</p><p>The study was supported by the Internal Research Grant 2023 of Hospital Havirov, and in part by the Ministry of Health of the Czech Republic (FNOl 0098892).</p><p>The study was approved by the Local Ethics Committee of the Hospital Havirov and carried out in accordance with the updated principles of the Helsinki Declaration. The study used archived material. The patients gave their written informed consent for BM examination and anonymous data collection and analysis.</p>","PeriodicalId":10189,"journal":{"name":"Clinical and Translational Medicine","volume":"15 7","pages":""},"PeriodicalIF":7.9000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ctm2.70364","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ctm2.70364","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Dear Editor,
In our research, we showcased that AI-driven automated digital morphology (ADM), with its innovative categorisation options and ability to visually represent cellular contexts, opens up unprecedented avenues for bone marrow (BM) cellular classification in haemato-oncology, supporting clinical decisions. However, despite its remarkable image quality and clinical consistency, particularly in reactive haemopoiesis and myeloproliferative and myelodysplastic neoplasms, certain neoplastic BM cells are notably challenging for ADM to classify, which can significantly affect BM cytomorphological diagnosis.
Correct cytomorphological evaluation of BM smears remains a cornerstone of diagnostics in haematology with critical clinical impact.1, 2 Current expert optical microscopy is limited by substantial subjective interobserver variability and the need for highly skilled cytomorphologists. Furthermore, expert findings may not be considered definitive or the only possible correct results, especially within closely related borderline categories or when dealing with ambiguous cellular classifications. Therefore, innovative objective digital technologies for BM cytomorphology are vitally needed.1-4 There is limited data on the utility of ADM in this field.3, 4
Our study of a real-world cohort of 328 BM smears of European ancestry patients aims to comprehensively assess the effectiveness, reliability, and limitations of AI-assisted ADM (Morphogo system) by comparing it with expert optical microscopy (Figure 1). The cases were divided into six diagnostic groups: myelodysplastic neoplasms (MDN: 15%), multiple myeloma (MM: 14%), mature B/T-cell neoplasms (B/T-lymphoma: 13%), acute leukaemia and chronic myelomonocytic leukaemia (AL+CMML: 9%), myeloproliferative neoplasms (MPN: 8%), and reactive haemopoiesis (reactive: 41%). High-resolution digital images (magnification 1000×) of 500 BM nucleated cells were acquired. ADM's cell recognition (classification) capabilities were evaluated by asking it to correctly classify cells into one of 25 categories, compared with independent consensual expertise with double reading. The percentage of relevant correctly classified cells out of all classified cells was 95.4%. Satisfactory correlation values, as indicated by the Matthews correlation coefficient (MCC), were .400 or above for 22 out of 25 (88.0%) cell types; unsatisfactory MCC values below .400 were noted for 3 out of 25 (12%) cell types (lymphoblasts, prolymphocytes, and promonocytes) (Figure S4).
Next, we evaluated the percentage of correctly classified cells in individual patients for clinical consistency. The overall relevant clinical consistency was 97.1% (median), with 94.5% of cases showing consistency rates between 80% and 100%. In 5.5% of patients, critical misclassification was noted, with relevant clinical consistency values ranging from 36% to 79%, due to failure to recognise neoplastic cells. Errors included misclassifying neoplastic lymphocytes as blasts; small lymphoblasts/myeloblasts as lymphocytes; granular monoblasts as promyelocytes; and dysplastic monocytic lineage elements, plasmablasts, and immature plasma cells, all exhibiting atypical cytomorphology (Figure 2). Within the B/T-lymphoma group, misclassified lymphocytes were medium-sized, featuring abundant basophilic or pale cytoplasm with projections and finer chromatin, sometimes with a nucleolus or presenting a blastoid appearance, unlike correctly classified neoplastic cells. In the AL+CMML group, the misclassified lymphoblasts/myeloblasts were small, displayed a high nucleoplasmic ratio, and had coarser chromatin and nucleoli. Numerous granular monoblasts were misclassified as promyelocytes. In CMML, misclassifications involved the monocytic lineage. In MM, misclassifications were noted in the plasma cell lineage (plasmablasts and immature plasma cells) (Figure 2). The patient similarity network5 was constructed to visualise differences between patients and highlight cases with misclassifications (Figure 3). Patients with critical misclassifications with misdiagnosis potential were identified only in three diagnostic categories: AL+CMML, MM, and B/T-lymphoma. None were found in the MPN, MDN, or reactive groups (Figures 2 and 3). In addition to cytomorphological overlap, misclassifications likely reflected the rarity of some diagnoses and expert inconsistency for certain cells, resulting in incorrect annotations and gaps in training data.
Furthermore, we compared the numerical results of the myelogram from conventional analysis with those obtained from expert ADM, as these are crucial for diagnostics. The most significant numerical differences in the representation of diagnostically relevant cells (blasts, monocytes, lymphocytes, and plasma cells) were observed in AL+CMML for blast counts, AL+CMML for monocyte counts, B/T-lymphoma for lymphocyte counts, and MM for plasma cell counts (Figure 4). This discovery prompts a thought-provoking inquiry into the precision of measuring essential diagnostic features and the overall legitimacy of expert assessments. The rigorously standardised procedure for automatically selecting the adaptive area for 1000× immersion-lens analysis in ADM is a significant factor contributing to numerical discrepancies. ADM can offer greater reliability than traditional light microscopy's subjective and variable area selection.
There are differences between the results of our analysis and prior studies. Unlike our cohort with 59% haematolymphoid tumours including rare diagnoses, the rate of neoplasms was under 30% in the Chinese study,3 with approximately one-fifth of the BM smears categorised as ‘relatively normal’. The representation of B/T-lymphomas in that study was 2.4%, compared with 13% in our cohort. The differences are also attributable to the number of BM samples analysed and the methodology employed to assess classification consistency. Our novel clinical approach, which takes individual patients into account, alongside the use of patient similarity networks to visually evaluate the dataset, is highly relevant to BM cytomorphological diagnosis and yields different results compared with an approach focused solely on cellular classification.
Regarding the perspectives, further, AI training focused on specific cell types is essential to improving classification consistency, enabling the subclassification of lymphomas into specific diagnostic entities,6 analysing elements of megakaryopoiesis, detecting metastatic cells in BM,7-9 and facilitating reliable recognition of lineage dysplasia.10
In summary, the data clearly shows that ADM is a highly beneficial diagnostical method for BM cytomorphology. The approach has the potential to revolutionise and improve diagnostics by minimising subjectivity and variability in evaluations. Our research is the first to identify and describe the cell misclassifications in detail, adversely affecting BM cytomorphological diagnosis and undermining the system's reliability. Consequently, this study highlights candidate cells for future AI training and testing. The results of the study also point to a possible (and ongoing) limitation of the use of AI in this case. Any negative consequences can be effectively addressed through expert supervision, underscoring the crucial role of highly trained morphologists in ensuring accurate cell classification and diagnostic interpretation in haemato-oncology and driving innovation to reach full potential.
David Starostka: Conceptualisation and design of the study, comprehensive BM diagnostics, expert review of pre-classification, data management, statistical analysis and initial drafting of the manuscript. Richard Dolezilek: Conceptualisation and design of the study, comprehensive BM diagnostics and initial drafting of the manuscript. Hans Michael Kvasnicka: Conceptualisation and design of the study and review of the manuscript draft. Milos Kudelka: Contribution to the design of the study, statistical analysis and review of the manuscript draft. Petra Miczkova: Preparation and scanning of BM slides, expert review of pre-classification, data management and review of the manuscript draft. Eva Kriegova: Contribution to the design of the study and review of the manuscript draft. David Kolacek: Comprehensive BM diagnostics, data management and review of the manuscript draft. Barbora Sotkovska and Jarmila Juranova: Preparation and scanning of BM slides, data management and review of the manuscript draft. Tomas Anlauf: Statistical analysis and review of the manuscript draft. Katerina Chasakova, Michael Paprota and Peter Kovac: Data management and review of the manuscript draft. Sona Kolarova: Review of the manuscript draft. David Buffa: Data management and review of the manuscript draft. Vit Zmatlo: Graphics and review of the manuscript draft. All authors have made a significant contribution to this study and have approved the final manuscript.
The authors declare no conflict of interest.
The study was supported by the Internal Research Grant 2023 of Hospital Havirov, and in part by the Ministry of Health of the Czech Republic (FNOl 0098892).
The study was approved by the Local Ethics Committee of the Hospital Havirov and carried out in accordance with the updated principles of the Helsinki Declaration. The study used archived material. The patients gave their written informed consent for BM examination and anonymous data collection and analysis.
期刊介绍:
Clinical and Translational Medicine (CTM) is an international, peer-reviewed, open-access journal dedicated to accelerating the translation of preclinical research into clinical applications and fostering communication between basic and clinical scientists. It highlights the clinical potential and application of various fields including biotechnologies, biomaterials, bioengineering, biomarkers, molecular medicine, omics science, bioinformatics, immunology, molecular imaging, drug discovery, regulation, and health policy. With a focus on the bench-to-bedside approach, CTM prioritizes studies and clinical observations that generate hypotheses relevant to patients and diseases, guiding investigations in cellular and molecular medicine. The journal encourages submissions from clinicians, researchers, policymakers, and industry professionals.