Giuseppe D'Abbronzo, Antonio D'Antonio, Annarosaria De Chiara, Luigi Panico, Lucianna Sparano, Anna Diluvio, Antonello Sica, Gino Svanera, Giovanni De Chiara, Mariano Fuggi, Ferdinando Russo, Renato Franco, Andrea Ronchi
{"title":"开发一种基于人工智能的自动化工具,用于骨髓活检中网状蛋白纤维化的评估。","authors":"Giuseppe D'Abbronzo, Antonio D'Antonio, Annarosaria De Chiara, Luigi Panico, Lucianna Sparano, Anna Diluvio, Antonello Sica, Gino Svanera, Giovanni De Chiara, Mariano Fuggi, Ferdinando Russo, Renato Franco, Andrea Ronchi","doi":"10.1007/s00428-025-04122-5","DOIUrl":null,"url":null,"abstract":"<p><p>Bone marrow fibrosis plays a critical role in the diagnosis, prognosis, and management of haematological disorders, particularly myeloproliferative neoplasms like primary myelofibrosis. Accurate assessment of fibrosis, typically graded through histochemical techniques such as reticulin and trichrome staining, is essential but remains highly dependent on the pathologist's experience. To address the challenges of variability in interpretation and the increasing demand for standardized evaluations, we developed a digital pathology system for automated bone marrow reticulin fibrosis grading. This study utilized 86 bone marrow biopsy specimens from patients diagnosed with Philadelphia chromosome-negative myeloproliferative neoplasms, collected between 2018 and 2023. A fully convolutional network based on the InceptionV3 architecture was trained to assess fibrosis grades (MF0-MF3) from whole slide images of reticulin-stained sections. The model was trained using 3814 annotated images and validated using a separate set of 40 BMBs. The algorithm's performance was evaluated by comparing its fibrosis grading to expert hematopathologists' assessments, yielding a Cohen's kappa coefficient of 0.831, indicating excellent agreement. The algorithm showed strong concordance in fibrosis grading, especially for MF0 (k = 0.918) and MF3 (k = 0.886), and substantial agreement for intermediate grades (MF1 and MF2). Further validation across multiple institutions and scanning platforms confirmed the algorithm's robustness, with an overall agreement of 0.816. These results demonstrate the potential of digital pathology tools to provide standardized, reproducible fibrosis grading, thereby aiding pathologists in clinical decision-making and training.</p>","PeriodicalId":23514,"journal":{"name":"Virchows Archiv","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of an automated artificial intelligence-based tool for reticulin fibrosis assessment in bone marrow biopsies.\",\"authors\":\"Giuseppe D'Abbronzo, Antonio D'Antonio, Annarosaria De Chiara, Luigi Panico, Lucianna Sparano, Anna Diluvio, Antonello Sica, Gino Svanera, Giovanni De Chiara, Mariano Fuggi, Ferdinando Russo, Renato Franco, Andrea Ronchi\",\"doi\":\"10.1007/s00428-025-04122-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Bone marrow fibrosis plays a critical role in the diagnosis, prognosis, and management of haematological disorders, particularly myeloproliferative neoplasms like primary myelofibrosis. Accurate assessment of fibrosis, typically graded through histochemical techniques such as reticulin and trichrome staining, is essential but remains highly dependent on the pathologist's experience. To address the challenges of variability in interpretation and the increasing demand for standardized evaluations, we developed a digital pathology system for automated bone marrow reticulin fibrosis grading. This study utilized 86 bone marrow biopsy specimens from patients diagnosed with Philadelphia chromosome-negative myeloproliferative neoplasms, collected between 2018 and 2023. A fully convolutional network based on the InceptionV3 architecture was trained to assess fibrosis grades (MF0-MF3) from whole slide images of reticulin-stained sections. The model was trained using 3814 annotated images and validated using a separate set of 40 BMBs. The algorithm's performance was evaluated by comparing its fibrosis grading to expert hematopathologists' assessments, yielding a Cohen's kappa coefficient of 0.831, indicating excellent agreement. The algorithm showed strong concordance in fibrosis grading, especially for MF0 (k = 0.918) and MF3 (k = 0.886), and substantial agreement for intermediate grades (MF1 and MF2). Further validation across multiple institutions and scanning platforms confirmed the algorithm's robustness, with an overall agreement of 0.816. These results demonstrate the potential of digital pathology tools to provide standardized, reproducible fibrosis grading, thereby aiding pathologists in clinical decision-making and training.</p>\",\"PeriodicalId\":23514,\"journal\":{\"name\":\"Virchows Archiv\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virchows Archiv\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00428-025-04122-5\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virchows Archiv","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00428-025-04122-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
Development of an automated artificial intelligence-based tool for reticulin fibrosis assessment in bone marrow biopsies.
Bone marrow fibrosis plays a critical role in the diagnosis, prognosis, and management of haematological disorders, particularly myeloproliferative neoplasms like primary myelofibrosis. Accurate assessment of fibrosis, typically graded through histochemical techniques such as reticulin and trichrome staining, is essential but remains highly dependent on the pathologist's experience. To address the challenges of variability in interpretation and the increasing demand for standardized evaluations, we developed a digital pathology system for automated bone marrow reticulin fibrosis grading. This study utilized 86 bone marrow biopsy specimens from patients diagnosed with Philadelphia chromosome-negative myeloproliferative neoplasms, collected between 2018 and 2023. A fully convolutional network based on the InceptionV3 architecture was trained to assess fibrosis grades (MF0-MF3) from whole slide images of reticulin-stained sections. The model was trained using 3814 annotated images and validated using a separate set of 40 BMBs. The algorithm's performance was evaluated by comparing its fibrosis grading to expert hematopathologists' assessments, yielding a Cohen's kappa coefficient of 0.831, indicating excellent agreement. The algorithm showed strong concordance in fibrosis grading, especially for MF0 (k = 0.918) and MF3 (k = 0.886), and substantial agreement for intermediate grades (MF1 and MF2). Further validation across multiple institutions and scanning platforms confirmed the algorithm's robustness, with an overall agreement of 0.816. These results demonstrate the potential of digital pathology tools to provide standardized, reproducible fibrosis grading, thereby aiding pathologists in clinical decision-making and training.
期刊介绍:
Manuscripts of original studies reinforcing the evidence base of modern diagnostic pathology, using immunocytochemical, molecular and ultrastructural techniques, will be welcomed. In addition, papers on critical evaluation of diagnostic criteria but also broadsheets and guidelines with a solid evidence base will be considered. Consideration will also be given to reports of work in other fields relevant to the understanding of human pathology as well as manuscripts on the application of new methods and techniques in pathology. Submission of purely experimental articles is discouraged but manuscripts on experimental work applicable to diagnostic pathology are welcomed. Biomarker studies are welcomed but need to abide by strict rules (e.g. REMARK) of adequate sample size and relevant marker choice. Single marker studies on limited patient series without validated application will as a rule not be considered. Case reports will only be considered when they provide substantial new information with an impact on understanding disease or diagnostic practice.