Han-Back Shin, Heesoon Sheen, Jang-Hoon Oh, Young Eun Choi, Kihoon Sung, Hyun Ju Kim
{"title":"Evaluating feature extraction reproducibility across image biomarker standardization initiative-compliant radiomics platforms using a digital phantom.","authors":"Han-Back Shin, Heesoon Sheen, Jang-Hoon Oh, Young Eun Choi, Kihoon Sung, Hyun Ju Kim","doi":"10.1002/acm2.70110","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The aim of this study was to thoroughly analyze the reproducibility of radiomics feature extraction across three Image Biomarker Standardization Initiative (IBSI)-compliant platforms using a digital phantom for benchmarking. It uncovers high consistency among common features while also pointing out the necessity for standardization in computational algorithms and mathematical definitions due to unique platform-specific features.</p><p><strong>Methods: </strong>We selected three widely used radiomics platforms: LIFEx, Computational Environment for Radiological Research (CERR), and PyRadiomics. Using the IBSI digital phantom, we performed a comparative analysis to extract and benchmark radiomics features. The study design included testing each platform's ability to consistently reproduce radiomics features, with statistical analyses to assess the variability and agreement among the platforms.</p><p><strong>Results: </strong>The results indicated varying levels of feature reproducibility across the platforms. Although some features showed high consistency, others varied significantly, highlighting the need for standardized computational algorithms. Specifically, LIFEx and PyRadiomics performed consistently well across many features, whereas CERR showed greater variability in certain feature categories than LIFEx and PyRadiomics.</p><p><strong>Conclusion: </strong>The study findings highlight the need for harmonized feature calculation methods to enhance the reliability and clinical usefulness of radiomics. Additionally, this study recommends incorporating clinical data and establishing benchmarking procedures in future studies to enhance the role of radiomics in personalized medicine.</p>","PeriodicalId":14989,"journal":{"name":"Journal of Applied Clinical Medical Physics","volume":" ","pages":"e70110"},"PeriodicalIF":2.0000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Clinical Medical Physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/acm2.70110","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The aim of this study was to thoroughly analyze the reproducibility of radiomics feature extraction across three Image Biomarker Standardization Initiative (IBSI)-compliant platforms using a digital phantom for benchmarking. It uncovers high consistency among common features while also pointing out the necessity for standardization in computational algorithms and mathematical definitions due to unique platform-specific features.
Methods: We selected three widely used radiomics platforms: LIFEx, Computational Environment for Radiological Research (CERR), and PyRadiomics. Using the IBSI digital phantom, we performed a comparative analysis to extract and benchmark radiomics features. The study design included testing each platform's ability to consistently reproduce radiomics features, with statistical analyses to assess the variability and agreement among the platforms.
Results: The results indicated varying levels of feature reproducibility across the platforms. Although some features showed high consistency, others varied significantly, highlighting the need for standardized computational algorithms. Specifically, LIFEx and PyRadiomics performed consistently well across many features, whereas CERR showed greater variability in certain feature categories than LIFEx and PyRadiomics.
Conclusion: The study findings highlight the need for harmonized feature calculation methods to enhance the reliability and clinical usefulness of radiomics. Additionally, this study recommends incorporating clinical data and establishing benchmarking procedures in future studies to enhance the role of radiomics in personalized medicine.
期刊介绍:
Journal of Applied Clinical Medical Physics is an international Open Access publication dedicated to clinical medical physics. JACMP welcomes original contributions dealing with all aspects of medical physics from scientists working in the clinical medical physics around the world. JACMP accepts only online submission.
JACMP will publish:
-Original Contributions: Peer-reviewed, investigations that represent new and significant contributions to the field. Recommended word count: up to 7500.
-Review Articles: Reviews of major areas or sub-areas in the field of clinical medical physics. These articles may be of any length and are peer reviewed.
-Technical Notes: These should be no longer than 3000 words, including key references.
-Letters to the Editor: Comments on papers published in JACMP or on any other matters of interest to clinical medical physics. These should not be more than 1250 (including the literature) and their publication is only based on the decision of the editor, who occasionally asks experts on the merit of the contents.
-Book Reviews: The editorial office solicits Book Reviews.
-Announcements of Forthcoming Meetings: The Editor may provide notice of forthcoming meetings, course offerings, and other events relevant to clinical medical physics.
-Parallel Opposed Editorial: We welcome topics relevant to clinical practice and medical physics profession. The contents can be controversial debate or opposed aspects of an issue. One author argues for the position and the other against. Each side of the debate contains an opening statement up to 800 words, followed by a rebuttal up to 500 words. Readers interested in participating in this series should contact the moderator with a proposed title and a short description of the topic