Benjamin Hers, Maria Bonta, Siyi Du, Kishore Mulpuri, Emily K Schaeffer, Antony J Hodgson, Rafeef Garbi
{"title":"SegFormer3D: Improving the Robustness of Deep Learning Model-Based Image Segmentation in Ultrasound Volumes of the Pediatric Hip.","authors":"Benjamin Hers, Maria Bonta, Siyi Du, Kishore Mulpuri, Emily K Schaeffer, Antony J Hodgson, Rafeef Garbi","doi":"10.1016/j.ultrasmedbio.2024.12.013","DOIUrl":null,"url":null,"abstract":"<p><p>Developmental dysplasia of the hip (DDH) is a painful orthopedic malformation diagnosed at birth in 1-3% of all newborns. Left untreated, DDH can lead to significant morbidity including long-term disability. Currently the condition is clinically diagnosed using 2-D ultrasound (US) imaging acquired between 0 and 6 mo of age. DDH metrics are manually extracted by highly trained radiologists through manual measurements of relevant anatomy from the 2-D US data, which remains a time-consuming and highly error-prone process. Recently, it was shown that combining 3-D US imaging with deep learning-based automated diagnostic tools may significantly improve accuracy and reduce variability in measuring DDH metrics. However, the robustness of current techniques remains insufficient for reliable deployment into real-life clinical workflows. In this work, we first present a quantitative robustness evaluation of the state of the art in bone segmentation from 3-D US and demonstrate examples of failed or implausible segmentations with convolutional neural network and vision transformer models under common data variations, e.g., small changes in image resolution or anatomical field of view from those encountered in the training data. Second, we propose a 3-D extension of SegFormer architecture, a lightweight transformer-based model with hierarchically structured encoders producing multi-scale features, which we show to concurrently improve accuracy and robustness. Quantitative results on clinical data from pediatric patients in the test set showed up to 0.9% improvement in Dice score and up to a 3% smaller Hausdorff distance 95% compared with state of the art when unseen variations in anatomical structures and data resolutions were introduced.</p>","PeriodicalId":49399,"journal":{"name":"Ultrasound in Medicine and Biology","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasound in Medicine and Biology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ultrasmedbio.2024.12.013","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Developmental dysplasia of the hip (DDH) is a painful orthopedic malformation diagnosed at birth in 1-3% of all newborns. Left untreated, DDH can lead to significant morbidity including long-term disability. Currently the condition is clinically diagnosed using 2-D ultrasound (US) imaging acquired between 0 and 6 mo of age. DDH metrics are manually extracted by highly trained radiologists through manual measurements of relevant anatomy from the 2-D US data, which remains a time-consuming and highly error-prone process. Recently, it was shown that combining 3-D US imaging with deep learning-based automated diagnostic tools may significantly improve accuracy and reduce variability in measuring DDH metrics. However, the robustness of current techniques remains insufficient for reliable deployment into real-life clinical workflows. In this work, we first present a quantitative robustness evaluation of the state of the art in bone segmentation from 3-D US and demonstrate examples of failed or implausible segmentations with convolutional neural network and vision transformer models under common data variations, e.g., small changes in image resolution or anatomical field of view from those encountered in the training data. Second, we propose a 3-D extension of SegFormer architecture, a lightweight transformer-based model with hierarchically structured encoders producing multi-scale features, which we show to concurrently improve accuracy and robustness. Quantitative results on clinical data from pediatric patients in the test set showed up to 0.9% improvement in Dice score and up to a 3% smaller Hausdorff distance 95% compared with state of the art when unseen variations in anatomical structures and data resolutions were introduced.
期刊介绍:
Ultrasound in Medicine and Biology is the official journal of the World Federation for Ultrasound in Medicine and Biology. The journal publishes original contributions that demonstrate a novel application of an existing ultrasound technology in clinical diagnostic, interventional and therapeutic applications, new and improved clinical techniques, the physics, engineering and technology of ultrasound in medicine and biology, and the interactions between ultrasound and biological systems, including bioeffects. Papers that simply utilize standard diagnostic ultrasound as a measuring tool will be considered out of scope. Extended critical reviews of subjects of contemporary interest in the field are also published, in addition to occasional editorial articles, clinical and technical notes, book reviews, letters to the editor and a calendar of forthcoming meetings. It is the aim of the journal fully to meet the information and publication requirements of the clinicians, scientists, engineers and other professionals who constitute the biomedical ultrasonic community.