Pankaj Gupta, Ruby Siddiqui, Thakur D Yadav, Lileswar Kaman, Gaurav Prakash, Parikshaa Gupta, Uma N Saikia, Usha Dutta
{"title":"Deep learning-based classification of gallbladder lesions in patients with non-diagnostic (GB-RADS 0) ultrasound.","authors":"Pankaj Gupta, Ruby Siddiqui, Thakur D Yadav, Lileswar Kaman, Gaurav Prakash, Parikshaa Gupta, Uma N Saikia, Usha Dutta","doi":"10.5114/ceh.2024.145424","DOIUrl":null,"url":null,"abstract":"<p><strong>Aim of the study: </strong>Non-diagnostic ultrasound (US) of the gallbladder may be due to various factors. We aimed to evaluate the diagnostic performance of deep learning-based classification of gallbladder lesions on US images in patients with non-diagnostic US secondary to gallbladder factors.</p><p><strong>Material and methods: </strong>Consecutive patients with non-diagnostic US due to calculi within the gallbladder lumen, obscuring the detailed evaluation, were identified by a research fellow from a prospective database of patients with gallbladder lesions. The US reports and images were evaluated by a radiologist blinded to the final diagnosis. Patients who had the final pathological diagnosis based on fine-needle aspiration cytology, percutaneous or endoscopic biopsy, or surgical histopathology were included. Convolution neural networks (ResNet50, GBCNet), transformer models (vision transformer [ViT], RadFormer), and a hybrid model (MedViT) were trained on a public gallbladder dataset (GBCU dataset). The performance of these models for classifying gallbladder lesions into benign and malignant was tested on non-diagnostic (GB-RADS 0) US images.</p><p><strong>Results: </strong>Training and validation cohorts (GBCU dataset) comprised 1004 and 251 images, respectively. The testing data (26 patients, mean age [SD]: 57.5 ±8.07 years, 17 female) comprised 304 images. The best performance for detection of GBC was achieved with GBCNet (sensitivity 51.1%, specificity 83.3%, area under the curve [AUC] 0.709) and MedViT (sensitivity 92.8%, specificity 50%, AUC 0.714). MedViT had the best accuracy (73.1%) for detecting benign gallbladder lesions.</p><p><strong>Conclusions: </strong>These results suggest that deep learning models can potentially stratify patients with non-diagnostic US. However, further improvement in the performance is needed to render this approach relevant in clinical practice.</p>","PeriodicalId":10281,"journal":{"name":"Clinical and Experimental Hepatology","volume":"10 4","pages":"232-239"},"PeriodicalIF":1.5000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12022622/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Experimental Hepatology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5114/ceh.2024.145424","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/11 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Aim of the study: Non-diagnostic ultrasound (US) of the gallbladder may be due to various factors. We aimed to evaluate the diagnostic performance of deep learning-based classification of gallbladder lesions on US images in patients with non-diagnostic US secondary to gallbladder factors.
Material and methods: Consecutive patients with non-diagnostic US due to calculi within the gallbladder lumen, obscuring the detailed evaluation, were identified by a research fellow from a prospective database of patients with gallbladder lesions. The US reports and images were evaluated by a radiologist blinded to the final diagnosis. Patients who had the final pathological diagnosis based on fine-needle aspiration cytology, percutaneous or endoscopic biopsy, or surgical histopathology were included. Convolution neural networks (ResNet50, GBCNet), transformer models (vision transformer [ViT], RadFormer), and a hybrid model (MedViT) were trained on a public gallbladder dataset (GBCU dataset). The performance of these models for classifying gallbladder lesions into benign and malignant was tested on non-diagnostic (GB-RADS 0) US images.
Results: Training and validation cohorts (GBCU dataset) comprised 1004 and 251 images, respectively. The testing data (26 patients, mean age [SD]: 57.5 ±8.07 years, 17 female) comprised 304 images. The best performance for detection of GBC was achieved with GBCNet (sensitivity 51.1%, specificity 83.3%, area under the curve [AUC] 0.709) and MedViT (sensitivity 92.8%, specificity 50%, AUC 0.714). MedViT had the best accuracy (73.1%) for detecting benign gallbladder lesions.
Conclusions: These results suggest that deep learning models can potentially stratify patients with non-diagnostic US. However, further improvement in the performance is needed to render this approach relevant in clinical practice.
期刊介绍:
Clinical and Experimental Hepatology – quarterly of the Polish Association for Study of Liver – is a scientific and educational, peer-reviewed journal publishing original and review papers describing clinical and basic investigations in the field of hepatology.