Human-machine collaborative risk assessment model for thyroid nodules based on local attention and multi-scale feature extraction: a multi-center clinical study.
Shunlan Liu, Yang Yang, Mingli Cai, Zhirong Xu, Shaozheng He, Qichen Su, Peizhong Liu, Guorong Lyu
{"title":"Human-machine collaborative risk assessment model for thyroid nodules based on local attention and multi-scale feature extraction: a multi-center clinical study.","authors":"Shunlan Liu, Yang Yang, Mingli Cai, Zhirong Xu, Shaozheng He, Qichen Su, Peizhong Liu, Guorong Lyu","doi":"10.1007/s12020-025-04278-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>We aimed to evaluate a human-machine collaborative risk assessment model for thyroid nodules using local attention mechanisms and multi-scale feature extraction and compare its performance with those of radiologists of varying experience levels.</p><p><strong>Methods: </strong>A multi-center diagnostic study was conducted using ultrasound image datasets from six hospitals in China. The model was trained on 8397 images from 8063 patients (training set) and validated on 253 images from 245 patients across multiple centers. The diagnostic performance of the model was compared with those of radiologists with varying levels of experience. An assistive strategy was developed where radiologists adjusted their diagnoses based on model results.</p><p><strong>Results: </strong>The model achieved recognition accuracies of 0.966, 0.809, 0.826, 0.837, and 0.861 for composition, echogenicity, margin, echogenic foci, and orientation, respectively. The area under the receiver operating characteristic curve (AUROC) for the model in diagnosing benign and malignant nodules was 0.882, significantly higher than that of the junior radiologist (0.789; P < 0.0001). The AUROC of the model was between that of the intermediate (0.837) and senior (0.892) radiologists, with no significant difference compared to either group (both P > 0.05). The assistive strategy improved the AUROC for the junior radiologist from 0.789 to 0.859 (P < 0.0001) and increased sensitivity from 66.11% to 80.00% (P < 0.05), with specificity unchanged.</p><p><strong>Conclusion: </strong>The model accurately identified thyroid nodule risk features and enhanced diagnostic performance, particularly for the junior radiologist, improving sensitivity in diagnosing thyroid nodules.</p>","PeriodicalId":11572,"journal":{"name":"Endocrine","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Endocrine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12020-025-04278-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: We aimed to evaluate a human-machine collaborative risk assessment model for thyroid nodules using local attention mechanisms and multi-scale feature extraction and compare its performance with those of radiologists of varying experience levels.
Methods: A multi-center diagnostic study was conducted using ultrasound image datasets from six hospitals in China. The model was trained on 8397 images from 8063 patients (training set) and validated on 253 images from 245 patients across multiple centers. The diagnostic performance of the model was compared with those of radiologists with varying levels of experience. An assistive strategy was developed where radiologists adjusted their diagnoses based on model results.
Results: The model achieved recognition accuracies of 0.966, 0.809, 0.826, 0.837, and 0.861 for composition, echogenicity, margin, echogenic foci, and orientation, respectively. The area under the receiver operating characteristic curve (AUROC) for the model in diagnosing benign and malignant nodules was 0.882, significantly higher than that of the junior radiologist (0.789; P < 0.0001). The AUROC of the model was between that of the intermediate (0.837) and senior (0.892) radiologists, with no significant difference compared to either group (both P > 0.05). The assistive strategy improved the AUROC for the junior radiologist from 0.789 to 0.859 (P < 0.0001) and increased sensitivity from 66.11% to 80.00% (P < 0.05), with specificity unchanged.
Conclusion: The model accurately identified thyroid nodule risk features and enhanced diagnostic performance, particularly for the junior radiologist, improving sensitivity in diagnosing thyroid nodules.
期刊介绍:
Well-established as a major journal in today’s rapidly advancing experimental and clinical research areas, Endocrine publishes original articles devoted to basic (including molecular, cellular and physiological studies), translational and clinical research in all the different fields of endocrinology and metabolism. Articles will be accepted based on peer-reviews, priority, and editorial decision. Invited reviews, mini-reviews and viewpoints on relevant pathophysiological and clinical topics, as well as Editorials on articles appearing in the Journal, are published. Unsolicited Editorials will be evaluated by the editorial team. Outcomes of scientific meetings, as well as guidelines and position statements, may be submitted. The Journal also considers special feature articles in the field of endocrine genetics and epigenetics, as well as articles devoted to novel methods and techniques in endocrinology.
Endocrine covers controversial, clinical endocrine issues. Meta-analyses on endocrine and metabolic topics are also accepted. Descriptions of single clinical cases and/or small patients studies are not published unless of exceptional interest. However, reports of novel imaging studies and endocrine side effects in single patients may be considered. Research letters and letters to the editor related or unrelated to recently published articles can be submitted.
Endocrine covers leading topics in endocrinology such as neuroendocrinology, pituitary and hypothalamic peptides, thyroid physiological and clinical aspects, bone and mineral metabolism and osteoporosis, obesity, lipid and energy metabolism and food intake control, insulin, Type 1 and Type 2 diabetes, hormones of male and female reproduction, adrenal diseases pediatric and geriatric endocrinology, endocrine hypertension and endocrine oncology.