Shidi Miao , Yuyang Jiang , Wenjuan Huang , Yuxin Jiang , Mengzhuo Sun , Mingxuan Wang , Hongzhuo Qi , Ao Li , Zengyao Liu , Qiujun Wang , Ruitao Wang , Xuemei Ding
{"title":"SCLResNet和DSAF:基于自我监督对比学习和深度自我注意融合的多模式网络预测甲状腺乳头状癌中央淋巴结转移。","authors":"Shidi Miao , Yuyang Jiang , Wenjuan Huang , Yuxin Jiang , Mengzhuo Sun , Mingxuan Wang , Hongzhuo Qi , Ao Li , Zengyao Liu , Qiujun Wang , Ruitao Wang , Xuemei Ding","doi":"10.1016/j.artmed.2025.103280","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate prediction of central lymph node metastasis (CLNM) in papillary thyroid carcinoma (PTC) is crucial to avoid unnecessary invasive procedures, yet existing models often fall short. We constructed the SCLResNet101 model based on a contrastive learning framework to extract network features of tumor ultrasound (US). SeResnet101 was used to extract network features of peri-vascular adipose tissue (PVAT) from the computed tomography (CT) of C6 (the arterial and venous layers beneath the thyroid). Univariate and multivariate analyses were performed using binary logistic regression to select clinical features. Finally, we constructed a Deep Self-Attention Fusion (DSAF) network to integrate features from these three modalities for CLNM prediction. Univariate and multivariate analyses revealed that Gender, Age, Size of US, and Extrathyroidal Extension (ETE) were independent risk factors for CLNM. In the internal test cohort (I-T), the area under the curve (AUC) of model was 0.863 (95 % CI: 0.779–0.932). In the external test cohort (E-T), the AUC was 0.839 (95 % CI: 0.755–0.905). Compared to all radiologists, the model significantly reduced both false-positive and false-negative rates in both the I-T and E-T. This study incorporates PVAT, which significantly enhances the performance of the multimodal deep learning model and may assist surgeons in making more informed and precise surgical decisions in the treatment of PTC.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"170 ","pages":"Article 103280"},"PeriodicalIF":6.2000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SCLResNet and DSAF: A self-supervised contrastive learning and deep self-attention fusion-based multimodal network for predicting central lymph node metastasis in papillary thyroid carcinoma\",\"authors\":\"Shidi Miao , Yuyang Jiang , Wenjuan Huang , Yuxin Jiang , Mengzhuo Sun , Mingxuan Wang , Hongzhuo Qi , Ao Li , Zengyao Liu , Qiujun Wang , Ruitao Wang , Xuemei Ding\",\"doi\":\"10.1016/j.artmed.2025.103280\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate prediction of central lymph node metastasis (CLNM) in papillary thyroid carcinoma (PTC) is crucial to avoid unnecessary invasive procedures, yet existing models often fall short. We constructed the SCLResNet101 model based on a contrastive learning framework to extract network features of tumor ultrasound (US). SeResnet101 was used to extract network features of peri-vascular adipose tissue (PVAT) from the computed tomography (CT) of C6 (the arterial and venous layers beneath the thyroid). Univariate and multivariate analyses were performed using binary logistic regression to select clinical features. Finally, we constructed a Deep Self-Attention Fusion (DSAF) network to integrate features from these three modalities for CLNM prediction. Univariate and multivariate analyses revealed that Gender, Age, Size of US, and Extrathyroidal Extension (ETE) were independent risk factors for CLNM. In the internal test cohort (I-T), the area under the curve (AUC) of model was 0.863 (95 % CI: 0.779–0.932). In the external test cohort (E-T), the AUC was 0.839 (95 % CI: 0.755–0.905). Compared to all radiologists, the model significantly reduced both false-positive and false-negative rates in both the I-T and E-T. This study incorporates PVAT, which significantly enhances the performance of the multimodal deep learning model and may assist surgeons in making more informed and precise surgical decisions in the treatment of PTC.</div></div>\",\"PeriodicalId\":55458,\"journal\":{\"name\":\"Artificial Intelligence in Medicine\",\"volume\":\"170 \",\"pages\":\"Article 103280\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0933365725002155\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725002155","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
SCLResNet and DSAF: A self-supervised contrastive learning and deep self-attention fusion-based multimodal network for predicting central lymph node metastasis in papillary thyroid carcinoma
Accurate prediction of central lymph node metastasis (CLNM) in papillary thyroid carcinoma (PTC) is crucial to avoid unnecessary invasive procedures, yet existing models often fall short. We constructed the SCLResNet101 model based on a contrastive learning framework to extract network features of tumor ultrasound (US). SeResnet101 was used to extract network features of peri-vascular adipose tissue (PVAT) from the computed tomography (CT) of C6 (the arterial and venous layers beneath the thyroid). Univariate and multivariate analyses were performed using binary logistic regression to select clinical features. Finally, we constructed a Deep Self-Attention Fusion (DSAF) network to integrate features from these three modalities for CLNM prediction. Univariate and multivariate analyses revealed that Gender, Age, Size of US, and Extrathyroidal Extension (ETE) were independent risk factors for CLNM. In the internal test cohort (I-T), the area under the curve (AUC) of model was 0.863 (95 % CI: 0.779–0.932). In the external test cohort (E-T), the AUC was 0.839 (95 % CI: 0.755–0.905). Compared to all radiologists, the model significantly reduced both false-positive and false-negative rates in both the I-T and E-T. This study incorporates PVAT, which significantly enhances the performance of the multimodal deep learning model and may assist surgeons in making more informed and precise surgical decisions in the treatment of PTC.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.