Quankeng Huang , Wenchao Jiang , Junhang Li , Jianxuan Wen , Ji He , Wei Song
{"title":"Hashimoto’s thyroiditis recognition from multi-modal data via global cross-attention and distance-aware training","authors":"Quankeng Huang , Wenchao Jiang , Junhang Li , Jianxuan Wen , Ji He , Wei Song","doi":"10.1016/j.media.2025.103515","DOIUrl":null,"url":null,"abstract":"<div><div>Ultrasound images and biological indicators, which reveal Hashimoto’s thyroiditis (HT) characteristics in thyroid tissue from different perspectives, play crucial roles in HT recognition. Ultrasound images of patients with HT typically present a heterogeneous background with potential decreases in echogenicity. Clinicians are prone to misdiagnosing HT by visually evaluating these characteristics. In addition, patients with HT may exhibit fluctuations in relevant biological indicators, but there are no absolute relationships between a single biological indicator and HT. To address these challenges, we propose HTR-Net, a novel HT recognition network that combines ultrasound images and biological indicators through multi-modality information embedding. Specifically, HTR-Net introduces a global cross-attention module (GCA), which enhances recognition of the heterogeneous background with potential decreases in echogenicity. A distance-aware mismatched augmentation (DMA) strategy is also designed to expand the limited biological indicator data and ensure reasonable values for the augmented biological indicators, thus enhancing the model performance. In order to address the nonabsolute relationship between HT and a single biological indicator, we propose a distance-aware loss (DL) function to constrain feature mapping for effective information extraction from indicators, thereby enhancing the model’s capability to detect anomalous sets of biological indicators. To validate the proposed method, we construct a multi-center HT dataset and conduct extensive experiments. The experimental results demonstrate that the proposed HTR-Net achieves state-of-the-art (SOTA) performance.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"102 ","pages":"Article 103515"},"PeriodicalIF":10.7000,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525000635","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Ultrasound images and biological indicators, which reveal Hashimoto’s thyroiditis (HT) characteristics in thyroid tissue from different perspectives, play crucial roles in HT recognition. Ultrasound images of patients with HT typically present a heterogeneous background with potential decreases in echogenicity. Clinicians are prone to misdiagnosing HT by visually evaluating these characteristics. In addition, patients with HT may exhibit fluctuations in relevant biological indicators, but there are no absolute relationships between a single biological indicator and HT. To address these challenges, we propose HTR-Net, a novel HT recognition network that combines ultrasound images and biological indicators through multi-modality information embedding. Specifically, HTR-Net introduces a global cross-attention module (GCA), which enhances recognition of the heterogeneous background with potential decreases in echogenicity. A distance-aware mismatched augmentation (DMA) strategy is also designed to expand the limited biological indicator data and ensure reasonable values for the augmented biological indicators, thus enhancing the model performance. In order to address the nonabsolute relationship between HT and a single biological indicator, we propose a distance-aware loss (DL) function to constrain feature mapping for effective information extraction from indicators, thereby enhancing the model’s capability to detect anomalous sets of biological indicators. To validate the proposed method, we construct a multi-center HT dataset and conduct extensive experiments. The experimental results demonstrate that the proposed HTR-Net achieves state-of-the-art (SOTA) performance.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.