Yubo Yuan, Lijun Liu, Xiaobing Yang, Li Liu, Qingsong Huang
{"title":"Multi-scale Lesion Feature Fusion and Location-Aware for Chest Multi-disease Detection.","authors":"Yubo Yuan, Lijun Liu, Xiaobing Yang, Li Liu, Qingsong Huang","doi":"10.1007/s10278-024-01133-7","DOIUrl":null,"url":null,"abstract":"<p><p>Accurately identifying and locating lesions in chest X-rays has the potential to significantly enhance diagnostic efficiency, quality, and interpretability. However, current methods primarily focus on detecting of specific diseases in chest X-rays, disregarding the presence of multiple diseases in a single chest X-ray scan. Moreover, the diversity in lesion locations and attributes introduces complexity in accurately discerning specific traits for each lesion, leading to diminished accuracy when detecting multiple diseases. To address these issues, we propose a novel detection framework that enhances multi-scale lesion feature extraction and fusion, improving lesion position perception and subsequently boosting chest multi-disease detection performance. Initially, we construct a multi-scale lesion feature extraction network to tackle the uniqueness of various lesion features and locations, strengthening the global semantic correlation between lesion features and their positions. Following this, we introduce an instance-aware semantic enhancement network that dynamically amalgamates instance-specific features with high-level semantic representations across various scales. This adaptive integration effectively mitigates the loss of detailed information within lesion regions. Additionally, we perform lesion region feature mapping using candidate boxes to preserve crucial positional information, enhancing the accuracy of chest disease detection across multiple scales. Experimental results on the VinDr-CXR dataset reveal a 6% increment in mean average precision (mAP) and an 8.4% improvement in mean recall (mR) when compared to state-of-the-art baselines. This demonstrates the effectiveness of the model in accurately detecting multiple chest diseases by capturing specific features and location information.</p>","PeriodicalId":516858,"journal":{"name":"Journal of imaging informatics in medicine","volume":" ","pages":"2752-2767"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of imaging informatics in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10278-024-01133-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/17 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately identifying and locating lesions in chest X-rays has the potential to significantly enhance diagnostic efficiency, quality, and interpretability. However, current methods primarily focus on detecting of specific diseases in chest X-rays, disregarding the presence of multiple diseases in a single chest X-ray scan. Moreover, the diversity in lesion locations and attributes introduces complexity in accurately discerning specific traits for each lesion, leading to diminished accuracy when detecting multiple diseases. To address these issues, we propose a novel detection framework that enhances multi-scale lesion feature extraction and fusion, improving lesion position perception and subsequently boosting chest multi-disease detection performance. Initially, we construct a multi-scale lesion feature extraction network to tackle the uniqueness of various lesion features and locations, strengthening the global semantic correlation between lesion features and their positions. Following this, we introduce an instance-aware semantic enhancement network that dynamically amalgamates instance-specific features with high-level semantic representations across various scales. This adaptive integration effectively mitigates the loss of detailed information within lesion regions. Additionally, we perform lesion region feature mapping using candidate boxes to preserve crucial positional information, enhancing the accuracy of chest disease detection across multiple scales. Experimental results on the VinDr-CXR dataset reveal a 6% increment in mean average precision (mAP) and an 8.4% improvement in mean recall (mR) when compared to state-of-the-art baselines. This demonstrates the effectiveness of the model in accurately detecting multiple chest diseases by capturing specific features and location information.
准确识别和定位胸部 X 光片中的病变有可能显著提高诊断效率、质量和可解释性。然而,目前的方法主要侧重于检测胸部 X 光片中的特定疾病,而忽略了在一次胸部 X 光扫描中存在多种疾病的情况。此外,病变位置和属性的多样性给准确辨别每个病变的具体特征带来了复杂性,导致检测多种疾病时的准确性降低。为了解决这些问题,我们提出了一种新型检测框架,它能增强多尺度病变特征提取和融合,改善病变位置感知,从而提高胸部多种疾病的检测性能。首先,我们构建了一个多尺度病变特征提取网络,以解决各种病变特征和位置的独特性问题,加强病变特征及其位置之间的全局语义相关性。随后,我们引入了实例感知语义增强网络,该网络可动态地将特定实例特征与不同尺度的高级语义表征相结合。这种自适应整合可有效减少病变区域内详细信息的丢失。此外,我们还使用候选框进行病变区域特征映射,以保留关键的位置信息,从而提高跨尺度胸部疾病检测的准确性。在 VinDr-CXR 数据集上的实验结果显示,与最先进的基线相比,平均精确度 (mAP) 提高了 6%,平均召回率 (mR) 提高了 8.4%。这证明了该模型通过捕捉特定特征和位置信息,在准确检测多种胸部疾病方面的有效性。