SOCR-YOLO: Small Objects Detection Algorithm in Medical Images

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2024-06-21 DOI:10.1002/ima.23130

Yongjie Liu, Yang Li, Mingfeng Jiang, Shuchao Wang, Shitai Ye, Simon Walsh, Guang Yang

{"title":"SOCR-YOLO: Small Objects Detection Algorithm in Medical Images","authors":"Yongjie Liu, Yang Li, Mingfeng Jiang, Shuchao Wang, Shitai Ye, Simon Walsh, Guang Yang","doi":"10.1002/ima.23130","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the field of medical image analysis, object detection plays a crucial role by providing interpretable diagnostic information to healthcare professionals. Although current object detection models have achieved remarkable success in conventional images, their performance in detecting abnormalities in medical images has not been as satisfactory. This is primarily due to the complexity of anatomical structures in medical images, and the fact that some lesions may have subtle features, particularly in the case of early-stage, small-scale abnormalities. To address this challenge, we introduce SOCR-YOLO, a novel lesion detection model with online convolutional reparameterization based on channel shuffling. First, it employs the SOCR (Shuffled Channel with Online Convolutional Re-parameterization) module to establish a connection between feature concatenation and computational efficiency, aiming to extract more comprehensive information while reducing time consumption. Second, it incorporates the Bi-FPN structure to achieve multiscale feature fusion. Lastly, the loss function has been optimized to improve the model training process. We evaluated two datasets, chest x-ray (Vindr-CXR) and brain tumor (Br35H), provided by the Kaggle competition. Experimental results show that the proposed method has outperformed several state-of-the-art models, including YOLOv8, YOLO-NAS, and RT-DETR, in both speed and accuracy. Notably, in the context of chest x-ray anomaly detection, SOCR-YOLO exhibits a 1.8% enhancement in accuracy over YOLOv8 while simultaneously reducing floating-point operations by 26.3%. Additionally, a similar 1.8% improvement in accuracy is observed in the detection of brain tumors. The results indicate the superior ability of our model to detect multiscale variations and small lesions.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"34 4","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.23130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

In the field of medical image analysis, object detection plays a crucial role by providing interpretable diagnostic information to healthcare professionals. Although current object detection models have achieved remarkable success in conventional images, their performance in detecting abnormalities in medical images has not been as satisfactory. This is primarily due to the complexity of anatomical structures in medical images, and the fact that some lesions may have subtle features, particularly in the case of early-stage, small-scale abnormalities. To address this challenge, we introduce SOCR-YOLO, a novel lesion detection model with online convolutional reparameterization based on channel shuffling. First, it employs the SOCR (Shuffled Channel with Online Convolutional Re-parameterization) module to establish a connection between feature concatenation and computational efficiency, aiming to extract more comprehensive information while reducing time consumption. Second, it incorporates the Bi-FPN structure to achieve multiscale feature fusion. Lastly, the loss function has been optimized to improve the model training process. We evaluated two datasets, chest x-ray (Vindr-CXR) and brain tumor (Br35H), provided by the Kaggle competition. Experimental results show that the proposed method has outperformed several state-of-the-art models, including YOLOv8, YOLO-NAS, and RT-DETR, in both speed and accuracy. Notably, in the context of chest x-ray anomaly detection, SOCR-YOLO exhibits a 1.8% enhancement in accuracy over YOLOv8 while simultaneously reducing floating-point operations by 26.3%. Additionally, a similar 1.8% improvement in accuracy is observed in the detection of brain tumors. The results indicate the superior ability of our model to detect multiscale variations and small lesions.

查看原文本刊更多论文

SOCR-YOLO：医学图像中的小物体检测算法

在医学图像分析领域，物体检测起着至关重要的作用，它能为医护人员提供可解释的诊断信息。尽管目前的物体检测模型在常规图像中取得了显著的成功，但在医学图像中检测异常情况的表现却不尽如人意。这主要是由于医学图像中解剖结构的复杂性，以及某些病变可能具有微妙的特征，尤其是早期的小范围异常。为了应对这一挑战，我们引入了 SOCR-YOLO，这是一种基于通道洗牌的在线卷积重新参数化的新型病变检测模型。首先，它采用了 SOCR（洗牌信道与在线卷积重参数化）模块，在特征串联和计算效率之间建立了联系，旨在提取更全面的信息，同时减少时间消耗。其次，它采用了 Bi-FPN 结构来实现多尺度特征融合。最后，对损失函数进行了优化，以改进模型训练过程。我们评估了 Kaggle 竞赛提供的两个数据集：胸部 X 光（Vindr-CXR）和脑肿瘤（Br35H）。实验结果表明，所提出的方法在速度和准确性上都优于几个最先进的模型，包括 YOLOv8、YOLO-NAS 和 RT-DETR。值得注意的是，在胸部 X 射线异常检测方面，SOCR-YOLO 比 YOLOv8 的准确性提高了 1.8%，同时浮点运算减少了 26.3%。此外，在检测脑肿瘤时，准确率也提高了 1.8%。这些结果表明，我们的模型在检测多尺度变化和小病变方面具有卓越的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.