Yongjie Liu, Yang Li, Mingfeng Jiang, Shuchao Wang, Shitai Ye, Simon Walsh, Guang Yang
{"title":"SOCR-YOLO: Small Objects Detection Algorithm in Medical Images","authors":"Yongjie Liu, Yang Li, Mingfeng Jiang, Shuchao Wang, Shitai Ye, Simon Walsh, Guang Yang","doi":"10.1002/ima.23130","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the field of medical image analysis, object detection plays a crucial role by providing interpretable diagnostic information to healthcare professionals. Although current object detection models have achieved remarkable success in conventional images, their performance in detecting abnormalities in medical images has not been as satisfactory. This is primarily due to the complexity of anatomical structures in medical images, and the fact that some lesions may have subtle features, particularly in the case of early-stage, small-scale abnormalities. To address this challenge, we introduce SOCR-YOLO, a novel lesion detection model with online convolutional reparameterization based on channel shuffling. First, it employs the SOCR (Shuffled Channel with Online Convolutional Re-parameterization) module to establish a connection between feature concatenation and computational efficiency, aiming to extract more comprehensive information while reducing time consumption. Second, it incorporates the Bi-FPN structure to achieve multiscale feature fusion. Lastly, the loss function has been optimized to improve the model training process. We evaluated two datasets, chest x-ray (Vindr-CXR) and brain tumor (Br35H), provided by the Kaggle competition. Experimental results show that the proposed method has outperformed several state-of-the-art models, including YOLOv8, YOLO-NAS, and RT-DETR, in both speed and accuracy. Notably, in the context of chest x-ray anomaly detection, SOCR-YOLO exhibits a 1.8% enhancement in accuracy over YOLOv8 while simultaneously reducing floating-point operations by 26.3%. Additionally, a similar 1.8% improvement in accuracy is observed in the detection of brain tumors. The results indicate the superior ability of our model to detect multiscale variations and small lesions.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"34 4","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.23130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of medical image analysis, object detection plays a crucial role by providing interpretable diagnostic information to healthcare professionals. Although current object detection models have achieved remarkable success in conventional images, their performance in detecting abnormalities in medical images has not been as satisfactory. This is primarily due to the complexity of anatomical structures in medical images, and the fact that some lesions may have subtle features, particularly in the case of early-stage, small-scale abnormalities. To address this challenge, we introduce SOCR-YOLO, a novel lesion detection model with online convolutional reparameterization based on channel shuffling. First, it employs the SOCR (Shuffled Channel with Online Convolutional Re-parameterization) module to establish a connection between feature concatenation and computational efficiency, aiming to extract more comprehensive information while reducing time consumption. Second, it incorporates the Bi-FPN structure to achieve multiscale feature fusion. Lastly, the loss function has been optimized to improve the model training process. We evaluated two datasets, chest x-ray (Vindr-CXR) and brain tumor (Br35H), provided by the Kaggle competition. Experimental results show that the proposed method has outperformed several state-of-the-art models, including YOLOv8, YOLO-NAS, and RT-DETR, in both speed and accuracy. Notably, in the context of chest x-ray anomaly detection, SOCR-YOLO exhibits a 1.8% enhancement in accuracy over YOLOv8 while simultaneously reducing floating-point operations by 26.3%. Additionally, a similar 1.8% improvement in accuracy is observed in the detection of brain tumors. The results indicate the superior ability of our model to detect multiscale variations and small lesions.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.