{"title":"Ultra-Lightweight Network for Medical Image Segmentation Inspired by Bio-Visual Interaction","authors":"Zhefei Cai;Yingle Fan;Minwei Zhu;Tao Fang","doi":"10.1109/TCSVT.2024.3507383","DOIUrl":null,"url":null,"abstract":"Computer-aided medical image segmentation helps to assist physicians in locating lesion area for the subsequent diagnosis and treatment. Due to the irregular shape of the target and the uneven sample size between the target and the background area, automatic segmentation of medical images is a challenging task. Many CNN-Based, Transformer-Based models deepen the number of network layers or introduce complex modules in order to improve the segmentation accuracy. Limited by the computational resources, these types of large models are not suitable for the actual clinical environment. Inspired by the rapidity, accuracy, and low consumption characteristics of bio-visual processing, the Ultra-Lightweight Network Inspired by Bio-Visual Interaction (BVI-Net) is constructed in this paper. The Global Pathway is constructed by simulating the dorsal stream, in order to extract global features rapidly, and the Local Pathway is constructed by simulating the ventral stream, in order to process local features finely. At the same time, the skip connection module integrating Graph Convolutional Network (GCN) attention mechanism is constructed to simulate the synchronous integration ability of the visual pathway for multi-level features. The International Skin Imaging Collaboration (ISIC) dataset, the Liver Tumor Segmentation (LiTS) dataset, and the Brain Tumor Segmentation Challenge (BraTS) dataset are used for experiments. The BVI-Net proposed in this paper requires only 0.026M parameters to achieve the excellent performance in three representative medical image segmentation datasets, which has certain advantages over state-of-the-art (SOTA) methods. The biological vision mechanism and the artificial intelligence algorithm are integrated in this paper, which provides new ideas for the construction of biological vision-guided deep learning models and promotes the development of biomimetic computational vision.","PeriodicalId":13082,"journal":{"name":"IEEE Transactions on Circuits and Systems for Video Technology","volume":"35 4","pages":"3486-3497"},"PeriodicalIF":11.1000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems for Video Technology","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10769421/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Computer-aided medical image segmentation helps to assist physicians in locating lesion area for the subsequent diagnosis and treatment. Due to the irregular shape of the target and the uneven sample size between the target and the background area, automatic segmentation of medical images is a challenging task. Many CNN-Based, Transformer-Based models deepen the number of network layers or introduce complex modules in order to improve the segmentation accuracy. Limited by the computational resources, these types of large models are not suitable for the actual clinical environment. Inspired by the rapidity, accuracy, and low consumption characteristics of bio-visual processing, the Ultra-Lightweight Network Inspired by Bio-Visual Interaction (BVI-Net) is constructed in this paper. The Global Pathway is constructed by simulating the dorsal stream, in order to extract global features rapidly, and the Local Pathway is constructed by simulating the ventral stream, in order to process local features finely. At the same time, the skip connection module integrating Graph Convolutional Network (GCN) attention mechanism is constructed to simulate the synchronous integration ability of the visual pathway for multi-level features. The International Skin Imaging Collaboration (ISIC) dataset, the Liver Tumor Segmentation (LiTS) dataset, and the Brain Tumor Segmentation Challenge (BraTS) dataset are used for experiments. The BVI-Net proposed in this paper requires only 0.026M parameters to achieve the excellent performance in three representative medical image segmentation datasets, which has certain advantages over state-of-the-art (SOTA) methods. The biological vision mechanism and the artificial intelligence algorithm are integrated in this paper, which provides new ideas for the construction of biological vision-guided deep learning models and promotes the development of biomimetic computational vision.
期刊介绍:
The IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) is dedicated to covering all aspects of video technologies from a circuits and systems perspective. We encourage submissions of general, theoretical, and application-oriented papers related to image and video acquisition, representation, presentation, and display. Additionally, we welcome contributions in areas such as processing, filtering, and transforms; analysis and synthesis; learning and understanding; compression, transmission, communication, and networking; as well as storage, retrieval, indexing, and search. Furthermore, papers focusing on hardware and software design and implementation are highly valued. Join us in advancing the field of video technology through innovative research and insights.