{"title":"SEDNet: Real-Time Semantic Segmentation Algorithm Based on STDC","authors":"Sugang Ma, Ziyi Zhao, Zhiqiang Hou, Wangsheng Yu, Xiaobao Yang, Xiangmo Zhao","doi":"10.1155/int/8243407","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Recently, deep convolutional neural networks (DCNN) have been widely used in semantic segmentation tasks and have achieved high segmentation accuracy. However, most algorithms based on DCNN have high computational complexity, making them unsuitable for real-time segmentation. To solve this problem, this paper proposes a real-time semantic segmentation algorithm based on the STDC network. The algorithm adopts an “encoder–decoder” embedded in a U-shaped architecture to realize real-time segmentation while maintaining high accuracy. Following the encoder, a mixed pooling attention module is designed to expand the receptive field, enhancing the network model’s learning ability in complex scenarios. Then, a feature fusion module is used for combining features from different stages, and channel attention based on atrous convolution is employed to expand the receptive field and avoid dimensionality reduction learning. Finally, a Tversky-based detail loss function is used to encode more spatial details. The proposed algorithm was extensively tested on the challenging Cityscapes and CamVid datasets, and the experimental results showed that the proposed algorithm obtained 76.4% and 72.8% of mIoU, respectively. Meanwhile, our algorithm achieves 105.2 FPS and 165.6 FPS inference speed with a single NVIDIA GTX 1080Ti GPU, meeting the real-time segmentation requirements. The proposed algorithm can conduct real-time segmentation while maintaining high accuracy, achieving a good balance between accuracy and speed.</p>\n </div>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8243407","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/int/8243407","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, deep convolutional neural networks (DCNN) have been widely used in semantic segmentation tasks and have achieved high segmentation accuracy. However, most algorithms based on DCNN have high computational complexity, making them unsuitable for real-time segmentation. To solve this problem, this paper proposes a real-time semantic segmentation algorithm based on the STDC network. The algorithm adopts an “encoder–decoder” embedded in a U-shaped architecture to realize real-time segmentation while maintaining high accuracy. Following the encoder, a mixed pooling attention module is designed to expand the receptive field, enhancing the network model’s learning ability in complex scenarios. Then, a feature fusion module is used for combining features from different stages, and channel attention based on atrous convolution is employed to expand the receptive field and avoid dimensionality reduction learning. Finally, a Tversky-based detail loss function is used to encode more spatial details. The proposed algorithm was extensively tested on the challenging Cityscapes and CamVid datasets, and the experimental results showed that the proposed algorithm obtained 76.4% and 72.8% of mIoU, respectively. Meanwhile, our algorithm achieves 105.2 FPS and 165.6 FPS inference speed with a single NVIDIA GTX 1080Ti GPU, meeting the real-time segmentation requirements. The proposed algorithm can conduct real-time segmentation while maintaining high accuracy, achieving a good balance between accuracy and speed.
期刊介绍:
The International Journal of Intelligent Systems serves as a forum for individuals interested in tapping into the vast theories based on intelligent systems construction. With its peer-reviewed format, the journal explores several fascinating editorials written by today''s experts in the field. Because new developments are being introduced each day, there''s much to be learned — examination, analysis creation, information retrieval, man–computer interactions, and more. The International Journal of Intelligent Systems uses charts and illustrations to demonstrate these ground-breaking issues, and encourages readers to share their thoughts and experiences.