{"title":"Boundary semantic interactive aggregation network for scene segmentation","authors":"Fan Zhang, Qijun Lv, Binrong Pan, Yun Wang","doi":"10.1016/j.eswa.2025.126754","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based road scene segmentation has been paid significantly attention for autonomous driving. Most approaches focus on extracting semantic information of target and neglect boundary feature, resulting in difficultly improving holistically semantics modeling. To handle this challenge, we propose boundary semantic interactive aggregation network (BSIA-Net) to segment road scene, which embeds boundary feature into multi-scale semantic features and aggregates interactive information for promoting semantic understanding. The designed BSIA-Net consists of three parts including semantic flow, boundary flow, and aggregation flow. The semantic flow is used to extract multi-scale semantic information and constructs astrous stack pooling (ASP) module to capture rich context information by expanding the receptive field. The boundary flow is specifically designed to refine low-level boundary feature from edge extracting operator (EEO) by multi-scale semantic features. The aggregation flow constructs interactive aggregation module (IAM) to capture long-range dependencies between the inner objects and boundaries by interactively enhancing intra-class consistency. The BSIA-Net successively achieves 81.1 % and 45.41 % in mean intersection over union on Cityscapes and ADE20K datasets. Extensive experiments compared to some other methods demonstrates the effectiveness and advancement of our method.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"272 ","pages":"Article 126754"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425003768","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning-based road scene segmentation has been paid significantly attention for autonomous driving. Most approaches focus on extracting semantic information of target and neglect boundary feature, resulting in difficultly improving holistically semantics modeling. To handle this challenge, we propose boundary semantic interactive aggregation network (BSIA-Net) to segment road scene, which embeds boundary feature into multi-scale semantic features and aggregates interactive information for promoting semantic understanding. The designed BSIA-Net consists of three parts including semantic flow, boundary flow, and aggregation flow. The semantic flow is used to extract multi-scale semantic information and constructs astrous stack pooling (ASP) module to capture rich context information by expanding the receptive field. The boundary flow is specifically designed to refine low-level boundary feature from edge extracting operator (EEO) by multi-scale semantic features. The aggregation flow constructs interactive aggregation module (IAM) to capture long-range dependencies between the inner objects and boundaries by interactively enhancing intra-class consistency. The BSIA-Net successively achieves 81.1 % and 45.41 % in mean intersection over union on Cityscapes and ADE20K datasets. Extensive experiments compared to some other methods demonstrates the effectiveness and advancement of our method.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.