{"title":"AEFusion: An Attention-Based Ensemble Learning Approach for BEV Fusion Perception in Autonomous Modular Buses","authors":"Hongyi Lin;Shouqun Ming;Yang Liu;Xiaobo Qu","doi":"10.1109/TIV.2024.3454288","DOIUrl":null,"url":null,"abstract":"Autonomous modular buses (AMB) are considered a promising solution to the challenges in public transportation, as they can reduce commute times, enhance transfer convenience, and address supply-demand imbalances in transportation systems. Nonetheless, current research mainly focuses on operational aspects, whereas the high precision required for in-transit docking remains a critical challenge for implementation. The accuracy of current autonomous driving perception systems is often limited due to errors introduced by multi-sensor fusion methods. To address this issue, this paper introduces an attention-based ensemble learning fusion method (AEfusion) which includes a supervision module that utilizes the more accurate depth information from LiDAR to guide the generation of image depth information. Additionally, the fusion module incorporates two enhanced channel attention blocks and a spatial attention block to strengthen feature learning and integration. Experiments on both the nuScenes dataset and a self-collected dataset demonstrate that our method is suited for full-range docking perception in AMBs and is superior to the existing approaches.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"10 5","pages":"3468-3480"},"PeriodicalIF":14.3000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Vehicles","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10665996/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous modular buses (AMB) are considered a promising solution to the challenges in public transportation, as they can reduce commute times, enhance transfer convenience, and address supply-demand imbalances in transportation systems. Nonetheless, current research mainly focuses on operational aspects, whereas the high precision required for in-transit docking remains a critical challenge for implementation. The accuracy of current autonomous driving perception systems is often limited due to errors introduced by multi-sensor fusion methods. To address this issue, this paper introduces an attention-based ensemble learning fusion method (AEfusion) which includes a supervision module that utilizes the more accurate depth information from LiDAR to guide the generation of image depth information. Additionally, the fusion module incorporates two enhanced channel attention blocks and a spatial attention block to strengthen feature learning and integration. Experiments on both the nuScenes dataset and a self-collected dataset demonstrate that our method is suited for full-range docking perception in AMBs and is superior to the existing approaches.
期刊介绍:
The IEEE Transactions on Intelligent Vehicles (T-IV) is a premier platform for publishing peer-reviewed articles that present innovative research concepts, application results, significant theoretical findings, and application case studies in the field of intelligent vehicles. With a particular emphasis on automated vehicles within roadway environments, T-IV aims to raise awareness of pressing research and application challenges.
Our focus is on providing critical information to the intelligent vehicle community, serving as a dissemination vehicle for IEEE ITS Society members and others interested in learning about the state-of-the-art developments and progress in research and applications related to intelligent vehicles. Join us in advancing knowledge and innovation in this dynamic field.