Pan Wang, Chengzhi Lyu, Lei Zhang, Hong He, Fang Dai
{"title":"Mask-based anomaly segmentation in complex driving scenes","authors":"Pan Wang, Chengzhi Lyu, Lei Zhang, Hong He, Fang Dai","doi":"10.1016/j.patrec.2025.05.013","DOIUrl":null,"url":null,"abstract":"<div><div>Road anomaly segmentation plays a pivotal role in advancing the safety of autonomous driving by facilitating the detection of unknown objects in complex traffic environments. Nevertheless, traditional semantic segmentation models, limited by predefined categories, often struggle to accurately identify anomalous objects. In this study, we propose AnomaskDrive, a mask-based anomaly segmentation approach that integrates a comprehensive mask-based attention mechanism and a mask refinement strategy within an RbA framework, enhancing the detection of anomalous objects in complex scenes. The proposed mask-based attention mechanism effectively distinguishes between foreground and background regions, thereby enhancing the segmentation of anomalies in cluttered road environments. Additionally, the inclusion of a mask refinement strategy minimizes false positives and elevates overall segmentation accuracy, demonstrating the robustness and effectiveness of our method. Benchmark evaluations on Road Anomaly and Fishyscapes Lost &Found datasets demonstrate that AnomaskDrive outperforms existing methods, achieving AUC/AP/FPR@95 scores of 98.56%/90.87%/4.68% and 97.75%/74.54%/4.25%, respectively, underscoring its competitive advantage in anomaly segmentation.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"196 ","pages":"Pages 142-147"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525002089","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Road anomaly segmentation plays a pivotal role in advancing the safety of autonomous driving by facilitating the detection of unknown objects in complex traffic environments. Nevertheless, traditional semantic segmentation models, limited by predefined categories, often struggle to accurately identify anomalous objects. In this study, we propose AnomaskDrive, a mask-based anomaly segmentation approach that integrates a comprehensive mask-based attention mechanism and a mask refinement strategy within an RbA framework, enhancing the detection of anomalous objects in complex scenes. The proposed mask-based attention mechanism effectively distinguishes between foreground and background regions, thereby enhancing the segmentation of anomalies in cluttered road environments. Additionally, the inclusion of a mask refinement strategy minimizes false positives and elevates overall segmentation accuracy, demonstrating the robustness and effectiveness of our method. Benchmark evaluations on Road Anomaly and Fishyscapes Lost &Found datasets demonstrate that AnomaskDrive outperforms existing methods, achieving AUC/AP/FPR@95 scores of 98.56%/90.87%/4.68% and 97.75%/74.54%/4.25%, respectively, underscoring its competitive advantage in anomaly segmentation.
道路异常分割有助于在复杂的交通环境中发现未知物体,对提高自动驾驶的安全性具有关键作用。然而,传统的语义分割模型受到预定义类别的限制,往往难以准确识别异常对象。在本研究中,我们提出了一种基于掩模的异常分割方法AnomaskDrive,该方法在RbA框架内集成了综合的基于掩模的注意机制和掩模细化策略,增强了对复杂场景中异常物体的检测。本文提出的基于掩模的注意机制能够有效区分前景和背景区域,从而增强对混乱道路环境中异常的分割。此外,包含掩码细化策略最大限度地减少误报并提高整体分割精度,证明了我们的方法的鲁棒性和有效性。对道路异常和fishyscape Lost & Found数据集的基准评估表明,AnomaskDrive优于现有方法,AUC/AP/FPR@95得分分别为98.56%/90.87%/4.68%和97.75%/74.54%/4.25%,突出了其在异常分割方面的竞争优势。
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.