M. Esat Kalfaoglu, Halil Ibrahim Ozturk, Ozsel Kilinc, Alptekin Temizel
{"title":"TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem","authors":"M. Esat Kalfaoglu, Halil Ibrahim Ozturk, Ozsel Kilinc, Alptekin Temizel","doi":"arxiv-2409.11325","DOIUrl":null,"url":null,"abstract":"Recently, the centerline has become a popular representation of lanes due to\nits advantages in solving the road topology problem. To enhance centerline\nprediction, we have developed a new approach called TopoMask. Unlike previous\nmethods that rely on keypoints or parametric methods, TopoMask utilizes an\ninstance-mask-based formulation coupled with a masked-attention-based\ntransformer architecture. We introduce a quad-direction label representation to\nenrich the mask instances with flow information and design a corresponding\npost-processing technique for mask-to-centerline conversion. Additionally, we\ndemonstrate that the instance-mask formulation provides complementary\ninformation to parametric Bezier regressions, and fusing both outputs leads to\nimproved detection and topology performance. Moreover, we analyze the\nshortcomings of the pillar assumption in the Lift Splat technique and adapt a\nmulti-height bin configuration. Experimental results show that TopoMask\nachieves state-of-the-art performance in the OpenLane-V2 dataset, increasing\nfrom 44.1 to 49.4 for Subset-A and 44.7 to 51.8 for Subset-B in the V1.1 OLS\nbaseline.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, the centerline has become a popular representation of lanes due to
its advantages in solving the road topology problem. To enhance centerline
prediction, we have developed a new approach called TopoMask. Unlike previous
methods that rely on keypoints or parametric methods, TopoMask utilizes an
instance-mask-based formulation coupled with a masked-attention-based
transformer architecture. We introduce a quad-direction label representation to
enrich the mask instances with flow information and design a corresponding
post-processing technique for mask-to-centerline conversion. Additionally, we
demonstrate that the instance-mask formulation provides complementary
information to parametric Bezier regressions, and fusing both outputs leads to
improved detection and topology performance. Moreover, we analyze the
shortcomings of the pillar assumption in the Lift Splat technique and adapt a
multi-height bin configuration. Experimental results show that TopoMask
achieves state-of-the-art performance in the OpenLane-V2 dataset, increasing
from 44.1 to 49.4 for Subset-A and 44.7 to 51.8 for Subset-B in the V1.1 OLS
baseline.