{"title":"Weakly-supervised road condition detection via scribble annotations","authors":"Hongshuai Qin, Xiao-Diao Chen, Wen Wu, Wenya Yang","doi":"10.1016/j.jvcir.2025.104494","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based solutions in road condition detection rely on fully supervised learning widely which requires a large quantity of training data with pixel-wise annotation from researchers. Accurate dense labeling is challenging since the boundaries between road condition regions are ambiguous. To combat this issue, this work aims to introduce a weakly supervised road condition detection framework, which can generate high-quality pseudo masks from sparse scribble labels and train a road condition detection network using these masks. Specifically, we collect a dataset for road condition detection and annotate it with scribble. Next, we propose a graph convolutional network (GCN)-based label augmentation strategy, which considers both local and global image information, to generate pixel-level pseudo-labels by augmenting the label information from scribbles to the whole. To alleviate the label inconsistency caused by sparse annotations, we adopt the supervision strategy with joint loss of labeled and unlabeled regions during training. Extensive experiments demonstrate that the proposed method can work well on various road condition detection and is on par with the full-supervision method. The code will be made publicly available at <span><span>https://github.com/qinhs9/Scribble_road_Condition</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"111 ","pages":"Article 104494"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001087","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Deep learning-based solutions in road condition detection rely on fully supervised learning widely which requires a large quantity of training data with pixel-wise annotation from researchers. Accurate dense labeling is challenging since the boundaries between road condition regions are ambiguous. To combat this issue, this work aims to introduce a weakly supervised road condition detection framework, which can generate high-quality pseudo masks from sparse scribble labels and train a road condition detection network using these masks. Specifically, we collect a dataset for road condition detection and annotate it with scribble. Next, we propose a graph convolutional network (GCN)-based label augmentation strategy, which considers both local and global image information, to generate pixel-level pseudo-labels by augmenting the label information from scribbles to the whole. To alleviate the label inconsistency caused by sparse annotations, we adopt the supervision strategy with joint loss of labeled and unlabeled regions during training. Extensive experiments demonstrate that the proposed method can work well on various road condition detection and is on par with the full-supervision method. The code will be made publicly available at https://github.com/qinhs9/Scribble_road_Condition.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.