How to detect occluded crosswalks in overview images? Comparing three methods in a heavily occluded area

IF 4.3 Q2 TRANSPORTATION
Yuanyuan Zhang , Joseph Luttrell IV , Chaoyang Zhang
{"title":"How to detect occluded crosswalks in overview images? Comparing three methods in a heavily occluded area","authors":"Yuanyuan Zhang ,&nbsp;Joseph Luttrell IV ,&nbsp;Chaoyang Zhang","doi":"10.1016/j.ijtst.2024.04.001","DOIUrl":null,"url":null,"abstract":"<div><div>Crosswalk presence data are crucial for pedestrian safety and urban planning. However, obtaining such data at a large scale is often challenging due to the high cost associated with traditional collection methods. While automated methods based on computer vision have been explored to detect crosswalks from aerial images, a major obstacle to their application is the handling of candidate crosswalks occluded by objects or shadows in the aerial imagery. To address this challenge, this study explores different deep learning-based solutions, including the aerial-view method (AVM) and street-view method (SVM), which are commonly used, and a combination of them, i.e., the dual-perspective method (DPM). Deep learning models based on convolutional neural networks (CNNs) with the VGG16 architecture were trained using 16 815 images to automatically detect crosswalks from both aerial and street view images. To compare the performance of these methods in handling occlusions, 1 378 images from a heavily occluded area were processed separately by the three methods. The results showed that the AVM suffered the most when dealing with images from a heavily occluded area, resulting in the lowest accuracy, precision, recall, and F1 score among the three methods. On the other hand, the SVM outperformed the AVM significantly. The DPM demonstrated the highest accuracy and precision values, indicating its superiority in accurately predicting the location of a crosswalk. However, the SVM exhibited the highest recall value, highlighting its superior ability to recover an occluded crosswalk among all methods.</div></div>","PeriodicalId":52282,"journal":{"name":"International Journal of Transportation Science and Technology","volume":"17 ","pages":"Pages 148-160"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Transportation Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2046043024000364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Crosswalk presence data are crucial for pedestrian safety and urban planning. However, obtaining such data at a large scale is often challenging due to the high cost associated with traditional collection methods. While automated methods based on computer vision have been explored to detect crosswalks from aerial images, a major obstacle to their application is the handling of candidate crosswalks occluded by objects or shadows in the aerial imagery. To address this challenge, this study explores different deep learning-based solutions, including the aerial-view method (AVM) and street-view method (SVM), which are commonly used, and a combination of them, i.e., the dual-perspective method (DPM). Deep learning models based on convolutional neural networks (CNNs) with the VGG16 architecture were trained using 16 815 images to automatically detect crosswalks from both aerial and street view images. To compare the performance of these methods in handling occlusions, 1 378 images from a heavily occluded area were processed separately by the three methods. The results showed that the AVM suffered the most when dealing with images from a heavily occluded area, resulting in the lowest accuracy, precision, recall, and F1 score among the three methods. On the other hand, the SVM outperformed the AVM significantly. The DPM demonstrated the highest accuracy and precision values, indicating its superiority in accurately predicting the location of a crosswalk. However, the SVM exhibited the highest recall value, highlighting its superior ability to recover an occluded crosswalk among all methods.
如何检测概览图像中被遮挡的人行横道?比较严重遮挡区域中的三种方法
人行横道存在数据对行人安全和城市规划至关重要。然而,由于与传统收集方法相关的高成本,大规模获取此类数据通常具有挑战性。虽然已经探索了基于计算机视觉的自动方法来检测航空图像中的人行横道,但其应用的主要障碍是处理航空图像中被物体或阴影遮挡的候选人行横道。为了应对这一挑战,本研究探索了不同的基于深度学习的解决方案,包括常用的鸟瞰图方法(AVM)和街景方法(SVM),以及它们的组合,即双视角方法(DPM)。基于VGG16架构的卷积神经网络(cnn)的深度学习模型使用16815张图像进行训练,自动检测航拍和街景图像中的人行横道。为了比较这三种方法处理遮挡的性能,分别对1 378幅重度遮挡区域的图像进行了处理。结果表明,AVM在处理严重遮挡区域的图像时受到的影响最大,导致三种方法的准确率、精密度、召回率和F1分数最低。另一方面,SVM的性能明显优于AVM。DPM具有最高的准确度和精度值,表明其在准确预测人行横道位置方面具有优势。然而,支持向量机的召回值最高,突出了其在所有方法中恢复闭塞人行横道的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Transportation Science and Technology
International Journal of Transportation Science and Technology Engineering-Civil and Structural Engineering
CiteScore
7.20
自引率
0.00%
发文量
105
审稿时长
88 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信