基础模型能可靠地识别空间危害吗?路缘分割的案例研究。

IF 2.5 4区 医学 Q1 REHABILITATION
Diwei Sheng, Giles Hamilton-Fletcher, Mahya Beheshti, Chen Feng, John-Ross Rizzo
{"title":"基础模型能可靠地识别空间危害吗?路缘分割的案例研究。","authors":"Diwei Sheng, Giles Hamilton-Fletcher, Mahya Beheshti, Chen Feng, John-Ross Rizzo","doi":"10.1080/10400435.2025.2490632","DOIUrl":null,"url":null,"abstract":"<p><p>Curbs serve as vital borders that delineate safe pedestrian zones from potential vehicular traffic hazards. Curbs also represent a primary spatial hazard during dynamic navigation with significant stumbling potential. Such vulnerabilities are particularly exacerbated for persons with blindness and low vision (PBLV). Accurate visual-based discrimination of curbs is paramount for assistive technologies that aid PBLV with safe navigation in urban environments. Herein, we investigate the efficacy of curb segmentation for foundation models. We introduce the largest curb segmentation dataset to date to benchmark leading foundation models. Our results show that state-of-the-art foundation models face significant challenges in curb segmentation. This is due to their low precision and recall with poor performance distinguishing curbs from curb-like objects or non-curb areas, such as sidewalks. In addition, the best-performing model averaged a 3.70-s inference time, underscoring problems in providing real-time assistance. In response, we propose solutions including filtered bounding box selections to achieve more accurate curb segmentation. Overall, despite the immediate flexibility of foundation models, their application for practical assistive technology applications still requires refinement. This research highlights the critical need for specialized datasets and tailored model training to address navigation challenges for PBLV and underscores implicit weaknesses in foundation models.</p>","PeriodicalId":51568,"journal":{"name":"Assistive Technology","volume":" ","pages":"1-9"},"PeriodicalIF":2.5000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can foundation models reliably identify spatial hazards? A case study on curb segmentation.\",\"authors\":\"Diwei Sheng, Giles Hamilton-Fletcher, Mahya Beheshti, Chen Feng, John-Ross Rizzo\",\"doi\":\"10.1080/10400435.2025.2490632\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Curbs serve as vital borders that delineate safe pedestrian zones from potential vehicular traffic hazards. Curbs also represent a primary spatial hazard during dynamic navigation with significant stumbling potential. Such vulnerabilities are particularly exacerbated for persons with blindness and low vision (PBLV). Accurate visual-based discrimination of curbs is paramount for assistive technologies that aid PBLV with safe navigation in urban environments. Herein, we investigate the efficacy of curb segmentation for foundation models. We introduce the largest curb segmentation dataset to date to benchmark leading foundation models. Our results show that state-of-the-art foundation models face significant challenges in curb segmentation. This is due to their low precision and recall with poor performance distinguishing curbs from curb-like objects or non-curb areas, such as sidewalks. In addition, the best-performing model averaged a 3.70-s inference time, underscoring problems in providing real-time assistance. In response, we propose solutions including filtered bounding box selections to achieve more accurate curb segmentation. Overall, despite the immediate flexibility of foundation models, their application for practical assistive technology applications still requires refinement. This research highlights the critical need for specialized datasets and tailored model training to address navigation challenges for PBLV and underscores implicit weaknesses in foundation models.</p>\",\"PeriodicalId\":51568,\"journal\":{\"name\":\"Assistive Technology\",\"volume\":\" \",\"pages\":\"1-9\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Assistive Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/10400435.2025.2490632\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REHABILITATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Assistive Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10400435.2025.2490632","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REHABILITATION","Score":null,"Total":0}
引用次数: 0

摘要

路缘是划定安全步行区与潜在车辆交通危险的重要边界。在动态导航过程中,路缘也是一个主要的空间危险,有可能造成严重的绊倒。对于失明和弱视者来说,这种脆弱性尤其加剧。准确的基于视觉的路缘识别对于辅助技术至关重要,这些辅助技术可以帮助PBLV在城市环境中安全导航。在此,我们研究了基础模型的抑制分割的有效性。我们引入了迄今为止最大的抑制分割数据集来基准领先的基础模型。我们的研究结果表明,最先进的基础模型在抑制分割方面面临重大挑战。这是由于它们的精度和召回率较低,区分路缘与路边物体或非路缘区域(如人行道)的性能较差。此外,性能最好的模型平均推理时间为3.70秒,这突出了在提供实时帮助方面的问题。为此,我们提出了包括过滤边界框选择在内的解决方案,以实现更准确的抑制分割。总的来说,尽管基础模型具有直接的灵活性,但它们在实际辅助技术应用中的应用仍然需要改进。本研究强调了对专门数据集和定制模型训练的迫切需求,以解决PBLV导航挑战,并强调了基础模型的隐性弱点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Can foundation models reliably identify spatial hazards? A case study on curb segmentation.

Curbs serve as vital borders that delineate safe pedestrian zones from potential vehicular traffic hazards. Curbs also represent a primary spatial hazard during dynamic navigation with significant stumbling potential. Such vulnerabilities are particularly exacerbated for persons with blindness and low vision (PBLV). Accurate visual-based discrimination of curbs is paramount for assistive technologies that aid PBLV with safe navigation in urban environments. Herein, we investigate the efficacy of curb segmentation for foundation models. We introduce the largest curb segmentation dataset to date to benchmark leading foundation models. Our results show that state-of-the-art foundation models face significant challenges in curb segmentation. This is due to their low precision and recall with poor performance distinguishing curbs from curb-like objects or non-curb areas, such as sidewalks. In addition, the best-performing model averaged a 3.70-s inference time, underscoring problems in providing real-time assistance. In response, we propose solutions including filtered bounding box selections to achieve more accurate curb segmentation. Overall, despite the immediate flexibility of foundation models, their application for practical assistive technology applications still requires refinement. This research highlights the critical need for specialized datasets and tailored model training to address navigation challenges for PBLV and underscores implicit weaknesses in foundation models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Assistive Technology
Assistive Technology REHABILITATION-
CiteScore
4.00
自引率
5.60%
发文量
40
期刊介绍: Assistive Technology is an applied, scientific publication in the multi-disciplinary field of technology for people with disabilities. The journal"s purpose is to foster communication among individuals working in all aspects of the assistive technology arena including researchers, developers, clinicians, educators and consumers. The journal will consider papers from all assistive technology applications. Only original papers will be accepted. Technical notes describing preliminary techniques, procedures, or findings of original scientific research may also be submitted. Letters to the Editor are welcome. Books for review may be sent to authors or publisher.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信