Attention-Guided Lightweight CNN-Transformer Fusion for Real-Time Traffic Sign Recognition in Adverse Environments: HACTNet

IF 2.5 4区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Mandeep Singh Devgan, Gurvinder Singh, Purushottam Sharma, Tajinder Kumar, Xiaochun Cheng, Deepak Ahlawat
{"title":"Attention-Guided Lightweight CNN-Transformer Fusion for Real-Time Traffic Sign Recognition in Adverse Environments: HACTNet","authors":"Mandeep Singh Devgan,&nbsp;Gurvinder Singh,&nbsp;Purushottam Sharma,&nbsp;Tajinder Kumar,&nbsp;Xiaochun Cheng,&nbsp;Deepak Ahlawat","doi":"10.1049/itr2.70167","DOIUrl":null,"url":null,"abstract":"<p>Autonomous driving is also impossible without traffic sign recognition (TSR; also known as traffic sign-on-road), which limits its reliability to domain changes, unfavourable weather, obstruction and hardware capacity. This paper proposes HACTNet, a low-complexity CNN-Transformer hybrid model that pushes the state-of-art in TSR by making a noteworthy set of contributions including (i) efficient convaps to model parts of the image, (ii) transformer encoder to capture the global context and (iii) an attention-based fusion block to dynamically combine the two complementary sets of features. This synergy facilitates strong recognition in presence of blur and occlusion and in varying illumination. In addition to accuracy, HACTNet achieves high robustness (52.8%) against strong PGD adversarial attacks (8/255), but is still efficient (7.9 M parameters and 22.1 FPS) on the NVIDIA Jetson Nano. Moreover, the comparative analysis between the hybrid models (EATFormer, local-ViT) and HACTNet proves that HACTNet has a better accuracy-efficiency ratio. The extraordinary capability to counteract adverse weather conditions, fog, night, rain, snow etc., which is proven by the extensive testing of the real-world ACDC adverse conditions data set, supports the viability of the proposed solutions in the real world. It is plug and play modularity with on-going learning via elastic weight consolidation (3.3% less forgetting) and unsupervised domain adaptation via MMD loss (5.3% better on TT100K with no labels). Moreover, INT8 quantization with quantization-aware training (QAT) incurs little accuracy loss (less than 0.5 percent) and much lower energy (0.27 J/sample) usage, which forms an edge deployment preparedness. Additionally, when adjusting to new traffic signs over time, the model shows compatibility with continuous learning, achieving a low forgetting rate (3.3%), highlighting its practical viability for long-term autonomous deployment. Overall, HACTNet produces a versatile and expandable solution for next-generation intelligent transportation systems by striking a balance between accuracy, robustness and efficiency.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":"20 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2026-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.70167","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/itr2.70167","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous driving is also impossible without traffic sign recognition (TSR; also known as traffic sign-on-road), which limits its reliability to domain changes, unfavourable weather, obstruction and hardware capacity. This paper proposes HACTNet, a low-complexity CNN-Transformer hybrid model that pushes the state-of-art in TSR by making a noteworthy set of contributions including (i) efficient convaps to model parts of the image, (ii) transformer encoder to capture the global context and (iii) an attention-based fusion block to dynamically combine the two complementary sets of features. This synergy facilitates strong recognition in presence of blur and occlusion and in varying illumination. In addition to accuracy, HACTNet achieves high robustness (52.8%) against strong PGD adversarial attacks (8/255), but is still efficient (7.9 M parameters and 22.1 FPS) on the NVIDIA Jetson Nano. Moreover, the comparative analysis between the hybrid models (EATFormer, local-ViT) and HACTNet proves that HACTNet has a better accuracy-efficiency ratio. The extraordinary capability to counteract adverse weather conditions, fog, night, rain, snow etc., which is proven by the extensive testing of the real-world ACDC adverse conditions data set, supports the viability of the proposed solutions in the real world. It is plug and play modularity with on-going learning via elastic weight consolidation (3.3% less forgetting) and unsupervised domain adaptation via MMD loss (5.3% better on TT100K with no labels). Moreover, INT8 quantization with quantization-aware training (QAT) incurs little accuracy loss (less than 0.5 percent) and much lower energy (0.27 J/sample) usage, which forms an edge deployment preparedness. Additionally, when adjusting to new traffic signs over time, the model shows compatibility with continuous learning, achieving a low forgetting rate (3.3%), highlighting its practical viability for long-term autonomous deployment. Overall, HACTNet produces a versatile and expandable solution for next-generation intelligent transportation systems by striking a balance between accuracy, robustness and efficiency.

Abstract Image

不利环境下关注引导的轻量级CNN-Transformer融合实时交通标志识别:HACTNet
如果没有交通标志识别(TSR,也称为道路上的交通标志),自动驾驶也不可能实现,这限制了其可靠性,包括领域变化、不利天气、障碍物和硬件容量。本文提出了HACTNet,这是一种低复杂度的CNN-Transformer混合模型,它通过做出一系列值得注意的贡献,包括(i)对图像部分建模的有效convaps, (ii)捕获全局上下文的transformer编码器和(iii)基于注意力的融合块,以动态地组合两个互补的特征集,从而推动了TSR中的最新技术。这种协同作用有助于在模糊和遮挡以及不同照明下进行强识别。除了准确性之外,HACTNet对强PGD对抗性攻击(8/255)实现了高鲁棒性(52.8%),但在NVIDIA Jetson Nano上仍然是高效的(7.9 M参数和22.1 FPS)。此外,将混合模型(EATFormer、local-ViT)与HACTNet进行对比分析,证明HACTNet具有更好的准确率-效率比。ACDC恶劣天气数据集的广泛测试证明,该系统具有抵抗恶劣天气条件(雾、夜、雨、雪等)的非凡能力,支持了该解决方案在现实世界中的可行性。它是一种即插即用的模块化,通过弹性权重巩固(减少3.3%的遗忘)和通过MMD损失(在没有标签的TT100K上提高5.3%)进行持续学习。此外,使用量化感知训练(QAT)的INT8量化产生很小的精度损失(小于0.5%)和更低的能量(0.27 J/样本)使用,这形成了边缘部署准备。此外,当随着时间的推移调整新的交通标志时,该模型显示出与持续学习的兼容性,实现了较低的遗忘率(3.3%),突出了其长期自主部署的实际可行性。总体而言,HACTNet通过在准确性,稳健性和效率之间取得平衡,为下一代智能交通系统提供了多功能和可扩展的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IET Intelligent Transport Systems
IET Intelligent Transport Systems 工程技术-运输科技
CiteScore
6.50
自引率
7.40%
发文量
159
审稿时长
3 months
期刊介绍: IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following: Sustainable traffic solutions Deployments with enabling technologies Pervasive monitoring Applications; demonstrations and evaluation Economic and behavioural analyses of ITS services and scenario Data Integration and analytics Information collection and processing; image processing applications in ITS ITS aspects of electric vehicles Autonomous vehicles; connected vehicle systems; In-vehicle ITS, safety and vulnerable road user aspects Mobility as a service systems Traffic management and control Public transport systems technologies Fleet and public transport logistics Emergency and incident management Demand management and electronic payment systems Traffic related air pollution management Policy and institutional issues Interoperability, standards and architectures Funding scenarios Enforcement Human machine interaction Education, training and outreach Current Special Issue Call for papers: Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书