用于监控车辆检测的两阶段再参数化和样本解缠网络

IF 2.7 3区 工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY
Wei Xie, Weiming Liu, Y. Dai
{"title":"用于监控车辆检测的两阶段再参数化和样本解缠网络","authors":"Wei Xie, Weiming Liu, Y. Dai","doi":"10.1088/1361-6501/ad5ea6","DOIUrl":null,"url":null,"abstract":"\n Detecting vehicles from a surveillance viewpoint is essential, as it has wide applications in community security and traffic control. However, existing methods completely overlook the high memory access costs (MAC) and low degree of parallelism inherent in multi-branch topologies, resulting in significant latency during inference. Additionally, existing methods share the same positive/negative sample set between the classification and localization branches, leading to sample misalignment, and rely solely on intersection-over-union (IoU) for sample assignment, thereby causing a decrease in detection performance. To tackle these issues, this paper introduces a two-stage re-parameterization and sample disentanglement network (TRSD-Net). It is based on two-stage depthwise to pointwise re-parameterization (RepTDP) and task-aligned sample disentanglement (TSD). RepTDP employs structural re-parameterization to decouple the multi-branch topology during training and the plain architecture during inference, thus achieving low latency. By employing different sample assignment strategies, TSD can adaptively select the most suitable positive/negative sample sets for classification and localization tasks, thereby enhancing detection performance. Additionally, TSD considers three important factors influencing sample assignment. TRSD-Net is evaluated on both the UA-DETRAC and COCO datasets. On the UA-DETRAC dataset, compared to state-of-the-art (SOTA) methods, TRSD-Net improves the detection accuracy from 58.8% to 59.7%. Additionally, it reduces the parameter count by 87%, the computational complexity by 85%, and the latency by 39%, while increasing the detection speed by 65%. Similar performance improvement trends are observed on the COCO dataset.","PeriodicalId":18526,"journal":{"name":"Measurement Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Two-Stage Re-Parameterization and Sample Disentanglement Network for Surveillance Vehicle Detection\",\"authors\":\"Wei Xie, Weiming Liu, Y. Dai\",\"doi\":\"10.1088/1361-6501/ad5ea6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Detecting vehicles from a surveillance viewpoint is essential, as it has wide applications in community security and traffic control. However, existing methods completely overlook the high memory access costs (MAC) and low degree of parallelism inherent in multi-branch topologies, resulting in significant latency during inference. Additionally, existing methods share the same positive/negative sample set between the classification and localization branches, leading to sample misalignment, and rely solely on intersection-over-union (IoU) for sample assignment, thereby causing a decrease in detection performance. To tackle these issues, this paper introduces a two-stage re-parameterization and sample disentanglement network (TRSD-Net). It is based on two-stage depthwise to pointwise re-parameterization (RepTDP) and task-aligned sample disentanglement (TSD). RepTDP employs structural re-parameterization to decouple the multi-branch topology during training and the plain architecture during inference, thus achieving low latency. By employing different sample assignment strategies, TSD can adaptively select the most suitable positive/negative sample sets for classification and localization tasks, thereby enhancing detection performance. Additionally, TSD considers three important factors influencing sample assignment. TRSD-Net is evaluated on both the UA-DETRAC and COCO datasets. On the UA-DETRAC dataset, compared to state-of-the-art (SOTA) methods, TRSD-Net improves the detection accuracy from 58.8% to 59.7%. Additionally, it reduces the parameter count by 87%, the computational complexity by 85%, and the latency by 39%, while increasing the detection speed by 65%. Similar performance improvement trends are observed on the COCO dataset.\",\"PeriodicalId\":18526,\"journal\":{\"name\":\"Measurement Science and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Measurement Science and Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6501/ad5ea6\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6501/ad5ea6","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

从监控角度检测车辆至关重要,因为它在社区安全和交通控制方面有着广泛的应用。然而,现有方法完全忽视了多分支拓扑结构固有的高内存访问成本(MAC)和低并行性,从而导致推理过程中的显著延迟。此外,现有方法在分类分支和定位分支之间共享相同的正/负样本集,导致样本错位,并且完全依赖于交集-联合(IoU)进行样本分配,从而导致检测性能下降。为了解决这些问题,本文介绍了一种两阶段重参数化和样本分解网络(TRSD-Net)。它基于两阶段深度到点重参数化(RepTDP)和任务对齐样本解缠(TSD)。RepTDP 采用结构重参数化,将训练时的多分支拓扑和推理时的普通结构解耦,从而实现低延迟。通过采用不同的样本分配策略,TSD 可以为分类和定位任务自适应地选择最合适的正/负样本集,从而提高检测性能。此外,TSD 还考虑了影响样本分配的三个重要因素。TRSD-Net 在 UA-DETRAC 和 COCO 数据集上进行了评估。在 UA-DETRAC 数据集上,与最先进的(SOTA)方法相比,TRSD-Net 将检测准确率从 58.8% 提高到 59.7%。此外,它还将参数数量减少了 87%,计算复杂度降低了 85%,延迟降低了 39%,同时将检测速度提高了 65%。在 COCO 数据集上也观察到了类似的性能改进趋势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Two-Stage Re-Parameterization and Sample Disentanglement Network for Surveillance Vehicle Detection
Detecting vehicles from a surveillance viewpoint is essential, as it has wide applications in community security and traffic control. However, existing methods completely overlook the high memory access costs (MAC) and low degree of parallelism inherent in multi-branch topologies, resulting in significant latency during inference. Additionally, existing methods share the same positive/negative sample set between the classification and localization branches, leading to sample misalignment, and rely solely on intersection-over-union (IoU) for sample assignment, thereby causing a decrease in detection performance. To tackle these issues, this paper introduces a two-stage re-parameterization and sample disentanglement network (TRSD-Net). It is based on two-stage depthwise to pointwise re-parameterization (RepTDP) and task-aligned sample disentanglement (TSD). RepTDP employs structural re-parameterization to decouple the multi-branch topology during training and the plain architecture during inference, thus achieving low latency. By employing different sample assignment strategies, TSD can adaptively select the most suitable positive/negative sample sets for classification and localization tasks, thereby enhancing detection performance. Additionally, TSD considers three important factors influencing sample assignment. TRSD-Net is evaluated on both the UA-DETRAC and COCO datasets. On the UA-DETRAC dataset, compared to state-of-the-art (SOTA) methods, TRSD-Net improves the detection accuracy from 58.8% to 59.7%. Additionally, it reduces the parameter count by 87%, the computational complexity by 85%, and the latency by 39%, while increasing the detection speed by 65%. Similar performance improvement trends are observed on the COCO dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Measurement Science and Technology
Measurement Science and Technology 工程技术-工程:综合
CiteScore
4.30
自引率
16.70%
发文量
656
审稿时长
4.9 months
期刊介绍: Measurement Science and Technology publishes articles on new measurement techniques and associated instrumentation. Papers that describe experiments must represent an advance in measurement science or measurement technique rather than the application of established experimental technique. Bearing in mind the multidisciplinary nature of the journal, authors must provide an introduction to their work that makes clear the novelty, significance, broader relevance of their work in a measurement context and relevance to the readership of Measurement Science and Technology. All submitted articles should contain consideration of the uncertainty, precision and/or accuracy of the measurements presented. Subject coverage includes the theory, practice and application of measurement in physics, chemistry, engineering and the environmental and life sciences from inception to commercial exploitation. Publications in the journal should emphasize the novelty of reported methods, characterize them and demonstrate their performance using examples or applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信