A Survey on Regression-Based Crowd Counting Techniques

IF 2 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS
Yu Hao, Huimin Du, Meiwen Mao, Ying Liu, Jiulun Fan
{"title":"A Survey on Regression-Based Crowd Counting Techniques","authors":"Yu Hao, Huimin Du, Meiwen Mao, Ying Liu, Jiulun Fan","doi":"10.5755/j01.itc.52.3.33701","DOIUrl":null,"url":null,"abstract":"Traditional detect and count strategy can’t well handle the extremely crowded footage in computer vision-based counting task. In recent years, deep learning approaches have been widely explored to tackle this challenge. By regressing visual features to density map, the total crowd number can be predicted while avoids the detection of their actual positions. Efforts of improving performance distribute at various phases of the detecting pipeline, such as feature extraction and eliminating deviation of regressed density map etc. In this article, we conduct a thorough review on the most representative and state-of-the-art techniques. The efforts are systematically categorized into three topics: the evolving of front-end network, the handling of unbalanced density map prediction, and the selection of loss function. After the evaluation of most significant techniques, innovations of the state-of-the-art are inspected in detail to analyze specific reasons to achieve high performances. As conclusion, possible directions of enhancement are discussed to provide insights of future research.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"50 1","pages":"0"},"PeriodicalIF":2.0000,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Technology and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5755/j01.itc.52.3.33701","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional detect and count strategy can’t well handle the extremely crowded footage in computer vision-based counting task. In recent years, deep learning approaches have been widely explored to tackle this challenge. By regressing visual features to density map, the total crowd number can be predicted while avoids the detection of their actual positions. Efforts of improving performance distribute at various phases of the detecting pipeline, such as feature extraction and eliminating deviation of regressed density map etc. In this article, we conduct a thorough review on the most representative and state-of-the-art techniques. The efforts are systematically categorized into three topics: the evolving of front-end network, the handling of unbalanced density map prediction, and the selection of loss function. After the evaluation of most significant techniques, innovations of the state-of-the-art are inspected in detail to analyze specific reasons to achieve high performances. As conclusion, possible directions of enhancement are discussed to provide insights of future research.
基于回归的人群计数技术综述
传统的检测和计数策略不能很好地处理基于计算机视觉的计数任务中极为拥挤的镜头。近年来,深度学习方法已被广泛探索以应对这一挑战。通过将视觉特征回归到密度图中,可以预测人群的总人数,同时避免检测人群的实际位置。提高性能的努力分布在检测管道的各个阶段,如特征提取和消除回归密度图的偏差等。在本文中,我们对最具代表性和最先进的技术进行了全面的回顾。系统地分为三个方面:前端网络的演化、不平衡密度图预测的处理和损失函数的选择。在对最重要的技术进行评估后,详细检查了最先进的创新,以分析实现高性能的具体原因。最后,讨论了可能的增强方向,为今后的研究提供参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Technology and Control
Information Technology and Control 工程技术-计算机:人工智能
CiteScore
2.70
自引率
9.10%
发文量
36
审稿时长
12 months
期刊介绍: Periodical journal covers a wide field of computer science and control systems related problems including: -Software and hardware engineering; -Management systems engineering; -Information systems and databases; -Embedded systems; -Physical systems modelling and application; -Computer networks and cloud computing; -Data visualization; -Human-computer interface; -Computer graphics, visual analytics, and multimedia systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信