{"title":"无人机图像中的车辆计数:一种具有空间注意和多尺度接受域的自适应方法","authors":"Yu Liu, Hang Shen, Tianjing Wang, Guangwei Bai","doi":"10.4218/etrij.2023-0426","DOIUrl":null,"url":null,"abstract":"<p>We propose an altitude-adaptive vehicle counting method with an attention mechanism and multiscale receptive fields that optimizes the measurement accuracy and inference latency of unmanned aerial vehicle (UAV) images. An attention mechanism is used to aggregate horizontal and vertical feature weights to enhance spatial information and suppress background noise. The UAV flight altitude and shooting depression angle are considered for scale division and image segmentation to avoid acquiring distance measurements. Based on the dilation rate, we introduce a receptive field selection strategy for the trained model to exhibit scale generalization without redundant calculations. A distribution-aware block loss is optimized via \n<span></span><math>\n <mi>k</mi></math> roots to balance the loss of sparse and crowded regions by dividing the density map. Experiments on three authoritative datasets demonstrate that compared with CSRNet, the proposed method improves the mean absolute error by 29.4%–54.0% and mean squared error by 28.6%–41.2% while reducing the inference latency. The proposed method exhibits higher counting accuracy than lightweight models including MCNN and MobileCount.</p>","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"47 1","pages":"7-19"},"PeriodicalIF":1.3000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0426","citationCount":"0","resultStr":"{\"title\":\"Vehicle counting in drone images: An adaptive method with spatial attention and multiscale receptive fields\",\"authors\":\"Yu Liu, Hang Shen, Tianjing Wang, Guangwei Bai\",\"doi\":\"10.4218/etrij.2023-0426\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We propose an altitude-adaptive vehicle counting method with an attention mechanism and multiscale receptive fields that optimizes the measurement accuracy and inference latency of unmanned aerial vehicle (UAV) images. An attention mechanism is used to aggregate horizontal and vertical feature weights to enhance spatial information and suppress background noise. The UAV flight altitude and shooting depression angle are considered for scale division and image segmentation to avoid acquiring distance measurements. Based on the dilation rate, we introduce a receptive field selection strategy for the trained model to exhibit scale generalization without redundant calculations. A distribution-aware block loss is optimized via \\n<span></span><math>\\n <mi>k</mi></math> roots to balance the loss of sparse and crowded regions by dividing the density map. Experiments on three authoritative datasets demonstrate that compared with CSRNet, the proposed method improves the mean absolute error by 29.4%–54.0% and mean squared error by 28.6%–41.2% while reducing the inference latency. The proposed method exhibits higher counting accuracy than lightweight models including MCNN and MobileCount.</p>\",\"PeriodicalId\":11901,\"journal\":{\"name\":\"ETRI Journal\",\"volume\":\"47 1\",\"pages\":\"7-19\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.4218/etrij.2023-0426\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ETRI Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.4218/etrij.2023-0426\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ETRI Journal","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.4218/etrij.2023-0426","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Vehicle counting in drone images: An adaptive method with spatial attention and multiscale receptive fields
We propose an altitude-adaptive vehicle counting method with an attention mechanism and multiscale receptive fields that optimizes the measurement accuracy and inference latency of unmanned aerial vehicle (UAV) images. An attention mechanism is used to aggregate horizontal and vertical feature weights to enhance spatial information and suppress background noise. The UAV flight altitude and shooting depression angle are considered for scale division and image segmentation to avoid acquiring distance measurements. Based on the dilation rate, we introduce a receptive field selection strategy for the trained model to exhibit scale generalization without redundant calculations. A distribution-aware block loss is optimized via
roots to balance the loss of sparse and crowded regions by dividing the density map. Experiments on three authoritative datasets demonstrate that compared with CSRNet, the proposed method improves the mean absolute error by 29.4%–54.0% and mean squared error by 28.6%–41.2% while reducing the inference latency. The proposed method exhibits higher counting accuracy than lightweight models including MCNN and MobileCount.
期刊介绍:
ETRI Journal is an international, peer-reviewed multidisciplinary journal published bimonthly in English. The main focus of the journal is to provide an open forum to exchange innovative ideas and technology in the fields of information, telecommunications, and electronics.
Key topics of interest include high-performance computing, big data analytics, cloud computing, multimedia technology, communication networks and services, wireless communications and mobile computing, material and component technology, as well as security.
With an international editorial committee and experts from around the world as reviewers, ETRI Journal publishes high-quality research papers on the latest and best developments from the global community.