I-MMCCN:用于无人机图像RGB-T人群计数的改进MMCCN

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) Pub Date : 2021-11-17 DOI:10.1109/IC-NIDC54101.2021.9660586

Binyu Zhang, Yunhao Du, Yanyun Zhao, Jun-Jun Wan, Zhihang Tong

{"title":"I-MMCCN:用于无人机图像RGB-T人群计数的改进MMCCN","authors":"Binyu Zhang, Yunhao Du, Yanyun Zhao, Jun-Jun Wan, Zhihang Tong","doi":"10.1109/IC-NIDC54101.2021.9660586","DOIUrl":null,"url":null,"abstract":"Crowd counting is a critical technique in many artificial intelligent applications, such as security monitoring and automatic transportation management. However, due to the variations in object scales, illumination and image quality, crowd counting from drone images is full of challenges. To fully delve the information hidden in the multi-modal RGB-T images shot by drones for crowd counting, we proposed a hard examples mining module and a novel Block Mean Absolute Error loss (BMAE) to improve Multi-Modal Crowd Counting Network (MMCCN). With the local structural supervision introduced by BMAE loss, the network can incorporate local spatial correlation within each block and focus on the local pattern of people. Besides, BMAE is more similar to the evaluation metrics. By combining our proposed hard example mining module and BMAE loss with MMCCN, we obtain our Improved MMCCN, named as I-MMCCN. Experiments on the DroneRGBT dataset verify the effectiveness of our I-MMCCN. It achieves 1.01 MAE and 1.48 RMSE lower than MMCCN on DroneRGBT validation set.","PeriodicalId":264468,"journal":{"name":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"I-MMCCN: Improved MMCCN for RGB-T Crowd Counting of Drone Images\",\"authors\":\"Binyu Zhang, Yunhao Du, Yanyun Zhao, Jun-Jun Wan, Zhihang Tong\",\"doi\":\"10.1109/IC-NIDC54101.2021.9660586\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crowd counting is a critical technique in many artificial intelligent applications, such as security monitoring and automatic transportation management. However, due to the variations in object scales, illumination and image quality, crowd counting from drone images is full of challenges. To fully delve the information hidden in the multi-modal RGB-T images shot by drones for crowd counting, we proposed a hard examples mining module and a novel Block Mean Absolute Error loss (BMAE) to improve Multi-Modal Crowd Counting Network (MMCCN). With the local structural supervision introduced by BMAE loss, the network can incorporate local spatial correlation within each block and focus on the local pattern of people. Besides, BMAE is more similar to the evaluation metrics. By combining our proposed hard example mining module and BMAE loss with MMCCN, we obtain our Improved MMCCN, named as I-MMCCN. Experiments on the DroneRGBT dataset verify the effectiveness of our I-MMCCN. It achieves 1.01 MAE and 1.48 RMSE lower than MMCCN on DroneRGBT validation set.\",\"PeriodicalId\":264468,\"journal\":{\"name\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC-NIDC54101.2021.9660586\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC-NIDC54101.2021.9660586","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

人群计数是许多人工智能应用的关键技术，如安全监控和自动交通管理。然而，由于物体尺度、光照和图像质量的变化，无人机图像的人群计数充满了挑战。为了充分挖掘无人机拍摄的多模态RGB-T图像中隐藏的信息进行人群计数，我们提出了一个硬样本挖掘模块和一种新的块平均绝对误差损失(BMAE)来改进多模态人群计数网络(MMCCN)。利用BMAE损失引入的局部结构监督，该网络可以将每个街区内的局部空间相关性结合起来，并关注人们的局部模式。此外，BMAE更接近于评价指标。将提出的硬例挖掘模块和BMAE损失与MMCCN相结合，得到改进的MMCCN，称为I-MMCCN。在DroneRGBT数据集上的实验验证了我们的I-MMCCN的有效性。在DroneRGBT验证集上，它比MMCCN的MAE低1.01,RMSE低1.48。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

I-MMCCN: Improved MMCCN for RGB-T Crowd Counting of Drone Images

Crowd counting is a critical technique in many artificial intelligent applications, such as security monitoring and automatic transportation management. However, due to the variations in object scales, illumination and image quality, crowd counting from drone images is full of challenges. To fully delve the information hidden in the multi-modal RGB-T images shot by drones for crowd counting, we proposed a hard examples mining module and a novel Block Mean Absolute Error loss (BMAE) to improve Multi-Modal Crowd Counting Network (MMCCN). With the local structural supervision introduced by BMAE loss, the network can incorporate local spatial correlation within each block and focus on the local pattern of people. Besides, BMAE is more similar to the evaluation metrics. By combining our proposed hard example mining module and BMAE loss with MMCCN, we obtain our Improved MMCCN, named as I-MMCCN. Experiments on the DroneRGBT dataset verify the effectiveness of our I-MMCCN. It achieves 1.01 MAE and 1.48 RMSE lower than MMCCN on DroneRGBT validation set.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)

自引率

0.00%

发文量