Improved Knowledge Distillation for Crowd Counting on IoT Devices

2023 IEEE International Conference on Edge Computing and Communications (EDGE) Pub Date : 2023-07-01 DOI:10.1109/EDGE60047.2023.00041

Zuo Huang, R. Sinnott

{"title":"Improved Knowledge Distillation for Crowd Counting on IoT Devices","authors":"Zuo Huang, R. Sinnott","doi":"10.1109/EDGE60047.2023.00041","DOIUrl":null,"url":null,"abstract":"Manual crowd counting for real-world problems is impossible or results in wildly inaccurate estimations. Deep learning is one area that has been applied to address this issue. Crowd counting is a computationally intensive task. Therefore, many crowd counting models employ large-scale deep convolutional neural networks (CNN) to achieve higher accuracy. However, these are typically at the cost of performance and inference speed. This makes such approaches difficult to apply in real-world settings, e.g., on Internet-of-Things (IoT) devices. To tackle this problem, one method is to compress models using pruning and quantization or use of lightweight model backbones. However, such methods often result in a significant loss of accuracy. To address this, some studies have explored knowledge distillation methods to extract useful information from large state-of-the-art (teacher) models to guide/train smaller (student) models. However, knowledge distillation methods suffer from the problem of information loss caused by hint-transformers. Furthermore, teacher models may have a negative impact on student models. In this work, we propose a method based on knowledge distillation that uses self-transformed hints and loss functions that ignore outliers to tackle real-world and challenging crowd counting tasks. Based on our approach, we achieve a MAE of 77.24 and a MSE of 276.17 using the JHU-CROWD++ [1] test set. This is comparable to state-of-the-art deep crowd counting models, but at a fraction of the original model size and complexity, thus making the solution suitable for IoT devices. The source code is available at https://github.com/huangzuo/effcc_distilled.","PeriodicalId":369407,"journal":{"name":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Edge Computing and Communications (EDGE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EDGE60047.2023.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Manual crowd counting for real-world problems is impossible or results in wildly inaccurate estimations. Deep learning is one area that has been applied to address this issue. Crowd counting is a computationally intensive task. Therefore, many crowd counting models employ large-scale deep convolutional neural networks (CNN) to achieve higher accuracy. However, these are typically at the cost of performance and inference speed. This makes such approaches difficult to apply in real-world settings, e.g., on Internet-of-Things (IoT) devices. To tackle this problem, one method is to compress models using pruning and quantization or use of lightweight model backbones. However, such methods often result in a significant loss of accuracy. To address this, some studies have explored knowledge distillation methods to extract useful information from large state-of-the-art (teacher) models to guide/train smaller (student) models. However, knowledge distillation methods suffer from the problem of information loss caused by hint-transformers. Furthermore, teacher models may have a negative impact on student models. In this work, we propose a method based on knowledge distillation that uses self-transformed hints and loss functions that ignore outliers to tackle real-world and challenging crowd counting tasks. Based on our approach, we achieve a MAE of 77.24 and a MSE of 276.17 using the JHU-CROWD++ [1] test set. This is comparable to state-of-the-art deep crowd counting models, but at a fraction of the original model size and complexity, thus making the solution suitable for IoT devices. The source code is available at https://github.com/huangzuo/effcc_distilled.

查看原文本刊更多论文

物联网设备上人群计数的改进知识蒸馏

对现实世界的问题进行人工人群计数是不可能的，或者会导致非常不准确的估计。深度学习是解决这个问题的一个领域。人群计数是一项计算密集型任务。因此，许多人群计数模型采用大规模深度卷积神经网络(CNN)来达到更高的精度。然而，这些通常是以性能和推理速度为代价的。这使得这种方法难以应用于现实环境，例如物联网(IoT)设备。为了解决这个问题，一种方法是使用修剪和量化或使用轻量级模型主干来压缩模型。然而，这种方法往往导致准确性的显著损失。为了解决这个问题，一些研究探索了知识蒸馏方法，从大型最先进的(教师)模型中提取有用的信息，以指导/训练较小的(学生)模型。然而，知识蒸馏方法存在提示变换导致信息丢失的问题。此外，教师模式可能会对学生模式产生负面影响。在这项工作中，我们提出了一种基于知识蒸馏的方法，该方法使用忽略异常值的自转换提示和损失函数来解决现实世界和具有挑战性的人群计数任务。基于我们的方法，我们使用JHU-CROWD++[1]测试集实现了77.24的MAE和276.17的MSE。这与最先进的深度人群计数模型相当，但在原始模型的大小和复杂性的一小部分，从而使解决方案适用于物联网设备。源代码可从https://github.com/huangzuo/effcc_distilled获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Edge Computing and Communications (EDGE)

自引率

0.00%

发文量