Research on Knowledge Distillation Algorithm of Object Detection

2021 IEEE International Conference on Progress in Informatics and Computing (PIC) Pub Date : 2021-12-17 DOI:10.1109/PIC53636.2021.9687066

Xue-fang Wang, Wenbin Zhang, Yuchun Chu, Peishun Liu, Qilin Yin, Qi Li

{"title":"Research on Knowledge Distillation Algorithm of Object Detection","authors":"Xue-fang Wang, Wenbin Zhang, Yuchun Chu, Peishun Liu, Qilin Yin, Qi Li","doi":"10.1109/PIC53636.2021.9687066","DOIUrl":null,"url":null,"abstract":"The Algorithms of object detection are usually difficult to deploy on low-end devices due to the large amount of computation, but knowledge distillation can solve this problem by training small models to learn the already trained complex network models, realizing model compression, and effectively reducing the amount of computation. How to transfer rich knowledge from teachers to students is a key step in the knowledge distillation. To solve this problem, this paper uses the knowledge of the teacher to guide the student network training in feature extraction, target classification and frame prediction, and proposes a distillation algorithm based on multi-scale attention mechanism, which uses attention mechanism to integrate different scale features. The correlation of features between different channels is learned by assigning weights to the features of each channel. The distillation algorithm proposed in this paper is based on YOLOv4, so it can strengthen the student network to learn the key knowledge of the teacher network, and make the knowledge of the teacher network How to the student network better. Experimental analysis shows that it can effectively improve the detection accuracy of the student network. The size of the model is only 6.4% of the teacher network, but the speed is increased by 3 times, and mAP is 5.7% higher than the original student network and 2.1% lower than the teacher network.","PeriodicalId":297239,"journal":{"name":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIC53636.2021.9687066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The Algorithms of object detection are usually difficult to deploy on low-end devices due to the large amount of computation, but knowledge distillation can solve this problem by training small models to learn the already trained complex network models, realizing model compression, and effectively reducing the amount of computation. How to transfer rich knowledge from teachers to students is a key step in the knowledge distillation. To solve this problem, this paper uses the knowledge of the teacher to guide the student network training in feature extraction, target classification and frame prediction, and proposes a distillation algorithm based on multi-scale attention mechanism, which uses attention mechanism to integrate different scale features. The correlation of features between different channels is learned by assigning weights to the features of each channel. The distillation algorithm proposed in this paper is based on YOLOv4, so it can strengthen the student network to learn the key knowledge of the teacher network, and make the knowledge of the teacher network How to the student network better. Experimental analysis shows that it can effectively improve the detection accuracy of the student network. The size of the model is only 6.4% of the teacher network, but the speed is increased by 3 times, and mAP is 5.7% higher than the original student network and 2.1% lower than the teacher network.

查看原文本刊更多论文

目标检测中的知识蒸馏算法研究

由于计算量大，目标检测算法通常难以在低端设备上部署，而知识蒸馏可以通过训练小模型来学习已经训练好的复杂网络模型，实现模型压缩，有效地减少了计算量，从而解决了这一问题。如何将丰富的知识从教师手中传递给学生，是知识升华的关键环节。针对这一问题，本文利用教师的知识指导学生网络训练在特征提取、目标分类、框架预测等方面，提出了一种基于多尺度注意机制的精馏算法，利用注意机制对不同尺度特征进行整合。通过对每个通道的特征分配权重来学习不同通道之间特征的相关性。本文提出的精馏算法是基于YOLOv4的，因此它可以加强学生网络对教师网络关键知识的学习，并使教师网络的知识如何更好地应用到学生网络中。实验分析表明，该方法能有效提高学生网络的检测精度。模型的规模仅为教师网络的6.4%，但速度提高了3倍，mAP比原来的学生网络高5.7%，比教师网络低2.1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Progress in Informatics and Computing (PIC)

自引率

0.00%

发文量