基于特征增强和定位优化的小目标检测方法

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET) Pub Date : 2022-08-19 DOI:10.1109/CCET55412.2022.9906335

Qingshu Li, Xiǎohóng Shí, Qi Xu, Wei Huang, Peng Yang

{"title":"基于特征增强和定位优化的小目标检测方法","authors":"Qingshu Li, Xiǎohóng Shí, Qi Xu, Wei Huang, Peng Yang","doi":"10.1109/CCET55412.2022.9906335","DOIUrl":null,"url":null,"abstract":"Feature Pyramid Network (FPN) is a basic but important component in target detection system. Together with target detection algorithms, such as SSD, Faster R-CNN and YOLO series, they have achieved good detection results for large targets with high resolutions, but the performance is less effective when it comes to detect small targets that contain relatively little semantic information. And small target detection is quite common in daily life, such as face recognition at long distances, traffic sign detection in automatic driving, etc. That means it is significant to break the bottleneck of target detection and get better accuracy performance. In this article, based on the FPN, we propose an improved network structure (IMFPN) from two aspects to get a better accuracy result in small target detection task. In the first aspect, we improve the feature map pyramid structure for feature enhancement, reduce the problem of information loss during feature map fusion and get the semantic information of multiscale feature maps. In the second aspect, we concentrate on the problem of information loss in the pooling process of feature maps, we propose an improved version of the PRRoI pooling method that combines RoI Pooling and RoI Align Pooling. And we also optimize the positioning of the frame through a new IoU calculation standard. Based on these above ideas and methods, we propose a small target detection method based on feature enhancement and positioning optimization.","PeriodicalId":329327,"journal":{"name":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","volume":"974 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Small Target Detection Method Based on Feature Enhancement and Positioning Optimization\",\"authors\":\"Qingshu Li, Xiǎohóng Shí, Qi Xu, Wei Huang, Peng Yang\",\"doi\":\"10.1109/CCET55412.2022.9906335\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature Pyramid Network (FPN) is a basic but important component in target detection system. Together with target detection algorithms, such as SSD, Faster R-CNN and YOLO series, they have achieved good detection results for large targets with high resolutions, but the performance is less effective when it comes to detect small targets that contain relatively little semantic information. And small target detection is quite common in daily life, such as face recognition at long distances, traffic sign detection in automatic driving, etc. That means it is significant to break the bottleneck of target detection and get better accuracy performance. In this article, based on the FPN, we propose an improved network structure (IMFPN) from two aspects to get a better accuracy result in small target detection task. In the first aspect, we improve the feature map pyramid structure for feature enhancement, reduce the problem of information loss during feature map fusion and get the semantic information of multiscale feature maps. In the second aspect, we concentrate on the problem of information loss in the pooling process of feature maps, we propose an improved version of the PRRoI pooling method that combines RoI Pooling and RoI Align Pooling. And we also optimize the positioning of the frame through a new IoU calculation standard. Based on these above ideas and methods, we propose a small target detection method based on feature enhancement and positioning optimization.\",\"PeriodicalId\":329327,\"journal\":{\"name\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"volume\":\"974 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCET55412.2022.9906335\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET55412.2022.9906335","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

特征金字塔网络(FPN)是目标检测系统中一个基本而重要的组成部分。它们与SSD、Faster R-CNN、YOLO系列等目标检测算法一起，对高分辨率的大目标都取得了很好的检测效果，但对语义信息相对较少的小目标检测效果不佳。而小目标检测在日常生活中是相当普遍的，比如远距离的人脸识别，自动驾驶中的交通标志检测等。这意味着突破目标检测瓶颈，获得更好的精度性能具有重要意义。本文在FPN的基础上，从两个方面提出了一种改进的网络结构(IMFPN)，以在小目标检测任务中获得更好的精度结果。首先，改进特征图金字塔结构进行特征增强，减少特征图融合过程中的信息丢失问题，获得多尺度特征图的语义信息;第二方面，针对特征图池化过程中的信息丢失问题，提出了一种改进的PRRoI池化方法，该方法将RoI池化和RoI Align池化相结合。并通过新的IoU计算标准对车架的定位进行了优化。基于上述思想和方法，我们提出了一种基于特征增强和定位优化的小目标检测方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Small Target Detection Method Based on Feature Enhancement and Positioning Optimization

Feature Pyramid Network (FPN) is a basic but important component in target detection system. Together with target detection algorithms, such as SSD, Faster R-CNN and YOLO series, they have achieved good detection results for large targets with high resolutions, but the performance is less effective when it comes to detect small targets that contain relatively little semantic information. And small target detection is quite common in daily life, such as face recognition at long distances, traffic sign detection in automatic driving, etc. That means it is significant to break the bottleneck of target detection and get better accuracy performance. In this article, based on the FPN, we propose an improved network structure (IMFPN) from two aspects to get a better accuracy result in small target detection task. In the first aspect, we improve the feature map pyramid structure for feature enhancement, reduce the problem of information loss during feature map fusion and get the semantic information of multiscale feature maps. In the second aspect, we concentrate on the problem of information loss in the pooling process of feature maps, we propose an improved version of the PRRoI pooling method that combines RoI Pooling and RoI Align Pooling. And we also optimize the positioning of the frame through a new IoU calculation standard. Based on these above ideas and methods, we propose a small target detection method based on feature enhancement and positioning optimization.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)

自引率

0.00%

发文量