Improved YOLO model with multi-feature fully convolutional network for object detection

Yanbin Chen, Huai-Mu Wang, Zhuo Han
{"title":"Improved YOLO model with multi-feature fully convolutional network for object detection","authors":"Yanbin Chen, Huai-Mu Wang, Zhuo Han","doi":"10.1117/12.2574417","DOIUrl":null,"url":null,"abstract":"The main task of object detection is to identify and locate interested objects from still images or video sequences. It is one of the key tasks in the field of computer vision. However, the object usually has variable factors in brightness, shape, occlusion and so on, and is interfered by various and complex environmental factors, which makes the research opportunities and challenges of object detection algorithm coexist. In this paper, a main frame of object detection algorithm based on convolutional neural network is studied, which is based on regression. We propose a real-time object detection algorithm based on fully convolution network, which aims to solve the problems of low detection accuracy and poor location accuracy of objects in regression method. The innovation is that the proposed fully convolution network increases the detection flexibility of the model because it is not affected by the input scale. At the same time, we propose a multi feature fusion and multi border prediction strategy, which effectively improves the detection accuracy of small objects. In order to prove the effectiveness of the proposed algorithm, we use PASCAL VOC data set to carry out object detection experiments. In this paper, the accuracy of each object category and the average accuracy of all categories are calculated. Experiments show that the performance of the multi feature fusion algorithm based on the fully convolution network is better than that based on the regression idea such as YOLO, and more than 10% higher than that of the YOLO model.","PeriodicalId":90079,"journal":{"name":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","volume":"21 1","pages":"1152607 - 1152607-6"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2574417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The main task of object detection is to identify and locate interested objects from still images or video sequences. It is one of the key tasks in the field of computer vision. However, the object usually has variable factors in brightness, shape, occlusion and so on, and is interfered by various and complex environmental factors, which makes the research opportunities and challenges of object detection algorithm coexist. In this paper, a main frame of object detection algorithm based on convolutional neural network is studied, which is based on regression. We propose a real-time object detection algorithm based on fully convolution network, which aims to solve the problems of low detection accuracy and poor location accuracy of objects in regression method. The innovation is that the proposed fully convolution network increases the detection flexibility of the model because it is not affected by the input scale. At the same time, we propose a multi feature fusion and multi border prediction strategy, which effectively improves the detection accuracy of small objects. In order to prove the effectiveness of the proposed algorithm, we use PASCAL VOC data set to carry out object detection experiments. In this paper, the accuracy of each object category and the average accuracy of all categories are calculated. Experiments show that the performance of the multi feature fusion algorithm based on the fully convolution network is better than that based on the regression idea such as YOLO, and more than 10% higher than that of the YOLO model.
基于多特征全卷积网络的目标检测改进YOLO模型
目标检测的主要任务是从静止图像或视频序列中识别和定位感兴趣的目标。它是计算机视觉领域的关键任务之一。然而,目标通常在亮度、形状、遮挡等方面具有可变因素,并且受到各种复杂环境因素的干扰,使得目标检测算法的研究机遇与挑战并存。本文研究了一种基于回归的卷积神经网络目标检测算法的主要框架。本文提出了一种基于全卷积网络的实时目标检测算法,旨在解决回归方法检测精度低、目标定位精度差的问题。创新之处在于,所提出的全卷积网络增加了模型的检测灵活性,因为它不受输入规模的影响。同时,提出了多特征融合多边界预测策略,有效提高了小目标的检测精度。为了证明所提算法的有效性,我们使用PASCAL VOC数据集进行了目标检测实验。本文计算了每个目标类别的准确率和所有类别的平均准确率。实验表明,基于全卷积网络的多特征融合算法的性能优于基于YOLO等回归思想的多特征融合算法,比YOLO模型的性能提高10%以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信