A Learning-rate Optimization Technique for Object Detection Accuracy Enhancement

C. Anusha, P. S. Avadhani
{"title":"A Learning-rate Optimization Technique for Object Detection Accuracy Enhancement","authors":"C. Anusha, P. S. Avadhani","doi":"10.9734/bpi/naer/v14/5658d","DOIUrl":null,"url":null,"abstract":"Recently, Deep Learning [1] models are used primarily in Object Detection algorithms because of their specific capability for Image Recognition. These models identify items present in input images and videos [2] by extracting features from them. These models have a variety of applications, which include Image Processing, Video analysis, Speech Recognition, Biomedical Image Analysis, Biometric Recognition, Iris Recognition, National Security applications, Cyber Security, Natural Language Processing [3], Weather Forecasting applications, Renewable Energy Generation Scheduling etc. The Convolution Neural Network (CNN) [3], which comprises many artificial neuron layers, is employed for these models. The accuracy of Deep Learning models is determined by a number of factors, including the learning rate, the training batch size, the validation batch size, the activation function, and the drop-out rate. Hyper-Parameters are the name for these parameters. The accuracy of Object Detection depends on the choice of Hyper-Parameters. It is therefore a difficult task to find the best values for these parameters. Fine-Tuning is a method for selecting an effective Hyper-Parameter for improving Object Detection precision. \nSelecting an inaccurate Hyper-Parameter value, leads to Over-Fitting or Under-Fitting of data. Over-Fitting is a problem, when training data is greater than the necessary, leading to learning noise and inaccurate Object Detection [4]. Under-Fitting occurs when a model is unable to capture the data's trend, resulting in more erroneous testing or training outcomes. \nBy changing the ‘Learning rate' of various Deep Learning Models, a balance between Over-Fitting and Under-Fitting is reached in this article. For experimentation purpose, this paper considers four Deep Learning Models such as VGG-16, VGG-19, InceptionV3 and Xception. In terms of maximal Object Detection accuracy, the best zone of Learning-rate for each model is analyzed.  The prediction accuracy of a dataset of 70 object classes is investigated in this study by adjusting the ‘Learning-Rate' while keeping the rest of the Hyper-Parameters fixed.This article focuses on the impact of ‘Learning-Rate' on accuracy in Object Detection and identifies an ideal accuracy zone.  This analysis helps in reduction of computational effort in calculation of Objection Detection Accuracy.","PeriodicalId":262600,"journal":{"name":"New Approaches in Engineering Research Vol. 14","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Approaches in Engineering Research Vol. 14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/bpi/naer/v14/5658d","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, Deep Learning [1] models are used primarily in Object Detection algorithms because of their specific capability for Image Recognition. These models identify items present in input images and videos [2] by extracting features from them. These models have a variety of applications, which include Image Processing, Video analysis, Speech Recognition, Biomedical Image Analysis, Biometric Recognition, Iris Recognition, National Security applications, Cyber Security, Natural Language Processing [3], Weather Forecasting applications, Renewable Energy Generation Scheduling etc. The Convolution Neural Network (CNN) [3], which comprises many artificial neuron layers, is employed for these models. The accuracy of Deep Learning models is determined by a number of factors, including the learning rate, the training batch size, the validation batch size, the activation function, and the drop-out rate. Hyper-Parameters are the name for these parameters. The accuracy of Object Detection depends on the choice of Hyper-Parameters. It is therefore a difficult task to find the best values for these parameters. Fine-Tuning is a method for selecting an effective Hyper-Parameter for improving Object Detection precision. Selecting an inaccurate Hyper-Parameter value, leads to Over-Fitting or Under-Fitting of data. Over-Fitting is a problem, when training data is greater than the necessary, leading to learning noise and inaccurate Object Detection [4]. Under-Fitting occurs when a model is unable to capture the data's trend, resulting in more erroneous testing or training outcomes. By changing the ‘Learning rate' of various Deep Learning Models, a balance between Over-Fitting and Under-Fitting is reached in this article. For experimentation purpose, this paper considers four Deep Learning Models such as VGG-16, VGG-19, InceptionV3 and Xception. In terms of maximal Object Detection accuracy, the best zone of Learning-rate for each model is analyzed.  The prediction accuracy of a dataset of 70 object classes is investigated in this study by adjusting the ‘Learning-Rate' while keeping the rest of the Hyper-Parameters fixed.This article focuses on the impact of ‘Learning-Rate' on accuracy in Object Detection and identifies an ideal accuracy zone.  This analysis helps in reduction of computational effort in calculation of Objection Detection Accuracy.
一种提高目标检测精度的学习率优化技术
最近,深度学习[1]模型主要用于目标检测算法,因为它们具有图像识别的特定能力。这些模型通过从中提取特征来识别输入图像和视频[2]中存在的项目。这些模型有各种各样的应用,包括图像处理、视频分析、语音识别、生物医学图像分析、生物特征识别、虹膜识别、国家安全应用、网络安全、自然语言处理[3]、天气预报应用、可再生能源发电调度等。这些模型采用卷积神经网络(convolutional Neural Network, CNN)[3],它包含许多人工神经元层。深度学习模型的准确性由许多因素决定,包括学习率、训练批大小、验证批大小、激活函数和退出率。超参数是这些参数的名称。目标检测的准确性取决于超参数的选择。因此,找到这些参数的最佳值是一项艰巨的任务。微调是一种选择有效的超参数以提高目标检测精度的方法。选择不准确的超参数值会导致数据的过拟合或欠拟合。过度拟合是一个问题,当训练数据大于所需时,会导致学习噪声和不准确的目标检测[4]。当模型无法捕捉数据的趋势时,就会出现欠拟合,导致更多错误的测试或训练结果。通过改变各种深度学习模型的“学习率”,本文达到了过拟合和欠拟合之间的平衡。为了实验目的,本文考虑了VGG-16、VGG-19、InceptionV3和Xception四种深度学习模型。从最大目标检测精度出发,分析了各模型的最佳学习率区域。在本研究中,通过调整“学习率”,同时保持其余超参数固定,研究了70个对象类数据集的预测精度。本文主要研究了“学习率”对目标检测准确率的影响,并确定了一个理想的准确率区域。这种分析有助于减少目标检测精度的计算量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信