A Learning-rate Optimization Technique for Object Detection Accuracy Enhancement

New Approaches in Engineering Research Vol. 14 Pub Date : 2021-08-25 DOI:10.9734/bpi/naer/v14/5658d

C. Anusha, P. S. Avadhani

{"title":"A Learning-rate Optimization Technique for Object Detection Accuracy Enhancement","authors":"C. Anusha, P. S. Avadhani","doi":"10.9734/bpi/naer/v14/5658d","DOIUrl":null,"url":null,"abstract":"Recently, Deep Learning [1] models are used primarily in Object Detection algorithms because of their specific capability for Image Recognition. These models identify items present in input images and videos [2] by extracting features from them. These models have a variety of applications, which include Image Processing, Video analysis, Speech Recognition, Biomedical Image Analysis, Biometric Recognition, Iris Recognition, National Security applications, Cyber Security, Natural Language Processing [3], Weather Forecasting applications, Renewable Energy Generation Scheduling etc. The Convolution Neural Network (CNN) [3], which comprises many artificial neuron layers, is employed for these models. The accuracy of Deep Learning models is determined by a number of factors, including the learning rate, the training batch size, the validation batch size, the activation function, and the drop-out rate. Hyper-Parameters are the name for these parameters. The accuracy of Object Detection depends on the choice of Hyper-Parameters. It is therefore a difficult task to find the best values for these parameters. Fine-Tuning is a method for selecting an effective Hyper-Parameter for improving Object Detection precision. \nSelecting an inaccurate Hyper-Parameter value, leads to Over-Fitting or Under-Fitting of data. Over-Fitting is a problem, when training data is greater than the necessary, leading to learning noise and inaccurate Object Detection [4]. Under-Fitting occurs when a model is unable to capture the data's trend, resulting in more erroneous testing or training outcomes. \nBy changing the ‘Learning rate' of various Deep Learning Models, a balance between Over-Fitting and Under-Fitting is reached in this article. For experimentation purpose, this paper considers four Deep Learning Models such as VGG-16, VGG-19, InceptionV3 and Xception. In terms of maximal Object Detection accuracy, the best zone of Learning-rate for each model is analyzed. The prediction accuracy of a dataset of 70 object classes is investigated in this study by adjusting the ‘Learning-Rate' while keeping the rest of the Hyper-Parameters fixed.This article focuses on the impact of ‘Learning-Rate' on accuracy in Object Detection and identifies an ideal accuracy zone. This analysis helps in reduction of computational effort in calculation of Objection Detection Accuracy.","PeriodicalId":262600,"journal":{"name":"New Approaches in Engineering Research Vol. 14","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Approaches in Engineering Research Vol. 14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/bpi/naer/v14/5658d","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, Deep Learning [1] models are used primarily in Object Detection algorithms because of their specific capability for Image Recognition. These models identify items present in input images and videos [2] by extracting features from them. These models have a variety of applications, which include Image Processing, Video analysis, Speech Recognition, Biomedical Image Analysis, Biometric Recognition, Iris Recognition, National Security applications, Cyber Security, Natural Language Processing [3], Weather Forecasting applications, Renewable Energy Generation Scheduling etc. The Convolution Neural Network (CNN) [3], which comprises many artificial neuron layers, is employed for these models. The accuracy of Deep Learning models is determined by a number of factors, including the learning rate, the training batch size, the validation batch size, the activation function, and the drop-out rate. Hyper-Parameters are the name for these parameters. The accuracy of Object Detection depends on the choice of Hyper-Parameters. It is therefore a difficult task to find the best values for these parameters. Fine-Tuning is a method for selecting an effective Hyper-Parameter for improving Object Detection precision. Selecting an inaccurate Hyper-Parameter value, leads to Over-Fitting or Under-Fitting of data. Over-Fitting is a problem, when training data is greater than the necessary, leading to learning noise and inaccurate Object Detection [4]. Under-Fitting occurs when a model is unable to capture the data's trend, resulting in more erroneous testing or training outcomes. By changing the ‘Learning rate' of various Deep Learning Models, a balance between Over-Fitting and Under-Fitting is reached in this article. For experimentation purpose, this paper considers four Deep Learning Models such as VGG-16, VGG-19, InceptionV3 and Xception. In terms of maximal Object Detection accuracy, the best zone of Learning-rate for each model is analyzed. The prediction accuracy of a dataset of 70 object classes is investigated in this study by adjusting the ‘Learning-Rate' while keeping the rest of the Hyper-Parameters fixed.This article focuses on the impact of ‘Learning-Rate' on accuracy in Object Detection and identifies an ideal accuracy zone. This analysis helps in reduction of computational effort in calculation of Objection Detection Accuracy.

查看原文本刊更多论文

一种提高目标检测精度的学习率优化技术

最近，深度学习[1]模型主要用于目标检测算法，因为它们具有图像识别的特定能力。这些模型通过从中提取特征来识别输入图像和视频[2]中存在的项目。这些模型有各种各样的应用，包括图像处理、视频分析、语音识别、生物医学图像分析、生物特征识别、虹膜识别、国家安全应用、网络安全、自然语言处理[3]、天气预报应用、可再生能源发电调度等。这些模型采用卷积神经网络(convolutional Neural Network, CNN)[3]，它包含许多人工神经元层。深度学习模型的准确性由许多因素决定，包括学习率、训练批大小、验证批大小、激活函数和退出率。超参数是这些参数的名称。目标检测的准确性取决于超参数的选择。因此，找到这些参数的最佳值是一项艰巨的任务。微调是一种选择有效的超参数以提高目标检测精度的方法。选择不准确的超参数值会导致数据的过拟合或欠拟合。过度拟合是一个问题，当训练数据大于所需时，会导致学习噪声和不准确的目标检测[4]。当模型无法捕捉数据的趋势时，就会出现欠拟合，导致更多错误的测试或训练结果。通过改变各种深度学习模型的“学习率”，本文达到了过拟合和欠拟合之间的平衡。为了实验目的，本文考虑了VGG-16、VGG-19、InceptionV3和Xception四种深度学习模型。从最大目标检测精度出发，分析了各模型的最佳学习率区域。在本研究中，通过调整“学习率”，同时保持其余超参数固定，研究了70个对象类数据集的预测精度。本文主要研究了“学习率”对目标检测准确率的影响，并确定了一个理想的准确率区域。这种分析有助于减少目标检测精度的计算量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

New Approaches in Engineering Research Vol. 14

自引率

0.00%

发文量