An Improved Performance of Convolutional Neural Network for Infant Pose Estimation by Evaluating Hyperparameter

E. S. Ningrum, E. M. Yuniarno, M. Purnomo
{"title":"An Improved Performance of Convolutional Neural Network for Infant Pose Estimation by Evaluating Hyperparameter","authors":"E. S. Ningrum, E. M. Yuniarno, M. Purnomo","doi":"10.1109/ISITIA59021.2023.10221125","DOIUrl":null,"url":null,"abstract":"The infant stage is crucial for human development, with fidgeting playing a key role in the development of balance and coordination. Recent studies have developed machine learning algorithms that utilize body posture estimation to detect fidgety movements in babies. However, research on optimal hyperparameters for infant posture estimation is still limited. Without a reference to the optimal configuration, research on infant-based pose estimation could be prolonged and deviate from its main goal of detecting infant growth through movement.This paper employs a computer vision approach to enhance the accuracy of predicting fidgety movements in babies. Evaluating the hyperparameters of the Convolutional Neural Network (CNN) model for Baby Pose Estimation can significantly improve its performance. The synthetic and real infant pose (SyRIP) dataset, along with the high-resolution net (HRnet) and distribution-aware coordinate representation of keypoin (DARKPose) models, is utilized for the infant pose estimation dataset. The hyperparameter values were exploited to identify the most optimal results in this research. Among the 37 scenarios, the following hyperparameter combinations yielded the best results: Batch Size combinations of 2 and 4, train epochs of 15 and 150, lambda value of 0.0001, learning rate of 0.00005, learning rate factor of 0.1, learning rate steps of 60 and 120, weight decay of 0.00005, gamma of 0.95, and momentum of 0.9. Increasing the epochs and pre-epochs has proven to enhance the model’s performance. Lambda values show a positive correlation with model performance. Conversely, values such as Learning Rate and its factor, steps, gamma, momentum, and weight decay demonstrate a negative correlation.","PeriodicalId":116682,"journal":{"name":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Seminar on Intelligent Technology and Its Applications (ISITIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITIA59021.2023.10221125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The infant stage is crucial for human development, with fidgeting playing a key role in the development of balance and coordination. Recent studies have developed machine learning algorithms that utilize body posture estimation to detect fidgety movements in babies. However, research on optimal hyperparameters for infant posture estimation is still limited. Without a reference to the optimal configuration, research on infant-based pose estimation could be prolonged and deviate from its main goal of detecting infant growth through movement.This paper employs a computer vision approach to enhance the accuracy of predicting fidgety movements in babies. Evaluating the hyperparameters of the Convolutional Neural Network (CNN) model for Baby Pose Estimation can significantly improve its performance. The synthetic and real infant pose (SyRIP) dataset, along with the high-resolution net (HRnet) and distribution-aware coordinate representation of keypoin (DARKPose) models, is utilized for the infant pose estimation dataset. The hyperparameter values were exploited to identify the most optimal results in this research. Among the 37 scenarios, the following hyperparameter combinations yielded the best results: Batch Size combinations of 2 and 4, train epochs of 15 and 150, lambda value of 0.0001, learning rate of 0.00005, learning rate factor of 0.1, learning rate steps of 60 and 120, weight decay of 0.00005, gamma of 0.95, and momentum of 0.9. Increasing the epochs and pre-epochs has proven to enhance the model’s performance. Lambda values show a positive correlation with model performance. Conversely, values such as Learning Rate and its factor, steps, gamma, momentum, and weight decay demonstrate a negative correlation.
基于超参数的卷积神经网络婴儿姿态估计性能改进
婴儿阶段是人类发展的关键阶段,坐立不安在平衡和协调的发展中起着关键作用。最近的研究开发了机器学习算法,利用身体姿势估计来检测婴儿的烦躁动作。然而,关于婴儿姿态估计的最优超参数的研究仍然有限。如果没有最优构型的参考,基于婴儿姿态估计的研究可能会延长,并且偏离其通过运动来检测婴儿生长的主要目标。本文采用计算机视觉方法来提高预测婴儿烦躁动作的准确性。对卷积神经网络(CNN)婴儿姿态估计模型的超参数进行评估可以显著提高其性能。采用合成真实婴儿姿态(SyRIP)数据集、高分辨率网络(HRnet)和分布感知的关键点坐标表示(DARKPose)模型进行婴儿姿态估计数据集。在本研究中,利用超参数值来确定最优结果。在37个场景中,以下超参数组合获得了最好的结果:Batch Size组合为2和4,训练epoch为15和150,lambda值为0.0001,学习率为0.00005,学习率因子为0.1,学习率步长为60和120,权衰减为0.00005,gamma为0.95,动量为0.9。增加epoch和pre-epoch已被证明可以提高模型的性能。Lambda值与模型性能呈正相关。相反,学习率及其因子、步骤、伽马、动量和权重衰减等值则表现为负相关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信