Optimizing U-Net Architecture with Feed-Forward Neural Networks for Precise Cobb Angle Prediction in Scoliosis Diagnosis

Q3 Mathematics

Indonesian Journal of Electrical Engineering and Informatics Pub Date : 2023-09-28 DOI:10.52549/ijeei.v11i3.5009

Mohamad Iqmal Jamaludin, Teddy Surya Gunawan, Rajendra Kumar Karupiah, Suriza Ahmad Zabidi, Mira Kartiwi, Zamzuri Zakaria

{"title":"Optimizing U-Net Architecture with Feed-Forward Neural Networks for Precise Cobb Angle Prediction in Scoliosis Diagnosis","authors":"Mohamad Iqmal Jamaludin, Teddy Surya Gunawan, Rajendra Kumar Karupiah, Suriza Ahmad Zabidi, Mira Kartiwi, Zamzuri Zakaria","doi":"10.52549/ijeei.v11i3.5009","DOIUrl":null,"url":null,"abstract":"In the burgeoning field of Artificial Intelligence (AI) and its notable subsets, such as Deep Learning (DL), there is evidence of its transformative impact in assisting clinicians, particularly in diagnosing scoliosis. AI is unrivaled for its speed and precision in analyzing medical images, including X-rays and computed tomography (CT) scans. However, the path does not lack obstacles. Biases, unanticipated outcomes, and false positive and negative predictions present significant challenges. Our research employed three complex experimental sets, each focusing on adapting the U-Net architecture. Through a nuanced combination of feed-forward neural network (FFNN) configurations and hyperparameters, we endeavored to determine the most effective nonlinear regression model configuration for predicting the Cobb angle. This was done with the dual purpose of reducing AI training time without sacrificing predictive accuracy. Utilizing the capabilities of the PyTorch framework, we meticulously crafted and refined the deep learning models for each of the three experiments, focusing on an FFFN dropout rate of p=0.45. The Root Mean Square Error (RMSE), the number of epochs, and the number of nodes spanning three hidden layers in each FFFN were utilized as crucial performance metrics while a base learning rate of 0.001 was maintained. Notably, during the optimization phase, one of the experiments incorporated a learning rate scheduler to protect against potential pitfalls such as local minima and saddle points. A judiciously incorporated Early Stopping technique, triggered between the patience range of 5-10 epochs, ensured model stability as the Mean Squared Error (MSE) plateau loss approached approximately 1. Consequently, the model converged between 50 and 82 epochs. We hypothesize that our proposed architecture holds promise for future refinements, conditioned on assiduous experimentation with an array of medical deep learning paradigms.","PeriodicalId":37618,"journal":{"name":"Indonesian Journal of Electrical Engineering and Informatics","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indonesian Journal of Electrical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52549/ijeei.v11i3.5009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

In the burgeoning field of Artificial Intelligence (AI) and its notable subsets, such as Deep Learning (DL), there is evidence of its transformative impact in assisting clinicians, particularly in diagnosing scoliosis. AI is unrivaled for its speed and precision in analyzing medical images, including X-rays and computed tomography (CT) scans. However, the path does not lack obstacles. Biases, unanticipated outcomes, and false positive and negative predictions present significant challenges. Our research employed three complex experimental sets, each focusing on adapting the U-Net architecture. Through a nuanced combination of feed-forward neural network (FFNN) configurations and hyperparameters, we endeavored to determine the most effective nonlinear regression model configuration for predicting the Cobb angle. This was done with the dual purpose of reducing AI training time without sacrificing predictive accuracy. Utilizing the capabilities of the PyTorch framework, we meticulously crafted and refined the deep learning models for each of the three experiments, focusing on an FFFN dropout rate of p=0.45. The Root Mean Square Error (RMSE), the number of epochs, and the number of nodes spanning three hidden layers in each FFFN were utilized as crucial performance metrics while a base learning rate of 0.001 was maintained. Notably, during the optimization phase, one of the experiments incorporated a learning rate scheduler to protect against potential pitfalls such as local minima and saddle points. A judiciously incorporated Early Stopping technique, triggered between the patience range of 5-10 epochs, ensured model stability as the Mean Squared Error (MSE) plateau loss approached approximately 1. Consequently, the model converged between 50 and 82 epochs. We hypothesize that our proposed architecture holds promise for future refinements, conditioned on assiduous experimentation with an array of medical deep learning paradigms.

查看原文本刊更多论文

基于前馈神经网络的U-Net结构优化在脊柱侧凸诊断中的精确Cobb角预测

在人工智能(AI)及其显著子集(如深度学习(DL))的新兴领域，有证据表明它在协助临床医生方面具有变革性影响，特别是在诊断脊柱侧凸方面。人工智能在分析x射线和计算机断层扫描(CT)等医学图像方面的速度和精度无与伦比。然而，这条道路并不缺少障碍。偏见、意想不到的结果、假阳性和假阴性预测都带来了重大挑战。我们的研究采用了三个复杂的实验集，每个实验集都侧重于适应U-Net架构。通过前馈神经网络(FFNN)配置和超参数的微妙组合，我们努力确定最有效的非线性回归模型配置来预测Cobb角。这样做的双重目的是在不牺牲预测准确性的情况下减少人工智能的训练时间。利用PyTorch框架的功能，我们精心制作和完善了三个实验中的每个实验的深度学习模型，重点关注FFFN辍学率p=0.45。在每个FFFN中，均方根误差(RMSE)、epoch数和跨越三个隐藏层的节点数被用作关键的性能指标，同时保持了0.001的基本学习率。值得注意的是，在优化阶段，其中一个实验包含了一个学习率调度器，以防止潜在的缺陷，如局部最小值和鞍点。当均方误差(MSE)平台损失接近约1时，在忍耐范围5-10个epoch之间触发的明智的早期停止技术确保了模型的稳定性。因此，该模型收敛于50至82个时代之间。我们假设，我们提出的架构有希望在未来进行改进，条件是对一系列医学深度学习范例进行不懈的实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Indonesian Journal of Electrical Engineering and Informatics Computer Science-Computer Science (miscellaneous)

CiteScore

1.50

自引率

0.00%

发文量

期刊介绍： The journal publishes original papers in the field of electrical, computer and informatics engineering which covers, but not limited to, the following scope: Electronics: Electronic Materials, Microelectronic System, Design and Implementation of Application Specific Integrated Circuits (ASIC), VLSI Design, System-on-a-Chip (SoC) and Electronic Instrumentation Using CAD Tools, digital signal & data Processing, , Biomedical Transducers and instrumentation. Electrical: Electrical Engineering Materials, Electric Power Generation, Transmission and Distribution, Power Electronics, Power Quality, Power Economic, FACTS, Renewable Energy, Electric Traction. Telecommunication: Modulation and Signal Processing for Telecommunication, Information Theory and Coding, Antenna and Wave Propagation, Wireless and Mobile Communications, Radio Communication, Communication Electronics and Microwave, Radar Imaging. Control: Optimal, Robust and Adaptive Controls, Non Linear and Stochastic Controls, Modeling and Identification, Robotics, Image Based Control, Hybrid and Switching Control, Process Optimization and Scheduling, Control and Intelligent Systems. Computer and Informatics: Computer Architecture, Parallel and Distributed Computer, Pervasive Computing, Computer Network, Embedded System, Human—Computer Interaction, Virtual/Augmented Reality, Computer Security, Software Engineering (Software: Lifecycle, Management, Engineering Process, Engineering Tools and Methods), Programming (Programming Methodology and Paradigm), Data Engineering (Data and Knowledge level Modeling, Information Management (DB) practices, Knowledge Based Management System, Knowledge Discovery in Data).