利用正则化与深度神经网络相结合的自编码器增强了糖尿病的预测方法

Q1 Engineering

Periodicals of Engineering and Natural Sciences Pub Date : 2023-01-17 DOI:10.21533/pen.v10i6.3394

H. A. Ismael, Nabeel Al-A'araji, B. K. Shukur

{"title":"利用正则化与深度神经网络相结合的自编码器增强了糖尿病的预测方法","authors":"H. A. Ismael, Nabeel Al-A'araji, B. K. Shukur","doi":"10.21533/pen.v10i6.3394","DOIUrl":null,"url":null,"abstract":"Diabetes mellitus is considered one of the foremost common and extreme diseases worldwide. A precise and early diagnosis of diabetes is essential to avoid complications and is of crucial importance to the medical care that patients get. To achieve that, we need to develop a model to predict diabetes. There are many prediction models, but they suffer from some problems such as the accuracy of prediction being poor and the time complexity. The prediction process is highly dependent on important features. So, in this paper, we proposed a new model called (CAER-DNN) that depends on an unsupervised technique for generating newly important features and a deep neural network for the prediction process. The unsupervised technique is called complete autoencoder with regularization techniques (CAER) that uses to reconstruct the original features (newly learned features). It is focused too much on training the most important learned features and misses out on less important features. Thus, improving the performance of the prediction process. These important features are used as input to the deep neural network for the prediction of diabetes. Our model is applied to two sets of data including Pima Indian and Mendeley diabetic datasets. Based on the 10-fold cross-validation technique Pima Indian dataset achieves high performance in evaluation measures (f1-score 97.38%, accuracy, recall 97.25%, specificity 97.59%, precision 97.53%,). While the Mendeley diabetes dataset achieved high performance in evaluation measures (f1-score 94.51%, accuracy 98.48, recall 91.74%, accuracy-balance 98.21%, precision 98.21%) based on the holdout technique. compared with other existing machine learning and deep learning techniques our model outperformed existing techniques.","PeriodicalId":37519,"journal":{"name":"Periodicals of Engineering and Natural Sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhanced the prediction approach of diabetes using an autoencoder with regularization and deep neural network\",\"authors\":\"H. A. Ismael, Nabeel Al-A'araji, B. K. Shukur\",\"doi\":\"10.21533/pen.v10i6.3394\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes mellitus is considered one of the foremost common and extreme diseases worldwide. A precise and early diagnosis of diabetes is essential to avoid complications and is of crucial importance to the medical care that patients get. To achieve that, we need to develop a model to predict diabetes. There are many prediction models, but they suffer from some problems such as the accuracy of prediction being poor and the time complexity. The prediction process is highly dependent on important features. So, in this paper, we proposed a new model called (CAER-DNN) that depends on an unsupervised technique for generating newly important features and a deep neural network for the prediction process. The unsupervised technique is called complete autoencoder with regularization techniques (CAER) that uses to reconstruct the original features (newly learned features). It is focused too much on training the most important learned features and misses out on less important features. Thus, improving the performance of the prediction process. These important features are used as input to the deep neural network for the prediction of diabetes. Our model is applied to two sets of data including Pima Indian and Mendeley diabetic datasets. Based on the 10-fold cross-validation technique Pima Indian dataset achieves high performance in evaluation measures (f1-score 97.38%, accuracy, recall 97.25%, specificity 97.59%, precision 97.53%,). While the Mendeley diabetes dataset achieved high performance in evaluation measures (f1-score 94.51%, accuracy 98.48, recall 91.74%, accuracy-balance 98.21%, precision 98.21%) based on the holdout technique. compared with other existing machine learning and deep learning techniques our model outperformed existing techniques.\",\"PeriodicalId\":37519,\"journal\":{\"name\":\"Periodicals of Engineering and Natural Sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Periodicals of Engineering and Natural Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21533/pen.v10i6.3394\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Periodicals of Engineering and Natural Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21533/pen.v10i6.3394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Engineering","Score":null,"Total":0}

引用次数: 0

摘要

糖尿病被认为是世界上最常见和最极端的疾病之一。糖尿病的准确和早期诊断对于避免并发症至关重要，对患者获得的医疗护理至关重要。为了实现这一目标，我们需要开发一个模型来预测糖尿病。预测模型很多，但存在预测精度差、时间复杂等问题。预测过程高度依赖于重要特征。因此，在本文中，我们提出了一种新的模型(CAER-DNN)，该模型依赖于无监督技术来生成新的重要特征，并依赖于深度神经网络来进行预测过程。无监督技术称为正则化完全自编码器(CAER)，用于重建原始特征(新学习的特征)。它过于专注于训练最重要的学习特征，而忽略了不太重要的特征。从而提高了预测过程的性能。这些重要特征被用作深度神经网络预测糖尿病的输入。我们的模型应用于两组数据，包括皮马印第安人和门德利糖尿病数据集。基于10倍交叉验证技术，Pima Indian数据集在评价指标上表现优异(f1得分97.38%，正确率97.25%，召回率97.25%，特异性97.59%，精密度97.53%)。而Mendeley糖尿病数据集在基于hold - out技术的评价指标上表现优异(f1-score 94.51%，准确率98.48，召回率91.74%，准确度-平衡98.21%，精密度98.21%)。与其他现有的机器学习和深度学习技术相比，我们的模型优于现有的技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhanced the prediction approach of diabetes using an autoencoder with regularization and deep neural network

Diabetes mellitus is considered one of the foremost common and extreme diseases worldwide. A precise and early diagnosis of diabetes is essential to avoid complications and is of crucial importance to the medical care that patients get. To achieve that, we need to develop a model to predict diabetes. There are many prediction models, but they suffer from some problems such as the accuracy of prediction being poor and the time complexity. The prediction process is highly dependent on important features. So, in this paper, we proposed a new model called (CAER-DNN) that depends on an unsupervised technique for generating newly important features and a deep neural network for the prediction process. The unsupervised technique is called complete autoencoder with regularization techniques (CAER) that uses to reconstruct the original features (newly learned features). It is focused too much on training the most important learned features and misses out on less important features. Thus, improving the performance of the prediction process. These important features are used as input to the deep neural network for the prediction of diabetes. Our model is applied to two sets of data including Pima Indian and Mendeley diabetic datasets. Based on the 10-fold cross-validation technique Pima Indian dataset achieves high performance in evaluation measures (f1-score 97.38%, accuracy, recall 97.25%, specificity 97.59%, precision 97.53%,). While the Mendeley diabetes dataset achieved high performance in evaluation measures (f1-score 94.51%, accuracy 98.48, recall 91.74%, accuracy-balance 98.21%, precision 98.21%) based on the holdout technique. compared with other existing machine learning and deep learning techniques our model outperformed existing techniques.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Periodicals of Engineering and Natural Sciences Engineering-Architecture

CiteScore

1.90

自引率

0.00%

发文量

140

审稿时长

7 weeks

期刊介绍： *Industrial Engineering: 1 . Ergonomics 2 . Manufacturing 3 . TQM/quality engineering, reliability/maintenance engineering 4 . Production Planning 5 . Facility location, layout, design, materials handling 6 . Education, case studies 7 . Inventory, logistics, transportation, supply chain management 8 . Management 9 . Project/operations management, scheduling 10 . Information systems for production and management 11 . Innovation, knowledge management, organizational learning *Mechanical Engineering: 1 . Energy 2 . Machine Design 3 . Engineering Materials 4 . Manufacturing 5 . Mechatronics & Robotics 6 . Transportation 7 . Fluid Mechanics 8 . Optical Engineering 9 . Nanotechnology 10 . Maintenance & Safety *Computer Science: 1 . Computational Intelligence 2 . Computer Graphics 3 . Data Mining 4 . Human-Centered Computing 5 . Internet and Web Computing 6 . Mobile and Cloud computing 7 . Software Engineering 8 . Online Social Networks *Electrical and electronics engineering 1 . Sensor, automation and instrumentation technology 2 . Telecommunications 3 . Power systems 4 . Electronics 5 . Nanotechnology *Architecture: 1 . Advanced digital applications in architecture practice and computation within Generative processes of design 2 . Computer science, biology and ecology connected with structural engineering 3 . Technology and sustainability in architecture *Bioengineering: 1 . Medical Sciences 2 . Biological and Biomedical Sciences 3 . Agriculture and Life Sciences 4 . Biology and neuroscience 5 . Biological Sciences (Botany, Forestry, Cell Biology, Marine Biology, Zoology) [...]