利用各种机器学习方法中的关键因素建立糖尿病发展预测模型

IF 16.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Marenao Tanaka , Yukinori Akiyama , Kazuma Mori , Itaru Hosaka , Kenichi Kato , Keisuke Endo , Toshifumi Ogawa , Tatsuya Sato , Toru Suzuki , Toshiyuki Yano , Hirofumi Ohnishi , Nagisa Hanawa , Masato Furuhashi
{"title":"利用各种机器学习方法中的关键因素建立糖尿病发展预测模型","authors":"Marenao Tanaka ,&nbsp;Yukinori Akiyama ,&nbsp;Kazuma Mori ,&nbsp;Itaru Hosaka ,&nbsp;Kenichi Kato ,&nbsp;Keisuke Endo ,&nbsp;Toshifumi Ogawa ,&nbsp;Tatsuya Sato ,&nbsp;Toru Suzuki ,&nbsp;Toshiyuki Yano ,&nbsp;Hirofumi Ohnishi ,&nbsp;Nagisa Hanawa ,&nbsp;Masato Furuhashi","doi":"10.1016/j.deman.2023.100191","DOIUrl":null,"url":null,"abstract":"<div><h3>Aims</h3><p>Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus.</p></div><div><h3>Methods</h3><p>In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and γ-glutamyl transferase, were used.</p></div><div><h3>Results</h3><p>During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naïve Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models.</p></div><div><h3>Conclusions</h3><p>ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus.</p></div>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666970623000707/pdfft?md5=29183cb351f691865659fdb42480574b&pid=1-s2.0-S2666970623000707-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches\",\"authors\":\"Marenao Tanaka ,&nbsp;Yukinori Akiyama ,&nbsp;Kazuma Mori ,&nbsp;Itaru Hosaka ,&nbsp;Kenichi Kato ,&nbsp;Keisuke Endo ,&nbsp;Toshifumi Ogawa ,&nbsp;Tatsuya Sato ,&nbsp;Toru Suzuki ,&nbsp;Toshiyuki Yano ,&nbsp;Hirofumi Ohnishi ,&nbsp;Nagisa Hanawa ,&nbsp;Masato Furuhashi\",\"doi\":\"10.1016/j.deman.2023.100191\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Aims</h3><p>Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus.</p></div><div><h3>Methods</h3><p>In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and γ-glutamyl transferase, were used.</p></div><div><h3>Results</h3><p>During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naïve Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models.</p></div><div><h3>Conclusions</h3><p>ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus.</p></div>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666970623000707/pdfft?md5=29183cb351f691865659fdb42480574b&pid=1-s2.0-S2666970623000707-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666970623000707\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666970623000707","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

目的当需要从众多候选者中自动识别相关特征时,机器学习(ML)方法是非常有益的。方法 在接受年度健康检查的 10248 名受试者中,使用了包括脂肪肝指数(FLI)在内的 58 个候选指标,脂肪肝指数是通过腰围、体重指数以及甘油三酯和γ-谷氨酰转移酶水平计算得出的。结果在 10 年的随访期间(平均时间:6.9 年),培训组(70%,人数=7173)有 322 名受试者(6.5%)新发糖尿病,试验组(30%,人数=3075)有 127 名受试者(6.2%)新发糖尿病。通过随机森林特征选择和 10 倍交叉验证,血红蛋白 A1c、空腹血糖和 FLI 被确定为前 3 个预测因子。当使用血红蛋白 A1c 和 FLI 作为所选特征时,包括逻辑回归、奈夫贝叶斯、极端梯度提升和人工神经网络在内的多模型接收者工作特征曲线分析的 C 统计量分别为 0.874、0.869、0.856 和 0.869。结论 结合血红蛋白 A1c 和 FLI 的ML 模型为预测糖尿病的发展提供了一种准确而直接的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive modeling for the development of diabetes mellitus using key factors in various machine learning approaches

Aims

Machine learning (ML) approaches are beneficial when automatic identification of relevant features among numerous candidates is desired. We investigated the predictive ability of several ML models for new onset of diabetes mellitus.

Methods

In 10,248 subjects who received annual health examinations, 58 candidates including fatty liver index (FLI), which is calculated by using waist circumference, body mass index and levels of triglycerides and γ-glutamyl transferase, were used.

Results

During a 10-year follow-up period (mean period: 6.9 years), 322 subjects (6.5 %) in the training group (70 %, n=7,173) and 127 subjects (6.2 %) in the test group (30 %, n=3,075) had new onset of diabetes mellitus. Hemoglobin A1c, fasting glucose and FLI were identified as the top 3 predictors by random forest feature selection with 10-fold cross-validation. When hemoglobin A1c and FLI were used as the selected features, C-statistics analogous in receiver operating characteristic curve analysis in ML models including logistic regression, naïve Bayes, extreme gradient boosting and artificial neural network were 0.874, 0.869, 0.856 and 0.869, respectively. There was no significant difference in the discriminatory capacity among the ML models.

Conclusions

ML models incorporating hemoglobin A1c and FLI provide an accurate and straightforward approach for predicting the development of diabetes mellitus.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Accounts of Chemical Research
Accounts of Chemical Research 化学-化学综合
CiteScore
31.40
自引率
1.10%
发文量
312
审稿时长
2 months
期刊介绍: Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信