Chen Zheng , Tong Qing , Mao Li , Shujuan Liao , Biru Luo , Chenwei Tang , Jiancheng Lv
{"title":"GDM-BC:用于妊娠糖尿病智能预测的无创身体成分数据集","authors":"Chen Zheng , Tong Qing , Mao Li , Shujuan Liao , Biru Luo , Chenwei Tang , Jiancheng Lv","doi":"10.1016/j.compbiomed.2025.110176","DOIUrl":null,"url":null,"abstract":"<div><div>Gestational Diabetes Mellitus (GDM) refers to any degree of impaired glucose tolerance with onset or first recognition during pregnancy. As a high-prevalence disease, GDM damages the health of both pregnant women and fetuses in the short and long term. Accurate and cost-effective recognition of GDM is quite crucial to reduce the risk and economic pressure of this disease. However, existing datasets for the prediction of GDM primarily focus on clinical and biochemical parameters, including a mass of invasive indexes. These variables are hard to obtain and do not always perform well in the prediction of GDM. In this paper, we introduce a large-scale non-invasive body composition dataset, called GDM-BC, for intelligent risk prediction of GDM. Specifically, it contains a cohort of 39,438 pregnant women, of whom 7777 (19.7%) were subsequently diagnosed with GDM. Besides, our dataset includes a large number of body composition indexes that can be acquired non-invasively. In addition, we perform several traditional machine learning and deep learning methods on the GDM-BC dataset, among which the Residual Attention Fully Connected Network (RAFNet) performs the best, achieving an AUC (area under the ROC curve) of 0.920. The results show that our dataset is marvelous and creates a new perspective on the prediction of GDM. Our models may offer an opportunity to establish a cost-effective screening approach for identifying low-risk pregnant women based on body composition data. We believe that our proposed GDM-BC dataset will advance future research on risk prediction for GDM, as well as provide new insights for intelligent prediction of other high-incidence pregnancy-related diseases such as gestational hypertension.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"192 ","pages":"Article 110176"},"PeriodicalIF":6.3000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GDM-BC: Non-invasive body composition dataset for intelligent prediction of Gestational Diabetes Mellitus\",\"authors\":\"Chen Zheng , Tong Qing , Mao Li , Shujuan Liao , Biru Luo , Chenwei Tang , Jiancheng Lv\",\"doi\":\"10.1016/j.compbiomed.2025.110176\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Gestational Diabetes Mellitus (GDM) refers to any degree of impaired glucose tolerance with onset or first recognition during pregnancy. As a high-prevalence disease, GDM damages the health of both pregnant women and fetuses in the short and long term. Accurate and cost-effective recognition of GDM is quite crucial to reduce the risk and economic pressure of this disease. However, existing datasets for the prediction of GDM primarily focus on clinical and biochemical parameters, including a mass of invasive indexes. These variables are hard to obtain and do not always perform well in the prediction of GDM. In this paper, we introduce a large-scale non-invasive body composition dataset, called GDM-BC, for intelligent risk prediction of GDM. Specifically, it contains a cohort of 39,438 pregnant women, of whom 7777 (19.7%) were subsequently diagnosed with GDM. Besides, our dataset includes a large number of body composition indexes that can be acquired non-invasively. In addition, we perform several traditional machine learning and deep learning methods on the GDM-BC dataset, among which the Residual Attention Fully Connected Network (RAFNet) performs the best, achieving an AUC (area under the ROC curve) of 0.920. The results show that our dataset is marvelous and creates a new perspective on the prediction of GDM. Our models may offer an opportunity to establish a cost-effective screening approach for identifying low-risk pregnant women based on body composition data. We believe that our proposed GDM-BC dataset will advance future research on risk prediction for GDM, as well as provide new insights for intelligent prediction of other high-incidence pregnancy-related diseases such as gestational hypertension.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"192 \",\"pages\":\"Article 110176\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S001048252500527X\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001048252500527X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
GDM-BC: Non-invasive body composition dataset for intelligent prediction of Gestational Diabetes Mellitus
Gestational Diabetes Mellitus (GDM) refers to any degree of impaired glucose tolerance with onset or first recognition during pregnancy. As a high-prevalence disease, GDM damages the health of both pregnant women and fetuses in the short and long term. Accurate and cost-effective recognition of GDM is quite crucial to reduce the risk and economic pressure of this disease. However, existing datasets for the prediction of GDM primarily focus on clinical and biochemical parameters, including a mass of invasive indexes. These variables are hard to obtain and do not always perform well in the prediction of GDM. In this paper, we introduce a large-scale non-invasive body composition dataset, called GDM-BC, for intelligent risk prediction of GDM. Specifically, it contains a cohort of 39,438 pregnant women, of whom 7777 (19.7%) were subsequently diagnosed with GDM. Besides, our dataset includes a large number of body composition indexes that can be acquired non-invasively. In addition, we perform several traditional machine learning and deep learning methods on the GDM-BC dataset, among which the Residual Attention Fully Connected Network (RAFNet) performs the best, achieving an AUC (area under the ROC curve) of 0.920. The results show that our dataset is marvelous and creates a new perspective on the prediction of GDM. Our models may offer an opportunity to establish a cost-effective screening approach for identifying low-risk pregnant women based on body composition data. We believe that our proposed GDM-BC dataset will advance future research on risk prediction for GDM, as well as provide new insights for intelligent prediction of other high-incidence pregnancy-related diseases such as gestational hypertension.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.