Machine Learning-Based Prediction of First Trimester Down Syndrome Risk in East Asian Populations.

IF 2.7 4区医学 Q2 HEALTH CARE SCIENCES & SERVICES

Risk Management and Healthcare Policy Pub Date : 2025-03-29 eCollection Date: 2025-01-01 DOI:10.2147/RMHP.S511035

Yen-Tin Chen, Gina Jinna Chen, Yu-Shiang Lin

{"title":"Machine Learning-Based Prediction of First Trimester Down Syndrome Risk in East Asian Populations.","authors":"Yen-Tin Chen, Gina Jinna Chen, Yu-Shiang Lin","doi":"10.2147/RMHP.S511035","DOIUrl":null,"url":null,"abstract":"Purpose: Down syndrome is the most common chromosomal abnormality in newborns, often leading to developmental delays and congenital structural anomalies. This study employed multiple machine learning models to perform risk prediction and result exploration for first-trimester Down syndrome in East Asian populations, aiming to identify an optimal risk prediction model that will enhance future predictions of Down syndrome risk and improve the efficiency of the screening process.Patients and methods: This study collected data from the Down syndrome screening database at Taipei Chang Gung Memorial Hospital from May 1, 2018, to February 29, 2024. The dataset included 3,812 cases available for analysis, comprising 165 high-risk cases and 3,647 low-risk cases. Fourteen features (including maternal age, nuchal translucency thickness, serum markers, etc.) were input into the twelve machine learning models, along with seven data-balancing algorithms, to explore the risk prediction outcomes. The performance of these models was thoroughly evaluated using AUC (Area Under the Curve), accuracy, precision, recall, and F1 scores.Results: Among the twelve machine learning models, the highest recall of 0.84 for high-risk cases was achieved by LightGBM combined with the RUS (Random Undersampling) data balancing algorithm. The highest AUC of 0.939 was attained by the ANN and LSTM models when combined with the ROS (Random Oversampling) data balancing algorithm.Conclusion: The proposed ANN machine learning model, based on deep neural networks and combined with the ROS data balancing method, achieved an impressive AUC of 0.939 for classifying first-trimester Down syndrome risk in the East Asian population. Notably, this model also achieved an outstanding classification accuracy of 0.97. These results demonstrate the potential of the proposed ANN machine learning model for the accurate prediction of first-trimester Down syndrome risk.","PeriodicalId":56009,"journal":{"name":"Risk Management and Healthcare Policy","volume":"18 ","pages":"1109-1120"},"PeriodicalIF":2.7000,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963822/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Risk Management and Healthcare Policy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/RMHP.S511035","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Down syndrome is the most common chromosomal abnormality in newborns, often leading to developmental delays and congenital structural anomalies. This study employed multiple machine learning models to perform risk prediction and result exploration for first-trimester Down syndrome in East Asian populations, aiming to identify an optimal risk prediction model that will enhance future predictions of Down syndrome risk and improve the efficiency of the screening process.

Patients and methods: This study collected data from the Down syndrome screening database at Taipei Chang Gung Memorial Hospital from May 1, 2018, to February 29, 2024. The dataset included 3,812 cases available for analysis, comprising 165 high-risk cases and 3,647 low-risk cases. Fourteen features (including maternal age, nuchal translucency thickness, serum markers, etc.) were input into the twelve machine learning models, along with seven data-balancing algorithms, to explore the risk prediction outcomes. The performance of these models was thoroughly evaluated using AUC (Area Under the Curve), accuracy, precision, recall, and F1 scores.

Results: Among the twelve machine learning models, the highest recall of 0.84 for high-risk cases was achieved by LightGBM combined with the RUS (Random Undersampling) data balancing algorithm. The highest AUC of 0.939 was attained by the ANN and LSTM models when combined with the ROS (Random Oversampling) data balancing algorithm.

Conclusion: The proposed ANN machine learning model, based on deep neural networks and combined with the ROS data balancing method, achieved an impressive AUC of 0.939 for classifying first-trimester Down syndrome risk in the East Asian population. Notably, this model also achieved an outstanding classification accuracy of 0.97. These results demonstrate the potential of the proposed ANN machine learning model for the accurate prediction of first-trimester Down syndrome risk.

查看原文本刊更多论文

基于机器学习的东亚人群妊娠早期唐氏综合症风险预测。

目的：唐氏综合征是新生儿中最常见的染色体异常，常导致发育迟缓和先天性结构异常。本研究采用多种机器学习模型对东亚人群妊娠早期唐氏综合征进行风险预测和结果探索，旨在寻找一种最佳的风险预测模型，以增强对唐氏综合征风险的未来预测，提高筛查过程的效率。患者和方法：本研究收集台北长庚纪念医院2018年5月1日至2024年2月29日唐氏综合征筛查数据库的数据。该数据集包括3812例可用于分析的病例，其中包括165例高风险病例和3647例低风险病例。将14个特征（包括产妇年龄、颈部半透明厚度、血清标志物等）与7种数据平衡算法一起输入到12个机器学习模型中，探索风险预测结果。使用AUC（曲线下面积）、准确度、精密度、召回率和F1分数对这些模型的性能进行了全面评估。结果：在12个机器学习模型中，LightGBM结合RUS （Random Undersampling，随机欠采样）数据平衡算法对高危病例的召回率最高，为0.84。结合ROS （Random Oversampling）数据平衡算法，ANN和LSTM模型的AUC最高，为0.939。结论：本文提出的基于深度神经网络的ANN机器学习模型，结合ROS数据平衡方法，对东亚人群妊娠早期唐氏综合征风险进行分类的AUC为0.939，令人印象深刻。值得注意的是，该模型也取得了0.97的分类精度。这些结果证明了所提出的人工神经网络机器学习模型在准确预测妊娠早期唐氏综合症风险方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Risk Management and Healthcare Policy Medicine-Public Health, Environmental and Occupational Health

CiteScore

6.20

自引率

2.90%

发文量

242

审稿时长

16 weeks

期刊介绍： Risk Management and Healthcare Policy is an international, peer-reviewed, open access journal focusing on all aspects of public health, policy and preventative measures to promote good health and improve morbidity and mortality in the population. Specific topics covered in the journal include: Public and community health Policy and law Preventative and predictive healthcare Risk and hazard management Epidemiology, detection and screening Lifestyle and diet modification Vaccination and disease transmission/modification programs Health and safety and occupational health Healthcare services provision Health literacy and education Advertising and promotion of health issues Health economic evaluations and resource management Risk Management and Healthcare Policy focuses on human interventional and observational research. The journal welcomes submitted papers covering original research, clinical and epidemiological studies, reviews and evaluations, guidelines, expert opinion and commentary, and extended reports. Case reports will only be considered if they make a valuable and original contribution to the literature. The journal does not accept study protocols, animal-based or cell line-based studies.