{"title":"Machine Learning-Based Prediction of First Trimester Down Syndrome Risk in East Asian Populations.","authors":"Yen-Tin Chen, Gina Jinna Chen, Yu-Shiang Lin","doi":"10.2147/RMHP.S511035","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Down syndrome is the most common chromosomal abnormality in newborns, often leading to developmental delays and congenital structural anomalies. This study employed multiple machine learning models to perform risk prediction and result exploration for first-trimester Down syndrome in East Asian populations, aiming to identify an optimal risk prediction model that will enhance future predictions of Down syndrome risk and improve the efficiency of the screening process.</p><p><strong>Patients and methods: </strong>This study collected data from the Down syndrome screening database at Taipei Chang Gung Memorial Hospital from May 1, 2018, to February 29, 2024. The dataset included 3,812 cases available for analysis, comprising 165 high-risk cases and 3,647 low-risk cases. Fourteen features (including maternal age, nuchal translucency thickness, serum markers, etc.) were input into the twelve machine learning models, along with seven data-balancing algorithms, to explore the risk prediction outcomes. The performance of these models was thoroughly evaluated using AUC (Area Under the Curve), accuracy, precision, recall, and F1 scores.</p><p><strong>Results: </strong>Among the twelve machine learning models, the highest recall of 0.84 for high-risk cases was achieved by LightGBM combined with the RUS (Random Undersampling) data balancing algorithm. The highest AUC of 0.939 was attained by the ANN and LSTM models when combined with the ROS (Random Oversampling) data balancing algorithm.</p><p><strong>Conclusion: </strong>The proposed ANN machine learning model, based on deep neural networks and combined with the ROS data balancing method, achieved an impressive AUC of 0.939 for classifying first-trimester Down syndrome risk in the East Asian population. Notably, this model also achieved an outstanding classification accuracy of 0.97. These results demonstrate the potential of the proposed ANN machine learning model for the accurate prediction of first-trimester Down syndrome risk.</p>","PeriodicalId":56009,"journal":{"name":"Risk Management and Healthcare Policy","volume":"18 ","pages":"1109-1120"},"PeriodicalIF":2.7000,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11963822/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Risk Management and Healthcare Policy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/RMHP.S511035","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Down syndrome is the most common chromosomal abnormality in newborns, often leading to developmental delays and congenital structural anomalies. This study employed multiple machine learning models to perform risk prediction and result exploration for first-trimester Down syndrome in East Asian populations, aiming to identify an optimal risk prediction model that will enhance future predictions of Down syndrome risk and improve the efficiency of the screening process.
Patients and methods: This study collected data from the Down syndrome screening database at Taipei Chang Gung Memorial Hospital from May 1, 2018, to February 29, 2024. The dataset included 3,812 cases available for analysis, comprising 165 high-risk cases and 3,647 low-risk cases. Fourteen features (including maternal age, nuchal translucency thickness, serum markers, etc.) were input into the twelve machine learning models, along with seven data-balancing algorithms, to explore the risk prediction outcomes. The performance of these models was thoroughly evaluated using AUC (Area Under the Curve), accuracy, precision, recall, and F1 scores.
Results: Among the twelve machine learning models, the highest recall of 0.84 for high-risk cases was achieved by LightGBM combined with the RUS (Random Undersampling) data balancing algorithm. The highest AUC of 0.939 was attained by the ANN and LSTM models when combined with the ROS (Random Oversampling) data balancing algorithm.
Conclusion: The proposed ANN machine learning model, based on deep neural networks and combined with the ROS data balancing method, achieved an impressive AUC of 0.939 for classifying first-trimester Down syndrome risk in the East Asian population. Notably, this model also achieved an outstanding classification accuracy of 0.97. These results demonstrate the potential of the proposed ANN machine learning model for the accurate prediction of first-trimester Down syndrome risk.
期刊介绍:
Risk Management and Healthcare Policy is an international, peer-reviewed, open access journal focusing on all aspects of public health, policy and preventative measures to promote good health and improve morbidity and mortality in the population. Specific topics covered in the journal include:
Public and community health
Policy and law
Preventative and predictive healthcare
Risk and hazard management
Epidemiology, detection and screening
Lifestyle and diet modification
Vaccination and disease transmission/modification programs
Health and safety and occupational health
Healthcare services provision
Health literacy and education
Advertising and promotion of health issues
Health economic evaluations and resource management
Risk Management and Healthcare Policy focuses on human interventional and observational research. The journal welcomes submitted papers covering original research, clinical and epidemiological studies, reviews and evaluations, guidelines, expert opinion and commentary, and extended reports. Case reports will only be considered if they make a valuable and original contribution to the literature. The journal does not accept study protocols, animal-based or cell line-based studies.