Arezoo Borji , Hossam Haick , Birgit Pohn , Antonia Graf , Jana Zakall , S M Ragib Shahriar Islam , Gernot Kronreif , Daniel Kovatchki , Heinz Strohmer , Sepideh Hatamikia
{"title":"An integrated optimization and deep learning pipeline for predicting live birth success in IVF using feature optimization and transformer-based models","authors":"Arezoo Borji , Hossam Haick , Birgit Pohn , Antonia Graf , Jana Zakall , S M Ragib Shahriar Islam , Gernot Kronreif , Daniel Kovatchki , Heinz Strohmer , Sepideh Hatamikia","doi":"10.1016/j.cmpb.2025.108979","DOIUrl":null,"url":null,"abstract":"<div><div>The complicated interplay of clinical, demographic, and procedural factors makes it difficult to predict the success of in vitro fertilization (IVF), a commonly used assisted reproductive technology. The goal of this research was to create an artificial intelligence (AI) pipeline that could predict live birth outcomes in IVF treatments with high accuracy.</div></div><div><h3>Design</h3><div>We evaluated prediction performance by integrating different feature selection methods, such as principal component analysis (PCA) and particle swarm optimization (PSO), with different machine learning-based classifiers, including random forest (RF) and decision tree, as well as deep learning-based classifiers, including a custom transformer-based model and a Tab_transformer model with an attention mechanism. Additionally, this study analyzes confounding factors like patient age and previous IVF cycles and explores the influence of different perturbation and preprocessing techniques and validates the model’s robustness under varied scenarios. In addition, Shapley Additive Explanations (SHAP) analysis was performed to enhance interpretability of methods.</div></div><div><h3>Results</h3><div>This research demonstrated that the best performance was achieved by combining PSO for feature selection with the Tab_transformer-based deep learning model, yielding an accuracy of 97 % and an AUC of 98.4 %, highlighting its significant performance in prediction live births. By identifying the most significant predictors of infertility and guaranteeing clinical significance, SHAP analysis significantly improved interpretability.</div></div><div><h3>Conclusion</h3><div>With the accuracy and interpretability, this study develops a strong AI pipeline for predicting live birth outcomes in IVF. This study establishes a highly accurate AI pipeline for predicting live birth outcomes in IVF, demonstrating its potential to enhance personalized fertility treatments.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"271 ","pages":"Article 108979"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725003967","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
The complicated interplay of clinical, demographic, and procedural factors makes it difficult to predict the success of in vitro fertilization (IVF), a commonly used assisted reproductive technology. The goal of this research was to create an artificial intelligence (AI) pipeline that could predict live birth outcomes in IVF treatments with high accuracy.
Design
We evaluated prediction performance by integrating different feature selection methods, such as principal component analysis (PCA) and particle swarm optimization (PSO), with different machine learning-based classifiers, including random forest (RF) and decision tree, as well as deep learning-based classifiers, including a custom transformer-based model and a Tab_transformer model with an attention mechanism. Additionally, this study analyzes confounding factors like patient age and previous IVF cycles and explores the influence of different perturbation and preprocessing techniques and validates the model’s robustness under varied scenarios. In addition, Shapley Additive Explanations (SHAP) analysis was performed to enhance interpretability of methods.
Results
This research demonstrated that the best performance was achieved by combining PSO for feature selection with the Tab_transformer-based deep learning model, yielding an accuracy of 97 % and an AUC of 98.4 %, highlighting its significant performance in prediction live births. By identifying the most significant predictors of infertility and guaranteeing clinical significance, SHAP analysis significantly improved interpretability.
Conclusion
With the accuracy and interpretability, this study develops a strong AI pipeline for predicting live birth outcomes in IVF. This study establishes a highly accurate AI pipeline for predicting live birth outcomes in IVF, demonstrating its potential to enhance personalized fertility treatments.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.