Yuval Fouks, Pietro Bortoletto, Jeffrey Chang, Alan Penzias, Denis Vaughan, Denny Sakkas
{"title":"Looking into the future: a machine learning powered prediction model for oocyte return rates after cryopreservation.","authors":"Yuval Fouks, Pietro Bortoletto, Jeffrey Chang, Alan Penzias, Denis Vaughan, Denny Sakkas","doi":"10.1016/j.rbmo.2024.104432","DOIUrl":null,"url":null,"abstract":"<p><strong>Research question: </strong>Could a predictive model, using data from all US fertility clinics reporting to the Society for Assisted Reproductive Technology, estimate the likelihood of patients using their stored oocytes?</p><p><strong>Design: </strong>Multiple learner algorithms, including penalized regressions, random forests, gradient boosting machine, linear discriminant analysis and bootstrap aggregating decision trees were used. Data were split into training and test datasets. Patient demographics, medical and fertility diagnoses, partner information and geographic locations were analysed.</p><p><strong>Results: </strong>A total of 77,631 oocyte-cryopreservation cycles (2014-2020) were analysed. Patient age averaged 34.5 years. Treatment indications varied: planned (35.6%), gender-related (0.1%), medically indicated (15.5%), oncologic (5.7%) and unknown (42.3%). Infertility diagnoses were less common: unexplained infertility (1.8%), age-related infertility (3.2%), diminished ovarian reserve (9.9%) and endometriosis (1.6%). An ensemble model combining bootstrap aggregation classification and regression trees, stochastic gradient boosting and linear discriminant analysis yielded the highest predictive accuracy on test set (balanced accuracy: 0.83, sensitivity: 0.76, specificity: 0.91), with a receiver operating characteristic curve of 0.90 and precision-recall curve and area under the curve of 0.57. Key factors influencing the likelihood of returning for oocyte use included patient age, presence of a partner, race or ethnicity, the clinic's geographic region and oocyte cryopreservation indication.</p><p><strong>Conclusions: </strong>This model demonstrated significant predictive accuracy, and is a valuable tool for patient counselling on oocyte cryopreservation. It helps to identify patients more likely to use stored oocytes, enhancing healthcare decision-making and the efficiency of gamete storage programmes. The model can be applied to self-financed and insurance-funded cycles.</p>","PeriodicalId":21134,"journal":{"name":"Reproductive biomedicine online","volume":"50 1","pages":"104432"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproductive biomedicine online","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.rbmo.2024.104432","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Research question: Could a predictive model, using data from all US fertility clinics reporting to the Society for Assisted Reproductive Technology, estimate the likelihood of patients using their stored oocytes?
Design: Multiple learner algorithms, including penalized regressions, random forests, gradient boosting machine, linear discriminant analysis and bootstrap aggregating decision trees were used. Data were split into training and test datasets. Patient demographics, medical and fertility diagnoses, partner information and geographic locations were analysed.
Results: A total of 77,631 oocyte-cryopreservation cycles (2014-2020) were analysed. Patient age averaged 34.5 years. Treatment indications varied: planned (35.6%), gender-related (0.1%), medically indicated (15.5%), oncologic (5.7%) and unknown (42.3%). Infertility diagnoses were less common: unexplained infertility (1.8%), age-related infertility (3.2%), diminished ovarian reserve (9.9%) and endometriosis (1.6%). An ensemble model combining bootstrap aggregation classification and regression trees, stochastic gradient boosting and linear discriminant analysis yielded the highest predictive accuracy on test set (balanced accuracy: 0.83, sensitivity: 0.76, specificity: 0.91), with a receiver operating characteristic curve of 0.90 and precision-recall curve and area under the curve of 0.57. Key factors influencing the likelihood of returning for oocyte use included patient age, presence of a partner, race or ethnicity, the clinic's geographic region and oocyte cryopreservation indication.
Conclusions: This model demonstrated significant predictive accuracy, and is a valuable tool for patient counselling on oocyte cryopreservation. It helps to identify patients more likely to use stored oocytes, enhancing healthcare decision-making and the efficiency of gamete storage programmes. The model can be applied to self-financed and insurance-funded cycles.
期刊介绍:
Reproductive BioMedicine Online covers the formation, growth and differentiation of the human embryo. It is intended to bring to public attention new research on biological and clinical research on human reproduction and the human embryo including relevant studies on animals. It is published by a group of scientists and clinicians working in these fields of study. Its audience comprises researchers, clinicians, practitioners, academics and patients.
Context:
The period of human embryonic growth covered is between the formation of the primordial germ cells in the fetus until mid-pregnancy. High quality research on lower animals is included if it helps to clarify the human situation. Studies progressing to birth and later are published if they have a direct bearing on events in the earlier stages of pregnancy.