一种新型人工智能(AI)模型的多中心外部验证,该模型可预测成熟卵母细胞的囊胚PGT-A结果

IF 6 1区 医学 Q1 OBSTETRICS & GYNECOLOGY
N Mercuri, J Fjeldstad, S Corsac, D Nayot, A Krivoi
{"title":"一种新型人工智能(AI)模型的多中心外部验证,该模型可预测成熟卵母细胞的囊胚PGT-A结果","authors":"N Mercuri, J Fjeldstad, S Corsac, D Nayot, A Krivoi","doi":"10.1093/humrep/deaf097.004","DOIUrl":null,"url":null,"abstract":"Study question Is a model developed to predict euploid blastocyst development from mature oocytes generalizable across varying geographic locations? Summary answer A non-invasive AI model predicts euploid blastocyst development from mature oocytes with an AUC of 0.68 on a large dataset from 5 clinics (4 countries). What is known already MAGENTA is an AI-based model that assesses mature oocyte images and provides a score (0-10) correlated to its likelihood of developing to a blastocyst-stage embryo. An additional model has been recently developed to further predict the likelihood of euploid blastocyst development from the same oocyte images yet also incorporates oocyte age and MAGENTA’s assessments as key features. This Ploidy-AI model provides additional insight reflective of the oocyte’s potential chromosomal complement–enhancing the clinical utility of assessments. With the development of a new AI model, external validation of its performance is necessary to ensure generalizability across different geographies and patient demographics. Study design, size, duration This is a retrospective study that included 13,307 images of mature oocytes obtained from 5 clinics in 4 countries (1603 patients, 1949 cycles) including Argentina (C1; mean age 32.6±7.0, BMI unavailable), Brazil (C2; mean age 38.1±3.6, BMI 38.3), Spain (C3; mean age 38.7±3.8, BMI 20.4 and C4; mean age 38.9±3.4, BMI 21.8), and USA (C5; mean age 37.2±4.1; BMI 27). Images were obtained immediately post-ICSI from Embryoscope or GERI Time-Lapse incubators between the years 2020-2024. Participants/materials, setting, methods 13,307 oocyte images were assessed by MAGENTA and the Ploidy-AI model to predict each oocyte’s likelihood of developing into a euploid blastocyst (0-100%). Oocytes that did not develop into a blastocyst (n = 7385) or those that developed into an aneuploid blastocyst (n = 3534) were labelled the negative outcome, whereas those that developed into euploid blastocysts (n = 2388) were labeled the positive outcome. Untested or mosaic blastocysts were excluded. Main results and the role of chance On 13,307 mature oocytes, the Ploidy-AI model achieved an AUC of 0.68, sensitivity 0.54, and specificity 0.71. Oocytes that failed blastulation or developed into an aneuploid blastocyst had significantly lower median model-predicted euploid probability (n = 10,919, 0.20) than those that developed into an euploid blastocyst (n = 2388, 0.28) by Mann-Whitney U-test (p < 0.001). Additionally, model-predicted euploid probabilities were divided into quartiles (Q) according to the distribution within this dataset—Q1 (n = 3327), Q2 (n = 3327), Q3 (n = 3326), Q4 (n = 3327). A significant, stepwise positive increase in true euploid development rate for oocytes within each quartile of model-predicted probabilities was observed by pairwise-proportions test with Bonferroni correction (all p < 0.001): Q1(6%), Q2(14%), Q3(22%), and Q4(30%). Subgroup analysis by Clinic revealed consistent performance across all 5 clinics; C1 (n = 1643) – AUC 0.66, sensitivity 0.70, specificity 0.54; C2 (n = 7239) – AUC 0.68, sensitivity 0.51, specificity 0.72; C3 (n = 2442) – AUC 0.72, sensitivity 0.49, specificity 0.80; C4 (n = 802) – AUC 0.68, sensitivity 0.46, specificity 0.76; C5 (n = 1181) – AUC 0.66, sensitivity 0.61, specificity 0.62. The Ploidy-AI model performance was significantly higher on C3 than C1 (p < 0.001), C2 (p < 0.01), C5 (p < 0.01), and the overall dataset (p < 0.01) by DeLong’s test; however, no significant differences were observed in the other clinic-to-clinic or clinic-to-overall dataset comparisons. Limitations, reasons for caution The model displayed significantly higher performance on C3 compared to three other clinics, although model performance on the remaining clinics was similar and comparable to the overall dataset AUC. Further validating the model in additional geographies may ensure greater application. This study was retrospective in nature, prospective evaluation is warranted. Wider implications of the findings External validation of newly developed AI models is critical prior to clinical utilization. Large, diverse datasets, as in this study, ensure model generalization. This study presents a robust validation of a model that predicts blastocyst ploidy development from mature oocytes and is consistent across various clinic locations in different countries. Trial registration number No","PeriodicalId":13003,"journal":{"name":"Human reproduction","volume":"53 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"O-004 Multi-center, external validation of a novel artificial intelligence (AI) model that predicts blastocyst PGT-A results from mature oocytes\",\"authors\":\"N Mercuri, J Fjeldstad, S Corsac, D Nayot, A Krivoi\",\"doi\":\"10.1093/humrep/deaf097.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Study question Is a model developed to predict euploid blastocyst development from mature oocytes generalizable across varying geographic locations? Summary answer A non-invasive AI model predicts euploid blastocyst development from mature oocytes with an AUC of 0.68 on a large dataset from 5 clinics (4 countries). What is known already MAGENTA is an AI-based model that assesses mature oocyte images and provides a score (0-10) correlated to its likelihood of developing to a blastocyst-stage embryo. An additional model has been recently developed to further predict the likelihood of euploid blastocyst development from the same oocyte images yet also incorporates oocyte age and MAGENTA’s assessments as key features. This Ploidy-AI model provides additional insight reflective of the oocyte’s potential chromosomal complement–enhancing the clinical utility of assessments. With the development of a new AI model, external validation of its performance is necessary to ensure generalizability across different geographies and patient demographics. Study design, size, duration This is a retrospective study that included 13,307 images of mature oocytes obtained from 5 clinics in 4 countries (1603 patients, 1949 cycles) including Argentina (C1; mean age 32.6±7.0, BMI unavailable), Brazil (C2; mean age 38.1±3.6, BMI 38.3), Spain (C3; mean age 38.7±3.8, BMI 20.4 and C4; mean age 38.9±3.4, BMI 21.8), and USA (C5; mean age 37.2±4.1; BMI 27). Images were obtained immediately post-ICSI from Embryoscope or GERI Time-Lapse incubators between the years 2020-2024. Participants/materials, setting, methods 13,307 oocyte images were assessed by MAGENTA and the Ploidy-AI model to predict each oocyte’s likelihood of developing into a euploid blastocyst (0-100%). Oocytes that did not develop into a blastocyst (n = 7385) or those that developed into an aneuploid blastocyst (n = 3534) were labelled the negative outcome, whereas those that developed into euploid blastocysts (n = 2388) were labeled the positive outcome. Untested or mosaic blastocysts were excluded. Main results and the role of chance On 13,307 mature oocytes, the Ploidy-AI model achieved an AUC of 0.68, sensitivity 0.54, and specificity 0.71. Oocytes that failed blastulation or developed into an aneuploid blastocyst had significantly lower median model-predicted euploid probability (n = 10,919, 0.20) than those that developed into an euploid blastocyst (n = 2388, 0.28) by Mann-Whitney U-test (p < 0.001). Additionally, model-predicted euploid probabilities were divided into quartiles (Q) according to the distribution within this dataset—Q1 (n = 3327), Q2 (n = 3327), Q3 (n = 3326), Q4 (n = 3327). A significant, stepwise positive increase in true euploid development rate for oocytes within each quartile of model-predicted probabilities was observed by pairwise-proportions test with Bonferroni correction (all p < 0.001): Q1(6%), Q2(14%), Q3(22%), and Q4(30%). Subgroup analysis by Clinic revealed consistent performance across all 5 clinics; C1 (n = 1643) – AUC 0.66, sensitivity 0.70, specificity 0.54; C2 (n = 7239) – AUC 0.68, sensitivity 0.51, specificity 0.72; C3 (n = 2442) – AUC 0.72, sensitivity 0.49, specificity 0.80; C4 (n = 802) – AUC 0.68, sensitivity 0.46, specificity 0.76; C5 (n = 1181) – AUC 0.66, sensitivity 0.61, specificity 0.62. The Ploidy-AI model performance was significantly higher on C3 than C1 (p < 0.001), C2 (p < 0.01), C5 (p < 0.01), and the overall dataset (p < 0.01) by DeLong’s test; however, no significant differences were observed in the other clinic-to-clinic or clinic-to-overall dataset comparisons. Limitations, reasons for caution The model displayed significantly higher performance on C3 compared to three other clinics, although model performance on the remaining clinics was similar and comparable to the overall dataset AUC. Further validating the model in additional geographies may ensure greater application. This study was retrospective in nature, prospective evaluation is warranted. Wider implications of the findings External validation of newly developed AI models is critical prior to clinical utilization. Large, diverse datasets, as in this study, ensure model generalization. This study presents a robust validation of a model that predicts blastocyst ploidy development from mature oocytes and is consistent across various clinic locations in different countries. Trial registration number No\",\"PeriodicalId\":13003,\"journal\":{\"name\":\"Human reproduction\",\"volume\":\"53 1\",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human reproduction\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1093/humrep/deaf097.004\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human reproduction","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/humrep/deaf097.004","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

研究问题:一个预测成熟卵母细胞发育成整倍体囊胚的模型是否可以在不同的地理位置推广?在来自5个诊所(4个国家)的大型数据集上,一种无创AI模型预测成熟卵母细胞发育成整倍体囊胚,AUC为0.68。目前已知的MAGENTA是一种基于人工智能的模型,它可以评估成熟的卵母细胞图像,并根据其发育成囊胚期胚胎的可能性提供评分(0-10)。最近开发了一个额外的模型来进一步预测来自相同卵母细胞图像的整倍体囊胚发育的可能性,但也将卵母细胞年龄和MAGENTA的评估作为关键特征。这种Ploidy-AI模型提供了反映卵母细胞潜在染色体补体的额外见解,增强了评估的临床效用。随着新的人工智能模型的发展,有必要对其性能进行外部验证,以确保不同地区和患者人口统计数据的通用性。这是一项回顾性研究,包括来自4个国家5个诊所的13307张成熟卵母细胞图像(1603例患者,1949个周期),包括阿根廷(C1;平均年龄32.6±7.0,BMI不详),巴西(C2;平均年龄38.1±3.6,BMI 38.3),西班牙(C3;平均年龄38.7±3.8,BMI 20.4, C4;平均年龄38.9±3.4,BMI 21.8),美国(C5;平均年龄37.2±4.1岁;BMI 27)。图像在icsi后立即从胚胎镜或GERI延时孵育器获得,时间为2020-2024年。通过MAGENTA和Ploidy-AI模型评估13307个卵母细胞图像,预测每个卵母细胞发育成整倍体囊胚的可能性(0-100%)。未发育为囊胚的卵母细胞(n = 7385)或发育为非整倍体囊胚的卵母细胞(n = 3534)被标记为阴性结果,而发育为整倍体囊胚的卵母细胞(n = 2388)被标记为阳性结果。未检测或嵌合囊胚被排除在外。在13307个成熟卵母细胞上,Ploidy-AI模型的AUC为0.68,灵敏度为0.54,特异性为0.71。通过Mann-Whitney u检验(p < p < p),未发育成囊胚或发育成非整倍体囊胚的卵母细胞在模型预测的整倍体概率中位数(n = 10,919, 0.20)显著低于发育成整倍体囊胚的卵母细胞(n = 2388, 0.28)。0.001)。此外,根据该数据集内的分布,将模型预测的整倍体概率分为四分位数(Q)——q1 (n = 3327)、Q2 (n = 3327)、Q3 (n = 3326)、Q4 (n = 3327)。在模型预测概率的每一个四分位数内,通过Bonferroni校正的成对比例检验(所有p &;lt;0.001): Q1(6%), Q2(14%), Q3(22%)和Q4(30%)。Clinic的亚组分析显示,所有5家诊所的表现一致;C1 (n = 1643) - AUC 0.66,敏感性0.70,特异性0.54;C2 (n = 7239) - AUC 0.68,敏感性0.51,特异性0.72;C3 (n = 2442) - AUC 0.72,敏感性0.49,特异性0.80;C4 (n = 802) - AUC 0.68,敏感性0.46,特异性0.76;C5 (n = 1181) - AUC 0.66,敏感性0.61,特异性0.62。plidy - ai模型在C3上的生产性能显著高于C1 (p <;0.001), C2 (p <;0.01), C5 (p <;0.01),整体数据集(p <;DeLong检验0.01);然而,在其他临床与临床或临床与总体数据集比较中未观察到显著差异。与其他三个诊所相比,该模型在C3上显示出明显更高的性能,尽管模型在其余诊所的性能与整体数据集AUC相似且可比较。在其他地区进一步验证该模型可以确保更广泛的应用。本研究为回顾性研究,有必要进行前瞻性评价。在临床应用之前,对新开发的人工智能模型进行外部验证至关重要。在本研究中,大型、多样化的数据集确保了模型的泛化。这项研究对一个预测囊胚倍性从成熟卵母细胞发育的模型进行了强有力的验证,并且在不同国家的不同临床地点是一致的。试验注册号
本文章由计算机程序翻译,如有差异,请以英文原文为准。
O-004 Multi-center, external validation of a novel artificial intelligence (AI) model that predicts blastocyst PGT-A results from mature oocytes
Study question Is a model developed to predict euploid blastocyst development from mature oocytes generalizable across varying geographic locations? Summary answer A non-invasive AI model predicts euploid blastocyst development from mature oocytes with an AUC of 0.68 on a large dataset from 5 clinics (4 countries). What is known already MAGENTA is an AI-based model that assesses mature oocyte images and provides a score (0-10) correlated to its likelihood of developing to a blastocyst-stage embryo. An additional model has been recently developed to further predict the likelihood of euploid blastocyst development from the same oocyte images yet also incorporates oocyte age and MAGENTA’s assessments as key features. This Ploidy-AI model provides additional insight reflective of the oocyte’s potential chromosomal complement–enhancing the clinical utility of assessments. With the development of a new AI model, external validation of its performance is necessary to ensure generalizability across different geographies and patient demographics. Study design, size, duration This is a retrospective study that included 13,307 images of mature oocytes obtained from 5 clinics in 4 countries (1603 patients, 1949 cycles) including Argentina (C1; mean age 32.6±7.0, BMI unavailable), Brazil (C2; mean age 38.1±3.6, BMI 38.3), Spain (C3; mean age 38.7±3.8, BMI 20.4 and C4; mean age 38.9±3.4, BMI 21.8), and USA (C5; mean age 37.2±4.1; BMI 27). Images were obtained immediately post-ICSI from Embryoscope or GERI Time-Lapse incubators between the years 2020-2024. Participants/materials, setting, methods 13,307 oocyte images were assessed by MAGENTA and the Ploidy-AI model to predict each oocyte’s likelihood of developing into a euploid blastocyst (0-100%). Oocytes that did not develop into a blastocyst (n = 7385) or those that developed into an aneuploid blastocyst (n = 3534) were labelled the negative outcome, whereas those that developed into euploid blastocysts (n = 2388) were labeled the positive outcome. Untested or mosaic blastocysts were excluded. Main results and the role of chance On 13,307 mature oocytes, the Ploidy-AI model achieved an AUC of 0.68, sensitivity 0.54, and specificity 0.71. Oocytes that failed blastulation or developed into an aneuploid blastocyst had significantly lower median model-predicted euploid probability (n = 10,919, 0.20) than those that developed into an euploid blastocyst (n = 2388, 0.28) by Mann-Whitney U-test (p &lt; 0.001). Additionally, model-predicted euploid probabilities were divided into quartiles (Q) according to the distribution within this dataset—Q1 (n = 3327), Q2 (n = 3327), Q3 (n = 3326), Q4 (n = 3327). A significant, stepwise positive increase in true euploid development rate for oocytes within each quartile of model-predicted probabilities was observed by pairwise-proportions test with Bonferroni correction (all p &lt; 0.001): Q1(6%), Q2(14%), Q3(22%), and Q4(30%). Subgroup analysis by Clinic revealed consistent performance across all 5 clinics; C1 (n = 1643) – AUC 0.66, sensitivity 0.70, specificity 0.54; C2 (n = 7239) – AUC 0.68, sensitivity 0.51, specificity 0.72; C3 (n = 2442) – AUC 0.72, sensitivity 0.49, specificity 0.80; C4 (n = 802) – AUC 0.68, sensitivity 0.46, specificity 0.76; C5 (n = 1181) – AUC 0.66, sensitivity 0.61, specificity 0.62. The Ploidy-AI model performance was significantly higher on C3 than C1 (p &lt; 0.001), C2 (p &lt; 0.01), C5 (p &lt; 0.01), and the overall dataset (p &lt; 0.01) by DeLong’s test; however, no significant differences were observed in the other clinic-to-clinic or clinic-to-overall dataset comparisons. Limitations, reasons for caution The model displayed significantly higher performance on C3 compared to three other clinics, although model performance on the remaining clinics was similar and comparable to the overall dataset AUC. Further validating the model in additional geographies may ensure greater application. This study was retrospective in nature, prospective evaluation is warranted. Wider implications of the findings External validation of newly developed AI models is critical prior to clinical utilization. Large, diverse datasets, as in this study, ensure model generalization. This study presents a robust validation of a model that predicts blastocyst ploidy development from mature oocytes and is consistent across various clinic locations in different countries. Trial registration number No
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Human reproduction
Human reproduction 医学-妇产科学
CiteScore
10.90
自引率
6.60%
发文量
1369
审稿时长
1 months
期刊介绍: Human Reproduction features full-length, peer-reviewed papers reporting original research, concise clinical case reports, as well as opinions and debates on topical issues. Papers published cover the clinical science and medical aspects of reproductive physiology, pathology and endocrinology; including andrology, gonad function, gametogenesis, fertilization, embryo development, implantation, early pregnancy, genetics, genetic diagnosis, oncology, infectious disease, surgery, contraception, infertility treatment, psychology, ethics and social issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信