Evaluating the concordance between AI-based and conventional embryo selection: implications for clinical decision-making

IF 3.5 2区医学 Q1 OBSTETRICS & GYNECOLOGY

Reproductive biomedicine online Pub Date : 2026-04-01 Epub Date: 2026-01-23 DOI:10.1016/j.rbmo.2026.105502

Piotr Wygocki , Tomasz Gilewicz , Paweł Pawlik , Michał Siennicki , Michał Brzozowski , Joanna Kuśmierczyk-Kubiak , Urszula Sankowska , Robert Milewski , Agnieszka Chmielowska , Marta Kordalewska , Małgorzata Różańska , Waldemar Kuczyński , Bartłomiej Wojtasik , Piotr Sankowski , Juergen Liebermann

{"title":"Evaluating the concordance between AI-based and conventional embryo selection: implications for clinical decision-making","authors":"Piotr Wygocki , Tomasz Gilewicz , Paweł Pawlik , Michał Siennicki , Michał Brzozowski , Joanna Kuśmierczyk-Kubiak , Urszula Sankowska , Robert Milewski , Agnieszka Chmielowska , Marta Kordalewska , Małgorzata Różańska , Waldemar Kuczyński , Bartłomiej Wojtasik , Piotr Sankowski , Juergen Liebermann","doi":"10.1016/j.rbmo.2026.105502","DOIUrl":null,"url":null,"abstract":"<div><h3>Research question</h3><div>Can artificial intelligence (AI) standardize embryo scoring, and help embryologists to identify embryos with the highest likelihood of pregnancy and live birth?</div></div><div><h3>Design</h3><div>Multicentre, retrospective, head-to-head analysis across six centres in five countries. An embryo selection algorithm (ESA) and 20 embryologists of varying seniority independently selected the implanting (i.e. ‘best’) embryo from 1681 pairs (1237 pairs with biochemical pregnancy; 444 pairs with live births), with each pair comprising one embryo with a positive outcome and one embryo with a negative outcome. Accuracy was computed for the ESA and for the embryologists; differences were assessed using McNemar’s test.</div></div><div><h3>Results</h3><div>The accuracy of the ESA was 70.1%. The accuracy of individual embryologists ranged from 64.2% to 68.9% (mean value for embryologists 67.7%), and the accuracy of the expert committee (i.e. majority vote across the 20 embryologists) was 69.5%. McNemar’s test indicated a significant advantage for the ESA compared with 14 of 20 embryologists, and the mean value for embryologists (<em>P</em> < 0.05), but no significant difference between the ESA and the remaining six embryologists or the expert committee.</div></div><div><h3>Conclusions</h3><div>The ESA achieved higher accuracy than most individual embryologists and the mean value for embryologists, supporting its potential as a standardized adjunct to expert judgement. Confirmation of effectiveness and generalizability requires adequately powered, prospective multicentre trials.</div></div>","PeriodicalId":21134,"journal":{"name":"Reproductive biomedicine online","volume":"52 4","pages":"Article 105502"},"PeriodicalIF":3.5000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproductive biomedicine online","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S147264832600043X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Research question

Can artificial intelligence (AI) standardize embryo scoring, and help embryologists to identify embryos with the highest likelihood of pregnancy and live birth?

Design

Multicentre, retrospective, head-to-head analysis across six centres in five countries. An embryo selection algorithm (ESA) and 20 embryologists of varying seniority independently selected the implanting (i.e. ‘best’) embryo from 1681 pairs (1237 pairs with biochemical pregnancy; 444 pairs with live births), with each pair comprising one embryo with a positive outcome and one embryo with a negative outcome. Accuracy was computed for the ESA and for the embryologists; differences were assessed using McNemar’s test.

Results

The accuracy of the ESA was 70.1%. The accuracy of individual embryologists ranged from 64.2% to 68.9% (mean value for embryologists 67.7%), and the accuracy of the expert committee (i.e. majority vote across the 20 embryologists) was 69.5%. McNemar’s test indicated a significant advantage for the ESA compared with 14 of 20 embryologists, and the mean value for embryologists (P < 0.05), but no significant difference between the ESA and the remaining six embryologists or the expert committee.

Conclusions

The ESA achieved higher accuracy than most individual embryologists and the mean value for embryologists, supporting its potential as a standardized adjunct to expert judgement. Confirmation of effectiveness and generalizability requires adequately powered, prospective multicentre trials.

查看原文本刊更多论文

评估人工智能与传统胚胎选择的一致性：对临床决策的影响。

研究问题：人工智能（AI）能否标准化胚胎评分，并帮助胚胎学家识别最有可能怀孕和活产的胚胎？设计：在5个国家的6个中心进行多中心、回顾性、正面分析。胚胎选择算法（ESA）和20名不同资历的胚胎学家独立选择了植入(即。“最佳”)胚胎来自1681对（1237对生化妊娠，444对活产），每对包括一个阳性结果的胚胎和一个阴性结果的胚胎。为ESA和胚胎学家计算了精度；使用McNemar试验评估差异。结果：ESA的准确率为70.1%。个体胚胎学家的准确率从64.2%到68.9%（胚胎学家的平均值为67.7%），专家委员会（即20名胚胎学家的多数投票）的准确率为69.5%。McNemar的测试表明，ESA与20名胚胎学家中的14名相比具有显著优势，与胚胎学家的平均值相比具有显著优势（P < 0.05），但ESA与其余6名胚胎学家或专家委员会之间没有显著差异。结论：ESA比大多数个体胚胎学家和胚胎学家的平均值具有更高的准确性，支持其作为专家判断的标准化辅助手段的潜力。确认有效性和普遍性需要充分有力的前瞻性多中心试验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Reproductive biomedicine online 医学-妇产科学

CiteScore

7.20

自引率

7.50%

发文量

391

审稿时长

50 days

期刊介绍： Reproductive BioMedicine Online covers the formation, growth and differentiation of the human embryo. It is intended to bring to public attention new research on biological and clinical research on human reproduction and the human embryo including relevant studies on animals. It is published by a group of scientists and clinicians working in these fields of study. Its audience comprises researchers, clinicians, practitioners, academics and patients. Context: The period of human embryonic growth covered is between the formation of the primordial germ cells in the fetus until mid-pregnancy. High quality research on lower animals is included if it helps to clarify the human situation. Studies progressing to birth and later are published if they have a direct bearing on events in the earlier stages of pregnancy.