Piotr Wygocki , Tomasz Gilewicz , Paweł Pawlik , Michał Siennicki , Michał Brzozowski , Joanna Kuśmierczyk-Kubiak , Urszula Sankowska , Robert Milewski , Agnieszka Chmielowska , Marta Kordalewska , Małgorzata Różańska , Waldemar Kuczyński , Bartłomiej Wojtasik , Piotr Sankowski , Juergen Liebermann
{"title":"Evaluating the concordance between AI-based and conventional embryo selection: implications for clinical decision-making","authors":"Piotr Wygocki , Tomasz Gilewicz , Paweł Pawlik , Michał Siennicki , Michał Brzozowski , Joanna Kuśmierczyk-Kubiak , Urszula Sankowska , Robert Milewski , Agnieszka Chmielowska , Marta Kordalewska , Małgorzata Różańska , Waldemar Kuczyński , Bartłomiej Wojtasik , Piotr Sankowski , Juergen Liebermann","doi":"10.1016/j.rbmo.2026.105502","DOIUrl":null,"url":null,"abstract":"<div><h3>Research question</h3><div>Can artificial intelligence (AI) standardize embryo scoring, and help embryologists to identify embryos with the highest likelihood of pregnancy and live birth?</div></div><div><h3>Design</h3><div>Multicentre, retrospective, head-to-head analysis across six centres in five countries. An embryo selection algorithm (ESA) and 20 embryologists of varying seniority independently selected the implanting (i.e. ‘best’) embryo from 1681 pairs (1237 pairs with biochemical pregnancy; 444 pairs with live births), with each pair comprising one embryo with a positive outcome and one embryo with a negative outcome. Accuracy was computed for the ESA and for the embryologists; differences were assessed using McNemar’s test.</div></div><div><h3>Results</h3><div>The accuracy of the ESA was 70.1%. The accuracy of individual embryologists ranged from 64.2% to 68.9% (mean value for embryologists 67.7%), and the accuracy of the expert committee (i.e. majority vote across the 20 embryologists) was 69.5%. McNemar’s test indicated a significant advantage for the ESA compared with 14 of 20 embryologists, and the mean value for embryologists (<em>P</em> < 0.05), but no significant difference between the ESA and the remaining six embryologists or the expert committee.</div></div><div><h3>Conclusions</h3><div>The ESA achieved higher accuracy than most individual embryologists and the mean value for embryologists, supporting its potential as a standardized adjunct to expert judgement. Confirmation of effectiveness and generalizability requires adequately powered, prospective multicentre trials.</div></div>","PeriodicalId":21134,"journal":{"name":"Reproductive biomedicine online","volume":"52 4","pages":"Article 105502"},"PeriodicalIF":3.5000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproductive biomedicine online","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S147264832600043X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Research question
Can artificial intelligence (AI) standardize embryo scoring, and help embryologists to identify embryos with the highest likelihood of pregnancy and live birth?
Design
Multicentre, retrospective, head-to-head analysis across six centres in five countries. An embryo selection algorithm (ESA) and 20 embryologists of varying seniority independently selected the implanting (i.e. ‘best’) embryo from 1681 pairs (1237 pairs with biochemical pregnancy; 444 pairs with live births), with each pair comprising one embryo with a positive outcome and one embryo with a negative outcome. Accuracy was computed for the ESA and for the embryologists; differences were assessed using McNemar’s test.
Results
The accuracy of the ESA was 70.1%. The accuracy of individual embryologists ranged from 64.2% to 68.9% (mean value for embryologists 67.7%), and the accuracy of the expert committee (i.e. majority vote across the 20 embryologists) was 69.5%. McNemar’s test indicated a significant advantage for the ESA compared with 14 of 20 embryologists, and the mean value for embryologists (P < 0.05), but no significant difference between the ESA and the remaining six embryologists or the expert committee.
Conclusions
The ESA achieved higher accuracy than most individual embryologists and the mean value for embryologists, supporting its potential as a standardized adjunct to expert judgement. Confirmation of effectiveness and generalizability requires adequately powered, prospective multicentre trials.
期刊介绍:
Reproductive BioMedicine Online covers the formation, growth and differentiation of the human embryo. It is intended to bring to public attention new research on biological and clinical research on human reproduction and the human embryo including relevant studies on animals. It is published by a group of scientists and clinicians working in these fields of study. Its audience comprises researchers, clinicians, practitioners, academics and patients.
Context:
The period of human embryonic growth covered is between the formation of the primordial germ cells in the fetus until mid-pregnancy. High quality research on lower animals is included if it helps to clarify the human situation. Studies progressing to birth and later are published if they have a direct bearing on events in the earlier stages of pregnancy.