M. Goetz-Fu , M. Haller , T. Collins , N. Begusic , F. Jochum , Y. Keeza , J. Uwineza , J. Marescaux , A.S. Weingertner , N. Sananès , A. Hostettler
{"title":"Development and temporal validation of a deep learning model for automatic fetal biometry from ultrasound videos","authors":"M. Goetz-Fu , M. Haller , T. Collins , N. Begusic , F. Jochum , Y. Keeza , J. Uwineza , J. Marescaux , A.S. Weingertner , N. Sananès , A. Hostettler","doi":"10.1016/j.jogoh.2025.103039","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>The objective was to develop an artificial intelligence (AI)-based system, using deep neural network (DNN) technology, to automatically detect standard fetal planes during video capture, measure fetal biometry parameters and estimate fetal weight.</div></div><div><h3>Methods</h3><div>A standard plane recognition DNN was trained to classify ultrasound images into four categories: head circumference (HC), abdominal circumference (AC), femur length (FL) standard planes, or ‘other’. The recognized standard plane images were subsequently processed by three fetal biometry DNNs, automatically measuring HC, AC and FL. Fetal weight was then estimated with the Hadlock 3 formula. The training dataset consisted of 16,626 images. A prospective temporal validation was then conducted using an independent set of 281 ultrasound videos of healthy fetuses. Fetal weight and biometry measurements were compared against an expert sonographer. Two less experienced sonographers were used as controls.</div></div><div><h3>Results</h3><div>The AI system obtained a significantly lower absolute relative measurement error in fetal weight estimation than the controls (AI vs. medium-level: <em>p</em> = 0.032, AI vs. beginner: <em>p</em> < 1e-8), so in AC measurements (AI vs. medium-level: <em>p</em> = 1.72e-04, AI vs. beginner: <em>p</em> < 1e-06). Average absolute relative measurement errors of AI versus expert were: 0.96 % (S.D. 0.79 %) for HC, 1.56 % (S.D. 1.39 %) for AC, 1.77 % (S.D. 1.46 %) for FL and 3.10 % (S.D. 2.74 %) for fetal weight estimation.</div></div><div><h3>Conclusion</h3><div>The AI system produced similar biometry measurements and fetal weight estimation to those of the expert sonographer. It is a promising tool to enhance non-expert sonographers’ performance and reproducibility in fetal biometry measurements, and to reduce inter-operator variability.</div></div>","PeriodicalId":15871,"journal":{"name":"Journal of gynecology obstetrics and human reproduction","volume":"54 10","pages":"Article 103039"},"PeriodicalIF":1.6000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of gynecology obstetrics and human reproduction","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468784725001369","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
The objective was to develop an artificial intelligence (AI)-based system, using deep neural network (DNN) technology, to automatically detect standard fetal planes during video capture, measure fetal biometry parameters and estimate fetal weight.
Methods
A standard plane recognition DNN was trained to classify ultrasound images into four categories: head circumference (HC), abdominal circumference (AC), femur length (FL) standard planes, or ‘other’. The recognized standard plane images were subsequently processed by three fetal biometry DNNs, automatically measuring HC, AC and FL. Fetal weight was then estimated with the Hadlock 3 formula. The training dataset consisted of 16,626 images. A prospective temporal validation was then conducted using an independent set of 281 ultrasound videos of healthy fetuses. Fetal weight and biometry measurements were compared against an expert sonographer. Two less experienced sonographers were used as controls.
Results
The AI system obtained a significantly lower absolute relative measurement error in fetal weight estimation than the controls (AI vs. medium-level: p = 0.032, AI vs. beginner: p < 1e-8), so in AC measurements (AI vs. medium-level: p = 1.72e-04, AI vs. beginner: p < 1e-06). Average absolute relative measurement errors of AI versus expert were: 0.96 % (S.D. 0.79 %) for HC, 1.56 % (S.D. 1.39 %) for AC, 1.77 % (S.D. 1.46 %) for FL and 3.10 % (S.D. 2.74 %) for fetal weight estimation.
Conclusion
The AI system produced similar biometry measurements and fetal weight estimation to those of the expert sonographer. It is a promising tool to enhance non-expert sonographers’ performance and reproducibility in fetal biometry measurements, and to reduce inter-operator variability.
期刊介绍:
Formerly known as Journal de Gynécologie Obstétrique et Biologie de la Reproduction, Journal of Gynecology Obstetrics and Human Reproduction is the official Academic publication of the French College of Obstetricians and Gynecologists (Collège National des Gynécologues et Obstétriciens Français / CNGOF).
J Gynecol Obstet Hum Reprod publishes monthly, in English, research papers and techniques in the fields of Gynecology, Obstetrics, Neonatology and Human Reproduction: (guest) editorials, original articles, reviews, updates, technical notes, case reports, letters to the editor and guidelines.
Original works include clinical or laboratory investigations and clinical or equipment reports. Reviews include narrative reviews, systematic reviews and meta-analyses.