具有透明带结合能力的人类精子的深度学习自动识别。

IF 8.3 Q1 OBSTETRICS & GYNECOLOGY
Human reproduction open Pub Date : 2025-05-10 eCollection Date: 2025-01-01 DOI:10.1093/hropen/hoaf024
Erica T Y Leung, Xianghan Mei, Brayden K M Lee, Kevin K W Lam, Cheuk-Lun Lee, Raymond H W Li, Ernest H Y Ng, William S B Yeung, Lequan Yu, Philip C N Chiu
{"title":"具有透明带结合能力的人类精子的深度学习自动识别。","authors":"Erica T Y Leung, Xianghan Mei, Brayden K M Lee, Kevin K W Lam, Cheuk-Lun Lee, Raymond H W Li, Ernest H Y Ng, William S B Yeung, Lequan Yu, Philip C N Chiu","doi":"10.1093/hropen/hoaf024","DOIUrl":null,"url":null,"abstract":"<p><strong>Study question: </strong>Can a deep-learning algorithm, independent of World Health Organization (WHO) sperm morphology grading, be used to identify human spermatozoa with zona pellucida (ZP)-binding capability in assisted reproductive technology (ART)?</p><p><strong>Summary answer: </strong>A novel deep-learning model, irrespective of the conventional semen analysis, was established to identify human spermatozoa capable of binding to ZP for predicting their fertilization potential.</p><p><strong>What is known already: </strong>Sperm morphology evaluation is crucial in semen analysis to investigate male infertility and to determine the appropriate insemination methods in ART. The current manual assessment, which relies on microscopically examining individual spermatozoa based on WHO criteria, has shown limited predictive power for fertilization outcomes due to its highly subjective, labour-intensive nature, and high inter-/intra-assay variations. Deep learning is a rapidly evolving method for automated image analysis. Recent studies have explored its potential for automating sperm morphology analysis. However, algorithms trained on manually annotated datasets using existing WHO criteria have had little success in predicting ART outcomes. To date, no study has established an independent set of morphology evaluation standards based on sperm fertilizing ability for clinical prediction.</p><p><strong>Study design size duration: </strong>Spare semen samples were collected from men undergoing premarital check-ups at a family planning clinic. Immature oocytes at germinal vesicle/metaphase I stage, or mature metaphase II oocytes were donated from women attending the infertility clinic for assisted reproduction treatments. Acrosome-intact, ZP-bound spermatozoa were collected by our previously modified spermatozoa-ZP coincubation assay. ZP-unbound spermatozoa were collected from normozoospermic samples with defective ZP-binding ability, as evidenced by complete fertilization failure following conventional <i>in vitro</i> fertilization (IVF) and the absence of ZP-bound spermatozoa on the inseminated oocytes. A total of 1083 Diff-Quik stained images of ZP-bound and unbound spermatozoa were collected to create a training database, with an additional 220 images serving as an independent test set. Clinical data were obtained from 117 men undergoing IVF due to male factor or unexplained infertility to validate the model's ability to generalize to new data. These participants were categorized into three groups based on their fertilization rates following IVF: low (0-40%), intermediate (41-70%), and high (71-100%).</p><p><strong>Participants/materials setting methods: </strong>A pre-trained VGG13 model was fine-tuned using our database to classify individual spermatozoa as either ZP-bound or unbound based on their automatically extracted morphological features. Confusion matrix was used to assess the model's classification performance, expressed in terms of accuracy, specificity, sensitivity, and precision rates. The area under the receiver-operating characteristic (ROC) curve (AUC) was utilized to measure the model's discriminative power. A 5-fold cross-validation was conducted on the training dataset to assess the model's performance on randomized subgroups. Saliency mapping was used to analyse pixel importance localized to the morphological features of sperm images. Clinical data of spermatozoa from three fertilization groups were used for clinical validation. Logistic ROC regression analysis was performed to evaluate the differences in predicted values between high and low fertilization groups, as indicated by AUC and <i>P</i>-values. Additionally, Youden's index was applied to determine a clinical threshold for predicting IVF fertilization outcome using the model.</p><p><strong>Main results and the role of chance: </strong>A VGG13 model was fine-tuned to distinguish images of spermatozoa capable of binding to the ZP based on their morphological features with high sensitivity (97.6%), specificity (96.0%), accuracy (96.7%), and precision (95.2%). The model exhibited low learning variance (average accuracy: 97.4%; sensitivity: 96.0%; and specificity: 98.5%) across subgroups, with primary emphasis on the sperm head and mid-pieces in all images as indicated by the pixel importance. Its discriminative performance was clinically validated on over 33 000 sperm images collected from three fertilization groups. Overall, the model exhibited excellent generalization ability as reflected by the strong correlation between the predicted percentages of spermatozoa with ZP-binding per sample and their fertilization rates. A clinical threshold of 4.9% (specificity: 89.3%; sensitivity: 90.0%) was established to differentiate sperm samples with normal and defective ZP-binding ability. By conducting pairwise comparisons among 30 patients, the predicted values generated by the model outperformed conventional semen analysis assessed by our in-house embryologists in identifying patients who were likely to experience failure with conventional IVF.</p><p><strong>Large scale data: </strong>N/A.</p><p><strong>Limitations reasons for caution: </strong>The model is currently designed for high-resolution, air-dried, Diff-Quik stained sperm samples, and further research is required to validate its classification performance across different image qualities with a larger sample size.</p><p><strong>Wider implications of the findings: </strong>This newly established method can identify couples at high risk of unexpected IVF fertilization failure, enabling clinicians to offer alternative insemination methods to reduce the likelihood of suboptimal fertilization outcomes.</p><p><strong>Study funding/competing interests: </strong>This study was supported in part by two Health and Medical Research Funds, the Food and Health Bureau, The Government of the HKSAR (07182446 and 11222236), and the Sanming Project of Medicine in Shenzhen (SZSM 202211014). Two provisional patent applications related to the data presented here have been filed on behalf of The University of Hong Kong (i. application no. 63/511,375; filing date: 30 June 2023; current status: active; applicant: The University of Hong Kong; ii. application no. US 63/567,147; filing date: 19 March 2024; current status: active; applicant: The University of Hong Kong). The authors declare that they have no other competing interests.</p>","PeriodicalId":73264,"journal":{"name":"Human reproduction open","volume":"2025 3","pages":"hoaf024"},"PeriodicalIF":8.3000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12145210/pdf/","citationCount":"0","resultStr":"{\"title\":\"Automatic identification of human spermatozoa with zona pellucida-binding capability using deep learning.\",\"authors\":\"Erica T Y Leung, Xianghan Mei, Brayden K M Lee, Kevin K W Lam, Cheuk-Lun Lee, Raymond H W Li, Ernest H Y Ng, William S B Yeung, Lequan Yu, Philip C N Chiu\",\"doi\":\"10.1093/hropen/hoaf024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Study question: </strong>Can a deep-learning algorithm, independent of World Health Organization (WHO) sperm morphology grading, be used to identify human spermatozoa with zona pellucida (ZP)-binding capability in assisted reproductive technology (ART)?</p><p><strong>Summary answer: </strong>A novel deep-learning model, irrespective of the conventional semen analysis, was established to identify human spermatozoa capable of binding to ZP for predicting their fertilization potential.</p><p><strong>What is known already: </strong>Sperm morphology evaluation is crucial in semen analysis to investigate male infertility and to determine the appropriate insemination methods in ART. The current manual assessment, which relies on microscopically examining individual spermatozoa based on WHO criteria, has shown limited predictive power for fertilization outcomes due to its highly subjective, labour-intensive nature, and high inter-/intra-assay variations. Deep learning is a rapidly evolving method for automated image analysis. Recent studies have explored its potential for automating sperm morphology analysis. However, algorithms trained on manually annotated datasets using existing WHO criteria have had little success in predicting ART outcomes. To date, no study has established an independent set of morphology evaluation standards based on sperm fertilizing ability for clinical prediction.</p><p><strong>Study design size duration: </strong>Spare semen samples were collected from men undergoing premarital check-ups at a family planning clinic. Immature oocytes at germinal vesicle/metaphase I stage, or mature metaphase II oocytes were donated from women attending the infertility clinic for assisted reproduction treatments. Acrosome-intact, ZP-bound spermatozoa were collected by our previously modified spermatozoa-ZP coincubation assay. ZP-unbound spermatozoa were collected from normozoospermic samples with defective ZP-binding ability, as evidenced by complete fertilization failure following conventional <i>in vitro</i> fertilization (IVF) and the absence of ZP-bound spermatozoa on the inseminated oocytes. A total of 1083 Diff-Quik stained images of ZP-bound and unbound spermatozoa were collected to create a training database, with an additional 220 images serving as an independent test set. Clinical data were obtained from 117 men undergoing IVF due to male factor or unexplained infertility to validate the model's ability to generalize to new data. These participants were categorized into three groups based on their fertilization rates following IVF: low (0-40%), intermediate (41-70%), and high (71-100%).</p><p><strong>Participants/materials setting methods: </strong>A pre-trained VGG13 model was fine-tuned using our database to classify individual spermatozoa as either ZP-bound or unbound based on their automatically extracted morphological features. Confusion matrix was used to assess the model's classification performance, expressed in terms of accuracy, specificity, sensitivity, and precision rates. The area under the receiver-operating characteristic (ROC) curve (AUC) was utilized to measure the model's discriminative power. A 5-fold cross-validation was conducted on the training dataset to assess the model's performance on randomized subgroups. Saliency mapping was used to analyse pixel importance localized to the morphological features of sperm images. Clinical data of spermatozoa from three fertilization groups were used for clinical validation. Logistic ROC regression analysis was performed to evaluate the differences in predicted values between high and low fertilization groups, as indicated by AUC and <i>P</i>-values. Additionally, Youden's index was applied to determine a clinical threshold for predicting IVF fertilization outcome using the model.</p><p><strong>Main results and the role of chance: </strong>A VGG13 model was fine-tuned to distinguish images of spermatozoa capable of binding to the ZP based on their morphological features with high sensitivity (97.6%), specificity (96.0%), accuracy (96.7%), and precision (95.2%). The model exhibited low learning variance (average accuracy: 97.4%; sensitivity: 96.0%; and specificity: 98.5%) across subgroups, with primary emphasis on the sperm head and mid-pieces in all images as indicated by the pixel importance. Its discriminative performance was clinically validated on over 33 000 sperm images collected from three fertilization groups. Overall, the model exhibited excellent generalization ability as reflected by the strong correlation between the predicted percentages of spermatozoa with ZP-binding per sample and their fertilization rates. A clinical threshold of 4.9% (specificity: 89.3%; sensitivity: 90.0%) was established to differentiate sperm samples with normal and defective ZP-binding ability. By conducting pairwise comparisons among 30 patients, the predicted values generated by the model outperformed conventional semen analysis assessed by our in-house embryologists in identifying patients who were likely to experience failure with conventional IVF.</p><p><strong>Large scale data: </strong>N/A.</p><p><strong>Limitations reasons for caution: </strong>The model is currently designed for high-resolution, air-dried, Diff-Quik stained sperm samples, and further research is required to validate its classification performance across different image qualities with a larger sample size.</p><p><strong>Wider implications of the findings: </strong>This newly established method can identify couples at high risk of unexpected IVF fertilization failure, enabling clinicians to offer alternative insemination methods to reduce the likelihood of suboptimal fertilization outcomes.</p><p><strong>Study funding/competing interests: </strong>This study was supported in part by two Health and Medical Research Funds, the Food and Health Bureau, The Government of the HKSAR (07182446 and 11222236), and the Sanming Project of Medicine in Shenzhen (SZSM 202211014). Two provisional patent applications related to the data presented here have been filed on behalf of The University of Hong Kong (i. application no. 63/511,375; filing date: 30 June 2023; current status: active; applicant: The University of Hong Kong; ii. application no. US 63/567,147; filing date: 19 March 2024; current status: active; applicant: The University of Hong Kong). The authors declare that they have no other competing interests.</p>\",\"PeriodicalId\":73264,\"journal\":{\"name\":\"Human reproduction open\",\"volume\":\"2025 3\",\"pages\":\"hoaf024\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12145210/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human reproduction open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/hropen/hoaf024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human reproduction open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/hropen/hoaf024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

研究问题:在辅助生殖技术(ART)中,是否可以使用独立于世界卫生组织(WHO)精子形态分级的深度学习算法来识别具有透明带(ZP)结合能力的人类精子?摘要回答:建立了一个新的深度学习模型,不考虑传统的精液分析,以识别能够与ZP结合的人类精子,并预测其受精潜力。已知情况:精子形态评估在精液分析中至关重要,以调查男性不育症并确定ART中适当的人工授精方法。目前的人工评估依赖于根据世卫组织标准对单个精子进行显微镜检查,由于其高度主观、劳动密集的性质以及测定间/测定内的高度差异,对受精结果的预测能力有限。深度学习是一种快速发展的自动图像分析方法。最近的研究已经探索了其自动化精子形态分析的潜力。然而,在使用现有世卫组织标准的人工注释数据集上训练的算法在预测抗逆转录病毒治疗结果方面几乎没有成功。迄今为止,尚无研究建立一套独立的基于精子受精能力的形态学评价标准用于临床预测。研究设计规模持续时间:从在计划生育诊所接受婚前检查的男性中收集备用精液样本。生殖囊/中期I期的未成熟卵母细胞,或中期II期的成熟卵母细胞来自于不孕不育诊所接受辅助生殖治疗的妇女。顶体完整,结合zp的精子通过我们先前修改的精子- zp共孵育试验收集。从正常精子样本中收集未结合zp的精子,这些精子的zp结合能力存在缺陷,这表明常规体外受精(IVF)后完全受精失败,受精卵细胞上没有zp结合的精子。收集1083张结合和未结合精子的Diff-Quik染色图像作为训练数据库,另外220张图像作为独立的测试集。临床数据来自117名因男性因素或不明原因不育而接受体外受精的男性,以验证该模型推广新数据的能力。这些参与者根据体外受精后的受精率分为三组:低(0-40%)、中(41-70%)和高(71-100%)。参与者/材料设置方法:使用我们的数据库对预训练的VGG13模型进行微调,根据自动提取的形态特征将单个精子分类为zp结合或未结合。混淆矩阵用于评估模型的分类性能,以准确性、特异性、敏感性和准确率表示。采用受试者工作特征曲线下面积(AUC)来衡量模型的判别能力。在训练数据集上进行5倍交叉验证,以评估模型在随机子组上的性能。使用显著性映射来分析定位于精子图像形态特征的像素重要性。使用三个受精组精子的临床资料进行临床验证。采用Logistic ROC回归分析评价高、低施肥组预测值的差异,用AUC和p值表示。此外,应用约登指数来确定使用该模型预测体外受精结果的临床阈值。主要结果及机会的作用:对VGG13模型进行了微调,以区分能够结合ZP的精子图像,其灵敏度(97.6%)、特异性(96.0%)、准确度(96.7%)和精密度(95.2%)较高。该模型具有较低的学习方差(平均准确率为97.4%;灵敏度:96.0%;特异性:98.5%),在所有图像中,主要强调精子头部和中间部分,如像素重要性所示。从三个受精组收集的33000多个精子图像中,临床验证了其鉴别性能。总体而言,该模型具有良好的泛化能力,这体现在每个样本中与zp结合的精子的预测百分比与其受精率之间存在很强的相关性。临床阈值为4.9%(特异性:89.3%;灵敏度:90.0%),用于区分正常与缺陷的精子样本。通过对30例患者进行两两比较,该模型产生的预测值优于我们内部胚胎学家评估的传统精液分析,以确定可能经历传统试管婴儿失败的患者。大规模数据:无。 该模型目前是为高分辨率、风干、Diff-Quik染色的精子样本设计的,需要进一步的研究来验证其在不同图像质量和更大样本量下的分类性能。研究结果的更广泛意义:这种新建立的方法可以识别出意外IVF受精失败的高风险夫妇,使临床医生能够提供替代的人工授精方法,以减少次优受精结果的可能性。研究经费/竞争利益:本研究由两项卫生与医学研究基金、香港特别行政区政府食品及卫生局(07182446和11222236)和深圳三明医学项目(SZSM 202211014)提供部分支持。本署已代表香港大学提交两项与本网页所载资料有关的临时专利申请(申请编号:63/511,375;申请日期:2023年6月30日;当前状态:活跃;申请人:香港大学;2。应用程序没有。我们63/567,147;申请日期:2024年3月19日;当前状态:活跃;申请人:香港大学)。作者宣称他们没有其他利益冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic identification of human spermatozoa with zona pellucida-binding capability using deep learning.

Study question: Can a deep-learning algorithm, independent of World Health Organization (WHO) sperm morphology grading, be used to identify human spermatozoa with zona pellucida (ZP)-binding capability in assisted reproductive technology (ART)?

Summary answer: A novel deep-learning model, irrespective of the conventional semen analysis, was established to identify human spermatozoa capable of binding to ZP for predicting their fertilization potential.

What is known already: Sperm morphology evaluation is crucial in semen analysis to investigate male infertility and to determine the appropriate insemination methods in ART. The current manual assessment, which relies on microscopically examining individual spermatozoa based on WHO criteria, has shown limited predictive power for fertilization outcomes due to its highly subjective, labour-intensive nature, and high inter-/intra-assay variations. Deep learning is a rapidly evolving method for automated image analysis. Recent studies have explored its potential for automating sperm morphology analysis. However, algorithms trained on manually annotated datasets using existing WHO criteria have had little success in predicting ART outcomes. To date, no study has established an independent set of morphology evaluation standards based on sperm fertilizing ability for clinical prediction.

Study design size duration: Spare semen samples were collected from men undergoing premarital check-ups at a family planning clinic. Immature oocytes at germinal vesicle/metaphase I stage, or mature metaphase II oocytes were donated from women attending the infertility clinic for assisted reproduction treatments. Acrosome-intact, ZP-bound spermatozoa were collected by our previously modified spermatozoa-ZP coincubation assay. ZP-unbound spermatozoa were collected from normozoospermic samples with defective ZP-binding ability, as evidenced by complete fertilization failure following conventional in vitro fertilization (IVF) and the absence of ZP-bound spermatozoa on the inseminated oocytes. A total of 1083 Diff-Quik stained images of ZP-bound and unbound spermatozoa were collected to create a training database, with an additional 220 images serving as an independent test set. Clinical data were obtained from 117 men undergoing IVF due to male factor or unexplained infertility to validate the model's ability to generalize to new data. These participants were categorized into three groups based on their fertilization rates following IVF: low (0-40%), intermediate (41-70%), and high (71-100%).

Participants/materials setting methods: A pre-trained VGG13 model was fine-tuned using our database to classify individual spermatozoa as either ZP-bound or unbound based on their automatically extracted morphological features. Confusion matrix was used to assess the model's classification performance, expressed in terms of accuracy, specificity, sensitivity, and precision rates. The area under the receiver-operating characteristic (ROC) curve (AUC) was utilized to measure the model's discriminative power. A 5-fold cross-validation was conducted on the training dataset to assess the model's performance on randomized subgroups. Saliency mapping was used to analyse pixel importance localized to the morphological features of sperm images. Clinical data of spermatozoa from three fertilization groups were used for clinical validation. Logistic ROC regression analysis was performed to evaluate the differences in predicted values between high and low fertilization groups, as indicated by AUC and P-values. Additionally, Youden's index was applied to determine a clinical threshold for predicting IVF fertilization outcome using the model.

Main results and the role of chance: A VGG13 model was fine-tuned to distinguish images of spermatozoa capable of binding to the ZP based on their morphological features with high sensitivity (97.6%), specificity (96.0%), accuracy (96.7%), and precision (95.2%). The model exhibited low learning variance (average accuracy: 97.4%; sensitivity: 96.0%; and specificity: 98.5%) across subgroups, with primary emphasis on the sperm head and mid-pieces in all images as indicated by the pixel importance. Its discriminative performance was clinically validated on over 33 000 sperm images collected from three fertilization groups. Overall, the model exhibited excellent generalization ability as reflected by the strong correlation between the predicted percentages of spermatozoa with ZP-binding per sample and their fertilization rates. A clinical threshold of 4.9% (specificity: 89.3%; sensitivity: 90.0%) was established to differentiate sperm samples with normal and defective ZP-binding ability. By conducting pairwise comparisons among 30 patients, the predicted values generated by the model outperformed conventional semen analysis assessed by our in-house embryologists in identifying patients who were likely to experience failure with conventional IVF.

Large scale data: N/A.

Limitations reasons for caution: The model is currently designed for high-resolution, air-dried, Diff-Quik stained sperm samples, and further research is required to validate its classification performance across different image qualities with a larger sample size.

Wider implications of the findings: This newly established method can identify couples at high risk of unexpected IVF fertilization failure, enabling clinicians to offer alternative insemination methods to reduce the likelihood of suboptimal fertilization outcomes.

Study funding/competing interests: This study was supported in part by two Health and Medical Research Funds, the Food and Health Bureau, The Government of the HKSAR (07182446 and 11222236), and the Sanming Project of Medicine in Shenzhen (SZSM 202211014). Two provisional patent applications related to the data presented here have been filed on behalf of The University of Hong Kong (i. application no. 63/511,375; filing date: 30 June 2023; current status: active; applicant: The University of Hong Kong; ii. application no. US 63/567,147; filing date: 19 March 2024; current status: active; applicant: The University of Hong Kong). The authors declare that they have no other competing interests.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
15.50
自引率
0.00%
发文量
0
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信