[基于机器学习的经阴道三维超声定量参数诊断t型子宫模型的开发]。

Q3 Medicine

Zhonghua yi xue za zhi Pub Date : 2025-08-12 DOI:10.3760/cma.j.cn112137-20250302-00493

S J Li, Y Wang, R Huang, L M Yang, X D Lyu, X W Huang, X B Peng, D M Song, N Ma, Y Xiao, Q Y Zhou, Y Guo, N Liang, S Liu, K Gao, Y N Yan, E L Xia

{"title":"[基于机器学习的经阴道三维超声定量参数诊断t型子宫模型的开发]。","authors":"S J Li, Y Wang, R Huang, L M Yang, X D Lyu, X W Huang, X B Peng, D M Song, N Ma, Y Xiao, Q Y Zhou, Y Guo, N Liang, S Liu, K Gao, Y N Yan, E L Xia","doi":"10.3760/cma.j.cn112137-20250302-00493","DOIUrl":null,"url":null,"abstract":"Objective: To develop a machine learning diagnostic model for T-shaped uterus based on quantitative parameters from 3D transvaginal ultrasound. Methods: A retrospective cross-sectional study was conducted, recruiting 304 patients who visited the hysteroscopy centre of Fuxing Hospital, Beijing, China, between July 2021 and June 2024 for reasons such as \"infertility or recurrent pregnancy loss\" and other adverse obstetric histories. Twelve experts, including seven clinicians and five sonographers, from Fuxing Hospital and Beijing Obstetrics and Gynecology Hospital of Capital Medical University, Peking University People's Hospital, and Beijing Hospital, independently and anonymously assessed the diagnosis of T-shaped uterus using a modified Delphi method. Based on the consensus results, 56 cases were classified into the T-shaped uterus group and 248 cases into the non-T-shaped uterus group. A total of 7 clinical features and 14 sonographic features were initially included. Features demonstrating significant diagnostic impact were selected using 10-fold cross-validated LASSO (Least Absolute Shrinkage and Selection Operator) regression. Four machine learning algorithms [logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM)] were subsequently implemented to develop T-shaped uterus diagnostic models. Using the Python random module, the patient dataset was randomly divided into five subsets, each maintaining the original class distribution (T-shaped uterus: non-T-shaped uterus ≈ 1∶4) and a balanced number of samples between the two categories. Five-fold cross-validation was performed, with four subsets used for training and one for validation in each round, to enhance the reliability of model evaluation. Model performance was rigorously assessed using established metrics: area under the curve (AUC) of receiver operator characteristic (ROC) curve, sensitivity, specificity, precision, and F1-score. In the RF model, feature importance was assessed by the mean decrease in Gini impurity attributed to each variable. Results: A total of 304 patients had a mean age of (35±4) years, and the age of the T-shaped uterus group was (35±5) years; the age of the non-T-shaped uterus group was (34±4) years.. Eight features with non-zero coefficients were selected by LASSO regression, including average lateral wall indentation width, average lateral wall indentation angle, upper cavity depth, endometrial thickness, uterine cavity area, cavity width at level of lateral wall indentation, angle formed by the bilateral lateral walls, and average cornual angle (coefficient: 0.125, -0.064,-0.037,-0.030,-0.026,-0.025,-0.025 and -0.024, respectively). The RF model showed the best diagnostic performance: in training set, AUC was 0.986 (95%CI: 0.980-0.992), sensitivity was 0.978, specificity 0.946, precision 0.802, and F1-score 0.881; in testing set, AUC was 0.948 (95%CI: 0.911-0.985), sensitivity was 0.873, specificity 0.919, precision 0.716, and F1-score 0.784. RF model feature importance analysis revealed that average lateral wall indentation width, upper cavity depth, and average lateral wall indentation angle were the top three features (over 65% in total), playing a decisive role in model prediction. Conclusion: The machine learning models developed in this study, particularly the RF model, are promising for the diagnosis of T-shaped uterus, offering new perspectives and technical support for clinical practice.","PeriodicalId":24023,"journal":{"name":"Zhonghua yi xue za zhi","volume":"105 30","pages":"2551-2557"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Development of a machine learning-based diagnostic model for T-shaped uterus using transvaginal 3D ultrasound quantitative parameters].\",\"authors\":\"S J Li, Y Wang, R Huang, L M Yang, X D Lyu, X W Huang, X B Peng, D M Song, N Ma, Y Xiao, Q Y Zhou, Y Guo, N Liang, S Liu, K Gao, Y N Yan, E L Xia\",\"doi\":\"10.3760/cma.j.cn112137-20250302-00493\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: To develop a machine learning diagnostic model for T-shaped uterus based on quantitative parameters from 3D transvaginal ultrasound. Methods: A retrospective cross-sectional study was conducted, recruiting 304 patients who visited the hysteroscopy centre of Fuxing Hospital, Beijing, China, between July 2021 and June 2024 for reasons such as \\\"infertility or recurrent pregnancy loss\\\" and other adverse obstetric histories. Twelve experts, including seven clinicians and five sonographers, from Fuxing Hospital and Beijing Obstetrics and Gynecology Hospital of Capital Medical University, Peking University People's Hospital, and Beijing Hospital, independently and anonymously assessed the diagnosis of T-shaped uterus using a modified Delphi method. Based on the consensus results, 56 cases were classified into the T-shaped uterus group and 248 cases into the non-T-shaped uterus group. A total of 7 clinical features and 14 sonographic features were initially included. Features demonstrating significant diagnostic impact were selected using 10-fold cross-validated LASSO (Least Absolute Shrinkage and Selection Operator) regression. Four machine learning algorithms [logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM)] were subsequently implemented to develop T-shaped uterus diagnostic models. Using the Python random module, the patient dataset was randomly divided into five subsets, each maintaining the original class distribution (T-shaped uterus: non-T-shaped uterus ≈ 1∶4) and a balanced number of samples between the two categories. Five-fold cross-validation was performed, with four subsets used for training and one for validation in each round, to enhance the reliability of model evaluation. Model performance was rigorously assessed using established metrics: area under the curve (AUC) of receiver operator characteristic (ROC) curve, sensitivity, specificity, precision, and F1-score. In the RF model, feature importance was assessed by the mean decrease in Gini impurity attributed to each variable. Results: A total of 304 patients had a mean age of (35±4) years, and the age of the T-shaped uterus group was (35±5) years; the age of the non-T-shaped uterus group was (34±4) years.. Eight features with non-zero coefficients were selected by LASSO regression, including average lateral wall indentation width, average lateral wall indentation angle, upper cavity depth, endometrial thickness, uterine cavity area, cavity width at level of lateral wall indentation, angle formed by the bilateral lateral walls, and average cornual angle (coefficient: 0.125, -0.064,-0.037,-0.030,-0.026,-0.025,-0.025 and -0.024, respectively). The RF model showed the best diagnostic performance: in training set, AUC was 0.986 (95%CI: 0.980-0.992), sensitivity was 0.978, specificity 0.946, precision 0.802, and F1-score 0.881; in testing set, AUC was 0.948 (95%CI: 0.911-0.985), sensitivity was 0.873, specificity 0.919, precision 0.716, and F1-score 0.784. RF model feature importance analysis revealed that average lateral wall indentation width, upper cavity depth, and average lateral wall indentation angle were the top three features (over 65% in total), playing a decisive role in model prediction. Conclusion: The machine learning models developed in this study, particularly the RF model, are promising for the diagnosis of T-shaped uterus, offering new perspectives and technical support for clinical practice.\",\"PeriodicalId\":24023,\"journal\":{\"name\":\"Zhonghua yi xue za zhi\",\"volume\":\"105 30\",\"pages\":\"2551-2557\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Zhonghua yi xue za zhi\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3760/cma.j.cn112137-20250302-00493\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zhonghua yi xue za zhi","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112137-20250302-00493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

摘要

目的：建立基于三维经阴道超声定量参数的t型子宫机器学习诊断模型。方法：采用回顾性横断面研究方法，招募2021年7月至2024年6月在中国北京复兴医院宫腔镜中心就诊的304例因“不孕症或复发性流产”等不良产科史的患者。来自首都医科大学附属复兴医院、北京妇产科医院、北京大学人民医院、北京医院的7名临床医生、5名超声医师等12名专家采用改进的德尔菲法对t型子宫的诊断进行独立匿名评估。根据一致结果，56例为t型子宫组，248例为非t型子宫组。最初共包括7个临床特征和14个超声特征。使用10倍交叉验证LASSO（最小绝对收缩和选择算子）回归选择具有显著诊断影响的特征。随后采用四种机器学习算法[逻辑回归（LR），决策树（DT），随机森林（RF）和支持向量机（SVM）]来开发t型子宫诊断模型。使用Python随机模块，将患者数据集随机分为5个子集，每个子集保持原始的类分布（t型子宫：非t型子宫≈1∶4），并且两类之间的样本数量平衡。进行五重交叉验证，每轮使用四个子集进行训练，一个子集进行验证，以提高模型评估的可靠性。使用既定指标严格评估模型的性能：受试者操作者特征（ROC）曲线下面积（AUC）、灵敏度、特异性、精度和f1评分。在RF模型中，特征重要性通过归因于每个变量的基尼杂质的平均减少来评估。结果：304例患者平均年龄（35±4）岁，t型子宫组年龄（35±5）岁；非t型子宫组年龄为（34±4）岁。采用LASSO回归选择8个非零系数特征，分别为平均侧壁压痕宽度、平均侧壁压痕角度、上腔深度、子宫内膜厚度、子宫腔面积、侧壁压痕水平腔宽、双侧侧壁形成的角度、平均子宫角（系数分别为0.125、-0.064、-0.037、-0.030、-0.026、-0.025、-0.025、-0.024）。该模型在训练集上的AUC为0.986 (95%CI: 0.980 ~ 0.992)，灵敏度为0.978，特异性为0.946，精密度为0.802，f1评分为0.881；检验集的AUC为0.948 (95%CI: 0.911 ~ 0.985)，敏感性0.873，特异性0.919，精密度0.716，f1评分0.784。RF模型特征重要性分析显示，平均侧壁压痕宽度、上腔深度和平均侧壁压痕角度是前3个特征（占比超过65%），对模型预测具有决定性作用。结论：本研究建立的机器学习模型，尤其是RF模型，在t型子宫的诊断中具有较好的应用前景，为临床实践提供了新的视角和技术支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

[Development of a machine learning-based diagnostic model for T-shaped uterus using transvaginal 3D ultrasound quantitative parameters].

Objective: To develop a machine learning diagnostic model for T-shaped uterus based on quantitative parameters from 3D transvaginal ultrasound. Methods: A retrospective cross-sectional study was conducted, recruiting 304 patients who visited the hysteroscopy centre of Fuxing Hospital, Beijing, China, between July 2021 and June 2024 for reasons such as "infertility or recurrent pregnancy loss" and other adverse obstetric histories. Twelve experts, including seven clinicians and five sonographers, from Fuxing Hospital and Beijing Obstetrics and Gynecology Hospital of Capital Medical University, Peking University People's Hospital, and Beijing Hospital, independently and anonymously assessed the diagnosis of T-shaped uterus using a modified Delphi method. Based on the consensus results, 56 cases were classified into the T-shaped uterus group and 248 cases into the non-T-shaped uterus group. A total of 7 clinical features and 14 sonographic features were initially included. Features demonstrating significant diagnostic impact were selected using 10-fold cross-validated LASSO (Least Absolute Shrinkage and Selection Operator) regression. Four machine learning algorithms [logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM)] were subsequently implemented to develop T-shaped uterus diagnostic models. Using the Python random module, the patient dataset was randomly divided into five subsets, each maintaining the original class distribution (T-shaped uterus: non-T-shaped uterus ≈ 1∶4) and a balanced number of samples between the two categories. Five-fold cross-validation was performed, with four subsets used for training and one for validation in each round, to enhance the reliability of model evaluation. Model performance was rigorously assessed using established metrics: area under the curve (AUC) of receiver operator characteristic (ROC) curve, sensitivity, specificity, precision, and F1-score. In the RF model, feature importance was assessed by the mean decrease in Gini impurity attributed to each variable. Results: A total of 304 patients had a mean age of (35±4) years, and the age of the T-shaped uterus group was (35±5) years; the age of the non-T-shaped uterus group was (34±4) years.. Eight features with non-zero coefficients were selected by LASSO regression, including average lateral wall indentation width, average lateral wall indentation angle, upper cavity depth, endometrial thickness, uterine cavity area, cavity width at level of lateral wall indentation, angle formed by the bilateral lateral walls, and average cornual angle (coefficient: 0.125, -0.064,-0.037,-0.030,-0.026,-0.025,-0.025 and -0.024, respectively). The RF model showed the best diagnostic performance: in training set, AUC was 0.986 (95%CI: 0.980-0.992), sensitivity was 0.978, specificity 0.946, precision 0.802, and F1-score 0.881; in testing set, AUC was 0.948 (95%CI: 0.911-0.985), sensitivity was 0.873, specificity 0.919, precision 0.716, and F1-score 0.784. RF model feature importance analysis revealed that average lateral wall indentation width, upper cavity depth, and average lateral wall indentation angle were the top three features (over 65% in total), playing a decisive role in model prediction. Conclusion: The machine learning models developed in this study, particularly the RF model, are promising for the diagnosis of T-shaped uterus, offering new perspectives and technical support for clinical practice.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Zhonghua yi xue za zhi Medicine-Medicine (all)

CiteScore

0.80

自引率

0.00%

发文量

400