基于多特征融合的产时超声视频胎儿取向分类模型

Q3 Medicine

南方医科大学学报杂志 Pub Date : 2025-07-20 DOI:10.12122/j.issn.1673-4254.2025.07.24

Ziyu Zheng, Xiaying Yang, Shengjie Wu, Shijie Zhang, Guorong Lyu, Peizhong Liu, Jun Wang, Shaozheng He

{"title":"基于多特征融合的产时超声视频胎儿取向分类模型","authors":"Ziyu Zheng, Xiaying Yang, Shengjie Wu, Shijie Zhang, Guorong Lyu, Peizhong Liu, Jun Wang, Shaozheng He","doi":"10.12122/j.issn.1673-4254.2025.07.24","DOIUrl":null,"url":null,"abstract":"Objectives: To construct an intelligent analysis model for classifying fetal orientation during intrapartum ultrasound videos based on multi-feature fusion.Methods: The proposed model consists of the Input, Backbone Network and Classification Head modules. The Input module carries out data augmentation to improve the sample quality and generalization ability of the model. The Backbone Network was responsible for feature extraction based on Yolov8 combined with CBAM, ECA, PSA attention mechanism and AIFI feature interaction module. The Classification Head consists of a convolutional layer and a softmax function to output the final probability value of each class. The images of the key structures (the eyes, face, head, thalamus, and spine) were annotated with frames by physicians for model training to improve the classification accuracy of the anterior occipital, posterior occipital, and transverse occipital orientations.Results: The experimental results showed that the proposed model had excellent performance in the tire orientation classification task with the classification accuracy reaching 0.984, an area under the PR curve (average accuracy) of 0.993, and area under the ROC curve of 0.984, and a kappa consistency test score of 0.974. The prediction results by the deep learning model were highly consistent with the actual classification results.Conclusions: The multi-feature fusion model proposed in this study can efficiently and accurately classify fetal orientation in intrapartum ultrasound videos.","PeriodicalId":18962,"journal":{"name":"南方医科大学学报杂志","volume":"45 7","pages":"1563-1570"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12268913/pdf/","citationCount":"0","resultStr":"{\"title\":\"[A multi-feature fusion-based model for fetal orientation classification from intrapartum ultrasound videos].\",\"authors\":\"Ziyu Zheng, Xiaying Yang, Shengjie Wu, Shijie Zhang, Guorong Lyu, Peizhong Liu, Jun Wang, Shaozheng He\",\"doi\":\"10.12122/j.issn.1673-4254.2025.07.24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: To construct an intelligent analysis model for classifying fetal orientation during intrapartum ultrasound videos based on multi-feature fusion.Methods: The proposed model consists of the Input, Backbone Network and Classification Head modules. The Input module carries out data augmentation to improve the sample quality and generalization ability of the model. The Backbone Network was responsible for feature extraction based on Yolov8 combined with CBAM, ECA, PSA attention mechanism and AIFI feature interaction module. The Classification Head consists of a convolutional layer and a softmax function to output the final probability value of each class. The images of the key structures (the eyes, face, head, thalamus, and spine) were annotated with frames by physicians for model training to improve the classification accuracy of the anterior occipital, posterior occipital, and transverse occipital orientations.Results: The experimental results showed that the proposed model had excellent performance in the tire orientation classification task with the classification accuracy reaching 0.984, an area under the PR curve (average accuracy) of 0.993, and area under the ROC curve of 0.984, and a kappa consistency test score of 0.974. The prediction results by the deep learning model were highly consistent with the actual classification results.Conclusions: The multi-feature fusion model proposed in this study can efficiently and accurately classify fetal orientation in intrapartum ultrasound videos.\",\"PeriodicalId\":18962,\"journal\":{\"name\":\"南方医科大学学报杂志\",\"volume\":\"45 7\",\"pages\":\"1563-1570\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12268913/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"南方医科大学学报杂志\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12122/j.issn.1673-4254.2025.07.24\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"南方医科大学学报杂志","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12122/j.issn.1673-4254.2025.07.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

摘要

目的：建立基于多特征融合的产时超声视频胎儿取向分类智能分析模型。方法：该模型由输入模块、骨干网模块和分类头模块组成。输入模块进行数据增强，提高模型的样本质量和泛化能力。骨干网负责基于Yolov8结合CBAM、ECA、PSA注意机制和AIFI特征交互模块进行特征提取。分类头由一个卷积层和一个softmax函数组成，用于输出每个类别的最终概率值。医生对关键结构（眼睛、面部、头部、丘脑和脊柱）的图像进行框架注释，进行模型训练，以提高枕前、枕后和枕横方向的分类准确性。结果：实验结果表明，所提出的模型在轮胎方位分类任务中表现优异，分类精度达到0.984，PR曲线下面积（平均精度）为0.993，ROC曲线下面积为0.984，kappa一致性检验分数为0.974。深度学习模型的预测结果与实际分类结果高度一致。结论：本研究提出的多特征融合模型能有效、准确地识别产时超声影像中的胎儿取向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

[A multi-feature fusion-based model for fetal orientation classification from intrapartum ultrasound videos].

Objectives: To construct an intelligent analysis model for classifying fetal orientation during intrapartum ultrasound videos based on multi-feature fusion.

Methods: The proposed model consists of the Input, Backbone Network and Classification Head modules. The Input module carries out data augmentation to improve the sample quality and generalization ability of the model. The Backbone Network was responsible for feature extraction based on Yolov8 combined with CBAM, ECA, PSA attention mechanism and AIFI feature interaction module. The Classification Head consists of a convolutional layer and a softmax function to output the final probability value of each class. The images of the key structures (the eyes, face, head, thalamus, and spine) were annotated with frames by physicians for model training to improve the classification accuracy of the anterior occipital, posterior occipital, and transverse occipital orientations.

Results: The experimental results showed that the proposed model had excellent performance in the tire orientation classification task with the classification accuracy reaching 0.984, an area under the PR curve (average accuracy) of 0.993, and area under the ROC curve of 0.984, and a kappa consistency test score of 0.974. The prediction results by the deep learning model were highly consistent with the actual classification results.

Conclusions: The multi-feature fusion model proposed in this study can efficiently and accurately classify fetal orientation in intrapartum ultrasound videos.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

南方医科大学学报杂志 Medicine-Medicine (all)

CiteScore

1.50

自引率

0.00%

发文量

208

期刊介绍：