Jing Wu , Zhenyi Ge , Helin Huang , Hairui Wang , Nan Li , Chunqiang Hu , Cuizhen Pan , Xiaomei Wu
{"title":"Attention-guided model for mitral regurgitation analysis based on multi-task learning","authors":"Jing Wu , Zhenyi Ge , Helin Huang , Hairui Wang , Nan Li , Chunqiang Hu , Cuizhen Pan , Xiaomei Wu","doi":"10.1016/j.bspc.2024.107169","DOIUrl":null,"url":null,"abstract":"<div><div>For automated analysis of mitral valve regurgitation in non-Doppler-based 2D echocardiography, there is limited work on combining quantitative tasks for cardiac targets, such as semantic segmentation and motion tracking, with qualitative detection and etiological classification of mitral regurgitation, thus overlooking the common features that exist among these tasks. Therefore, we proposed a multi-task learning model called Attention-guided ResUNet-MTL (abbreviated as ARUNet-MTL), to address the task from a holistic view. Specifically, the model was built on the U-shaped architecture and emphasized the importance of inherent correlation among tasks by allowing all three tasks to share the encoder structure. Meanwhile, an attention mechanism called MSA was incorporated to improve the temporal continuity of segmentation image sequences by leveraging the bidirectional deformation field information achieved in motion tracking tasks. Besides, during the training phase of the model, the loss function was designed to assimilate two key aspects: the fidelity of cardiac anatomical structures in segmentation and motion tracking results, and the enforcement smooth transitions and coherence between consecutive frames in the sequence. Through 5-fold cross validation, the accuracy for mitral regurgitation etiological classification was 0.8946 and 0.9179 at the video level and subject level, respectively, while the macro-F1 score was 0.8976 and 0.9176, respectively. The segmentation results for the left atrium, left ventricle, and mitral valve yielded Dice coefficients of 0.9438, 0.9157, and 0.7951. Additionally, validation experiments were performed on two public datasets to verify the robustness of the model’s segmentation and motion tracking branches.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"101 ","pages":"Article 107169"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424012278","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
For automated analysis of mitral valve regurgitation in non-Doppler-based 2D echocardiography, there is limited work on combining quantitative tasks for cardiac targets, such as semantic segmentation and motion tracking, with qualitative detection and etiological classification of mitral regurgitation, thus overlooking the common features that exist among these tasks. Therefore, we proposed a multi-task learning model called Attention-guided ResUNet-MTL (abbreviated as ARUNet-MTL), to address the task from a holistic view. Specifically, the model was built on the U-shaped architecture and emphasized the importance of inherent correlation among tasks by allowing all three tasks to share the encoder structure. Meanwhile, an attention mechanism called MSA was incorporated to improve the temporal continuity of segmentation image sequences by leveraging the bidirectional deformation field information achieved in motion tracking tasks. Besides, during the training phase of the model, the loss function was designed to assimilate two key aspects: the fidelity of cardiac anatomical structures in segmentation and motion tracking results, and the enforcement smooth transitions and coherence between consecutive frames in the sequence. Through 5-fold cross validation, the accuracy for mitral regurgitation etiological classification was 0.8946 and 0.9179 at the video level and subject level, respectively, while the macro-F1 score was 0.8976 and 0.9176, respectively. The segmentation results for the left atrium, left ventricle, and mitral valve yielded Dice coefficients of 0.9438, 0.9157, and 0.7951. Additionally, validation experiments were performed on two public datasets to verify the robustness of the model’s segmentation and motion tracking branches.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.