PCSCN:用于大规模心音图数据清理和增强的质量评估模型

IF 4.8 2区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Yongzhuo Jiang , Hongbo Yang , Tao Guo , Hong Deng , Weilian Wang
{"title":"PCSCN:用于大规模心音图数据清理和增强的质量评估模型","authors":"Yongzhuo Jiang ,&nbsp;Hongbo Yang ,&nbsp;Tao Guo ,&nbsp;Hong Deng ,&nbsp;Weilian Wang","doi":"10.1016/j.cmpb.2025.108977","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>The acquisition of large-scale phonocardiogram (PCG) data is crucial for clinical research and has significantly advanced the application of data-driven heart sound classification models. Owing to the complexity of clinical environments, the quality of collected PCG data cannot be guaranteed. The exclusion of low-quality signals is essential for conducting a reliable PCG analysis.</div></div><div><h3>Methods:</h3><div>This study introduces a novel quality assessment model, the Parallel Channel Sequence Convolutional Network (PCSCN). The PCSCN automatically and accurately detects and removes low-quality PCG signals, thereby improving dataset reliability and usability. Unlike previous methods that rely on manually extracting numerous statistical features, the PCSCN employs low-complexity features and a multi-channel sequence architecture, offering greater accuracy and efficiency. In addition, this study applies the PCSCN in a controlled experiment for database cleaning and enhancement to verify the impact of this process on downstream data-driven PCG classification models.</div></div><div><h3>Results:</h3><div>When tested on a public dataset, PCSCN achieved an accuracy of 95.45% and an F1-score of 95.44%. In a database cleaning task involving PCG data from 7220 subjects, PCSCN completed the task in 515.76s. Furthermore, in the controlled experiment, the PCG classification model trained with PCSCN-enhanced data demonstrated enhanced performance across multiple metrics.</div></div><div><h3>Conclusions:</h3><div>The PCSCN is both reliable and efficient, and is poised to play a pivotal role in the cleaning of large-scale PCG databases. By enhancing and improving the quality of the training data, the PCSCN significantly strengthens the clinical decision-making capabilities of PCG classification models, thereby elevating their value for clinical research and application.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"271 ","pages":"Article 108977"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PCSCN: A quality assessment model for large-scale phonocardiogram data cleaning and enhancement\",\"authors\":\"Yongzhuo Jiang ,&nbsp;Hongbo Yang ,&nbsp;Tao Guo ,&nbsp;Hong Deng ,&nbsp;Weilian Wang\",\"doi\":\"10.1016/j.cmpb.2025.108977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background:</h3><div>The acquisition of large-scale phonocardiogram (PCG) data is crucial for clinical research and has significantly advanced the application of data-driven heart sound classification models. Owing to the complexity of clinical environments, the quality of collected PCG data cannot be guaranteed. The exclusion of low-quality signals is essential for conducting a reliable PCG analysis.</div></div><div><h3>Methods:</h3><div>This study introduces a novel quality assessment model, the Parallel Channel Sequence Convolutional Network (PCSCN). The PCSCN automatically and accurately detects and removes low-quality PCG signals, thereby improving dataset reliability and usability. Unlike previous methods that rely on manually extracting numerous statistical features, the PCSCN employs low-complexity features and a multi-channel sequence architecture, offering greater accuracy and efficiency. In addition, this study applies the PCSCN in a controlled experiment for database cleaning and enhancement to verify the impact of this process on downstream data-driven PCG classification models.</div></div><div><h3>Results:</h3><div>When tested on a public dataset, PCSCN achieved an accuracy of 95.45% and an F1-score of 95.44%. In a database cleaning task involving PCG data from 7220 subjects, PCSCN completed the task in 515.76s. Furthermore, in the controlled experiment, the PCG classification model trained with PCSCN-enhanced data demonstrated enhanced performance across multiple metrics.</div></div><div><h3>Conclusions:</h3><div>The PCSCN is both reliable and efficient, and is poised to play a pivotal role in the cleaning of large-scale PCG databases. By enhancing and improving the quality of the training data, the PCSCN significantly strengthens the clinical decision-making capabilities of PCG classification models, thereby elevating their value for clinical research and application.</div></div>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"271 \",\"pages\":\"Article 108977\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169260725003943\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725003943","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

背景:大规模心音图(PCG)数据的获取对临床研究至关重要,并显著推进了数据驱动的心音分类模型的应用。由于临床环境的复杂性,所采集的PCG数据的质量无法保证。排除低质量信号对于进行可靠的PCG分析至关重要。方法:本研究引入一种新的质量评估模型——并行通道序列卷积网络(PCSCN)。PCSCN能够自动准确地检测和去除低质量的PCG信号,从而提高数据集的可靠性和可用性。与以往依赖于手动提取大量统计特征的方法不同,PCSCN采用低复杂度特征和多通道序列架构,提供更高的准确性和效率。此外,本研究将PCSCN应用于数据库清洗和增强的对照实验中,验证该过程对下游数据驱动的PCG分类模型的影响。结果:在公共数据集上进行测试时,PCSCN的准确率为95.45%,f1得分为95.44%。在涉及7220名受试者的PCG数据的数据库清理任务中,PCSCN在515.76秒内完成任务。此外,在对照实验中,使用pcscn增强数据训练的PCG分类模型在多个指标上表现出更高的性能。结论:PCSCN可靠、高效,有望在大规模PCG数据库的清洗中发挥关键作用。PCSCN通过增强和提高训练数据的质量,显著增强了PCG分类模型的临床决策能力,从而提升了其临床研究和应用价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PCSCN: A quality assessment model for large-scale phonocardiogram data cleaning and enhancement

Background:

The acquisition of large-scale phonocardiogram (PCG) data is crucial for clinical research and has significantly advanced the application of data-driven heart sound classification models. Owing to the complexity of clinical environments, the quality of collected PCG data cannot be guaranteed. The exclusion of low-quality signals is essential for conducting a reliable PCG analysis.

Methods:

This study introduces a novel quality assessment model, the Parallel Channel Sequence Convolutional Network (PCSCN). The PCSCN automatically and accurately detects and removes low-quality PCG signals, thereby improving dataset reliability and usability. Unlike previous methods that rely on manually extracting numerous statistical features, the PCSCN employs low-complexity features and a multi-channel sequence architecture, offering greater accuracy and efficiency. In addition, this study applies the PCSCN in a controlled experiment for database cleaning and enhancement to verify the impact of this process on downstream data-driven PCG classification models.

Results:

When tested on a public dataset, PCSCN achieved an accuracy of 95.45% and an F1-score of 95.44%. In a database cleaning task involving PCG data from 7220 subjects, PCSCN completed the task in 515.76s. Furthermore, in the controlled experiment, the PCG classification model trained with PCSCN-enhanced data demonstrated enhanced performance across multiple metrics.

Conclusions:

The PCSCN is both reliable and efficient, and is poised to play a pivotal role in the cleaning of large-scale PCG databases. By enhancing and improving the quality of the training data, the PCSCN significantly strengthens the clinical decision-making capabilities of PCG classification models, thereby elevating their value for clinical research and application.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer methods and programs in biomedicine
Computer methods and programs in biomedicine 工程技术-工程:生物医学
CiteScore
12.30
自引率
6.60%
发文量
601
审稿时长
135 days
期刊介绍: To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine. Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信