{"title":"基于Radon变换和多帧相关的噪声语音基音检测算法","authors":"Xianwu Zhang, Chenyang Liu, Jiashen Li","doi":"10.1016/j.dsp.2025.105415","DOIUrl":null,"url":null,"abstract":"<div><div>In this article we propose a novel fundamental frequency detection algorithm for noisy speech signals. The algorithm combines Radon transform and multi-frame signals correlation to extract the fundamental frequency, that is, pitch period from voiced frames in degraded speech signals. Two publicly available datasets, the CSTR and TIMIT datasets, were used to evaluate the performance of the algorithm and other state-of-the-art pitch detection algorithms under various additive daily environmental noises conditions and multiple signal-to-noise ratios. As far as the Gross Pitch Error and Mean Absolute Error metrics are concerned, the results demonstrate that the proposed method achieves better results among all the algorithms in general.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105415"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A novel pitch detection algorithm for noisy speech signal based on Radon transform and multi-frame correlation\",\"authors\":\"Xianwu Zhang, Chenyang Liu, Jiashen Li\",\"doi\":\"10.1016/j.dsp.2025.105415\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this article we propose a novel fundamental frequency detection algorithm for noisy speech signals. The algorithm combines Radon transform and multi-frame signals correlation to extract the fundamental frequency, that is, pitch period from voiced frames in degraded speech signals. Two publicly available datasets, the CSTR and TIMIT datasets, were used to evaluate the performance of the algorithm and other state-of-the-art pitch detection algorithms under various additive daily environmental noises conditions and multiple signal-to-noise ratios. As far as the Gross Pitch Error and Mean Absolute Error metrics are concerned, the results demonstrate that the proposed method achieves better results among all the algorithms in general.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105415\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004373\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004373","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
A novel pitch detection algorithm for noisy speech signal based on Radon transform and multi-frame correlation
In this article we propose a novel fundamental frequency detection algorithm for noisy speech signals. The algorithm combines Radon transform and multi-frame signals correlation to extract the fundamental frequency, that is, pitch period from voiced frames in degraded speech signals. Two publicly available datasets, the CSTR and TIMIT datasets, were used to evaluate the performance of the algorithm and other state-of-the-art pitch detection algorithms under various additive daily environmental noises conditions and multiple signal-to-noise ratios. As far as the Gross Pitch Error and Mean Absolute Error metrics are concerned, the results demonstrate that the proposed method achieves better results among all the algorithms in general.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,