{"title":"基于广义先验的语音信号的通用MMSE联合检测与估计","authors":"Siavash Shajari , Mahdi Nangir","doi":"10.1016/j.dsp.2025.105405","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105405"},"PeriodicalIF":3.0000,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A general MMSE joint detection and estimation of speech signal based on generalized gamma priors\",\"authors\":\"Siavash Shajari , Mahdi Nangir\",\"doi\":\"10.1016/j.dsp.2025.105405\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105405\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004270\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004270","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
A general MMSE joint detection and estimation of speech signal based on generalized gamma priors
In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,