基于广义先验的语音信号的通用MMSE联合检测与估计

IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Siavash Shajari , Mahdi Nangir
{"title":"基于广义先验的语音信号的通用MMSE联合检测与估计","authors":"Siavash Shajari ,&nbsp;Mahdi Nangir","doi":"10.1016/j.dsp.2025.105405","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105405"},"PeriodicalIF":3.0000,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A general MMSE joint detection and estimation of speech signal based on generalized gamma priors\",\"authors\":\"Siavash Shajari ,&nbsp;Mahdi Nangir\",\"doi\":\"10.1016/j.dsp.2025.105405\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105405\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004270\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004270","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种语音信号离散傅立叶变换(DFT)系数同时检测和估计的综合模型。我们提出的模型建议使用广义伽马概率密度函数(PDF)来表示语音信号的DFT系数的大小。经典的概率密度函数(PDF),如瑞利PDF,不足以准确地模拟语音信号。这些模型通常依赖于对语音信号统计特性的过于简化的假设。这些假设限制了它们在实际应用中的有效性。我们的研究旨在推导一个基于广义伽玛分布的综合同时检测和估计模型(GΓD)。我们在短时傅里叶变换(STFT)域中对离散傅里叶变换(DFT)系数的幅度采用最小均方误差(MMSE)估计。这种方法允许我们使用GΓD有效地建模语音信号的统计特性。我们的分析表明,采用GΓD框架可以提高噪声环境下语音信号检测和估计的性能,客观评价指标证明了这一点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A general MMSE joint detection and estimation of speech signal based on generalized gamma priors
In this paper, we present a comprehensive model for a simultaneous detection and estimation of Discrete Fourier Transform (DFT) coefficients of speech signals. Our proposed model suggests using a Generalized Gamma Probability Density Function (PDF) to represent the magnitudes of the DFT coefficients of speech signals. Classical probability density functions (PDFs), such as the Rayleigh PDF, are inadequate for accurately modeling speech signal. These models often rely on oversimplified assumptions about the statistical properties of speech signals. These assumptions limit their effectiveness in practical applications. Our study aims to derive a comprehensive simultaneous detection and estimation model based on the Generalized Gamma Distribution (GΓD). We employ the Minimum Mean Square Error (MMSE) estimator to the magnitudes of Discrete Fourier Transform (DFT) coefficients in the Short-Time Fourier Transform (STFT) domain. This approach allows us to effectively model the statistical properties of speech signals using GΓD. Our analyses demonstrate that adopting the GΓD framework can enhance the performance of speech signal detection and estimation in noisy environments, as evidenced by objective evaluation measures.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digital Signal Processing
Digital Signal Processing 工程技术-工程:电子与电气
CiteScore
5.30
自引率
17.20%
发文量
435
审稿时长
66 days
期刊介绍: Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信