{"title":"Improved $F_{0}$ Estimation based on $\\ell_{2}$-norm Regularized TV-CAR Analysis using Bone-Conducted filter with an Adaptive Pre-Emphasis *","authors":"K. Funaki","doi":"10.1109/ISPACS57703.2022.10082816","DOIUrl":null,"url":null,"abstract":"Linear Prediction (LP) used in the CELP speech coding is the most successful speech analysis method. However, it has drawbacks, such as low estimation accuracy due to its least-squares scheme. We proposed time-varying complex AR (TV-CAR) speech analysis based on MMSE criterion, robust criterion, and LASSO analysis and evaluated it for $F_{0}$ estimation of speech. We also proposed $\\ell_{2}$-norm regularized TV-CAR analysis, Regularized LP (RLP)-based, Time-RLP (TRLP)-based and their combined methods, and evaluated them on $F_{0}$ estimation based on the IRAPT using complex residual signals. In addition, a bone-conducted (BC) pre-filter has already been introduced to improve performance. This paper proposes an improved $F_{0}$ estimation lntroducing adaptive pre-emphasis and shows its effectiveness.","PeriodicalId":410603,"journal":{"name":"2022 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS57703.2022.10082816","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Linear Prediction (LP) used in the CELP speech coding is the most successful speech analysis method. However, it has drawbacks, such as low estimation accuracy due to its least-squares scheme. We proposed time-varying complex AR (TV-CAR) speech analysis based on MMSE criterion, robust criterion, and LASSO analysis and evaluated it for $F_{0}$ estimation of speech. We also proposed $\ell_{2}$-norm regularized TV-CAR analysis, Regularized LP (RLP)-based, Time-RLP (TRLP)-based and their combined methods, and evaluated them on $F_{0}$ estimation based on the IRAPT using complex residual signals. In addition, a bone-conducted (BC) pre-filter has already been introduced to improve performance. This paper proposes an improved $F_{0}$ estimation lntroducing adaptive pre-emphasis and shows its effectiveness.