基于盲模型自适应的窄带语音带宽扩展

2007 15th European Signal Processing Conference Pub Date : 2007-09-03 DOI:10.5281/ZENODO.40685

Sheng Yao, C. Chan

{"title":"基于盲模型自适应的窄带语音带宽扩展","authors":"Sheng Yao, C. Chan","doi":"10.5281/ZENODO.40685","DOIUrl":null,"url":null,"abstract":"Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"264 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bandwidth extension of narrowband speech based on blind model adaptation\",\"authors\":\"Sheng Yao, C. Chan\",\"doi\":\"10.5281/ZENODO.40685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.\",\"PeriodicalId\":176384,\"journal\":{\"name\":\"2007 15th European Signal Processing Conference\",\"volume\":\"264 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 15th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.40685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 15th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.40685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

传统电话传输网的语音频率上限在4khz以下。与原来的宽带语音(0-8 kHz)相比，窄带电话语音(0-4 kHz)听起来很低沉。人工带宽扩展是在不改变网络基础设施的情况下提高窄带语音质量的一种经济的方法。现有的带宽扩展方法通常包括离线学习阶段和在线增强阶段。这些系统的性能在很大程度上取决于宽带训练数据和实际窄带输入数据的一致性。在实际情况下，输入演讲通常与离线训练演讲不匹配，导致严重的模型错误。为了避免数据不匹配，我们提出了一种基于线性动态模型的盲适应方法。该方法的优点是排除了离线训练阶段，实验结果表明，我们的系统在测量高频频谱失真方面与那些面向数据的系统相当。当数据不匹配发生时，我们的系统优于那些系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Bandwidth extension of narrowband speech based on blind model adaptation

Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 15th European Signal Processing Conference

自引率

0.00%

发文量