基于盲模型自适应的窄带语音带宽扩展

Sheng Yao, C. Chan
{"title":"基于盲模型自适应的窄带语音带宽扩展","authors":"Sheng Yao, C. Chan","doi":"10.5281/ZENODO.40685","DOIUrl":null,"url":null,"abstract":"Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"264 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bandwidth extension of narrowband speech based on blind model adaptation\",\"authors\":\"Sheng Yao, C. Chan\",\"doi\":\"10.5281/ZENODO.40685\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.\",\"PeriodicalId\":176384,\"journal\":{\"name\":\"2007 15th European Signal Processing Conference\",\"volume\":\"264 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 15th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.40685\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 15th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.40685","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

传统电话传输网的语音频率上限在4khz以下。与原来的宽带语音(0-8 kHz)相比,窄带电话语音(0-4 kHz)听起来很低沉。人工带宽扩展是在不改变网络基础设施的情况下提高窄带语音质量的一种经济的方法。现有的带宽扩展方法通常包括离线学习阶段和在线增强阶段。这些系统的性能在很大程度上取决于宽带训练数据和实际窄带输入数据的一致性。在实际情况下,输入演讲通常与离线训练演讲不匹配,导致严重的模型错误。为了避免数据不匹配,我们提出了一种基于线性动态模型的盲适应方法。该方法的优点是排除了离线训练阶段,实验结果表明,我们的系统在测量高频频谱失真方面与那些面向数据的系统相当。当数据不匹配发生时,我们的系统优于那些系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Bandwidth extension of narrowband speech based on blind model adaptation
Traditional telephone transmission network has speech frequency upper-limit below 4 kHz. The narrowband telephone speech (0-4 kHz) sounds muffled as compared with the original wideband speech (0-8 kHz). Artificial bandwidth extension is an economical way of enhancing the quality of narrowband speech without modifying the infrastructure of the network. Existing bandwidth extension methods usually include off-line learning phase and on-line enhancing phase. The performance of these systems depends largely on the consistency of wideband training data and actual narrowband input data. In real situation, input speeches usually mismatch with off-line training speeches, leading to serious model errors. To avoid the data mismatch, we propose a method based on blind adaptation of linear dynamic model. The benefit of our method is the exclusion of off-line training phase and experiment results show that our systems is comparable with those data-oriented systems in the measurements of highband spectral distortion. When data mismatch occurs, our system outperforms those systems.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信