语音多波段非线性振荡器模型

Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284) Pub Date : 1998-11-01 DOI:10.1109/ACSSC.1998.750882

H. Haas, G. Kubin

{"title":"语音多波段非线性振荡器模型","authors":"H. Haas, G. Kubin","doi":"10.1109/ACSSC.1998.750882","DOIUrl":null,"url":null,"abstract":"Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.","PeriodicalId":393743,"journal":{"name":"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"A multi-band nonlinear oscillator model for speech\",\"authors\":\"H. Haas, G. Kubin\",\"doi\":\"10.1109/ACSSC.1998.750882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.\",\"PeriodicalId\":393743,\"journal\":{\"name\":\"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACSSC.1998.750882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.1998.750882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

摘要

非线性自振荡系统可以在没有驱动传统滤波器模型的外部激励的情况下对语音进行建模。然而，它们往往没有考虑到感知上重要但较弱的信号成分，如浊音的高共振峰。为了克服这个问题，我们提出了两种频域振荡器模型:一组具有单个振荡器状态的子带振荡器和一个具有单个联合状态向量的多带振荡器。用紧参数化多变量自适应回归样条(MARS)逼近了状态转换图，并成功地在持续元音的短期预测和合成实验中进行了测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A multi-band nonlinear oscillator model for speech

Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)

自引率

0.00%

发文量