{"title":"语音多波段非线性振荡器模型","authors":"H. Haas, G. Kubin","doi":"10.1109/ACSSC.1998.750882","DOIUrl":null,"url":null,"abstract":"Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.","PeriodicalId":393743,"journal":{"name":"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"A multi-band nonlinear oscillator model for speech\",\"authors\":\"H. Haas, G. Kubin\",\"doi\":\"10.1109/ACSSC.1998.750882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.\",\"PeriodicalId\":393743,\"journal\":{\"name\":\"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACSSC.1998.750882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.1998.750882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A multi-band nonlinear oscillator model for speech
Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.