{"title":"2400 b/s的增强型多频带激励语音编码器","authors":"K. Teague","doi":"10.1109/ACSSC.2002.1197179","DOIUrl":null,"url":null,"abstract":"The design and implementation of a 2400 b/s enhanced multiband excitation (EMBE) speech coder is described. The coder uses a variation of the multiband excitation (MBE) model originally proposed by Griffin and Lim to produce natural sounding and intelligible speech. A pitch-adaptive variable band structure is used for representing voicing decisions, with perceptually weighted spectral smoothing of harmonic amplitudes, and efficient vector quantization of speech spectrum using a four-way split vector quantizer to achieve high quality speech at 2,400 b/s. Objective performance results, in the form of DAM and DRT scores, are presented for quiet and office environments, indicating this coder is well suited for applications requiring low rate communications quality speech.","PeriodicalId":284950,"journal":{"name":"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.","volume":"267 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An enhanced multiband excitation speech coder at 2,400 b/s\",\"authors\":\"K. Teague\",\"doi\":\"10.1109/ACSSC.2002.1197179\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The design and implementation of a 2400 b/s enhanced multiband excitation (EMBE) speech coder is described. The coder uses a variation of the multiband excitation (MBE) model originally proposed by Griffin and Lim to produce natural sounding and intelligible speech. A pitch-adaptive variable band structure is used for representing voicing decisions, with perceptually weighted spectral smoothing of harmonic amplitudes, and efficient vector quantization of speech spectrum using a four-way split vector quantizer to achieve high quality speech at 2,400 b/s. Objective performance results, in the form of DAM and DRT scores, are presented for quiet and office environments, indicating this coder is well suited for applications requiring low rate communications quality speech.\",\"PeriodicalId\":284950,\"journal\":{\"name\":\"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.\",\"volume\":\"267 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACSSC.2002.1197179\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACSSC.2002.1197179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An enhanced multiband excitation speech coder at 2,400 b/s
The design and implementation of a 2400 b/s enhanced multiband excitation (EMBE) speech coder is described. The coder uses a variation of the multiband excitation (MBE) model originally proposed by Griffin and Lim to produce natural sounding and intelligible speech. A pitch-adaptive variable band structure is used for representing voicing decisions, with perceptually weighted spectral smoothing of harmonic amplitudes, and efficient vector quantization of speech spectrum using a four-way split vector quantizer to achieve high quality speech at 2,400 b/s. Objective performance results, in the form of DAM and DRT scores, are presented for quiet and office environments, indicating this coder is well suited for applications requiring low rate communications quality speech.