{"title":"递归神经网络在低比特率语音编码中的应用","authors":"M. Kohata","doi":"10.1109/ICSLP.1996.607116","DOIUrl":null,"url":null,"abstract":"It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows good performance for spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20/spl sim/30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality.","PeriodicalId":359344,"journal":{"name":"Fourth International Symposium on Signal Processing and Its Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Application of Recurrent Neural Networks to Low Bit Rate Speech Coding\",\"authors\":\"M. Kohata\",\"doi\":\"10.1109/ICSLP.1996.607116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows good performance for spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20/spl sim/30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality.\",\"PeriodicalId\":359344,\"journal\":{\"name\":\"Fourth International Symposium on Signal Processing and Its Applications\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fourth International Symposium on Signal Processing and Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSLP.1996.607116\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Symposium on Signal Processing and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSLP.1996.607116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Application of Recurrent Neural Networks to Low Bit Rate Speech Coding
It is well known that the LSP coefficient which represents the speech spectrum envelope as one of the linear prediction coefficients, shows good performance for spectral interpolation along the time axis, but it is also known that the duration of interpolation is limited up to 20/spl sim/30 ms. This limitation makes it difficult to reduce the bit rate in very low bit rate speech coding. To resolve this problem, recurrent neural networks (RNN) were applied to interpolate LSP coefficients, and it was possible to increase the duration of interpolation to about 100 ms without so much degradation of the synthesized speech quality.