{"title":"2.4 kb/s语音谱激励编码的主观表现","authors":"P. Lupini, V. Cuperman","doi":"10.5281/ZENODO.36343","DOIUrl":null,"url":null,"abstract":"This paper presents a low rate speech codec (2.4 kb/s) based on a sinusoidal model applied to the excitation signal. A frame classifier in combination with a phase dispersion algorithm allows the same model to be used for voiced as well as unvoiced and transitional sounds. The phase dispersion algorithm significantly improves the perceived quality for all frame classes resulting in more \"natural\" reconstructed speech. Informal MOS testing indicates that the 2.4 kb/s SEC system achieves MOS scores close to the existing 4 kb/s standards (differences up to 0.2 on the MOS scale) and significantly better than the existing 2.4 kb/s LPC-10 standard (difference of 1.5 on the MOS scale).","PeriodicalId":282153,"journal":{"name":"1996 8th European Signal Processing Conference (EUSIPCO 1996)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Subjective performance of spectral excitation coding of speech at 2.4 kb/s\",\"authors\":\"P. Lupini, V. Cuperman\",\"doi\":\"10.5281/ZENODO.36343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a low rate speech codec (2.4 kb/s) based on a sinusoidal model applied to the excitation signal. A frame classifier in combination with a phase dispersion algorithm allows the same model to be used for voiced as well as unvoiced and transitional sounds. The phase dispersion algorithm significantly improves the perceived quality for all frame classes resulting in more \\\"natural\\\" reconstructed speech. Informal MOS testing indicates that the 2.4 kb/s SEC system achieves MOS scores close to the existing 4 kb/s standards (differences up to 0.2 on the MOS scale) and significantly better than the existing 2.4 kb/s LPC-10 standard (difference of 1.5 on the MOS scale).\",\"PeriodicalId\":282153,\"journal\":{\"name\":\"1996 8th European Signal Processing Conference (EUSIPCO 1996)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1996 8th European Signal Processing Conference (EUSIPCO 1996)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.36343\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1996 8th European Signal Processing Conference (EUSIPCO 1996)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.36343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Subjective performance of spectral excitation coding of speech at 2.4 kb/s
This paper presents a low rate speech codec (2.4 kb/s) based on a sinusoidal model applied to the excitation signal. A frame classifier in combination with a phase dispersion algorithm allows the same model to be used for voiced as well as unvoiced and transitional sounds. The phase dispersion algorithm significantly improves the perceived quality for all frame classes resulting in more "natural" reconstructed speech. Informal MOS testing indicates that the 2.4 kb/s SEC system achieves MOS scores close to the existing 4 kb/s standards (differences up to 0.2 on the MOS scale) and significantly better than the existing 2.4 kb/s LPC-10 standard (difference of 1.5 on the MOS scale).