P. Bauer, Marc-André Jung, Junge Qi, T. Fingscheidt
{"title":"提高汽车免提系统语音清晰度的研究","authors":"P. Bauer, Marc-André Jung, Junge Qi, T. Fingscheidt","doi":"10.1109/ISCE.2010.5522703","DOIUrl":null,"url":null,"abstract":"Due to its narrow acoustic bandwidth, typical telephone speech is poorly intelligible on a phoneme or syllable basis. Artificial bandwidth extension (ABWE) techniques intend to increase speech intelligibility by estimating and reconstructing higher frequency components up to 7kHz. Hands-free systems in automotive environments could especially benefit from that, since drivers are mentally stressed during telephone conversations. By improving speech intelligibility, their mental load may decrease. Thus driving safety and comfort increase. This paper presents a real-time ABWE implementation integrated into an automotive hands-free system. Its novelty is a phoneme-specific design that yields an improved performance on critical phonemes, especially /s/ and /z/. Hence, typical lisping effects of previous ABWE approaches are reduced. We performed informal listening tests on meaningless vowel-consonant-vowel (VCV) syllables demonstrating a significantly improved fricative intelligibility of ABWE speech vs. NB speech.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"On improving speech intelligibility in automotive hands-free systems\",\"authors\":\"P. Bauer, Marc-André Jung, Junge Qi, T. Fingscheidt\",\"doi\":\"10.1109/ISCE.2010.5522703\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to its narrow acoustic bandwidth, typical telephone speech is poorly intelligible on a phoneme or syllable basis. Artificial bandwidth extension (ABWE) techniques intend to increase speech intelligibility by estimating and reconstructing higher frequency components up to 7kHz. Hands-free systems in automotive environments could especially benefit from that, since drivers are mentally stressed during telephone conversations. By improving speech intelligibility, their mental load may decrease. Thus driving safety and comfort increase. This paper presents a real-time ABWE implementation integrated into an automotive hands-free system. Its novelty is a phoneme-specific design that yields an improved performance on critical phonemes, especially /s/ and /z/. Hence, typical lisping effects of previous ABWE approaches are reduced. We performed informal listening tests on meaningless vowel-consonant-vowel (VCV) syllables demonstrating a significantly improved fricative intelligibility of ABWE speech vs. NB speech.\",\"PeriodicalId\":403652,\"journal\":{\"name\":\"IEEE International Symposium on Consumer Electronics (ISCE 2010)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE International Symposium on Consumer Electronics (ISCE 2010)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCE.2010.5522703\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCE.2010.5522703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On improving speech intelligibility in automotive hands-free systems
Due to its narrow acoustic bandwidth, typical telephone speech is poorly intelligible on a phoneme or syllable basis. Artificial bandwidth extension (ABWE) techniques intend to increase speech intelligibility by estimating and reconstructing higher frequency components up to 7kHz. Hands-free systems in automotive environments could especially benefit from that, since drivers are mentally stressed during telephone conversations. By improving speech intelligibility, their mental load may decrease. Thus driving safety and comfort increase. This paper presents a real-time ABWE implementation integrated into an automotive hands-free system. Its novelty is a phoneme-specific design that yields an improved performance on critical phonemes, especially /s/ and /z/. Hence, typical lisping effects of previous ABWE approaches are reduced. We performed informal listening tests on meaningless vowel-consonant-vowel (VCV) syllables demonstrating a significantly improved fricative intelligibility of ABWE speech vs. NB speech.