自动检测过渡段的强度和时间尺度修改语音可理解性

A. Jayan, P. C. Pandey, P. Lehana
{"title":"自动检测过渡段的强度和时间尺度修改语音可理解性","authors":"A. Jayan, P. C. Pandey, P. Lehana","doi":"10.1109/ICSCN.2008.4447162","DOIUrl":null,"url":null,"abstract":"Spectral transition segments serve as landmarks for the perception of consonants. In \"clear speech\" mode adopted by speakers to improve intelligibility in difficult communication environments, transition segments are of increased duration and intensity. Modification of conversational speech to have acoustic properties of clear speech has been reported to improve its intelligibility. This paper presents an automated method for locating spectral transition segments in speech, and to produce natural quality resynthesized speech with intensity and time-scale modified spectral transition segments. The boundaries of spectral transition segments are located using an index derived from the rate of variation of energy and centroid frequency in five non-overlapping spectral bands. Time-scale modification is performed using harmonic plus noise model (HNM) based analysis-synthesis. The overall speech duration is kept unaltered by appropriately compressing the steady state segments. Transition segments are intensity scaled by 6 dB. The effectiveness of the method was evaluated by conducting listening tests on normal hearing subjects using VCV syllables as the test material.","PeriodicalId":158011,"journal":{"name":"2008 International Conference on Signal Processing, Communications and Networking","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Automated Detection of Transition Segments for Intensity and Time-Scale Modification for Speech Intelligibility Enhancement\",\"authors\":\"A. Jayan, P. C. Pandey, P. Lehana\",\"doi\":\"10.1109/ICSCN.2008.4447162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spectral transition segments serve as landmarks for the perception of consonants. In \\\"clear speech\\\" mode adopted by speakers to improve intelligibility in difficult communication environments, transition segments are of increased duration and intensity. Modification of conversational speech to have acoustic properties of clear speech has been reported to improve its intelligibility. This paper presents an automated method for locating spectral transition segments in speech, and to produce natural quality resynthesized speech with intensity and time-scale modified spectral transition segments. The boundaries of spectral transition segments are located using an index derived from the rate of variation of energy and centroid frequency in five non-overlapping spectral bands. Time-scale modification is performed using harmonic plus noise model (HNM) based analysis-synthesis. The overall speech duration is kept unaltered by appropriately compressing the steady state segments. Transition segments are intensity scaled by 6 dB. The effectiveness of the method was evaluated by conducting listening tests on normal hearing subjects using VCV syllables as the test material.\",\"PeriodicalId\":158011,\"journal\":{\"name\":\"2008 International Conference on Signal Processing, Communications and Networking\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-02-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Signal Processing, Communications and Networking\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSCN.2008.4447162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Signal Processing, Communications and Networking","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCN.2008.4447162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

摘要

谱过渡段作为辅音感知的标志。在困难的交流环境中,说话者为提高可理解性而采用的“清晰言语”模式中,过渡段的持续时间和强度都有所增加。据报道,对会话语音进行修饰,使其具有清晰语音的声学特性,以提高其可理解性。本文提出了一种自动定位语音中频谱过渡段的方法,并对频谱过渡段的强度和时间尺度进行了修改,生成了自然质量的重合成语音。利用能量变化率和质心频率在5个不重叠光谱波段上的变化率来确定光谱过渡段的边界。采用谐波加噪声模型(HNM)进行时间尺度修正。通过适当地压缩稳态段,可以保持整体语音持续时间不变。过渡段的强度按6db进行缩放。以VCV音节为测试材料,对听力正常的受试者进行听力测试,评价该方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automated Detection of Transition Segments for Intensity and Time-Scale Modification for Speech Intelligibility Enhancement
Spectral transition segments serve as landmarks for the perception of consonants. In "clear speech" mode adopted by speakers to improve intelligibility in difficult communication environments, transition segments are of increased duration and intensity. Modification of conversational speech to have acoustic properties of clear speech has been reported to improve its intelligibility. This paper presents an automated method for locating spectral transition segments in speech, and to produce natural quality resynthesized speech with intensity and time-scale modified spectral transition segments. The boundaries of spectral transition segments are located using an index derived from the rate of variation of energy and centroid frequency in five non-overlapping spectral bands. Time-scale modification is performed using harmonic plus noise model (HNM) based analysis-synthesis. The overall speech duration is kept unaltered by appropriately compressing the steady state segments. Transition segments are intensity scaled by 6 dB. The effectiveness of the method was evaluated by conducting listening tests on normal hearing subjects using VCV syllables as the test material.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信