Pole-focused linear prediction-based spectrogram for coarticulation analysis

A. Abraham, P. Vijayalakshmi, T. Nagarajan
{"title":"Pole-focused linear prediction-based spectrogram for coarticulation analysis","authors":"A. Abraham, P. Vijayalakshmi, T. Nagarajan","doi":"10.1109/TECHSYM.2010.5469215","DOIUrl":null,"url":null,"abstract":"Coarticulation refers to the influence of the articulation of one sound on the articulation of another sound in the same utterance [1]. Effects of the coarticulation in an utterance have to be analyzed for developing a triphone-based speech recognition system, text-to-speech (TTS) system, etc. The conventional Fourier transform-based spectrogram fails to capture the formant transitions between two adjacent phonemes. Further, depending on the size of the analysis window, either horizontal or vertical striations are observed, which affects the clarity of the spectrogram. In this work, to overcome these problems, a linear prediction-based (a model-based) spectrogram is suggested and implemented. To further improve the clarity and to make the formant trajectories more prominent, the poles of the LP analysis are focussed and a modified LP-based spectrogram is derived. The resultant pole-focussed LP-based spectrogram is found to be a better candidate for analyzing the coarticulation effects. Using this technique, a matlab-based, user-friendly tool is developed for coarticulation analysis.","PeriodicalId":262830,"journal":{"name":"2010 IEEE Students Technology Symposium (TechSym)","volume":"2001 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Students Technology Symposium (TechSym)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TECHSYM.2010.5469215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Coarticulation refers to the influence of the articulation of one sound on the articulation of another sound in the same utterance [1]. Effects of the coarticulation in an utterance have to be analyzed for developing a triphone-based speech recognition system, text-to-speech (TTS) system, etc. The conventional Fourier transform-based spectrogram fails to capture the formant transitions between two adjacent phonemes. Further, depending on the size of the analysis window, either horizontal or vertical striations are observed, which affects the clarity of the spectrogram. In this work, to overcome these problems, a linear prediction-based (a model-based) spectrogram is suggested and implemented. To further improve the clarity and to make the formant trajectories more prominent, the poles of the LP analysis are focussed and a modified LP-based spectrogram is derived. The resultant pole-focussed LP-based spectrogram is found to be a better candidate for analyzing the coarticulation effects. Using this technique, a matlab-based, user-friendly tool is developed for coarticulation analysis.
基于极点聚焦线性预测的协同发音分析光谱图
协同发音是指同一话语中一个音的发音对另一个音发音的影响[1]。为了开发基于三音符的语音识别系统、文本到语音(TTS)系统等,必须分析话语中协同发音的影响。传统的基于傅立叶变换的谱图无法捕捉两个相邻音素之间的形成峰转换。此外,根据分析窗口的大小,可以观察到水平或垂直条纹,这会影响光谱图的清晰度。在这项工作中,为了克服这些问题,提出并实现了基于线性预测(基于模型)的谱图。为了进一步提高清晰度并使形成峰轨迹更加突出,对LP分析的极点进行了集中,并推导了改进的基于LP的谱图。由此得到的基于极聚焦lp的谱图是分析协同接合效应的较好选择。利用这一技术,开发了一个基于matlab的、用户友好的协同发音分析工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信