Obstruent classification using modulation spectrogram based features

Anshu Chittora, Kewal D. Malde, H. Patil
{"title":"Obstruent classification using modulation spectrogram based features","authors":"Anshu Chittora, Kewal D. Malde, H. Patil","doi":"10.1109/ICSDA.2014.7051438","DOIUrl":null,"url":null,"abstract":"In this paper, a new feature extraction technique based on modulation spectrogram is proposed. Modulation spectrogram gives a 2-dimensional (2-D) feature set for each obstruent segment. Since the size of feature vector given by modulation spectrogram is of very large dimension, Higher Order Singular Value Decomposition (HOSVD) theorem is used to reduce the size of feature vector. The reduced feature vector is then applied to a classifier, which classify the obstruent in three broad classes, viz., stop, affricate and fricative. Four-fold cross-validation experiments have been conducted on TIMIT database to find accuracy of obstruent classification at phoneme-level and recognition of manner of articulation of obstruents. Our experimental results show 92.22 % and 94.85 % accuracies for obstruent classification at phoneme-level and recognition of manner of articulation of obstruents, respectively, using 3-nearest neighbor classifier while with same experimental setup Mel Frequency Cepstral Coefficients (MFCC) shows 87.24 % and 93.68 % average classification accuracy of phoneme-level classification and manner of articulation level classification of obstruents, respectively.","PeriodicalId":361187,"journal":{"name":"2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2014.7051438","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper, a new feature extraction technique based on modulation spectrogram is proposed. Modulation spectrogram gives a 2-dimensional (2-D) feature set for each obstruent segment. Since the size of feature vector given by modulation spectrogram is of very large dimension, Higher Order Singular Value Decomposition (HOSVD) theorem is used to reduce the size of feature vector. The reduced feature vector is then applied to a classifier, which classify the obstruent in three broad classes, viz., stop, affricate and fricative. Four-fold cross-validation experiments have been conducted on TIMIT database to find accuracy of obstruent classification at phoneme-level and recognition of manner of articulation of obstruents. Our experimental results show 92.22 % and 94.85 % accuracies for obstruent classification at phoneme-level and recognition of manner of articulation of obstruents, respectively, using 3-nearest neighbor classifier while with same experimental setup Mel Frequency Cepstral Coefficients (MFCC) shows 87.24 % and 93.68 % average classification accuracy of phoneme-level classification and manner of articulation level classification of obstruents, respectively.
基于调制谱图特征的障碍物分类
本文提出了一种新的基于调制谱图的特征提取技术。调制谱图给出了每个阻塞段的二维特征集。由于调制谱图给出的特征向量尺寸非常大,采用高阶奇异值分解(HOSVD)定理来减小特征向量的尺寸。然后将简化后的特征向量应用到分类器中,分类器将阻塞分为三大类,即顿音、消舌音和摩擦音。在TIMIT数据库上进行了四重交叉验证实验,以验证在音素水平上障碍物分类的准确性和对障碍物发音方式的识别。实验结果表明,在相同的实验设置下,使用3近邻分类器对音素水平的障碍分类和发音方式的障碍分类准确率分别为92.22%和94.85%,Mel频率频谱系数(MFCC)对音素水平的障碍分类和发音方式的障碍分类的平均准确率分别为87.24%和93.68%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信