扬声器匹配和不匹配条件下基于线性变换的VTLN中雅可比补偿的影响

S. Rath, A. K. Sarkar, S. Umesh
{"title":"扬声器匹配和不匹配条件下基于线性变换的VTLN中雅可比补偿的影响","authors":"S. Rath, A. K. Sarkar, S. Umesh","doi":"10.1109/NCC.2010.5430188","DOIUrl":null,"url":null,"abstract":"In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.","PeriodicalId":130953,"journal":{"name":"2010 National Conference On Communications (NCC)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Effect of jacobian compensation in linear transformation based VTLN under matched and mis-matched speaker conditions\",\"authors\":\"S. Rath, A. K. Sarkar, S. Umesh\",\"doi\":\"10.1109/NCC.2010.5430188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.\",\"PeriodicalId\":130953,\"journal\":{\"name\":\"2010 National Conference On Communications (NCC)\",\"volume\":\"2014 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 National Conference On Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2010.5430188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 National Conference On Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2010.5430188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文研究了雅可比矩阵在不同的线性变换方法中的应用效果。在传统的VTLN中,雅可比矩阵是高度非线性的,无法计算,因此被忽略。在基于LT的VTLN中,由于VTLN的缩放被表示为未翘曲的MFCC特征的矩阵乘法,雅可比矩阵被简单地证明为VTLN翘曲矩阵的行列式。因此,在这个矩阵的框架中,可以解释雅可比矩阵。采用L-VTLN和T-VTLN两种不同的方法实现基于LT的VTLN。通过对RM任务和TIDIGITs数据库在匹配和不匹配发言者条件下的实验,评估了雅可比矩阵在翘曲因子估计中的性能。观察到,在几乎所有匹配和不匹配的扬声器条件下,雅可比矩阵都能提高L-VTLN框架的性能。然而,在T-VTLN中,雅可比矩阵在任何不匹配的扬声器条件下都不能改善性能。详细研究了在L-VTLN和T-VTLN中雅可比矩阵降低性能的情况。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Effect of jacobian compensation in linear transformation based VTLN under matched and mis-matched speaker conditions
In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信