{"title":"扬声器匹配和不匹配条件下基于线性变换的VTLN中雅可比补偿的影响","authors":"S. Rath, A. K. Sarkar, S. Umesh","doi":"10.1109/NCC.2010.5430188","DOIUrl":null,"url":null,"abstract":"In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.","PeriodicalId":130953,"journal":{"name":"2010 National Conference On Communications (NCC)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Effect of jacobian compensation in linear transformation based VTLN under matched and mis-matched speaker conditions\",\"authors\":\"S. Rath, A. K. Sarkar, S. Umesh\",\"doi\":\"10.1109/NCC.2010.5430188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.\",\"PeriodicalId\":130953,\"journal\":{\"name\":\"2010 National Conference On Communications (NCC)\",\"volume\":\"2014 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 National Conference On Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2010.5430188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 National Conference On Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2010.5430188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Effect of jacobian compensation in linear transformation based VTLN under matched and mis-matched speaker conditions
In this paper we study the effect of use of jacobian in different linear transformation (LT) based methods of VTLN. In conventional VTLN, the jacobian is highly non-linear and can not be computed and hence is ignored. In the LT based VTLN, since VTLN scaling is expressed as a matrix multiplication of un-warped MFCC features, jacobian is simply turns out as the determinant of the VTLN warp matrices. Hence in this framework of VTLN it is possible to account for jacobian. Two different methods, namely, L-VTLN and T-VTLN, are used for implementing LT based VTLN. By conducting experiments on the RM task and the TIDIGITs databases in matched and mismatched speaker conditions, the performance of using jacobian in warp-factor estimation have been evaluated. It is observed that in almost every matched and mis-matched speaker conditions jacobian improves performance in L-VTLN framework. In T-VTLN, however, jacobian does not improve the performance in any mis-matched speaker conditions. The cases in which jacobian degrades performance in L-VTLN and T-VTLN have been studied in detail.