{"title":"LDA集成到电话通话扬声器拨号系统中","authors":"I. Lapidot, J. Bonastre","doi":"10.1109/EEEI.2012.6376948","DOIUrl":null,"url":null,"abstract":"In this work we examine whether linear discriminant analysis (LDA) can improve the diarization performance, when used as an additional phase in a telephone conversation diarization system. We first apply a classical diarization system. Using systems output (to define the classes of interest) an LDA transformation on the mel-cepstrum features is performed. Then, the final diarization process is applied onto the transformed features. A relative improvement of 14.8% was obtained on LDC America CallHome database. The LDA seemed sensible to both segment duration and amount of data available for training, as shown by the results obtained on NIST SRE-05 database where no significative improvement was observed.","PeriodicalId":177385,"journal":{"name":"2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Integration of LDA into a telephone conversation speaker diarization system\",\"authors\":\"I. Lapidot, J. Bonastre\",\"doi\":\"10.1109/EEEI.2012.6376948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work we examine whether linear discriminant analysis (LDA) can improve the diarization performance, when used as an additional phase in a telephone conversation diarization system. We first apply a classical diarization system. Using systems output (to define the classes of interest) an LDA transformation on the mel-cepstrum features is performed. Then, the final diarization process is applied onto the transformed features. A relative improvement of 14.8% was obtained on LDC America CallHome database. The LDA seemed sensible to both segment duration and amount of data available for training, as shown by the results obtained on NIST SRE-05 database where no significative improvement was observed.\",\"PeriodicalId\":177385,\"journal\":{\"name\":\"2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EEEI.2012.6376948\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EEEI.2012.6376948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
在这项工作中,我们研究了当线性判别分析(LDA)用作电话会话拨号系统中的附加相位时,是否可以提高拨号性能。我们首先应用一个经典的二次化系统。使用系统输出(定义感兴趣的类)对mel-倒谱特征执行LDA转换。然后,对变换后的特征进行最后的特征化处理。在LDC America CallHome数据库上获得了14.8%的相对改进。在NIST SRE-05数据库上获得的结果表明,LDA似乎对段持续时间和可用于训练的数据量都是合理的,没有观察到显著的改进。
Integration of LDA into a telephone conversation speaker diarization system
In this work we examine whether linear discriminant analysis (LDA) can improve the diarization performance, when used as an additional phase in a telephone conversation diarization system. We first apply a classical diarization system. Using systems output (to define the classes of interest) an LDA transformation on the mel-cepstrum features is performed. Then, the final diarization process is applied onto the transformed features. A relative improvement of 14.8% was obtained on LDC America CallHome database. The LDA seemed sensible to both segment duration and amount of data available for training, as shown by the results obtained on NIST SRE-05 database where no significative improvement was observed.