{"title":"知觉能量在说话人辨别中的应用","authors":"Zhao Sun, Lei Wang","doi":"10.1109/ISCID.2014.128","DOIUrl":null,"url":null,"abstract":"The short-term energy is widely used in various speaker discriminating applications (such as speaker verification system, diarization system and etc). However, unlike the well-known loundness measurement, its calculation takes no considerations on the human auditory properties. Since it has been shown that PLP is, to some extent, better than MFCC due to its well approximation of the excitation pattern to the human hearing system, the perceptual energy is thus introduced in this paper to replace the short-term energy, which is calculated to the PLP framework. It has been demonstrated that the newly introduced energy measurement exhibits more disseminating power with less computational overhead than the classical short-tem loudness. Its effectiveness is further verified in the application of speaker diarization.","PeriodicalId":385391,"journal":{"name":"2014 Seventh International Symposium on Computational Intelligence and Design","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Use of Perceptual Energy in Speaker Discrimination\",\"authors\":\"Zhao Sun, Lei Wang\",\"doi\":\"10.1109/ISCID.2014.128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The short-term energy is widely used in various speaker discriminating applications (such as speaker verification system, diarization system and etc). However, unlike the well-known loundness measurement, its calculation takes no considerations on the human auditory properties. Since it has been shown that PLP is, to some extent, better than MFCC due to its well approximation of the excitation pattern to the human hearing system, the perceptual energy is thus introduced in this paper to replace the short-term energy, which is calculated to the PLP framework. It has been demonstrated that the newly introduced energy measurement exhibits more disseminating power with less computational overhead than the classical short-tem loudness. Its effectiveness is further verified in the application of speaker diarization.\",\"PeriodicalId\":385391,\"journal\":{\"name\":\"2014 Seventh International Symposium on Computational Intelligence and Design\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Seventh International Symposium on Computational Intelligence and Design\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCID.2014.128\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Seventh International Symposium on Computational Intelligence and Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCID.2014.128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Use of Perceptual Energy in Speaker Discrimination
The short-term energy is widely used in various speaker discriminating applications (such as speaker verification system, diarization system and etc). However, unlike the well-known loundness measurement, its calculation takes no considerations on the human auditory properties. Since it has been shown that PLP is, to some extent, better than MFCC due to its well approximation of the excitation pattern to the human hearing system, the perceptual energy is thus introduced in this paper to replace the short-term energy, which is calculated to the PLP framework. It has been demonstrated that the newly introduced energy measurement exhibits more disseminating power with less computational overhead than the classical short-tem loudness. Its effectiveness is further verified in the application of speaker diarization.