Spoorti J. Jainar, Pritam Limbaji Sale, B. Nagaraja
{"title":"说话人识别的特征提取和建模技术综述","authors":"Spoorti J. Jainar, Pritam Limbaji Sale, B. Nagaraja","doi":"10.1504/IJSISE.2020.10036128","DOIUrl":null,"url":null,"abstract":"This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.","PeriodicalId":56359,"journal":{"name":"International Journal of Signal and Imaging Systems Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"VAD, feature extraction and modelling techniques for speaker recognition: a review\",\"authors\":\"Spoorti J. Jainar, Pritam Limbaji Sale, B. Nagaraja\",\"doi\":\"10.1504/IJSISE.2020.10036128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.\",\"PeriodicalId\":56359,\"journal\":{\"name\":\"International Journal of Signal and Imaging Systems Engineering\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Signal and Imaging Systems Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJSISE.2020.10036128\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Signal and Imaging Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJSISE.2020.10036128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
VAD, feature extraction and modelling techniques for speaker recognition: a review
This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.