Spoorti J. Jainar, Pritam Limbaji Sale, B. Nagaraja
{"title":"VAD, feature extraction and modelling techniques for speaker recognition: a review","authors":"Spoorti J. Jainar, Pritam Limbaji Sale, B. Nagaraja","doi":"10.1504/IJSISE.2020.10036128","DOIUrl":null,"url":null,"abstract":"This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.","PeriodicalId":56359,"journal":{"name":"International Journal of Signal and Imaging Systems Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Signal and Imaging Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJSISE.2020.10036128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 4
Abstract
This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.