Indu S, Indu Subramanian, Aishwarya Ponni P, Akilandeshwari R, Kaavya S, V. P
{"title":"基于GMM和BiLSTM的车辆环境下说话人识别","authors":"Indu S, Indu Subramanian, Aishwarya Ponni P, Akilandeshwari R, Kaavya S, V. P","doi":"10.1109/temsmet53515.2021.9768718","DOIUrl":null,"url":null,"abstract":"Speaker recognition in the vehicular environment is becoming increasingly significant with the advancement of smart and connected devices. Noisy conditions in the vehicular environment pose a pressing challenge to speaker identification tasks. Thus, a robust speaker recognition system is proposed that is not susceptible to a drop in performance when subjected to various SNR levels. This paper analyzes the performance of speaker identification using GMM and BiLSTM in noisy environments. The model is trained to identify speakers from the TIMIT speech corpus. The paper also explores data augmentation techniques, namely, time stretching and time rolling. On training and testing the proposed speaker identification system with TIMIT and custom dataset, results show that more data for both training and testing purposes can eventually improve the performance of the system.","PeriodicalId":170546,"journal":{"name":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speaker Identification in Vehicular Environment using GMM and BiLSTM\",\"authors\":\"Indu S, Indu Subramanian, Aishwarya Ponni P, Akilandeshwari R, Kaavya S, V. P\",\"doi\":\"10.1109/temsmet53515.2021.9768718\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speaker recognition in the vehicular environment is becoming increasingly significant with the advancement of smart and connected devices. Noisy conditions in the vehicular environment pose a pressing challenge to speaker identification tasks. Thus, a robust speaker recognition system is proposed that is not susceptible to a drop in performance when subjected to various SNR levels. This paper analyzes the performance of speaker identification using GMM and BiLSTM in noisy environments. The model is trained to identify speakers from the TIMIT speech corpus. The paper also explores data augmentation techniques, namely, time stretching and time rolling. On training and testing the proposed speaker identification system with TIMIT and custom dataset, results show that more data for both training and testing purposes can eventually improve the performance of the system.\",\"PeriodicalId\":170546,\"journal\":{\"name\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/temsmet53515.2021.9768718\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/temsmet53515.2021.9768718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker Identification in Vehicular Environment using GMM and BiLSTM
Speaker recognition in the vehicular environment is becoming increasingly significant with the advancement of smart and connected devices. Noisy conditions in the vehicular environment pose a pressing challenge to speaker identification tasks. Thus, a robust speaker recognition system is proposed that is not susceptible to a drop in performance when subjected to various SNR levels. This paper analyzes the performance of speaker identification using GMM and BiLSTM in noisy environments. The model is trained to identify speakers from the TIMIT speech corpus. The paper also explores data augmentation techniques, namely, time stretching and time rolling. On training and testing the proposed speaker identification system with TIMIT and custom dataset, results show that more data for both training and testing purposes can eventually improve the performance of the system.