Khadidja Nesrine Boubakeur, M. Debyeche, A. Amrouche, Youssouf Bentrcia
{"title":"基于韵律模型的说话人识别","authors":"Khadidja Nesrine Boubakeur, M. Debyeche, A. Amrouche, Youssouf Bentrcia","doi":"10.1109/NTIC55069.2022.10100506","DOIUrl":null,"url":null,"abstract":"The use of prosodic characteristics, mainly pitch and intensity, for speaker identification in noisy environments is examined in this work. To make the acoustic models more resistant to the variability in the speech signal in noisy situations, these features are supplemented with cepstral parameters. As a consequence, two systems for Automatic Speaker Identification (ASI) in the independent mode of text are implemented. The first based on Hidden Markov Models (HMM), whereas Support Vector Machines (SVM) are employed in the second. The addition of prosodic features improves recognition, especially in high-noise environments. The performance of SVM-based systems is better than HMM-based systems","PeriodicalId":403927,"journal":{"name":"2022 2nd International Conference on New Technologies of Information and Communication (NTIC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Prosodic Modelling based Speaker Identification\",\"authors\":\"Khadidja Nesrine Boubakeur, M. Debyeche, A. Amrouche, Youssouf Bentrcia\",\"doi\":\"10.1109/NTIC55069.2022.10100506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of prosodic characteristics, mainly pitch and intensity, for speaker identification in noisy environments is examined in this work. To make the acoustic models more resistant to the variability in the speech signal in noisy situations, these features are supplemented with cepstral parameters. As a consequence, two systems for Automatic Speaker Identification (ASI) in the independent mode of text are implemented. The first based on Hidden Markov Models (HMM), whereas Support Vector Machines (SVM) are employed in the second. The addition of prosodic features improves recognition, especially in high-noise environments. The performance of SVM-based systems is better than HMM-based systems\",\"PeriodicalId\":403927,\"journal\":{\"name\":\"2022 2nd International Conference on New Technologies of Information and Communication (NTIC)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 2nd International Conference on New Technologies of Information and Communication (NTIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NTIC55069.2022.10100506\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on New Technologies of Information and Communication (NTIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NTIC55069.2022.10100506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The use of prosodic characteristics, mainly pitch and intensity, for speaker identification in noisy environments is examined in this work. To make the acoustic models more resistant to the variability in the speech signal in noisy situations, these features are supplemented with cepstral parameters. As a consequence, two systems for Automatic Speaker Identification (ASI) in the independent mode of text are implemented. The first based on Hidden Markov Models (HMM), whereas Support Vector Machines (SVM) are employed in the second. The addition of prosodic features improves recognition, especially in high-noise environments. The performance of SVM-based systems is better than HMM-based systems