{"title":"Development of a biometric authentication platform using voice recognition","authors":"Abdelmadjid Benmachiche, Bouzata Hadjar, Ines Boutabia, Ali Abdelatif Betouil, Majda Maâtallah, Amina Makhlouf","doi":"10.1109/PAIS56586.2022.9946890","DOIUrl":null,"url":null,"abstract":"Nowadays, several areas have turned to speech recognition, as they have witnessed its efficiency in maintaining the privacy of users and facilitating access and use of numerous applications, systems even establishments. Our work revolves around recognizing a form of speech from a single word, where the main objective is to identify as accurately as possible a set of predefined words from brief audio clips. a Speech Commands dataset consisting of 65 000 one-second-long statements of 30 short term, is applied. We use a Convolutional Neural Network (CNN) to classify representatives with two-dimensional convolutions on the audio waveform. Unlike many classic techniques with critical feature engineering, we benefit from the power of deep learning to comprehend the feature representation while training. The prototype achieves an acceptable accuracy rate on the validation set. Our model has provided satisfactory results during training with high accuracy compared to its pre-descendants. Still, room for errors exists in words exceeding the range of the training data and especially noisy samples.","PeriodicalId":266229,"journal":{"name":"2022 4th International Conference on Pattern Analysis and Intelligent Systems (PAIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Pattern Analysis and Intelligent Systems (PAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PAIS56586.2022.9946890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, several areas have turned to speech recognition, as they have witnessed its efficiency in maintaining the privacy of users and facilitating access and use of numerous applications, systems even establishments. Our work revolves around recognizing a form of speech from a single word, where the main objective is to identify as accurately as possible a set of predefined words from brief audio clips. a Speech Commands dataset consisting of 65 000 one-second-long statements of 30 short term, is applied. We use a Convolutional Neural Network (CNN) to classify representatives with two-dimensional convolutions on the audio waveform. Unlike many classic techniques with critical feature engineering, we benefit from the power of deep learning to comprehend the feature representation while training. The prototype achieves an acceptable accuracy rate on the validation set. Our model has provided satisfactory results during training with high accuracy compared to its pre-descendants. Still, room for errors exists in words exceeding the range of the training data and especially noisy samples.