Sanjay S. Tippannavar, R. Shashidhar, H. R. Sathvik, S. Varun, G. V. Punith, H. G. Nikshep
{"title":"基于KNN算法的文本独立说话人识别与分类","authors":"Sanjay S. Tippannavar, R. Shashidhar, H. R. Sathvik, S. Varun, G. V. Punith, H. G. Nikshep","doi":"10.1109/IC3I56241.2022.10072615","DOIUrl":null,"url":null,"abstract":"The method of automatically identifying the speaker using the speaker-specific data included in voice waves is known as speaker recognition. For speaker recognition, a variety of uses have been investigated. Monitoring, speech-activated secure access control, voice-activated customization of services or information for certain users, instances include using recorded voice samples in forensic and criminal investigations. The application that is now mentioned most often is access control, which also includes voice dialing, banking, telephone shopping, and database access services. Thus, it is projected that speaker recognition technology would provide new services in smart environments and enhance the comfort of daily life. Research has been done on the phenomenon known as “speaker idolization,” which occurs when speakers are automatically added to an input audio channel. It makes speech recognition easier, makes it easier to search and index audio archives, and gives machine transcriptions more depth and intelligibility. An important additional application for voice recognition technology is as a forensics tool. The speaker’s short-time spectral coefficients are described using vector quantization using a codebook. The success of these techniques is assessed from the perspective of robustness against utterance variation, such as variances in content, temporal variation, and changes in utterance pace. The voice of each individual is recorded three times. The experiment’s double distance measurement result is 96.97%, whereas the KNN technique’s single data center result is 84.85% The outcome shows that the twofold distance method increases the precision of voice recognition.","PeriodicalId":274660,"journal":{"name":"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)","volume":"75 24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text Independent Speaker Recognition and Classification using KNN Algorithm\",\"authors\":\"Sanjay S. Tippannavar, R. Shashidhar, H. R. Sathvik, S. Varun, G. V. Punith, H. G. Nikshep\",\"doi\":\"10.1109/IC3I56241.2022.10072615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The method of automatically identifying the speaker using the speaker-specific data included in voice waves is known as speaker recognition. For speaker recognition, a variety of uses have been investigated. Monitoring, speech-activated secure access control, voice-activated customization of services or information for certain users, instances include using recorded voice samples in forensic and criminal investigations. The application that is now mentioned most often is access control, which also includes voice dialing, banking, telephone shopping, and database access services. Thus, it is projected that speaker recognition technology would provide new services in smart environments and enhance the comfort of daily life. Research has been done on the phenomenon known as “speaker idolization,” which occurs when speakers are automatically added to an input audio channel. It makes speech recognition easier, makes it easier to search and index audio archives, and gives machine transcriptions more depth and intelligibility. An important additional application for voice recognition technology is as a forensics tool. The speaker’s short-time spectral coefficients are described using vector quantization using a codebook. The success of these techniques is assessed from the perspective of robustness against utterance variation, such as variances in content, temporal variation, and changes in utterance pace. The voice of each individual is recorded three times. The experiment’s double distance measurement result is 96.97%, whereas the KNN technique’s single data center result is 84.85% The outcome shows that the twofold distance method increases the precision of voice recognition.\",\"PeriodicalId\":274660,\"journal\":{\"name\":\"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)\",\"volume\":\"75 24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC3I56241.2022.10072615\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I56241.2022.10072615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text Independent Speaker Recognition and Classification using KNN Algorithm
The method of automatically identifying the speaker using the speaker-specific data included in voice waves is known as speaker recognition. For speaker recognition, a variety of uses have been investigated. Monitoring, speech-activated secure access control, voice-activated customization of services or information for certain users, instances include using recorded voice samples in forensic and criminal investigations. The application that is now mentioned most often is access control, which also includes voice dialing, banking, telephone shopping, and database access services. Thus, it is projected that speaker recognition technology would provide new services in smart environments and enhance the comfort of daily life. Research has been done on the phenomenon known as “speaker idolization,” which occurs when speakers are automatically added to an input audio channel. It makes speech recognition easier, makes it easier to search and index audio archives, and gives machine transcriptions more depth and intelligibility. An important additional application for voice recognition technology is as a forensics tool. The speaker’s short-time spectral coefficients are described using vector quantization using a codebook. The success of these techniques is assessed from the perspective of robustness against utterance variation, such as variances in content, temporal variation, and changes in utterance pace. The voice of each individual is recorded three times. The experiment’s double distance measurement result is 96.97%, whereas the KNN technique’s single data center result is 84.85% The outcome shows that the twofold distance method increases the precision of voice recognition.