语音识别系统:使用深度神经网络的技术、方法和挑战报告

2021 Innovations in Power and Advanced Computing Technologies (i-PACT) Pub Date : 2021-11-27 DOI:10.1109/i-PACT52855.2021.9697005

P. Deepa, Rashmita Khilar

{"title":"语音识别系统:使用深度神经网络的技术、方法和挑战报告","authors":"P. Deepa, Rashmita Khilar","doi":"10.1109/i-PACT52855.2021.9697005","DOIUrl":null,"url":null,"abstract":"Voice recognition has been advancing at a fast rate. Many cases involving edited audio clips and incorrect identity claims are reported on a daily basis. Due to the growing importance of information processing technology, it becomes easier and easier to identify people by their voices. Voice recognition consists of detecting a user's identity based on characteristics of their voice. It is a widely applied form of biometric recognition in the world, particularly in fields where security has a high priority. The deep neural networks were used as feature extractor alongside classifiers, but they haven't been completely trained due to the success of deep learning. While such methods are extremely efficient, they still require manual attention. Especially in DNN, interactivity between people and machines is essential. This is where the art of voice recognition comes from. In addition to their application in speech recognition, deep neural networks have demonstrated their potential to be used for voice recognition as well. They provide an efficient implementation of complex nonlinear models for learning unique and invariant data structures. The main contribution of this work is to provide a brief overview of the field of deep neural networks and voice recognition, describing its system, underlying approaches, and challenges.","PeriodicalId":335956,"journal":{"name":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","volume":"137 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Report on Voice Recognition System: Techniques, Methodologies and Challenges using Deep Neural Network\",\"authors\":\"P. Deepa, Rashmita Khilar\",\"doi\":\"10.1109/i-PACT52855.2021.9697005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Voice recognition has been advancing at a fast rate. Many cases involving edited audio clips and incorrect identity claims are reported on a daily basis. Due to the growing importance of information processing technology, it becomes easier and easier to identify people by their voices. Voice recognition consists of detecting a user's identity based on characteristics of their voice. It is a widely applied form of biometric recognition in the world, particularly in fields where security has a high priority. The deep neural networks were used as feature extractor alongside classifiers, but they haven't been completely trained due to the success of deep learning. While such methods are extremely efficient, they still require manual attention. Especially in DNN, interactivity between people and machines is essential. This is where the art of voice recognition comes from. In addition to their application in speech recognition, deep neural networks have demonstrated their potential to be used for voice recognition as well. They provide an efficient implementation of complex nonlinear models for learning unique and invariant data structures. The main contribution of this work is to provide a brief overview of the field of deep neural networks and voice recognition, describing its system, underlying approaches, and challenges.\",\"PeriodicalId\":335956,\"journal\":{\"name\":\"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)\",\"volume\":\"137 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/i-PACT52855.2021.9697005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/i-PACT52855.2021.9697005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

语音识别技术一直在快速发展。每天都有许多涉及编辑音频片段和错误身份声明的案件被报道。由于信息处理技术的日益重要，通过声音来识别人变得越来越容易。语音识别包括根据用户的语音特征来检测用户的身份。它是世界上广泛应用的一种生物特征识别形式，特别是在安全性要求很高的领域。深度神经网络与分类器一起被用作特征提取器，但由于深度学习的成功，它们还没有得到完全的训练。虽然这些方法非常有效，但它们仍然需要人工关注。特别是在深度神经网络中，人与机器之间的交互是必不可少的。这就是语音识别技术的由来。除了在语音识别方面的应用之外，深度神经网络也展示了它们在语音识别方面的潜力。它们为学习唯一和不变的数据结构提供了复杂非线性模型的有效实现。这项工作的主要贡献是提供了深度神经网络和语音识别领域的简要概述，描述了其系统，底层方法和挑战。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Report on Voice Recognition System: Techniques, Methodologies and Challenges using Deep Neural Network

Voice recognition has been advancing at a fast rate. Many cases involving edited audio clips and incorrect identity claims are reported on a daily basis. Due to the growing importance of information processing technology, it becomes easier and easier to identify people by their voices. Voice recognition consists of detecting a user's identity based on characteristics of their voice. It is a widely applied form of biometric recognition in the world, particularly in fields where security has a high priority. The deep neural networks were used as feature extractor alongside classifiers, but they haven't been completely trained due to the success of deep learning. While such methods are extremely efficient, they still require manual attention. Especially in DNN, interactivity between people and machines is essential. This is where the art of voice recognition comes from. In addition to their application in speech recognition, deep neural networks have demonstrated their potential to be used for voice recognition as well. They provide an efficient implementation of complex nonlinear models for learning unique and invariant data structures. The main contribution of this work is to provide a brief overview of the field of deep neural networks and voice recognition, describing its system, underlying approaches, and challenges.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Innovations in Power and Advanced Computing Technologies (i-PACT)

自引率

0.00%

发文量