A Report on Voice Recognition System: Techniques, Methodologies and Challenges using Deep Neural Network

P. Deepa, Rashmita Khilar
{"title":"A Report on Voice Recognition System: Techniques, Methodologies and Challenges using Deep Neural Network","authors":"P. Deepa, Rashmita Khilar","doi":"10.1109/i-PACT52855.2021.9697005","DOIUrl":null,"url":null,"abstract":"Voice recognition has been advancing at a fast rate. Many cases involving edited audio clips and incorrect identity claims are reported on a daily basis. Due to the growing importance of information processing technology, it becomes easier and easier to identify people by their voices. Voice recognition consists of detecting a user's identity based on characteristics of their voice. It is a widely applied form of biometric recognition in the world, particularly in fields where security has a high priority. The deep neural networks were used as feature extractor alongside classifiers, but they haven't been completely trained due to the success of deep learning. While such methods are extremely efficient, they still require manual attention. Especially in DNN, interactivity between people and machines is essential. This is where the art of voice recognition comes from. In addition to their application in speech recognition, deep neural networks have demonstrated their potential to be used for voice recognition as well. They provide an efficient implementation of complex nonlinear models for learning unique and invariant data structures. The main contribution of this work is to provide a brief overview of the field of deep neural networks and voice recognition, describing its system, underlying approaches, and challenges.","PeriodicalId":335956,"journal":{"name":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","volume":"137 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Innovations in Power and Advanced Computing Technologies (i-PACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/i-PACT52855.2021.9697005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Voice recognition has been advancing at a fast rate. Many cases involving edited audio clips and incorrect identity claims are reported on a daily basis. Due to the growing importance of information processing technology, it becomes easier and easier to identify people by their voices. Voice recognition consists of detecting a user's identity based on characteristics of their voice. It is a widely applied form of biometric recognition in the world, particularly in fields where security has a high priority. The deep neural networks were used as feature extractor alongside classifiers, but they haven't been completely trained due to the success of deep learning. While such methods are extremely efficient, they still require manual attention. Especially in DNN, interactivity between people and machines is essential. This is where the art of voice recognition comes from. In addition to their application in speech recognition, deep neural networks have demonstrated their potential to be used for voice recognition as well. They provide an efficient implementation of complex nonlinear models for learning unique and invariant data structures. The main contribution of this work is to provide a brief overview of the field of deep neural networks and voice recognition, describing its system, underlying approaches, and challenges.
语音识别系统:使用深度神经网络的技术、方法和挑战报告
语音识别技术一直在快速发展。每天都有许多涉及编辑音频片段和错误身份声明的案件被报道。由于信息处理技术的日益重要,通过声音来识别人变得越来越容易。语音识别包括根据用户的语音特征来检测用户的身份。它是世界上广泛应用的一种生物特征识别形式,特别是在安全性要求很高的领域。深度神经网络与分类器一起被用作特征提取器,但由于深度学习的成功,它们还没有得到完全的训练。虽然这些方法非常有效,但它们仍然需要人工关注。特别是在深度神经网络中,人与机器之间的交互是必不可少的。这就是语音识别技术的由来。除了在语音识别方面的应用之外,深度神经网络也展示了它们在语音识别方面的潜力。它们为学习唯一和不变的数据结构提供了复杂非线性模型的有效实现。这项工作的主要贡献是提供了深度神经网络和语音识别领域的简要概述,描述了其系统,底层方法和挑战。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信