语音认证系统中接收和处理阶段信息的细节

Mykola Pastushenko, V. Pastushenko, O. Pastushenko
{"title":"语音认证系统中接收和处理阶段信息的细节","authors":"Mykola Pastushenko, V. Pastushenko, O. Pastushenko","doi":"10.1109/PICST47496.2019.9061260","DOIUrl":null,"url":null,"abstract":"The issues of improving the reliability of storing various resources, access to which is carried out using telecommunication networks, are considered. In this case, the first barrier in ensuring access reliability is the user authentication system. Lately, access systems based on biometric features of a user have been used. Initially, static biometric features of a user (facial image, finger papillary pattern and iris) were preferable, which did not meet the expectations of developers and users due to the simplicity of their falsification. Recently, the preference has been given to the dynamic (behavioral) biometric features of a user, namely voice authentication systems became more widely used. As it is known, voice authentication systems have several advantages, such as: simplicity, convenience, compactness, low cost, and a number of others. In addition, the passphrase can be quickly changed and expanded during the authentication process. However, the quality indicators of all biometric access systems do not meet the increasing requirements. The object of the study is the process of digital processing of voice signal during user authentication in access systems.In the process of voice authentication, the analysis of the amplitude-frequency spectrum of recording materials is performed. At the same time, the main research focuses on the use of estimates of formants, cepstral coefficients, mel-frequency cepstral coefficients, linear prediction coefficients, etc. as a user’s template. On the basis of user’s established patterns, admission decisions are made using Gaussian Mixture Models, Support Vector Machines, Hidden Markov Models or artificial neural networks.In the report, it is proposed to change the paradigm of digital processing of user voice signals and supplement the analysis of the amplitude-frequency spectrum with studies of phase data, which are traditionally ignored during the authentication. According to the authors, the latter is caused by the lack of effective procedures for the formation of phase data, the requirement of additional computational resources, which were not always available to researchers, and some features using the signal phase.","PeriodicalId":6764,"journal":{"name":"2019 IEEE International Scientific-Practical Conference Problems of Infocommunications, Science and Technology (PIC S&T)","volume":"11 1","pages":"621-624"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Specifics of Receiving and Processing Phase Information in Voice Authentication Systems\",\"authors\":\"Mykola Pastushenko, V. Pastushenko, O. Pastushenko\",\"doi\":\"10.1109/PICST47496.2019.9061260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The issues of improving the reliability of storing various resources, access to which is carried out using telecommunication networks, are considered. In this case, the first barrier in ensuring access reliability is the user authentication system. Lately, access systems based on biometric features of a user have been used. Initially, static biometric features of a user (facial image, finger papillary pattern and iris) were preferable, which did not meet the expectations of developers and users due to the simplicity of their falsification. Recently, the preference has been given to the dynamic (behavioral) biometric features of a user, namely voice authentication systems became more widely used. As it is known, voice authentication systems have several advantages, such as: simplicity, convenience, compactness, low cost, and a number of others. In addition, the passphrase can be quickly changed and expanded during the authentication process. However, the quality indicators of all biometric access systems do not meet the increasing requirements. The object of the study is the process of digital processing of voice signal during user authentication in access systems.In the process of voice authentication, the analysis of the amplitude-frequency spectrum of recording materials is performed. At the same time, the main research focuses on the use of estimates of formants, cepstral coefficients, mel-frequency cepstral coefficients, linear prediction coefficients, etc. as a user’s template. On the basis of user’s established patterns, admission decisions are made using Gaussian Mixture Models, Support Vector Machines, Hidden Markov Models or artificial neural networks.In the report, it is proposed to change the paradigm of digital processing of user voice signals and supplement the analysis of the amplitude-frequency spectrum with studies of phase data, which are traditionally ignored during the authentication. According to the authors, the latter is caused by the lack of effective procedures for the formation of phase data, the requirement of additional computational resources, which were not always available to researchers, and some features using the signal phase.\",\"PeriodicalId\":6764,\"journal\":{\"name\":\"2019 IEEE International Scientific-Practical Conference Problems of Infocommunications, Science and Technology (PIC S&T)\",\"volume\":\"11 1\",\"pages\":\"621-624\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Scientific-Practical Conference Problems of Infocommunications, Science and Technology (PIC S&T)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PICST47496.2019.9061260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Scientific-Practical Conference Problems of Infocommunications, Science and Technology (PIC S&T)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PICST47496.2019.9061260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

还考虑了提高存储各种资源的可靠性的问题,这些资源是通过电信网络进行访问的。在这种情况下,保证访问可靠性的第一个障碍就是用户认证系统。最近,基于用户生物特征的访问系统已经被使用。最初,用户的静态生物特征(面部图像、手指乳头状图案和虹膜)更可取,但由于伪造简单,无法满足开发人员和用户的期望。最近,人们更倾向于用户动态(行为)的生物特征,即语音认证系统得到了更广泛的应用。众所周知,语音认证系统具有几个优点,例如:简单、方便、紧凑、低成本以及许多其他优点。此外,在身份验证过程中可以快速更改和扩展密码短语。然而,所有生物识别门禁系统的质量指标都不能满足日益增长的需求。研究的对象是接入系统中用户认证过程中语音信号的数字化处理过程。在语音认证过程中,对录音材料的幅频谱进行分析。同时,重点研究了利用共振峰估计、倒谱系数、梅尔频倒谱系数、线性预测系数等作为用户模板。在用户建立模式的基础上,使用高斯混合模型、支持向量机、隐马尔可夫模型或人工神经网络进行录取决策。本文提出改变用户语音信号的数字化处理范式,在对幅频频谱分析的基础上,增加对相位数据的研究,这是传统认证过程中被忽略的。根据作者的说法,后者是由于缺乏有效的相位数据形成程序,需要额外的计算资源,而研究人员并不总是可以获得这些资源,以及使用信号相位的一些特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Specifics of Receiving and Processing Phase Information in Voice Authentication Systems
The issues of improving the reliability of storing various resources, access to which is carried out using telecommunication networks, are considered. In this case, the first barrier in ensuring access reliability is the user authentication system. Lately, access systems based on biometric features of a user have been used. Initially, static biometric features of a user (facial image, finger papillary pattern and iris) were preferable, which did not meet the expectations of developers and users due to the simplicity of their falsification. Recently, the preference has been given to the dynamic (behavioral) biometric features of a user, namely voice authentication systems became more widely used. As it is known, voice authentication systems have several advantages, such as: simplicity, convenience, compactness, low cost, and a number of others. In addition, the passphrase can be quickly changed and expanded during the authentication process. However, the quality indicators of all biometric access systems do not meet the increasing requirements. The object of the study is the process of digital processing of voice signal during user authentication in access systems.In the process of voice authentication, the analysis of the amplitude-frequency spectrum of recording materials is performed. At the same time, the main research focuses on the use of estimates of formants, cepstral coefficients, mel-frequency cepstral coefficients, linear prediction coefficients, etc. as a user’s template. On the basis of user’s established patterns, admission decisions are made using Gaussian Mixture Models, Support Vector Machines, Hidden Markov Models or artificial neural networks.In the report, it is proposed to change the paradigm of digital processing of user voice signals and supplement the analysis of the amplitude-frequency spectrum with studies of phase data, which are traditionally ignored during the authentication. According to the authors, the latter is caused by the lack of effective procedures for the formation of phase data, the requirement of additional computational resources, which were not always available to researchers, and some features using the signal phase.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信