Residual speech signal compression: an experiment in the practical application of neural network technology

L. Pratt, K. Cebulka, Peter Clitherow
{"title":"Residual speech signal compression: an experiment in the practical application of neural network technology","authors":"L. Pratt, K. Cebulka, Peter Clitherow","doi":"10.1145/98894.99124","DOIUrl":null,"url":null,"abstract":"Neural networks are a popular area of research today. However, neural network algorithms have only recently proven valuable to application problems. This paper seeks to aid in the process of transferring neural network technology from research to a development environment by describing our experience in applying this technology. The application studied here is Speaker Identity Verification (SIV), which is the task of verifying a speaker's identity by comparing the speaker's voice pattern to a stored template. In this paper, we describe the application of the back-propagation neural network algorithm to one aspect of the SIV problem, called Residual Compression (RC). The RC problem is to extract useful features from a part of the speech signal that was not utilized by previous SIV systems. Here, we describe a neural network architecture, pre-processing algorithm, training methodology, and empirical results for this problem. We also present a few guidelines for the use of neural networks in applied settings.","PeriodicalId":175812,"journal":{"name":"Proceedings of the 3rd international conference on Industrial and engineering applications of artificial intelligence and expert systems - Volume 2","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1990-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd international conference on Industrial and engineering applications of artificial intelligence and expert systems - Volume 2","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/98894.99124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Neural networks are a popular area of research today. However, neural network algorithms have only recently proven valuable to application problems. This paper seeks to aid in the process of transferring neural network technology from research to a development environment by describing our experience in applying this technology. The application studied here is Speaker Identity Verification (SIV), which is the task of verifying a speaker's identity by comparing the speaker's voice pattern to a stored template. In this paper, we describe the application of the back-propagation neural network algorithm to one aspect of the SIV problem, called Residual Compression (RC). The RC problem is to extract useful features from a part of the speech signal that was not utilized by previous SIV systems. Here, we describe a neural network architecture, pre-processing algorithm, training methodology, and empirical results for this problem. We also present a few guidelines for the use of neural networks in applied settings.
残差语音信号压缩:神经网络技术在实验中的实际应用
神经网络是当今研究的热门领域。然而,神经网络算法直到最近才被证明对应用问题有价值。本文试图通过描述我们在应用该技术方面的经验来帮助将神经网络技术从研究转移到开发环境的过程。这里研究的应用是说话人身份验证(SIV),它是通过将说话人的语音模式与存储的模板进行比较来验证说话人身份的任务。在本文中,我们描述了反向传播神经网络算法在SIV问题的一个方面的应用,称为残余压缩(RC)。RC问题是从以前的SIV系统未利用的部分语音信号中提取有用的特征。在这里,我们描述了一个神经网络架构,预处理算法,训练方法,并为这个问题的经验结果。我们还提出了在应用环境中使用神经网络的一些指导方针。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信