Speaker-independent voiced-stop-consonant recognition using a block-windowed neural network architecture

B.D. Bryant, J. Gowdy
{"title":"Speaker-independent voiced-stop-consonant recognition using a block-windowed neural network architecture","authors":"B.D. Bryant, J. Gowdy","doi":"10.1109/SSST.1993.522811","DOIUrl":null,"url":null,"abstract":"The authors study several of the more well-known connectionist models, and how they address the time and frequency variability of the multispeaker, voiced-stop-consonant recognition task. Among the network architectures reviewed or tested for were the self-organizing feature maps (SOFM) architecture, various derivatives of this architecture, the time-delay neural network (TDNN) architecture, various derivatives of this architecture, and two frequency-and-time-shift-invariant architectures, frequency-shift-invariant TDNN, and the block-windowed neural network (FTDNN and BWNN). Voiced-stop speech was extracted from up to four dialect regions of the TIMIT continuous speech corpus for subsequent preprocessing and training and testing of network instances. Various feature representations were tested for their robustness in representing the voiced-stop consonants.","PeriodicalId":260036,"journal":{"name":"1993 (25th) Southeastern Symposium on System Theory","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1993 (25th) Southeastern Symposium on System Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSST.1993.522811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The authors study several of the more well-known connectionist models, and how they address the time and frequency variability of the multispeaker, voiced-stop-consonant recognition task. Among the network architectures reviewed or tested for were the self-organizing feature maps (SOFM) architecture, various derivatives of this architecture, the time-delay neural network (TDNN) architecture, various derivatives of this architecture, and two frequency-and-time-shift-invariant architectures, frequency-shift-invariant TDNN, and the block-windowed neural network (FTDNN and BWNN). Voiced-stop speech was extracted from up to four dialect regions of the TIMIT continuous speech corpus for subsequent preprocessing and training and testing of network instances. Various feature representations were tested for their robustness in representing the voiced-stop consonants.
基于块窗口神经网络结构的独立于说话人的语音-停顿辅音识别
作者研究了几个更著名的连接主义模型,以及它们如何解决多说话者的时间和频率变化,语音-停止-辅音识别任务。在审查或测试的网络体系结构中,有自组织特征映射(SOFM)体系结构,该体系结构的各种衍生产品,时延神经网络(TDNN)体系结构,该体系结构的各种衍生产品,以及两种频率和时移不变体系结构,频移不变TDNN和块窗神经网络(FTDNN和BWNN)。从TIMIT连续语音语料库中提取多达四个方言区域的停顿语音,进行后续预处理和网络实例的训练和测试。测试了各种特征表征在表示顿音辅音方面的稳健性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信