From Natural to Artificial Intelligence - Algorithms and Applications最新文献

Convolutional Neural Networks for Raw Speech Recognition 原始语音识别的卷积神经网络

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-12-12 DOI: 10.5772/INTECHOPEN.80026

Vishal Passricha, R. Aggarwal

{"title":"Convolutional Neural Networks for Raw Speech Recognition","authors":"Vishal Passricha, R. Aggarwal","doi":"10.5772/INTECHOPEN.80026","DOIUrl":"https://doi.org/10.5772/INTECHOPEN.80026","url":null,"abstract":"State-of-the-art automatic speech recognition (ASR) systems map the speech signal into its corresponding text. Traditional ASR systems are based on Gaussian mixture model. The emergence of deep learning drastically improved the recognition rate of ASR systems. Such systems are replacing traditional ASR systems. These systems can also be trained in end-to-end manner. End-to-end ASR systems are gaining much popularity due to simpli- fied model-building process and abilities to directly map speech into the text without any predefined alignments. Three major types of end-to-end architectures for ASR are atten- tion-based methods, connectionist temporal classification, and convolutional neural network (CNN)-based direct raw speech model. In this chapter, CNN-based acoustic model for raw speech signal is discussed. It establishes the relation between raw speech signal and phones in a data-driven manner. Relevant features and classifier both are jointly learned from the raw speech. Raw speech is processed by first convolutional layer to learn the feature representation. The output of first convolutional layer, that is, intermediate representation, is more discriminative and further processed by rest convolutional layers. This system uses only few parameters and performs better than traditional cepstral fea- ture-based systems. The performance of the system is evaluated for TIMIT and claimed similar performance as MFCC.","PeriodicalId":289041,"journal":{"name":"From Natural to Artificial Intelligence - Algorithms and Applications","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114732709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Hard, firm, soft … Etherealware:Computing by Temporal Order of Clocking 硬的，坚固的，软的……以太坊:按时间顺序计算

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.80432

M. Vielhaber

引用次数: 0

Evaluation between Virtual Acoustic Model and Real Acoustic Scenarios for Urban Representation 城市表征的虚拟声学模型与真实声学场景的评价

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.78330

Josep Llorca, Héctor Zapata, J. Alba, E. Redondo, D. Fonseca

引用次数: 2

Face Recognition Based on Texture Descriptors 基于纹理描述符的人脸识别

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.76722

J. Olivares-Mercado, K. Toscano-Medina, G. Sánchez-Pérez, M. Miyatake, H. Perez-Meana, L. C. Castro-Madrid

引用次数: 0

Local Patterns for Face Recognition 人脸识别的局部模式

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.76571

Chih-Wei Lin

引用次数: 1

Learning Algorithms for Fuzzy Inference Systems Using Vector Quantization 基于矢量量化的模糊推理系统学习算法

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.79925

H. Miyajima, Noritaka Shigei, H. Miyajima

引用次数: 2

Cellular Automata and Randomization: A Structural Overview 元胞自动机与随机化:结构概述

From Natural to Artificial Intelligence - Algorithms and Applications Pub Date : 2018-11-05 DOI: 10.5772/INTECHOPEN.79812

M. Dascalu

引用次数: 1