2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)最新文献

Multi-resolution spectral input for convolutional neural network-based speech recognition 基于卷积神经网络的多分辨率频谱输入语音识别

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990430

L. Tóth

引用次数: 5

Towards a continuous speech corpus for banking domain automatic speech recognition 面向银行领域语音自动识别的连续语料库

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990436

G. Suciu, Stefan-Adrian Toma, Romulus Cheveresan

引用次数: 1

Fast method for ENF database build and search ENF数据库的快速构建和搜索方法

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990447

Gheorghe Pop, Dragos Draghicescu, D. Burileanu, H. Cucu, C. Burileanu

引用次数: 3

Semantics driven intelligent front-end 语义驱动的智能前端

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990429

T. Gergely, Edit Halmay, Miklós Szöts, G. Suciu, Romulus Cheveresan

引用次数: 1

Speech recognition results for voice-controlled assistive applications 语音控制辅助应用程序的语音识别结果

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990438

Alexandru Caranica, H. Cucu, C. Burileanu, François Portet, Michel Vacher

{"title":"Speech recognition results for voice-controlled assistive applications","authors":"Alexandru Caranica, H. Cucu, C. Burileanu, François Portet, Michel Vacher","doi":"10.1109/SPED.2017.7990438","DOIUrl":"https://doi.org/10.1109/SPED.2017.7990438","url":null,"abstract":"Until recently, controlling a “smart home” consisted in setting up a series of applications and automation tools: scheduling when the air conditioning system could cool the room, turn on the lighting system at sunset, or just use ones phone to control several TV appliances or the garage door. Recent advances in speech recognition technology have made voice-controlled smart homes attainable, and many companies and communities are providing interfaces or home boxes to make this voice control available. However, they lack customization ability, and interoperability with appliances or applications is not guaranteed. Moreover, most of these systems are not focused in supporting specific voice recognition scenarios, such as assistive applications for elder or disabled people or consider a triggered close talking voice interaction. Although state of the art speech processing has achieved great performance for most widely used languages, little to no efforts were made for under-resourced languages, such as Romanian. This paper focuses on a set of experiments in building a series of acoustic and grammar models for Romanian language, to be used in distant speech recognition scenarios, for voice driven speech applications in intelligent homes or buildings, using previously acquired speech databases in Romanian language, in real life conditions, by our research group.","PeriodicalId":345314,"journal":{"name":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121660484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Several classifiers for intruder detection applications 用于入侵者检测应用程序的几个分类器

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990432

Elena Roxana Buhus, L. Grama, C. Rusu

引用次数: 8

Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language 年龄对自发言语情绪识别的影响:以资源不足语言为例

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-06 DOI: 10.1109/SPED.2017.7990448

N. Jamil, F. Apandi, Raseeda Hamzah

{"title":"Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language","authors":"N. Jamil, F. Apandi, Raseeda Hamzah","doi":"10.1109/SPED.2017.7990448","DOIUrl":"https://doi.org/10.1109/SPED.2017.7990448","url":null,"abstract":"Recognizing emotions using natural or spontaneous speech are extremely difficult compared to doing the same for acted or elicited speeches. Speech emotion recognition for real conversation such as spontaneous speech requires linguistic information of the speech to be included in the speech emotion recognition component to achieve a high recognition rate. However, with the lack of digital speech resources of an under-resourced language, this requirement poses a problem. In this paper, speech emotion recognition of spontaneous speech in Malay language using prosodic features and Random Forest classifier is presented. We also investigate the influence of age categorized as children, young adults and middle-aged on emotion recognition. Ninety spontaneous speech sentences from 30 native speakers of Malay language are collected and classified into three emotions, which are happy, angry and sad. Results show that the spontaneous speech of middle-aged group achieved the highest accuracy rate followed by children age group and finally the young adults. While sad emotions are recognized satisfactorily across all age groups, confusions exist between happy and angry emotions.","PeriodicalId":345314,"journal":{"name":"2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134274029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

The SWARA speech corpus: A large parallel Romanian read speech dataset SWARA语音语料库:一个大型并行罗马尼亚读语音数据集

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-01 DOI: 10.1109/SPED.2017.7990428

Adriana Stan, Florina Dinescu, C. Tiple, S. Meza, B. Orza, M. Chirilă, M. Giurgiu

引用次数: 21

Speech recognition in education: Voice geometry painter application 语音识别在教育:语音几何画家的应用

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-01 DOI: 10.1109/SPED.2017.7990446

Lucian-Petru Tuca, Adrian Iftene

引用次数: 6

Old geographical corpora: A methodology for interpretative transcription 古地理语料库:解释性抄写的方法论

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2017-07-01 DOI: 10.1109/SPED.2017.7990445

Mihaela Plamada-Onofrei, Daniela Gîfu, Cecilia Bolea

引用次数: 1