2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)最新文献

Palestinian Arabic regional accent recognition 巴勒斯坦阿拉伯地区口音识别

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343088

Abualsoud Hanani, H. Basha, Y. Sharaf, Stephen Eugene Taylor

引用次数: 10

Quantization effects on audio signals for detecting intruders in wild areas using TESPAR S-matrix and artificial neural networks 基于TESPAR s -矩阵和人工神经网络的野外入侵检测音频信号量化效应

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343079

L. Grama, C. Rusu, G. Oltean, L. Ivanciu

引用次数: 2

Spectrograms, sparsograms and spectral signatures for wildlife intruder detection 野生动物入侵者检测的光谱图、稀疏图和光谱特征

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343103

C. Rusu, L. Grama

引用次数: 6

Phonetic segmentation of speech using STEP and t-SNE 使用STEP和t-SNE进行语音的语音分割

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343105

Adriana Stan, Cassia Valentini-Botinhao, M. Giurgiu, Simon King

引用次数: 3

Evaluation of the generative and discriminative text-independent speaker verification approaches on handheld devices 手持设备上生成和判别文本无关说话人验证方法的评价

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343091

Florin Curelaru

{"title":"Evaluation of the generative and discriminative text-independent speaker verification approaches on handheld devices","authors":"Florin Curelaru","doi":"10.1109/SPED.2015.7343091","DOIUrl":"https://doi.org/10.1109/SPED.2015.7343091","url":null,"abstract":"This paper takes advantage of the “MIT Mobile Device Speaker Verification Corpus” (MIT-MDSVC) availability in order to evaluate the performance of three well known text-independent speaker verification approaches on handheld devices, considering the MIT-MDSVC as a representative corpus designed for robust speaker verification tasks on limited vocabulary and limited amount of training data collected on handheld devices. Several experiments with either mismatched testing conditions, or with samples collected from multiple test conditions were conducted for evaluating both text-independent approaches: generative (based on Gaussian Mixture Models) and discriminative (based on Support Vector Machines with Fisher kernel and GMM Supervector Linear kernel), without using the transcription of the utterances or knowledge about the acoustic conditions of the recordings (environment and microphone). An equal error rate less than 3% was achieved using Gaussian Mixture Models, and a slightly greater equal error rate (less than 3.5%) was achieved using Support Vector Machines with Fisher kernel and with GMM Supervector Linear kernel, against any possible acoustic conditions.","PeriodicalId":426074,"journal":{"name":"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133306720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Speech database acquisition for assisted living environment applications 辅助生活环境应用的语音数据库采集

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343083

Mihai Dogariu, H. Cucu, Andi Buzo, D. Burileanu, O. Fratu

{"title":"Speech database acquisition for assisted living environment applications","authors":"Mihai Dogariu, H. Cucu, Andi Buzo, D. Burileanu, O. Fratu","doi":"10.1109/SPED.2015.7343083","DOIUrl":"https://doi.org/10.1109/SPED.2015.7343083","url":null,"abstract":"Home automation has become a subject of increasing interest for both industry and research as there is an increase in the awareness of such systems and their benefits can be easily seen. The new trend is to develop smart homes where commands can be given by speech. This way of communication, besides being the most natural, has the advantage of offering flexibility to the users especially when they have limited motion capabilities. As for widely used languages the state of the art has achieved an important level of performance, little efforts are made with the Romanian language. The main reason for this is the lack of an annotated speech database from real life conditions. This paper focuses on the methodology of acquiring four different speech corpora with various end-user scenarios in mind. The commands corpus is meant to be used in home automation development, the cough corpus is meant to help research in detecting distress situations, the spontaneous speech corpus will aid in distant speech recognition applications and the multi-room, multi-person, multi-language corpus can be used for research in speaker detection and identification. All these were recorded in the context of a completely automated and functional smart home. The small number of such environments available to the public makes these corpora valuable from experimental point of view.","PeriodicalId":426074,"journal":{"name":"2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133371144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Estimating competing speaker count for blind speech source separation 盲语音源分离中竞争说话人数估计

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343081

Valentin Andrei, H. Cucu, Andi Buzo, C. Burileanu

引用次数: 2

Sound event recognition in smart environments 智能环境中的声音事件识别

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343087

Gheorghe Pop, Alexandru Caranica, H. Cucu, D. Burileanu

引用次数: 2

Achievements in the field of voice synthesis for Romanian 罗马尼亚语语音合成领域的成就

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343078

G. Toderean, O. Buza, J. Domokos

引用次数: 2

Methods for automatic generation of GRAALAN-based phonetic databases 基于graalan的语音数据库自动生成方法

2015 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) Pub Date : 2015-12-03 DOI: 10.1109/SPED.2015.7343082

S. Diaconescu, Monica-Mihaela Rizea, Felicia-Carmen Codirlasu, M. Ionescu, Monica Radulescu, A. Minca, Stefan Fulea

引用次数: 2