IEICE technical report. Natural language understanding and models of communication最新文献

筛选
英文 中文
CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition CENSREC2:汽车连续数字语音识别的语料库和评估环境
IEICE technical report. Natural language understanding and models of communication Pub Date : 2005-12-15 DOI: 10.21437/Interspeech.2006-99
Satoshi Nakamura, M. Fujimoto, K. Takeda
{"title":"CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition","authors":"Satoshi Nakamura, M. Fujimoto, K. Takeda","doi":"10.21437/Interspeech.2006-99","DOIUrl":"https://doi.org/10.21437/Interspeech.2006-99","url":null,"abstract":"Abstract This paper introduces a common database and an evaluationframework for connected digit speech recognition in real drivingcar environments, CENSREC-2, as an outcome of IPSJ-SIG SLPNoisy Speech Recognition Evaluation Working Group. Speechdata of CENSREC-2 was collected using two microphones, aclose-talking microphone and a hands-free microphone, underthree car speeds and four car conditions. CENSREC-2 providesfour evaluation environments which are designed using speechdata collected in these car conditions. Index Terms : noisy speech recognition, common evaluationframework, in-car speech corpus. 1. Introduction Recently, the progress of speech recognition technology has beenbrought about by the advent of statistical approaches and large-scale corpora. Furthermore, it is also widely known that progresshas been accelerated by the U.S. DARPA projects [1] initiated inthe late ’80s in terms of project participants competitively develop-ing speech recognition systems for the same task, using the sametraining and test corpus.However, current speech recognition performance must stillbe improved if the system is to be exposed to noisy environments,where speech recognition applications might be used in practice.Therefore, noise robustness is an emerging and crucial factor to besolved for speech recognition techniques.With regard to the noise robustness problem, there have beentwo major evaluation projects, SPINE1, 2 [2] and AURORA [3]-[9]. The SPINE (SPeech recognition In Noisy Environments)project was organized by the U.S.’s DARPA, with SPINE1 in 2000and SPINE2 in 2001. On the other hand, the AURORA was or-ganized by the European Telecommunications Standards Institute(ETSI) [10] AURORA group. To date, AURORA2 [3](a con-nected digit corpus with additive noise), AURORA3 [4]-[7] (an in-car noisy digit corpus), and AURORA4 [8, 9] (a large-vocabularycontinuous-speech recognition corpus with additive noise) havebeen distributed with HTK (HMM Took Kit) [11] scripts, whichcan be used to obtain baseline performance[12].The authors voluntarily organized a special working group inOctober 2001 under the auspices of the Information ProcessingSociety of Japan in order to assess speech recognition technol-ogy in noisy environments. The focus of the working group in-cluded the planning of comprehensive fundamental assessments ofnoisy speech recognition, standardized corpus collection, evalua-tion strategy developments, and distribution of standardized pro-cessing modules. As an outcome of working group, we havealready produced the Japanese AURORA-2, AURORA-2J [13],which comprises the English digits translated into Japanese. Wehave also produced CENSREC-3 (Corpora and Environments forNoisy Speech RECognition) [14], our original evaluation frame-work CENSREC-3 is designed as the evaluation framework ofisolated word recognition in real driving car environments. Themain target application of CENSREC-3 is human voice (hands-free) control of car navigation systems. Thus","PeriodicalId":290291,"journal":{"name":"IEICE technical report. Natural language understanding and models of communication","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134439859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Efficient Multidimensional Indexing Using One-dimensional Self-Organizing Maps 使用一维自组织映射的高效多维索引
K. Kita, M. Shishibori
{"title":"Efficient Multidimensional Indexing Using One-dimensional Self-Organizing Maps","authors":"K. Kita, M. Shishibori","doi":"10.5715/JNLP.10.5_41","DOIUrl":"https://doi.org/10.5715/JNLP.10.5_41","url":null,"abstract":"高次元空間における最近傍検索 (nearest neighbor search) は, マルチメディア・コンテンツ検索, データ・マイニング, パターン認識等の分野における重要な研究課題の1つである. 高次元空間では, ある点の最近点と最遠点との問に距離的な差が生じなくなるという現象が起こるため, 効率的な多次元インデキシング手法を設計することが極度に困難となる. 本稿では, 1次元自己組織化マップを用いた近似的最近傍検索の手法を提案し, 提案した手法の有効性を類似画像検索と文書検索の2種類の実験により評価する. 自己組織化マップを用いて, 高次元空間での近傍関係をできる限り保ちつつ, 高次元データを1次元空間へ配置し, 1次元マップから得られる情報で探索範囲を限定することにより, きわめて高速な最近傍検索が可能となる.","PeriodicalId":290291,"journal":{"name":"IEICE technical report. Natural language understanding and models of communication","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130465092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书