一个语言识别系统

Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy
{"title":"一个语言识别系统","authors":"Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy","doi":"10.1109/CSPC.2017.8305857","DOIUrl":null,"url":null,"abstract":"Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.","PeriodicalId":123773,"journal":{"name":"2017 International Conference on Signal Processing and Communication (ICSPC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"RECAL — A language identification system\",\"authors\":\"Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy\",\"doi\":\"10.1109/CSPC.2017.8305857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.\",\"PeriodicalId\":123773,\"journal\":{\"name\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPC.2017.8305857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Signal Processing and Communication (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPC.2017.8305857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

摘要

自IT诞生以来,主要关注的问题之一就是构建具有易于交互性的设备。语音可以被认为是最受欢迎和最简单的交互方式之一。语音识别是一种从语音信号中自动识别口语单词的技术。由于我国的多语言性质,我们习惯于在口头互动过程中使用多种语言,因此,在识别语音之前,确定口语单词所属的各自语言是至关重要的。RECAL (Record Extract classified按语言分类)是一个旨在从多语言语音信号中识别语言的系统。首先,基于Mel Scale Cepstral Coefficient (MFCC)的特征被用于在3种语言(英语、孟加拉语和印地语)中使用9300个发音数字来建模语言。考虑到孟加拉语和印地语数字之间的相似性以及避免噪声门控以模拟现实世界环境,获得了98.39%的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RECAL — A language identification system
Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信