Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy
{"title":"一个语言识别系统","authors":"Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy","doi":"10.1109/CSPC.2017.8305857","DOIUrl":null,"url":null,"abstract":"Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.","PeriodicalId":123773,"journal":{"name":"2017 International Conference on Signal Processing and Communication (ICSPC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"RECAL — A language identification system\",\"authors\":\"Himadri Mukherjee, Ankita Dhar, S. Phadikar, K. Roy\",\"doi\":\"10.1109/CSPC.2017.8305857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.\",\"PeriodicalId\":123773,\"journal\":{\"name\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPC.2017.8305857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Signal Processing and Communication (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPC.2017.8305857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Since the inception of IT, one of the primary concerns has been to build devices with easy interactivity. Speech can be considered as one of the most preferred and easiest modes of interaction. Speech Recognition is the technique of automatically identifying spoken words from voice signals. Due to the multilingual nature of our country, we are habituated in using a mixture of languages in the course of verbal interaction and so, prior to recognizing speech, it is essential to determine the respective languages to which the spoken words belong. RECAL (Record Extract Classify According to Language) is a system, aimed towards identification of languages from multilingual voice signals. To start with, Mel Scale Cepstral Coefficient (MFCC) based features have been used to model languages using 9300 uttered numerals amidst 3 languages (English, Bangla and Hindi). An accuracy of 98.39% has been obtained considering the similarity between Bangla and Hindi numerals and avoidance of noise gating to simulate real world environment.