N. Barroso, K. L. D. Ipiña, Odei Barroso, A. Ezeiza, Unai Susperregi
{"title":"巴斯克语语境下面向多语言语音识别的语言识别","authors":"N. Barroso, K. L. D. Ipiña, Odei Barroso, A. Ezeiza, Unai Susperregi","doi":"10.1109/ETFA.2010.5641193","DOIUrl":null,"url":null,"abstract":"Automatic Speech Recognition (ASR) is a broad research area that absorbs many efforts from the research community. Robust speech recognition systems can be applied to automation of houses, office or business, monitoring of manufacturing processes, automation of telephone or telecommunication services. The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic Phonetic Decoding, Language Modelling or the development of appropriated Language Resources. The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context. The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).","PeriodicalId":201440,"journal":{"name":"2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Language identification oriented to Multilingual Speech Recognition in the Basque context\",\"authors\":\"N. Barroso, K. L. D. Ipiña, Odei Barroso, A. Ezeiza, Unai Susperregi\",\"doi\":\"10.1109/ETFA.2010.5641193\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic Speech Recognition (ASR) is a broad research area that absorbs many efforts from the research community. Robust speech recognition systems can be applied to automation of houses, office or business, monitoring of manufacturing processes, automation of telephone or telecommunication services. The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic Phonetic Decoding, Language Modelling or the development of appropriated Language Resources. The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context. The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).\",\"PeriodicalId\":201440,\"journal\":{\"name\":\"2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ETFA.2010.5641193\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETFA.2010.5641193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Language identification oriented to Multilingual Speech Recognition in the Basque context
Automatic Speech Recognition (ASR) is a broad research area that absorbs many efforts from the research community. Robust speech recognition systems can be applied to automation of houses, office or business, monitoring of manufacturing processes, automation of telephone or telecommunication services. The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic Phonetic Decoding, Language Modelling or the development of appropriated Language Resources. The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages. This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context. The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).