{"title":"Review of Features and Classification for Spoken Indian Language Recognition using Deep Learning and Machine Learning Techniques","authors":"Shreyasi Watve, M. Patil, Arun C. Shinde","doi":"10.1109/ESCI56872.2023.10099742","DOIUrl":null,"url":null,"abstract":"The most common and organic method of interpersonal communication is speech. Humanity has long aspired to creating a computer with human-like comprehension and communication abilities. To recognize language (words and phrases) using voice signals, multilingual countries like India must be taken into account. More research on speech has been demanded by specialists over the past ten years. Researchers require a specific database, or previously recorded collection of information, for that specific recognition system when they seek to construct it. There are several speech databases available for European languages, but only a small number for Indian languages. The several Speech Databases developed in various Indian languages for Text to Speech, Speaker Identification and Speech Identification systems are discussed in this article. To accurately identify the spoken language, first need to collect information from speech signal. In the initial step of the pre-processing phase, audio feature based approach were used, and then deep learning and machine learning classification methods. This survey will explore a variety of feature extraction methods as well as classification methods.","PeriodicalId":441215,"journal":{"name":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCI56872.2023.10099742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The most common and organic method of interpersonal communication is speech. Humanity has long aspired to creating a computer with human-like comprehension and communication abilities. To recognize language (words and phrases) using voice signals, multilingual countries like India must be taken into account. More research on speech has been demanded by specialists over the past ten years. Researchers require a specific database, or previously recorded collection of information, for that specific recognition system when they seek to construct it. There are several speech databases available for European languages, but only a small number for Indian languages. The several Speech Databases developed in various Indian languages for Text to Speech, Speaker Identification and Speech Identification systems are discussed in this article. To accurately identify the spoken language, first need to collect information from speech signal. In the initial step of the pre-processing phase, audio feature based approach were used, and then deep learning and machine learning classification methods. This survey will explore a variety of feature extraction methods as well as classification methods.