{"title":"CV转换和稳定元音区域对语言识别的意义","authors":"Dipanjan Nandi, A. Dutta, K. S. Rao","doi":"10.1109/IC3.2014.6897226","DOIUrl":null,"url":null,"abstract":"The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze LID performance. The evidences obtained from the CV transition and steady vowel regions are combined to investigate the existence of complementary information in these two regions. The LID study carried out on 27 Indian languages from IITKGP-MLILSC speech database. The Gaussian mixture modelling (GMM) technique has been used for developing the language models. The average LID performances obtained by processing CV transition region and steady vowel regions are 70% and 71% respectively. In contemporary works, LID system has been developed by processing whole speech utterances, which provides 72% recognition accuracy.","PeriodicalId":444918,"journal":{"name":"2014 Seventh International Conference on Contemporary Computing (IC3)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Significance of CV transition and steady vowel regions for language identification\",\"authors\":\"Dipanjan Nandi, A. Dutta, K. S. Rao\",\"doi\":\"10.1109/IC3.2014.6897226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze LID performance. The evidences obtained from the CV transition and steady vowel regions are combined to investigate the existence of complementary information in these two regions. The LID study carried out on 27 Indian languages from IITKGP-MLILSC speech database. The Gaussian mixture modelling (GMM) technique has been used for developing the language models. The average LID performances obtained by processing CV transition region and steady vowel regions are 70% and 71% respectively. In contemporary works, LID system has been developed by processing whole speech utterances, which provides 72% recognition accuracy.\",\"PeriodicalId\":444918,\"journal\":{\"name\":\"2014 Seventh International Conference on Contemporary Computing (IC3)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Seventh International Conference on Contemporary Computing (IC3)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC3.2014.6897226\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Seventh International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2014.6897226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Significance of CV transition and steady vowel regions for language identification
The present work explores the significance of the consonant-vowel (CV) transition and steady vowel (SV) regions for language identification (LID) task. The language-specific vocal tract information represented by Mel-frequency cepstral coefficients (MFCCs), extracted from the CV transition and steady vowel regions for LID task. The duration of CV transition and steady vowel regions are varied to analyze LID performance. The evidences obtained from the CV transition and steady vowel regions are combined to investigate the existence of complementary information in these two regions. The LID study carried out on 27 Indian languages from IITKGP-MLILSC speech database. The Gaussian mixture modelling (GMM) technique has been used for developing the language models. The average LID performances obtained by processing CV transition region and steady vowel regions are 70% and 71% respectively. In contemporary works, LID system has been developed by processing whole speech utterances, which provides 72% recognition accuracy.