M. Shoaib, F. Rasheed, J. Akhtar, M. Awais, S. Masud, S. Shamail
{"title":"一种提高独立说话人阿拉伯语语音识别鲁棒性的新方法","authors":"M. Shoaib, F. Rasheed, J. Akhtar, M. Awais, S. Masud, S. Shamail","doi":"10.1109/INMIC.2003.1416753","DOIUrl":null,"url":null,"abstract":"This work presents a two-tier approach through sequential application of intensity contours and formant tracks for accurate Arabic phoneme identification. The recognition system developed is based on data sets of 40 speakers for each Arabic phonetic sound. As a first step towards recognition of phonemes, the sound is sampled and then preprocessed to get formant frequencies and intensity contours. In order to automate the intensity and formant based feature extraction, a generalized regression neural network has been implemented, trained and validated on 21 input features.","PeriodicalId":253329,"journal":{"name":"7th International Multi Topic Conference, 2003. INMIC 2003.","volume":"65 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"A novel approach to increase the robustness of speaker independent Arabic speech recognition\",\"authors\":\"M. Shoaib, F. Rasheed, J. Akhtar, M. Awais, S. Masud, S. Shamail\",\"doi\":\"10.1109/INMIC.2003.1416753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents a two-tier approach through sequential application of intensity contours and formant tracks for accurate Arabic phoneme identification. The recognition system developed is based on data sets of 40 speakers for each Arabic phonetic sound. As a first step towards recognition of phonemes, the sound is sampled and then preprocessed to get formant frequencies and intensity contours. In order to automate the intensity and formant based feature extraction, a generalized regression neural network has been implemented, trained and validated on 21 input features.\",\"PeriodicalId\":253329,\"journal\":{\"name\":\"7th International Multi Topic Conference, 2003. INMIC 2003.\",\"volume\":\"65 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"7th International Multi Topic Conference, 2003. INMIC 2003.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INMIC.2003.1416753\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"7th International Multi Topic Conference, 2003. INMIC 2003.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INMIC.2003.1416753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A novel approach to increase the robustness of speaker independent Arabic speech recognition
This work presents a two-tier approach through sequential application of intensity contours and formant tracks for accurate Arabic phoneme identification. The recognition system developed is based on data sets of 40 speakers for each Arabic phonetic sound. As a first step towards recognition of phonemes, the sound is sampled and then preprocessed to get formant frequencies and intensity contours. In order to automate the intensity and formant based feature extraction, a generalized regression neural network has been implemented, trained and validated on 21 input features.