{"title":"Analysis Of Inflectional Behaviour In Indian Languages Using Features Extraction Techniques","authors":"Bhairab Sarma, C. Nath","doi":"10.1109/InCACCT57535.2023.10141783","DOIUrl":null,"url":null,"abstract":"Compared to English, Indian languages are highly inflectional in nature. Features extraction is a challenging task for Indian languages due to multiple reasons. First, alphabets called ‘Akshara’ are coded with Unicode unlike English, which is coded with ASCII. Second, upon inflection, the structure of the root word gets changed to a different format. Words get modified according to their added features as per tense, aspect, and modality. Thirdly, composite characters called ‘yuktAkshara’ are influenced with additive vowels or consonants. With these three prospects, this paper is aimed to address some practical difficulties for stemming root words with experimental reviews. Feature extraction is used in hidden information retrieval, root word stemming, text-to-speech conversion, and semantic analysis of Natural Language Processing. Analyzing features from an inflected word of Unicode language, one can recover the semantic and pragmatic meaning of the written text and further can be used in text-to-speech conversion. This paper discusses various techniques of feature extraction and its applications in some Indian languages.","PeriodicalId":405272,"journal":{"name":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/InCACCT57535.2023.10141783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Compared to English, Indian languages are highly inflectional in nature. Features extraction is a challenging task for Indian languages due to multiple reasons. First, alphabets called ‘Akshara’ are coded with Unicode unlike English, which is coded with ASCII. Second, upon inflection, the structure of the root word gets changed to a different format. Words get modified according to their added features as per tense, aspect, and modality. Thirdly, composite characters called ‘yuktAkshara’ are influenced with additive vowels or consonants. With these three prospects, this paper is aimed to address some practical difficulties for stemming root words with experimental reviews. Feature extraction is used in hidden information retrieval, root word stemming, text-to-speech conversion, and semantic analysis of Natural Language Processing. Analyzing features from an inflected word of Unicode language, one can recover the semantic and pragmatic meaning of the written text and further can be used in text-to-speech conversion. This paper discusses various techniques of feature extraction and its applications in some Indian languages.