Charmi Jobanputra, Nihit Parikh, Vishwa Vora, S. Bharti
{"title":"Parts-of-Speech Tagger for Gujarati Language using Long-short-Term-Memory","authors":"Charmi Jobanputra, Nihit Parikh, Vishwa Vora, S. Bharti","doi":"10.1109/aimv53313.2021.9670996","DOIUrl":null,"url":null,"abstract":"Parts-of-Speech (POS) tagging is a crucial step to process the natural languages. It is a state-of-art method of providing the lexicon category such as noun, verb, adjective, etc. to each word that best suits the context of the sentence in which it is used. Being a part of pre-processing makes this task an important step in linguistics and semantics. Gujarati is an Indian language widely spoken in Asia and across the world. Part-of-Speech tagging can be used in word sense disambiguation, Information retrieval, machine translation and parsing. In this paper, we proposed Long-short-Term-Memory (LSTM) based Part-of-Speech tagger for Gujarati language. With our proposed approach, this paper envisions achieving accuracy of 95.34% and 96% precision with the help of this novel & efficient gradient based method.","PeriodicalId":135318,"journal":{"name":"2021 International Conference on Artificial Intelligence and Machine Vision (AIMV)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Artificial Intelligence and Machine Vision (AIMV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/aimv53313.2021.9670996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Parts-of-Speech (POS) tagging is a crucial step to process the natural languages. It is a state-of-art method of providing the lexicon category such as noun, verb, adjective, etc. to each word that best suits the context of the sentence in which it is used. Being a part of pre-processing makes this task an important step in linguistics and semantics. Gujarati is an Indian language widely spoken in Asia and across the world. Part-of-Speech tagging can be used in word sense disambiguation, Information retrieval, machine translation and parsing. In this paper, we proposed Long-short-Term-Memory (LSTM) based Part-of-Speech tagger for Gujarati language. With our proposed approach, this paper envisions achieving accuracy of 95.34% and 96% precision with the help of this novel & efficient gradient based method.