{"title":"A Recognizer and Parser for Basic Sentences in Telugu using CYK Algorithm","authors":"S. Varshini, Gottimukkala Sarayu Varma, S. M.","doi":"10.1109/CONIT59222.2023.10205628","DOIUrl":null,"url":null,"abstract":"The scientific and technical field of computational linguistics seeks to comprehend spoken and written language from a computational standpoint. The way of describing rules and semantics in linguistics paved the beginning of natural language processing research for various languages spoken in the world. Over 700 languages are spoken in India alone, out of an estimated 7,000 spoken worldwide. Telugu is one of the most predominantly spoken languages in the states of Andhra Pradesh and Telangana. This proposed work presents a syntactical parsing technique on some basic sentences in Telugu. The Cocke-Younger-Kasami algorithm has been implemented to parse these basic sentences and also infer their grammatical structure. At present, there are very few language processing tools for Indian languages. Hence, an effort has been made to efficiently parse a few simple sentences in Telugu. The syntactical parser that has been developed acts as a recognizer and parser which can not only recognize and parse the grammatically correct sentences but can also recognize the grammatically incorrect sentences. This recognizer cum parser is then evaluated using the performance metrics like accuracy, precision and recall.","PeriodicalId":377623,"journal":{"name":"2023 3rd International Conference on Intelligent Technologies (CONIT)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Intelligent Technologies (CONIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONIT59222.2023.10205628","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The scientific and technical field of computational linguistics seeks to comprehend spoken and written language from a computational standpoint. The way of describing rules and semantics in linguistics paved the beginning of natural language processing research for various languages spoken in the world. Over 700 languages are spoken in India alone, out of an estimated 7,000 spoken worldwide. Telugu is one of the most predominantly spoken languages in the states of Andhra Pradesh and Telangana. This proposed work presents a syntactical parsing technique on some basic sentences in Telugu. The Cocke-Younger-Kasami algorithm has been implemented to parse these basic sentences and also infer their grammatical structure. At present, there are very few language processing tools for Indian languages. Hence, an effort has been made to efficiently parse a few simple sentences in Telugu. The syntactical parser that has been developed acts as a recognizer and parser which can not only recognize and parse the grammatically correct sentences but can also recognize the grammatically incorrect sentences. This recognizer cum parser is then evaluated using the performance metrics like accuracy, precision and recall.