基于CYK算法的泰卢固语基本句识别与解析器

S. Varshini, Gottimukkala Sarayu Varma, S. M.
{"title":"基于CYK算法的泰卢固语基本句识别与解析器","authors":"S. Varshini, Gottimukkala Sarayu Varma, S. M.","doi":"10.1109/CONIT59222.2023.10205628","DOIUrl":null,"url":null,"abstract":"The scientific and technical field of computational linguistics seeks to comprehend spoken and written language from a computational standpoint. The way of describing rules and semantics in linguistics paved the beginning of natural language processing research for various languages spoken in the world. Over 700 languages are spoken in India alone, out of an estimated 7,000 spoken worldwide. Telugu is one of the most predominantly spoken languages in the states of Andhra Pradesh and Telangana. This proposed work presents a syntactical parsing technique on some basic sentences in Telugu. The Cocke-Younger-Kasami algorithm has been implemented to parse these basic sentences and also infer their grammatical structure. At present, there are very few language processing tools for Indian languages. Hence, an effort has been made to efficiently parse a few simple sentences in Telugu. The syntactical parser that has been developed acts as a recognizer and parser which can not only recognize and parse the grammatically correct sentences but can also recognize the grammatically incorrect sentences. This recognizer cum parser is then evaluated using the performance metrics like accuracy, precision and recall.","PeriodicalId":377623,"journal":{"name":"2023 3rd International Conference on Intelligent Technologies (CONIT)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Recognizer and Parser for Basic Sentences in Telugu using CYK Algorithm\",\"authors\":\"S. Varshini, Gottimukkala Sarayu Varma, S. M.\",\"doi\":\"10.1109/CONIT59222.2023.10205628\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The scientific and technical field of computational linguistics seeks to comprehend spoken and written language from a computational standpoint. The way of describing rules and semantics in linguistics paved the beginning of natural language processing research for various languages spoken in the world. Over 700 languages are spoken in India alone, out of an estimated 7,000 spoken worldwide. Telugu is one of the most predominantly spoken languages in the states of Andhra Pradesh and Telangana. This proposed work presents a syntactical parsing technique on some basic sentences in Telugu. The Cocke-Younger-Kasami algorithm has been implemented to parse these basic sentences and also infer their grammatical structure. At present, there are very few language processing tools for Indian languages. Hence, an effort has been made to efficiently parse a few simple sentences in Telugu. The syntactical parser that has been developed acts as a recognizer and parser which can not only recognize and parse the grammatically correct sentences but can also recognize the grammatically incorrect sentences. This recognizer cum parser is then evaluated using the performance metrics like accuracy, precision and recall.\",\"PeriodicalId\":377623,\"journal\":{\"name\":\"2023 3rd International Conference on Intelligent Technologies (CONIT)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Intelligent Technologies (CONIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CONIT59222.2023.10205628\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Intelligent Technologies (CONIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONIT59222.2023.10205628","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

计算语言学的科学和技术领域试图从计算的角度理解口语和书面语。语言学中描述规则和语义的方法为世界上各种语言的自然语言处理研究奠定了基础。全世界大约有7000种语言,仅印度就有700多种语言。泰卢固语是安得拉邦和特伦加纳邦最主要使用的语言之一。本文提出了一种泰卢固语基本句子的句法分析技术。我们实现了Cocke-Younger-Kasami算法来解析这些基本句子并推断它们的语法结构。目前,针对印度语言的语言处理工具非常少。因此,人们努力有效地解析泰卢固语的几个简单句子。已经开发的语法解析器作为一个识别器和解析器,既可以识别和解析语法正确的句子,也可以识别语法错误的句子。然后使用准确度、精度和召回率等性能指标评估该识别器和解析器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Recognizer and Parser for Basic Sentences in Telugu using CYK Algorithm
The scientific and technical field of computational linguistics seeks to comprehend spoken and written language from a computational standpoint. The way of describing rules and semantics in linguistics paved the beginning of natural language processing research for various languages spoken in the world. Over 700 languages are spoken in India alone, out of an estimated 7,000 spoken worldwide. Telugu is one of the most predominantly spoken languages in the states of Andhra Pradesh and Telangana. This proposed work presents a syntactical parsing technique on some basic sentences in Telugu. The Cocke-Younger-Kasami algorithm has been implemented to parse these basic sentences and also infer their grammatical structure. At present, there are very few language processing tools for Indian languages. Hence, an effort has been made to efficiently parse a few simple sentences in Telugu. The syntactical parser that has been developed acts as a recognizer and parser which can not only recognize and parse the grammatically correct sentences but can also recognize the grammatically incorrect sentences. This recognizer cum parser is then evaluated using the performance metrics like accuracy, precision and recall.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信