使用同义词集关系的泰卢固语信息检索

Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam
{"title":"使用同义词集关系的泰卢固语信息检索","authors":"Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam","doi":"10.1109/ICACT.2013.6710540","DOIUrl":null,"url":null,"abstract":"Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.","PeriodicalId":302640,"journal":{"name":"2013 15th International Conference on Advanced Computing Technologies (ICACT)","volume":"170 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Information retrieval in Telugu language using Synset relationships\",\"authors\":\"Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam\",\"doi\":\"10.1109/ICACT.2013.6710540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.\",\"PeriodicalId\":302640,\"journal\":{\"name\":\"2013 15th International Conference on Advanced Computing Technologies (ICACT)\",\"volume\":\"170 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 15th International Conference on Advanced Computing Technologies (ICACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACT.2013.6710540\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 15th International Conference on Advanced Computing Technologies (ICACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACT.2013.6710540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

信息技术通过网络和其他数字信息访问环境,使信息检索尽可能简单,带来了许多应用。需要建立本地语言的IR应用程序,使对至少一种语言具有最低知识的普通人能够利用信息服务。单词不匹配是红外系统应用中常见的问题。早在40年前就开始了解决这一问题的研究,但至今仍未得到解决。解决词错配和概念错配的方法由词替换向知识替换发展。本文主要探讨了如何解决印度语言语境中的词错配问题,特别是泰卢固语。泰卢固语信息检索还处于起步阶段,需要大量的研究来解决信息处理和检索中的许多问题。使用平行规则转换技术,跨语言IR是相当舒适的阶段,因为很难单独处理泰卢固语并建立信息检索系统。泰卢固语形态丰富,通膨率高。本文建立了泰卢固语的语义关系,并将其扩展到使用同义词集关系来解决词汇不匹配问题。与基础检索系统相比,扩展后的搜索产生了提供的结果。本文确定了与主题搜索相关的许多问题,并给出了解决的方向,并给出了基础证明和实施结果。我们发现扩展搜索和非扩展检索在查全率和查准率上都有提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Information retrieval in Telugu language using Synset relationships
Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信