使用同义词集关系的泰卢固语信息检索

2013 15th International Conference on Advanced Computing Technologies (ICACT) Pub Date : 2013-09-01 DOI:10.1109/ICACT.2013.6710540

Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam

{"title":"使用同义词集关系的泰卢固语信息检索","authors":"Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam","doi":"10.1109/ICACT.2013.6710540","DOIUrl":null,"url":null,"abstract":"Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.","PeriodicalId":302640,"journal":{"name":"2013 15th International Conference on Advanced Computing Technologies (ICACT)","volume":"170 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Information retrieval in Telugu language using Synset relationships\",\"authors\":\"Kolikipogu Ramakrishna, P. Rani, D. Subrahmanyam\",\"doi\":\"10.1109/ICACT.2013.6710540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.\",\"PeriodicalId\":302640,\"journal\":{\"name\":\"2013 15th International Conference on Advanced Computing Technologies (ICACT)\",\"volume\":\"170 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 15th International Conference on Advanced Computing Technologies (ICACT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACT.2013.6710540\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 15th International Conference on Advanced Computing Technologies (ICACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACT.2013.6710540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

信息技术通过网络和其他数字信息访问环境，使信息检索尽可能简单，带来了许多应用。需要建立本地语言的IR应用程序，使对至少一种语言具有最低知识的普通人能够利用信息服务。单词不匹配是红外系统应用中常见的问题。早在40年前就开始了解决这一问题的研究，但至今仍未得到解决。解决词错配和概念错配的方法由词替换向知识替换发展。本文主要探讨了如何解决印度语言语境中的词错配问题，特别是泰卢固语。泰卢固语信息检索还处于起步阶段，需要大量的研究来解决信息处理和检索中的许多问题。使用平行规则转换技术，跨语言IR是相当舒适的阶段，因为很难单独处理泰卢固语并建立信息检索系统。泰卢固语形态丰富，通膨率高。本文建立了泰卢固语的语义关系，并将其扩展到使用同义词集关系来解决词汇不匹配问题。与基础检索系统相比，扩展后的搜索产生了提供的结果。本文确定了与主题搜索相关的许多问题，并给出了解决的方向，并给出了基础证明和实施结果。我们发现扩展搜索和非扩展检索在查全率和查准率上都有提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Information retrieval in Telugu language using Synset relationships

Information Technology brought many applications of Information Retrieval as simple as possible through Web and other Digital Information Access Environment. There is a demand in building Applications of IR in Local languages, which allows common people with minimal knowledge in at least one language to avail the information services. Word mismatch is a common problem in IR System Applications. The research is started to overcome this problem in 4 decades back and still the problem is unsolved. The methods were moving from word replacement to knowledge replacement to solve word mismatch and concept mismatches. This paper mainly explored towards solving word mismatch problem in India language context particularly Telugu language. IR in Telugu Language is in inception phase and need huge research to solve many issues in information processing and retrieval. Cross Lingual IR is quite comfortable stage with parallel rule conversion techniques, where as it is difficult to process Telugu alone and build Information Retrieval System. Telugu is morphologically rich with high conflational rate. In this paper we build IR in Telugu Language and extended to solve vocabulary mismatch problem using Synset relationships. When compared to base Retrieval system, expanded search yield offered results. This paper identified many issues related to topical search and given direction to solve with base proof with implementation results. We found improvement in Recall and precision with Expanded Search with Non-Expanded Retrieval.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 15th International Conference on Advanced Computing Technologies (ICACT)

自引率

0.00%

发文量