{"title":"English-persian text retrieval using concept graph","authors":"Farnaz Teymoorian, M. Mohsenzadeh, M. Seyyedi","doi":"10.1109/ICCSIT.2009.5234499","DOIUrl":null,"url":null,"abstract":"Cross-language information retrieval (CLIR) is the retrieval process where the user presents queries in one language to retrieve documents in another language. In this field the resolution of lexical ambiguity in translating queries is a key challenge. In this paper, we propose a technique for calculating translation probabilities based on creating query terms' concept graphs for selecting the right translation sense of query terms for English-Persian text retrieval. We present an efficient statistical method for creating this graph. We test the effectiveness of the proposed disambiguation method on Hamshahri collection1 that is standardized according to CLEF standards. Evaluation using this data collection shows great effectiveness of the proposed method.","PeriodicalId":342396,"journal":{"name":"2009 2nd IEEE International Conference on Computer Science and Information Technology","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 2nd IEEE International Conference on Computer Science and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSIT.2009.5234499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Cross-language information retrieval (CLIR) is the retrieval process where the user presents queries in one language to retrieve documents in another language. In this field the resolution of lexical ambiguity in translating queries is a key challenge. In this paper, we propose a technique for calculating translation probabilities based on creating query terms' concept graphs for selecting the right translation sense of query terms for English-Persian text retrieval. We present an efficient statistical method for creating this graph. We test the effectiveness of the proposed disambiguation method on Hamshahri collection1 that is standardized according to CLEF standards. Evaluation using this data collection shows great effectiveness of the proposed method.