{"title":"一种基于语义关系的有效文档检索技术","authors":"S. Chatvichienchai, Katsumi Tanaka","doi":"10.1109/ICCIT.2009.11","DOIUrl":null,"url":null,"abstract":"Office applications are becoming a major pillar of today’s organizations since they are used to edit a vast amount of digital documents. Finding these office documents in large databases that fit users’ needs is becoming increasingly important. Traditional search tools that employ keyword and phrase matching between the query and search index alone tend to offer high recall and low precision. The search users are faced with too many irrelevant results. In order to solve this problem, we propose a novel search system that effectively searches the target documents by the search query whose definition is based on the document type, search terms and the semantic relationship between the search terms and the target documents. We present a technique that collects search terms and their semantic relationship from the documents of some office applications to generate the XML-based search indices that can effectively locate the office documents. Furthermore, our query optimization algorithm is also presented.","PeriodicalId":112416,"journal":{"name":"2009 Fourth International Conference on Computer Sciences and Convergence Information Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Effective Document Search Technique by Semantic Relationship Approach\",\"authors\":\"S. Chatvichienchai, Katsumi Tanaka\",\"doi\":\"10.1109/ICCIT.2009.11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Office applications are becoming a major pillar of today’s organizations since they are used to edit a vast amount of digital documents. Finding these office documents in large databases that fit users’ needs is becoming increasingly important. Traditional search tools that employ keyword and phrase matching between the query and search index alone tend to offer high recall and low precision. The search users are faced with too many irrelevant results. In order to solve this problem, we propose a novel search system that effectively searches the target documents by the search query whose definition is based on the document type, search terms and the semantic relationship between the search terms and the target documents. We present a technique that collects search terms and their semantic relationship from the documents of some office applications to generate the XML-based search indices that can effectively locate the office documents. Furthermore, our query optimization algorithm is also presented.\",\"PeriodicalId\":112416,\"journal\":{\"name\":\"2009 Fourth International Conference on Computer Sciences and Convergence Information Technology\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-11-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Fourth International Conference on Computer Sciences and Convergence Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT.2009.11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Fourth International Conference on Computer Sciences and Convergence Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT.2009.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Effective Document Search Technique by Semantic Relationship Approach
Office applications are becoming a major pillar of today’s organizations since they are used to edit a vast amount of digital documents. Finding these office documents in large databases that fit users’ needs is becoming increasingly important. Traditional search tools that employ keyword and phrase matching between the query and search index alone tend to offer high recall and low precision. The search users are faced with too many irrelevant results. In order to solve this problem, we propose a novel search system that effectively searches the target documents by the search query whose definition is based on the document type, search terms and the semantic relationship between the search terms and the target documents. We present a technique that collects search terms and their semantic relationship from the documents of some office applications to generate the XML-based search indices that can effectively locate the office documents. Furthermore, our query optimization algorithm is also presented.