{"title":"Searching Tourism Information by Using Vertical Search Engine Based on Nutch and Solr","authors":"Huawei Ma, Wencai Du, Simon Xu, Weijun Li","doi":"10.1109/SERA.2019.8886775","DOIUrl":null,"url":null,"abstract":"Since there exist some issues with traditional search engine in information retrieval, such as huge numbers of results, poor profession, low precision rate and other issues, in this paper, we proposed a Vertical Search Engine based on Nutch and Solr. We used forward iteration most granular segmentation algorithm based on dictionary to achieve Chinese word segmentation, employed Vector Space Model (VSM) based on keywords to implement topic relevance, extended the user search module and the tourism domain word library to collect information, filter information retrieval, and relate word various stages. Experiments were also conducted in order to evaluate the algorithm and the results show that the vertical search engine based on Nutch and Solr which is used for tourism information retrieval can improve the user retrieval precision and meet the professional demand of user retrieval.","PeriodicalId":438947,"journal":{"name":"2019 IEEE 17th International Conference on Software Engineering Research, Management and Applications (SERA)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 17th International Conference on Software Engineering Research, Management and Applications (SERA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SERA.2019.8886775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Since there exist some issues with traditional search engine in information retrieval, such as huge numbers of results, poor profession, low precision rate and other issues, in this paper, we proposed a Vertical Search Engine based on Nutch and Solr. We used forward iteration most granular segmentation algorithm based on dictionary to achieve Chinese word segmentation, employed Vector Space Model (VSM) based on keywords to implement topic relevance, extended the user search module and the tourism domain word library to collect information, filter information retrieval, and relate word various stages. Experiments were also conducted in order to evaluate the algorithm and the results show that the vertical search engine based on Nutch and Solr which is used for tourism information retrieval can improve the user retrieval precision and meet the professional demand of user retrieval.