{"title":"Design of Paper Duplicate Detection System Based on Lucene","authors":"YueHua Ding, Kui Yi, RiHua Xiang","doi":"10.1109/APWCS.2010.16","DOIUrl":null,"url":null,"abstract":"Full-text retrieval is a very popular technology in recent information search area. Lucene is an open-source full-text search engine toolkit, and has excellent system architecture and wide application foreground. Based on paper duplicate detection system research, we introduce Lucene theory and analyze two pivotal work steps of Lucene which are index creation module and index search module. We describe the paper duplicate detection system design and implementation, and discuss key technology of highlight and combination of B/S mode and C/S mode. We provide excellent solution for similar system development.","PeriodicalId":354322,"journal":{"name":"2010 Asia-Pacific Conference on Wearable Computing Systems","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Asia-Pacific Conference on Wearable Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APWCS.2010.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Full-text retrieval is a very popular technology in recent information search area. Lucene is an open-source full-text search engine toolkit, and has excellent system architecture and wide application foreground. Based on paper duplicate detection system research, we introduce Lucene theory and analyze two pivotal work steps of Lucene which are index creation module and index search module. We describe the paper duplicate detection system design and implementation, and discuss key technology of highlight and combination of B/S mode and C/S mode. We provide excellent solution for similar system development.