{"title":"Relevance Feedback Query Refinement for PDF Medical Journal Articles","authors":"A. Christiansen, D. J. Lee","doi":"10.1109/CBMS.2006.140","DOIUrl":null,"url":null,"abstract":"This paper addresses relevance feedback as an alternative to keyword-based search engines for sifting through large PDF document collections and extracting the most relevant documents (especially for literature review purposes). Until now, relevance feedback has only been used in content-based image and video retrieval due to the inability to query those media types without keywords. Since PDF journal articles contain many valuable non-keyword features such as structure and formatting information as well as embedded figures, they would benefit from relevance feedback. Stripping a PDF into \"full-text\" for indexing purposes disregards these important features. We discuss how they can be used to our advantage and look to integrate the wealth of knowledge from relevance feedback text-based information retrieval. We argue for the benefits of placing the burden of relevance judgement on the user rather than the retrieval system and present alternative document views that quickly allow the user to deem relevance","PeriodicalId":208693,"journal":{"name":"19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS.2006.140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
This paper addresses relevance feedback as an alternative to keyword-based search engines for sifting through large PDF document collections and extracting the most relevant documents (especially for literature review purposes). Until now, relevance feedback has only been used in content-based image and video retrieval due to the inability to query those media types without keywords. Since PDF journal articles contain many valuable non-keyword features such as structure and formatting information as well as embedded figures, they would benefit from relevance feedback. Stripping a PDF into "full-text" for indexing purposes disregards these important features. We discuss how they can be used to our advantage and look to integrate the wealth of knowledge from relevance feedback text-based information retrieval. We argue for the benefits of placing the burden of relevance judgement on the user rather than the retrieval system and present alternative document views that quickly allow the user to deem relevance