{"title":"Literature Filtering for Systematic Reviews with Transformers","authors":"John Hawkins, David Tivey","doi":"arxiv-2405.20354","DOIUrl":null,"url":null,"abstract":"Identifying critical research within the growing body of academic work is an\nessential element of quality research. Systematic review processes, used in\nevidence-based medicine, formalise this as a procedure that must be followed in\na research program. However, it comes with an increasing burden in terms of the\ntime required to identify the important articles of research for a given topic.\nIn this work, we develop a method for building a general-purpose filtering\nsystem that matches a research question, posed as a natural language\ndescription of the required content, against a candidate set of articles\nobtained via the application of broad search terms. Our results demonstrate\nthat transformer models, pre-trained on biomedical literature then fine tuned\nfor the specific task, offer a promising solution to this problem. The model\ncan remove large volumes of irrelevant articles for most research questions.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.20354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying critical research within the growing body of academic work is an
essential element of quality research. Systematic review processes, used in
evidence-based medicine, formalise this as a procedure that must be followed in
a research program. However, it comes with an increasing burden in terms of the
time required to identify the important articles of research for a given topic.
In this work, we develop a method for building a general-purpose filtering
system that matches a research question, posed as a natural language
description of the required content, against a candidate set of articles
obtained via the application of broad search terms. Our results demonstrate
that transformer models, pre-trained on biomedical literature then fine tuned
for the specific task, offer a promising solution to this problem. The model
can remove large volumes of irrelevant articles for most research questions.