{"title":"用转换器过滤系统性综述文献","authors":"John Hawkins, David Tivey","doi":"arxiv-2405.20354","DOIUrl":null,"url":null,"abstract":"Identifying critical research within the growing body of academic work is an\nessential element of quality research. Systematic review processes, used in\nevidence-based medicine, formalise this as a procedure that must be followed in\na research program. However, it comes with an increasing burden in terms of the\ntime required to identify the important articles of research for a given topic.\nIn this work, we develop a method for building a general-purpose filtering\nsystem that matches a research question, posed as a natural language\ndescription of the required content, against a candidate set of articles\nobtained via the application of broad search terms. Our results demonstrate\nthat transformer models, pre-trained on biomedical literature then fine tuned\nfor the specific task, offer a promising solution to this problem. The model\ncan remove large volumes of irrelevant articles for most research questions.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Literature Filtering for Systematic Reviews with Transformers\",\"authors\":\"John Hawkins, David Tivey\",\"doi\":\"arxiv-2405.20354\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identifying critical research within the growing body of academic work is an\\nessential element of quality research. Systematic review processes, used in\\nevidence-based medicine, formalise this as a procedure that must be followed in\\na research program. However, it comes with an increasing burden in terms of the\\ntime required to identify the important articles of research for a given topic.\\nIn this work, we develop a method for building a general-purpose filtering\\nsystem that matches a research question, posed as a natural language\\ndescription of the required content, against a candidate set of articles\\nobtained via the application of broad search terms. Our results demonstrate\\nthat transformer models, pre-trained on biomedical literature then fine tuned\\nfor the specific task, offer a promising solution to this problem. The model\\ncan remove large volumes of irrelevant articles for most research questions.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"41 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.20354\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.20354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Literature Filtering for Systematic Reviews with Transformers
Identifying critical research within the growing body of academic work is an
essential element of quality research. Systematic review processes, used in
evidence-based medicine, formalise this as a procedure that must be followed in
a research program. However, it comes with an increasing burden in terms of the
time required to identify the important articles of research for a given topic.
In this work, we develop a method for building a general-purpose filtering
system that matches a research question, posed as a natural language
description of the required content, against a candidate set of articles
obtained via the application of broad search terms. Our results demonstrate
that transformer models, pre-trained on biomedical literature then fine tuned
for the specific task, offer a promising solution to this problem. The model
can remove large volumes of irrelevant articles for most research questions.