Dan Su, Yan Xu, Tiezheng Yu, Farhad Bin Siddique, Elham J. Barezi, Pascale Fung
{"title":"CAiRE-COVID: A Question Answering and Query-focused Multi-Document Summarization System for COVID-19 Scholarly Information Management","authors":"Dan Su, Yan Xu, Tiezheng Yu, Farhad Bin Siddique, Elham J. Barezi, Pascale Fung","doi":"10.18653/v1/2020.nlpcovid19-2.14","DOIUrl":null,"url":null,"abstract":"The outbreak of COVID-19 raises attention from the researchers from various communities. While many scientific articles have been published, a system that can provide reliable information to COVID-19 related questions from the latest academic resources is crucial, especially for the medical community in the current time-critical race to treat patients and to find a cure for the virus. To address the requests, we propose our CAiRE-COVID, a neural-based system that uses open-domain question answering (QA) techniques combined with summarization techniques for mining the available scientific literature. It leverages the Information Retrieval (IR) system and QA models to extract relevant snippets from existing literature given a query. Fluent summaries are also provided to help understand the content in a more efficient way. Our system has been awarded as winner for one of the tasks in CORD-19 Kaggle Challenge. We also launched our CAiRE-COVID website for broader use. The code for our system is also open-sourced to bootstrap further study.","PeriodicalId":131251,"journal":{"name":"Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"75","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2020.nlpcovid19-2.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 75
Abstract
The outbreak of COVID-19 raises attention from the researchers from various communities. While many scientific articles have been published, a system that can provide reliable information to COVID-19 related questions from the latest academic resources is crucial, especially for the medical community in the current time-critical race to treat patients and to find a cure for the virus. To address the requests, we propose our CAiRE-COVID, a neural-based system that uses open-domain question answering (QA) techniques combined with summarization techniques for mining the available scientific literature. It leverages the Information Retrieval (IR) system and QA models to extract relevant snippets from existing literature given a query. Fluent summaries are also provided to help understand the content in a more efficient way. Our system has been awarded as winner for one of the tasks in CORD-19 Kaggle Challenge. We also launched our CAiRE-COVID website for broader use. The code for our system is also open-sourced to bootstrap further study.