{"title":"Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges.","authors":"Sanjana Ramprasad, Iain J Marshall, Denis Jered McInerney, Byron C Wallace","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>We present <i>TrialsSummarizer</i>, a system that aims to automatically summarize evidence presented in the set of randomized controlled trials most relevant to a given query. Building on prior work (Marshall et al., 2020), the system retrieves trial publications matching a query specifying a combination of condition, intervention(s), and outcome(s), and ranks these according to sample size and estimated study quality. The top-<i>k</i> such studies are passed through a neural multi-document summarization system, yielding a synopsis of these trials. We consider two architectures: A standard sequence-to-sequence model based on BART (Lewis et al., 2019), and a multi-headed architecture intended to provide greater transparency to end-users. Both models produce fluent and relevant summaries of evidence retrieved for queries, but their tendency to introduce unsupported statements render them inappropriate for use in this domain at present. The proposed architecture may help users verify outputs allowing users to trace generated tokens back to inputs. The demonstration video is available at: https://vimeo.com/735605060 The prototype, source code, and model weights are available at: https://sanjanaramprasad.github.io/trials-summarizer/.</p>","PeriodicalId":74541,"journal":{"name":"Proceedings of the conference. Association for Computational Linguistics. Meeting","volume":"2023 ","pages":"236-247"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10361334/pdf/nihms-1912129.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the conference. Association for Computational Linguistics. Meeting","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present TrialsSummarizer, a system that aims to automatically summarize evidence presented in the set of randomized controlled trials most relevant to a given query. Building on prior work (Marshall et al., 2020), the system retrieves trial publications matching a query specifying a combination of condition, intervention(s), and outcome(s), and ranks these according to sample size and estimated study quality. The top-k such studies are passed through a neural multi-document summarization system, yielding a synopsis of these trials. We consider two architectures: A standard sequence-to-sequence model based on BART (Lewis et al., 2019), and a multi-headed architecture intended to provide greater transparency to end-users. Both models produce fluent and relevant summaries of evidence retrieved for queries, but their tendency to introduce unsupported statements render them inappropriate for use in this domain at present. The proposed architecture may help users verify outputs allowing users to trace generated tokens back to inputs. The demonstration video is available at: https://vimeo.com/735605060 The prototype, source code, and model weights are available at: https://sanjanaramprasad.github.io/trials-summarizer/.
我们提出了TrialsSummarizer,一个旨在自动总结与给定查询最相关的随机对照试验中出现的证据的系统。在先前工作的基础上(Marshall et al., 2020),系统检索与指定条件、干预措施和结果组合的查询匹配的试验出版物,并根据样本量和估计的研究质量对这些出版物进行排名。前k个这样的研究通过神经多文件摘要系统,产生这些试验的摘要。我们考虑了两种架构:基于BART的标准序列到序列模型(Lewis et al., 2019),以及旨在为最终用户提供更大透明度的多头架构。这两种模型都会生成为查询检索的证据的流畅和相关的摘要,但是它们倾向于引入不受支持的语句,这使得它们目前不适合在这个领域中使用。所建议的体系结构可以帮助用户验证输出,允许用户跟踪生成的令牌到输入。演示视频可在:https://vimeo.com/735605060。原型、源代码和模型权重可在:https://sanjanaramprasad.github.io/trials-summarizer/。