Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records.

Proceedings of machine learning research Pub Date : 2022-08-01

Raphael Poulain, Mehak Gupta, Rahmatollah Beheshti

{"title":"Few-Shot Learning with Semi-Supervised Transformers for Electronic Health Records.","authors":"Raphael Poulain, Mehak Gupta, Rahmatollah Beheshti","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>With the growing availability of Electronic Health Records (EHRs), many deep learning methods have been developed to leverage such datasets in medical prediction tasks. Notably, transformer-based architectures have proven to be highly effective for EHRs. Transformer-based architectures are generally very effective in \"transferring\" the acquired knowledge from very large datasets to smaller target datasets through their comprehensive \"pre-training\" process. However, to work efficiently, they still rely on the target datasets for the downstream tasks, and if the target dataset is (very) small, the performance of downstream models can degrade rapidly. In biomedical applications, it is common to only have access to small datasets, for instance, when studying rare diseases, invasive procedures, or using restrictive cohort selection processes. In this study, we present CEHR-GAN-BERT, a semi-supervised transformer-based architecture that leverages both in- and out-of-cohort patients to learn better patient representations in the context of few-shot learning. The proposed method opens new learning opportunities where only a few hundred samples are available. We extensively evaluate our method on four prediction tasks and three public datasets showing the ability of our model to achieve improvements upwards of 5% on all performance metrics (including AUROC and F1 Score) on the tasks that use less than 200 annotated patients during the training process.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"182 ","pages":"853-873"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10399128/pdf/nihms-1865430.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of machine learning research","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the growing availability of Electronic Health Records (EHRs), many deep learning methods have been developed to leverage such datasets in medical prediction tasks. Notably, transformer-based architectures have proven to be highly effective for EHRs. Transformer-based architectures are generally very effective in "transferring" the acquired knowledge from very large datasets to smaller target datasets through their comprehensive "pre-training" process. However, to work efficiently, they still rely on the target datasets for the downstream tasks, and if the target dataset is (very) small, the performance of downstream models can degrade rapidly. In biomedical applications, it is common to only have access to small datasets, for instance, when studying rare diseases, invasive procedures, or using restrictive cohort selection processes. In this study, we present CEHR-GAN-BERT, a semi-supervised transformer-based architecture that leverages both in- and out-of-cohort patients to learn better patient representations in the context of few-shot learning. The proposed method opens new learning opportunities where only a few hundred samples are available. We extensively evaluate our method on four prediction tasks and three public datasets showing the ability of our model to achieve improvements upwards of 5% on all performance metrics (including AUROC and F1 Score) on the tasks that use less than 200 annotated patients during the training process.

Abstract Image

本刊更多论文

用半监督变压器进行电子健康记录的短时间学习。

随着电子健康记录(EHRs)的日益普及，已经开发了许多深度学习方法来利用这些数据集进行医疗预测任务。值得注意的是，基于变压器的体系结构已被证明对电子病历非常有效。基于转换器的体系结构通过其全面的“预训练”过程，通常在将获得的知识从非常大的数据集“转移”到较小的目标数据集方面非常有效。然而，为了有效地工作，它们仍然依赖于目标数据集来完成下游任务，如果目标数据集(非常)小，下游模型的性能会迅速下降。在生物医学应用中，通常只能访问小数据集，例如，在研究罕见疾病、侵入性手术或使用限制性队列选择过程时。在这项研究中，我们提出了CEHR-GAN-BERT，这是一种基于半监督变压器的架构，它利用队列内和队列外的患者在少量学习的背景下学习更好的患者表征。在只有几百个样本可用的情况下，提出的方法开辟了新的学习机会。我们在四个预测任务和三个公共数据集上广泛评估了我们的方法，显示了我们的模型在训练过程中使用少于200个注释患者的任务上，在所有性能指标(包括AUROC和F1 Score)上实现5%以上改进的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of machine learning research

自引率

0.00%

发文量