Towards Practical Few-shot Federated NLP

Proceedings of the 3rd Workshop on Machine Learning and Systems Pub Date : 2022-12-01 DOI:10.1145/3578356.3592575

Dongqi Cai, Yaozong Wu, Haitao Yuan, Shangguang Wang, F. Lin, Mengwei Xu

引用次数: 3

Abstract

Transformer-based pre-trained models have emerged as the predominant solution for natural language processing (NLP). Fine-tuning such pre-trained models for downstream tasks often requires a considerable amount of labeled private data. In practice, private data is often distributed across heterogeneous mobile devices and may be prohibited from being uploaded. Moreover, well-curated labeled data is often scarce, presenting an additional challenge. To address these challenges, we first introduce a data generator for federated few-shot learning tasks, which encompasses the quantity and skewness of scarce labeled data in a realistic setting. Subsequently, we propose AUG-FedPrompt, a prompt-based federated learning system that exploits abundant unlabeled data for data augmentation. Our experiments indicate that AUG-FedPrompt can perform on par with full-set fine-tuning with a limited amount of labeled data. However, such competitive performance comes at a significant system cost.

查看原文本刊更多论文

走向实用的少镜头联邦NLP

基于变压器的预训练模型已经成为自然语言处理(NLP)的主要解决方案。为下游任务对这种预训练模型进行微调通常需要大量标记的私有数据。在实践中，私有数据通常分布在异构移动设备上，并且可能被禁止上传。此外，精心策划的标签数据往往是稀缺的，提出了一个额外的挑战。为了解决这些挑战，我们首先引入了一个用于联邦少射学习任务的数据生成器，它包含了现实环境中稀缺标记数据的数量和偏度。随后，我们提出了AUG-FedPrompt，这是一个基于提示的联邦学习系统，利用丰富的未标记数据进行数据增强。我们的实验表明，AUG-FedPrompt可以在有限数量的标记数据上执行与全套微调相当的操作。然而，这种具有竞争力的性能是以巨大的系统成本为代价的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 3rd Workshop on Machine Learning and Systems

自引率

0.00%

发文量