Distillation of Large Language Models for Text Simplification

Modeling Control and Information Technologies Pub Date : 2023-11-22 DOI:10.31713/mcit.2023.071

Олександр Скуржанський

引用次数: 0

Abstract

This work presents a comprehensive methodology for harnessing the capabilities of Large Language Models to address specific Natural Language Processing tasks, with a focus on Text Simplification. While LLMs have demonstrated their prowess in tackling a wide range of NLP challenges, their demanding computational requirements can render them impractical for real-time online inference. In response to this limitation, we suggest the concept of text distillation, a technique aimed at effectively transferring the knowledge stored within LLMs to more compact and computationally efficient neural networks.

查看原文本刊更多论文

为文本简化提炼大型语言模型

这项研究提出了一种综合方法，利用大型语言模型的能力来解决特定的自然语言处理任务，重点是文本简化。虽然大型语言模型在应对各种 NLP 挑战方面表现出了卓越的能力，但其苛刻的计算要求可能会使其无法用于实时在线推理。针对这一限制，我们提出了文本蒸馏的概念，这一技术旨在将 LLM 中存储的知识有效地转移到更紧凑、计算效率更高的神经网络中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Modeling Control and Information Technologies

自引率

0.00%

发文量