Byte-Level Massively Multilingual Semantic Parsing

Proceedings of the Massively Multilingual Natural Language Understanding Workshop (MMNLU-22) Pub Date : 1900-01-01 DOI:10.18653/v1/2022.mmnlu-1.3

M. Nicosia, Francesco Piccinno

引用次数: 1

Abstract

Token free approaches have been successfully applied to a series of word and span level tasks. In this work, we evaluate a byte-level sequence to sequence model (ByT5) on the 51 languages in the MASSIVE multilingual semantic parsing dataset. We examine multiple experimental settings: (i) zero-shot, (ii) full gold data and (iii) zero-shot with synthetic data. By leveraging a state-of-the-art label projection method for machine translated examples, we are able to reduce the gap in exact match to only 5 points with respect to a model trained on gold data from all the languages. We additionally provide insights on the cross-lingual transfer of ByT5 and show how the model compares with respect to mT5 across all parameter sizes.

查看原文本刊更多论文

字节级大规模多语言语义解析

无令牌方法已成功应用于一系列单词和跨级任务。在这项工作中，我们在MASSIVE多语言语义分析数据集中的51种语言上评估了字节级序列到序列模型(ByT5)。我们检查了多个实验设置:(i)零射击，(ii)全金数据和(iii)合成数据的零射击。通过利用机器翻译示例的最先进的标签投影方法，我们能够将精确匹配的差距减少到仅5分，相对于来自所有语言的黄金数据训练的模型。我们还提供了关于ByT5跨语言迁移的见解，并展示了该模型如何在所有参数大小上与mT5进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Massively Multilingual Natural Language Understanding Workshop (MMNLU-22)

自引率

0.00%

发文量