Grounding End-to-End Pre-trained architectures for Semantic Role Labeling in multiple languages

IF 1.4 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligenza Artificiale Pub Date : 2023-10-27 DOI:10.3233/ia-230012

Claudiu D. Hromei, Danilo Croce, Roberto Basili

{"title":"Grounding End-to-End Pre-trained architectures for Semantic Role Labeling in multiple languages","authors":"Claudiu D. Hromei, Danilo Croce, Roberto Basili","doi":"10.3233/ia-230012","DOIUrl":null,"url":null,"abstract":"Situated natural language interactions between humans and robots are strictly necessary for complex applications: communication here implies the reference to the environment shared between a user and the robot. This paper proposes a transformer-based architecture that supports the integration of spatial information (as logical representation) about a semantic map of the environment and the input utterances. The generated interpretation is a logical form of the command that makes references to the state of the world through a single end-to-end process, stimulated at each interaction by an explicit linguistic description of the environment. In this specific work, the end-to-end capability of the targeted transformer is studied in light of its multilingual applications where the robot can be queried in different natural languages. The obtained experimental results confirm the applicability of transformers to grounded human-robotic interaction, with benefits in terms of both portability of the approach across domains and effectiveness in terms of reachable accuracy. Moreover, language-specific processing chains are shown to be preferable to large-scale multilingual models for their better trade-off between accuracy and complexity. Overall, the proposed architecture outperforms previous approaches and paves the way for sustainable multilingual architectures.","PeriodicalId":42055,"journal":{"name":"Intelligenza Artificiale","volume":"27 1","pages":"0"},"PeriodicalIF":1.4000,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligenza Artificiale","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/ia-230012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Situated natural language interactions between humans and robots are strictly necessary for complex applications: communication here implies the reference to the environment shared between a user and the robot. This paper proposes a transformer-based architecture that supports the integration of spatial information (as logical representation) about a semantic map of the environment and the input utterances. The generated interpretation is a logical form of the command that makes references to the state of the world through a single end-to-end process, stimulated at each interaction by an explicit linguistic description of the environment. In this specific work, the end-to-end capability of the targeted transformer is studied in light of its multilingual applications where the robot can be queried in different natural languages. The obtained experimental results confirm the applicability of transformers to grounded human-robotic interaction, with benefits in terms of both portability of the approach across domains and effectiveness in terms of reachable accuracy. Moreover, language-specific processing chains are shown to be preferable to large-scale multilingual models for their better trade-off between accuracy and complexity. Overall, the proposed architecture outperforms previous approaches and paves the way for sustainable multilingual architectures.

查看原文本刊更多论文

基于端到端预训练的多语言语义角色标注体系结构

人类和机器人之间的自然语言交互对于复杂的应用程序是非常必要的:这里的通信意味着对用户和机器人之间共享的环境的引用。本文提出了一种基于转换器的体系结构，该体系结构支持关于环境语义图和输入话语的空间信息(作为逻辑表示)的集成。生成的解释是命令的逻辑形式，它通过单个端到端过程引用世界状态，在每次交互时都通过对环境的明确语言描述来刺激。在这项具体的工作中，研究了目标变压器的端到端能力，根据其多语言应用，机器人可以用不同的自然语言进行查询。获得的实验结果证实了变压器对接地人机交互的适用性，在跨领域的可移植性和可达精度方面的有效性方面都有好处。此外，特定于语言的处理链被证明比大规模多语言模型更可取，因为它们在准确性和复杂性之间有更好的平衡。总的来说，所提出的体系结构优于以前的方法，并为可持续的多语言体系结构铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Intelligenza Artificiale COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

3.50

自引率

6.70%

发文量