Granular Syntax Processing with Multi-Task and Curriculum Learning

IF 4.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation Pub Date : 2024-07-08 DOI:10.1007/s12559-024-10320-1

Xulang Zhang, Rui Mao, Erik Cambria

{"title":"Granular Syntax Processing with Multi-Task and Curriculum Learning","authors":"Xulang Zhang, Rui Mao, Erik Cambria","doi":"10.1007/s12559-024-10320-1","DOIUrl":null,"url":null,"abstract":"<p>Syntactic processing techniques are the foundation of natural language processing (NLP), supporting many downstream NLP tasks. In this paper, we conduct pair-wise multi-task learning (MTL) on syntactic tasks with different granularity, namely Sentence Boundary Detection (SBD), text chunking, and Part-of-Speech (PoS) tagging, so as to investigate the extent to which they complement each other. We propose a novel soft parameter-sharing mechanism to share local and global dependency information that is learned from both target tasks. We also propose a curriculum learning (CL) mechanism to improve MTL with non-parallel labeled data. Using non-parallel labeled data in MTL is a common practice, whereas it has not received enough attention before. For example, our employed PoS tagging data do not have text chunking labels. When learning PoS tagging and text chunking together, the proposed CL mechanism aims to select complementary samples from the two tasks to update the parameters of the MTL model in the same training batch. Such a method yields better performance and learning stability. We conclude that the fine-grained tasks can provide complementary features to coarse-grained ones, while the most coarse-grained task, SBD, provides useful information for the most fine-grained one, PoS tagging. Additionally, the text chunking task achieves state-of-the-art performance when joint learning with PoS tagging. Our analytical experiments also show the effectiveness of the proposed soft parameter-sharing and CL mechanisms.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"57 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12559-024-10320-1","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Syntactic processing techniques are the foundation of natural language processing (NLP), supporting many downstream NLP tasks. In this paper, we conduct pair-wise multi-task learning (MTL) on syntactic tasks with different granularity, namely Sentence Boundary Detection (SBD), text chunking, and Part-of-Speech (PoS) tagging, so as to investigate the extent to which they complement each other. We propose a novel soft parameter-sharing mechanism to share local and global dependency information that is learned from both target tasks. We also propose a curriculum learning (CL) mechanism to improve MTL with non-parallel labeled data. Using non-parallel labeled data in MTL is a common practice, whereas it has not received enough attention before. For example, our employed PoS tagging data do not have text chunking labels. When learning PoS tagging and text chunking together, the proposed CL mechanism aims to select complementary samples from the two tasks to update the parameters of the MTL model in the same training batch. Such a method yields better performance and learning stability. We conclude that the fine-grained tasks can provide complementary features to coarse-grained ones, while the most coarse-grained task, SBD, provides useful information for the most fine-grained one, PoS tagging. Additionally, the text chunking task achieves state-of-the-art performance when joint learning with PoS tagging. Our analytical experiments also show the effectiveness of the proposed soft parameter-sharing and CL mechanisms.

Abstract Image

查看原文本刊更多论文

利用多任务和课程学习进行细粒度语法处理

句法处理技术是自然语言处理（NLP）的基础，为许多下游 NLP 任务提供支持。在本文中，我们对不同粒度的句法任务（即句子边界检测（SBD）、文本分块和语音部分标记（PoS））进行了成对多任务学习（MTL），以研究它们之间的互补程度。我们提出了一种新颖的软参数共享机制，以共享从两个目标任务中学习到的局部和全局依赖性信息。我们还提出了一种课程学习（CL）机制，利用非并行标记数据改进 MTL。在 MTL 中使用非并行标记数据是一种常见的做法，但以前并未引起足够的重视。例如，我们使用的 PoS 标记数据没有文本分块标记。在同时学习 PoS 标记和文本分块时，所提出的 CL 机制旨在从两个任务中选择互补样本，在同一训练批次中更新 MTL 模型的参数。这种方法能获得更好的性能和学习稳定性。我们的结论是，细粒度任务可以为粗粒度任务提供互补特征，而最粗粒度的任务 SBD 可以为最细粒度的任务 PoS 标记提供有用信息。此外，在与 PoS 标记联合学习时，文本分块任务达到了最先进的性能。我们的分析实验还显示了所提出的软参数共享和 CL 机制的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cognitive Computation COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-NEUROSCIENCES

CiteScore

9.30

自引率

3.70%

发文量

116

审稿时长

>12 weeks

期刊介绍： Cognitive Computation is an international, peer-reviewed, interdisciplinary journal that publishes cutting-edge articles describing original basic and applied work involving biologically-inspired computational accounts of all aspects of natural and artificial cognitive systems. It provides a new platform for the dissemination of research, current practices and future trends in the emerging discipline of cognitive computation that bridges the gap between life sciences, social sciences, engineering, physical and mathematical sciences, and humanities.