利用有限或无旧数据进行意图分类的类递增学习

Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP) Pub Date : 1900-01-01 DOI:10.18653/v1/2022.evonlp-1.4

Debjit Paul, Daniil Sorokin, Judith Gaspers

{"title":"利用有限或无旧数据进行意图分类的类递增学习","authors":"Debjit Paul, Daniil Sorokin, Judith Gaspers","doi":"10.18653/v1/2022.evonlp-1.4","DOIUrl":null,"url":null,"abstract":"In this paper, we explore class-incremental learning for intent classification (IC) in a setting with limited old data available. IC is the task of mapping user utterances to their corresponding intents. Even though class-incremental learning without storing the old data yields high potential of reducing human and computational resources in industry NLP model releases, to the best of our knowledge, it hasn’t been studied for NLP classification tasks in the literature before. In this work, we compare several contemporary class-incremental learning methods, i.e., BERT warm start, L2, Elastic Weight Consolidation, RecAdam and Knowledge Distillation within two realistic class-incremental learning scenarios: one where only the previous model is assumed to be available, but no data corresponding to old classes, and one in which limited unlabeled data for old classes is assumed to be available. Our results indicate that among the investigated continual learning methods, Knowledge Distillation worked best for our class-incremental learning tasks, and adding limited unlabeled data helps the model in both adaptability and stability.","PeriodicalId":158578,"journal":{"name":"Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Class Incremental Learning for Intent Classification with Limited or No Old Data\",\"authors\":\"Debjit Paul, Daniil Sorokin, Judith Gaspers\",\"doi\":\"10.18653/v1/2022.evonlp-1.4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we explore class-incremental learning for intent classification (IC) in a setting with limited old data available. IC is the task of mapping user utterances to their corresponding intents. Even though class-incremental learning without storing the old data yields high potential of reducing human and computational resources in industry NLP model releases, to the best of our knowledge, it hasn’t been studied for NLP classification tasks in the literature before. In this work, we compare several contemporary class-incremental learning methods, i.e., BERT warm start, L2, Elastic Weight Consolidation, RecAdam and Knowledge Distillation within two realistic class-incremental learning scenarios: one where only the previous model is assumed to be available, but no data corresponding to old classes, and one in which limited unlabeled data for old classes is assumed to be available. Our results indicate that among the investigated continual learning methods, Knowledge Distillation worked best for our class-incremental learning tasks, and adding limited unlabeled data helps the model in both adaptability and stability.\",\"PeriodicalId\":158578,\"journal\":{\"name\":\"Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18653/v1/2022.evonlp-1.4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2022.evonlp-1.4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在本文中，我们探讨了在旧数据有限的情况下进行意图分类（IC）的类递增学习。意图分类是将用户话语映射到其相应意图的任务。尽管不存储旧数据的类递增学习在减少行业 NLP 模型发布的人力和计算资源方面具有很大的潜力，但据我们所知，以前的文献中还没有针对 NLP 分类任务进行过研究。在这项工作中，我们在两个现实的类增量学习场景中比较了几种当代的类增量学习方法，即 BERT warm start、L2、Elastic Weight Consolidation、RecAdam 和 Knowledge Distillation：一个场景是假定只有之前的模型可用，但没有与旧类相对应的数据，另一个场景是假定有有限的未标记旧类数据可用。我们的结果表明，在所研究的持续学习方法中，知识蒸馏法最适合我们的类递增学习任务，而且添加有限的未标记数据有助于模型的适应性和稳定性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Class Incremental Learning for Intent Classification with Limited or No Old Data

In this paper, we explore class-incremental learning for intent classification (IC) in a setting with limited old data available. IC is the task of mapping user utterances to their corresponding intents. Even though class-incremental learning without storing the old data yields high potential of reducing human and computational resources in industry NLP model releases, to the best of our knowledge, it hasn’t been studied for NLP classification tasks in the literature before. In this work, we compare several contemporary class-incremental learning methods, i.e., BERT warm start, L2, Elastic Weight Consolidation, RecAdam and Knowledge Distillation within two realistic class-incremental learning scenarios: one where only the previous model is assumed to be available, but no data corresponding to old classes, and one in which limited unlabeled data for old classes is assumed to be available. Our results indicate that among the investigated continual learning methods, Knowledge Distillation worked best for our class-incremental learning tasks, and adding limited unlabeled data helps the model in both adaptability and stability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the The First Workshop on Ever Evolving NLP (EvoNLP)

自引率

0.00%

发文量