IBM MNLP IE at CASE 2021 Task 2: NLI Reranking for Zero-Shot Text Classification

Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021) Pub Date : 1900-01-01 DOI:10.18653/v1/2021.case-1.24

Ken Barker, Parul Awasthy, Jian Ni, Radu Florian

{"title":"IBM MNLP IE at CASE 2021 Task 2: NLI Reranking for Zero-Shot Text Classification","authors":"Ken Barker, Parul Awasthy, Jian Ni, Radu Florian","doi":"10.18653/v1/2021.case-1.24","DOIUrl":null,"url":null,"abstract":"Supervised models can achieve very high accuracy for fine-grained text classification. In practice, however, training data may be abundant for some types but scarce or even non-existent for others. We propose a hybrid architecture that uses as much labeled data as available for fine-tuning classification models, while also allowing for types with little (few-shot) or no (zero-shot) labeled data. In particular, we pair a supervised text classification model with a Natural Language Inference (NLI) reranking model. The NLI reranker uses a textual representation of target types that allows it to score the strength with which a type is implied by a text, without requiring training data for the types. Experiments show that the NLI model is very sensitive to the choice of textual representation, but can be effective for classifying unseen types. It can also improve classification accuracy for the known types of an already highly accurate supervised model.","PeriodicalId":330699,"journal":{"name":"Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/2021.case-1.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Supervised models can achieve very high accuracy for fine-grained text classification. In practice, however, training data may be abundant for some types but scarce or even non-existent for others. We propose a hybrid architecture that uses as much labeled data as available for fine-tuning classification models, while also allowing for types with little (few-shot) or no (zero-shot) labeled data. In particular, we pair a supervised text classification model with a Natural Language Inference (NLI) reranking model. The NLI reranker uses a textual representation of target types that allows it to score the strength with which a type is implied by a text, without requiring training data for the types. Experiments show that the NLI model is very sensitive to the choice of textual representation, but can be effective for classifying unseen types. It can also improve classification accuracy for the known types of an already highly accurate supervised model.

查看原文本刊更多论文

IBM MNLP IE在CASE 2021的任务2:零射击文本分类的NLI重新排序

监督模型对于细粒度的文本分类可以达到非常高的精度。然而，在实践中，某些类型的训练数据可能很丰富，而另一些类型的训练数据却很少，甚至根本不存在。我们提出了一种混合架构，它使用尽可能多的标记数据来微调分类模型，同时也允许使用少量(少量)或没有(零次)标记数据的类型。特别地，我们将监督文本分类模型与自然语言推理(NLI)重新排序模型配对。NLI重新排序器使用目标类型的文本表示，允许它对文本所暗示的类型的强度进行评分，而不需要类型的训练数据。实验表明，NLI模型对文本表示的选择非常敏感，但对未见过的类型进行分类是有效的。它还可以提高已经高度精确的监督模型的已知类型的分类精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)

自引率

0.00%

发文量