Kaifang Dong , Baoxing Jiang , Hongye Li , Zhenfang Zhu , Peiyu Liu
{"title":"用于少量文本分类的元学习三重对比网络","authors":"Kaifang Dong , Baoxing Jiang , Hongye Li , Zhenfang Zhu , Peiyu Liu","doi":"10.1016/j.knosys.2024.112440","DOIUrl":null,"url":null,"abstract":"<div><p>Few-shot text classification (FSTC) strives to predict classes not involved in the training by learning from a few labeled examples. Currently, most tasks construct meta-tasks in a randomized manner that fails to give more priority to hard-to-identify classes and samples. Besides, some tasks incorporated a contrast strategy, but the sample could only be compared to positive or negative examples individually. In this work, we propose a Meta-learning Triplet Contrast Network (Meta-TCN) with bidirectional contrast capability to solve the above problem. Specifically, Meta-TCN uses external knowledge with labeled information as the class examples, which decouples the embedding of prototypes from the support pool. Meanwhile, the class examples combine the support samples to construct triplet pairs used for learning. Unlike previous studies, the model can learn negative and positive knowledge simultaneously, ensuring that understanding is enriched and enhances learning. Further, we improve the shortcomings of randomness in the meta-task construction process by proposing a Dynamic Rate of Change (DRC) sampling strategy. DRC enhances the model’s focus on difficult-to-classify samples. We conducted extensive experiments on six benchmark datasets such as Huffpost and RCV1. Experiments show that the average accuracy of Meta-TCN can achieve state-of-the-art performance in the vast majority of tasks.</p></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Meta-learning triplet contrast network for few-shot text classification\",\"authors\":\"Kaifang Dong , Baoxing Jiang , Hongye Li , Zhenfang Zhu , Peiyu Liu\",\"doi\":\"10.1016/j.knosys.2024.112440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Few-shot text classification (FSTC) strives to predict classes not involved in the training by learning from a few labeled examples. Currently, most tasks construct meta-tasks in a randomized manner that fails to give more priority to hard-to-identify classes and samples. Besides, some tasks incorporated a contrast strategy, but the sample could only be compared to positive or negative examples individually. In this work, we propose a Meta-learning Triplet Contrast Network (Meta-TCN) with bidirectional contrast capability to solve the above problem. Specifically, Meta-TCN uses external knowledge with labeled information as the class examples, which decouples the embedding of prototypes from the support pool. Meanwhile, the class examples combine the support samples to construct triplet pairs used for learning. Unlike previous studies, the model can learn negative and positive knowledge simultaneously, ensuring that understanding is enriched and enhances learning. Further, we improve the shortcomings of randomness in the meta-task construction process by proposing a Dynamic Rate of Change (DRC) sampling strategy. DRC enhances the model’s focus on difficult-to-classify samples. We conducted extensive experiments on six benchmark datasets such as Huffpost and RCV1. Experiments show that the average accuracy of Meta-TCN can achieve state-of-the-art performance in the vast majority of tasks.</p></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2024-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705124010748\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124010748","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Meta-learning triplet contrast network for few-shot text classification
Few-shot text classification (FSTC) strives to predict classes not involved in the training by learning from a few labeled examples. Currently, most tasks construct meta-tasks in a randomized manner that fails to give more priority to hard-to-identify classes and samples. Besides, some tasks incorporated a contrast strategy, but the sample could only be compared to positive or negative examples individually. In this work, we propose a Meta-learning Triplet Contrast Network (Meta-TCN) with bidirectional contrast capability to solve the above problem. Specifically, Meta-TCN uses external knowledge with labeled information as the class examples, which decouples the embedding of prototypes from the support pool. Meanwhile, the class examples combine the support samples to construct triplet pairs used for learning. Unlike previous studies, the model can learn negative and positive knowledge simultaneously, ensuring that understanding is enriched and enhances learning. Further, we improve the shortcomings of randomness in the meta-task construction process by proposing a Dynamic Rate of Change (DRC) sampling strategy. DRC enhances the model’s focus on difficult-to-classify samples. We conducted extensive experiments on six benchmark datasets such as Huffpost and RCV1. Experiments show that the average accuracy of Meta-TCN can achieve state-of-the-art performance in the vast majority of tasks.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.