{"title":"Incorporating target-aware knowledge into prompt-tuning for few-shot stance detection","authors":"Shaokang Wang , Fuhui Sun , Xiaoyan Wang , Li Pan","doi":"10.1016/j.ipm.2024.103815","DOIUrl":null,"url":null,"abstract":"<div><p>Stance detection, a fundamental task in natural language processing, identifies user stances in texts towards specific targets. The diverse targets and ever-changing expressions make it challenging to attain comprehensive knowledge from limited data. Existing methods focus on incorporating supplementary knowledge, neglecting fusion consistency during training, which is critical for preserving the rationality of the inference. In this paper, we introduce TAP, a novel approach for few-shot stance detection. TAP extends the verbalizer hierarchically, a mapping function in prompt-tuning. Constructed using a log-odds ratio of topics and targets, the verbalizer refines candidates with prior knowledge, forming the foundation for subsequent hierarchical distillation. The hierarchical distillation, a technique based on pilot experiments on the hierarchical verbalizer, ensures the fusion of diverse knowledge during prompt-tuning, maintaining consistency throughout the training process. Notably, TAP constructs verbalizers without external knowledge augmentation. The hierarchical distillation involves a joint loss function, contributing to the model’s robustness and training consistency. Extensive experiments are conducted on SemEval2016t6 and ArgMin datasets with 13 different targets. The proposed method is evaluated on various few-shot and full-data settings with F1-Macro and F1-Micro scores. On average, TAP achieves overall improvements of 4.71% and 3.76% over state-of-the-art baselines on Semeval2016t6 and ArgMin datasets, respectively, in few-shot scenarios.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324001742","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Stance detection, a fundamental task in natural language processing, identifies user stances in texts towards specific targets. The diverse targets and ever-changing expressions make it challenging to attain comprehensive knowledge from limited data. Existing methods focus on incorporating supplementary knowledge, neglecting fusion consistency during training, which is critical for preserving the rationality of the inference. In this paper, we introduce TAP, a novel approach for few-shot stance detection. TAP extends the verbalizer hierarchically, a mapping function in prompt-tuning. Constructed using a log-odds ratio of topics and targets, the verbalizer refines candidates with prior knowledge, forming the foundation for subsequent hierarchical distillation. The hierarchical distillation, a technique based on pilot experiments on the hierarchical verbalizer, ensures the fusion of diverse knowledge during prompt-tuning, maintaining consistency throughout the training process. Notably, TAP constructs verbalizers without external knowledge augmentation. The hierarchical distillation involves a joint loss function, contributing to the model’s robustness and training consistency. Extensive experiments are conducted on SemEval2016t6 and ArgMin datasets with 13 different targets. The proposed method is evaluated on various few-shot and full-data settings with F1-Macro and F1-Micro scores. On average, TAP achieves overall improvements of 4.71% and 3.76% over state-of-the-art baselines on Semeval2016t6 and ArgMin datasets, respectively, in few-shot scenarios.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.