{"title":"A Model for Zero-shot Text Multi-labeling Using Semantics-based Labels","authors":"Dan Dickinson, Ananth Raj GV, G. Fung","doi":"10.1109/TransAI51903.2021.00033","DOIUrl":null,"url":null,"abstract":"We introduce a transformer-based method to associate relevant tags to text passages or blocks such as categories to pages of a website, marking sections in an article, or social postings subject tagging. In contrast with traditional multi-label formulations, the proposed approach uses semantic definitions of the tags available during training, and the model outputs a binary prediction of whether the described category applies to a document or not. The transformer-based model learns the semantics of the definition of a tag, and therefore works for tags not seen during training. Performance on domain-specific datasets can be further improved via transfer learning after fine-tuning with relatively little additional labeled data required.","PeriodicalId":426766,"journal":{"name":"2021 Third International Conference on Transdisciplinary AI (TransAI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Third International Conference on Transdisciplinary AI (TransAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TransAI51903.2021.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We introduce a transformer-based method to associate relevant tags to text passages or blocks such as categories to pages of a website, marking sections in an article, or social postings subject tagging. In contrast with traditional multi-label formulations, the proposed approach uses semantic definitions of the tags available during training, and the model outputs a binary prediction of whether the described category applies to a document or not. The transformer-based model learns the semantics of the definition of a tag, and therefore works for tags not seen during training. Performance on domain-specific datasets can be further improved via transfer learning after fine-tuning with relatively little additional labeled data required.