Ivan Bondarenko, S. Berezin, Alexey Pauls, Tatiana Batura, Yuliya Rubtsova, B. Tuchinov
{"title":"Using Few-Shot Learning Techniques for Named Entity Recognition and Relation Extraction","authors":"Ivan Bondarenko, S. Berezin, Alexey Pauls, Tatiana Batura, Yuliya Rubtsova, B. Tuchinov","doi":"10.1109/S.A.I.ence50533.2020.9303192","DOIUrl":null,"url":null,"abstract":"This paper presents new methods for entity recognition and relation extraction tasks on partially labeled and unlabeled datasets. The proposed methods are based on techniques of semi-supervised, unsupervised and the transfer learning. We use the few-shot learning technique to construct specific algorithms for the new data sources without manual retraining. To compare the results with other studies, we conducted experiments on two benchmark datasets for the Russian language. The results for named entity recognition demonstrate significant improvement and outperform the state-of-the-art results. Our results for relation extraction are comparable to other research. We assume that a longer BERT fine-tuning will help to improve them, and we also plan to experiment with other few-shot learning methods in the near future.","PeriodicalId":201402,"journal":{"name":"2020 Science and Artificial Intelligence conference (S.A.I.ence)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Science and Artificial Intelligence conference (S.A.I.ence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/S.A.I.ence50533.2020.9303192","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper presents new methods for entity recognition and relation extraction tasks on partially labeled and unlabeled datasets. The proposed methods are based on techniques of semi-supervised, unsupervised and the transfer learning. We use the few-shot learning technique to construct specific algorithms for the new data sources without manual retraining. To compare the results with other studies, we conducted experiments on two benchmark datasets for the Russian language. The results for named entity recognition demonstrate significant improvement and outperform the state-of-the-art results. Our results for relation extraction are comparable to other research. We assume that a longer BERT fine-tuning will help to improve them, and we also plan to experiment with other few-shot learning methods in the near future.