Learning Adversarial Networks for Semi-Supervised Text Classification via Policy Gradient

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining Pub Date : 2018-07-19 DOI:10.1145/3219819.3219956

Yan Li, Jieping Ye

{"title":"Learning Adversarial Networks for Semi-Supervised Text Classification via Policy Gradient","authors":"Yan Li, Jieping Ye","doi":"10.1145/3219819.3219956","DOIUrl":null,"url":null,"abstract":"Semi-supervised learning is a branch of machine learning techniques that aims to make fully use of both labeled and unlabeled instances to improve the prediction performance. The size of modern real world datasets is ever-growing so that acquiring label information for them is extraordinarily difficult and costly. Therefore, deep semi-supervised learning is becoming more and more popular. Most of the existing deep semi-supervised learning methods are built under the generative model based scheme, where the data distribution is approximated via input data reconstruction. However, this scheme does not naturally work on discrete data, e.g., text; in addition, learning a good data representation is sometimes directly opposed to the goal of learning a high performance prediction model. To address the issues of this type of methods, we reformulate the semi-supervised learning as a model-based reinforcement learning problem and propose an adversarial networks based framework. The proposed framework contains two networks: a predictor network for target estimation and a judge network for evaluation. The judge network iteratively generates proper reward to guide the training of predictor network, and the predictor network is trained via policy gradient. Based on the aforementioned framework, we propose a recurrent neural network based model for semi-supervised text classification. We conduct comprehensive experimental analysis on several real world benchmark text datasets, and the results from our evaluations show that our method outperforms other competing state-of-the-art methods.","PeriodicalId":322066,"journal":{"name":"Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"45","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3219819.3219956","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 45

Abstract

Semi-supervised learning is a branch of machine learning techniques that aims to make fully use of both labeled and unlabeled instances to improve the prediction performance. The size of modern real world datasets is ever-growing so that acquiring label information for them is extraordinarily difficult and costly. Therefore, deep semi-supervised learning is becoming more and more popular. Most of the existing deep semi-supervised learning methods are built under the generative model based scheme, where the data distribution is approximated via input data reconstruction. However, this scheme does not naturally work on discrete data, e.g., text; in addition, learning a good data representation is sometimes directly opposed to the goal of learning a high performance prediction model. To address the issues of this type of methods, we reformulate the semi-supervised learning as a model-based reinforcement learning problem and propose an adversarial networks based framework. The proposed framework contains two networks: a predictor network for target estimation and a judge network for evaluation. The judge network iteratively generates proper reward to guide the training of predictor network, and the predictor network is trained via policy gradient. Based on the aforementioned framework, we propose a recurrent neural network based model for semi-supervised text classification. We conduct comprehensive experimental analysis on several real world benchmark text datasets, and the results from our evaluations show that our method outperforms other competing state-of-the-art methods.

查看原文本刊更多论文

基于策略梯度学习半监督文本分类的对抗网络

半监督学习是机器学习技术的一个分支，旨在充分利用标记和未标记的实例来提高预测性能。现代现实世界数据集的规模不断增长，因此获取标签信息非常困难和昂贵。因此，深度半监督学习越来越受欢迎。现有的深度半监督学习方法大多建立在基于生成模型的方案下，通过输入数据重构来逼近数据分布。然而，该方案并不适用于离散数据，例如文本;此外，学习良好的数据表示有时与学习高性能预测模型的目标直接相反。为了解决这类方法的问题，我们将半监督学习重新表述为基于模型的强化学习问题，并提出了一个基于对抗网络的框架。该框架包含两个网络:用于目标估计的预测网络和用于评估的判断网络。判断网络迭代生成适当的奖励来指导预测网络的训练，预测网络通过策略梯度进行训练。基于上述框架，我们提出了一种基于递归神经网络的半监督文本分类模型。我们对几个真实世界的基准文本数据集进行了全面的实验分析，评估结果表明，我们的方法优于其他竞争的最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

自引率

0.00%

发文量