Domain Adaptation in NLP Based on Hybrid Generative and Discriminative Model

2008 Chinese Conference on Pattern Recognition Pub Date : 2008-10-31 DOI:10.1109/CCPR.2008.11

Kang Liu, Jun Zhao

引用次数: 0

Abstract

This study investigates the domain adaptation problem for nature language processing tasks in the distributional view. A novel method is proposed for domain adaptation based on the hybrid model which combines the discriminative model with the generative model. The advantage of the discriminative model is to have lower asymptotic error, while the advantage of the generative model can easily incorporate the unlabeled data for better generalization performance. The hybrid model can integrate their advantages. For domain transfer, the proposed method exploits the difference of the distributions in different domains to adjust the weights of the instances in the training set so that the source labeled data is more adaptive to the target domain. Experimental results on several NLP tasks in different domains indicate that our method outperforms both the traditional supervised learning and the semi-supervised method.

查看原文本刊更多论文

基于生成与判别混合模型的NLP领域自适应

本研究从分布的角度探讨自然语言处理任务的领域自适应问题。提出了一种基于判别模型和生成模型相结合的混合模型的领域自适应方法。判别模型的优点是具有较小的渐近误差，而生成模型的优点是可以很容易地合并未标记的数据，从而获得更好的泛化性能。混合模式可以综合两者的优点。对于域转移，该方法利用不同域的分布差异来调整训练集中实例的权值，使源标记数据更适应目标域。在不同领域的NLP任务上的实验结果表明，该方法优于传统的监督学习和半监督学习方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 Chinese Conference on Pattern Recognition

自引率

0.00%

发文量