The Research on Chinese Coreference Resolution Based on Support Vector Machines

2010 Fourth International Conference on Genetic and Evolutionary Computing Pub Date : 2010-12-13 DOI:10.1109/ICGEC.2010.49

Yihao Zhang, Peng Jin

引用次数: 1

Abstract

Coreference is a common linguistic phenomenon in natural language understanding, it plays an important role in simplifying the expression and linking up the context. In this paper, the algorithm of support vector machines is applied to solve the problem of Chinese coreference, we consider fully the important characteristics which related to coreference and integrate them effectively to build model. In the handling of training data, using data scaling techniques balance the range of characteristic values, and use cross validation to optimize the training parameters of the model. The experimental results show that the F-score of positive instances and negative instances reached 76.80% and 90.91% respectively on the classification model in Lancaster Corpus of Mandarin Chinese.

查看原文本刊更多论文

基于支持向量机的中文互参分辨研究

共指是自然语言理解中常见的语言现象，它在简化表达、连接语境方面起着重要作用。本文将支持向量机算法应用于中文共指问题，充分考虑与共指相关的重要特征，并对其进行有效整合，建立模型。在训练数据的处理上，利用数据缩放技术平衡特征值的范围，并利用交叉验证优化模型的训练参数。实验结果表明，在兰开斯特普通话语料库的分类模型上，积极实例和消极实例的f值分别达到76.80%和90.91%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 Fourth International Conference on Genetic and Evolutionary Computing

自引率

0.00%

发文量