Discovering Fine-grained Sentiment in Suicide Notes.

Biomedical informatics insights Pub Date : 2012-01-01 Epub Date: 2012-01-30 DOI:10.4137/BII.S8963

Wenbo Wang, Lu Chen, Ming Tan, Shaojun Wang, Amit P Sheth

引用次数: 24

Abstract

This paper presents our solution for the i2b2 sentiment classification challenge. Our hybrid system consists of machine learning and rule-based classifiers. For the machine learning classifier, we investigate a variety of lexical, syntactic and knowledge-based features, and show how much these features contribute to the performance of the classifier through experiments. For the rule-based classifier, we propose an algorithm to automatically extract effective syntactic and lexical patterns from training examples. The experimental results show that the rule-based classifier outperforms the baseline machine learning classifier using unigram features. By combining the machine learning classifier and the rule-based classifier, the hybrid system gains a better trade-off between precision and recall, and yields the highest micro-averaged F-measure (0.5038), which is better than the mean (0.4875) and median (0.5027) micro-average F-measures among all participating teams.

Abstract Image

查看原文本刊更多论文

在遗书中发现细粒度的情绪。

本文提出了我们对i2b2情感分类挑战的解决方案。我们的混合系统由机器学习和基于规则的分类器组成。对于机器学习分类器，我们研究了各种词汇、句法和基于知识的特征，并通过实验展示了这些特征对分类器性能的贡献。对于基于规则的分类器，我们提出了一种从训练样本中自动提取有效语法和词汇模式的算法。实验结果表明，基于规则的分类器优于使用单图特征的基线机器学习分类器。通过结合机器学习分类器和基于规则的分类器，混合系统在精度和召回率之间获得了更好的权衡，并产生了最高的微平均f -度量(0.5038)，优于所有参与团队的平均值(0.4875)和中位数(0.5027)微平均f -度量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Biomedical informatics insights

自引率

0.00%

发文量