Insincere Question Classification on Question Answering Forum

2019 International Conference on Electrical Engineering and Informatics (ICEEI) Pub Date : 2019-07-01 DOI:10.1109/ICEEI47359.2019.8988798

Hendri Priyambowo, M. Adriani

引用次数: 6

Abstract

Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.

查看原文本刊更多论文

问答论坛上的不真诚问题分类

“不真诚”被定义为人类真正感觉不到的言语或行为，没有意义，或者不是基于真诚的感情。在网络论坛，特别是问答论坛中，不诚信问题是一个严重的问题，它会影响网络论坛的质量。经常发现不符合论坛规则的内容(即消息或帖子)，可能会干扰其他用户。采用机器学习模型检测用户在问答论坛中提出的问题。我们研究了六种具有n-gram、POS标签和词嵌入特征的基线机器学习模型。我们的实验结果表明，使用POS标签特征的多层感知器模型给出了最高的F1-Score，为87.81%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Conference on Electrical Engineering and Informatics (ICEEI)

自引率

0.00%

发文量