问答论坛上的不真诚问题分类

Hendri Priyambowo, M. Adriani
{"title":"问答论坛上的不真诚问题分类","authors":"Hendri Priyambowo, M. Adriani","doi":"10.1109/ICEEI47359.2019.8988798","DOIUrl":null,"url":null,"abstract":"Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.","PeriodicalId":236517,"journal":{"name":"2019 International Conference on Electrical Engineering and Informatics (ICEEI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Insincere Question Classification on Question Answering Forum\",\"authors\":\"Hendri Priyambowo, M. Adriani\",\"doi\":\"10.1109/ICEEI47359.2019.8988798\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.\",\"PeriodicalId\":236517,\"journal\":{\"name\":\"2019 International Conference on Electrical Engineering and Informatics (ICEEI)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Electrical Engineering and Informatics (ICEEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEEI47359.2019.8988798\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Electrical Engineering and Informatics (ICEEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEI47359.2019.8988798","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

“不真诚”被定义为人类真正感觉不到的言语或行为,没有意义,或者不是基于真诚的感情。在网络论坛,特别是问答论坛中,不诚信问题是一个严重的问题,它会影响网络论坛的质量。经常发现不符合论坛规则的内容(即消息或帖子),可能会干扰其他用户。采用机器学习模型检测用户在问答论坛中提出的问题。我们研究了六种具有n-gram、POS标签和词嵌入特征的基线机器学习模型。我们的实验结果表明,使用POS标签特征的多层感知器模型给出了最高的F1-Score,为87.81%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Insincere Question Classification on Question Answering Forum
Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信