{"title":"Insincere Question Classification on Question Answering Forum","authors":"Hendri Priyambowo, M. Adriani","doi":"10.1109/ICEEI47359.2019.8988798","DOIUrl":null,"url":null,"abstract":"Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.","PeriodicalId":236517,"journal":{"name":"2019 International Conference on Electrical Engineering and Informatics (ICEEI)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Electrical Engineering and Informatics (ICEEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEI47359.2019.8988798","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Insincerity is defined as a word or action that is genuinely not felt by humans, has no meaning, or not based on sincere feelings. In the internet forum, especially question answering forum, insincerity is one of the severe problems because it can affect the quality of internet forums. Frequently found content (i.e., a message or post) which is not appropriate with the rules of the forum in general and can interfere with other users. Machine learning model will be implemented to detect the questions posed by users in the question answering forum. We examine six baseline machine learning models with n-gram, POS tag and word embedding features. The result of our experiment shows that Multilayer Perceptron model using POS tag features give us highest F1-Score, which is 87.81%.