{"title":"Topic Classification of Islamic Consultation Question and Answer Using Supervised Learning","authors":"Farhan Arrahman, K. Lhaksmana, D. Murdiansyah","doi":"10.1109/ICADEIS52521.2021.9701944","DOIUrl":null,"url":null,"abstract":"Islamic question-and-answer (Q&A) websites are available as platforms for sharing and learning about Islam. Different Islamic Q&A websites usually shares similar Q&A topics that have been frequently asked by Islamic learners. However, due to a large number of Q&A entries in such websites, manual topic classification would be costly and time consuming. The objectives of this research are to develop a classification system for Islamic Q&A topics and analyze the vocabulary words that affect the classification results. To achieve these objectives, well-known supervised learning methods that have been previously implemented to classify Islamic texts are utilized, namely K-Nearest Neighbor (K-NN), Support Vector Machine (SVM), Multinomial Naive Bayes (MNB), and Multinomial Logistic Regression (MLR). In this research, these classifiers are evaluated in classifying Islamic Q&A entries. The evaluation finds that the SVM achieves the best accuracy and Hamming loss at 79.8 percent and 0.202, respectively. This research also finds that the relevant or specific vocabulary from a class can improve the classification system’s ability to predict correctly and vice versa.","PeriodicalId":422702,"journal":{"name":"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)","volume":"34 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference Advancement in Data Science, E-learning and Information Systems (ICADEIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICADEIS52521.2021.9701944","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Islamic question-and-answer (Q&A) websites are available as platforms for sharing and learning about Islam. Different Islamic Q&A websites usually shares similar Q&A topics that have been frequently asked by Islamic learners. However, due to a large number of Q&A entries in such websites, manual topic classification would be costly and time consuming. The objectives of this research are to develop a classification system for Islamic Q&A topics and analyze the vocabulary words that affect the classification results. To achieve these objectives, well-known supervised learning methods that have been previously implemented to classify Islamic texts are utilized, namely K-Nearest Neighbor (K-NN), Support Vector Machine (SVM), Multinomial Naive Bayes (MNB), and Multinomial Logistic Regression (MLR). In this research, these classifiers are evaluated in classifying Islamic Q&A entries. The evaluation finds that the SVM achieves the best accuracy and Hamming loss at 79.8 percent and 0.202, respectively. This research also finds that the relevant or specific vocabulary from a class can improve the classification system’s ability to predict correctly and vice versa.