{"title":"基于集成学习的古兰经主题分类","authors":"Bassam Arkok, A. Zeki","doi":"10.1109/ICCCE50029.2021.9467178","DOIUrl":null,"url":null,"abstract":"the real datasets in the world usually are imbalanced; the number of samples for their classes is not equal. Classifying these datasets makes the classifiers pay attention to the class with more samples than the classes with fewer samples. The Qur’anic dataset can be considered an imbalanced dataset because verses of the Qur’anic topics are not equal. Many studies have been performed to classify Qur’anic text using different classifiers. However, few studies classified the Qur’anic verses based on Imbalanced Learning (IL). So, this work aims to classify the Qur’anic text using Ensemble methods, Boosting and Bagging. The base classifiers of these methods were LibSVM, Naïve Bayes, KNN, and J48. Three techniques are conducted in this paper based on the standard classifiers. The three techniques are: implementing the base classifiers alone, implementing these classifiers with the Boosting method, and implementing the classifiers with the Bagging method. The results showed that the Quranic classification performance was improved when the ensemble methods were applied for the imbalanced Qur’anic verses in the standard classifiers.","PeriodicalId":122857,"journal":{"name":"2021 8th International Conference on Computer and Communication Engineering (ICCCE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Classification of Quranic Topics Using Ensemble Learning\",\"authors\":\"Bassam Arkok, A. Zeki\",\"doi\":\"10.1109/ICCCE50029.2021.9467178\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"the real datasets in the world usually are imbalanced; the number of samples for their classes is not equal. Classifying these datasets makes the classifiers pay attention to the class with more samples than the classes with fewer samples. The Qur’anic dataset can be considered an imbalanced dataset because verses of the Qur’anic topics are not equal. Many studies have been performed to classify Qur’anic text using different classifiers. However, few studies classified the Qur’anic verses based on Imbalanced Learning (IL). So, this work aims to classify the Qur’anic text using Ensemble methods, Boosting and Bagging. The base classifiers of these methods were LibSVM, Naïve Bayes, KNN, and J48. Three techniques are conducted in this paper based on the standard classifiers. The three techniques are: implementing the base classifiers alone, implementing these classifiers with the Boosting method, and implementing the classifiers with the Bagging method. The results showed that the Quranic classification performance was improved when the ensemble methods were applied for the imbalanced Qur’anic verses in the standard classifiers.\",\"PeriodicalId\":122857,\"journal\":{\"name\":\"2021 8th International Conference on Computer and Communication Engineering (ICCCE)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 8th International Conference on Computer and Communication Engineering (ICCCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCE50029.2021.9467178\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Computer and Communication Engineering (ICCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCE50029.2021.9467178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification of Quranic Topics Using Ensemble Learning
the real datasets in the world usually are imbalanced; the number of samples for their classes is not equal. Classifying these datasets makes the classifiers pay attention to the class with more samples than the classes with fewer samples. The Qur’anic dataset can be considered an imbalanced dataset because verses of the Qur’anic topics are not equal. Many studies have been performed to classify Qur’anic text using different classifiers. However, few studies classified the Qur’anic verses based on Imbalanced Learning (IL). So, this work aims to classify the Qur’anic text using Ensemble methods, Boosting and Bagging. The base classifiers of these methods were LibSVM, Naïve Bayes, KNN, and J48. Three techniques are conducted in this paper based on the standard classifiers. The three techniques are: implementing the base classifiers alone, implementing these classifiers with the Boosting method, and implementing the classifiers with the Bagging method. The results showed that the Quranic classification performance was improved when the ensemble methods were applied for the imbalanced Qur’anic verses in the standard classifiers.