Classification of Questions Based on Difficulty Levels using Support Vector Machine and Naïve Bayes Algorithms for Imbalanced Class

2021 4th International Conference of Computer and Informatics Engineering (IC2IE) Pub Date : 2021-09-14 DOI:10.1109/ic2ie53219.2021.9649149

Danny Pratama, Oktariani Nurul Pratiwi, E. Sutoyo

{"title":"Classification of Questions Based on Difficulty Levels using Support Vector Machine and Naïve Bayes Algorithms for Imbalanced Class","authors":"Danny Pratama, Oktariani Nurul Pratiwi, E. Sutoyo","doi":"10.1109/ic2ie53219.2021.9649149","DOIUrl":null,"url":null,"abstract":"Quiz questions are crucial evaluations in measuring student learning development because they are one of the lecturers' benchmarks for providing learning materials. The accuracy of the results of measuring student competency achievement is important because it will be used as a benchmark for assessment by lecturers, therefore a question instrument that functions well is needed in distinguishing between students who have high abilities and students who have low abilities based on defined criteria. A good question, that is, when a question has a balanced level of difficulty (proportional), it is said that the question is good. However, a question should be neither too difficult nor too easy. On that basis, grouping the level of difficulty of the questions should be done to make a package of questions that fit the portion. The case study taken by the researcher is a Data Warehouse S1 Information System at Telkom University. The case study was taken because the Data Warehouse course is a compulsory subject in the Information Studies Program at Telkom University. In doing the classification, the writer compares the Naive Bayes algorithm and the Support Vector Machine. The comparison results obtained the highest accuracy with the algorithm method SVM Classification. The accuracy results were obtained from the comparison of the average scores on the algorithm Naïve Bayes (Before SMOTE) of 85.73% and the SVM algorithm (Before SMOTE) of 85.11%. then for the comparison of the average score on the algorithm Naïve Bayes (After SMOTE) of 88.9% and on the SVM algorithm (After SMOTE) of 97.82%.","PeriodicalId":178443,"journal":{"name":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Conference of Computer and Informatics Engineering (IC2IE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ic2ie53219.2021.9649149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Quiz questions are crucial evaluations in measuring student learning development because they are one of the lecturers' benchmarks for providing learning materials. The accuracy of the results of measuring student competency achievement is important because it will be used as a benchmark for assessment by lecturers, therefore a question instrument that functions well is needed in distinguishing between students who have high abilities and students who have low abilities based on defined criteria. A good question, that is, when a question has a balanced level of difficulty (proportional), it is said that the question is good. However, a question should be neither too difficult nor too easy. On that basis, grouping the level of difficulty of the questions should be done to make a package of questions that fit the portion. The case study taken by the researcher is a Data Warehouse S1 Information System at Telkom University. The case study was taken because the Data Warehouse course is a compulsory subject in the Information Studies Program at Telkom University. In doing the classification, the writer compares the Naive Bayes algorithm and the Support Vector Machine. The comparison results obtained the highest accuracy with the algorithm method SVM Classification. The accuracy results were obtained from the comparison of the average scores on the algorithm Naïve Bayes (Before SMOTE) of 85.73% and the SVM algorithm (Before SMOTE) of 85.11%. then for the comparison of the average score on the algorithm Naïve Bayes (After SMOTE) of 88.9% and on the SVM algorithm (After SMOTE) of 97.82%.

查看原文本刊更多论文

基于难易度的问题分类:支持向量机和Naïve贝叶斯算法在不平衡类中的应用

测验问题是衡量学生学习发展的重要评价，是教师提供学习材料的基准之一。测量学生能力成就的结果的准确性很重要，因为它将被讲师用作评估的基准，因此需要一个功能良好的问题工具，以根据确定的标准区分高能力学生和低能力学生。一个好问题，也就是当一个问题的难度水平(比例)平衡时，就说这个问题好。然而，一个问题不应该太难也不应该太容易。在此基础上，应该对问题的难度进行分组，以形成适合该部分的问题包。研究人员采用的案例研究是电信大学的数据仓库S1信息系统。之所以选择这个案例研究，是因为数据仓库课程是电信大学信息研究项目的必修课。在进行分类时，作者比较了朴素贝叶斯算法和支持向量机算法。对比结果表明，SVM分类方法的准确率最高。将Naïve算法Bayes (Before SMOTE)的平均得分85.73%与SVM算法(Before SMOTE)的平均得分85.11%进行比较，得出准确率结果。然后比较Naïve贝叶斯算法(After SMOTE)和SVM算法(After SMOTE)的平均得分分别为88.9%和97.82%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 4th International Conference of Computer and Informatics Engineering (IC2IE)

自引率

0.00%

发文量