{"title":"利用机器学习和特征选择算法预测高等教育教师的表现","authors":"Ravinder Ahuja, S. C. Sharma","doi":"10.6688/JISE.202109_37(5).0001","DOIUrl":null,"url":null,"abstract":"Machine learning has emerged as the most important and widely used tool in resolving the administrative and other educational related problems. Most of the research in the educational field centers on demonstrating the student's potential rather than focusing on faculty quality. In this paper the performance of the instructor is evaluated through feedback collected from students in the questionnaire form. The unlabelled dataset is taken from UCI machine learning repository consisting of 5820 records with 33 attributes. Firstly, the dataset is labelled(three labels) using agglomerative clustering and the k-means algorithms. Further, five feature selection techniques (Random Forest,Principal Component Analysis, Recursive Feature Selection, Univariate Feature Selection, and Genetic Algorithm) are applied to extract essential features. After feature selection, twelve classification algorithms (K Nearest Neighbor, XGBoost, Multi-Layer Perceptron, AdaBoost, Random Forest, Logistic Regression, Decision Tree, Bagging, LightGBM, Support Vector Machine, Extra Tree and Naive Bayes) are applied using Python language. Out of all algorithms applied, Support Vector Machine with PCA feature selection technique has given the highest accuracy value 99.66%, recall value 99.66%, precision value 99.67%, and f-score value 99.67%. To prove that results are statistically different, we have applied ANOVA one way test.","PeriodicalId":50177,"journal":{"name":"Journal of Information Science and Engineering","volume":"41 1","pages":"993-1009"},"PeriodicalIF":0.5000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Exploiting Machine Learning and Feature Selection Algorithms to Predict Instructor Performance in Higher Education\",\"authors\":\"Ravinder Ahuja, S. C. Sharma\",\"doi\":\"10.6688/JISE.202109_37(5).0001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning has emerged as the most important and widely used tool in resolving the administrative and other educational related problems. Most of the research in the educational field centers on demonstrating the student's potential rather than focusing on faculty quality. In this paper the performance of the instructor is evaluated through feedback collected from students in the questionnaire form. The unlabelled dataset is taken from UCI machine learning repository consisting of 5820 records with 33 attributes. Firstly, the dataset is labelled(three labels) using agglomerative clustering and the k-means algorithms. Further, five feature selection techniques (Random Forest,Principal Component Analysis, Recursive Feature Selection, Univariate Feature Selection, and Genetic Algorithm) are applied to extract essential features. After feature selection, twelve classification algorithms (K Nearest Neighbor, XGBoost, Multi-Layer Perceptron, AdaBoost, Random Forest, Logistic Regression, Decision Tree, Bagging, LightGBM, Support Vector Machine, Extra Tree and Naive Bayes) are applied using Python language. Out of all algorithms applied, Support Vector Machine with PCA feature selection technique has given the highest accuracy value 99.66%, recall value 99.66%, precision value 99.67%, and f-score value 99.67%. To prove that results are statistically different, we have applied ANOVA one way test.\",\"PeriodicalId\":50177,\"journal\":{\"name\":\"Journal of Information Science and Engineering\",\"volume\":\"41 1\",\"pages\":\"993-1009\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information Science and Engineering\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.6688/JISE.202109_37(5).0001\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Science and Engineering","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.6688/JISE.202109_37(5).0001","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Exploiting Machine Learning and Feature Selection Algorithms to Predict Instructor Performance in Higher Education
Machine learning has emerged as the most important and widely used tool in resolving the administrative and other educational related problems. Most of the research in the educational field centers on demonstrating the student's potential rather than focusing on faculty quality. In this paper the performance of the instructor is evaluated through feedback collected from students in the questionnaire form. The unlabelled dataset is taken from UCI machine learning repository consisting of 5820 records with 33 attributes. Firstly, the dataset is labelled(three labels) using agglomerative clustering and the k-means algorithms. Further, five feature selection techniques (Random Forest,Principal Component Analysis, Recursive Feature Selection, Univariate Feature Selection, and Genetic Algorithm) are applied to extract essential features. After feature selection, twelve classification algorithms (K Nearest Neighbor, XGBoost, Multi-Layer Perceptron, AdaBoost, Random Forest, Logistic Regression, Decision Tree, Bagging, LightGBM, Support Vector Machine, Extra Tree and Naive Bayes) are applied using Python language. Out of all algorithms applied, Support Vector Machine with PCA feature selection technique has given the highest accuracy value 99.66%, recall value 99.66%, precision value 99.67%, and f-score value 99.67%. To prove that results are statistically different, we have applied ANOVA one way test.
期刊介绍:
The Journal of Information Science and Engineering is dedicated to the dissemination of information on computer science, computer engineering, and computer systems. This journal encourages articles on original research in the areas of computer hardware, software, man-machine interface, theory and applications. tutorial papers in the above-mentioned areas, and state-of-the-art papers on various aspects of computer systems and applications.