Predicting and Analyzing Student Absenteeism Using Machine Learning Algorithm

Q1 Social Sciences
Lindita Mukli, Amarildo Rista
{"title":"Predicting and Analyzing Student Absenteeism Using Machine Learning Algorithm","authors":"Lindita Mukli, Amarildo Rista","doi":"10.15507/1991-9468.107.026.202202.216-228","DOIUrl":null,"url":null,"abstract":"Introduction. In a developed society, the state should invest in the education of the younger generation. In less developed countries, Albania included, there are no nation-wide studies to show the factors that affect the lack of students in classrooms. The purpose of this study is to predict, analyze, and evaluate the possible causes of student absenteeism using machine learning algorithms. The attributes taken into account in this study are related to the family, demographic, social, university, and personal aspects according to academic criteria.\nMaterials and Methods. Student absenteeism covers any student that has not attended class, irrespective of the reason. The data set consists of 26 attributes and 210,000 records corresponding to the teaching hours of 500 students during an academic year at Faculty of Information Technology. The students participating in the survey range from 18 to 25 years of age of both genders. The compilation of the student questionnaire was based on reviewing the literature and analyzing 26 attributes that we categorized into 5 groups included in the questionnaire.\nResults. This paper provides knowledge in the analysis and evaluation of factors that lead students to miss lectures using machine learning. It is important to note that this study was conducted on students of this faculty, and as such, the results may not be generalized to all universities. That’s why, researchers are encouraged to test the results achieved in this paper on other clusters.\nDiscussion and Conclusion. The paper provides recommendations based on the findings by offering different problem-solving strategies. The questionnaire used only for 500 Faculty of Information Technology students can be widely applied in any educational institution in the region. However, the results of this study cannot be generalized for the student and youth population of other regions or other countries. This paper provides an original and easily usable questionnaire suitable to various study programs and universities.","PeriodicalId":53450,"journal":{"name":"Integration of Education","volume":"97 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Integration of Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15507/1991-9468.107.026.202202.216-228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 1

Abstract

Introduction. In a developed society, the state should invest in the education of the younger generation. In less developed countries, Albania included, there are no nation-wide studies to show the factors that affect the lack of students in classrooms. The purpose of this study is to predict, analyze, and evaluate the possible causes of student absenteeism using machine learning algorithms. The attributes taken into account in this study are related to the family, demographic, social, university, and personal aspects according to academic criteria. Materials and Methods. Student absenteeism covers any student that has not attended class, irrespective of the reason. The data set consists of 26 attributes and 210,000 records corresponding to the teaching hours of 500 students during an academic year at Faculty of Information Technology. The students participating in the survey range from 18 to 25 years of age of both genders. The compilation of the student questionnaire was based on reviewing the literature and analyzing 26 attributes that we categorized into 5 groups included in the questionnaire. Results. This paper provides knowledge in the analysis and evaluation of factors that lead students to miss lectures using machine learning. It is important to note that this study was conducted on students of this faculty, and as such, the results may not be generalized to all universities. That’s why, researchers are encouraged to test the results achieved in this paper on other clusters. Discussion and Conclusion. The paper provides recommendations based on the findings by offering different problem-solving strategies. The questionnaire used only for 500 Faculty of Information Technology students can be widely applied in any educational institution in the region. However, the results of this study cannot be generalized for the student and youth population of other regions or other countries. This paper provides an original and easily usable questionnaire suitable to various study programs and universities.
使用机器学习算法预测和分析学生缺勤
介绍。在一个发达的社会,国家应该投资于年轻一代的教育。在包括阿尔巴尼亚在内的欠发达国家,没有全国性的研究表明影响课堂生源不足的因素。本研究的目的是使用机器学习算法来预测、分析和评估学生旷课的可能原因。根据学术标准,本研究考虑的属性涉及家庭、人口、社会、大学和个人方面。材料与方法。学生旷课包括任何没有上课的学生,无论原因如何。该数据集由26个属性和21万条记录组成,对应于信息技术学院一学年500名学生的教学时数。参与调查的学生年龄从18岁到25岁不等,男女不限。学生问卷的编制是在查阅文献的基础上,对问卷中包含的26个属性进行分析,并将其分为5类。本文提供了使用机器学习分析和评估导致学生缺课的因素的知识。值得注意的是,这项研究是针对该学院的学生进行的,因此,结果可能不适用于所有大学。这就是为什么鼓励研究人员在其他集群上测试本文中获得的结果。讨论与结论。本文通过提供不同的解决问题的策略,根据研究结果提出了建议。仅针对500名信息技术学院学生使用的问卷可以广泛应用于该地区的任何教育机构。然而,本研究的结果不能推广到其他地区或其他国家的学生和青年人口。本文提供了一份原始的、易于使用的问卷,适用于各种研究项目和大学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Integration of Education
Integration of Education Social Sciences-Education
CiteScore
1.30
自引率
0.00%
发文量
37
审稿时长
16 weeks
期刊介绍: The journal was established by the resolution of the Russian Federation State Committee on Higher Education, the Ministry of Education of the Russian Federation, the State Assembly and Government of Republic of Mordovia of 12 July 1995. Integration of Education publishes original researches in the field of integration of education. The names and content of the Journal’s sections correspond to the fields of science and groups of specialties of scientific workers in accordance with the Nomenclature of Scientific Specialties in which academic degrees are awarded: 19.00.00 PSYCHOLOGY 13.00.00 PEDAGOGY 22.00.00 SOCIOLOGY
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信