LLM-EPSP: Large language model empowered early prediction of student performance

IF 6.9 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management Pub Date : 2025-08-22 DOI:10.1016/j.ipm.2025.104351

Huawei Zhou , Shuanghong Shen , Yu Su , Yongchun Miao , Qi Liu , Linbo Zhu , Junyu Lu , Zhenya Huang

{"title":"LLM-EPSP: Large language model empowered early prediction of student performance","authors":"Huawei Zhou , Shuanghong Shen , Yu Su , Yongchun Miao , Qi Liu , Linbo Zhu , Junyu Lu , Zhenya Huang","doi":"10.1016/j.ipm.2025.104351","DOIUrl":null,"url":null,"abstract":"<div><div>Early prediction of student performance (EPSP) has garnered significant attention due to its educational value, especially its importance in academic early warning systems. State-of-the-art data mining methods have achieved remarkable success by optimizing feature selection and model enhancements. However, these methods often face challenges, including the cold-start problem, limited exploration of the intrinsic relationships among features, and poor generalization. In this work, we explore the utilization of Large Language Models (LLMs) as information integrators to address these challenges and propose a novel model called Large Language Model Empowered Early Prediction of Student Performance (LLM-EPSP). Specifically, for the cold-start problem, LLM-EPSP benefits from the inherent advantages of LLMs, which stem from their extensive pretraining on diverse datasets. This enables the model to make informed predictions even with limited initial data. For exploring intrinsic relationships among features, LLM-EPSP employs feature fusion techniques to uncover underlying connections between various features, ensuring a comprehensive and robust analysis. To enhance the generalization capabilities of LLM-EPSP, we develop predefined templates that facilitate its adaptation to a wide range of educational contexts. We evaluate our method on two real-world datasets: (1) OULAD, which includes data on 22 courses and 32,593 students, and (2) the UCI Machine Learning Repository, which contains 23 types of features from 649 students. Extensive validation demonstrates that LLM-EPSP considerably outperforms baseline approaches across diverse scenarios. Further analysis results also demonstrate the robustness and versatility of LLM-EPSP, suggesting its enormous potential in practical applications.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104351"},"PeriodicalIF":6.9000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002924","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Early prediction of student performance (EPSP) has garnered significant attention due to its educational value, especially its importance in academic early warning systems. State-of-the-art data mining methods have achieved remarkable success by optimizing feature selection and model enhancements. However, these methods often face challenges, including the cold-start problem, limited exploration of the intrinsic relationships among features, and poor generalization. In this work, we explore the utilization of Large Language Models (LLMs) as information integrators to address these challenges and propose a novel model called Large Language Model Empowered Early Prediction of Student Performance (LLM-EPSP). Specifically, for the cold-start problem, LLM-EPSP benefits from the inherent advantages of LLMs, which stem from their extensive pretraining on diverse datasets. This enables the model to make informed predictions even with limited initial data. For exploring intrinsic relationships among features, LLM-EPSP employs feature fusion techniques to uncover underlying connections between various features, ensuring a comprehensive and robust analysis. To enhance the generalization capabilities of LLM-EPSP, we develop predefined templates that facilitate its adaptation to a wide range of educational contexts. We evaluate our method on two real-world datasets: (1) OULAD, which includes data on 22 courses and 32,593 students, and (2) the UCI Machine Learning Repository, which contains 23 types of features from 649 students. Extensive validation demonstrates that LLM-EPSP considerably outperforms baseline approaches across diverse scenarios. Further analysis results also demonstrate the robustness and versatility of LLM-EPSP, suggesting its enormous potential in practical applications.

查看原文本刊更多论文

LLM-EPSP：大型语言模型支持学生表现的早期预测

学生成绩早期预测（EPSP）由于其教育价值，特别是在学术预警系统中的重要性而受到广泛关注。最先进的数据挖掘方法通过优化特征选择和模型增强取得了显著的成功。然而，这些方法经常面临挑战，包括冷启动问题，对特征之间内在关系的探索有限，以及泛化能力差。在这项工作中，我们探索了利用大型语言模型（llm）作为信息集成商来解决这些挑战，并提出了一种新的模型，称为大型语言模型授权学生表现早期预测（LLM-EPSP）。具体来说，对于冷启动问题，LLM-EPSP受益于llm的固有优势，这源于它们在不同数据集上的广泛预训练。这使得该模型即使在有限的初始数据下也能做出明智的预测。为了探索特征之间的内在关系，LLM-EPSP采用特征融合技术来揭示各种特征之间的潜在联系，确保了全面和稳健的分析。为了增强LLM-EPSP的泛化能力，我们开发了预定义的模板，以促进其适应广泛的教育背景。我们在两个真实世界的数据集上评估了我们的方法：(1)OULAD，其中包括22门课程和32,593名学生的数据；(2)UCI机器学习存储库，其中包含来自649名学生的23种类型的特征。广泛的验证表明，LLM-EPSP在不同场景下的性能明显优于基线方法。进一步的分析结果也证明了LLM-EPSP的鲁棒性和通用性，表明其在实际应用中的巨大潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.