{"title":"通过微调的大型语言模型评估练习意识高阶思维技能","authors":"Xiuling He, Xiong Xiao, Jing Fang, Yue Li, Yangyang Li, Ruijie Zhou","doi":"10.1016/j.knosys.2025.113808","DOIUrl":null,"url":null,"abstract":"<div><div>Higher-order thinking Skills (HOTS) are complex cognitive skills that go beyond the basic levels of memory and comprehension. It is critical to developing an individual’s critical thinking and problem-solving skills. Current methods for assessing HOTS typically rely on expert judgment or specially designed assessment tasks, which have become well-established and reliable paradigms but are time-consuming and difficult to transfer. Accordingly, this study aims to develop an automated HOTS assessment model that will reduce reliance on experts and enhance efficiency and accuracy. Large language models (LLM) are pre-trained models using deep learning techniques to deal with natural language processing tasks. They have excellent knowledge base and reasoning capabilities, and researchers have applied them to a range of domains. In this paper, we proposed the Exercise-Aware higher-order Thinking skills Assessment (EATA) model based on fine-tuning the LLM. The EATA comprises the Exercise Awareness (EA) and the HOTS Assessment (HA) modules. The EA module includes a pre-trained Text2Vec and a multilayer perceptron (MLP). It integrates the exercise text information with the HOTS labeling information to generate the higher-order exercise embedding matrix. The HA module employs a pre-trained LLM as the underlying network, which takes the student’s learning records with the higher-order exercise embedding as inputs and automatically assesses the student’s HOTS through a fine-tuning technique. In this way, EATA can emulate traditional assessment methods, but replace experts with LLM. It reduces the interference of human factors, thus improving efficiency. To verify the validity of EATA, we collect 43070 online exercise data from 181 undergraduate students in a university. The experiments show that EATA can effectively assess students’ HOTS, indicating the potential value of LLM in HOTS assessment tasks. The implementations are available at <span><span>https://github.com/xxiongGG/EATA</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"324 ","pages":"Article 113808"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exercise-Aware higher-order Thinking skills Assessment via fine-tuned large language model\",\"authors\":\"Xiuling He, Xiong Xiao, Jing Fang, Yue Li, Yangyang Li, Ruijie Zhou\",\"doi\":\"10.1016/j.knosys.2025.113808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Higher-order thinking Skills (HOTS) are complex cognitive skills that go beyond the basic levels of memory and comprehension. It is critical to developing an individual’s critical thinking and problem-solving skills. Current methods for assessing HOTS typically rely on expert judgment or specially designed assessment tasks, which have become well-established and reliable paradigms but are time-consuming and difficult to transfer. Accordingly, this study aims to develop an automated HOTS assessment model that will reduce reliance on experts and enhance efficiency and accuracy. Large language models (LLM) are pre-trained models using deep learning techniques to deal with natural language processing tasks. They have excellent knowledge base and reasoning capabilities, and researchers have applied them to a range of domains. In this paper, we proposed the Exercise-Aware higher-order Thinking skills Assessment (EATA) model based on fine-tuning the LLM. The EATA comprises the Exercise Awareness (EA) and the HOTS Assessment (HA) modules. The EA module includes a pre-trained Text2Vec and a multilayer perceptron (MLP). It integrates the exercise text information with the HOTS labeling information to generate the higher-order exercise embedding matrix. The HA module employs a pre-trained LLM as the underlying network, which takes the student’s learning records with the higher-order exercise embedding as inputs and automatically assesses the student’s HOTS through a fine-tuning technique. In this way, EATA can emulate traditional assessment methods, but replace experts with LLM. It reduces the interference of human factors, thus improving efficiency. To verify the validity of EATA, we collect 43070 online exercise data from 181 undergraduate students in a university. The experiments show that EATA can effectively assess students’ HOTS, indicating the potential value of LLM in HOTS assessment tasks. The implementations are available at <span><span>https://github.com/xxiongGG/EATA</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"324 \",\"pages\":\"Article 113808\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125008548\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125008548","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Exercise-Aware higher-order Thinking skills Assessment via fine-tuned large language model
Higher-order thinking Skills (HOTS) are complex cognitive skills that go beyond the basic levels of memory and comprehension. It is critical to developing an individual’s critical thinking and problem-solving skills. Current methods for assessing HOTS typically rely on expert judgment or specially designed assessment tasks, which have become well-established and reliable paradigms but are time-consuming and difficult to transfer. Accordingly, this study aims to develop an automated HOTS assessment model that will reduce reliance on experts and enhance efficiency and accuracy. Large language models (LLM) are pre-trained models using deep learning techniques to deal with natural language processing tasks. They have excellent knowledge base and reasoning capabilities, and researchers have applied them to a range of domains. In this paper, we proposed the Exercise-Aware higher-order Thinking skills Assessment (EATA) model based on fine-tuning the LLM. The EATA comprises the Exercise Awareness (EA) and the HOTS Assessment (HA) modules. The EA module includes a pre-trained Text2Vec and a multilayer perceptron (MLP). It integrates the exercise text information with the HOTS labeling information to generate the higher-order exercise embedding matrix. The HA module employs a pre-trained LLM as the underlying network, which takes the student’s learning records with the higher-order exercise embedding as inputs and automatically assesses the student’s HOTS through a fine-tuning technique. In this way, EATA can emulate traditional assessment methods, but replace experts with LLM. It reduces the interference of human factors, thus improving efficiency. To verify the validity of EATA, we collect 43070 online exercise data from 181 undergraduate students in a university. The experiments show that EATA can effectively assess students’ HOTS, indicating the potential value of LLM in HOTS assessment tasks. The implementations are available at https://github.com/xxiongGG/EATA.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.