Data mining in intelligent tutoring systems using rough sets

International Conference on Electrical, Electronic and Computer Engineering, 2004. ICEEC '04. Pub Date : 2004-09-05 DOI:10.1109/ICEEC.2004.1374414

S. S. Attia, H. Mahdi, H.K. Mohammad

{"title":"Data mining in intelligent tutoring systems using rough sets","authors":"S. S. Attia, H. Mahdi, H.K. Mohammad","doi":"10.1109/ICEEC.2004.1374414","DOIUrl":null,"url":null,"abstract":"Data mining aims at searching for meaninghl injormation like patterns and rules in large volumes of data. Our objective is to mine the data of Intelligent Tutoring Systems (rrS). n e s e are tutoring systems which offer the ability to respond to individualized student ne&. An qweriment was conducted over a lesson for binary relatbns. Students’ answers to questions at the end of the lesson were collected. Data mining was implemented to extract important nrles @om the data (students’ answers) and hence the student can be directed to which parts of the lesson he should take again, thus heking to adopt the brtoring systems to each student individual needs nree approaches are applied to detect the decision nrles based on the Rough Sets and the Md$ed Rough &s. n e s e approaches provide a poweijid foundation to discover important structures in data. These approaches are unique in the sense that they onb use the injormation given by the data and do not rely on other model assumptions. f i e results obtained were in the form of rules that showed what concepts the student understood and which he did not understand depending on which questions he answered correct and which questions he answered wrong. Also some questions of the quizzes were found to be useless. It was concluded that data mining was able to extract some important patterns and rules >om the students’ answers which were hidden before and which are helpfir1 to both the students and the expert. Data mining is a set of methods used as a step in the Knowledge Discovery (KD) process to distinguish previously unknown relationships, rules and patterns within large volumes of data [l]. One of data mining tasks is &scription i.e. to describe databases in terms of patterns which human can understand and make use of In our research, we are trying to mine databases resulting from Intelligent Tutoring Systems (ITS). These are Computerbased tutoring systems which achieve their intelligence by representing pedagogical decisions about how to teach as well as information about the learner. This allows for greater versatility by altering t k system’s interactions with students. Intelligent tutoring systems have been shown to be highly effective at increasing student’s motivation and performance [2]. 0-7803-8575-6/04/$20.00 02004 IEEE The goal of data mining in ITS is to automatically assess student knowledge of the concepts underlying a tutorial topic, and use this assessment to direct remediation of knowledge. It does not require any knowledge about the subject being taught [3]. Thus our main objective is to investigate the application of data mining to provide a reliable way to determine a student knowledge status i.e. what a student does and does not know during the course of instruction. Once student knowledge can be assessed automatically without human intervention, computer4ased educational system can be individually tailored to each student’s leaming needs. This research investigates the application of rough sets as a method of data mining and knowledge discovery in ITS. Rough Sets approaches provide a powerful foundation to reveal and discover important structures in data They have been shown to be very effective for revealing relationships within imprecise data, discovering independencies among objects, removing redundancies and extmcting decision rules [4]. Three approaches were applied in this research two different approaches of traditional Rough Sets [5] and one approach of Modified Rough Sets [6]. The rules extracted were evaluated to test their quality using complexity measure [7]. The data sets on which the three approaches were applied came from a trial run of a binary relations lesson in North Carolina State University (NCSU) with a group of students taking Discrete Mathematics during the past three years. The binary relations lesson is taken over the NovaNET network which is a computer-based educational network that evolved from the first computer-based educational network, PLATO. NovaNET is a system-quality network with response times in fractions of seconds. Its programming language, TUTOR, is particularly effective for writing educational lessons, including modes for answer judging and integrated help. Lessons on NovaNET integrate text and graphics, and can also be linked to Internet resources, including audic-visual presentations The second section of this paper describes the rou& sets algorithms implemented. The third section describes the three different methods for the extraction of decision rules. The fourth section is for data analysis and the fifth section deals with the results obtained. Finally, the last section presents t he conclusions and future work. 131.","PeriodicalId":180043,"journal":{"name":"International Conference on Electrical, Electronic and Computer Engineering, 2004. ICEEC '04.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electrical, Electronic and Computer Engineering, 2004. ICEEC '04.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEEC.2004.1374414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Data mining aims at searching for meaninghl injormation like patterns and rules in large volumes of data. Our objective is to mine the data of Intelligent Tutoring Systems (rrS). n e s e are tutoring systems which offer the ability to respond to individualized student ne&. An qweriment was conducted over a lesson for binary relatbns. Students’ answers to questions at the end of the lesson were collected. Data mining was implemented to extract important nrles @om the data (students’ answers) and hence the student can be directed to which parts of the lesson he should take again, thus heking to adopt the brtoring systems to each student individual needs nree approaches are applied to detect the decision nrles based on the Rough Sets and the Md$ed Rough &s. n e s e approaches provide a poweijid foundation to discover important structures in data. These approaches are unique in the sense that they onb use the injormation given by the data and do not rely on other model assumptions. f i e results obtained were in the form of rules that showed what concepts the student understood and which he did not understand depending on which questions he answered correct and which questions he answered wrong. Also some questions of the quizzes were found to be useless. It was concluded that data mining was able to extract some important patterns and rules >om the students’ answers which were hidden before and which are helpfir1 to both the students and the expert. Data mining is a set of methods used as a step in the Knowledge Discovery (KD) process to distinguish previously unknown relationships, rules and patterns within large volumes of data [l]. One of data mining tasks is &scription i.e. to describe databases in terms of patterns which human can understand and make use of In our research, we are trying to mine databases resulting from Intelligent Tutoring Systems (ITS). These are Computerbased tutoring systems which achieve their intelligence by representing pedagogical decisions about how to teach as well as information about the learner. This allows for greater versatility by altering t k system’s interactions with students. Intelligent tutoring systems have been shown to be highly effective at increasing student’s motivation and performance [2]. 0-7803-8575-6/04/$20.00 02004 IEEE The goal of data mining in ITS is to automatically assess student knowledge of the concepts underlying a tutorial topic, and use this assessment to direct remediation of knowledge. It does not require any knowledge about the subject being taught [3]. Thus our main objective is to investigate the application of data mining to provide a reliable way to determine a student knowledge status i.e. what a student does and does not know during the course of instruction. Once student knowledge can be assessed automatically without human intervention, computer4ased educational system can be individually tailored to each student’s leaming needs. This research investigates the application of rough sets as a method of data mining and knowledge discovery in ITS. Rough Sets approaches provide a powerful foundation to reveal and discover important structures in data They have been shown to be very effective for revealing relationships within imprecise data, discovering independencies among objects, removing redundancies and extmcting decision rules [4]. Three approaches were applied in this research two different approaches of traditional Rough Sets [5] and one approach of Modified Rough Sets [6]. The rules extracted were evaluated to test their quality using complexity measure [7]. The data sets on which the three approaches were applied came from a trial run of a binary relations lesson in North Carolina State University (NCSU) with a group of students taking Discrete Mathematics during the past three years. The binary relations lesson is taken over the NovaNET network which is a computer-based educational network that evolved from the first computer-based educational network, PLATO. NovaNET is a system-quality network with response times in fractions of seconds. Its programming language, TUTOR, is particularly effective for writing educational lessons, including modes for answer judging and integrated help. Lessons on NovaNET integrate text and graphics, and can also be linked to Internet resources, including audic-visual presentations The second section of this paper describes the rou& sets algorithms implemented. The third section describes the three different methods for the extraction of decision rules. The fourth section is for data analysis and the fifth section deals with the results obtained. Finally, the last section presents t he conclusions and future work. 131.

查看原文本刊更多论文

基于粗糙集的智能辅导系统数据挖掘

数据挖掘的目的是在大量数据中寻找有意义的信息，如模式和规则。我们的目标是挖掘智能辅导系统(rrS)的数据。这是一种辅导系统，它提供了对个性化的学生需求做出反应的能力。对二元关系的一课进行了一次问卷调查。在课程结束时收集了学生对问题的回答。数据挖掘是为了从数据(学生的答案)中提取重要的非规则规则，因此学生可以被引导到他应该再次学习的课程的哪一部分，因此他希望采用基于每个学生个人需求的导航系统。这些方法为发现数据中的重要结构提供了强大的基础。这些方法的独特之处在于，它们只使用数据提供的信息，而不依赖于其他模型假设。得到的结果以规则的形式显示学生理解和不理解的概念，这取决于他回答对了哪些问题，回答错了哪些问题。此外，测试中的一些问题被发现是无用的。结果表明，数据挖掘能够从学生的答案中提取出一些重要的模式和规则，这些模式和规则在以前是隐藏的，对学生和专家都有帮助。数据挖掘是知识发现(Knowledge Discovery, KD)过程中的一组方法，用于区分大量数据中先前未知的关系、规则和模式[1]。数据挖掘的任务之一是描述数据库，即用人类可以理解和利用的模式来描述数据库。在我们的研究中，我们试图挖掘智能辅导系统(ITS)产生的数据库。这是一种基于计算机的辅导系统，它通过表示关于如何教学的教学决策以及关于学习者的信息来实现其智能。这可以通过改变系统与学生的互动来实现更大的通用性。智能辅导系统已被证明在提高学生的学习动机和学习成绩方面非常有效[2]。ITS中数据挖掘的目标是自动评估学生对教程主题基础概念的知识，并使用此评估来指导知识的补救。它不需要任何关于所教科目的知识[3]。因此，我们的主要目标是研究数据挖掘的应用，以提供一种可靠的方法来确定学生的知识状态，即学生在教学过程中知道什么和不知道什么。一旦学生的知识可以在没有人为干预的情况下自动评估，基于计算机的教育系统就可以根据每个学生的学习需求进行个性化调整。本研究探讨了粗糙集作为一种数据挖掘和知识发现方法在智能交通系统中的应用。粗糙集方法为揭示和发现数据中的重要结构提供了强大的基础，它们已被证明在揭示不精确数据中的关系、发现对象之间的独立性、去除冗余和提取决策规则方面非常有效[4]。本研究采用了三种方法:两种不同的传统粗糙集方法[5]和一种改进粗糙集方法[6]。使用复杂性度量对提取的规则进行评估以测试其质量[7]。应用这三种方法的数据集来自北卡罗来纳州立大学(NCSU)在过去三年中对一组学习离散数学的学生进行的二元关系课程的试运行。二元关系课程在NovaNET网络上进行，NovaNET是一个基于计算机的教育网络，从第一个基于计算机的教育网络PLATO发展而来。NovaNET是一个系统质量的网络，响应时间在几秒钟内。它的编程语言TUTOR对于编写教育课程特别有效，包括答案判断模式和综合帮助。NovaNET上的课程集成了文本和图形，还可以链接到互联网资源，包括视听演示。本文的第二部分描述了实现的rou&sets算法。第三部分描述了提取决策规则的三种不同方法。第四部分为数据分析，第五部分为所得结果。最后，最后一部分提出了本文的结论和未来的工作。131.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Electrical, Electronic and Computer Engineering, 2004. ICEEC '04.

自引率

0.00%

发文量