{"title":"利用模拟学生对自适应测试系统进行评估","authors":"Sophiana Chua Abdullah, R. Cooley","doi":"10.1109/CIE.2002.1186021","DOIUrl":null,"url":null,"abstract":"This paper describes a strategy for evaluating an assessor algorithm through the use of simulated students. The assessor is an adaptive testing strategy elicited from a human expert. The assessor claims an accurate assessment of the student's domain knowledge with a minimum number of questions presented to the student. An evaluation of the assessor is carried out to test the claim. Three types of students are simulated-one who knows all the relevant skills in the subject domain, one who knows a subset of the relevant skills, and one who has one or more malrules. The performance of the assessor in predicting the mastery of skills of each student is analysed. Future work involves the evaluation of its accuracy in the assessment of other types of students, such as students with lucky guesses or careless slips.","PeriodicalId":206223,"journal":{"name":"International Conference on Computers in Education, 2002. Proceedings.","volume":"888 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Using simulated students to evaluate an adaptive testing system\",\"authors\":\"Sophiana Chua Abdullah, R. Cooley\",\"doi\":\"10.1109/CIE.2002.1186021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a strategy for evaluating an assessor algorithm through the use of simulated students. The assessor is an adaptive testing strategy elicited from a human expert. The assessor claims an accurate assessment of the student's domain knowledge with a minimum number of questions presented to the student. An evaluation of the assessor is carried out to test the claim. Three types of students are simulated-one who knows all the relevant skills in the subject domain, one who knows a subset of the relevant skills, and one who has one or more malrules. The performance of the assessor in predicting the mastery of skills of each student is analysed. Future work involves the evaluation of its accuracy in the assessment of other types of students, such as students with lucky guesses or careless slips.\",\"PeriodicalId\":206223,\"journal\":{\"name\":\"International Conference on Computers in Education, 2002. Proceedings.\",\"volume\":\"888 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Computers in Education, 2002. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIE.2002.1186021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Computers in Education, 2002. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIE.2002.1186021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using simulated students to evaluate an adaptive testing system
This paper describes a strategy for evaluating an assessor algorithm through the use of simulated students. The assessor is an adaptive testing strategy elicited from a human expert. The assessor claims an accurate assessment of the student's domain knowledge with a minimum number of questions presented to the student. An evaluation of the assessor is carried out to test the claim. Three types of students are simulated-one who knows all the relevant skills in the subject domain, one who knows a subset of the relevant skills, and one who has one or more malrules. The performance of the assessor in predicting the mastery of skills of each student is analysed. Future work involves the evaluation of its accuracy in the assessment of other types of students, such as students with lucky guesses or careless slips.