Ensemble-based factor analysis of survey data on computer science courses

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) Pub Date : 2016-10-01 DOI:10.1109/CISP-BMEI.2016.7853032

Shaohong Zhang, Jiqiao Li, Haihuang Huang, Hongqiu Wu, Haodong Lin, Wenxiao Qiu

{"title":"Ensemble-based factor analysis of survey data on computer science courses","authors":"Shaohong Zhang, Jiqiao Li, Haihuang Huang, Hongqiu Wu, Haodong Lin, Wenxiao Qiu","doi":"10.1109/CISP-BMEI.2016.7853032","DOIUrl":null,"url":null,"abstract":"Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.","PeriodicalId":275095,"journal":{"name":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"20 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2016.7853032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.

查看原文本刊更多论文

基于集成的计算机科学课程调查数据因子分析

课程调查数据的分析与学生的技能培养和人才发展密切相关，具有基础性的重要意义。然而，有多类分类信息，从不同的角度来表征不同的学生，如性别、班级、类型或地区。与单项分类调查数据之间的相关性相比，多类分类调查数据之间的一致性更具信息性。然而，传统的分类方法在分类类别繁多的情况下无法处理这一问题。针对上述问题，在本工作中，我们提出以共识方式进行新的因子分析方法。具体来说，我们分别计算了不同类别分类的共识结构和每个因素的相似性矩阵。受著名的聚类度量RAND的启发，我们提出了一种称为Rmm的广义方法，该方法用于计算两个不同相似矩阵的一致性，并可选择采用权重。遵循Rmm的类似精神，我们也在没有类别信息的情况下计算了不同因素之间的两两一致性。实验结果表明:(1)我们的Rmm方法能够有效地组合不同类别的分类信息;(2)我们的Rmm方法对分类组合保持了良好的线性特性;(iii)我们的Rmm方法可以在模糊相似的情况下使用，更适用于不同的场景。本工作试图通过提出的方法，从阅读数据中找出隐藏的规律和覆盖的规律，希望为高校的课程安排和人才培养计划提供建议，并为计算机专业大学生的个人发展提供启示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

自引率

0.00%

发文量