Ensemble-based factor analysis of survey data on computer science courses

Shaohong Zhang, Jiqiao Li, Haihuang Huang, Hongqiu Wu, Haodong Lin, Wenxiao Qiu
{"title":"Ensemble-based factor analysis of survey data on computer science courses","authors":"Shaohong Zhang, Jiqiao Li, Haihuang Huang, Hongqiu Wu, Haodong Lin, Wenxiao Qiu","doi":"10.1109/CISP-BMEI.2016.7853032","DOIUrl":null,"url":null,"abstract":"Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.","PeriodicalId":275095,"journal":{"name":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"20 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2016.7853032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.
基于集成的计算机科学课程调查数据因子分析
课程调查数据的分析与学生的技能培养和人才发展密切相关,具有基础性的重要意义。然而,有多类分类信息,从不同的角度来表征不同的学生,如性别、班级、类型或地区。与单项分类调查数据之间的相关性相比,多类分类调查数据之间的一致性更具信息性。然而,传统的分类方法在分类类别繁多的情况下无法处理这一问题。针对上述问题,在本工作中,我们提出以共识方式进行新的因子分析方法。具体来说,我们分别计算了不同类别分类的共识结构和每个因素的相似性矩阵。受著名的聚类度量RAND的启发,我们提出了一种称为Rmm的广义方法,该方法用于计算两个不同相似矩阵的一致性,并可选择采用权重。遵循Rmm的类似精神,我们也在没有类别信息的情况下计算了不同因素之间的两两一致性。实验结果表明:(1)我们的Rmm方法能够有效地组合不同类别的分类信息;(2)我们的Rmm方法对分类组合保持了良好的线性特性;(iii)我们的Rmm方法可以在模糊相似的情况下使用,更适用于不同的场景。本工作试图通过提出的方法,从阅读数据中找出隐藏的规律和覆盖的规律,希望为高校的课程安排和人才培养计划提供建议,并为计算机专业大学生的个人发展提供启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信