{"title":"Ensemble-based factor analysis of survey data on computer science courses","authors":"Shaohong Zhang, Jiqiao Li, Haihuang Huang, Hongqiu Wu, Haodong Lin, Wenxiao Qiu","doi":"10.1109/CISP-BMEI.2016.7853032","DOIUrl":null,"url":null,"abstract":"Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.","PeriodicalId":275095,"journal":{"name":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"20 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2016.7853032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Analysis of course survey data are foundationally important because it has a close relation with students' skills training and talent development. However, there are multiple categories of classification information that characterized different students from different perspectives, such as genders, classes, types or regions. Compared to the correlation between survey data with individual kind of classification, consensus between those with multiple categories is more informative. However, traditional methods can not handle this problem with multiple kinds of classification categories. In view of the problem above, in this work, we proposed to conduct a new factor analysis method in the consensus fashion. Specifically, we computed a consensus structure of different categories of classifications and the similarity matrices for each factor, respectively. Motivated by the famous clustering measure RAND, we proposed a generalized method called Rmm which is used to compute the consistency of two different similarity matrices, with the option to adopt weights. Followed the similar spirit of Rmm, we also computed pairwise consistency between different factors without category information. Experimental results show that: (i) Our Rmm method is capable to combine different categories of classification information in an effective manner;(ii) our Rmm method preserve promising linear property for the combination of classifications; and (iii) Our Rmm method can be used in the case of fuzzy similarity, which is more applicable in different scenarios. With the proposed method, this work attempted to figure out the veiled laws and covered from read data, with the hope to provide suggestions for course arrangement and personnel cultivating program in universities, and to shed light on personal development of university students who major on computer science.