{"title":"基于随机森林的博士论文影响因素及预测研究","authors":"Yang Lian, Jingying Chen, Chunyan Su","doi":"10.1145/3134847.3134855","DOIUrl":null,"url":null,"abstract":"Doctoral dissertation is one of the most important parts in the doctoral degree program. This paper analyzes the data of doctoral dissertations in a university for three consecutive years, and investigates the factors related to the quality of doctoral dissertations from the aspects of students' characteristics and cultivation methods. The results show that the enrollment age, study period, mode of study and subject categories, whether cross-specialty, etc., have significant impact on the quality of doctoral dissertations. Then, a prediction model based on the weighted Random Forest (RF) is presented to predict the quality of doctoral dissertation, which is effective for unbalanced data and improves the generalization ability of the original RF, the encouraging result of 81.29% prediction rate has been obtained, which provides objective evidence for doctoral degree program management.","PeriodicalId":269655,"journal":{"name":"Proceedings of the 1st International Conference on Digital Technology in Education","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Study on Influencing Factors and Prediction Based on Random Forest of Doctoral Dissertations\",\"authors\":\"Yang Lian, Jingying Chen, Chunyan Su\",\"doi\":\"10.1145/3134847.3134855\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Doctoral dissertation is one of the most important parts in the doctoral degree program. This paper analyzes the data of doctoral dissertations in a university for three consecutive years, and investigates the factors related to the quality of doctoral dissertations from the aspects of students' characteristics and cultivation methods. The results show that the enrollment age, study period, mode of study and subject categories, whether cross-specialty, etc., have significant impact on the quality of doctoral dissertations. Then, a prediction model based on the weighted Random Forest (RF) is presented to predict the quality of doctoral dissertation, which is effective for unbalanced data and improves the generalization ability of the original RF, the encouraging result of 81.29% prediction rate has been obtained, which provides objective evidence for doctoral degree program management.\",\"PeriodicalId\":269655,\"journal\":{\"name\":\"Proceedings of the 1st International Conference on Digital Technology in Education\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st International Conference on Digital Technology in Education\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3134847.3134855\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st International Conference on Digital Technology in Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3134847.3134855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Study on Influencing Factors and Prediction Based on Random Forest of Doctoral Dissertations
Doctoral dissertation is one of the most important parts in the doctoral degree program. This paper analyzes the data of doctoral dissertations in a university for three consecutive years, and investigates the factors related to the quality of doctoral dissertations from the aspects of students' characteristics and cultivation methods. The results show that the enrollment age, study period, mode of study and subject categories, whether cross-specialty, etc., have significant impact on the quality of doctoral dissertations. Then, a prediction model based on the weighted Random Forest (RF) is presented to predict the quality of doctoral dissertation, which is effective for unbalanced data and improves the generalization ability of the original RF, the encouraging result of 81.29% prediction rate has been obtained, which provides objective evidence for doctoral degree program management.