{"title":"利用文本挖掘探索计算机领域研究生研究的跨学科性质:一个案例研究","authors":"M. Lall","doi":"10.1109/ICIIS51140.2020.9342734","DOIUrl":null,"url":null,"abstract":"The aim of this article is to determine whether a compartmentalized curriculum at undergraduate lead to similar silos appearing in postgraduate research outputs. For this, data in the form of subject contents of undergraduate studies were obtained from the prospectus of various academic departments in the faculty. Additionally, data in the form of abstracts of research articles published by the postgraduate researchers in these departments were obtained. A total of 118 articles published between January 2016 and May 2020 was extracted from Scopus database. K-means algorithm was used on the corpus consisting of abstract dataset to obtain the clusters. Topic modelling using Latent Dirichlet Allocation (LDA) was then applied to obtain the main topics of the clusters. It was observed that three clusters were adequate in explaining a high percentage of variance in the data and there exists a substantial overlap in the main topics of the three clusters.","PeriodicalId":352858,"journal":{"name":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring interdisciplinary nature of postgraduate research in the field of Computing using Text mining: a case study\",\"authors\":\"M. Lall\",\"doi\":\"10.1109/ICIIS51140.2020.9342734\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this article is to determine whether a compartmentalized curriculum at undergraduate lead to similar silos appearing in postgraduate research outputs. For this, data in the form of subject contents of undergraduate studies were obtained from the prospectus of various academic departments in the faculty. Additionally, data in the form of abstracts of research articles published by the postgraduate researchers in these departments were obtained. A total of 118 articles published between January 2016 and May 2020 was extracted from Scopus database. K-means algorithm was used on the corpus consisting of abstract dataset to obtain the clusters. Topic modelling using Latent Dirichlet Allocation (LDA) was then applied to obtain the main topics of the clusters. It was observed that three clusters were adequate in explaining a high percentage of variance in the data and there exists a substantial overlap in the main topics of the three clusters.\",\"PeriodicalId\":352858,\"journal\":{\"name\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIIS51140.2020.9342734\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIS51140.2020.9342734","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring interdisciplinary nature of postgraduate research in the field of Computing using Text mining: a case study
The aim of this article is to determine whether a compartmentalized curriculum at undergraduate lead to similar silos appearing in postgraduate research outputs. For this, data in the form of subject contents of undergraduate studies were obtained from the prospectus of various academic departments in the faculty. Additionally, data in the form of abstracts of research articles published by the postgraduate researchers in these departments were obtained. A total of 118 articles published between January 2016 and May 2020 was extracted from Scopus database. K-means algorithm was used on the corpus consisting of abstract dataset to obtain the clusters. Topic modelling using Latent Dirichlet Allocation (LDA) was then applied to obtain the main topics of the clusters. It was observed that three clusters were adequate in explaining a high percentage of variance in the data and there exists a substantial overlap in the main topics of the three clusters.