{"title":"Categorization of Dissertation using Machine Learning Techniques","authors":"L. Kumar, Manish Jain","doi":"10.1109/ICONC345789.2020.9117485","DOIUrl":null,"url":null,"abstract":"Machine learning techniques are widely used to take intelligent decisions in industrial and educational domains. In the educational domain, when a research scholar submits a dissertation, then it has to be indexed and classified. The number of dissertations that are submitted in an educational institute is usually high and if done manually, it becomes difficult to index and classify correctly. This study applies machine learning techniques to automate the indexing and categorization of dissertations. We have focused on dissertations from the Engineering, Medical, Social Science, and General Science fields. We used the Bag of Words (BoW) method to extract features and K-means, Density-based spatial clustering of applications with noise (DBSCAN) and Expectation-Maximisation (EM) to train our model. Our experimental results reveal that the proposed K- means technique for indexing and categorization leads to higher accuracy and significant reduction in negative predictions as compared to DBSCAN and Expectation-Maximisation (EM).","PeriodicalId":155813,"journal":{"name":"2020 International Conference on Emerging Trends in Communication, Control and Computing (ICONC3)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Emerging Trends in Communication, Control and Computing (ICONC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONC345789.2020.9117485","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning techniques are widely used to take intelligent decisions in industrial and educational domains. In the educational domain, when a research scholar submits a dissertation, then it has to be indexed and classified. The number of dissertations that are submitted in an educational institute is usually high and if done manually, it becomes difficult to index and classify correctly. This study applies machine learning techniques to automate the indexing and categorization of dissertations. We have focused on dissertations from the Engineering, Medical, Social Science, and General Science fields. We used the Bag of Words (BoW) method to extract features and K-means, Density-based spatial clustering of applications with noise (DBSCAN) and Expectation-Maximisation (EM) to train our model. Our experimental results reveal that the proposed K- means technique for indexing and categorization leads to higher accuracy and significant reduction in negative predictions as compared to DBSCAN and Expectation-Maximisation (EM).