{"title":"A Review on Supervised Machine Learning Text Categorization Approaches","authors":"Aayushi A. Shah, Keyur Rana","doi":"10.1109/ICCSDET.2018.8821134","DOIUrl":null,"url":null,"abstract":"The inception of abundant and hefty digitized text documents has emerged in recent years. Such texts need to be categorized technically for storage and retrieval purpose. Text Categorization Machine Learning techniques exist to manage the substantial data. These techniques can be divided into Supervised and Unsupervised Machine Learning techniques. In this paper we review Supervised Machine learning techniques that deal with generating patterns and hypothesis according to the prior knowledge. The objective of this paper is to highlight important features and methods of Supervised Machine Learning techniques. Selecting important features and then classifying the text is necessary for effective text categorization. The performance of Text Categorization and dimensionality improves by performing feature selection in the existing techniques. The importance of feature selection is considered in this paper. The general comparison table of techniques is presented for in depth parametric assessment.","PeriodicalId":157362,"journal":{"name":"2018 International Conference on Circuits and Systems in Digital Enterprise Technology (ICCSDET)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Circuits and Systems in Digital Enterprise Technology (ICCSDET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSDET.2018.8821134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The inception of abundant and hefty digitized text documents has emerged in recent years. Such texts need to be categorized technically for storage and retrieval purpose. Text Categorization Machine Learning techniques exist to manage the substantial data. These techniques can be divided into Supervised and Unsupervised Machine Learning techniques. In this paper we review Supervised Machine learning techniques that deal with generating patterns and hypothesis according to the prior knowledge. The objective of this paper is to highlight important features and methods of Supervised Machine Learning techniques. Selecting important features and then classifying the text is necessary for effective text categorization. The performance of Text Categorization and dimensionality improves by performing feature selection in the existing techniques. The importance of feature selection is considered in this paper. The general comparison table of techniques is presented for in depth parametric assessment.