{"title":"基于机器学习的文本数据分类与优化方案Naïve贝叶斯分类器","authors":"Venkatesh, K. Ranjitha","doi":"10.1109/WSCE.2018.8690536","DOIUrl":null,"url":null,"abstract":"Text classification is an essential advance in characteristic dialect processing. It very well may be performed utilizing different classification algorithms. Hadoop Map Reduce is widely utilized in text classification to perform classification on colossal measure of text data. However, Map Reduce required a ton of time to perform the tasks thereby increasing latency and since the data is distributed over the cluster it builds time and thus reducing processing speed. Also Hadoop utilizes long queue of code. Motivated by this, we propose a basic yet compelling machine learning method which uses Naïve Bayes classifier for text data. In Machine Learning approach, the classifier is built automatically by learning the properties of categories from a set of pre-defined training data. Hence, it can process complex furthermore, multi assortment information in dynamic situations. Here we propose a naïve bayes classifier which scales directly with number of indicators and data points which can be used for both binary and multiclass classification problems. We implemented the presented schemes using Machine Learning tool. The experimental results demonstrate the performance improvement in the classification technique.","PeriodicalId":276876,"journal":{"name":"2018 IEEE World Symposium on Communication Engineering (WSCE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Classification and Optimization Scheme for Text Data using Machine Learning Naïve Bayes Classifier\",\"authors\":\"Venkatesh, K. Ranjitha\",\"doi\":\"10.1109/WSCE.2018.8690536\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text classification is an essential advance in characteristic dialect processing. It very well may be performed utilizing different classification algorithms. Hadoop Map Reduce is widely utilized in text classification to perform classification on colossal measure of text data. However, Map Reduce required a ton of time to perform the tasks thereby increasing latency and since the data is distributed over the cluster it builds time and thus reducing processing speed. Also Hadoop utilizes long queue of code. Motivated by this, we propose a basic yet compelling machine learning method which uses Naïve Bayes classifier for text data. In Machine Learning approach, the classifier is built automatically by learning the properties of categories from a set of pre-defined training data. Hence, it can process complex furthermore, multi assortment information in dynamic situations. Here we propose a naïve bayes classifier which scales directly with number of indicators and data points which can be used for both binary and multiclass classification problems. We implemented the presented schemes using Machine Learning tool. The experimental results demonstrate the performance improvement in the classification technique.\",\"PeriodicalId\":276876,\"journal\":{\"name\":\"2018 IEEE World Symposium on Communication Engineering (WSCE)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE World Symposium on Communication Engineering (WSCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSCE.2018.8690536\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE World Symposium on Communication Engineering (WSCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSCE.2018.8690536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classification and Optimization Scheme for Text Data using Machine Learning Naïve Bayes Classifier
Text classification is an essential advance in characteristic dialect processing. It very well may be performed utilizing different classification algorithms. Hadoop Map Reduce is widely utilized in text classification to perform classification on colossal measure of text data. However, Map Reduce required a ton of time to perform the tasks thereby increasing latency and since the data is distributed over the cluster it builds time and thus reducing processing speed. Also Hadoop utilizes long queue of code. Motivated by this, we propose a basic yet compelling machine learning method which uses Naïve Bayes classifier for text data. In Machine Learning approach, the classifier is built automatically by learning the properties of categories from a set of pre-defined training data. Hence, it can process complex furthermore, multi assortment information in dynamic situations. Here we propose a naïve bayes classifier which scales directly with number of indicators and data points which can be used for both binary and multiclass classification problems. We implemented the presented schemes using Machine Learning tool. The experimental results demonstrate the performance improvement in the classification technique.