S. Deshpande, Monika Doke, Aishwarya Deshpande, Anagha Chaudhari
{"title":"采用结合聚类的进化方法的文档检索专家系统","authors":"S. Deshpande, Monika Doke, Aishwarya Deshpande, Anagha Chaudhari","doi":"10.1109/ICECA.2017.8212847","DOIUrl":null,"url":null,"abstract":"Classification is a central problem in the fields of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classifier) that can be used to predict the class of new unlabelled instances. Data preparation is crucial to the data mining process, and its focus is to improve the fitness of the training data for the learning algorithms to produce more effective classifiers. Searching for the frequent pattern within a specific sequence has become a much needed task in various sectors. Feature selection is selecting a subset of optimal features. Feature selection is being used in high dimensional data reduction and it is being used in several applications like medical, image processing, text mining, etc. In the existing work, unsupervised feature selection methods using Artificial Bee Colony Optimization Algorithm, Bat Algorithm and Ant Colony Optimization have been introduced. We have compared these three algorithms and concluded that Bat Algorithm proves to be better in performance than the rest. The proposed system will use a novel method to select subset of features from unlabelled data using Bat algorithm with one of the clustering algorithm and develop an expert information retrieval system.","PeriodicalId":222768,"journal":{"name":"2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Expert system for retrieval of documents using evolutionary approaches incorporating clustering\",\"authors\":\"S. Deshpande, Monika Doke, Aishwarya Deshpande, Anagha Chaudhari\",\"doi\":\"10.1109/ICECA.2017.8212847\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classification is a central problem in the fields of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classifier) that can be used to predict the class of new unlabelled instances. Data preparation is crucial to the data mining process, and its focus is to improve the fitness of the training data for the learning algorithms to produce more effective classifiers. Searching for the frequent pattern within a specific sequence has become a much needed task in various sectors. Feature selection is selecting a subset of optimal features. Feature selection is being used in high dimensional data reduction and it is being used in several applications like medical, image processing, text mining, etc. In the existing work, unsupervised feature selection methods using Artificial Bee Colony Optimization Algorithm, Bat Algorithm and Ant Colony Optimization have been introduced. We have compared these three algorithms and concluded that Bat Algorithm proves to be better in performance than the rest. The proposed system will use a novel method to select subset of features from unlabelled data using Bat algorithm with one of the clustering algorithm and develop an expert information retrieval system.\",\"PeriodicalId\":222768,\"journal\":{\"name\":\"2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECA.2017.8212847\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International conference of Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA.2017.8212847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Expert system for retrieval of documents using evolutionary approaches incorporating clustering
Classification is a central problem in the fields of data mining and machine learning. Using a training set of labeled instances, the task is to build a model (classifier) that can be used to predict the class of new unlabelled instances. Data preparation is crucial to the data mining process, and its focus is to improve the fitness of the training data for the learning algorithms to produce more effective classifiers. Searching for the frequent pattern within a specific sequence has become a much needed task in various sectors. Feature selection is selecting a subset of optimal features. Feature selection is being used in high dimensional data reduction and it is being used in several applications like medical, image processing, text mining, etc. In the existing work, unsupervised feature selection methods using Artificial Bee Colony Optimization Algorithm, Bat Algorithm and Ant Colony Optimization have been introduced. We have compared these three algorithms and concluded that Bat Algorithm proves to be better in performance than the rest. The proposed system will use a novel method to select subset of features from unlabelled data using Bat algorithm with one of the clustering algorithm and develop an expert information retrieval system.