{"title":"基于分割KMean的社交网络数据集模式识别方法研究","authors":"Shilpa V. Gajbhiye, Gaurav B. Malode","doi":"10.1109/I2C2.2017.8321776","DOIUrl":null,"url":null,"abstract":"Databases today can range in size more than terabytes. Within these masses of data lies hidden information of strategic importance. So when there are lots of trees, how to find conclusions about the forest? The newest answer is mining of data, which is being used to increase revenues. Data mining is a process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make valid predictions. This research uses social networking data set for pattern recognition, because it is one of the emerging application areas in data mining. We used Facebook 100 dataset and applied Bisecting KMeans algorithm on it, so that we would get better clustering outputs. Bisecting KMeans first bisects the data into 2 parts and selects the part with greater number of elements, then apply clustering on it again. This goes on till we have N Number of clusters. We would apply this to our dataset to get desired results. With this we are going to compare Bisecting K Mean algorithm with other data mining algorithm. And finally we are going to find out different pattern from social networking dataset.","PeriodicalId":288351,"journal":{"name":"2017 International Conference on Intelligent Computing and Control (I2C2)","volume":"37 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing pattern recognition in social networking dataset by using bisecting KMean\",\"authors\":\"Shilpa V. Gajbhiye, Gaurav B. Malode\",\"doi\":\"10.1109/I2C2.2017.8321776\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Databases today can range in size more than terabytes. Within these masses of data lies hidden information of strategic importance. So when there are lots of trees, how to find conclusions about the forest? The newest answer is mining of data, which is being used to increase revenues. Data mining is a process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make valid predictions. This research uses social networking data set for pattern recognition, because it is one of the emerging application areas in data mining. We used Facebook 100 dataset and applied Bisecting KMeans algorithm on it, so that we would get better clustering outputs. Bisecting KMeans first bisects the data into 2 parts and selects the part with greater number of elements, then apply clustering on it again. This goes on till we have N Number of clusters. We would apply this to our dataset to get desired results. With this we are going to compare Bisecting K Mean algorithm with other data mining algorithm. And finally we are going to find out different pattern from social networking dataset.\",\"PeriodicalId\":288351,\"journal\":{\"name\":\"2017 International Conference on Intelligent Computing and Control (I2C2)\",\"volume\":\"37 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Intelligent Computing and Control (I2C2)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I2C2.2017.8321776\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Intelligent Computing and Control (I2C2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2C2.2017.8321776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhancing pattern recognition in social networking dataset by using bisecting KMean
Databases today can range in size more than terabytes. Within these masses of data lies hidden information of strategic importance. So when there are lots of trees, how to find conclusions about the forest? The newest answer is mining of data, which is being used to increase revenues. Data mining is a process that uses a variety of data analysis tools to discover patterns and relationships in data that may be used to make valid predictions. This research uses social networking data set for pattern recognition, because it is one of the emerging application areas in data mining. We used Facebook 100 dataset and applied Bisecting KMeans algorithm on it, so that we would get better clustering outputs. Bisecting KMeans first bisects the data into 2 parts and selects the part with greater number of elements, then apply clustering on it again. This goes on till we have N Number of clusters. We would apply this to our dataset to get desired results. With this we are going to compare Bisecting K Mean algorithm with other data mining algorithm. And finally we are going to find out different pattern from social networking dataset.