Mayank Sharma, Amit Srivastava, S. Shankar, S. Khatri
{"title":"切向空间上n维数据的动态聚类","authors":"Mayank Sharma, Amit Srivastava, S. Shankar, S. Khatri","doi":"10.1109/ICRITO.2018.8748444","DOIUrl":null,"url":null,"abstract":"Clustering of n-dimensional data into classes is consistent problem of research, Large number of efficient clustering techniques are in literature and still more are in development. K-means and Spherical K-means are standard clustering methods which are frequently used. Euclidean distance and cosine distance are mainly used by clustering methods. Data distribution is always non-linear and distributed in n-dimensional hyper sphere. Euclidean distance did not take care of topology of the hyper space. Clustering of data using spherical K-means clustering is done through mapping all data points in hyper sphere to the nearest cosine angular distance, but both do not take care of geodesic distance between the points on the surface of the hyper sphere. In this paper new mathematical dynamic clustering approach has been proposed which take care of topology of the data distribution between various clusters and geodesic distance between the points with in the cluster. Theoretical and mathematical results are discussed and empirically verified on the iris data set.","PeriodicalId":439047,"journal":{"name":"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic Clustering of n-Dimensional Data on Tangential Space\",\"authors\":\"Mayank Sharma, Amit Srivastava, S. Shankar, S. Khatri\",\"doi\":\"10.1109/ICRITO.2018.8748444\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering of n-dimensional data into classes is consistent problem of research, Large number of efficient clustering techniques are in literature and still more are in development. K-means and Spherical K-means are standard clustering methods which are frequently used. Euclidean distance and cosine distance are mainly used by clustering methods. Data distribution is always non-linear and distributed in n-dimensional hyper sphere. Euclidean distance did not take care of topology of the hyper space. Clustering of data using spherical K-means clustering is done through mapping all data points in hyper sphere to the nearest cosine angular distance, but both do not take care of geodesic distance between the points on the surface of the hyper sphere. In this paper new mathematical dynamic clustering approach has been proposed which take care of topology of the data distribution between various clusters and geodesic distance between the points with in the cluster. Theoretical and mathematical results are discussed and empirically verified on the iris data set.\",\"PeriodicalId\":439047,\"journal\":{\"name\":\"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRITO.2018.8748444\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRITO.2018.8748444","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic Clustering of n-Dimensional Data on Tangential Space
Clustering of n-dimensional data into classes is consistent problem of research, Large number of efficient clustering techniques are in literature and still more are in development. K-means and Spherical K-means are standard clustering methods which are frequently used. Euclidean distance and cosine distance are mainly used by clustering methods. Data distribution is always non-linear and distributed in n-dimensional hyper sphere. Euclidean distance did not take care of topology of the hyper space. Clustering of data using spherical K-means clustering is done through mapping all data points in hyper sphere to the nearest cosine angular distance, but both do not take care of geodesic distance between the points on the surface of the hyper sphere. In this paper new mathematical dynamic clustering approach has been proposed which take care of topology of the data distribution between various clusters and geodesic distance between the points with in the cluster. Theoretical and mathematical results are discussed and empirically verified on the iris data set.