{"title":"Grouping of users based on user navigation behaviour using supervised association rule tree mining","authors":"R. GeethaRamani, P. Revathy, L. Balasubramanian","doi":"10.1504/IJRIS.2018.10017514","DOIUrl":null,"url":null,"abstract":"In this internet world, an increased interest of users in search of World Wide Web results in wide relevance of web mining, an application of data mining. Clustering has been widely used for web usage mining. Finding initial cluster center and specifying the number of clusters are the major challenges, which are overcome in this work by grouping of users based on the target class value. The benchmark dataset MSNBC is collected for the entire day of September 28, 1999. Supervised association rule tree mining is used to find frequent itemset for the targeted class value and thus generating 'if then rules'. Users are automatically clustered based on the rules satisfying the ground truth, resulting in 36 clusters in two iterations. The results revealed that the renowned clustering algorithms such as K-means takes 22 iterations for forming 36 clusters, wherein the proposed work generates 36 clusters in two iterations.","PeriodicalId":360794,"journal":{"name":"Int. J. Reason. based Intell. Syst.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Reason. based Intell. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJRIS.2018.10017514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this internet world, an increased interest of users in search of World Wide Web results in wide relevance of web mining, an application of data mining. Clustering has been widely used for web usage mining. Finding initial cluster center and specifying the number of clusters are the major challenges, which are overcome in this work by grouping of users based on the target class value. The benchmark dataset MSNBC is collected for the entire day of September 28, 1999. Supervised association rule tree mining is used to find frequent itemset for the targeted class value and thus generating 'if then rules'. Users are automatically clustered based on the rules satisfying the ground truth, resulting in 36 clusters in two iterations. The results revealed that the renowned clustering algorithms such as K-means takes 22 iterations for forming 36 clusters, wherein the proposed work generates 36 clusters in two iterations.