基于关键路径的Web日志数据挖掘

Proceedings. International Conference on Machine Learning and Cybernetics Pub Date : 2002-11-04 DOI:10.1109/ICMLC.2002.1176728

Aibo Song, Zuo-Peng Liang, Mao-Xian Zhao, Yi-sheng Dong

{"title":"基于关键路径的Web日志数据挖掘","authors":"Aibo Song, Zuo-Peng Liang, Mao-Xian Zhao, Yi-sheng Dong","doi":"10.1109/ICMLC.2002.1176728","DOIUrl":null,"url":null,"abstract":"A Web log mining method is presented. First, minimal key path set (MKPS) is defined and an algorithm to find the MKPS online is given. At the same time, for any key path in the MPKS, this algorithm can find out all transactions relevant to it. After scanning the transaction database only once, a relevant matrix is set up, where the key paths in MKPS are taken as columns and the transactions are taken as rows. Compared to previous methods, our method considers the three major features of users' accessing the Web: ordinal, contiguous, and duplicate. Moreover, for clustering transactions, we have lesser dimensions than the previous method. Using the clustering algorithm based on the relevant matrix, better clustering results will be obtained more precisely and quickly. Experiments show the effectiveness of the method.","PeriodicalId":90702,"journal":{"name":"Proceedings. International Conference on Machine Learning and Cybernetics","volume":"34 1","pages":"150-155 vol.1"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Mining Web log data based on key path\",\"authors\":\"Aibo Song, Zuo-Peng Liang, Mao-Xian Zhao, Yi-sheng Dong\",\"doi\":\"10.1109/ICMLC.2002.1176728\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Web log mining method is presented. First, minimal key path set (MKPS) is defined and an algorithm to find the MKPS online is given. At the same time, for any key path in the MPKS, this algorithm can find out all transactions relevant to it. After scanning the transaction database only once, a relevant matrix is set up, where the key paths in MKPS are taken as columns and the transactions are taken as rows. Compared to previous methods, our method considers the three major features of users' accessing the Web: ordinal, contiguous, and duplicate. Moreover, for clustering transactions, we have lesser dimensions than the previous method. Using the clustering algorithm based on the relevant matrix, better clustering results will be obtained more precisely and quickly. Experiments show the effectiveness of the method.\",\"PeriodicalId\":90702,\"journal\":{\"name\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"volume\":\"34 1\",\"pages\":\"150-155 vol.1\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC.2002.1176728\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2002.1176728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

提出了一种Web日志挖掘方法。首先，定义了最小密钥路径集(MKPS)，并给出了一种在线查找最小密钥路径集的算法。同时，对于MPKS中的任意密钥路径，该算法都能找出与之相关的所有事务。在扫描事务数据库一次之后，将建立一个相关的矩阵，其中MKPS中的关键路径作为列，事务作为行。与以前的方法相比，我们的方法考虑了用户访问Web的三个主要特征:顺序、连续和重复。此外，对于聚类事务，我们的维数比以前的方法少。采用基于相关矩阵的聚类算法，可以更准确、更快速地获得更好的聚类结果。实验证明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Mining Web log data based on key path

A Web log mining method is presented. First, minimal key path set (MKPS) is defined and an algorithm to find the MKPS online is given. At the same time, for any key path in the MPKS, this algorithm can find out all transactions relevant to it. After scanning the transaction database only once, a relevant matrix is set up, where the key paths in MKPS are taken as columns and the transactions are taken as rows. Compared to previous methods, our method considers the three major features of users' accessing the Web: ordinal, contiguous, and duplicate. Moreover, for clustering transactions, we have lesser dimensions than the previous method. Using the clustering algorithm based on the relevant matrix, better clustering results will be obtained more precisely and quickly. Experiments show the effectiveness of the method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings. International Conference on Machine Learning and Cybernetics

自引率

0.00%

发文量