{"title":"建模和计算Web社区的随机方法","authors":"G. Greco, S. Greco, E. Zumpano","doi":"10.1109/WISE.2002.1181642","DOIUrl":null,"url":null,"abstract":"In the last few years, a lot of research has been devoted to developing new techniques for improving the recall and precision of current Web search engines. Few works deal with the interesting problem of identifying the communities to which pages belong. Most previous approaches tried to cluster data by means of spectral techniques or traditional hierarchical algorithms. The main problem with these techniques is that they ignore the fact that Web communities are social networks with distinctive statistical properties. We analyze Web communities on the basis of the evolution of an initial set of hubs and authoritative pages. The evolution law captures the behaviour of page authors with respect to the popularity of existing pages for topics of interest. Assuming such a model, we have found interesting properties of Web communities and have proposed a technique for computing relevant properties for specific topics. Several experiments have confirmed the validity of both the model and the identification method.","PeriodicalId":392999,"journal":{"name":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","volume":"42 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A stochastic approach for modeling and computing Web communities\",\"authors\":\"G. Greco, S. Greco, E. Zumpano\",\"doi\":\"10.1109/WISE.2002.1181642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the last few years, a lot of research has been devoted to developing new techniques for improving the recall and precision of current Web search engines. Few works deal with the interesting problem of identifying the communities to which pages belong. Most previous approaches tried to cluster data by means of spectral techniques or traditional hierarchical algorithms. The main problem with these techniques is that they ignore the fact that Web communities are social networks with distinctive statistical properties. We analyze Web communities on the basis of the evolution of an initial set of hubs and authoritative pages. The evolution law captures the behaviour of page authors with respect to the popularity of existing pages for topics of interest. Assuming such a model, we have found interesting properties of Web communities and have proposed a technique for computing relevant properties for specific topics. Several experiments have confirmed the validity of both the model and the identification method.\",\"PeriodicalId\":392999,\"journal\":{\"name\":\"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.\",\"volume\":\"42 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISE.2002.1181642\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISE.2002.1181642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A stochastic approach for modeling and computing Web communities
In the last few years, a lot of research has been devoted to developing new techniques for improving the recall and precision of current Web search engines. Few works deal with the interesting problem of identifying the communities to which pages belong. Most previous approaches tried to cluster data by means of spectral techniques or traditional hierarchical algorithms. The main problem with these techniques is that they ignore the fact that Web communities are social networks with distinctive statistical properties. We analyze Web communities on the basis of the evolution of an initial set of hubs and authoritative pages. The evolution law captures the behaviour of page authors with respect to the popularity of existing pages for topics of interest. Assuming such a model, we have found interesting properties of Web communities and have proposed a technique for computing relevant properties for specific topics. Several experiments have confirmed the validity of both the model and the identification method.