{"title":"集体可以使用搜索帮助吗?","authors":"D. Dicheva, Christo Dichev","doi":"10.1109/CyberC.2011.14","DOIUrl":null,"url":null,"abstract":"In this paper we propose a \"find similar\" method intended to extend the searching capabilities of digital collections targeting educational and academic domains. Given a document, the described algorithm finds similar documents that may be of interest to the user. It exploits the metadata typical for the participatory web. In the adopted model, documents are viewed as objects associated with a set of tags and a set of users who have tagged them, inducing tag-based and user-based similarity. The similarity between two documents is computed as a combination of their tag-base and, user-based cosine similarity and the document recency. We have con-ducted a series of experiments using a CiteULike dump to investigate the properties of the proposed similarity measure. The experimental results indicate that the algorithm exploiting meta-information about the documents provides a good approximation of our understanding of the contextual dependency of the notion of similarity.","PeriodicalId":227472,"journal":{"name":"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Can Collective Use Help for Searching?\",\"authors\":\"D. Dicheva, Christo Dichev\",\"doi\":\"10.1109/CyberC.2011.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we propose a \\\"find similar\\\" method intended to extend the searching capabilities of digital collections targeting educational and academic domains. Given a document, the described algorithm finds similar documents that may be of interest to the user. It exploits the metadata typical for the participatory web. In the adopted model, documents are viewed as objects associated with a set of tags and a set of users who have tagged them, inducing tag-based and user-based similarity. The similarity between two documents is computed as a combination of their tag-base and, user-based cosine similarity and the document recency. We have con-ducted a series of experiments using a CiteULike dump to investigate the properties of the proposed similarity measure. The experimental results indicate that the algorithm exploiting meta-information about the documents provides a good approximation of our understanding of the contextual dependency of the notion of similarity.\",\"PeriodicalId\":227472,\"journal\":{\"name\":\"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CyberC.2011.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CyberC.2011.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this paper we propose a "find similar" method intended to extend the searching capabilities of digital collections targeting educational and academic domains. Given a document, the described algorithm finds similar documents that may be of interest to the user. It exploits the metadata typical for the participatory web. In the adopted model, documents are viewed as objects associated with a set of tags and a set of users who have tagged them, inducing tag-based and user-based similarity. The similarity between two documents is computed as a combination of their tag-base and, user-based cosine similarity and the document recency. We have con-ducted a series of experiments using a CiteULike dump to investigate the properties of the proposed similarity measure. The experimental results indicate that the algorithm exploiting meta-information about the documents provides a good approximation of our understanding of the contextual dependency of the notion of similarity.