SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining最新文献_第5页

Web Content Extraction: a MetaAnalysis of its Past and Thoughts on its Future 网络内容抽取:过去的元分析与未来的思考

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-08-17 DOI: 10.1145/2897350.2897353

Tim Weninger, Rodrigo Palácios, Valter Crescenzi, Thomas Gottron, P. Merialdo

引用次数: 18

New Research Directions in Knowledge Discovery and Allied Spheres 知识发现及其相关领域的新研究方向

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-05-21 DOI: 10.1145/2783702.2783708

A. Nica, Fabian M. Suchanek, A. Varde

引用次数: 4

A Social Formalism and Survey for Recommender Systems 社会形式主义与推荐制度研究

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-05-21 DOI: 10.1145/2783702.2783705

D. F. Bernardes, M. Diaby, Raphaël Fournier-S’niehotta, F. Fogelman-Soulié, E. Viennet

引用次数: 41

The Data Problem in Data Mining 数据挖掘中的数据问题

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-05-21 DOI: 10.1145/2783702.2783706

Albrecht Zimmermann

引用次数: 14

Patent Mining: A Survey 专利挖掘:综述

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-05-21 DOI: 10.1145/2783702.2783704

Longhui Zhang, Lei Li, Tao Li

{"title":"Patent Mining: A Survey","authors":"Longhui Zhang, Lei Li, Tao Li","doi":"10.1145/2783702.2783704","DOIUrl":"https://doi.org/10.1145/2783702.2783704","url":null,"abstract":"Patent documents are important intellectual resources of protecting interests of individuals, organizations and companies. Different from general web documents, patent documents have a well-defined format including frontpage, description, nclaims, and figures. However, they are lengthy and rich in technical terms, which requires enormous human efforts for analysis. Hence, a new research area, called patent mining, emerges in recent years, aiming to assist patent analysts in investigating, processing, and analyzing patent documents. Despite the recent advances in patent mining, it is still far from being well explored in research communities. To help patent analysts and interested readers obtain a big picture of patent mining, we thus provide a systematic summary of existing research efforts along this direction. In this survey, we first present an overview of the technical trend in patent mining. We then investigate multiple research questions related to patent documents, including patent retrieval, patent classification, and patent visualization, and provide summaries and highlights for each question by delving into the corresponding research efforts.","PeriodicalId":90050,"journal":{"name":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","volume":"77 1","pages":"1-19"},"PeriodicalIF":0.0,"publicationDate":"2015-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76162868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 74

A Survey on Truth Discovery 真理发现调查

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-05-11 DOI: 10.1145/2897350.2897352

Yaliang Li, Jing Gao, Chuishi Meng, Qi Li, Lu Su, Bo Zhao, Wei Fan, Jiawei Han

引用次数: 382

Références bibliographiques 参考书目

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-03-04 DOI: 10.3917/dunod.porno.2015.02.0295

H. Pornon

引用次数: 0

Twitter analytics: a big data management perspective Twitter分析:大数据管理视角

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2014-09-25 DOI: 10.1145/2674026.2674029

Oshini Goonetilleke, T. Sellis, Xiuzhen Zhang, Saket K. Sathe

引用次数: 46

On power law distributions in large-scale taxonomies 关于大规模分类法中的幂律分布

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2014-09-25 DOI: 10.1145/2674026.2674033

Rohit Babbar, Cornelia Metzig, Ioannis Partalas, Éric Gaussier, Massih-Reza Amini

{"title":"On power law distributions in large-scale taxonomies","authors":"Rohit Babbar, Cornelia Metzig, Ioannis Partalas, Éric Gaussier, Massih-Reza Amini","doi":"10.1145/2674026.2674033","DOIUrl":"https://doi.org/10.1145/2674026.2674033","url":null,"abstract":"In many of the large-scale physical and social complex systems phenomena fat-tailed distributions occur, for which different generating mechanisms have been proposed. In this paper, we study models of generating power law distributions in the evolution of large-scale taxonomies such as Open Directory Project, which consist of websites assigned to one of tens of thousands of categories. The categories in such taxonomies are arranged in tree or DAG structured configurations having parent-child relations among them. We first quantitatively analyse the formation process of such taxonomies, which leads to power law distribution as the stationary distributions. In the context of designing classifiers for large-scale taxonomies, which automatically assign unseen documents to leaf-level categories, we highlight how the fat-tailed nature of these distributions can be leveraged to analytically study the space complexity of such classifiers. Empirical evaluation of the space complexity on publicly available datasets demonstrates the applicability of our approach.","PeriodicalId":90050,"journal":{"name":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","volume":"30 1","pages":"47-56"},"PeriodicalIF":0.0,"publicationDate":"2014-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81099743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 20

Change detection in streaming data in the era of big data: models and issues 大数据时代流数据的变化检测:模型与问题

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2014-09-25 DOI: 10.1145/2674026.2674031

Dang-Hoan Tran, Mohamed Medhat Gaber, K. Sattler

引用次数: 27