SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining最新文献_第4页

Current and Future Challenges in Mining Large Networks: Report on the Second SDM Workshop on Mining Networks and Graphs 挖掘大型网络的当前和未来挑战:第二届SDM挖矿网络和图研讨会报告

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-08-01 DOI: 10.1145/2980765.2980770

L. Holder, R. Caceres, D. Gleich, E. J. Riedy, Maleq Khan, N. Chawla, Ravi Kumar, Yinghui Wu, Christine Klymko, Tina Eliassi-Rad, B. Prakash

引用次数: 6

The Internet of Things: Opportunities and Challenges for Distributed Data Analysis 物联网:分布式数据分析的机遇与挑战

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-08-01 DOI: 10.1145/2980765.2980768

Marco Stolpe

{"title":"The Internet of Things: Opportunities and Challenges for Distributed Data Analysis","authors":"Marco Stolpe","doi":"10.1145/2980765.2980768","DOIUrl":"https://doi.org/10.1145/2980765.2980768","url":null,"abstract":"Nowadays, data is created by humans as well as automatically collected by physical things, which embed electronics, software, sensors and network connectivity. Together, these entities constitute the Internet of Things (IoT). The automated analysis of its data can provide insights into previously unknown relationships between things, their environment and their users, facilitating an optimization of their behavior. Especially the real-time analysis of data, embedded into physical systems, can enable new forms of autonomous control. These in turn may lead to more sustainable applications, reducing waste and saving resources IoT's distributed and dynamic nature, resource constraints of sensors and embedded devices as well as the amounts of generated data are challenging even the most advanced automated data analysis methods known today. In particular, the IoT requires a new generation of distributed analysis methods. Many existing surveys have strongly focused on the centralization of data in the cloud and big data analysis, which follows the paradigm of parallel high-performance computing. However, bandwidth and energy can be too limited for the transmission of raw data, or it is prohibited due to privacy constraints. Such communication-constrained scenarios require decentralized analysis algorithms which at least partly work directly on the generating devices. After listing data-driven IoT applications, in contrast to existing surveys, we highlight the differences between cloudbased and decentralized analysis from an algorithmic perspective. We present the opportunities and challenges of research on communication-efficient decentralized analysis algorithms. Here, the focus is on the difficult scenario of vertically partitioned data, which covers common IoT use cases. The comprehensive bibliography aims at providing readers with a good starting point for their own work","PeriodicalId":90050,"journal":{"name":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","volume":"12 1","pages":"15-34"},"PeriodicalIF":0.0,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72719008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 94

MultiClust 2013: Multiple Clusterings, Multiview Data, and Multisource Knowledgedriven Clustering: [Workshop Report] MultiClust 2013:多聚类、多视图数据和多源知识驱动聚类:[研讨会报告]

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-08-01 DOI: 10.1145/2980765.2980769

I. Assent, C. Domeniconi, Francesco Gullo, Andrea Tagarelli, A. Zimek

引用次数: 0

An Interactive Data Repository with Visual Analytics 具有可视化分析的交互式数据存储库

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-02-25 DOI: 10.1145/2897350.2897355

Ryan A. Rossi, Nesreen Ahmed

引用次数: 64

Web Content Extraction Web内容提取

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-02-25 DOI: 10.1007/springerreference_66087

WeningerTim, PalaciosRodrigo, CrescenziValter, GottronThomas, MerialdoPaolo

引用次数: 0

Shedding Light on the Performance of Solar Panels: A Data-Driven View 揭示太阳能电池板的性能:一个数据驱动的观点

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2016-02-25 DOI: 10.1145/2897350.2897354

S. A. Chen, A. Vishwanath, Saket K. Sathe, S. Kalyanaraman

引用次数: 5

Question Quality in Community Question Answering Forums: a survey 社区问答论坛问题质量调查

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-09-29 DOI: 10.1145/2830544.2830547

Antoaneta Baltadzhieva, Grzegorz Chrupała

引用次数: 54

Theoretical Foundations and Algorithms for Outlier Ensembles 离群值集成的理论基础和算法

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-09-29 DOI: 10.1145/2830544.2830549

C. Aggarwal, Saket K. Sathe

引用次数: 212

A Framework for Collocation Error Correction in Web Pages and Text Documents 网页与文本文档搭配纠错框架

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-09-29 DOI: 10.1145/2830544.2830548

Alan Varghese, A. Varde, Jing Peng, Eileen Fitzpatrick

{"title":"A Framework for Collocation Error Correction in Web Pages and Text Documents","authors":"Alan Varghese, A. Varde, Jing Peng, Eileen Fitzpatrick","doi":"10.1145/2830544.2830548","DOIUrl":"https://doi.org/10.1145/2830544.2830548","url":null,"abstract":"Much of the English in text documents today comes from nonnative speakers. Web searches are also conducted very often by non-native speakers. Though highly qualified in their respective fields, these speakers could potentially make errors in collocation, e.g., \"dark money\" and \"stock agora\" (instead of the more appropriate English expressions \"black money\" and \"stock market\" respectively). These may arise due to literal translation from the respective speaker's native language or other factors. Such errors could cause problems in contexts such as querying over Web pages, correct understanding of text documents and more. This paper proposes a framework called CollOrder to detect such collocation errors and suggest correctly ordered collocated responses for improving the semantics. This framework integrates machine learning approaches with natural language processing techniques, proposing suitable heuristics to provide responses to collocation errors, ranked in the order of correctness. We discuss the proposed framework with algorithms and experimental evaluation in this paper. We claim that it would be useful in semantically enhancing Web querying e.g., financial news, online shopping etc. It would also help in providing automated error correction in machine translated documents and offering assistance to people using ESL tools.","PeriodicalId":90050,"journal":{"name":"SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining","volume":"67 1","pages":"14-23"},"PeriodicalIF":0.0,"publicationDate":"2015-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79844414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Load-Balancing the Distance Computations in Record Linkage 负载均衡记录联动中的距离计算

SIGKDD explorations : newsletter of the Special Interest Group (SIG) on Knowledge Discovery & Data Mining Pub Date : 2015-09-29 DOI: 10.1145/2830544.2830546

Dimitrios Karapiperis, Vassilios S. Verykios

引用次数: 14