2014 IEEE 30th International Conference on Data Engineering Workshops最新文献_第3页

Reconciling malware labeling discrepancy via consensus learning 通过共识学习协调恶意软件标签差异

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818308

Ting Wang, Xin Hu, S. Meng, R. Sailer

引用次数: 2

Balloon Fusion: SPARQL rewriting based on unified co-reference information 气球融合:基于统一的共同引用信息的SPARQL重写

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818335

K. Schlegel, F. Stegmaier, Sebastian P. Bayerl, M. Granitzer, H. Kosch

引用次数: 19

BIIIG: Enabling business intelligence with integrated instance graphs BIIIG:通过集成的实例图实现商业智能

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818294

André Petermann, Martin Junghanns, R. Müller, E. Rahm

引用次数: 27

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818350

Zhenxing Xu

引用次数: 3

Bootstrapping Wikipedia to answer ambiguous person name queries 引导维基百科来回答模棱两可的人名查询

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818303

Toni Grütze, G. Kasneci, Zhe Zuo, Felix Naumann

{"title":"Bootstrapping Wikipedia to answer ambiguous person name queries","authors":"Toni Grütze, G. Kasneci, Zhe Zuo, Felix Naumann","doi":"10.1109/ICDEW.2014.6818303","DOIUrl":"https://doi.org/10.1109/ICDEW.2014.6818303","url":null,"abstract":"Some of the main ranking features of today's search engines reflect result popularity and are based on ranking models, such as PageRank, implicit feedback aggregation, and more. While such features yield satisfactory results for a wide range of queries, they aggravate the problem of search for ambiguous entities: Searching for a person yields satisfactory results only if the person in question is represented by a high-ranked Web page and all required information are contained in this page. Otherwise, the user has to either reformulate/refine the query or manually inspect low-ranked results to find the person in question. A possible approach to solve this problem is to cluster the results, so that each cluster represents one of the persons occurring in the answer set. However clustering search results has proven to be a difficult endeavor by itself, where the clusters are typically of moderate quality. A wealth of useful information about persons occurs in Web 2.0 platforms, such as Wikipedia, LinkedIn, Facebook, etc. Being human-generated, the information on these platforms is clean, focused, and already disambiguated. We show that when searching with ambiguous person names the information from Wikipedia can be bootstrapped to group the results according to the individuals occurring in them. We have evaluated our methods on a hand-labeled dataset of around 5,000 Web pages retrieved from Google queries on 50 ambiguous person names.","PeriodicalId":302600,"journal":{"name":"2014 IEEE 30th International Conference on Data Engineering Workshops","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133989306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Execution and optimization of continuous windowed aggregation queries 连续窗口聚合查询的执行和优化

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818345

Harold Lim, S. Babu

引用次数: 1

Analysis and detection of low quality information in social networks 社交网络中低质量信息的分析与检测

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818354

De Wang

{"title":"Analysis and detection of low quality information in social networks","authors":"De Wang","doi":"10.1109/ICDEW.2014.6818354","DOIUrl":"https://doi.org/10.1109/ICDEW.2014.6818354","url":null,"abstract":"With social networks like Facebook, Twitter and Google+ attracting audiences of millions of users, they have been an important communication platform in daily life. This in turn attracts malicious users to the social networks as well, causing an increase in the incidence of low quality information. Low quality information such as spam and rumors is a nuisance to people and hinders them from consuming information that is pertinent to them or that they are looking for. Although individual social networks are capable of filtering a significant amount of low quality information they receive, they usually require large amounts of resources (e.g, personnel) and incur a delay before detecting new types of low quality information. Also the evolution of various low quality information posts lots of challenges to defensive techniques. My PhD thesis work focuses on the analysis and detection of low quality information in social networks. We introduce social spam analytics and detection framework SPADE across multiple social networks showing the efficiency and flexibility of cross-domain classification and associative classification. For evolutionary study of low quality information, we present the results on large-scale study on Web spam and email spam over a long period of time. Furthermore, we provide activity-based detection approaches to filter out low quality information in social networks: click traffic analysis of short URL spam, behavior analysis of URL spam and information diffusion analysis of rumor. Our framework and detection techniques show promising results in analyzing and detecting low quality information in social networks.","PeriodicalId":302600,"journal":{"name":"2014 IEEE 30th International Conference on Data Engineering Workshops","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133367849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 28

A hashtags dictionary from crowdsourced definitions 一个来自众包定义的标签词典

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818300

Mérième Ghenname, Julien Subercaze, C. Gravier, F. Laforest, Mounia Abik, R. Ajhoun

引用次数: 2

Semantic management of Enterprise Integration Patterns: A use case in Smart Grids 企业集成模式的语义管理:智能电网中的一个用例

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818302

O. Patri, A. Panangadan, V. Sorathia, V. Prasanna

引用次数: 6

Neighbor-base similarity matching for graphs 图的基于邻居的相似度匹配

2014 IEEE 30th International Conference on Data Engineering Workshops Pub Date : 2014-05-19 DOI: 10.1109/ICDEW.2014.6818326

Hang Zhang, Hongzhi Wang, Jianzhong Li, Hong Gao

引用次数: 0