Companion Proceedings of the The Web Conference 2018最新文献

筛选
英文 中文
The Shifting Landscape of Web Search and Mining: Past, Present, and Future 网络搜索和挖掘的变化:过去、现在和未来
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3188749
Davood Rafiei, Eugene Agichtein, R. Baeza-Yates, J. Kleinberg, J. Leskovec
{"title":"The Shifting Landscape of Web Search and Mining: Past, Present, and Future","authors":"Davood Rafiei, Eugene Agichtein, R. Baeza-Yates, J. Kleinberg, J. Leskovec","doi":"10.1145/3184558.3188749","DOIUrl":"https://doi.org/10.1145/3184558.3188749","url":null,"abstract":"The Web's content has been going through major changes, triggered by multiple factors including changes in user demographic and authoring behaviour, a shift in device types that access the Web, and changes in common use cases of the Web. More specifically, the number of mobile internet users has surpassed the desktop users according to different statistics; a considerable portion of web use cases are in the form of social interactions rather than information seeking; and the authoring behaviour has transformed from compiling a page and linking resources to sharing content with like-minded followers and leaving likes and comments on posts. Those changes have influenced and are expected to shape the way the content is organized, searched, ranked and analyzed. This panel brings together researchers who have been working in different established areas related to web search and mining, web content and social network analysis, and semantics and knowledge management. The panel will draw from the experience of the panellists, dealing with changes in their respective fields. In the first (role-playing) round, each panellist will strongly take a side on where the changes are heading, arguing that one form of content will dominate in the near future. In the second round, the panellists will counter each other and will share their vision on what future holds in terms of research problems and directions. The members of the audience will participate, in a QA session with the panellists, bringing their own perspectives to the discussion.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127922951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Alt-Right to Alt-Rechts: Twitter Analysis of the 2017 German Federal Election 从另类右翼到另类右翼:2017年德国联邦选举的推特分析
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3188733
Fred Morstatter, Yunqiu Shao, A. Galstyan, S. Karunasekera
{"title":"From Alt-Right to Alt-Rechts: Twitter Analysis of the 2017 German Federal Election","authors":"Fred Morstatter, Yunqiu Shao, A. Galstyan, S. Karunasekera","doi":"10.1145/3184558.3188733","DOIUrl":"https://doi.org/10.1145/3184558.3188733","url":null,"abstract":"In the 2017 German Federal elections, the \"Alternative for Deutschland'', or AfD, party was able to take control of many seats in German parliament. Their success was credited, in part, to their large online presence. Like other \"alt-right'' organizations worldwide, this party is tech savvy, generating a large social media footprint, especially on Twitter, which provides an ample opportunity to understand their online behavior. In this work we present an analysis of Twitter data related to the aforementioned election. We show how users self-organize into communities, and identify the themes that define those communities. Next we analyze the content generated by those communities, and the extent to which these communities interact. Despite these elections being held in Germany, we note a substantial impact from the English-speaking Twittersphere. Specifically, we note that many of these accounts appear to be from the American alt-right movement, and support the German alt-right movement.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127929818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Handling Confounding for Realistic Off-Policy Evaluation 为现实的非政策评估处理混淆
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3186915
Saurabh Sohoney, Nikita Prabhu, V. Chaoji
{"title":"Handling Confounding for Realistic Off-Policy Evaluation","authors":"Saurabh Sohoney, Nikita Prabhu, V. Chaoji","doi":"10.1145/3184558.3186915","DOIUrl":"https://doi.org/10.1145/3184558.3186915","url":null,"abstract":"Inverse Propensity Score estimator (IPS) is a basic, unbiased, off-policy evaluation technique to measure the impact of a user-interactive system without serving live traffic. We present our work on applying IPS to real-world settings by addressing some practical challenges, thereby enabling successful policy evaluation. In particular, we show that off-policy evaluation can be impossible in the absence of a complete context and we describe a systematic way of defining the context.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132128027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Large Scale Ordinal Ranking Model via Divide-and-Conquer Technique 用分治法学习大规模有序排序模型
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3191658
Lu Tang, Sougata Chaudhuri, A. Bagherjeiran, Lingzhi Zhou
{"title":"Learning Large Scale Ordinal Ranking Model via Divide-and-Conquer Technique","authors":"Lu Tang, Sougata Chaudhuri, A. Bagherjeiran, Lingzhi Zhou","doi":"10.1145/3184558.3191658","DOIUrl":"https://doi.org/10.1145/3184558.3191658","url":null,"abstract":"Structured prediction, where outcomes have a precedence order, lies at the heart of machine learning for information retrieval, movie recommendation, product review prediction, and digital advertising. Ordinal ranking, in particular, assumes that the structured response has a linear ranked order. Due to the extensive applicability of these models, substantial research has been devoted to understanding them, as well as developing efficient training techniques. One popular and widely cited technique of training ordinal ranking models is to exploit the linear precedence order and systematically reduce it to a binary classification problem. This facilitates the usage of readily available, powerful binary classifiers, but necessitates an expansion of the original training data, where the training data increases by $K-1$ times of its original size, with K being the number of ordinal classes. Due to prevalent nature of problems with large number of ordered classes, the reduction leads to datasets which are too large to train on single machines. While approximation methods like stochastic gradient descent are typically applied here, we investigate exact optimization solutions that can scale. In this paper, we present a divide-and-conquer (DC) algorithm, which divides large scale binary classification data into a cluster of machines and trains logistic models in parallel, and combines them at the end of the training phase to create a single binary classifier, which can then be used as an ordinal ranker. It requires no synchronization between the parallel learning algorithms during the training period, which makes training on large datasets feasible and efficient. We prove consistency and asymptotic normality property of the learned models using our proposed algorithm. We provide empirical evidence, on various ordinal datasets, of improved estimation and prediction performance of the model learnt using our algorithm, over several standard divide-and-conquer algorithms.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132382495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Selection Bias in News Coverage: Learning it, Fighting it 新闻报道中的选择偏见:了解它,对抗它
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3188724
Dylan Bourgeois, Jérémie Rappaz, K. Aberer
{"title":"Selection Bias in News Coverage: Learning it, Fighting it","authors":"Dylan Bourgeois, Jérémie Rappaz, K. Aberer","doi":"10.1145/3184558.3188724","DOIUrl":"https://doi.org/10.1145/3184558.3188724","url":null,"abstract":"News entities must select and filter the coverage they broadcast through their respective channels since the set of world events is too large to be treated exhaustively. The subjective nature of this filtering induces biases due to, among other things, resource constraints, editorial guidelines, ideological affinities, or even the fragmented nature of the information at a journalist's disposal. The magnitude and direction of these biases are, however, widely unknown. The absence of ground truth, the sheer size of the event space, or the lack of an exhaustive set of absolute features to measure make it difficult to observe the bias directly, to characterize the leaning's nature and to factor it out to ensure a neutral coverage of the news. In this work, we introduce a methodology to capture the latent structure of media's decision process on a large scale. Our contribution is multi-fold. First, we show media coverage to be predictable using personalization techniques, and evaluate our approach on a large set of events collected from the GDELT database. We then show that a personalized and parametrized approach not only exhibits higher accuracy in coverage prediction, but also provides an interpretable representation of the selection bias. Last, we propose a method able to select a set of sources by leveraging the latent representation. These selected sources provide a more diverse and egalitarian coverage, all while retaining the most actively covered events.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"449 1-2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132500103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Travel Itinerary Recommendations with Must-see Points-of-Interest 旅游路线推荐,必看景点
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3191558
Kendall Taylor, Kwan Hui Lim, Jeffrey Chan
{"title":"Travel Itinerary Recommendations with Must-see Points-of-Interest","authors":"Kendall Taylor, Kwan Hui Lim, Jeffrey Chan","doi":"10.1145/3184558.3191558","DOIUrl":"https://doi.org/10.1145/3184558.3191558","url":null,"abstract":"Travelling and touring are popular leisure activities enjoyed by millions of tourists around the world. However, the task of travel itinerary recommendation and planning is tedious and challenging for tourists, who are often unfamiliar with the various Points-of-Interest (POIs) in a city. Apart from identifying popular POIs, the tourist needs to construct a travel itinerary comprising a subset of these POIs, and to order these POIs as a sequence of visits that can be completed within his/her available touring time. For a more realistic itinerary, the tourist also has to account for travelling time between POIs and visiting times at individual POIs. Furthermore, this itinerary should incorporate tourist preferences such as desired starting and ending POIs (e.g., POIs that are near the tourist's hotel) and a subset of must-see POIs (e.g., popular POIs that a tourist must visit). We term this the TourMustSee problem, which is based on a variant of the Orienteering problem. Following which, we propose the LP+M algorithm for solving the TourMustSee problem as an Integer Linear Program (ILP). Using a Flickr dataset of POI visits in seven touristic cities, we compare LP+M against various ILP-based baselines, and the results show that LP+M recommends better travel itineraries in terms of POI popularity, total POIs visited, total touring time utilized and must-visit POI(s) inclusion.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133932747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Smart Cities at Risk!: Privacy and Security Borderlines from Social Networking in Cities 智慧城市面临风险!:城市社交网络的隐私和安全边界
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3191516
Vaia Moustaka, Zenonas Theodosiou, A. Vakali, A. Kounoudes
{"title":"Smart Cities at Risk!: Privacy and Security Borderlines from Social Networking in Cities","authors":"Vaia Moustaka, Zenonas Theodosiou, A. Vakali, A. Kounoudes","doi":"10.1145/3184558.3191516","DOIUrl":"https://doi.org/10.1145/3184558.3191516","url":null,"abstract":"As smart cities infrastructures mature, data becomes a valuable asset which can radically improve city services and tools. Registration, acquisition and utilization of data, which will be transformed into smart services, are becoming more necessary than ever. Online social networks with their enormous momentum are one of the main sources of urban data offering heterogeneous real-time data at a minimal cost. However, various types of attacks often appear on them, which risk users' privacy and affect their online trust. The purpose of this article is to investigate how risks on online social networks affect smart cities and study the differences between privacy and security threats with regard to smart people and smart living dimensions.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131613427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Machine Learning for the Peer Assessment Credibility 基于机器学习的同行评估可信度
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3186957
Yingru Lin, S. Han, B. Kang
{"title":"Machine Learning for the Peer Assessment Credibility","authors":"Yingru Lin, S. Han, B. Kang","doi":"10.1145/3184558.3186957","DOIUrl":"https://doi.org/10.1145/3184558.3186957","url":null,"abstract":"The peer assessment approach is considered to be one of the best solutions for scaling both assessment and peer learning to global classrooms, such as MOOCs. However, some academic staff hesitate to use a peer assessment approach for their classes due to concerns about its credibility and reliability. The focus of our research is to detect the credibility level of each assessment performed by students during peer assessment. We found three major scopes in assessing the credibility level of evaluations, 1) Informativity, 2) Accuracy, and 3) Consistency. We collect assessments, including comments and grades provided by students during the peer assessment process and then each feedback-and-grade pair is labeled with its credibility level by Mechanical Turk evaluators. We extract relevant features from each labeled assessment and use them to build a classifier that attempts to automatically assess its level of credibility in C5.0 Decision Tree classifier. The evaluation results show that the model can be used to automatically classify peer assessments as credible or non-credible, with accuracy in the range of 88%.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131614179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Efficient Immunization Strategy Using Overlapping Nodes and Its Neighborhoods 利用重叠节点及其邻域的有效免疫策略
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3191566
Manish Kumar, Anurag Singh, H. Cherifi
{"title":"An Efficient Immunization Strategy Using Overlapping Nodes and Its Neighborhoods","authors":"Manish Kumar, Anurag Singh, H. Cherifi","doi":"10.1145/3184558.3191566","DOIUrl":"https://doi.org/10.1145/3184558.3191566","url":null,"abstract":"When an epidemic occurs, it is often impossible to vaccinate the entire population due to limited amount of resources. Therefore, it is of prime interest to identify the set of influential spreaders to immunize, in order to minimize both the cost of vaccine resource and the disease spreading. While various strategies based on the network topology have been introduced, few works consider the influence of the community structure in the epidemic spreading process. Nowadays, it is clear that many real-world networks exhibit an overlapping community structure, in which nodes are allowed to belong to more than one community. Previous work shows that the numbers of communities to which a node belongs is a good measure of its epidemic influence. In this work, we address the effect of nodes in the neighborhood of the overlapping nodes on epidemics spreading. The proposed immunization strategy provides highly connected neighbors of overlapping nodes in the network to immunize. The whole process requires information only at the node level and is well suited to large-scale networks. Extensive experiments on four real-world networks of diverse nature have been performed. Comparisons with alternative local immunization strategies using the fraction of the Largest Connected Component (LCC) after immunization,show that the proposed method is much more efficient. Additionally, it compares favorably to global measures such as degree and betweenness centrality.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132622287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
The WWW (and an H) of Mobile Application Usage in the City: The What, Where, When, and How 城市中移动应用使用的WWW(和H):什么,在哪里,何时,如何
Companion Proceedings of the The Web Conference 2018 Pub Date : 2018-04-23 DOI: 10.1145/3184558.3191561
Eduardo Graells-Garrido, Diego Caro, Omar Miranda, R. Schifanella, Oscar F. Peredo
{"title":"The WWW (and an H) of Mobile Application Usage in the City: The What, Where, When, and How","authors":"Eduardo Graells-Garrido, Diego Caro, Omar Miranda, R. Schifanella, Oscar F. Peredo","doi":"10.1145/3184558.3191561","DOIUrl":"https://doi.org/10.1145/3184558.3191561","url":null,"abstract":"People fulfill their informational needs through smartphones, however, little is known regarding how the urban fabric and the activities that take place in it affect the usage of mobile applications. In this regard, starting from an anonymized dataset of Deep Packet Inspection (DPI) data from the largest telecommunications operator in Chile, we focus on the following questions: What are the most popular applications used in the city Where are they spatially clustered When does an application is more frequently used And How does the urban context and the mobility patterns relate to application usage As a result, we observed that specific applications present high spatial clustering, while the most popular services are geographically dispersed throughout the entire city. Clusters appear in places of high floating population; however, hotspots vary in space depending on the application. Interestingly, we found that commuting plays an important role, both in terms of rush hours and transportation infrastructure. We present a discussion on these results, focusing on how the physical space and the daily commuting routine affect the pattern of data consumption and represent an important aspect in mobile users behavioral studies.","PeriodicalId":235572,"journal":{"name":"Companion Proceedings of the The Web Conference 2018","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133664225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信