{"title":"Automatic Generation of Event Timelines from Social Data","authors":"Omar Alonso, S. Tremblay, Fernando Diaz","doi":"10.1145/3091478.3091519","DOIUrl":"https://doi.org/10.1145/3091478.3091519","url":null,"abstract":"Over the past few years, social media has seen phenomenal growth and has become a very important source for getting real time updates from different parts of the world. While the notion of a trend usually reflects current events, the amount of information accumulated over a period of time can be used to provide another perspective for such events in the form of a timeline. In this paper, we present a technique that uses social information as relevance surrogates to generate an informative timeline. A core component is a variation of pseudo relevance feedback that is automatically generated using social data without external evidence. Finally, we describe the implementation of such technique and present evaluation results using a real-world data set.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128335894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Soundarajan, Tina Eliassi-Rad, Brian Gallagher, Ali Pinar
{"title":"ε - WGX: Adaptive Edge Probing for Enhancing Incomplete Networks","authors":"S. Soundarajan, Tina Eliassi-Rad, Brian Gallagher, Ali Pinar","doi":"10.1145/3091478.3091492","DOIUrl":"https://doi.org/10.1145/3091478.3091492","url":null,"abstract":"No matter how meticulously constructed, network datasets are often partially observed and incomplete. For example, most of the publicly available data from online social networking services (such as Facebook and Twitter) are collected via apps, users who make their accounts public, and/or the resources available to the researcher/practitioner. Such incompleteness can lead to inaccurate findings. We introduce the Adaptive Edge Probing problem. Suppose that one has observed a networked phenomenon via some form of sampling and has a budget to enhance the incomplete network by asking for additional information about specific nodes, with the ultimate goal of obtaining the most valuable information about the network as a whole. Which nodes should be further explored? We present ε-WGX, a network-based explore-exploit algorithm for identifying which nodes in the incomplete network to probe. Aggregated over multiple datasets and a wide range of probing budgets, we find that ε-WGX outperforms other explore-exploit strategies and baseline probing strategies. For example, for the task of adding as many nodes as possible, over incomplete networks observed via four popular sampling methods, ε-WGX outperforms the best comparison strategy by 12%-23% on average.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126102981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ASSIST: Automatic Summarization of Significant Structural Changes in Large Temporal Graphs","authors":"C. Chelmis, R. Dani","doi":"10.1145/3091478.3091518","DOIUrl":"https://doi.org/10.1145/3091478.3091518","url":null,"abstract":"Detecting outliers and anomalies in data is vital in numerous applications in areas such as security, finance, health care, and online social media. Such dynamic systems can be modeled as graphs that change over time. Even though considerable work has been performed on finding points in time at which a network notably differs from its past, little work has been done on characterizing or explaining such changes. However, in the era of big data where networked data are getting bigger and bigger, being able to summarize such changes is key for sensemaking and root cause analysis. To address this gap, we present a novel approach to summarize significant structural changes in large temporal graphs. Specifically, we propose an efficient approach to help the user understand sharp changes in the structure of the network by presenting to her only a summary of key subgraphs that contribute most to the change. Extensive evaluation on real-world datasets with ground truth demonstrates both quantitatively and qualitatively the ability of our approach to accurately detect changes and discover \"important\" structures to succinctly describe the change.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132537227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. Alrajebah, L. Carr, Markus Luczak-Rösch, T. Tiropanis
{"title":"Deconstructing Diffusion on Tumblr: Structural and Temporal Aspects","authors":"N. Alrajebah, L. Carr, Markus Luczak-Rösch, T. Tiropanis","doi":"10.1145/3091478.3091491","DOIUrl":"https://doi.org/10.1145/3091478.3091491","url":null,"abstract":"Online social networks enable collectives of users to create and share content at scale. The diffusion of content through the network, and the resulting information cascades, are phenomena that have been widely investigated on various platforms, which facilitate information diffusion using diverse technical mechanisms, user interfaces and incentives. This paper focuses on Tumblr, an online microblogging social network with a core 'reblogging' functionality that allows information to diffuse across its network by appearing on multiple user blogs. The formation of any cascade network is visible as a list of reblogging events attached as notes to each appearance of the post in the cascade. In this paper, we examine cascade networks on Tumblr, recreated from the series of diffusion events, and analyse them from structural and temporal perspectives. To achieve this, we utilise a cascade construction model that create cascade networks, overcoming problems of a lack of contextual information and missing/degraded data. Finally, we compare cascades in Tumblr with those appearing on other social network platforms. Our analysis shows that popular content on Tumblr creates 'large' cascades that are deep, branching into a large number of separate and long paths, having a consistent number of reblogs at each depth and at each given time.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131910125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yongren Shi, Kai Mast, Ingmar Weber, Agrippa Kellum, M. Macy
{"title":"Cultural Fault Lines and Political Polarization","authors":"Yongren Shi, Kai Mast, Ingmar Weber, Agrippa Kellum, M. Macy","doi":"10.1145/3091478.3091520","DOIUrl":"https://doi.org/10.1145/3091478.3091520","url":null,"abstract":"Survey research reveals deep partisan divisions in the U.S. that extend beyond politics to include cultural tastes, lifestyle choices, and consumer preferences. We show how co-following on Twitter can be used to measure the extent to which these divisions are also evident in social media. We measure political alignment (location on the red-blue spectrum), relevance (overlap between cultural and political interests), and polarization (internal division) in music, movies, hobbies, sports, vehicles, food and drink, technology, universities, religions, and business. The results provide compelling evidence that \"Tesla liberals\"{ and \"bird hunting conservatives\" are stereotypes grounded in empirical reality.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132767355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Closed-Loop Opinion Formation","authors":"L. Spinelli, M. Crovella","doi":"10.1145/3091478.3091483","DOIUrl":"https://doi.org/10.1145/3091478.3091483","url":null,"abstract":"When information sources are moderated by recommender systems, so-called \"filter bubbles\" may restrict the diversity of content made available to users, potentially affecting their opinions. User opinions may in turn affect the output of recommender systems. In this work we ask how the dynamical system defined by user and recommender systems behaves, as each element evolves in time. In particular, we look at whether the use of recommender system can affect user experience and user opinions in a systematic way. We define and analyze three metrics to understand those effects - intensity, simplification, and divergence - and we explore both link-based and ratings-based recommender systems. Our results suggest that previous studies of this problem have been too simplistic, and that user opinions can evolve in complex ways under the influence of personalized information sources.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127372412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lingzi Hong, Cheng Fu, P. Torrens, V. Frías-Martínez
{"title":"Understanding Citizens' and Local Governments' Digital Communications During Natural Disasters: The Case of Snowstorms","authors":"Lingzi Hong, Cheng Fu, P. Torrens, V. Frías-Martínez","doi":"10.1145/3091478.3091502","DOIUrl":"https://doi.org/10.1145/3091478.3091502","url":null,"abstract":"A growing number of citizens and local governments have embraced the use of Twitter to communicate during natural disasters. Studies have shown that online communications during disasters can be explained using crisis communication taxonomies. However, such taxonomies are broad and general, and offer little insight into the detailed content of the communications. In this paper, we propose a semi-automatic framework to extract and compare, in retrospect, the digital communication footprints of citizens and governments during disasters. These footprints, which characterize the topics discussed during a disaster at different spatio-temporal scales, are computed in an unsupervised manner using topic models, and manually labelled to identify specific issues affecting the population. The end objective is to offer detailed information about issues affecting citizens during natural disasters and to compare these against local governments' communications. We evaluate the framework using Twitter communications from 18 snowstorms (including two blizzards) on the US east coast.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123400536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"When Friend Becomes Abuser: Evidence of Friend Abuse in Facebook","authors":"Sajedul Talukder, Bogdan Carbunar","doi":"10.1145/3091478.3098869","DOIUrl":"https://doi.org/10.1145/3091478.3098869","url":null,"abstract":"We show through 2 user studies (n = 80), that a high number of participants have at least 1 Facebook friend whom they believe is likely to abuse their posted photos or status updates, or post offensive, false or malicious content, or with whom they never interact in Facebook and in real life. Our results reveal the importance of developing tools that automatically detect and defend against friend abuse in social networks.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128561853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Consolidating Identities of Authors through Egonet Structure","authors":"Janaína Gomide, Hugo Kling, D. R. Figueiredo","doi":"10.1145/3091478.3098862","DOIUrl":"https://doi.org/10.1145/3091478.3098862","url":null,"abstract":"Individuals often appear with multiple names when considering large data-sets collected from different sources, giving rise to the name ambiguities. Name ambiguity comes in two flavors: a single individual appearing with more than one name (synonym problem); a single name being used to refer to more than one individual (homonym problem). This works focuses on the synonym problem and explores structural patterns of the collaboration network. We consider a scenario where the individual has a specific profile page that lists its bibliographic records. Despite more restrictive, this scenario has become quite common among digital libraries such as DBLP1 and Google Scholar2. Such profiles often have name ambiguities as the profile owner appears with different names within the bibliographic records (e.g., different name spellings, typos, etc). This paper tackle these name ambiguities without resorting to any content information!","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"471 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113967160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
B. Kale, Harish Varma Siravuri, Hamed Alhoori, M. Papka
{"title":"Predicting Research that will be Cited in Policy Documents","authors":"B. Kale, Harish Varma Siravuri, Hamed Alhoori, M. Papka","doi":"10.1145/3091478.3098865","DOIUrl":"https://doi.org/10.1145/3091478.3098865","url":null,"abstract":"Scientific publications and other genres of research output are increasingly being cited in policy documents. Citations in documents of this nature could be considered a critical indicator of the significance and societal impact of the research output. In this study, we built classification models that predict whether a particular research work is likely to be cited in a public policy document based on the attention it received online, primarily on social media platforms. We evaluated the classifiers based on their accuracy, precision, and recall values. We found that Random Forest and Multinomial Naive Bayes classifiers performed better overall.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130676029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}