{"title":"Political Polarization and Platform Migration:: A Study of Parler and Twitter Usage by United States of America Congress Members","authors":"Jacqueline M. Otala, Gillian Kurtic, Isabella Grasso, Yu Liu, Jeanna Neefe Matthews, Golshan Madraki","doi":"10.1145/3442442.3452305","DOIUrl":"https://doi.org/10.1145/3442442.3452305","url":null,"abstract":"Growing dissatisfaction with platform governance decisions at major social media platforms like Twitter, Facebook, and Instagram has led to a number of substantial efforts, originating both on the political right and the political left, to shift to new platforms. In this paper, we examine one of the most impactful of these platform migration efforts, a recent effort primarily on the political right to shift from Twitter to Parler in response to Twitter's increased efforts to flag misinformation in the lead up to the 2020 election in the US. As a case study, we analyze the usage of Parler by all members of the United States Congress and compare that to their usage of Twitter. Even though usage of Parler, even at its peak, was only a small percentage of Twitter usage, Parler usage has been impactful. Specifically, it was linked to the planning of the January 6, 2021 attack on the United States Capitol building. Going forward, Parler itself may not have a large and lasting impact, but it offers important lessons about the relationship between political polarization, platform migration, and the real-world political impacts of platform governance decisions and the splintering of our media landscape.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123953050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SciBiD: Novel Scientometrics and NoSQL-enabled Scalable and domain-specific Analysis of Big Scholar Data","authors":"M. Bohlouli, Jonathan Hermann, Fabian Sunnus","doi":"10.1145/3442442.3453544","DOIUrl":"https://doi.org/10.1145/3442442.3453544","url":null,"abstract":"It is quite important how to correlate and find out relationships between how people grow up and succeed in their research field compared to their field’s grows. In many cases, people refer only to indices such as citation count, h-index, i10-index and compare scientists from different fields in a similar situation with the same variables. It is not a fair comparison since fields are different in development and being cited. In this paper, we used the acceleration concept from physics and propose a new method with new metrics to efficiently and fairly evaluate scientists according to the real-time analysis of their recent status compared to their field’s grows. This considers various inputs such as whether a person is beginner scientists or professional and applies all such key inputs in the evaluation. The evaluation is also over time. The results showed better evaluation compared to state-of-the-art metrics.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125193015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tweet Sentiment Analysis of the 2020 U.S. Presidential Election","authors":"Ethan Xia, Han Yue, Hongfu Liu","doi":"10.1145/3442442.3452322","DOIUrl":"https://doi.org/10.1145/3442442.3452322","url":null,"abstract":"In this paper, we conducted a tweet sentiment analysis of the 2020 U.S. Presidential Election between Donald Trump and Joe Biden. Specially, we identified the Multi-Layer Perceptron classifier as the methodology with the best performance on the Sanders Twitter benchmark dataset. We collected a sample of over 260,000 tweets related to the 2020 U.S. Presidential Election from the Twitter website via Twitter API, processed feature extraction, and applied Multi-Layer Perceptron to classify these tweets with a positive or negative sentiment. From the results, we concluded that (1) contrary to popular poll results, the candidates had a very close negative to positive sentiment ratio, (2) negative sentiment is more common and prominent than positive sentiment within the social media domain, (3) some key events can be detected by the trends of sentiment on social media, and (4) sentiment analysis can be used as a low-cost and easy alternative to gather political opinion.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130139977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning Spatio-Temporal Behavioural Representations for Urban Activity Forecasting","authors":"F. Salim","doi":"10.1145/3442442.3451892","DOIUrl":"https://doi.org/10.1145/3442442.3451892","url":null,"abstract":"Understanding human activity patterns in cities enables a more efficient and sustainable energy, transport, and resource planning. In this invited talk, after laying out the background on spatio-temporal representation, I will present our unsupervised approaches to handle large-scale mutivariate sensor data from heterogeneous sources, prior to modelling them further with the rich contextual signals obtained from the environment. I will also present several spatio-temporal prediction and recommendation problems, leveraging graph-based enrichment and embedding techniques, with applications in continuous trajectory prediction, visitor intent profiling, and urban flow forecasting.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129659637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Marketing Communications and the Semantic Web: Theoretical Intersections and Practical Implications","authors":"Teodora Petkova","doi":"10.1145/3442442.3453704","DOIUrl":"https://doi.org/10.1145/3442442.3453704","url":null,"abstract":"This research explores the content of marketing communications on the Web in their dialogic and data aspects. Bringing together theories from the fields of digital marketing communications and the Semantic Web, the work investigates how web marketing content is used for building semantic relationships between data nodes (by using schema.org) and across semiotic interpretative routes (by adhering to dialogic principles for communication).","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129290840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yusuf Arslan, Kevin Allix, Lisa Veiber, Cedric Lothritz, Tegawendé F. Bissyandé, Jacques Klein, A. Goujon
{"title":"A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain","authors":"Yusuf Arslan, Kevin Allix, Lisa Veiber, Cedric Lothritz, Tegawendé F. Bissyandé, Jacques Klein, A. Goujon","doi":"10.1145/3442442.3451375","DOIUrl":"https://doi.org/10.1145/3442442.3451375","url":null,"abstract":"Neural networks for language modeling have been proven effective on several sub-tasks of natural language processing. Training deep language models, however, is time-consuming and computationally intensive. Pre-trained language models such as BERT are thus appealing since (1) they yielded state-of-the-art performance, and (2) they offload practitioners from the burden of preparing the adequate resources (time, hardware, and data) to train models. Nevertheless, because pre-trained models are generic, they may underperform on specific domains. In this study, we investigate the case of multi-class text classification, a task that is relatively less studied in the literature evaluating pre-trained language models. Our work is further placed under the industrial settings of the financial domain. We thus leverage generic benchmark datasets from the literature and two proprietary datasets from our partners in the financial technological industry. After highlighting a challenge for generic pre-trained models (BERT, DistilBERT, RoBERTa, XLNet, XLM) to classify a portion of the financial document dataset, we investigate the intuition that a specialized pre-trained model for financial documents, such as FinBERT, should be leveraged. Nevertheless, our experiments show that the FinBERT model, even with an adapted vocabulary, does not lead to improvements compared to the generic BERT models.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130588282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Challenges and Opportunities in Using Data Science for Homelessness Service Provision","authors":"C. Chelmis, Wenting Qi, Wonhyung Lee","doi":"10.1145/3442442.3453454","DOIUrl":"https://doi.org/10.1145/3442442.3453454","url":null,"abstract":"Homelessness service provision, a task of great societal relevance, requires solutions to several urgent problems facing our humanity. Data science, that has recently emerged as a potential catalyst in addressing long standing problems related to human services, offers immense potential. However, homelessness service provision presents unignorable challenges (e.g., assessment methods and data bias) that are are seldom found in other domains, requiring cross-discipline collaborations and cross-pollination of ideas. This work summarizes the challenges offered by homelessness service provision tasks, as well as the problems and the opportunities that exist for advancing both data science and human services. We begin by highlighting typical goals of homelessness service provision, and subsequently describe homelessness service data along with their properties, that make it challenging to use traditional data science methods. Along the way, we discuss some of the existing efforts and promising directions for data science, and conclude by discussing the importance of a deep collaboration between data science and domain experts for synergistic advancements in both disciplines.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"66 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121530354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dmitri Goldenberg, Guy Tsype, Igor Spivak, Javier Albert, Amir Tzur
{"title":"Learning to Persist: Exploring the Tradeoff Between Model Optimization and Experience Consistency","authors":"Dmitri Goldenberg, Guy Tsype, Igor Spivak, Javier Albert, Amir Tzur","doi":"10.1145/3442442.3452051","DOIUrl":"https://doi.org/10.1145/3442442.3452051","url":null,"abstract":"Machine learning models and recommender systems play a crucial role in web applications, providing personalized experiences to each customer. Recurring visits of the same customer raise a nontrivial question about the persistence of the experience. Given a changing user context, alongside online algorithms that update over time, the optimal treatment might differ from past model decisions. However, changing customer experience may create inconsistency and harm customer satisfaction and business process completion. This paper discusses the tradeoff between providing the user with a consistent experience and suggesting an up-to-date optimal treatment. We offer preliminary approaches to tackle the persistence problem and explore the tradeoffs in a simulated study.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132636084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Feature Analysis Tool for Batch RL Datasets","authors":"Ruiyang Xu, Zhengxing Chen","doi":"10.1145/3442442.3453147","DOIUrl":"https://doi.org/10.1145/3442442.3453147","url":null,"abstract":"Batch RL is concerned about learning a decision policy from a given dataset without interacting with the environment. Although research is actively conducted on learning-related issues (e.g., convergence speed, stability, and safety), empirical challenges before learning are largely ignored. Many RL practitioners face the challenge of determining whether a designed Markov Decision Process (MDP) is valid and meaningful. This study proposes a model-based method to check whether an MDP designed for a given dataset is well formulated through a heuristic-based feature analysis. We tested our method in constructed as well as more realistic environments. Our results show that our approach can identify potential problems of data. As far as we know, performing validity analysis on batch RL data is a novel direction, and we envision that our tool serves as a motivational example to help practitioners apply RL more easily.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132898818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Demonstration of Faceted Search on Scholarly Knowledge Graphs","authors":"Golsa Heidari, Ahmad Ramadan, M. Stocker, S. Auer","doi":"10.1145/3442442.3458605","DOIUrl":"https://doi.org/10.1145/3442442.3458605","url":null,"abstract":"Scientists always look for the most accurate and relevant answer to their queries on the scholarly literature. Traditional scholarly search systems list documents instead of providing direct answers to the search queries. As data in knowledge graphs are not acquainted semantically, they are not machine-readable. Therefore, a search on scholarly knowledge graphs ends up in a full-text search, not a search in the content of scholarly literature. In this demo, we present a faceted search system that retrieves data from a scholarly knowledge graph, which can be compared and filtered to better satisfy user information needs. Our practice’s novelty is that we use dynamic facets, which means facets are not fixed and will change according to the content of a comparison.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134473978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}