{"title":"1st International Workshop on Digital Language Archives","authors":"Oksana L. Zavalina, S. Chelliah","doi":"10.1109/JCDL52503.2021.00063","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00063","url":null,"abstract":"This virtual workshop on digital language archives - digital libraries that preserve, curate, and provide online access to language data - seeks to address the growing need. It will explore a broad scope of issues related to digital language archives. This includes challenges and opportunities, strategies and solutions for: facilitating depositing and improving access; information organization, architecture, and retrieval; quality assurance; usability; ethical issues; ways of encouraging reuse of deposited data in research, and education. Workshop is expected to support interdisciplinary collaboration among information professionals, linguists, educators, representatives of language communities (including indigenous and other underrepresented), other interested audiences.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124935637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aoshen Wan, Yan Zhan, Sanjana Tripathi, Jiarong Yang, Mona Sachdev, Shreya Paithankar, J. Duerksen, Bin Chen, Ying Ding
{"title":"Building the COVID-19 Portal By Integrating Literature, Clinical Trials, and Knowledge Graphs","authors":"Aoshen Wan, Yan Zhan, Sanjana Tripathi, Jiarong Yang, Mona Sachdev, Shreya Paithankar, J. Duerksen, Bin Chen, Ying Ding","doi":"10.1109/JCDL52503.2021.00040","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00040","url":null,"abstract":"The outbreak of COVID-19 has a severe impact on our families, communities, and businesses. Researchers, practitioners, and administrators need a tool to help them digest this enormous amount of knowledge to address various scientific questions related to COVID-19. With CORD-19 dataset, this paper showcases the COVID-19 portal to portray the research profiles of scientists, bio entities (e.g., gene, drug, disease), and institutions based on the integration of CORD-19 research literature, COVID-19 related clinical trials, PubMed knowledge graph, and the drug discovery knowledge graph. This portal provides the following profiles related to COVID-19: 1) the profile of a research scientist with his/her COVID-19 related publications and clinical trials with tweets amount; 2) the profile of a bio entity which could be a gene, a drug, or a disease with articles and clinical trials; and 3) the profile of an institution with papers authored by researchers from this institution.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132503890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Felix Hamborg, Kim Heinser, Anastasia Zhukova, K. Donnay, Bela Gipp
{"title":"Newsalyze: Effective Communication of Person-Targeting Biases in News Articles","authors":"Felix Hamborg, Kim Heinser, Anastasia Zhukova, K. Donnay, Bela Gipp","doi":"10.1109/JCDL52503.2021.00025","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00025","url":null,"abstract":"Media bias and its extreme form, fake news, can decisively affect public opinion. Especially when reporting on policy issues, slanted news coverage may strongly influence societal decisions, e.g., in democratic elections. Our paper makes three contributions to address this issue. First, we present a system for bias identification, which combines state-of-the-art methods from natural language understanding. Second, we devise bias-sensitive visualizations to communicate bias in news articles to non-expert news consumers. Third, our main contribution is a large-scale user study that measures bias-awareness in a setting that approximates daily news consumption, e.g., we present respondents with a news overview and individual articles. We not only measure the visualizations' effect on respondents' bias-awareness, but we can also pinpoint the effects on individual components of the visualizations by employing a conjoint design. Our bias-sensitive overviews strongly and significantly increase bias-awareness in respondents. Our study further suggests that our content-driven identification method detects groups of similarly slanted news articles due to substantial biases present in individual news articles. In contrast, the reviewed prior work rather only facilitates the visibility of biases, e.g., by distinguishing left- and right-wing outlets.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114814514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Three Dimensions of Science: A Web Tool for 3D Visualization of Scientific Literature","authors":"J. Swacha","doi":"10.1109/JCDL52503.2021.00082","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00082","url":null,"abstract":"Graphical analysis is one of the primary methods in the study of networks. While the traditional approach uses a two-dimensional (2D) visualization, once the networks become complex, obtaining anything but superficial observations from 2D graphs becomes very difficult, mainly due to the so-called hairball effect, caused by a large number of overlapping nodes and edges. This problem can be effectively addressed with three-dimensional (3D) visualization. The power of modern web browsers' scripting engines can be utilized to provide 3D visualization without a hassle of installing platform-specific software. Consequently, a number of tools serving this purpose were developed, dedicated to the analysis of various types of networks in domains such as biology, social sciences, or engineering. Quite surprisingly, till now there were no free open-source tools of this kind dedicated to the analysis of networks representing bibliographic data. This paper introduces 3dSciLi, a web tool capable of 3D visualization of five types of such networks (work citations and co-citations, author citations and co-authorship, as well as keyword co-occurrence). The tool requires only an input of a set of bibliographic database search results, freeing the researchers from using a pipeline of programs and manual processing of data for the sake of their 3D visualization.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115489809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Finding the Relevance Between Publication Venues Based on Research Trend Similarity and Citation Relationships","authors":"Tomoya Nishide, Marie Katsurai","doi":"10.1109/JCDL52503.2021.00047","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00047","url":null,"abstract":"This paper presents a novel tool that finds the relevance between publication venues to foster opportunities for collaboration development. When a user inputs a publication venue name related to the user's research field, our tool first shows several relevant publication venues using results of citation network analysis. After the user selects one of those, our tool shows the trend information for each venue as well as the common keywords between the two venues.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130118512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Naman Paharia, Muhammad Syafiq Mohd Pozi, A. Jatowt
{"title":"Change Summarization of Diachronic Scholarly Paper Collections by Semantic Evolution Analysis","authors":"Naman Paharia, Muhammad Syafiq Mohd Pozi, A. Jatowt","doi":"10.1109/JCDL52503.2021.00067","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00067","url":null,"abstract":"The amount of scholarly data has been increasing dramatically over the last years. For newcomers to a particular science domain (e.g., IR, physics, NLP) it is often difficult to spot larger trends and to position the latest research in the context of prior scientific achievements and breakthroughs. Similarly, researchers in the history of science are interested in tools that allow them to analyze and visualize changes in particular scientific domains. Temporal summarization and related methods should be then useful for making sense of large volumes of scientific discourse data aggregated over time. We demonstrate a novel approach to analyze the collections of research papers published over longer time periods to provide a high level overview of important semantic changes that occurred over the progress of time. Our approach is based on comparing word semantic representations over time and aims to support users in better understanding of large domain-focused archives of scholarly publications. As an example dataset we use the ACL Anthology Reference Corpus that spans from 1979 to 2015 and contains 22,878 scholarly articles.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128570517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yoshiyuki Shoji, K. Aihara, N. Kando, Yuta Nakashima, Hiroaki Ohshima, Shio Takidaira, Masaki Ueta, Takehiro Yamamoto, Yusuke Yamamoto
{"title":"Museum Experience into a Souvenir: Generating Memorable Postcards from Guide Device Behavior Log","authors":"Yoshiyuki Shoji, K. Aihara, N. Kando, Yuta Nakashima, Hiroaki Ohshima, Shio Takidaira, Masaki Ueta, Takehiro Yamamoto, Yusuke Yamamoto","doi":"10.1109/JCDL52503.2021.00024","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00024","url":null,"abstract":"This paper proposes a method for automatically generating postcards that reflect each visitor's museum experience by analyzing the log of our original iPad app that supports and guides personalized navigation in the National Museum of Ethnology. Museum experiences have become more personalized with the evolution of guiding devices. Each visitor views the different exhibits in a different order. Souvenirs serve to remind visitors of their museum experience and cement it in their memories; thus, souvenir postcards should be tailored to each visitor's museum experience. Such tailored postcards can effectively remind visitors of their experiences, deepen their impressions when they look back at them, and promote post-learning. In this paper, we proposed a system that automatically generates a postcard for each visitor for each visit by selecting the five most relevant and impressive exhibits based on the search and navigation logs on our museum guide app. We analyzed the search logs of the guide devices based on the psychological effects on impressions and memory to estimate which exhibits had the strongest impact on the visitors. We then conducted an in laboratory controlled user experiment with 16 participants to check what exhibits had made an impression on visitors by using the implemented system. The results showed that the exhibits that were seen frequently and the exhibits that participants added to their favorites were the most memorable.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124447894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sandeep Kumar, Tirthankar Ghosal, P. Bharti, Asif Ekbal
{"title":"Sharing is Caring! Joint Multitask Learning Helps Aspect-Category Extraction and Sentiment Detection in Scientific Peer Reviews","authors":"Sandeep Kumar, Tirthankar Ghosal, P. Bharti, Asif Ekbal","doi":"10.1109/JCDL52503.2021.00081","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00081","url":null,"abstract":"The peer-review process is the benchmark of research validation. Peer-reviewed texts are the artifacts via which the editors/chairs decide the inclusion/exclusion of a paper in a journal or conference proceedings. Hence it is important for the editors/chairs to carefully analyze the peer-review text from various aspects of the paper (e.g., novelty, substance, soundness, etc.), identify the underlying sentiment of the reviewers, and thereby validate the informativeness of the reviews before making a decision. With the rise in research paper submissions, the current peer-review system is experiencing an unprecedented information overload. Sometimes it becomes stressful for the chairs/editors to make a reasonable decision within the stringent timelines. Here in this work, we attempt an interesting problem to automatically extract the aspect and sentiment from the peer-review texts. We design an end-to-end deep multitask learning model to perform aspect extraction and sentiment classification simultaneously. We show that both these tasks help each other in the predictions. We achieve encouraging performance on a recently released dataset of peer-review texts. We make our codes available for further research11https://www.iitp.ac.in/~ai-nlp-ml/resources.html#aspect-category-sentiment.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127061397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DRESS: Data-Repository Enhancer through Semantic Sources","authors":"Angel L. Garrido, Carlos Bobed","doi":"10.1109/JCDL52503.2021.00070","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00070","url":null,"abstract":"In recent years, there has been a huge research effort in the field of Knowledge Base Population (KBP). General approaches based on statistical techniques have been applied to popular resources on the Web (e.g., Wikipedia) with successful results. However, when it comes to small and private digital libraries, where the stored data is scarce and the existing entities might not be so popular, such approaches are not usually enough: many of the common techniques may lack specific tools to disambiguate entities that operate in a local environment. In this paper, we propose an approach to deal with private and isolated digital collections. Our proposed system (named DRESS, with the idea of “dressing” the digital library) builds a domain Knowledge Base (KB) from scratch, leveraging the available local knowledge. Then, DRESS enriches the KB integrating the local data with external knowledge obtained from the Semantic Web. Enhancing the digital repository in this way allows to build high value services for the user, recommending contents and improving the presentation of results (i.e., via infoboxes). Preliminary evaluations of the system have been carried out with promising results.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"185 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121269416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards A Reliable Ground-Truth For Biased Language Detection","authors":"Timo Spinde","doi":"10.1109/JCDL52503.2021.00053","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00053","url":null,"abstract":"Reference texts such as encyclopedias and news articles can manifest biased language when objective reporting is substituted by subjective writing. Existing methods to detect bias mostly rely on annotated data to train machine learning models. However, low annotator agreement and comparability is a substantial drawback in available media bias corpora. To evaluate data collection options, we collect and compare labels obtained from two popular crowdsourcing platforms. Our results demonstrate the existing crowdsourcing approaches' lack of data quality, underlining the need for a trained expert framework to gather a more reliable dataset. By creating such a framework and gathering a first dataset, we are able to improve Krippendorff's a = 0.144 (crowdsourcing labels) to a = 0.419 (expert labels). We conclude that detailed annotator training increases data quality, improving the performance of existing bias detection systems. We will continue to extend our dataset in the future.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125395056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}