{"title":"Scientific Data Management for Interconnected Critical Infrastructure Systems","authors":"G. Weaver","doi":"10.1109/JCDL52503.2021.00031","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00031","url":null,"abstract":"The Maritime Transportation System (MTS) is a nexus of critical infrastructure systems, combining intermodal movements along road, rail, and sea with emerging automation and supply chain management technologies. To understand risk in such an environment, a wide variety of stakeholder viewpoints must be integrated, including those from the Energy and Communications/IT infrastructure sectors. Therefore, this paper presents a data curation and management framework to support the analysis of Interconnected Critical Infrastructures (ICI) that is based on extensive fieldwork and security exercises with several shipping ports and supporting stakeholders. Our first contribution applies the CITE2 URN syntax as an approach to catalog and reference notional and multi-versioned critical infrastructure networks and flows along them. This common reference scheme supports integration of a variety of publicly-available and privately-held data sources such as the National Transportation Atlas Database (NTAD) from the Bureau of Transportation Statistics (BTS), vessel movements from individual ports via harbormaster or Automatic Identification System (AIS) data, and container movements. Our second contribution provides a theoretical framework to support analysis across multiple expressions of the same notional critical infrastructure asset. For example, geospatial grids and graph-based representations of critical infrastructure networks support complementary operations that when integrated, provide a holistic view of risk of the ICI being studied. Results based on the Jack Voltaic 3.0 exercises conducted in Charleston SC demonstrate the utility and adaptability of our data curation and analysis by integrating grid and network-based views on a regional transportation system and its geospatial dependencies on Communications/IT sectors and bulk electric system.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124288109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"VeTo-web: A Recommendation Tool for the Expansion of Sets of Scholars","authors":"Serafeim Chatzopoulos, Thanasis Vergoulis, Theodore Dalamagas, Christos Tryfonopoulos","doi":"10.1109/JCDL52503.2021.00058","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00058","url":null,"abstract":"Expanding a set of known experts with new ones that share similar expertise is a problem that emerges in various real-life applications. We demonstrate VeTo-web, an open source, publicly available tool that deals with this problem in the context of searching for academic experts. VeTo-web exploits analysis techniques for scholarly knowledge graphs to identify scholars that share similar research activities with a given expert group and offers a Web-based user interface to assist its users in expanding a set of academic experts with additional scholars with similar expertise.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124878667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
R. Srinivasa, Samuel Dodson, Kyoungwon Seo, Dongwook Yoon, Sidney S. Fels
{"title":"NoteLink: A Point-and-Shoot Linking Interface between Students' Handwritten Notebooks and Instructional Videos","authors":"R. Srinivasa, Samuel Dodson, Kyoungwon Seo, Dongwook Yoon, Sidney S. Fels","doi":"10.1109/JCDL52503.2021.00026","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00026","url":null,"abstract":"When learning from instructional videos, students frequently take handwritten notes to improve recall and comprehension. When reviewing their notes, it can be difficult to return to the corresponding part of the video. In this paper, we present NoteLink, a mobile application that allows students to take pictures of their notes to re-find and play relevant videos on their smartphone or tablet. Our study followed four phases. In Phase I, we identified the characteristics of students' notes by analyzing 10 engineering students' handwritten notes taken while watching instructional videos. We found: 1) students' notes are comprised of four content types: text, formula, drawing, and a hybrid of two or more types, 2) at least 75% of the notes, regardless of content type, manifest some degree of verbatim overlap with the corresponding video content, and 3) videos are referenced at three scales of temporal granularity: point, interval, and whole video. In Phase II, we designed a prototype mobile application, NoteLink, that retrieves instructional videos that are similar to students' notes. In Phase III, we ran a usability study with 12 engineering students to evaluate their preferences for the temporal granularity of retrieved videos and how search results are displayed. Students reported a preference for matches at the interval temporal granularity. Interviews with participants suggest that NoteLink-like tools for re-finding instructional videos are useful. In Phase IV, we evaluated the retrieval accuracy of NoteLink using the data collected in Phase I. The overall accuracy was 78%, and 98% for textual notes. We also provide design recommendations for optimizing NoteLink.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124936632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Visualizing the Evolution of Information Retrieval via the ACM Computer Classification Codes","authors":"J. Buzydlowski, L. Cassel","doi":"10.1109/JCDL52503.2021.00059","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00059","url":null,"abstract":"The Association for Computing Machinery (ACM) “provides the computing field's premier Digital Library and serves its members and the computing profession with leading-edge publications, conferences, and career resources” [1]. As part of the submission process to the digital library, each document is tagged with both controlled and uncontrolled vocabulary terms by the author to aid in that document's retrieval by other researchers.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126813280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson
{"title":"Hypercane: Intelligent Sampling for Web Archive Collections","authors":"Shawn M. Jones, Michele C. Weigle, Martin Klein, Michael L. Nelson","doi":"10.1109/JCDL52503.2021.00049","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00049","url":null,"abstract":"Humans can choose individual documents from a web archive collection, but doing so is difficult if they are unfamiliar with the collection. The issue is scale. Most web archive collections consist of thousands of documents. Hypercane is a tool that automates the selection of documents from a web archive collection for summarization or collection exploration.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131597280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Toolbox for the Nearly-Unsupervised Construction of Digital Library Knowledge Graphs","authors":"H. Kroll, Jan Pirklbauer, Wolf-Tilo Balke","doi":"10.1109/JCDL52503.2021.00014","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00014","url":null,"abstract":"Knowledge graphs are essential for digital libraries to store entity-centric knowledge. The applications of knowledge graphs range from summarizing entity information over answering complex queries to inferring new knowledge. Yet, building knowledge graphs means either relying on manual curation or designing supervised extraction processes to harvest knowledge from unstructured text. Obviously, both approaches are cost-intensive. Yet, the question is whether we can minimize the efforts to build a knowledge graph. And indeed, we propose a toolbox that provides methods to extract knowledge from arbitrary text. Our toolkit bypasses the need for supervision nearly completely and includes a novel algorithm to close the missing gaps. As a practical demonstration, we analyze our toolbox on established biomedical benchmarks. As far as we know, we are the first who propose, analyze and share a nearly unsupervised and complete toolbox for building knowledge graphs from text.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129605350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards Transparent Data Cleaning: The Data Cleaning Model Explorer (DCM/X)","authors":"Nikolaus Nova Parulian, Bertram Ludäscher","doi":"10.1109/JCDL52503.2021.00054","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00054","url":null,"abstract":"To make data cleaning processes more transparent, we have developed DCM, a data cleaning model that can represent different kinds of provenance information from tools such as OpenRefine. The information in DCM captures the data cleaning history D0 ↝ Dn, i.e., how an input dataset D0 was transformed, through a number of data cleaning transformations, into a “clean” dataset Dn. Here we demonstrate a Python-based toolkit for OpenRefine that allows users to (i) harvest provenance information from previously executed data cleaning recipes and internal project files, (ii) load this information into a DCM database, and then (iii) explore the data lineage and processing history of Dn using provenance queries and visualizations. The provenance information contained in DCM, and in the views and query results over DCM, turns otherwise opaque data cleaning processes into transparent data cleaning workflows suitable for archival, sharing, and reuse.","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123734762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"JCDL 2021 Tutorial on Systemic Challenges and Computational Solutions on Bias and Unfairness in Peer Review","authors":"Nihar B. Shah","doi":"10.1109/JCDL52503.2021.00075","DOIUrl":"https://doi.org/10.1109/JCDL52503.2021.00075","url":null,"abstract":"Peer review is the backbone of scientific research and determines the composition of scientific digital libraries. Any systemic issues in peer review - such as biases or fraud - can systematically affect the resulting scientific digital library as well as any analyses on that library. They also affect billions of dollars in research grants made via peer review as well as entire careers of researchers. The tutorial will discuss various systemic issues in peer review via insightful experiments, several computational solutions proposed to address these issues, and a number of important open problems. A detailed writeup on the topics of this tutorial as well as a complete list of references is available in [1].","PeriodicalId":112400,"journal":{"name":"2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114242644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}