{"title":"Run-time validation of knowledge-based systems","authors":"A. Finlayson, P. Compton","doi":"10.1145/2479832.2479860","DOIUrl":"https://doi.org/10.1145/2479832.2479860","url":null,"abstract":"As knowledge bases become more complex it is increasingly unlikely that they will have been validated against all possible data and therefore an increasing risk of making errors. Run-time validation is checking whether the output of a knowledge base for some data is likely to be correct at the time the data is processed. We have investigated various techniques for runtime validation. The most successful technique has been to constantly re-build a separate knowledge base using a different learning technique with cases labeled by the knowledge base being validated, as training data. Any new cases are processed by both knowledge bases and if the knowledge bases disagree the case is referred for manual checking as a possible outlier. If an outlier is detected the knowledge base is edited to give the correct answer and as cases are processed they are added to the training data for the machine learning knowledge base.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134069847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. D. Boer, J. Doornik, L. Buitinck, maarten marx, Tim Veken, Kees Ribbens
{"title":"Linking the kingdom: enriched access to a historiographical text","authors":"V. D. Boer, J. Doornik, L. Buitinck, maarten marx, Tim Veken, Kees Ribbens","doi":"10.1145/2479832.2479849","DOIUrl":"https://doi.org/10.1145/2479832.2479849","url":null,"abstract":"Digital history is a branch of digital humanities concerned using ICT to improve study of history. Linked Data provides a way of effective enriched digital access to scientific texts about history (historiographies). In this paper, we present a method for connecting a historiographical text to the Linked Data cloud. We present the method and tools that we use in each of the method's steps. We focus on one extensive case study: the enriched access of an important work of Dutch World War II historiography \"Het Koninkrijk der Nederlanden in de Tweede Wereldoorlog\". We describe the digitization and present two sources of structured knowledge that link to individual text sources, retrievable on the Web of Data. The first is the manually constructed and highly curated \"Back of the Book Index\". The second is a list of extracted Named Entities. We compare both structured sources as stepping stones to the Web of Data and present a number of use cases relevant for both historical researchers as well as for the general public.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124767578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detecting common scientific workflow fragments using templates and execution provenance","authors":"D. Garijo, Óscar Corcho, Y. Gil","doi":"10.1145/2479832.2479848","DOIUrl":"https://doi.org/10.1145/2479832.2479848","url":null,"abstract":"Provenance plays a major role when understanding and reusing the methods applied in a scientific experiment, as it provides a record of inputs, the processes carried out and the use and generation of intermediate and final results. In the specific case of in-silico scientific experiments, a large variety of scientific workflow systems (e.g., Wings, Taverna, Galaxy, Vistrails) have been created to support scientists. All of these systems produce some sort of provenance about the executions of the workflows that encode scientific experiments. However, provenance is normally recorded at a very low level of detail, which complicates the understanding of what happened during execution. In this paper we propose an approach to automatically obtain abstractions from low-level provenance data by finding common workflow fragments on workflow execution provenance and relating them to templates. We have tested our approach with a dataset of workflows published by the Wings workflow system. Our results show that by using these kinds of abstractions we can highlight the most common abstract methods used in the executions of a repository, relating different runs and workflow templates with each other.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129126143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Toward semantic interoperability with linked foundationalontologies in ROMULUS","authors":"Z. C. Khan, C. Keet","doi":"10.1145/2479832.2479838","DOIUrl":"https://doi.org/10.1145/2479832.2479838","url":null,"abstract":"A purpose of a foundational ontology is to solve interoperability issues among ontologies. Many foundational ontologies have been developed, re-introducing the ontology interoperability problem. We address this with the new online foundational ontology repository ROMULUS, in which DOLCE, BFO and GFO have been aligned. We summarise the alignments, mappings, and logical inconsistencies of the foundational ontologies, and ROMULUS?s features.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116061944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Towards a framework for iteratively signing graph data","authors":"Andreas Kasten, A. Scherp","doi":"10.1145/2479832.2479833","DOIUrl":"https://doi.org/10.1145/2479832.2479833","url":null,"abstract":"When publishing graph data on the web such as vocabularies using RDF(S) or OWL, one has only limited means to verify the authenticity and integrity of the graph data. Today's approaches require a high signature overhead and do not support iterative signing of graph data. This paper describes a first step towards a framework for signing arbitrary graph data provided in RDF(S), Named Graphs, or OWL. Our framework supports signing graph data at different levels of granularity: minimum self-contained graphs (MSG), sets of MSGs, and entire graphs. It supports iteratively signing graph data, e. g., when different parties provide different parts of a common graph, and allows for signing multiple graphs. Both can be done with a constant, low overhead for the resulting signature statements, even when iteratively signing.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128288099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Knowledge capture from multiple online sources with the extensible web retrieval toolkit (eWRT)","authors":"A. Weichselbraun, A. Scharl, Heinz-Peter Lang","doi":"10.1145/2479832.2479861","DOIUrl":"https://doi.org/10.1145/2479832.2479861","url":null,"abstract":"Knowledge capture approaches in the age of massive Web data require robust and scalable mechanisms to acquire, consolidate and pre-process large amounts of heterogeneous data, both unstructured and structured. This paper addresses this requirement by introducing the Extensible Web Retrieval Toolkit (eWRT), a modular Python API for retrieving social data from Web sources such as Delicious, Flickr, Yahoo! and Wikipedia. eWRT has been released as an open source library under GNU GPLv3. It includes classes for caching and data management, and provides low-level text processing capabilities including language detection, phonetic string similarity measures, and string normalization.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126913524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the seventh international conference on Knowledge capture","authors":"Richard Benjamins, M. d’Aquin, A. Gordon","doi":"10.1145/2479832","DOIUrl":"https://doi.org/10.1145/2479832","url":null,"abstract":"It is our pleasure to welcome you to the Seventh International Conference on Knowledge Capture (K-CAP2013). The K-CAP conference series provides a forum that brings together members of several research communities and practitioners who are interested in efficiently capturing knowledge from a variety of sources and in creating representations that can be useful for automated reasoning, analysis, and other forms of machine processing. Therefore, research in knowledge capture is at the intersection of areas such as knowledge engineering, machine learning, big data, natural-language processing, human-computer interaction, Artificial Intelligence, and the Semantic Web. \u0000 \u0000K-CAP 2012 follows on the success of six previous conferences in 2011 (Banff, Alberta Canada), 2009 (Los Angeles, California, USA), 2007 (Whistler, British Columbia, Canada), in 2005 (Banff, Alberta, Canada), in 2003 (Sanibel Island, Florida, USA), and 2001 (Victoria, British Columbia, Canada), and of the series of Knowledge Acquisition Workshops (KAW), the first of which took place in the same location (Banff, Alberta, Canada) in 1986. The theme of this seventh edition of K-CAP is: Knowledge Capture in the Age of Massive Web Data, reflecting the times we are living in. \u0000 \u0000The call for papers attracted 60 submissions from Europe, America, Asia, and Oceania. The program committee accepted 13 full papers, 4 short papers and 3 application papers, that cover a variety of topics, including ontology engineering, ontology learning, linked open data, information extraction for knowledge capture, and knowledge capture from web data including online textual and multimedia resources. In addition, this volume includes descriptions of 8 posters and demos presented at the conference. \u0000 \u0000In this seventh edition of the conference we have tried to innovate in several ways \u0000Rather than only aiming at research papers, we introduced an applications track for practitioners to present application-oriented contributions. \u0000We issued a call for participation for the first K-CAP \"Datathon\", a \"hackaton\" based on Open Data. \u0000Acknowledging the worldwide economic down turn, and the pressure many research and company budgets are suffering, we set up a crowdfunding initiative to help K-CAP 2013 with extra funds (http://www.indiegogo.com/projects/k-cap-2013). Through this initiative as well as through a standard sponsoring program, we have tried to keep the participation costs as low as possible.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117337190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas Gottron, A. Scherp, Bastian Krayer, Arne Peters
{"title":"LODatio: using a schema-level index to support users infinding relevant sources of linked data","authors":"Thomas Gottron, A. Scherp, Bastian Krayer, Arne Peters","doi":"10.1145/2479832.2479841","DOIUrl":"https://doi.org/10.1145/2479832.2479841","url":null,"abstract":"The Linked Open Data (LOD) cloud provides a vast amount of heterogeneous data, distributed over numerous data sources. This makes it difficult to find those data sources in the cloud which are relevant for a given information need. Existing search engines for the Semantic Web focus on instance-oriented information needs, i. e., searching for specific RDF instances or literals and exploring the search results. However, they do not address the question of finding linked data sources relevant to a schema-oriented information need, i. e., queries based on triple patterns relating to a specific combination of RDF types and/or properties. In this paper, we present the semantic search system LODatio leveraging a schema-level index for finding sources of Linked Data relevant to a schema-oriented information need. Beyond its capability to retrieve relevant data sources, LODatio actively supports the user in his schema-oriented search tasks. To this end, it provides ranked result lists of relevant data sources together with example snippets and an estimation of the result set size. Furthermore, LODatio provides support for novel features in semantic search such as recommending alternative queries in order to refine or broaden the result set.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131474535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Query generation for semantic datasets","authors":"Jeff Z. Pan, Y. Ren, Honghan Wu, Man Zhu","doi":"10.1145/2479832.2479859","DOIUrl":"https://doi.org/10.1145/2479832.2479859","url":null,"abstract":"Due to the increasing volume of and interconnections between semantic datasets, it becomes a challenging task for novice users to know what are included in a dataset, how they can make use of them, and particularly, what queries should be asked. In this paper we analyse several types of candidate insightful queries and propose a framework to generate such queries and identify their relations. To verify our approach, we implemented our framework and evaluated its performance with benchmark and real world datasets.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133913679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Axiomatizing εL⊥-expressible terminological knowledge from erroneous data","authors":"D. Borchmann","doi":"10.1145/2479832.2479835","DOIUrl":"https://doi.org/10.1145/2479832.2479835","url":null,"abstract":"In a recent approach, Baader and Distel proposed an algorithm to axiomatize all terminological knowledge that is valid in a given data set and is expressible in the description logic ELK. This approach is based on the mathematical theory of formal concept analysis. However, this algorithm requires the initial data set to be free of errors, an assumption that normally cannot be made for real-world data. In this work, we propose a first extension of the work of Baader and Distel to handle errors in the data set. The approach we present here is based on the notion of confidence, as it has been developed and used in the area of data mining.","PeriodicalId":388497,"journal":{"name":"Proceedings of the seventh international conference on Knowledge capture","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131865260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}