{"title":"Machine Learning of SPARQL Templates for Question Answering Over LinkedSpending","authors":"Roberto Cocco, M. Atzori, C. Zaniolo","doi":"10.1109/WETICE.2019.00041","DOIUrl":null,"url":null,"abstract":"We present a Question Answering system aimed to answer natural language questions over the open RDF spending data provided by LinkedSpeding. We propose an original machine-learning approach to learn generalized SPARQL templates from an existing training set of (NL question, SPARQL query) pairs. In our approach, the generalized SPARQL templates are fed to an instance-based classifier that associates a given user-provided question to an existing pair that is used to answer the user question. We employ an external tagger, delegating the Named-Entity Recognition (NER) task to a service developed for the domain we want to query. The problem is particularly challenging due to the small training set size available, counting only 100 questions/SPARQL queries. We illustrate the results of our new approach using data provided by the Question Answering over Linked Data challenge (QALD-6) task 3, showing that we can provide a correct answer to 14 of the 50 questions of the test set. These results are then compared to existing systems, including our previous system, QA3, where templates were provided by an expert rather than being generated automatically from a training set.","PeriodicalId":116875,"journal":{"name":"2019 IEEE 28th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 28th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WETICE.2019.00041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
We present a Question Answering system aimed to answer natural language questions over the open RDF spending data provided by LinkedSpeding. We propose an original machine-learning approach to learn generalized SPARQL templates from an existing training set of (NL question, SPARQL query) pairs. In our approach, the generalized SPARQL templates are fed to an instance-based classifier that associates a given user-provided question to an existing pair that is used to answer the user question. We employ an external tagger, delegating the Named-Entity Recognition (NER) task to a service developed for the domain we want to query. The problem is particularly challenging due to the small training set size available, counting only 100 questions/SPARQL queries. We illustrate the results of our new approach using data provided by the Question Answering over Linked Data challenge (QALD-6) task 3, showing that we can provide a correct answer to 14 of the 50 questions of the test set. These results are then compared to existing systems, including our previous system, QA3, where templates were provided by an expert rather than being generated automatically from a training set.