技术角度:查询结果的自然语言解释

SIGMOD Rec. Pub Date : 2018-09-10 DOI:10.1145/3277006.3277016

Z. Ives

{"title":"技术角度:查询结果的自然语言解释","authors":"Z. Ives","doi":"10.1145/3277006.3277016","DOIUrl":null,"url":null,"abstract":"Motivated by conversational agents such as Siri, Cortana, the Google Assistant, and Alexa — there has been a surge of interest in spoken as well as textual natural language interfaces. To this point, such systems have relied on innovations in speech recognition (such as recurrent neural networks, LSTMs, and so on) and in specially encoding specific questionanswering strategies via “skills.” A “natural” question for the SIGMOD community is how to best connect natural language interfaces systems to DBMSs, ideally in a way that generalizes to any database schema or instance. In fact, the problem of providing a natural language interface to a database system (i.e., mapping from a spoken or textual question to a structured query) dates back at least to the 1980s [4]. Such efforts had middling success due to issues of accuracy, so the problems were later revisited in the 2000’s with an eye towards restricting the space of options in order to improve precision [6]. Nonetheless, such systems did not gain much traction, again due to the challenges of ensuring accuracy for a given database when the user might ask an ambiguous question. Recent work by Li and Jagadish [5], called NaLIR, proposed an interactive communicator within the query system, which presents to the user a query tree explaining what the system was going to do — such that the user could correct any mistakes. This was helpful in improving reliability, but it required that the user understand tree structured representations of queries. In “Natural Language Explanations for Query Results,” Deutch and his co-authors suggest that a more effective means of helping the user understand and correct results might be through provenance information — i.e., giving an explanation for each answer of how and why it exists. Their approach adapts the NaLIR system and nicely leverages the recent body of work on provenance semirings [3, 2, 1]. The provenance semiring model has an important property that equivalent query plans (as produced by a query optimizer) will have equivalent provenance expressions. The innovations in this paper are in three areas. First, the authors use the structure of the natural language query itself (and the mappings to structured queries, and then later, from queries to provenance) to present the provenance in a form that matches the natural language query — and thus the user’s expectations. Second, they reduce the size (and repetition) of the provenance via factoring. Finally, they incorporate aggregate results (e.g., counts) in place of certain details. The paper does a great job of clearly identifying and articulating what makes the provenance problem different for natural language query systems, and presenting elegant technical solutions to these new challenges.","PeriodicalId":21740,"journal":{"name":"SIGMOD Rec.","volume":"31 1","pages":"41"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Technical Perspective:: Natural Language Explanations for Query Results\",\"authors\":\"Z. Ives\",\"doi\":\"10.1145/3277006.3277016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivated by conversational agents such as Siri, Cortana, the Google Assistant, and Alexa — there has been a surge of interest in spoken as well as textual natural language interfaces. To this point, such systems have relied on innovations in speech recognition (such as recurrent neural networks, LSTMs, and so on) and in specially encoding specific questionanswering strategies via “skills.” A “natural” question for the SIGMOD community is how to best connect natural language interfaces systems to DBMSs, ideally in a way that generalizes to any database schema or instance. In fact, the problem of providing a natural language interface to a database system (i.e., mapping from a spoken or textual question to a structured query) dates back at least to the 1980s [4]. Such efforts had middling success due to issues of accuracy, so the problems were later revisited in the 2000’s with an eye towards restricting the space of options in order to improve precision [6]. Nonetheless, such systems did not gain much traction, again due to the challenges of ensuring accuracy for a given database when the user might ask an ambiguous question. Recent work by Li and Jagadish [5], called NaLIR, proposed an interactive communicator within the query system, which presents to the user a query tree explaining what the system was going to do — such that the user could correct any mistakes. This was helpful in improving reliability, but it required that the user understand tree structured representations of queries. In “Natural Language Explanations for Query Results,” Deutch and his co-authors suggest that a more effective means of helping the user understand and correct results might be through provenance information — i.e., giving an explanation for each answer of how and why it exists. Their approach adapts the NaLIR system and nicely leverages the recent body of work on provenance semirings [3, 2, 1]. The provenance semiring model has an important property that equivalent query plans (as produced by a query optimizer) will have equivalent provenance expressions. The innovations in this paper are in three areas. First, the authors use the structure of the natural language query itself (and the mappings to structured queries, and then later, from queries to provenance) to present the provenance in a form that matches the natural language query — and thus the user’s expectations. Second, they reduce the size (and repetition) of the provenance via factoring. Finally, they incorporate aggregate results (e.g., counts) in place of certain details. The paper does a great job of clearly identifying and articulating what makes the provenance problem different for natural language query systems, and presenting elegant technical solutions to these new challenges.\",\"PeriodicalId\":21740,\"journal\":{\"name\":\"SIGMOD Rec.\",\"volume\":\"31 1\",\"pages\":\"41\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SIGMOD Rec.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3277006.3277016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIGMOD Rec.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3277006.3277016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在Siri、Cortana、Google Assistant和Alexa等对话代理的推动下，人们对语音和文本自然语言界面的兴趣激增。到目前为止，这样的系统依赖于语音识别方面的创新(如循环神经网络、lstm等)，以及通过“技能”对特定的问答策略进行特殊编码。SIGMOD社区的一个“自然”问题是如何最好地将自然语言接口系统连接到dbms，理想情况下，以一种泛化到任何数据库模式或实例的方式。事实上，为数据库系统提供自然语言接口的问题(即从口语或文本问题映射到结构化查询)至少可以追溯到20世纪80年代[4]。由于准确性的问题，这些努力取得了中等程度的成功，因此这些问题后来在2000年代重新审视，着眼于限制选项的空间，以提高精度[6]。尽管如此，这样的系统并没有获得太多的关注，这也是由于当用户可能提出一个模棱两可的问题时，要确保给定数据库的准确性是一个挑战。Li和Jagadish[5]最近的工作，称为NaLIR，提出了查询系统内的交互式通信器，它向用户呈现查询树，解释系统将要做什么，这样用户就可以纠正任何错误。这有助于提高可靠性，但它要求用户理解查询的树状结构表示。在“查询结果的自然语言解释”中，Deutch和他的合著者建议，帮助用户理解和纠正结果的更有效方法可能是通过来源信息，即为每个答案提供解释，说明它是如何存在的以及为什么存在的。他们的方法适应了NaLIR系统，并很好地利用了最近关于来源半线索的工作[3,2,1]。起源半循环模型有一个重要的属性，即等价的查询计划(由查询优化器生成)将具有等价的起源表达式。本文的创新点主要体现在三个方面。首先，作者使用自然语言查询本身的结构(以及到结构化查询的映射，然后是从查询到来源的映射)，以与自然语言查询匹配的形式呈现来源——从而满足用户的期望。其次，它们通过分解减少了来源的大小(和重复)。最后，它们合并聚合结果(例如计数)来代替某些细节。这篇论文在清楚地识别和阐明自然语言查询系统的来源问题的不同之处方面做得很好，并为这些新挑战提供了优雅的技术解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Technical Perspective:: Natural Language Explanations for Query Results

Motivated by conversational agents such as Siri, Cortana, the Google Assistant, and Alexa — there has been a surge of interest in spoken as well as textual natural language interfaces. To this point, such systems have relied on innovations in speech recognition (such as recurrent neural networks, LSTMs, and so on) and in specially encoding specific questionanswering strategies via “skills.” A “natural” question for the SIGMOD community is how to best connect natural language interfaces systems to DBMSs, ideally in a way that generalizes to any database schema or instance. In fact, the problem of providing a natural language interface to a database system (i.e., mapping from a spoken or textual question to a structured query) dates back at least to the 1980s [4]. Such efforts had middling success due to issues of accuracy, so the problems were later revisited in the 2000’s with an eye towards restricting the space of options in order to improve precision [6]. Nonetheless, such systems did not gain much traction, again due to the challenges of ensuring accuracy for a given database when the user might ask an ambiguous question. Recent work by Li and Jagadish [5], called NaLIR, proposed an interactive communicator within the query system, which presents to the user a query tree explaining what the system was going to do — such that the user could correct any mistakes. This was helpful in improving reliability, but it required that the user understand tree structured representations of queries. In “Natural Language Explanations for Query Results,” Deutch and his co-authors suggest that a more effective means of helping the user understand and correct results might be through provenance information — i.e., giving an explanation for each answer of how and why it exists. Their approach adapts the NaLIR system and nicely leverages the recent body of work on provenance semirings [3, 2, 1]. The provenance semiring model has an important property that equivalent query plans (as produced by a query optimizer) will have equivalent provenance expressions. The innovations in this paper are in three areas. First, the authors use the structure of the natural language query itself (and the mappings to structured queries, and then later, from queries to provenance) to present the provenance in a form that matches the natural language query — and thus the user’s expectations. Second, they reduce the size (and repetition) of the provenance via factoring. Finally, they incorporate aggregate results (e.g., counts) in place of certain details. The paper does a great job of clearly identifying and articulating what makes the provenance problem different for natural language query systems, and presenting elegant technical solutions to these new challenges.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

SIGMOD Rec.

自引率

0.00%

发文量