SIGMOD Rec.最新文献_第5页

Technical Perspective:: Natural Language Explanations for Query Results 技术角度:查询结果的自然语言解释

SIGMOD Rec. Pub Date : 2018-09-10 DOI: 10.1145/3277006.3277016

Z. Ives

{"title":"Technical Perspective:: Natural Language Explanations for Query Results","authors":"Z. Ives","doi":"10.1145/3277006.3277016","DOIUrl":"https://doi.org/10.1145/3277006.3277016","url":null,"abstract":"Motivated by conversational agents such as Siri, Cortana, the Google Assistant, and Alexa — there has been a surge of interest in spoken as well as textual natural language interfaces. To this point, such systems have relied on innovations in speech recognition (such as recurrent neural networks, LSTMs, and so on) and in specially encoding specific questionanswering strategies via “skills.” A “natural” question for the SIGMOD community is how to best connect natural language interfaces systems to DBMSs, ideally in a way that generalizes to any database schema or instance. In fact, the problem of providing a natural language interface to a database system (i.e., mapping from a spoken or textual question to a structured query) dates back at least to the 1980s [4]. Such efforts had middling success due to issues of accuracy, so the problems were later revisited in the 2000’s with an eye towards restricting the space of options in order to improve precision [6]. Nonetheless, such systems did not gain much traction, again due to the challenges of ensuring accuracy for a given database when the user might ask an ambiguous question. Recent work by Li and Jagadish [5], called NaLIR, proposed an interactive communicator within the query system, which presents to the user a query tree explaining what the system was going to do — such that the user could correct any mistakes. This was helpful in improving reliability, but it required that the user understand tree structured representations of queries. In “Natural Language Explanations for Query Results,” Deutch and his co-authors suggest that a more effective means of helping the user understand and correct results might be through provenance information — i.e., giving an explanation for each answer of how and why it exists. Their approach adapts the NaLIR system and nicely leverages the recent body of work on provenance semirings [3, 2, 1]. The provenance semiring model has an important property that equivalent query plans (as produced by a query optimizer) will have equivalent provenance expressions. The innovations in this paper are in three areas. First, the authors use the structure of the natural language query itself (and the mappings to structured queries, and then later, from queries to provenance) to present the provenance in a form that matches the natural language query — and thus the user’s expectations. Second, they reduce the size (and repetition) of the provenance via factoring. Finally, they incorporate aggregate results (e.g., counts) in place of certain details. The paper does a great job of clearly identifying and articulating what makes the provenance problem different for natural language query systems, and presenting elegant technical solutions to these new challenges.","PeriodicalId":21740,"journal":{"name":"SIGMOD Rec.","volume":"31 1","pages":"41"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82459393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Technical Perspective:: From Think Parallel to Think Sequential 技术角度:从平行思考到顺序思考

SIGMOD Rec. Pub Date : 2018-09-10 DOI: 10.1145/3277006.3277010

Z. Ives

{"title":"Technical Perspective:: From Think Parallel to Think Sequential","authors":"Z. Ives","doi":"10.1145/3277006.3277010","DOIUrl":"https://doi.org/10.1145/3277006.3277010","url":null,"abstract":"In recent years, the database and distributed systems communities have built a wide variety of runtime systems and programming models for largescale computing over graphs. Such “big graph processing systems” [1, 2, 4, 5, 7] o support highly scalable parallel execution of graph algorithms — e.g., computing shortest paths, graph centrality, connected components, or perhaps even graph clusters. As described in the excellent survey by Yan et al [6], most big graph processing systems require the programmer to adopt a vertex-centric or block-centric programming model. For the former, code only “sees” the state at one vertex, receives messages from other vertices, and can send messages to other vertices. Under the latter, code manages a set of vertices within a subgraph (“block”) and can communicate with the code managing other blocks. In “From think Parallel to Think Sequential,” Fan and colleagues argue that vertexand blockcentric programming models are not natural for programmers trained to think sequentially. Instead, they argue that a more intuitive programming model can be developed out of several very simple primitives that can be composed to do incremental computation (as has also been studied in more general “big data” systems [4, 3]). The authors propose four elegant building blocks: (1) a partial evaluation function, (2) an incremental update handling function, (3) mechanisms for updating and sharing parameters in global fashion, and (4) an aggregate function for when multiple workers are updating the same parameter. They build the GRAPE GRAPh Engine system, which implements this programming model, and they show that it provides excellent performance for a variety of graph algorithms. The paper presents a compelling case that, at least for certain classes of algorithms, the simple primitives may be both more natural and more amenable to optimization than standard vertex-centric approaches.","PeriodicalId":21740,"journal":{"name":"SIGMOD Rec.","volume":"1 1","pages":"14"},"PeriodicalIF":0.0,"publicationDate":"2018-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91235320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

From Think Parallel to Think Sequential 从平行思考到顺序思考

SIGMOD Rec. Pub Date : 2018-09-10 DOI: 10.1145/3277006.3277011

W. Fan, Yang Cao, Jingbo Xu, Wenyuan Yu, Yinghui Wu, Chao Tian, Jiaxin Jiang, Bohan Zhang

引用次数: 2

Data Quality: The Role of Empiricism 数据质量:经验主义的作用

SIGMOD Rec. Pub Date : 2018-02-22 DOI: 10.1145/3186549.3186559

S. Sadiq, T. Dasu, X. Dong, J. Freire, I. Ilyas, S. Link, Renée J. Miller, Felix Naumann, Xiaofang Zhou, D. Srivastava

引用次数: 37

Commonsense Knowledge in Machine Intelligence 机器智能常识性知识

SIGMOD Rec. Pub Date : 2018-02-22 DOI: 10.1145/3186549.3186562

Niket Tandon, A. Varde, Gerard de Melo

{"title":"Commonsense Knowledge in Machine Intelligence","authors":"Niket Tandon, A. Varde, Gerard de Melo","doi":"10.1145/3186549.3186562","DOIUrl":"https://doi.org/10.1145/3186549.3186562","url":null,"abstract":"There is growing conviction that the future of computing depends on our ability to exploit big data on theWeb to enhance intelligent systems. This includes encyclopedic knowledge for factual details, common sense for human-like reasoning and natural language generation for smarter communication. With recent chatbots conceivably at the verge of passing the Turing Test, there are calls for more common sense oriented alternatives, e.g., the Winograd Schema Challenge. The Aristo QA system demonstrates the lack of common sense in current systems in answering fourth-grade science exam questions. On the language generation front, despite the progress in deep learning, current models are easily confused by subtle distinctions that may require linguistic common sense, e.g.quick food vs. fast food. These issues bear on tasks such as machine translation and should be addressed using common sense acquired from text. Mining common sense from massive amounts of data and applying it in intelligent systems, in several respects, appears to be the next frontier in computing. Our brief overview of the state of Commonsense Knowledge (CSK) in Machine Intelligence provides insights into CSK acquisition, CSK in natural language, applications of CSK and discussion of open issues. This paper provides a report of a tutorial at a recent conference with a brief survey of topics.","PeriodicalId":21740,"journal":{"name":"SIGMOD Rec.","volume":"3 1","pages":"49-52"},"PeriodicalIF":0.0,"publicationDate":"2018-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78735500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 77

Digree: Building A Distributed Graph Processing Engine out of Single-node Graph Database Installations 从单节点图数据库安装中构建分布式图处理引擎

SIGMOD Rec. Pub Date : 2018-02-22 DOI: 10.1145/3186549.3186555

Vasilis Spyropoulos, Y. Kotidis

引用次数: 4

Dan Suciu Speaks Out on Research, Shyness and Being a Scientist Dan Suciu畅谈研究、害羞和成为一名科学家

SIGMOD Rec. Pub Date : 2018-02-22 DOI: 10.1145/3186549.3186557

M. Winslett, V. Braganholo

引用次数: 0

Provenance and Probabilities in Relational Databases 关系数据库中的来源和概率

SIGMOD Rec. Pub Date : 2017-12-01 DOI: 10.1145/3186549.3186551

P. Senellart