数据库查询重构的原则优化框架

Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web Pub Date : 2015-05-31 DOI:10.1145/2795218.2795227

Gautam Das

{"title":"数据库查询重构的原则优化框架","authors":"Gautam Das","doi":"10.1145/2795218.2795227","DOIUrl":null,"url":null,"abstract":"Traditional databases have traditionally supported the Boolean retrieval model, where a query returns all tuples that match the selection conditions specified -- no more and no less. Such a query model is often inconvenient for naive users conducting searches that are often exploratory in nature, since the user may not have a complete idea, or a firm opinion of what she may be looking for. This is especially relevant in the context of the Deep Web, which offers a plethora of searchable data sources such as electronic products, transportation choices, apparel, investment options, etc. Users often encounter two types of problems: (a) they may under-specify the items of interest, and find too many items satisfying the given conditions (the many answers problem), or (b) they may over-specify the items of interest, and find no item in the source satisfying all the provided conditions (the empty answer problem). In this talk, I discuss our recent efforts in developing techniques for iterative \"query reformulation\" by which the system guides the user in a systematic way through several small steps, where each step suggests slight query modifications, until the query reaches a form that generates desirable answers. Our proposed approaches for suggesting query reformulations are driven by novel probabilistic frameworks based on optimizing a wide variety of application-dependent objective functions.","PeriodicalId":211132,"journal":{"name":"Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web","volume":"340 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Principled Optimization Frameworks for Query Reformulation of Database Queries\",\"authors\":\"Gautam Das\",\"doi\":\"10.1145/2795218.2795227\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional databases have traditionally supported the Boolean retrieval model, where a query returns all tuples that match the selection conditions specified -- no more and no less. Such a query model is often inconvenient for naive users conducting searches that are often exploratory in nature, since the user may not have a complete idea, or a firm opinion of what she may be looking for. This is especially relevant in the context of the Deep Web, which offers a plethora of searchable data sources such as electronic products, transportation choices, apparel, investment options, etc. Users often encounter two types of problems: (a) they may under-specify the items of interest, and find too many items satisfying the given conditions (the many answers problem), or (b) they may over-specify the items of interest, and find no item in the source satisfying all the provided conditions (the empty answer problem). In this talk, I discuss our recent efforts in developing techniques for iterative \\\"query reformulation\\\" by which the system guides the user in a systematic way through several small steps, where each step suggests slight query modifications, until the query reaches a form that generates desirable answers. Our proposed approaches for suggesting query reformulations are driven by novel probabilistic frameworks based on optimizing a wide variety of application-dependent objective functions.\",\"PeriodicalId\":211132,\"journal\":{\"name\":\"Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web\",\"volume\":\"340 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2795218.2795227\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2795218.2795227","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

传统数据库传统上支持布尔检索模型，其中查询返回与指定的选择条件匹配的所有元组——不多也不少。对于进行探索性搜索的新手用户来说，这样的查询模型通常不方便，因为用户可能没有完整的想法，或者对自己要查找的内容没有明确的意见。这在深度网络的背景下尤为重要，深度网络提供了大量可搜索的数据源，如电子产品、交通选择、服装、投资选择等。用户通常会遇到两种类型的问题:(a)他们可能没有充分指定感兴趣的项目，并且找到太多满足给定条件的项目(多答案问题)，或者(b)他们可能过度指定感兴趣的项目，并且在源中没有找到满足所有提供条件的项目(空答案问题)。在这次演讲中，我将讨论我们最近在开发迭代“查询重新表述”技术方面所做的努力，通过该技术，系统以系统的方式引导用户通过几个小步骤，其中每个步骤都建议轻微的查询修改，直到查询达到生成所需答案的形式。我们提出的建议查询重新表述的方法是由基于优化各种应用相关目标函数的新型概率框架驱动的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Principled Optimization Frameworks for Query Reformulation of Database Queries

Traditional databases have traditionally supported the Boolean retrieval model, where a query returns all tuples that match the selection conditions specified -- no more and no less. Such a query model is often inconvenient for naive users conducting searches that are often exploratory in nature, since the user may not have a complete idea, or a firm opinion of what she may be looking for. This is especially relevant in the context of the Deep Web, which offers a plethora of searchable data sources such as electronic products, transportation choices, apparel, investment options, etc. Users often encounter two types of problems: (a) they may under-specify the items of interest, and find too many items satisfying the given conditions (the many answers problem), or (b) they may over-specify the items of interest, and find no item in the source satisfying all the provided conditions (the empty answer problem). In this talk, I discuss our recent efforts in developing techniques for iterative "query reformulation" by which the system guides the user in a systematic way through several small steps, where each step suggests slight query modifications, until the query reaches a form that generates desirable answers. Our proposed approaches for suggesting query reformulations are driven by novel probabilistic frameworks based on optimizing a wide variety of application-dependent objective functions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web

自引率

0.00%

发文量