{"title":"有界树宽实例上表达查询的概率计算","authors":"Mikaël Monet","doi":"10.1145/2926693.2929905","DOIUrl":null,"url":null,"abstract":"Though data uncertainty naturally appears in many real-life situations, traditional database theory and systems tend to assume that the data is reliable and complete. The reason is that of complexity and performance: on arbitrary relational database instances annotated with probabilities, performing exact probabilistic query evaluation is hard. However, a criterion on the shape of the database has been shown in recent work to be sufficient and in some sense necessary to the tractability of this task. Databases whose treewidth is bounded by a constant k are exactly those that can be tractably queried, with respect to quantitative uncertainty estimation. But this is a data complexity result, that does not take into account the cost in terms of the query or of k -- in many cases, this cost is too high for real-world applications. The aim of our PhD research is to study in which circumstances the overall complexity of probabilistic query evaluation can become tractable, aiming at both theoretical and practical results.","PeriodicalId":123723,"journal":{"name":"Proceedings of the 2016 on SIGMOD'16 PhD Symposium","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Probabilistic Evaluation of Expressive Queries on Bounded-Treewidth Instances\",\"authors\":\"Mikaël Monet\",\"doi\":\"10.1145/2926693.2929905\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Though data uncertainty naturally appears in many real-life situations, traditional database theory and systems tend to assume that the data is reliable and complete. The reason is that of complexity and performance: on arbitrary relational database instances annotated with probabilities, performing exact probabilistic query evaluation is hard. However, a criterion on the shape of the database has been shown in recent work to be sufficient and in some sense necessary to the tractability of this task. Databases whose treewidth is bounded by a constant k are exactly those that can be tractably queried, with respect to quantitative uncertainty estimation. But this is a data complexity result, that does not take into account the cost in terms of the query or of k -- in many cases, this cost is too high for real-world applications. The aim of our PhD research is to study in which circumstances the overall complexity of probabilistic query evaluation can become tractable, aiming at both theoretical and practical results.\",\"PeriodicalId\":123723,\"journal\":{\"name\":\"Proceedings of the 2016 on SIGMOD'16 PhD Symposium\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2016 on SIGMOD'16 PhD Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2926693.2929905\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2016 on SIGMOD'16 PhD Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2926693.2929905","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Probabilistic Evaluation of Expressive Queries on Bounded-Treewidth Instances
Though data uncertainty naturally appears in many real-life situations, traditional database theory and systems tend to assume that the data is reliable and complete. The reason is that of complexity and performance: on arbitrary relational database instances annotated with probabilities, performing exact probabilistic query evaluation is hard. However, a criterion on the shape of the database has been shown in recent work to be sufficient and in some sense necessary to the tractability of this task. Databases whose treewidth is bounded by a constant k are exactly those that can be tractably queried, with respect to quantitative uncertainty estimation. But this is a data complexity result, that does not take into account the cost in terms of the query or of k -- in many cases, this cost is too high for real-world applications. The aim of our PhD research is to study in which circumstances the overall complexity of probabilistic query evaluation can become tractable, aiming at both theoretical and practical results.