{"title":"Demonstration of ThalamusDB: Answering Complex SQL Queries with Natural Language Predicates on Multi-Modal Data","authors":"Saehan Jo, Immanuel Trummer","doi":"10.1145/3555041.3589730","DOIUrl":null,"url":null,"abstract":"ThalamusDB supports SQL queries with natural language predicates on multi-modal data. Our data model extends the relational model and integrates multi-modal data, including visual, audio, and text data, as columns. Users can write SQL queries including predicates on multi-modal data, described in natural language. In this demonstration, we show how ThalamusDB enables users to query multi-modal data. Visitors can write their own SQL queries on two real-world data sets gathered from Craigslist and YouTube. ThalamusDB has a specialized optimizer that selects execution plans that minimize the overall cost of answering such queries. Query execution involves pre-trained neural models as well as a relational database as processing engines. ThalamusDB collects a limited number of labels for selected data items to translate similarity scores into binary predicate evaluation. Our demonstration enables visitors to compare optimized plans against naive plans in terms of processing latency. ThalamusDB allows users to trade query result precision for reduced processing overheads. Our demonstration interface enables visitors to change the performance objectives and observe their effects on final result precision as well as computation time and number of labeling requests. Similar to online aggregation, our interactive interface allows users to track shrinking error bounds during query execution.","PeriodicalId":161812,"journal":{"name":"Companion of the 2023 International Conference on Management of Data","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2023 International Conference on Management of Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3555041.3589730","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
ThalamusDB supports SQL queries with natural language predicates on multi-modal data. Our data model extends the relational model and integrates multi-modal data, including visual, audio, and text data, as columns. Users can write SQL queries including predicates on multi-modal data, described in natural language. In this demonstration, we show how ThalamusDB enables users to query multi-modal data. Visitors can write their own SQL queries on two real-world data sets gathered from Craigslist and YouTube. ThalamusDB has a specialized optimizer that selects execution plans that minimize the overall cost of answering such queries. Query execution involves pre-trained neural models as well as a relational database as processing engines. ThalamusDB collects a limited number of labels for selected data items to translate similarity scores into binary predicate evaluation. Our demonstration enables visitors to compare optimized plans against naive plans in terms of processing latency. ThalamusDB allows users to trade query result precision for reduced processing overheads. Our demonstration interface enables visitors to change the performance objectives and observe their effects on final result precision as well as computation time and number of labeling requests. Similar to online aggregation, our interactive interface allows users to track shrinking error bounds during query execution.