{"title":"Defining Similarity Spaces for Large-Scale Image Retrieval Through Scientific Workflows","authors":"Luis Fernando Milano Oliveira, D. S. Kaster","doi":"10.1145/3105831.3105863","DOIUrl":null,"url":null,"abstract":"Content-Based Image Retrieval (CBIR) employs visual features from images for searching and retrieving of data. Systems based on this concept depend on a similarity space instance definition, but achieving an ideal instance is a very complex process and is dependent on domain knowledge. At the same time, domain experts are often unable to interact fully with systems because of technicalities. In this paper, we propose an architecture, based on scientific workflows, which allows users with no prior programming experience to build processes on images, creating Similarity Spaces and evaluating them when running similarity queries. Through this architecture, they can use domain expertise to improve image retrieval in a coordinated, auditable and reproducible manner, while being able to process very large image collections. We describe a prototype system and carry out experiments evaluating its performance in various scenarios. The current implementation supports both similarity space definition and querying workflows, achieving suitable speedups with the increase in the number of machines.","PeriodicalId":319729,"journal":{"name":"Proceedings of the 21st International Database Engineering & Applications Symposium","volume":"128 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3105831.3105863","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Content-Based Image Retrieval (CBIR) employs visual features from images for searching and retrieving of data. Systems based on this concept depend on a similarity space instance definition, but achieving an ideal instance is a very complex process and is dependent on domain knowledge. At the same time, domain experts are often unable to interact fully with systems because of technicalities. In this paper, we propose an architecture, based on scientific workflows, which allows users with no prior programming experience to build processes on images, creating Similarity Spaces and evaluating them when running similarity queries. Through this architecture, they can use domain expertise to improve image retrieval in a coordinated, auditable and reproducible manner, while being able to process very large image collections. We describe a prototype system and carry out experiments evaluating its performance in various scenarios. The current implementation supports both similarity space definition and querying workflows, achieving suitable speedups with the increase in the number of machines.