使用Hadoop, HDFS和c++创建分布式高维索引

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2012-06-27 DOI:10.1109/CBMI.2012.6269848

G. Gudmundsson, L. Amsaleg, B. Jónsson

{"title":"使用Hadoop, HDFS和c++创建分布式高维索引","authors":"G. Gudmundsson, L. Amsaleg, B. Jónsson","doi":"10.1109/CBMI.2012.6269848","DOIUrl":null,"url":null,"abstract":"This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.","PeriodicalId":120769,"journal":{"name":"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Distributed high-dimensional index creation using Hadoop, HDFS and C++\",\"authors\":\"G. Gudmundsson, L. Amsaleg, B. Jónsson\",\"doi\":\"10.1109/CBMI.2012.6269848\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.\",\"PeriodicalId\":120769,\"journal\":{\"name\":\"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMI.2012.6269848\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2012.6269848","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

本文介绍了一种利用开源Hadoop并行分布式运行环境加速大型高维索引构建阶段的初步研究。本文首先讨论开发人员在将代码移植到Hadoop时可能遇到的典型实际问题。然后给出了早期的实验结果，表明在索引大型数据集时，性能获得了实质性的提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Distributed high-dimensional index creation using Hadoop, HDFS and C++

This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)

自引率

0.00%

发文量