Research and Optimization of Massive Music Data Access Based on HDFS

Tong Ouyang, Yizhen Cao
{"title":"Research and Optimization of Massive Music Data Access Based on HDFS","authors":"Tong Ouyang, Yizhen Cao","doi":"10.1109/ICIS.2018.8466493","DOIUrl":null,"url":null,"abstract":"To build a storage management platform for music big data, we need to collect massive heterogeneous music resources from the Internet and store them on big data platforms. Therefore, it is a key problem to build a storage management system that is high performance, extensible, scalable and capable of supporting big data. Building big data platform based on HDFS is a feasible scheme. However, HDFS has good performance for accessing large files, but it is very inefficient for small files such as music source files and music metadata. In view of this, this paper proposes a geared to the needs of mass music based on HDFS small file access optimization scheme, using the format of the music data classification, build multistage merger queue, merging small files into large files in order to reduce the number of files, and create an indexing mechanism to access small files. The index file is then stored in the Phoenix + HBase storage repository and associated with the music metadata to improve the reading efficiency of music small files and music metadata. The experimental test verifies the effectiveness of the optimized scheme and meets the demand of music big data storage management platform.","PeriodicalId":447019,"journal":{"name":"2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2018.8466493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

To build a storage management platform for music big data, we need to collect massive heterogeneous music resources from the Internet and store them on big data platforms. Therefore, it is a key problem to build a storage management system that is high performance, extensible, scalable and capable of supporting big data. Building big data platform based on HDFS is a feasible scheme. However, HDFS has good performance for accessing large files, but it is very inefficient for small files such as music source files and music metadata. In view of this, this paper proposes a geared to the needs of mass music based on HDFS small file access optimization scheme, using the format of the music data classification, build multistage merger queue, merging small files into large files in order to reduce the number of files, and create an indexing mechanism to access small files. The index file is then stored in the Phoenix + HBase storage repository and associated with the music metadata to improve the reading efficiency of music small files and music metadata. The experimental test verifies the effectiveness of the optimized scheme and meets the demand of music big data storage management platform.
基于HDFS的海量音乐数据访问研究与优化
构建音乐大数据存储管理平台,需要从互联网上收集海量异构音乐资源,并将其存储在大数据平台上。因此,构建高性能、可扩展、可扩展、能够支持大数据的存储管理系统是一个关键问题。构建基于HDFS的大数据平台是一个可行的方案。HDFS对于大文件的访问性能很好,但是对于音乐源文件、音乐元数据等小文件的访问效率非常低。鉴于此,本文提出了一种针对海量音乐需求的基于HDFS的小文件访问优化方案,采用音乐数据分类的格式,构建多阶段合并队列,将小文件合并为大文件以减少文件数量,并创建索引机制对小文件进行访问。索引文件存储在Phoenix + HBase存储库中,与音乐元数据关联,提高音乐小文件和音乐元数据的读取效率。实验验证了优化方案的有效性,满足了音乐大数据存储管理平台的需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信