GRAIL: Database Linking Music Metadata Across Artist, Release, and Track

Michael D. Barone, Kurt Dacosta, Gabriel Vigliensoni, M. Woolhouse
{"title":"GRAIL: Database Linking Music Metadata Across Artist, Release, and Track","authors":"Michael D. Barone, Kurt Dacosta, Gabriel Vigliensoni, M. Woolhouse","doi":"10.1145/3144749.3144760","DOIUrl":null,"url":null,"abstract":"Linking information from multiple music databases is important for MIR because it provides a means to determine consistency of metadata between resources/services, which can help facilitate innovative product development and research. However, as yet, no open access tools exist that persistently link and validate metadata resources at the three main entities of music data: artist, release, and track. This paper introduces an open access resource which attempts to address the issue of linking information from multiple music databases. The General Recorded Audio Identity Linker (GRAIL - api.digitalmusiclab.org) is a music metadata ID-linking API that: i) connects International Standard Recording Codes (ISRCs) to music metadata IDs from services such as MusicBrainz, Spotify, and Last.FM; ii) provides these ID linkages as a publicly available resource; iii) confirms linkage accuracy using continuous metadata crawling from music-service APIs; and iv) derives consistency values (CV) for linkages by means of a set of quantifiable criteria. To date, more than 35M tracks, 8M releases, and 900K artists from 16 services have been ingested into GRAIL. We discuss the challenges faced in past attempts to link music metadata, the methods and rationale which we adopted in order to construct GRAIL and to ensure it remains updated with validated information.","PeriodicalId":134943,"journal":{"name":"Proceedings of the 4th International Workshop on Digital Libraries for Musicology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Workshop on Digital Libraries for Musicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3144749.3144760","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Linking information from multiple music databases is important for MIR because it provides a means to determine consistency of metadata between resources/services, which can help facilitate innovative product development and research. However, as yet, no open access tools exist that persistently link and validate metadata resources at the three main entities of music data: artist, release, and track. This paper introduces an open access resource which attempts to address the issue of linking information from multiple music databases. The General Recorded Audio Identity Linker (GRAIL - api.digitalmusiclab.org) is a music metadata ID-linking API that: i) connects International Standard Recording Codes (ISRCs) to music metadata IDs from services such as MusicBrainz, Spotify, and Last.FM; ii) provides these ID linkages as a publicly available resource; iii) confirms linkage accuracy using continuous metadata crawling from music-service APIs; and iv) derives consistency values (CV) for linkages by means of a set of quantifiable criteria. To date, more than 35M tracks, 8M releases, and 900K artists from 16 services have been ingested into GRAIL. We discuss the challenges faced in past attempts to link music metadata, the methods and rationale which we adopted in order to construct GRAIL and to ensure it remains updated with validated information.
圣杯:数据库链接音乐元数据跨越艺术家,发行,和轨道
链接来自多个音乐数据库的信息对MIR很重要,因为它提供了一种确定资源/服务之间元数据一致性的方法,这有助于促进创新产品的开发和研究。然而,到目前为止,还没有开放访问工具能够持久地链接和验证音乐数据的三个主要实体的元数据资源:艺术家、发行和曲目。本文介绍了一个开放存取资源,它试图解决从多个音乐数据库中链接信息的问题。通用录制音频标识链接器(GRAIL - API .digitalmusiclab.org)是一个音乐元数据id链接API: i)连接国际标准录音代码(isrc)到音乐元数据id从服务,如MusicBrainz, Spotify和Last.FM;ii)将这些ID链接作为公共可用资源提供;iii)通过从音乐服务api中连续抓取元数据来确认链接的准确性;iv)通过一组可量化的标准推导出联系的一致性值(CV)。迄今为止,来自16个服务平台的3500多万首歌曲、800万张唱片和90万名艺人已被纳入GRAIL。我们讨论了在过去链接音乐元数据的尝试中所面临的挑战,我们采用的方法和基本原理,以构建GRAIL并确保它与经过验证的信息保持更新。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信