Michael D. Barone, Kurt Dacosta, Gabriel Vigliensoni, M. Woolhouse
{"title":"圣杯:数据库链接音乐元数据跨越艺术家,发行,和轨道","authors":"Michael D. Barone, Kurt Dacosta, Gabriel Vigliensoni, M. Woolhouse","doi":"10.1145/3144749.3144760","DOIUrl":null,"url":null,"abstract":"Linking information from multiple music databases is important for MIR because it provides a means to determine consistency of metadata between resources/services, which can help facilitate innovative product development and research. However, as yet, no open access tools exist that persistently link and validate metadata resources at the three main entities of music data: artist, release, and track. This paper introduces an open access resource which attempts to address the issue of linking information from multiple music databases. The General Recorded Audio Identity Linker (GRAIL - api.digitalmusiclab.org) is a music metadata ID-linking API that: i) connects International Standard Recording Codes (ISRCs) to music metadata IDs from services such as MusicBrainz, Spotify, and Last.FM; ii) provides these ID linkages as a publicly available resource; iii) confirms linkage accuracy using continuous metadata crawling from music-service APIs; and iv) derives consistency values (CV) for linkages by means of a set of quantifiable criteria. To date, more than 35M tracks, 8M releases, and 900K artists from 16 services have been ingested into GRAIL. We discuss the challenges faced in past attempts to link music metadata, the methods and rationale which we adopted in order to construct GRAIL and to ensure it remains updated with validated information.","PeriodicalId":134943,"journal":{"name":"Proceedings of the 4th International Workshop on Digital Libraries for Musicology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GRAIL: Database Linking Music Metadata Across Artist, Release, and Track\",\"authors\":\"Michael D. Barone, Kurt Dacosta, Gabriel Vigliensoni, M. Woolhouse\",\"doi\":\"10.1145/3144749.3144760\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Linking information from multiple music databases is important for MIR because it provides a means to determine consistency of metadata between resources/services, which can help facilitate innovative product development and research. However, as yet, no open access tools exist that persistently link and validate metadata resources at the three main entities of music data: artist, release, and track. This paper introduces an open access resource which attempts to address the issue of linking information from multiple music databases. The General Recorded Audio Identity Linker (GRAIL - api.digitalmusiclab.org) is a music metadata ID-linking API that: i) connects International Standard Recording Codes (ISRCs) to music metadata IDs from services such as MusicBrainz, Spotify, and Last.FM; ii) provides these ID linkages as a publicly available resource; iii) confirms linkage accuracy using continuous metadata crawling from music-service APIs; and iv) derives consistency values (CV) for linkages by means of a set of quantifiable criteria. To date, more than 35M tracks, 8M releases, and 900K artists from 16 services have been ingested into GRAIL. We discuss the challenges faced in past attempts to link music metadata, the methods and rationale which we adopted in order to construct GRAIL and to ensure it remains updated with validated information.\",\"PeriodicalId\":134943,\"journal\":{\"name\":\"Proceedings of the 4th International Workshop on Digital Libraries for Musicology\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Workshop on Digital Libraries for Musicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3144749.3144760\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Workshop on Digital Libraries for Musicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3144749.3144760","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
链接来自多个音乐数据库的信息对MIR很重要,因为它提供了一种确定资源/服务之间元数据一致性的方法,这有助于促进创新产品的开发和研究。然而,到目前为止,还没有开放访问工具能够持久地链接和验证音乐数据的三个主要实体的元数据资源:艺术家、发行和曲目。本文介绍了一个开放存取资源,它试图解决从多个音乐数据库中链接信息的问题。通用录制音频标识链接器(GRAIL - API .digitalmusiclab.org)是一个音乐元数据id链接API: i)连接国际标准录音代码(isrc)到音乐元数据id从服务,如MusicBrainz, Spotify和Last.FM;ii)将这些ID链接作为公共可用资源提供;iii)通过从音乐服务api中连续抓取元数据来确认链接的准确性;iv)通过一组可量化的标准推导出联系的一致性值(CV)。迄今为止,来自16个服务平台的3500多万首歌曲、800万张唱片和90万名艺人已被纳入GRAIL。我们讨论了在过去链接音乐元数据的尝试中所面临的挑战,我们采用的方法和基本原理,以构建GRAIL并确保它与经过验证的信息保持更新。
GRAIL: Database Linking Music Metadata Across Artist, Release, and Track
Linking information from multiple music databases is important for MIR because it provides a means to determine consistency of metadata between resources/services, which can help facilitate innovative product development and research. However, as yet, no open access tools exist that persistently link and validate metadata resources at the three main entities of music data: artist, release, and track. This paper introduces an open access resource which attempts to address the issue of linking information from multiple music databases. The General Recorded Audio Identity Linker (GRAIL - api.digitalmusiclab.org) is a music metadata ID-linking API that: i) connects International Standard Recording Codes (ISRCs) to music metadata IDs from services such as MusicBrainz, Spotify, and Last.FM; ii) provides these ID linkages as a publicly available resource; iii) confirms linkage accuracy using continuous metadata crawling from music-service APIs; and iv) derives consistency values (CV) for linkages by means of a set of quantifiable criteria. To date, more than 35M tracks, 8M releases, and 900K artists from 16 services have been ingested into GRAIL. We discuss the challenges faced in past attempts to link music metadata, the methods and rationale which we adopted in order to construct GRAIL and to ensure it remains updated with validated information.