Design and Implementation of an Audio Fingerprinting System for the Identification of Audio Recordings

A. Patil, Lakshmi J Itagi, Ashika Cs, Ambika G, Mallika Ravi
{"title":"Design and Implementation of an Audio Fingerprinting System for the Identification of Audio Recordings","authors":"A. Patil, Lakshmi J Itagi, Ashika Cs, Ambika G, Mallika Ravi","doi":"10.1109/R10-HTC53172.2021.9641681","DOIUrl":null,"url":null,"abstract":"Rapid growth in the multimedia industry due to various streaming platforms has increased audio and video traffic enormously. This traffic generates a massive amount of data that requires efficient algorithms to retrieve the desired data in a short amount of time. Thus, there is a need for efficient audio retrieval methods such as audio fingerprinting. Audio fingerprinting systems mainly use audio signals after processing to obtain a representative hash called an audio fingerprint. The fingerprint holds content information of a recording that can distinguish one recording from another by extracting relevant features from the audio content. Some applications of audio fingerprinting include Music Retrieval, Copyright infringement, digital watermarking and broadcast monitoring. We propose to build a reliable audio fingerprinting system, which uses a robust audio fingerprint extraction method and an efficient search strategy, which uses only limited computing resources, with minimized search time for recognition of audio content, by considering other musical features. Accuracy, confidence, efficiency in storing the fingerprints and speed is used to measure the system's performance. The accuracy of the system depends on the confidence level of the match found. The accuracy varies with the recording time. The ideal recording time is found to be 5 seconds that recognizes the song with accuracy of 83.3%. The system performs well even in the presence of noise with reduced false positives.","PeriodicalId":117626,"journal":{"name":"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 9th Region 10 Humanitarian Technology Conference (R10-HTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/R10-HTC53172.2021.9641681","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Rapid growth in the multimedia industry due to various streaming platforms has increased audio and video traffic enormously. This traffic generates a massive amount of data that requires efficient algorithms to retrieve the desired data in a short amount of time. Thus, there is a need for efficient audio retrieval methods such as audio fingerprinting. Audio fingerprinting systems mainly use audio signals after processing to obtain a representative hash called an audio fingerprint. The fingerprint holds content information of a recording that can distinguish one recording from another by extracting relevant features from the audio content. Some applications of audio fingerprinting include Music Retrieval, Copyright infringement, digital watermarking and broadcast monitoring. We propose to build a reliable audio fingerprinting system, which uses a robust audio fingerprint extraction method and an efficient search strategy, which uses only limited computing resources, with minimized search time for recognition of audio content, by considering other musical features. Accuracy, confidence, efficiency in storing the fingerprints and speed is used to measure the system's performance. The accuracy of the system depends on the confidence level of the match found. The accuracy varies with the recording time. The ideal recording time is found to be 5 seconds that recognizes the song with accuracy of 83.3%. The system performs well even in the presence of noise with reduced false positives.
音频指纹识别系统的设计与实现
由于各种流媒体平台的出现,多媒体行业的快速发展极大地增加了音频和视频流量。这种流量会产生大量的数据,需要高效的算法在短时间内检索所需的数据。因此,需要有效的音频检索方法,如音频指纹。音频指纹系统主要利用音频信号经过处理后得到一个有代表性的哈希值,称为音频指纹。指纹保存录音的内容信息,该信息通过从音频内容中提取相关特征来区分一个录音与另一个录音。音频指纹的一些应用包括音乐检索、版权侵权、数字水印和广播监控。我们提出建立一个可靠的音频指纹识别系统,该系统采用鲁棒的音频指纹提取方法和高效的搜索策略,在考虑其他音乐特征的情况下,只使用有限的计算资源,以最小的搜索时间来识别音频内容。准确度、置信度、指纹存储效率和速度是衡量系统性能的主要指标。系统的准确性取决于找到的匹配的置信度。准确度随记录时间的不同而不同。理想的录音时间为5秒,识别歌曲的准确率为83.3%。即使在噪声存在的情况下,系统也表现良好,误报率降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信