{"title":"利用音乐样本上的三连音损失,通过Siamese cnn探索音乐相似性","authors":"Gibran Kasif, comGanesha Thondilege","doi":"10.1109/SCSE59836.2023.10215020","DOIUrl":null,"url":null,"abstract":"In the rapidly evolving digital music landscape, identifying similarities between musical pieces is essential to help musicians avoid unintended copyright infringement and maintain the originality of their work. However, detecting such similarities remains a complex and computationally challenging problem. A novel approach to address this issue is a song similarity detection system that utilizes a Siamese Convolutional Neural Network (CNN) with Triplet Loss for effective audio input comparison. The model is trained on a custom dataset from WhoSampled, an extensive database of information on sampled music, cover songs, and remixes. The dataset comprises pairs of audio samples and interpolations, making it suitable for the Siamese CNN approach. Incorporating Triplet Loss enhances the model’s performance by learning discriminative features for improved comparison. The performance of this system is assessed using a confidence interval-based metric, achieving a 96.86% accuracy at a 99.7% confidence level in determining the similarity between music samples. The solution provides a helpful tool for musicians to actively compare their creations with existing songs, helping to reduce the likelihood of unintentional plagiarism and possible legal issues.","PeriodicalId":429228,"journal":{"name":"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)","volume":"1085 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Music Similarity through Siamese CNNs using Triplet Loss on Music Samples\",\"authors\":\"Gibran Kasif, comGanesha Thondilege\",\"doi\":\"10.1109/SCSE59836.2023.10215020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the rapidly evolving digital music landscape, identifying similarities between musical pieces is essential to help musicians avoid unintended copyright infringement and maintain the originality of their work. However, detecting such similarities remains a complex and computationally challenging problem. A novel approach to address this issue is a song similarity detection system that utilizes a Siamese Convolutional Neural Network (CNN) with Triplet Loss for effective audio input comparison. The model is trained on a custom dataset from WhoSampled, an extensive database of information on sampled music, cover songs, and remixes. The dataset comprises pairs of audio samples and interpolations, making it suitable for the Siamese CNN approach. Incorporating Triplet Loss enhances the model’s performance by learning discriminative features for improved comparison. The performance of this system is assessed using a confidence interval-based metric, achieving a 96.86% accuracy at a 99.7% confidence level in determining the similarity between music samples. The solution provides a helpful tool for musicians to actively compare their creations with existing songs, helping to reduce the likelihood of unintentional plagiarism and possible legal issues.\",\"PeriodicalId\":429228,\"journal\":{\"name\":\"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)\",\"volume\":\"1085 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCSE59836.2023.10215020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Research Conference on Smart Computing and Systems Engineering (SCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCSE59836.2023.10215020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring Music Similarity through Siamese CNNs using Triplet Loss on Music Samples
In the rapidly evolving digital music landscape, identifying similarities between musical pieces is essential to help musicians avoid unintended copyright infringement and maintain the originality of their work. However, detecting such similarities remains a complex and computationally challenging problem. A novel approach to address this issue is a song similarity detection system that utilizes a Siamese Convolutional Neural Network (CNN) with Triplet Loss for effective audio input comparison. The model is trained on a custom dataset from WhoSampled, an extensive database of information on sampled music, cover songs, and remixes. The dataset comprises pairs of audio samples and interpolations, making it suitable for the Siamese CNN approach. Incorporating Triplet Loss enhances the model’s performance by learning discriminative features for improved comparison. The performance of this system is assessed using a confidence interval-based metric, achieving a 96.86% accuracy at a 99.7% confidence level in determining the similarity between music samples. The solution provides a helpful tool for musicians to actively compare their creations with existing songs, helping to reduce the likelihood of unintentional plagiarism and possible legal issues.