{"title":"Audio similarity detection algorithm based on Siamese LSTM network","authors":"Zhanli Li, Pengfei Song","doi":"10.1109/ICSP51882.2021.9408942","DOIUrl":null,"url":null,"abstract":"The key technology of audio signal similarity detection lies in the selection of audio signal features and feature matching model. In order to improve the accuracy of the similarity calculation of audios, a method of using LSTM in the basic network part of the Siamese network is proposed. First of all, we extract the Filter banks features of the two audio signals. Then, two feature matrices are input into the network to calculate the audio similarity. Experiments show that the Siamese LSTM network using FBank features can accurately detect the similarity of two audio segments.","PeriodicalId":117159,"journal":{"name":"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSP51882.2021.9408942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The key technology of audio signal similarity detection lies in the selection of audio signal features and feature matching model. In order to improve the accuracy of the similarity calculation of audios, a method of using LSTM in the basic network part of the Siamese network is proposed. First of all, we extract the Filter banks features of the two audio signals. Then, two feature matrices are input into the network to calculate the audio similarity. Experiments show that the Siamese LSTM network using FBank features can accurately detect the similarity of two audio segments.