{"title":"背景音乐识别的自优化谱相关方法","authors":"M. Abe, M. Nishiguchi","doi":"10.1109/ICME.2002.1035786","DOIUrl":null,"url":null,"abstract":"This paper proposes a new method of detecting a known reference signal in an input signal highly corrupted by other sounds. One major application of the method is the identification of broadcast background music corrupted by speech. In this method, the reference signal is first decomposed into a number of small time-frequency components, and the maximum similarity between each component and the input is calculated. The similarities for all the components are then integrated by a voting method. Finally, the result is used to determine whether or not the reference exists in the input; and if it exists, to determine its position. Experiments on the identification of background music and the classification of similar TV commercials have shown that this method can identify 100% of target signals with an SNR of -10dB.","PeriodicalId":90694,"journal":{"name":"Proceedings. IEEE International Conference on Multimedia and Expo","volume":"183 1","pages":"333-336 vol.1"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Self-optimized spectral correlation method for background music identification\",\"authors\":\"M. Abe, M. Nishiguchi\",\"doi\":\"10.1109/ICME.2002.1035786\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a new method of detecting a known reference signal in an input signal highly corrupted by other sounds. One major application of the method is the identification of broadcast background music corrupted by speech. In this method, the reference signal is first decomposed into a number of small time-frequency components, and the maximum similarity between each component and the input is calculated. The similarities for all the components are then integrated by a voting method. Finally, the result is used to determine whether or not the reference exists in the input; and if it exists, to determine its position. Experiments on the identification of background music and the classification of similar TV commercials have shown that this method can identify 100% of target signals with an SNR of -10dB.\",\"PeriodicalId\":90694,\"journal\":{\"name\":\"Proceedings. IEEE International Conference on Multimedia and Expo\",\"volume\":\"183 1\",\"pages\":\"333-336 vol.1\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. IEEE International Conference on Multimedia and Expo\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICME.2002.1035786\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. IEEE International Conference on Multimedia and Expo","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME.2002.1035786","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Self-optimized spectral correlation method for background music identification
This paper proposes a new method of detecting a known reference signal in an input signal highly corrupted by other sounds. One major application of the method is the identification of broadcast background music corrupted by speech. In this method, the reference signal is first decomposed into a number of small time-frequency components, and the maximum similarity between each component and the input is calculated. The similarities for all the components are then integrated by a voting method. Finally, the result is used to determine whether or not the reference exists in the input; and if it exists, to determine its position. Experiments on the identification of background music and the classification of similar TV commercials have shown that this method can identify 100% of target signals with an SNR of -10dB.