{"title":"基于DTW翘曲路径的广播音频比较模型","authors":"Yunwei Zhao, Gang Yang","doi":"10.1109/ITNEC48623.2020.9084853","DOIUrl":null,"url":null,"abstract":"As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.","PeriodicalId":235524,"journal":{"name":"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comparison Model of Broadcast Audio based on DTW Warping Path\",\"authors\":\"Yunwei Zhao, Gang Yang\",\"doi\":\"10.1109/ITNEC48623.2020.9084853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.\",\"PeriodicalId\":235524,\"journal\":{\"name\":\"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNEC48623.2020.9084853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNEC48623.2020.9084853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparison Model of Broadcast Audio based on DTW Warping Path
As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.