基于DTW翘曲路径的广播音频比较模型

2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) Pub Date : 2020-06-01 DOI:10.1109/ITNEC48623.2020.9084853

Yunwei Zhao, Gang Yang

{"title":"基于DTW翘曲路径的广播音频比较模型","authors":"Yunwei Zhao, Gang Yang","doi":"10.1109/ITNEC48623.2020.9084853","DOIUrl":null,"url":null,"abstract":"As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.","PeriodicalId":235524,"journal":{"name":"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comparison Model of Broadcast Audio based on DTW Warping Path\",\"authors\":\"Yunwei Zhao, Gang Yang\",\"doi\":\"10.1109/ITNEC48623.2020.9084853\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.\",\"PeriodicalId\":235524,\"journal\":{\"name\":\"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNEC48623.2020.9084853\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNEC48623.2020.9084853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了保证广播节目的安全播出，防止误播、插播等问题的发生。提出了一种基于DTW翘曲路径的广播音频比较模型。首先通过搜索音频同步点来解决两个音频的时间延迟问题，然后再进行比较。然后提取MFCC系数，通过动态时间翘曲(DTW)计算音频相似度，得到最优的翘曲路径。最后计算最佳匹配范围内的扭曲路径长度作为两个音频的相似度。实验结果表明，该模型能够准确判断不同类型的音频节目，具有较高的准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Comparison Model of Broadcast Audio based on DTW Warping Path

As to guarantee the safe broadcast of radio programs and prevent the problems such as wrong broadcasting, intercut and so on. A comparison model of broadcast audio based on DTW warping path is proposed in this paper. It first solves time delay problem of two audios before comparison by searching for audio synchronization point. Then it extracts the MFCC coefficient to calculate the audio similarity by dynamic time warping (DTW), which obtains the optimal warping path. Finally calculating the length of the warping path within the best matching range as the similarity of two audios. The Experimental results show that the model can accurately judge different types of audio programs with high accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC)

自引率

0.00%

发文量