{"title":"Audio Compensation Network: to Improve the Quality of Low-Energy Audio in Visual Sound Separation","authors":"Yining Gao, Pengyuan Zhang, Zejia Tian","doi":"10.1109/ICCECE51280.2021.9342383","DOIUrl":null,"url":null,"abstract":"Separating the single audio from the mixed audio has always been a task that researchers are trying to achieve. In visual audio, visual information can be used as an aid to audio separation, helping us to separate audio better. However, under the current method, the independent sound separated from the mixed video has the phenomenon that high energy audio causes serious interference to the low energy audio. We call it \"over-complete separation\" and \"incomplete separation\", which seriously affects our separation. This paper proposes a new separation network, which can compensate for the loss of low-energy audio in the separation process, improve the signal-to-noise ratio and loudness of low-energy audio, and achieve better results than state-of-the-art methods on our dataset.","PeriodicalId":229425,"journal":{"name":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCECE51280.2021.9342383","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Separating the single audio from the mixed audio has always been a task that researchers are trying to achieve. In visual audio, visual information can be used as an aid to audio separation, helping us to separate audio better. However, under the current method, the independent sound separated from the mixed video has the phenomenon that high energy audio causes serious interference to the low energy audio. We call it "over-complete separation" and "incomplete separation", which seriously affects our separation. This paper proposes a new separation network, which can compensate for the loss of low-energy audio in the separation process, improve the signal-to-noise ratio and loudness of low-energy audio, and achieve better results than state-of-the-art methods on our dataset.