P. Kendrick, T. Cox, Francis F. Li, B. Fazenda, Iain Jackson
{"title":"Automatic detection of microphone handling noise","authors":"P. Kendrick, T. Cox, Francis F. Li, B. Fazenda, Iain Jackson","doi":"10.1109/CIP.2014.6844501","DOIUrl":null,"url":null,"abstract":"Microphone handling noise is a common problem with user-generated content. It can occur when the operator inadvertently knocks or brushes a recording device. Handling noise may be impulsive, where a microphone is knocked, or a more sustained rubbing noise, when the microphone is brushed against something. A detector able to accurately detect handling noises caused by rubbing while recording speech, music or quotidian sounds has been developed. Ensembles of decision trees were trained to classify handling noise level over 23 ms frames; a second ensemble flags frames when the noise may be masked by foreground audio. Aggregation of the detection over 1 s yielded a Matthews correlation coefficient of 0.91.","PeriodicalId":117669,"journal":{"name":"2014 4th International Workshop on Cognitive Information Processing (CIP)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th International Workshop on Cognitive Information Processing (CIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIP.2014.6844501","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Microphone handling noise is a common problem with user-generated content. It can occur when the operator inadvertently knocks or brushes a recording device. Handling noise may be impulsive, where a microphone is knocked, or a more sustained rubbing noise, when the microphone is brushed against something. A detector able to accurately detect handling noises caused by rubbing while recording speech, music or quotidian sounds has been developed. Ensembles of decision trees were trained to classify handling noise level over 23 ms frames; a second ensemble flags frames when the noise may be masked by foreground audio. Aggregation of the detection over 1 s yielded a Matthews correlation coefficient of 0.91.