Ajey Saligrama, H. G. Ranjani, R. Muralishankar, H. N. Shankar
{"title":"一种利用连通时频域评价音乐噪声的客观方法","authors":"Ajey Saligrama, H. G. Ranjani, R. Muralishankar, H. N. Shankar","doi":"10.1109/NCC.2019.8732246","DOIUrl":null,"url":null,"abstract":"In this work, we propose an objective measure to assess the amount of musical noise that results from speech enhancement algorithms. The algorithms can result in nonsmooth suppression of background noise which in turn translates to isolated regions of high energy, referred to as musical noise. We propose to identify such regions by combining time-frequency (TF) bins associated through connectivity along with additional properties of these regions such as area, aspect ratio and total energy. The objective measure proposed is based on density of such regions. The effectiveness of the proposed measure is studied by correlating it with subjective assessment of listeners using enhanced speech of various algorithms.","PeriodicalId":6870,"journal":{"name":"2019 National Conference on Communications (NCC)","volume":"47 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Objective Measure to Assess Musical Noise using Connected Time-Frequency Regions\",\"authors\":\"Ajey Saligrama, H. G. Ranjani, R. Muralishankar, H. N. Shankar\",\"doi\":\"10.1109/NCC.2019.8732246\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we propose an objective measure to assess the amount of musical noise that results from speech enhancement algorithms. The algorithms can result in nonsmooth suppression of background noise which in turn translates to isolated regions of high energy, referred to as musical noise. We propose to identify such regions by combining time-frequency (TF) bins associated through connectivity along with additional properties of these regions such as area, aspect ratio and total energy. The objective measure proposed is based on density of such regions. The effectiveness of the proposed measure is studied by correlating it with subjective assessment of listeners using enhanced speech of various algorithms.\",\"PeriodicalId\":6870,\"journal\":{\"name\":\"2019 National Conference on Communications (NCC)\",\"volume\":\"47 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2019.8732246\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2019.8732246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Objective Measure to Assess Musical Noise using Connected Time-Frequency Regions
In this work, we propose an objective measure to assess the amount of musical noise that results from speech enhancement algorithms. The algorithms can result in nonsmooth suppression of background noise which in turn translates to isolated regions of high energy, referred to as musical noise. We propose to identify such regions by combining time-frequency (TF) bins associated through connectivity along with additional properties of these regions such as area, aspect ratio and total energy. The objective measure proposed is based on density of such regions. The effectiveness of the proposed measure is studied by correlating it with subjective assessment of listeners using enhanced speech of various algorithms.