{"title":"Erosion as a novel Approach for removing Semantics and Comparison of different State-of-Art-Methods","authors":"Martin Schorradt, D. Cunningham","doi":"10.1145/3548814.3551458","DOIUrl":null,"url":null,"abstract":"Through language, people convey not only pure semantics, but also information about themselves, such as age, gender, state of mind or health. The supralingual features that carry this information have been a subject of research for a long time. Various procedures have been proposed to remove unneeded semantics from speech recordings, in order to study supralingual information in natural speech. In this paper, we propose a new method for removing sematics, based on erosion, a morphological operator. We compare its effectiveness to different state-of-the-art methods. As established methods we consider two low pass filters with cut off frequencies of 450Hz and 1150Hz and Brownian noise. As a newer method we investigate a filter for spectro-temporal frequencies. To evaluate each method, appropriately processed recordings were presented to a group of participants in a perceptual experiment. The intelligibility was measured by means of the Levenshtein distance. Our results show that erosion itself performs similarly to the established methods, while a combination of erosion and low-pass filter outperforms all other methods.","PeriodicalId":376962,"journal":{"name":"ACM Symposium on Applied Perception 2022","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Symposium on Applied Perception 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3548814.3551458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Through language, people convey not only pure semantics, but also information about themselves, such as age, gender, state of mind or health. The supralingual features that carry this information have been a subject of research for a long time. Various procedures have been proposed to remove unneeded semantics from speech recordings, in order to study supralingual information in natural speech. In this paper, we propose a new method for removing sematics, based on erosion, a morphological operator. We compare its effectiveness to different state-of-the-art methods. As established methods we consider two low pass filters with cut off frequencies of 450Hz and 1150Hz and Brownian noise. As a newer method we investigate a filter for spectro-temporal frequencies. To evaluate each method, appropriately processed recordings were presented to a group of participants in a perceptual experiment. The intelligibility was measured by means of the Levenshtein distance. Our results show that erosion itself performs similarly to the established methods, while a combination of erosion and low-pass filter outperforms all other methods.