{"title":"语音增强--现代方法回顾","authors":"Douglas O'Shaughnessy","doi":"10.1109/THMS.2023.3339663","DOIUrl":null,"url":null,"abstract":"A review of techniques to improve distorted speech is presented, noting the strengths and weaknesses of common methods. Speech signals are discussed from the point of view of which features should be preserved to retain both naturalness and intelligibility. Enhancement methods range from classical spectral subtraction and Wiener filtering to recent deep neural network approaches. The difficulty of finding objective acoustic measures that approximate perceptual speech quality is discussed. Suggestions to improve these methods are made.","PeriodicalId":48916,"journal":{"name":"IEEE Transactions on Human-Machine Systems","volume":"54 1","pages":"110-120"},"PeriodicalIF":3.5000,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech Enhancement—A Review of Modern Methods\",\"authors\":\"Douglas O'Shaughnessy\",\"doi\":\"10.1109/THMS.2023.3339663\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A review of techniques to improve distorted speech is presented, noting the strengths and weaknesses of common methods. Speech signals are discussed from the point of view of which features should be preserved to retain both naturalness and intelligibility. Enhancement methods range from classical spectral subtraction and Wiener filtering to recent deep neural network approaches. The difficulty of finding objective acoustic measures that approximate perceptual speech quality is discussed. Suggestions to improve these methods are made.\",\"PeriodicalId\":48916,\"journal\":{\"name\":\"IEEE Transactions on Human-Machine Systems\",\"volume\":\"54 1\",\"pages\":\"110-120\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Human-Machine Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10382416/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Human-Machine Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10382416/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A review of techniques to improve distorted speech is presented, noting the strengths and weaknesses of common methods. Speech signals are discussed from the point of view of which features should be preserved to retain both naturalness and intelligibility. Enhancement methods range from classical spectral subtraction and Wiener filtering to recent deep neural network approaches. The difficulty of finding objective acoustic measures that approximate perceptual speech quality is discussed. Suggestions to improve these methods are made.
期刊介绍:
The scope of the IEEE Transactions on Human-Machine Systems includes the fields of human machine systems. It covers human systems and human organizational interactions including cognitive ergonomics, system test and evaluation, and human information processing concerns in systems and organizations.