D. O'Shaughnessy, P. Kabal, D. Bernardi, L. Barbeau, C. Chu, J. Moncet
{"title":"语音增强在音频监控中的应用","authors":"D. O'Shaughnessy, P. Kabal, D. Bernardi, L. Barbeau, C. Chu, J. Moncet","doi":"10.1109/CCST.1988.75991","DOIUrl":null,"url":null,"abstract":"The authors describe ways to cancel interference where an available reference signal is not synchronized with the surveillance recording, i.e., the reference is obtained later from a phonograph record on air check recording from a broadcast source. As a specific example, they discuss their experiences processing a wiretap recording used in an actual court case. They transformed the reference signal to reflect room and transmission effects and then subtracted the resulting secondary signal from the primary intercept signal, thus enhancing speech from the desired talkers by removing interfering sounds. The intercept signal was subject to time-scale modifications due to variable phonograph and tape recorder speeds, which adversely affected the ability to cancel interference automatically. The two methods that were most successful for enhancement were least-mean-squares adaptive cancellation and spectral subtraction.<<ETX>>","PeriodicalId":129971,"journal":{"name":"Proceedings. Institute of Electrical and Electronics Engineers 1988 International Carnahan Conference on Security Technology, Crime Countermeasures","volume":"793 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1988-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Applying speech enhancement to audio surveillance\",\"authors\":\"D. O'Shaughnessy, P. Kabal, D. Bernardi, L. Barbeau, C. Chu, J. Moncet\",\"doi\":\"10.1109/CCST.1988.75991\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The authors describe ways to cancel interference where an available reference signal is not synchronized with the surveillance recording, i.e., the reference is obtained later from a phonograph record on air check recording from a broadcast source. As a specific example, they discuss their experiences processing a wiretap recording used in an actual court case. They transformed the reference signal to reflect room and transmission effects and then subtracted the resulting secondary signal from the primary intercept signal, thus enhancing speech from the desired talkers by removing interfering sounds. The intercept signal was subject to time-scale modifications due to variable phonograph and tape recorder speeds, which adversely affected the ability to cancel interference automatically. The two methods that were most successful for enhancement were least-mean-squares adaptive cancellation and spectral subtraction.<<ETX>>\",\"PeriodicalId\":129971,\"journal\":{\"name\":\"Proceedings. Institute of Electrical and Electronics Engineers 1988 International Carnahan Conference on Security Technology, Crime Countermeasures\",\"volume\":\"793 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1988-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. Institute of Electrical and Electronics Engineers 1988 International Carnahan Conference on Security Technology, Crime Countermeasures\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCST.1988.75991\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Institute of Electrical and Electronics Engineers 1988 International Carnahan Conference on Security Technology, Crime Countermeasures","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCST.1988.75991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The authors describe ways to cancel interference where an available reference signal is not synchronized with the surveillance recording, i.e., the reference is obtained later from a phonograph record on air check recording from a broadcast source. As a specific example, they discuss their experiences processing a wiretap recording used in an actual court case. They transformed the reference signal to reflect room and transmission effects and then subtracted the resulting secondary signal from the primary intercept signal, thus enhancing speech from the desired talkers by removing interfering sounds. The intercept signal was subject to time-scale modifications due to variable phonograph and tape recorder speeds, which adversely affected the ability to cancel interference automatically. The two methods that were most successful for enhancement were least-mean-squares adaptive cancellation and spectral subtraction.<>