{"title":"基于支持向量机的孤立词语音活动检测方法","authors":"Cheng Dai, Linkai Luo, Hong Peng, Qingyun Sun","doi":"10.1109/ICCSE.2018.8468752","DOIUrl":null,"url":null,"abstract":"Voice activity detection (VAD) is an essential part of speech processing system, which aims at extracting active speech information from continuous speech signals. In this paper, we firstly treat VAD as a two-class problem: active speech frame and nonspeech frame, in which the active speech frame is composed of information frame and noise frame. Support vector machine is then applied to solve the two-class problem. Finally, noise frames are removed from the active speech frames by the prior knowledge of noise. An experiment on a speech dataset of isolated numbers shows the accuracy and recall rate on test set reaches 99.49% and 98.06% respectively, which indicates that the proposed method is effective for isolated-word VAD task.","PeriodicalId":228760,"journal":{"name":"2018 13th International Conference on Computer Science & Education (ICCSE)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Method Based on Support Vector Machine for Voice Activity Detection on Isolated Words\",\"authors\":\"Cheng Dai, Linkai Luo, Hong Peng, Qingyun Sun\",\"doi\":\"10.1109/ICCSE.2018.8468752\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Voice activity detection (VAD) is an essential part of speech processing system, which aims at extracting active speech information from continuous speech signals. In this paper, we firstly treat VAD as a two-class problem: active speech frame and nonspeech frame, in which the active speech frame is composed of information frame and noise frame. Support vector machine is then applied to solve the two-class problem. Finally, noise frames are removed from the active speech frames by the prior knowledge of noise. An experiment on a speech dataset of isolated numbers shows the accuracy and recall rate on test set reaches 99.49% and 98.06% respectively, which indicates that the proposed method is effective for isolated-word VAD task.\",\"PeriodicalId\":228760,\"journal\":{\"name\":\"2018 13th International Conference on Computer Science & Education (ICCSE)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 13th International Conference on Computer Science & Education (ICCSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSE.2018.8468752\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 13th International Conference on Computer Science & Education (ICCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSE.2018.8468752","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Method Based on Support Vector Machine for Voice Activity Detection on Isolated Words
Voice activity detection (VAD) is an essential part of speech processing system, which aims at extracting active speech information from continuous speech signals. In this paper, we firstly treat VAD as a two-class problem: active speech frame and nonspeech frame, in which the active speech frame is composed of information frame and noise frame. Support vector machine is then applied to solve the two-class problem. Finally, noise frames are removed from the active speech frames by the prior knowledge of noise. An experiment on a speech dataset of isolated numbers shows the accuracy and recall rate on test set reaches 99.49% and 98.06% respectively, which indicates that the proposed method is effective for isolated-word VAD task.