基于支持向量机和Naïve贝叶斯分类算法的语音活动检测器

N. Selvakumari, V. Radha
{"title":"基于支持向量机和Naïve贝叶斯分类算法的语音活动检测器","authors":"N. Selvakumari, V. Radha","doi":"10.1109/CSPC.2017.8305815","DOIUrl":null,"url":null,"abstract":"Voice or Speech Pathology analysis performs a significant role in the recent record of medical experts. The need for research is the recognition and classifications of tone of pathological voices are believed as a challenging work in the field of speech analysis still now. Commonly Patients are in a position to identify a change in voice parameters, such as hoarseness; however the voice pathologies can result from a wide spectral range of causes, like common cold to a malicious tumor. Medical experts like otolaryngologists were discovering a genuine quantity and range of speech pathologies from the Patients conversation. Unluckily, the current classification rate of voice pathology by the human experts is merely about 60–70%. Thus tone of voice or speech pathologies can be found by the endoscopy techniques and strategies like laryngostroboscopy or medical micro laryngoscopy, which distress the individual to a great scope which is expensive also. The primary objective of the research work is to assist this speech pathology finding process with computer structured diagnostic tools. This speech pathology diagnosis system works predicated on the support of the medical clinic based mostly professional otolaryngologists, by determining and figuring out the chance of the pathology automatically without the endoscopy which escalates the detection of speech pathology at the initial stage. In this research work, the conversation signal is examined by the acoustic guidelines and variables like transmission energy, pitch, Silence removal, Windowing, Mel consistency and occurrence Cepstrum, and Jitter. At the final end, the classification strategy i.e Support Vector Machine is employed to classify the standard and pathology speech, predicated on the features extracted in the last phase. Predicated on the results & conversation and dialogue pointed out below, thus the Speech pathology recognition system successfully categorized and labelled the normal tone of voice and the pathological speech which assists with diagnosing and examining the patient.","PeriodicalId":123773,"journal":{"name":"2017 International Conference on Signal Processing and Communication (ICSPC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"A voice activity detector using SVM and Naïve Bayes classification algorithm\",\"authors\":\"N. Selvakumari, V. Radha\",\"doi\":\"10.1109/CSPC.2017.8305815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Voice or Speech Pathology analysis performs a significant role in the recent record of medical experts. The need for research is the recognition and classifications of tone of pathological voices are believed as a challenging work in the field of speech analysis still now. Commonly Patients are in a position to identify a change in voice parameters, such as hoarseness; however the voice pathologies can result from a wide spectral range of causes, like common cold to a malicious tumor. Medical experts like otolaryngologists were discovering a genuine quantity and range of speech pathologies from the Patients conversation. Unluckily, the current classification rate of voice pathology by the human experts is merely about 60–70%. Thus tone of voice or speech pathologies can be found by the endoscopy techniques and strategies like laryngostroboscopy or medical micro laryngoscopy, which distress the individual to a great scope which is expensive also. The primary objective of the research work is to assist this speech pathology finding process with computer structured diagnostic tools. This speech pathology diagnosis system works predicated on the support of the medical clinic based mostly professional otolaryngologists, by determining and figuring out the chance of the pathology automatically without the endoscopy which escalates the detection of speech pathology at the initial stage. In this research work, the conversation signal is examined by the acoustic guidelines and variables like transmission energy, pitch, Silence removal, Windowing, Mel consistency and occurrence Cepstrum, and Jitter. At the final end, the classification strategy i.e Support Vector Machine is employed to classify the standard and pathology speech, predicated on the features extracted in the last phase. Predicated on the results & conversation and dialogue pointed out below, thus the Speech pathology recognition system successfully categorized and labelled the normal tone of voice and the pathological speech which assists with diagnosing and examining the patient.\",\"PeriodicalId\":123773,\"journal\":{\"name\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Signal Processing and Communication (ICSPC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPC.2017.8305815\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Signal Processing and Communication (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPC.2017.8305815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

语音或言语病理学分析在医学专家的最新记录中发挥着重要作用。病理语音的语调识别和分类一直是语音分析领域的一项具有挑战性的工作。通常,患者能够识别声音参数的变化,例如声音嘶哑;然而,声音病变可以由各种各样的原因引起,从普通感冒到恶性肿瘤。耳鼻喉科医生等医学专家从病人的谈话中发现了大量和广泛的语言病理。遗憾的是,目前人类专家对语音病理的分类率仅为60-70%左右。因此,声调或言语病理可以通过内窥镜技术和策略,如频闪喉镜或医学显微喉镜来发现,这些技术和策略使个体痛苦到很大的范围,也很昂贵。研究工作的主要目的是用计算机结构化诊断工具协助语言病理学发现过程。该语音病理诊断系统的工作基于以专业耳鼻喉科医生为主的医疗诊所的支持,通过自动确定和计算病理的可能性,而无需在初始阶段升级语音病理检测的内窥镜。在这项研究工作中,通过声学准则和变量,如传输能量,基音,去噪,窗口,Mel一致性和发生倒频谱和抖动来检查会话信号。最后,基于上一阶段提取的特征,采用支持向量机分类策略对标准语音和病理语音进行分类。基于下面指出的结果和对话和对话,语言病理识别系统成功地对正常语音和病理语音进行了分类和标记,帮助诊断和检查患者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A voice activity detector using SVM and Naïve Bayes classification algorithm
Voice or Speech Pathology analysis performs a significant role in the recent record of medical experts. The need for research is the recognition and classifications of tone of pathological voices are believed as a challenging work in the field of speech analysis still now. Commonly Patients are in a position to identify a change in voice parameters, such as hoarseness; however the voice pathologies can result from a wide spectral range of causes, like common cold to a malicious tumor. Medical experts like otolaryngologists were discovering a genuine quantity and range of speech pathologies from the Patients conversation. Unluckily, the current classification rate of voice pathology by the human experts is merely about 60–70%. Thus tone of voice or speech pathologies can be found by the endoscopy techniques and strategies like laryngostroboscopy or medical micro laryngoscopy, which distress the individual to a great scope which is expensive also. The primary objective of the research work is to assist this speech pathology finding process with computer structured diagnostic tools. This speech pathology diagnosis system works predicated on the support of the medical clinic based mostly professional otolaryngologists, by determining and figuring out the chance of the pathology automatically without the endoscopy which escalates the detection of speech pathology at the initial stage. In this research work, the conversation signal is examined by the acoustic guidelines and variables like transmission energy, pitch, Silence removal, Windowing, Mel consistency and occurrence Cepstrum, and Jitter. At the final end, the classification strategy i.e Support Vector Machine is employed to classify the standard and pathology speech, predicated on the features extracted in the last phase. Predicated on the results & conversation and dialogue pointed out below, thus the Speech pathology recognition system successfully categorized and labelled the normal tone of voice and the pathological speech which assists with diagnosing and examining the patient.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信