Design of Chinese Speech Intelligibility Evaluation System Based on Machine Learning Algorithm

Shuai Pan
{"title":"Design of Chinese Speech Intelligibility Evaluation System Based on Machine Learning Algorithm","authors":"Shuai Pan","doi":"10.1109/ACEDPI58926.2023.00031","DOIUrl":null,"url":null,"abstract":"With the acceleration of globalization and the popularization of computer technique, the links between different parts of the world are getting closer and closer, and the translation between different languages has become a key research topic of domestic and foreign scholars. Speech recognition is to identify the original speech signal into the corresponding text or other forms of message that can be processed by the computer. Speech recognition technique is an important research direction in the domain of artificial intelligence, which has high research value and commercial value. The method adopted in this paper is machine learning algorithm, and the deep learning technique among machine learning algorithms is selected. In this paper, the influence of high-order harmonic distortion, high frequency and mixed elements on the quality of speech signals caused by human vocal organs and speech signal acquisition equipment is studied. Windowing and framing are performed, so that the whole speech waveform will be divided into many small speech segments with overlapping 25ms, and then appropriate acoustic feature extraction algorithm is used to extract the corresponding acoustic features from the 25ms speech segments. After research, this method ameliorates the efficiency of speech recognition, which is higher than that of traditional methods.","PeriodicalId":124469,"journal":{"name":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","volume":"31 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACEDPI58926.2023.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the acceleration of globalization and the popularization of computer technique, the links between different parts of the world are getting closer and closer, and the translation between different languages has become a key research topic of domestic and foreign scholars. Speech recognition is to identify the original speech signal into the corresponding text or other forms of message that can be processed by the computer. Speech recognition technique is an important research direction in the domain of artificial intelligence, which has high research value and commercial value. The method adopted in this paper is machine learning algorithm, and the deep learning technique among machine learning algorithms is selected. In this paper, the influence of high-order harmonic distortion, high frequency and mixed elements on the quality of speech signals caused by human vocal organs and speech signal acquisition equipment is studied. Windowing and framing are performed, so that the whole speech waveform will be divided into many small speech segments with overlapping 25ms, and then appropriate acoustic feature extraction algorithm is used to extract the corresponding acoustic features from the 25ms speech segments. After research, this method ameliorates the efficiency of speech recognition, which is higher than that of traditional methods.
基于机器学习算法的汉语语音可理解度评价系统设计
随着全球化进程的加快和计算机技术的普及,世界各地之间的联系越来越紧密,不同语言之间的翻译已成为国内外学者的重点研究课题。语音识别就是将原来的语音信号识别成相应的文本或其他形式的信息,并由计算机进行处理。语音识别技术是人工智能领域的一个重要研究方向,具有很高的研究价值和商业价值。本文采用的方法是机器学习算法,在机器学习算法中选择了深度学习技术。本文研究了由人体发声器官和语音信号采集设备引起的高次谐波失真、高频和混合因素对语音信号质量的影响。对整个语音波形进行加窗和分帧,将整个语音波形划分为多个25ms重叠的小语音段,然后使用合适的声学特征提取算法从25ms语音段中提取相应的声学特征。经过研究,该方法提高了语音识别的效率,比传统方法的效率要高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信