Design of Chinese Speech Intelligibility Evaluation System Based on Machine Learning Algorithm

2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI) Pub Date : 2023-04-01 DOI:10.1109/ACEDPI58926.2023.00031

Shuai Pan

{"title":"Design of Chinese Speech Intelligibility Evaluation System Based on Machine Learning Algorithm","authors":"Shuai Pan","doi":"10.1109/ACEDPI58926.2023.00031","DOIUrl":null,"url":null,"abstract":"With the acceleration of globalization and the popularization of computer technique, the links between different parts of the world are getting closer and closer, and the translation between different languages has become a key research topic of domestic and foreign scholars. Speech recognition is to identify the original speech signal into the corresponding text or other forms of message that can be processed by the computer. Speech recognition technique is an important research direction in the domain of artificial intelligence, which has high research value and commercial value. The method adopted in this paper is machine learning algorithm, and the deep learning technique among machine learning algorithms is selected. In this paper, the influence of high-order harmonic distortion, high frequency and mixed elements on the quality of speech signals caused by human vocal organs and speech signal acquisition equipment is studied. Windowing and framing are performed, so that the whole speech waveform will be divided into many small speech segments with overlapping 25ms, and then appropriate acoustic feature extraction algorithm is used to extract the corresponding acoustic features from the 25ms speech segments. After research, this method ameliorates the efficiency of speech recognition, which is higher than that of traditional methods.","PeriodicalId":124469,"journal":{"name":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","volume":"31 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACEDPI58926.2023.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the acceleration of globalization and the popularization of computer technique, the links between different parts of the world are getting closer and closer, and the translation between different languages has become a key research topic of domestic and foreign scholars. Speech recognition is to identify the original speech signal into the corresponding text or other forms of message that can be processed by the computer. Speech recognition technique is an important research direction in the domain of artificial intelligence, which has high research value and commercial value. The method adopted in this paper is machine learning algorithm, and the deep learning technique among machine learning algorithms is selected. In this paper, the influence of high-order harmonic distortion, high frequency and mixed elements on the quality of speech signals caused by human vocal organs and speech signal acquisition equipment is studied. Windowing and framing are performed, so that the whole speech waveform will be divided into many small speech segments with overlapping 25ms, and then appropriate acoustic feature extraction algorithm is used to extract the corresponding acoustic features from the 25ms speech segments. After research, this method ameliorates the efficiency of speech recognition, which is higher than that of traditional methods.

查看原文本刊更多论文

基于机器学习算法的汉语语音可理解度评价系统设计

随着全球化进程的加快和计算机技术的普及，世界各地之间的联系越来越紧密，不同语言之间的翻译已成为国内外学者的重点研究课题。语音识别就是将原来的语音信号识别成相应的文本或其他形式的信息，并由计算机进行处理。语音识别技术是人工智能领域的一个重要研究方向，具有很高的研究价值和商业价值。本文采用的方法是机器学习算法，在机器学习算法中选择了深度学习技术。本文研究了由人体发声器官和语音信号采集设备引起的高次谐波失真、高频和混合因素对语音信号质量的影响。对整个语音波形进行加窗和分帧，将整个语音波形划分为多个25ms重叠的小语音段，然后使用合适的声学特征提取算法从25ms语音段中提取相应的声学特征。经过研究，该方法提高了语音识别的效率，比传统方法的效率要高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)

自引率

0.00%

发文量