{"title":"智能手机英语语音综合训练系统的设计","authors":"Xiuhui Hao","doi":"10.1109/ICICACS57338.2023.10099573","DOIUrl":null,"url":null,"abstract":"Smartphone software has the characteristics of abundant resources and simple operation, and has become an important tool for foreign language learning. This paper studies the key technologies of the English phonetic comprehensive training system for smartphones. First, text-to-speech conversion, combined with knowledge of linguistics and psychology, with the support of computer and other hardware environments, converts text information into natural speech streams. Second, the prosody generation based on neural network can spontaneously master prosody rules through simulation learning, and establish data association between prosody in speech and language, so as to meet the requirements of prosody processing. Third, speech recognition based on neural network is based on data preprocessing and speech feature extraction, and more accurate data can be obtained by using bidirectional recurrent neural network.","PeriodicalId":274807,"journal":{"name":"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of English Speech Comprehensive Training System for Smart Phone\",\"authors\":\"Xiuhui Hao\",\"doi\":\"10.1109/ICICACS57338.2023.10099573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Smartphone software has the characteristics of abundant resources and simple operation, and has become an important tool for foreign language learning. This paper studies the key technologies of the English phonetic comprehensive training system for smartphones. First, text-to-speech conversion, combined with knowledge of linguistics and psychology, with the support of computer and other hardware environments, converts text information into natural speech streams. Second, the prosody generation based on neural network can spontaneously master prosody rules through simulation learning, and establish data association between prosody in speech and language, so as to meet the requirements of prosody processing. Third, speech recognition based on neural network is based on data preprocessing and speech feature extraction, and more accurate data can be obtained by using bidirectional recurrent neural network.\",\"PeriodicalId\":274807,\"journal\":{\"name\":\"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICACS57338.2023.10099573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICACS57338.2023.10099573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design of English Speech Comprehensive Training System for Smart Phone
Smartphone software has the characteristics of abundant resources and simple operation, and has become an important tool for foreign language learning. This paper studies the key technologies of the English phonetic comprehensive training system for smartphones. First, text-to-speech conversion, combined with knowledge of linguistics and psychology, with the support of computer and other hardware environments, converts text information into natural speech streams. Second, the prosody generation based on neural network can spontaneously master prosody rules through simulation learning, and establish data association between prosody in speech and language, so as to meet the requirements of prosody processing. Third, speech recognition based on neural network is based on data preprocessing and speech feature extraction, and more accurate data can be obtained by using bidirectional recurrent neural network.