{"title":"哈萨克语的F0等值线预测","authors":"A. Kaliyev, Yuri N. Matveev, E. Lyakso, S. Rybin","doi":"10.1145/3330431.3330436","DOIUrl":null,"url":null,"abstract":"The article presents work on predicting the fundamental frequency (F0) values for the Kazakh language. The fundamental frequency plays one of the most important roles in the perception of speech, and at the same time modelling continuous F0 is one of the most difficult tasks in the development of intonational speech synthesis systems. The main and obvious difficulty is that a person is able to say the same sentence with different intonations and with different tones. In this work, we used deep neural networks for accurate and qualitative prediction F0 values as close as possible to the natural sounding of Kazakh speech.","PeriodicalId":196960,"journal":{"name":"Proceedings of the 5th International Conference on Engineering and MIS","volume":"21 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"F0 contour prediction for the Kazakh language\",\"authors\":\"A. Kaliyev, Yuri N. Matveev, E. Lyakso, S. Rybin\",\"doi\":\"10.1145/3330431.3330436\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article presents work on predicting the fundamental frequency (F0) values for the Kazakh language. The fundamental frequency plays one of the most important roles in the perception of speech, and at the same time modelling continuous F0 is one of the most difficult tasks in the development of intonational speech synthesis systems. The main and obvious difficulty is that a person is able to say the same sentence with different intonations and with different tones. In this work, we used deep neural networks for accurate and qualitative prediction F0 values as close as possible to the natural sounding of Kazakh speech.\",\"PeriodicalId\":196960,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Engineering and MIS\",\"volume\":\"21 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Engineering and MIS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3330431.3330436\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Engineering and MIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330431.3330436","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The article presents work on predicting the fundamental frequency (F0) values for the Kazakh language. The fundamental frequency plays one of the most important roles in the perception of speech, and at the same time modelling continuous F0 is one of the most difficult tasks in the development of intonational speech synthesis systems. The main and obvious difficulty is that a person is able to say the same sentence with different intonations and with different tones. In this work, we used deep neural networks for accurate and qualitative prediction F0 values as close as possible to the natural sounding of Kazakh speech.