基于情感的文本到语音转换系统综述

International Journal of Soft Computing and Engineering Pub Date : 2021-09-30 DOI:10.35940/ijsce.a3529.0911121

Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale

{"title":"基于情感的文本到语音转换系统综述","authors":"Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale","doi":"10.35940/ijsce.a3529.0911121","DOIUrl":null,"url":null,"abstract":"Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.","PeriodicalId":173799,"journal":{"name":"International Journal of Soft Computing and Engineering","volume":"125 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Brief Survey on Emotion Based Text to Speech Conversion System\",\"authors\":\"Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale\",\"doi\":\"10.35940/ijsce.a3529.0911121\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.\",\"PeriodicalId\":173799,\"journal\":{\"name\":\"International Journal of Soft Computing and Engineering\",\"volume\":\"125 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Soft Computing and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35940/ijsce.a3529.0911121\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Soft Computing and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijsce.a3529.0911121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

文本到语音的转换是机器学习的应用之一。它广泛用于搜索引擎、独立应用程序、web应用程序、聊天机器人和android应用程序。但是，还需要对文本转语音系统进行升级，以获得更具交互性和用户友好性的应用程序。传统的文本转语音应用以单调的声音作为输出，没有情感，显得更加机械化。因此，有必要对现有的系统进行即兴创作，将情感的味道嵌入其中。现有的文本转语音不能用于讲故事的应用，也不能提供有效的交流。大多数文本到语音的系统都是使用支持向量机(SVM)、Naïve贝叶斯等算法开发的。基于情感的文本到语音系统有助于对现有的文本到语音系统进行改进。在机器学习和深度学习算法的帮助下，如递归神经网络，可以对输入文本进行情感分析和语义分析。我们将使用更有效的神经网络，它有助于保持前一个单词和下一个单词之间的关系。基于情感的文本到语音系统将能够识别四种情绪:“快乐”、“悲伤”、“愤怒”和“中性”。基于情感的文本到语音系统将有利于教育目的，如从讲故事的应用程序中为年幼的孩子听故事。基于情感的文本到语音将对视障人士有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Brief Survey on Emotion Based Text to Speech Conversion System

Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Soft Computing and Engineering

自引率

0.00%

发文量