基于情感的文本到语音转换系统综述

Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale
{"title":"基于情感的文本到语音转换系统综述","authors":"Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale","doi":"10.35940/ijsce.a3529.0911121","DOIUrl":null,"url":null,"abstract":"Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.","PeriodicalId":173799,"journal":{"name":"International Journal of Soft Computing and Engineering","volume":"125 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Brief Survey on Emotion Based Text to Speech Conversion System\",\"authors\":\"Bhushan Hemant Dhimate, Manjiri Vitthal Khopade, Avadhoot Yogesh Dhere, Supriya Dhumale\",\"doi\":\"10.35940/ijsce.a3529.0911121\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.\",\"PeriodicalId\":173799,\"journal\":{\"name\":\"International Journal of Soft Computing and Engineering\",\"volume\":\"125 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Soft Computing and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35940/ijsce.a3529.0911121\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Soft Computing and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijsce.a3529.0911121","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

文本到语音的转换是机器学习的应用之一。它广泛用于搜索引擎、独立应用程序、web应用程序、聊天机器人和android应用程序。但是,还需要对文本转语音系统进行升级,以获得更具交互性和用户友好性的应用程序。传统的文本转语音应用以单调的声音作为输出,没有情感,显得更加机械化。因此,有必要对现有的系统进行即兴创作,将情感的味道嵌入其中。现有的文本转语音不能用于讲故事的应用,也不能提供有效的交流。大多数文本到语音的系统都是使用支持向量机(SVM)、Naïve贝叶斯等算法开发的。基于情感的文本到语音系统有助于对现有的文本到语音系统进行改进。在机器学习和深度学习算法的帮助下,如递归神经网络,可以对输入文本进行情感分析和语义分析。我们将使用更有效的神经网络,它有助于保持前一个单词和下一个单词之间的关系。基于情感的文本到语音系统将能够识别四种情绪:“快乐”、“悲伤”、“愤怒”和“中性”。基于情感的文本到语音系统将有利于教育目的,如从讲故事的应用程序中为年幼的孩子听故事。基于情感的文本到语音将对视障人士有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Brief Survey on Emotion Based Text to Speech Conversion System
Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Naïve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions ‘happy’, ‘sad’, ‘angry’ and ‘neutral’. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信