Humanising Text-to-Speech Through Emotional Expression in Online Courses

IF 2.7 Q1 EDUCATION & EDUCATIONAL RESEARCH

Journal of Interactive Media in Education Pub Date : 2019-09-10 DOI:10.5334/JIME.519

Garron Hillaire, Francisco Iniesto, B. Rienties

引用次数: 9

Abstract

This paper outlines an innovative approach to evaluating the emotional content of three online courses using the affective computing approach of prosody detection on two different text-to-speech (TTS) voices in conjunction with human raters judging the emotional content of the text. This work intends to establish the potential variation on the emotional delivery of online educational resources through the use of a synthetic voice, which automatically articulates text into audio. Preliminary results from this pilot research suggest that about one out of every three sentences (35%) in a Massive Open Online Course (MOOC) contained emotional text and two existing assistive technology voices had poor emotional alignment when reading this text. Synthetic voices were more likely to be overly negative when considering their expression as compared to the emotional content of the text they are reading, which was most frequently neutral. We also analysed a synthetic voice for which we configured the emotional expression to align with course text, which showed promising improvements.

查看原文本刊更多论文

通过网络课程中的情感表达实现语篇转换的人性化

本文概述了一种创新的方法来评估三门在线课程的情感内容，该方法使用对两种不同的TTS语音进行韵律检测的情感计算方法，并由评分人员判断文本的情感内容。这项工作旨在通过使用合成语音来建立在线教育资源情感传递的潜在变化，该语音将文本自动发音为音频。这项试点研究的初步结果表明，在大规模开放在线课程（MOOC）中，大约每三句话中就有一句（35%）包含情感文本，而两种现有的辅助技术声音在阅读文本时情绪一致性较差。与他们阅读的文本的情感内容相比，合成声音在考虑他们的表达时更有可能过于负面，而文本中的情感内容通常是中性的。我们还分析了一种合成语音，我们将其情感表达配置为与课程文本一致，这显示出了有希望的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊