The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

arXiv - CS - Computation and Language Pub Date : 2024-09-17 DOI:arxiv-2409.11261

Samee Arif, Taimoor Arif, Aamina Jamal Khan, Muhammad Saad Haroon, Agha Ali Raza, Awais Athar

引用次数: 0

Abstract

This paper introduces the concept of an education tool that utilizes Generative Artificial Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven narrative co-creation, text-to-speech conversion, and text-to-video generation to produce an engaging experience for learners. We describe the co-creation process, the adaptation of narratives into spoken words using text-to-speech models, and the transformation of these narratives into contextually relevant visuals through text-to-video technology. Our evaluation covers the linguistics of the generated stories, the text-to-speech conversion quality, and the accuracy of the generated visuals.

查看原文本刊更多论文

讲故事的艺术：用于动态多模态叙事的多代理生成式人工智能

本文介绍了一种教育工具的概念，该工具利用生成人工智能（GenAI）来增强儿童讲故事的能力。该系统结合了 GenAI 驱动的叙事共创、文本到语音的转换以及文本到视频的生成，为学习者带来了引人入胜的体验。我们描述了共同创作过程、使用文本到语音模型将叙述改编成口语，以及通过文本到视频技术将这些叙述转换成与上下文相关的视觉效果。我们的评估包括生成故事的语言学、文本到语音的转换质量以及生成视觉效果的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Computation and Language

自引率

0.00%

发文量