AniTales: End-to-End Multimodal Story Generation Through Natural Language Prompting (Student Abstract)

42181@AAAI

Total: 1

#1 AniTales: End-to-End Multimodal Story Generation Through Natural Language Prompting (Student Abstract) [PDF] [Copy] [Kimi] [REL]

Authors: Mrigendra Agrawal, Yunze Xiao

We present AniTales, a system designed to generate multimodal visual novels from natural language prompts. Our system integrates large language models for story generation, diffusion models for character art, and text-to-speech for voice acting. This paper describes the system's architecture and presents findings from a pilot user study. We evaluated the system with general users (n=10) and domain experts (n=5), focusing on usability, coherence, and visual consistency. General users reported high usability (SUS: 84/100) and strong character-dialogue consistency (4.2/5), along with an average score of 82/100 for their intention to continue using the platform. These initial results suggest AniTales is a promising approach for bridging the gap between text-based AI storytelling and end-to-end multimedia content creation.

Subject: AAAI.2026 - Student Abstract and Poster Program