LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction
LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction
LEARN is a layout-aware diffusion framework designed to generate pedagogically aligned illustrations for STEM education. It leverages a curated BookCover dataset that provides narrative layouts and structured visual cues, enabling the model to depict abstract and sequential scientific concepts with strong semantic alignment. Through layout-conditioned generation, contrastive visual-semantic training, and prompt modulation, LEARN produces coherent visual sequences that support mid-to-high-level reasoning in line with Bloom's taxonomy while reducing extraneous cognitive load as emphasized by Cognitive Load Theory. By fostering spatially organized and story-driven narratives, the framework counters fragmented attention often induced by short-form media and promotes sustained conceptual focus. Beyond static diagrams, LEARN demonstrates potential for integration with multimodal systems and curriculum-linked knowledge graphs to create adaptive, exploratory educational content. As the first generative approach to unify layout-based storytelling, semantic structure learning, and cognitive scaffolding, LEARN represents a novel direction for generative AI in education. The code and dataset will be released to facilitate future research and practical deployment.
Maoquan Zhang、Bisser Raytchev、Xiujuan Sun
教育信息传播、知识传播计算技术、计算机技术
Maoquan Zhang,Bisser Raytchev,Xiujuan Sun.LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction[EB/OL].(2025-08-15)[2025-08-28].https://arxiv.org/abs/2508.11153.点此复制
评论