首页|LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction

LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction

来源：

英文摘要

LEARN is a layout-aware diffusion framework designed to generate pedagogically aligned illustrations for STEM education. It leverages a curated BookCover dataset that provides narrative layouts and structured visual cues, enabling the model to depict abstract and sequential scientific concepts with strong semantic alignment. Through layout-conditioned generation, contrastive visual-semantic training, and prompt modulation, LEARN produces coherent visual sequences that support mid-to-high-level reasoning in line with Bloom's taxonomy while reducing extraneous cognitive load as emphasized by Cognitive Load Theory. By fostering spatially organized and story-driven narratives, the framework counters fragmented attention often induced by short-form media and promotes sustained conceptual focus. Beyond static diagrams, LEARN demonstrates potential for integration with multimodal systems and curriculum-linked knowledge graphs to create adaptive, exploratory educational content. As the first generative approach to unify layout-based storytelling, semantic structure learning, and cognitive scaffolding, LEARN represents a novel direction for generative AI in education. The code and dataset will be released to facilitate future research and practical deployment.

作者：Maoquan Zhang、Bisser Raytchev、Xiujuan Sun

作者单位：

学科分类：教育信息传播、知识传播计算技术、计算机技术

推荐引用：Maoquan Zhang,Bisser Raytchev,Xiujuan Sun.LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction[EB/OL].(2025-08-15)[2025-08-28].https://arxiv.org/abs/2508.11153.点此复制

LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction

LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction

评论