|国家预印本平台
首页|Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik's Cube

Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik's Cube

Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik's Cube

来源:Arxiv_logoArxiv
英文摘要

This paper presents Auto-RubikAI, a modular autonomous planning framework that integrates a symbolic Knowledge Base (KB), a vision-language model (VLM), and a large language model (LLM) to solve structured manipulation tasks exemplified by Rubik's Cube restoration. Unlike traditional robot systems based on predefined scripts, or modern approaches relying on pretrained networks and large-scale demonstration data, Auto-RubikAI enables interpretable, multi-step task execution with minimal data requirements and no prior demonstrations. The proposed system employs a KB module to solve group-theoretic restoration steps, overcoming LLMs' limitations in symbolic reasoning. A VLM parses RGB-D input to construct a semantic 3D scene representation, while the LLM generates structured robotic control code via prompt chaining. This tri-module architecture enables robust performance under spatial uncertainty. We deploy Auto-RubikAI in both simulation and real-world settings using a 7-DOF robotic arm, demonstrating effective Sim-to-Real adaptation without retraining. Experiments show a 79% end-to-end task success rate across randomized configurations. Compared to CFOP, DeepCubeA, and Two-Phase baselines, our KB-enhanced method reduces average solution steps while maintaining interpretability and safety. Auto-RubikAI provides a cost-efficient, modular foundation for embodied task planning in smart manufacturing, robotics education, and autonomous execution scenarios. Code, prompts, and hardware modules will be released upon publication.

Chongshan Fan、Shenghai Yuan

自动化技术、自动化技术设备计算技术、计算机技术

Chongshan Fan,Shenghai Yuan.Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik's Cube[EB/OL].(2025-07-08)[2025-08-02].https://arxiv.org/abs/2507.05607.点此复制

评论