|国家预印本平台
首页|Rodrigues Network for Learning Robot Actions

Rodrigues Network for Learning Robot Actions

Rodrigues Network for Learning Robot Actions

来源:Arxiv_logoArxiv
英文摘要

Understanding and predicting articulated actions is important in robot learning. However, common architectures such as MLPs and Transformers lack inductive biases that reflect the underlying kinematic structure of articulated systems. To this end, we propose the Neural Rodrigues Operator, a learnable generalization of the classical forward kinematics operation, designed to inject kinematics-aware inductive bias into neural computation. Building on this operator, we design the Rodrigues Network (RodriNet), a novel neural architecture specialized for processing actions. We evaluate the expressivity of our network on two synthetic tasks on kinematic and motion prediction, showing significant improvements compared to standard backbones. We further demonstrate its effectiveness in two realistic applications: (i) imitation learning on robotic benchmarks with the Diffusion Policy, and (ii) single-image 3D hand reconstruction. Our results suggest that integrating structured kinematic priors into the network architecture improves action learning in various domains.

Jialiang Zhang、Haoran Geng、Yang You、Congyue Deng、Pieter Abbeel、Jitendra Malik、Leonidas Guibas

自动化基础理论计算技术、计算机技术

Jialiang Zhang,Haoran Geng,Yang You,Congyue Deng,Pieter Abbeel,Jitendra Malik,Leonidas Guibas.Rodrigues Network for Learning Robot Actions[EB/OL].(2025-06-03)[2025-08-02].https://arxiv.org/abs/2506.02618.点此复制

评论