|国家预印本平台
首页|FunHOI: Annotation-Free 3D Hand-Object Interaction Generation via Functional Text Guidanc

FunHOI: Annotation-Free 3D Hand-Object Interaction Generation via Functional Text Guidanc

FunHOI: Annotation-Free 3D Hand-Object Interaction Generation via Functional Text Guidanc

来源:Arxiv_logoArxiv
英文摘要

Hand-object interaction(HOI) is the fundamental link between human and environment, yet its dexterous and complex pose significantly challenges for gesture control. Despite significant advances in AI and robotics, enabling machines to understand and simulate hand-object interactions, capturing the semantics of functional grasping tasks remains a considerable challenge. While previous work can generate stable and correct 3D grasps, they are still far from achieving functional grasps due to unconsidered grasp semantics. To address this challenge, we propose an innovative two-stage framework, Functional Grasp Synthesis Net (FGS-Net), for generating 3D HOI driven by functional text. This framework consists of a text-guided 3D model generator, Functional Grasp Generator (FGG), and a pose optimization strategy, Functional Grasp Refiner (FGR). FGG generates 3D models of hands and objects based on text input, while FGR fine-tunes the poses using Object Pose Approximator and energy functions to ensure the relative position between the hand and object aligns with human intent and remains physically plausible. Extensive experiments demonstrate that our approach achieves precise and high-quality HOI generation without requiring additional 3D annotation data.

Linji Hao、Xueyu Sun、Yongqi Tian、Ning Ding、Haoyuan He、Caigui Jiang

计算技术、计算机技术自动化技术、自动化技术设备

Linji Hao,Xueyu Sun,Yongqi Tian,Ning Ding,Haoyuan He,Caigui Jiang.FunHOI: Annotation-Free 3D Hand-Object Interaction Generation via Functional Text Guidanc[EB/OL].(2025-07-10)[2025-07-25].https://arxiv.org/abs/2502.20805.点此复制

评论