|国家预印本平台
首页|SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations

来源:Arxiv_logoArxiv
英文摘要

The growing applications of AR/VR increase the demand for real-time full-body pose estimation from Head-Mounted Displays (HMDs). Although HMDs provide joint signals from the head and hands, reconstructing a full-body pose remains challenging due to the unconstrained lower body. Recent advancements often rely on conventional neural networks and generative models to improve performance in this task, such as Transformers and diffusion models. However, these approaches struggle to strike a balance between achieving precise pose reconstruction and maintaining fast inference speed. To overcome these challenges, a lightweight and efficient model, SSD-Poser, is designed for robust full-body motion estimation from sparse observations. SSD-Poser incorporates a well-designed hybrid encoder, State Space Attention Encoders, to adapt the state space duality to complex motion poses and enable real-time realistic pose reconstruction. Moreover, a Frequency-Aware Decoder is introduced to mitigate jitter caused by variable-frequency motion signals, remarkably enhancing the motion smoothness. Comprehensive experiments on the AMASS dataset demonstrate that SSD-Poser achieves exceptional accuracy and computational efficiency, showing outstanding inference efficiency compared to state-of-the-art methods.

Shuting Zhao、Liangjing Shao、Ye Zhang、Linxin Bai、Xinrong Chen

计算技术、计算机技术

Shuting Zhao,Liangjing Shao,Ye Zhang,Linxin Bai,Xinrong Chen.SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations[EB/OL].(2025-04-25)[2025-07-16].https://arxiv.org/abs/2504.18332.点此复制

评论