首页|Tree-NeRV: A Tree-Structured Neural Representation for Efficient Non-Uniform Video Encoding

Tree-NeRV: A Tree-Structured Neural Representation for Efficient Non-Uniform Video Encoding

来源：

英文摘要

Implicit Neural Representations for Videos (NeRV) have emerged as a powerful paradigm for video representation, enabling direct mappings from frame indices to video frames. However, existing NeRV-based methods do not fully exploit temporal redundancy, as they rely on uniform sampling along the temporal axis, leading to suboptimal rate-distortion (RD) performance. To address this limitation, we propose Tree-NeRV, a novel tree-structured feature representation for efficient and adaptive video encoding. Unlike conventional approaches, Tree-NeRV organizes feature representations within a Binary Search Tree (BST), enabling non-uniform sampling along the temporal axis. Additionally, we introduce an optimization-driven sampling strategy, dynamically allocating higher sampling density to regions with greater temporal variation. Extensive experiments demonstrate that Tree-NeRV achieves superior compression efficiency and reconstruction quality, outperforming prior uniform sampling-based methods. Code will be released.

作者：Jiancheng Zhao、Yifan Zhan、Qingtian Zhu、Mingze Ma、Muyao Niu、Zunian Wan、Xiang Ji、Yinqiang Zheng

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Jiancheng Zhao,Yifan Zhan,Qingtian Zhu,Mingze Ma,Muyao Niu,Zunian Wan,Xiang Ji,Yinqiang Zheng.Tree-NeRV: A Tree-Structured Neural Representation for Efficient Non-Uniform Video Encoding[EB/OL].(2025-04-17)[2025-05-05].https://arxiv.org/abs/2504.12899.点此复制

Tree-NeRV: A Tree-Structured Neural Representation for Efficient Non-Uniform Video Encoding

Tree-NeRV: A Tree-Structured Neural Representation for Efficient Non-Uniform Video Encoding

评论