|国家预印本平台
首页|FANeRV: Frequency Separation and Augmentation based Neural Representation for Video

FANeRV: Frequency Separation and Augmentation based Neural Representation for Video

FANeRV: Frequency Separation and Augmentation based Neural Representation for Video

来源:Arxiv_logoArxiv
英文摘要

Neural representations for video (NeRV) have gained considerable attention for their strong performance across various video tasks. However, existing NeRV methods often struggle to capture fine spatial details, resulting in vague reconstructions. In this paper, we present a Frequency Separation and Augmentation based Neural Representation for video (FANeRV), which addresses these limitations with its core Wavelet Frequency Upgrade Block. This block explicitly separates input frames into high and low-frequency components using discrete wavelet transform, followed by targeted enhancement using specialized modules. Finally, a specially designed gated network effectively fuses these frequency components for optimal reconstruction. Additionally, convolutional residual enhancement blocks are integrated into the later stages of the network to balance parameter distribution and improve the restoration of high-frequency details. Experimental results demonstrate that FANeRV significantly improves reconstruction performance and excels in multiple tasks, including video compression, inpainting, and interpolation, outperforming existing NeRV methods.

Moncef Gabbouj、Li Yu、Zhihui Li、Chao Yao、Jimin Xiao

计算技术、计算机技术

Moncef Gabbouj,Li Yu,Zhihui Li,Chao Yao,Jimin Xiao.FANeRV: Frequency Separation and Augmentation based Neural Representation for Video[EB/OL].(2025-04-09)[2025-06-04].https://arxiv.org/abs/2504.06755.点此复制

评论