|国家预印本平台
| 注册
首页|LiTo: Surface Light Field Tokenization

LiTo: Surface Light Field Tokenization

Jen-Hao Rick Chang Xiaoming Zhao Dorian Chan Oncel Tuzel

Arxiv_logoArxiv

LiTo: Surface Light Field Tokenization

Jen-Hao Rick Chang Xiaoming Zhao Dorian Chan Oncel Tuzel

作者信息

Abstract

We propose a 3D latent representation that jointly models object geometry and view-dependent appearance. Most prior works focus on either reconstructing 3D geometry or predicting view-independent diffuse appearance, and thus struggle to capture realistic view-dependent effects. Our approach leverages that RGB-depth images provide samples of a surface light field. By encoding random subsamples of this surface light field into a compact set of latent vectors, our model learns to represent both geometry and appearance within a unified 3D latent space. This representation reproduces view-dependent effects such as specular highlights and Fresnel reflections under complex lighting. We further train a latent flow matching model on this representation to learn its distribution conditioned on a single input image, enabling the generation of 3D objects with appearances consistent with the lighting and materials in the input. Experiments show that our approach achieves higher visual quality and better input fidelity than existing methods.

引用本文复制引用

Jen-Hao Rick Chang,Xiaoming Zhao,Dorian Chan,Oncel Tuzel.LiTo: Surface Light Field Tokenization[EB/OL].(2026-03-11)[2026-03-13].https://arxiv.org/abs/2603.11047.

学科分类

计算技术、计算机技术

评论

首发时间 2026-03-11
下载量:0
|
点击量:4
段落导航相关论文