|国家预印本平台
首页|JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync

JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync

JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync

来源:Arxiv_logoArxiv
英文摘要

In this work, we revisit the effectiveness of 3DMM for talking head synthesis by jointly learning a 3D face reconstruction model and a talking head synthesis model. This enables us to obtain a FACS-based blendshape representation of facial expressions that is optimized for talking head synthesis. This contrasts with previous methods that either fit 3DMM parameters to 2D landmarks or rely on pretrained face reconstruction models. Not only does our approach increase the quality of the generated face, but it also allows us to take advantage of the blendshape representation to modify just the mouth region for the purpose of audio-based lip-sync. To this end, we propose a novel lip-sync pipeline that, unlike previous methods, decouples the original chin contour from the lip-synced chin contour, and reduces flickering near the mouth.

Sungjoon Park、Minsik Park、Haneol Lee、Jaesub Yun、Donggeon Lee

计算技术、计算机技术

Sungjoon Park,Minsik Park,Haneol Lee,Jaesub Yun,Donggeon Lee.JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync[EB/OL].(2025-07-28)[2025-08-10].https://arxiv.org/abs/2507.20452.点此复制

评论