|国家预印本平台
首页|Self-Attention Based Multi-Scale Graph Auto-Encoder Network of 3D Meshes

Self-Attention Based Multi-Scale Graph Auto-Encoder Network of 3D Meshes

Self-Attention Based Multi-Scale Graph Auto-Encoder Network of 3D Meshes

来源:Arxiv_logoArxiv
英文摘要

3D meshes are fundamental data representations for capturing complex geometric shapes in computer vision and graphics applications. While Convolutional Neural Networks (CNNs) have excelled in structured data like images, extending them to irregular 3D meshes is challenging due to the non-Euclidean nature of the data. Graph Convolutional Networks (GCNs) offer a solution by applying convolutions to graph-structured data, but many existing methods rely on isotropic filters or spectral decomposition, limiting their ability to capture both local and global mesh features. In this paper, we introduce 3D Geometric Mesh Network (3DGeoMeshNet), a novel GCN-based framework that uses anisotropic convolution layers to effectively learn both global and local features directly in the spatial domain. Unlike previous approaches that convert meshes into intermediate representations like voxel grids or point clouds, our method preserves the original polygonal mesh format throughout the reconstruction process, enabling more accurate shape reconstruction. Our architecture features a multi-scale encoder-decoder structure, where separate global and local pathways capture both large-scale geometric structures and fine-grained local details. Extensive experiments on the COMA dataset containing human faces demonstrate the efficiency of 3DGeoMeshNet in terms of reconstruction accuracy.

Saqib Nazir、Olivier Lézoray、Sébastien Bougleux

计算技术、计算机技术

Saqib Nazir,Olivier Lézoray,Sébastien Bougleux.Self-Attention Based Multi-Scale Graph Auto-Encoder Network of 3D Meshes[EB/OL].(2025-07-07)[2025-07-21].https://arxiv.org/abs/2507.05304.点此复制

评论