|国家预印本平台
首页|Joint Relational Database Generation via Graph-Conditional Diffusion Models

Joint Relational Database Generation via Graph-Conditional Diffusion Models

Joint Relational Database Generation via Graph-Conditional Diffusion Models

来源:Arxiv_logoArxiv
英文摘要

Building generative models for relational databases (RDBs) is important for applications like privacy-preserving data release and augmenting real datasets. However, most prior work either focuses on single-table generation or relies on autoregressive factorizations that impose a fixed table order and generate tables sequentially. This approach limits parallelism, restricts flexibility in downstream applications like missing value imputation, and compounds errors due to commonly made conditional independence assumptions. We propose a fundamentally different approach: jointly modeling all tables in an RDB without imposing any order. By using a natural graph representation of RDBs, we propose the Graph-Conditional Relational Diffusion Model (GRDM). GRDM leverages a graph neural network to jointly denoise row attributes and capture complex inter-table dependencies. Extensive experiments on six real-world RDBs demonstrate that our approach substantially outperforms autoregressive baselines in modeling multi-hop inter-table correlations and achieves state-of-the-art performance on single-table fidelity metrics.

Mohamed Amine Ketata、David Lüdke、Leo Schwinn、Stephan Günnemann

计算技术、计算机技术

Mohamed Amine Ketata,David Lüdke,Leo Schwinn,Stephan Günnemann.Joint Relational Database Generation via Graph-Conditional Diffusion Models[EB/OL].(2025-05-22)[2025-07-02].https://arxiv.org/abs/2505.16527.点此复制

评论