|国家预印本平台
首页|Tackling Size Generalization of Graph Neural Networks on Biological Data from a Spectral Perspective

Tackling Size Generalization of Graph Neural Networks on Biological Data from a Spectral Perspective

Tackling Size Generalization of Graph Neural Networks on Biological Data from a Spectral Perspective

来源:Arxiv_logoArxiv
英文摘要

We address the key challenge of size-induced distribution shifts in graph neural networks (GNNs) and their impact on the generalization of GNNs to larger graphs. Existing literature operates under diverse assumptions about distribution shifts, resulting in varying conclusions about the generalizability of GNNs. In contrast to prior work, we adopt a data-driven approach to identify and characterize the types of size-induced distribution shifts and explore their impact on GNN performance from a spectral standpoint, a perspective that has been largely underexplored. Leveraging the significant variance in graph sizes in real biological datasets, we analyze biological graphs and find that spectral differences, driven by subgraph patterns (e.g., average cycle length), strongly correlate with GNN performance on larger, unseen graphs. Based on these insights, we propose three model-agnostic strategies to enhance GNNs' awareness of critical subgraph patterns, identifying size-intensive attention as the most effective approach. Extensive experiments with six GNN architectures and seven model-agnostic strategies across five datasets show that our size-intensive attention strategy significantly improves graph classification on test graphs 2 to 10 times larger than the training graphs, boosting F1 scores by up to 8% over strong baselines.

Gaotang Li、Danai Koutra、Yujun Yan

生物科学研究方法、生物科学研究技术计算技术、计算机技术

Gaotang Li,Danai Koutra,Yujun Yan.Tackling Size Generalization of Graph Neural Networks on Biological Data from a Spectral Perspective[EB/OL].(2025-08-01)[2025-08-11].https://arxiv.org/abs/2305.15611.点此复制

评论