Optimizing the Optimizer for Physics-Informed Neural Networks and Kolmogorov-Arnold Networks
Optimizing the Optimizer for Physics-Informed Neural Networks and Kolmogorov-Arnold Networks
Physics-Informed Neural Networks (PINNs) have revolutionized the computation of PDE solutions by integrating partial differential equations (PDEs) into the neural network's training process as soft constraints, becoming an important component of the scientific machine learning (SciML) ecosystem. More recently, physics-informed Kolmogorv-Arnold networks (PIKANs) have also shown to be effective and comparable in accuracy with PINNs. In their current implementation, both PINNs and PIKANs are mainly optimized using first-order methods like Adam, as well as quasi-Newton methods such as BFGS and its low-memory variant, L-BFGS. However, these optimizers often struggle with highly non-linear and non-convex loss landscapes, leading to challenges such as slow convergence, local minima entrapment, and (non)degenerate saddle points. In this study, we investigate the performance of Self-Scaled BFGS (SSBFGS), Self-Scaled Broyden (SSBroyden) methods and other advanced quasi-Newton schemes, including BFGS and L-BFGS with different line search strategies. These methods dynamically rescale updates based on historical gradient information, thus enhancing training efficiency and accuracy. We systematically compare these optimizers using both PINNs and PIKANs on key challenging PDEs, including the Burgers, Allen-Cahn, Kuramoto-Sivashinsky, Ginzburg-Landau, and Stokes equations. Additionally, we evaluate the performance of SSBFGS and SSBroyden for Deep Operator Network (DeepONet) architectures, demonstrating their effectiveness for data-driven operator learning. Our findings provide state-of-the-art results with orders-of-magnitude accuracy improvements without the use of adaptive weights or any other enhancements typically employed in PINNs.
George Em Karniadakis、Khemraj Shukla、Jorge F. Urbán、Elham Kiyani、Jérôme Darbon
物理学计算技术、计算机技术
George Em Karniadakis,Khemraj Shukla,Jorge F. Urbán,Elham Kiyani,Jérôme Darbon.Optimizing the Optimizer for Physics-Informed Neural Networks and Kolmogorov-Arnold Networks[EB/OL].(2025-08-24)[2025-09-07].https://arxiv.org/abs/2501.16371.点此复制
评论