Skip to content

Unveiling Multi-Regime Patterns in SciML: Diverse Failure Modes and Domain-Specific Optimization

Conference: ICML 2026
arXiv: 2605.29153
Code: https://github.com/leastima/sciml_multi_regime
Area: Scientific Computing / Neural Network Optimization / Loss Landscape Analysis
Keywords: SciML, Multi-domain Analysis, PINN, Failure Modes, Loss Landscape

TL;DR

This paper reveals three consistent failure modes in SciML models (PINNs, neural operators, etc.) through a systematic multi-domain diagnostic framework and analyzes their loss landscape specificities to guide optimization method selection.

Background & Motivation

Limitations of Prior Work: SciML methods such as PINNs, Fourier Neural Operators (FNO), and Neural ODEs encounter optimization difficulties and generalization failures in practical applications, yet a systematic framework for diagnosing these failure modes is lacking.

Key Insight: The structure of SciML loss landscapes is more complex than that of Computer Vision (CV), characterized by sharp minima, a lack of connectivity, and large Hessian eigenvalues—properties that contradict common intuitions in CV.

Key Challenge: Standard Hessian-loss correlation fails in SciML—low training loss does not necessarily correspond to low curvature, and high curvature does not necessarily imply poor training performance.

Goal: To establish a unified multi-domain diagnostic framework for understanding the structural roots of SciML failures and to provide regime-aware guidance for selecting optimization methods.

Method

Overall Architecture

The study proposes a three-dimensional diagnostic framework that systematically varies along three axes: (1) physical domain (PDE coefficients, equation types); (2) data domain (number of training samples/collocation points); and (3) optimization domain (optimizer choices, constraint handling strategies). By jointly analyzing training loss, test error, and loss landscape geometry, the framework automatically extracts regime boundaries.

Key Designs

  1. Three-Domain Regime Labeling:

    • Function: Categorizes SciML models under different physical-data-optimization configurations into Well-Trained (Regime I, low training and test error), Under-Trained (Regime II, high training and test error), and Over-Trained (Regime III, low training error but high test error).
    • Mechanism: Automatically partitions regimes using training and test error thresholds \(T_{\text{train}}\) and \(T_{\text{test}}\), with boundary robustness assessed via \(\pm 20\%\) threshold perturbations.
    • Design Motivation: Provides a task-oblivious perspective on failure modes, bypassing the limitations of solely examining task-level performance.
  2. Loss Landscape Pathological Phenomenon Detection:

    • Function: Identifies two types of non-intuitive phenomena: (a) Deceptive Sharpness: high Hessian eigenvalues corresponding to low training loss; (b) Deceptive Flatness: low Hessian eigenvalues masking high training loss.
    • Mechanism: Concurrently tracks the dynamic curves of \(\lambda_{\max}\) (maximum eigenvalue) and training loss, discovering that both move in the same direction during an "Increasing Sharpening" phase. Hessian spectral density estimation reveals that PINNs lack the zero-eigenvalue peaks typically found in CV models.
    • Design Motivation: Uncovers the fundamental reasons why standard CV-inspired landscape intuitions (e.g., "flat minima are better") fail in SciML.
  3. Regime-Aware Optimization Effectiveness Analysis:

    • Function: Systematically compares the performance of Adam, L-BFGS, NNCG, ALM, RoPINN, and CL across different regimes.
    • Mechanism: Generates 2D regime heatmaps (physical parameters × data volume) for each optimization method to calculate relative performance gains. NNCG improves test error by approximately 50% compared to L-BFGS in Regime I but remains unstable in Regimes II and III.
    • Design Motivation: Demonstrates that no single optimizer is universally optimal, necessitating targeted selection based on the current regime.

Key Experimental Results

Regime Structure Consistency Verification

Model Dataset Regime I Regime II Regime III Key Phenomenon
PINN 1D Convection \(\beta < 25\) \(25 \leq \beta < 50\) \(\beta \geq 50\) (Sparse) Increasing physical parameters shifts boundaries right
FNO 2D Advection-Diffusion Sufficient samples Moderate pressure Sparse samples Smooth transitions instead of the sharp boundaries seen in PINNs
PINODE Nonlinear Pendulum Standard config High-dimensional Low data Intermediate characteristics between PINN and FNO

Optimization Method Effectiveness Comparison

Optimization Method Regime I Regime II Regime III Best Application Scenarios
L-BFGS ✓ (Prone to failure) Basic training
ALM ✓✓ (Constraint hardening) Constraint-critical problems
CL (Curriculum Learning) ✓✓ Difficult configurations
NNCG ✓✓ (+50%) ✗ (Unstable) Regime I fine-tuning

Highlights & Insights

  • Counter-intuitive Design of Deceptive Sharpness: Reveals that high-curvature regions in SciML can actually correspond to optimal solutions, contradicting the "flat minima are better" hypothesis prevalent in CV.
  • Breakdown of Hessian-Loss Correlation: Quantitatively proves through spectral density comparison (PINNs lack zero-eigenvalue peaks compared to ResNet) that SciML landscapes are fundamentally different from CV landscapes.
  • Universal Failure Mode Framework: Although PINN, FNO, and Neural ODE architectures differ significantly, consistent three-domain regime structures emerge across all, indicating that these failure modes are systemic issues in SciML.

Limitations & Future Work

  • Experiments were primarily conducted on small-scale 1D/2D problems; generalization to large-scale 3D PDEs remains to be verified.
  • The high computational cost of Hessian matrices makes it difficult to scale the framework to very large models.
  • The location of regime boundaries varies significantly with different PDE coefficients, making it hard to define universal thresholds.
  • Future Directions: Design adaptive regime detection algorithms to identify the current regime online and automatically switch optimization strategies accordingly.
  • vs. Loss Landscape Research (Yang et al.): Previous studies focus on landscape connectivity in CV/NLP; this paper finds that SciML lacks these properties and requires specialized diagnostic tools.
  • vs. PINN Optimization Research (Krishnapriyan et al.): Prior work offers fragmented analyses of local failure phenomena in PINNs; this paper provides a unified multi-domain perspective and quantitative regime labeling.

Rating

  • Novelty: ⭐⭐⭐⭐ Extending landscape diagnostics from CV to SciML with a systematic multi-dimensional analysis is highly innovative.
  • Experimental Thoroughness: ⭐⭐⭐⭐⭐ Comprehensive coverage of 5 SciML models, 4 PDEs, and 5 optimization methods.
  • Writing Quality: ⭐⭐⭐⭐ Clear logic with rich visualizations.
  • Value: ⭐⭐⭐⭐ Provides SciML practitioners with clear guidelines for optimizer selection and a robust failure diagnostic tool.