Curly Flow Matching for Learning Non-gradient Field Dynamics¶
Conference: NeurIPS 2025
arXiv: 2510.26645
Code: GitHub
Area: Computational Biology
Keywords: Flow Matching, Schrödinger Bridge, Non-gradient Field Dynamics, Trajectory Inference, Optimal Transport
TL;DR¶
The authors propose Curly Flow Matching (Curly-FM), which designs a Schrödinger Bridge problem with a non-zero reference drift. This allows flow matching to learn non-gradient field dynamics, such as periodic and rotational behaviors, breaking the limitation of traditional methods that can only model gradient fields.
Background & Motivation¶
Modeling the transport dynamics of natural processes from population-level observations is a fundamental problem in the natural sciences. Current mainstream methods (such as Conditional Flow Matching, OT-CFM) rely on the principle of least action, assuming that systems follow gradient field dynamics, where trajectories minimize the energy functional between two probability measures.
However, many real-world systems naturally exhibit non-gradient field and periodic behaviors: - Single-cell RNA cell cycle: Cells move along periodic trajectories in the gene expression space. - Computational fluid dynamics: Vortex structures generate rotational flow fields. - Ocean currents: Complex rotational and periodic patterns are observed.
These non-gradient (curl) components are fundamentally uncapturable by existing flow matching and bridge matching methods. Trajectories generated by traditional methods are straight-line interpolations, completely ignoring the true dynamics of the underlying system.
Method¶
Overall Architecture¶
Curly-FM is a two-stage, simulation-free framework:
First Stage — Learning the Reference Drift (Algorithm 1): A neural path interpolant is utilized to learn the drift of the reference process, allowing the generated paths to curve and match the known velocity field information.
Second Stage — Solving the Transportation Plan (Algorithm 2): Based on the learned reference drift, the optimal coupling is computed via a marginal flow matching objective to train the final drift model.
Key Designs¶
1. Non-Zero Drift Reference Process
Unlike traditional Schrödinger Bridge methods that use zero-drift (Brownian motion) reference processes, Curly-FM constructs a reference process \(Q\) with a non-zero drift, which is built from velocity information inferred from the data. Specifically, the reference drift \(f_t\) is a continuous field obtained by smoothing velocities at discrete observation points using a kernel function \(\kappa_t\).
2. Neural Path Interpolant
A neural network \(\phi_{t,\eta}(x_0, x_1)\) is employed to parameterize the mean of the conditional bridge, ensuring that it is no longer a straight line between \(x_0\) and \(x_1\), but rather a curved path. The conditional distribution is formulated as:
The interpolant matches the reference drift by minimizing \(\mathbb{E}[\|\partial_t X_{t,\eta} - f_t(X_{t,\eta})\|^2]\).
3. Optimal Coupling Computation
The learned neural path interpolant is used to estimate the OT cost \(c(x_0, x_1)\), and the coupling \(\pi^*\) is obtained by solving the entropy-regularized optimal transport using a mini-batch OT approximation. The cost is approximated by Monte Carlo estimation of the path length integral.
Loss & Training¶
Stage 1 Loss (learning reference drift matching): $\(\mathcal{L}(\eta) = \mathbb{E}_{(X_0, X_1, t)}\left[\|\partial_t X_{t,\eta} - f_t(X_{t,\eta})\|^2\right]\)$
Stage 2 Loss (flow matching + score matching):
flow_loss = E[||v_t(x_t, t) - x_dot_t||^2]
score_loss = E[||(lambda_t * s_t(x_t, t) + eps)||^2]
loss = flow_loss + score_loss
Where the score model is used to support cases with stochasticity (\(\sigma > 0\)).
Inference: Forward propagation is performed via SDE/ODE integration using the learned drift model.
Key Experimental Results¶
Main Results¶
Table 1: Single-cell RNA Trajectory Inference (Cell Cycle Data)
| Method | W1 ↓ | W2 ↓ | Cosine Dist ↓ |
|---|---|---|---|
| CFM | High | High | ~1.0 (straight trajectories) |
| OT-CFM | High | High | ~1.0 |
| CurlyFM | Lowest | Lowest | ~0.03 |
Table 2: Ocean Current Trajectory Inference
| Method | W1 ↓ | Cosine Dist ↓ | Compute Time (h) |
|---|---|---|---|
| DM-SB | - | - | 15.44 |
| TrajectoryNet | - | - | 7.44 |
| SBIRR | - | - | 4.67 |
| Vanilla-SB | - | - | 0.43 |
| CurlyFM | 0.062±0.003 | 0.034±0.006 | 0.06 |
On the ocean current task, CurlyFM requires only about 4 minutes of training, which is ~78x faster than SBIRR and ~124x faster than TrajectoryNet.
Ablation Study¶
Robustness to Noisy Reference Drift (Table R2)
| Noise Ratio β | W1 ↓ | W2 ↓ | Cosine Dist ↓ |
|---|---|---|---|
| 0.00 | 0.062±0.003 | 0.143±0.010 | 0.034±0.006 |
| 0.25 | 0.057±0.033 | 0.021±0.036 | 0.051±0.030 |
| 0.50 | 0.087±0.047 | 0.301±0.085 | 0.091±0.046 |
| 0.75 | 0.261±0.123 | 0.381±0.120 | 0.145±0.062 |
| 1.00 | 0.428±0.157 | 0.445±0.121 | 0.237±0.079 |
CurlyFM maintains robust performance under a 25% noise level, demonstrating resilience to approximation errors in the reference velocity field.
Stochasticity Ablation (Impact of σ)
| σ | W1 ↓ | W2 ↓ | Cosine Dist ↓ |
|---|---|---|---|
| 0.01 | 0.061±0.003 | 0.141±0.009 | 0.028±0.066 |
| 0.10 | 0.062±0.002 | 0.145±0.011 | 0.066±0.008 |
| 1.00 | 0.145±0.009 | 0.474±0.058 | 0.871±0.048 |
The low-stochasticity setting achieves the best performance, which is consistent with the characteristic of low noise levels in practical applications.
Key Findings¶
- Fundamental limitation of traditional flow matching: OT-CFM yields a cosine distance close to 1.0, indicating that its trajectories are almost purely linear, failing completely to recover rotational dynamics.
- Significant efficiency advantages: As a simulation-free method, CurlyFM is 1-2 orders of magnitude faster than simulation-based methods such as TrajectoryNet and SBIRR.
- Multi-marginal expansion: CurlyFM naturally extends to multi-time-point setups, training alternately on adjacent marginal pairs.
- Comparison with GSBM: CurlyFM outperforms GSBM (with 15 control points) in terms of W1 and cosine distance, though with a slightly higher computational cost.
Highlights & Insights¶
- Core Insight: Incorporating a non-zero drift reference process into the Schrödinger Bridge framework overcomes the limitation that flow matching can only model gradient fields.
- Elegant Two-Stage Design: Fitting the reference drift first and then solving the transport problem avoids the convergence difficulties typical of iterative algorithms.
- High Practicality: It only requires velocity information (e.g., RNA velocity) at the data sampling points, from which a continuous reference field can be constructed through kernel smoothing.
- Simulation-free: Training efficiency is improved by dozens of times compared to methods requiring backpropagation through time or simulation.
Limitations & Future Work¶
- Dependency on Velocity Field Quality: Performance relies on the approximation accuracy of the reference velocity field. The method cannot be directly applied when velocity information is unavailable.
- Focus on Determinism: Main experiments are conducted in the deterministic limit (\(\sigma = 0\)), and performance drops significantly in highly stochastic scenarios.
- Limited Theoretical Guarantees: The method relies on practical approximations (modeling conditional bridges as Brownian bridges, mini-batch OT) and lacks strict convergence guarantees.
- Kernel Selection: The choice of kernel \(\kappa\) impacts the quality of the reference drift, but the paper lacks systematic guidance on kernel selection.
Related Work & Insights¶
- Flow Matching (Lipman et al. 2023): The foundational framework for Curly-FM.
- OT-CFM (Tong et al. 2024): Mini-batch OT + CFM, but limited to gradient fields.
- DSBM (Shi et al. 2024): Diffusion Schrödinger Bridge Matching, dealing with zero-drift references.
- GSBM (Liu et al. 2024): Generalized SBM using spline interpolation, but Curly-FM is more flexible with neural networks.
- TrajectoryNet (Tong et al. 2020): ODE-based trajectory inference requiring simulations and incurring high computational costs.
Rating¶
| Dimension | Score (1-5) |
|---|---|
| Novelty | 4 — The non-zero drift reference process is a key innovation |
| Technical Quality | 4 — The method is sound, although some theoretical details require improvement |
| Experimental Thoroughness | 4 — Validation across multiple scenarios with comprehensive ablation studies |
| Writing Quality | 3 — The method description (Section 3.1) is somewhat difficult to follow |
| Impact | 4 — Holds direct application value for fields like single-cell biology |