Spring-Gaus: Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians¶
Conference: ECCV 2024
arXiv: 2403.09434
Code: https://zlicheng.com/spring_gaus
Area: 3D Vision / Physical Simulation
Keywords: 3D Gaussians, Elastic Objects, Spring-Mass Model, System Identification, Physical Simulation
TL;DR¶
This paper proposes Spring-Gaus, which integrates a learnable 3D spring-mass model into 3D Gaussian Splatting to reconstruct the appearance, geometry, and physical dynamics parameters of elastic objects from multi-view videos, supporting future prediction and simulation under different conditions.
Background & Motivation¶
Background: 3D Gaussian Splatting and its dynamic extensions can reconstruct the temporal variations of object appearance and geometry, but they do not capture physical properties. PAC-NeRF attempts to integrate Material Point Method (MPM) physical priors into NeRF, but assumes a known material model and only handles global physical parameters.
Limitations of Prior Work: (1) Dynamic scene reconstruction methods (e.g., D-NeRF, 4D-GS) only fit motion trajectories without understanding physical laws, making them incapable of predicting the future or simulating under new conditions; (2) PAC-NeRF assumes known material types (e.g., elastomers), which is inapplicable to real-world heterogeneous objects; (3) Learning physical parameters for each particle in MPM is computationally prohibitive, and NeRF's implicit grid resolution is limited.
Key Challenge: There is a need for a physical dynamics model that is both expressive (capable of modeling complex deformations of heterogeneous elastic objects) and computationally efficient (supporting gradient-based inverse optimization).
Goal: To reconstruct the appearance, shape, and physical dynamics parameters of objects from multi-view videos, and to support simulation predictions under different initial conditions and environments.
Key Insight: The spring-mass system is a classic physical model that does not assume specific materials. It models various elastic behaviors through learnable topologies and parameters (stiffness, damping) and is naturally differentiable.
Core Idea: Use 3D Gaussians to represent appearance and geometry, and keypoint-based sparse anchors + spring systems to represent physical dynamics. Reconstruct the two in a decoupled manner and then combine them into a simulatable 3D object.
Method¶
Overall Architecture¶
The reconstruction consists of three steps: (1) Reconstructing static 3D Gaussians from the multi-view images of the first frame; (2) Generating sparse anchors via volume sampling and refining the Gaussians; (3) Establishing spring connections between anchors, and optimizing physical parameters (stiffness, damping, mass, and initial velocity) via differentiable simulation and rendering losses.
Key Designs¶
-
3D Spring-Mass Model:
- Function: Represents the physical dynamics of elastic objects without assuming specific material types.
- Mechanism: Volume sampling is performed on Gaussian centers to generate \(N_A\) anchors, each having a mass \(m_i\) and velocity \(v_i\). They are connected to \(n_k\) nearest neighbors via KNN to form a spring network. The spring force is defined as \(F_k = -\eta \cdot k_{i,j}(\|x_i - x_{i,j}\| - l_{i,j}) \cdot |\Delta l|^{p_k}\) (representing a non-linear spring when \(p_k > 0\)). Together with damping forces and gravity, the positions are updated using semi-implicit Euler integration.
- Design Motivation: Compared to MPM, which requires dense particles and known material constitutions, the spring-mass model can model complex deformations of heterogeneous objects using sparse anchors and per-spring learnable parameters, and it is fully differentiable.
-
Soft Vector:
- Function: Automatically learns the effective number of spring connections for each anchor.
- Mechanism: Introducing a decay vector \(\eta = [\eta_0, \ldots, \eta_{n_k}]\), where close-neighbor spring weights are set to 1, and far-neighbor springs decay based on a learnable parameter \(\kappa\) as \(\eta_j = \text{clamp}(2 - \exp(\text{softplus}(\kappa))^{j-n_c}, 0, 1)\).
- Design Motivation: If \(n_k\) is too large, the object becomes too rigid; if too small, it is too soft. The Soft Vector enables the model to automatically learn the appropriate stiffness, avoiding manual parameter tuning.
-
Decoupled Reconstruction Pipeline:
- Function: Decouples appearance/geometry reconstruction from physical parameter optimization to reduce optimization difficulty.
- Mechanism: First, freeze physical parameters to reconstruct static 3D Gaussians \(\rightarrow\) extract anchors and refine the Gaussians \(\rightarrow\) freeze appearance parameters and optimize physical parameters using differentiable simulation and rendering losses.
- Design Motivation: Joint optimization of both appearance and physical parameters has an excessively large and highly non-convex search space. Decoupling them defines clearer optimization targets for each stage.
Loss & Training¶
Static reconstruction: Photometric loss + SSIM + Opacity regularization. Dynamic reconstruction: Multi-view, multi-frame photometric loss backpropagated through differentiable simulation to optimize physical parameters.
Key Experimental Results¶
Main Results¶
| Scene | Spring-Gaus PSNR | PAC-NeRF PSNR | Remarks |
|---|---|---|---|
| Synthetic Elastic Ball | Better | Lower | PAC-NeRF assumes known material |
| Synthetic Heterogeneous Object | Significantly Better | N/A | PAC-NeRF cannot handle heterogeneity |
| Real Rubber Duck | Effective | Limited | Validated on real-world data |
Ablation Study¶
| Configuration | Performance | Explanation |
|---|---|---|
| Full Spring-Gaus | Best | Spring-mass + decoupled reconstruction |
| W/o Soft Vector | Drop | Fixed \(n_k\) is inflexible |
| Non-linear spring (\(p_k>0\)) | Improved | Non-linear forces better match real materials |
| Joint optimization (w/o decoupling) | Hard to converge | Decoupled strategy is critical |
Key Findings¶
- Although simple, the spring-mass model performs remarkably well on real-world elastic objects, capturing collisions, deformations, and bouncing.
- The Soft Vector effectively and automatically balances the "stiffness-softness" trade-off.
- Forward simulation under unseen conditions (e.g., different initial heights, different gravity directions) is achievable on both synthetic and real-world data.
Highlights & Insights¶
- Wisdom of Model Selection: Instead of using complex MPM/FEM, a classical spring-mass model is deployed. It is flexible enough (with per-spring learnable parameters to handle heterogeneous materials) and simple enough (fully differentiable and highly efficient).
- Decoupling appearance reconstruction from physics reconstruction is a critical engineering insight, avoiding the challenges of high-dimensional joint optimization.
- The paradigm of inversely learning physical parameters from videos (system identification) can be transferred to robotic object manipulation.
Limitations & Future Work¶
- The spring-mass model cannot accurately model materials that require continuum mechanics, such as fluids or cloth.
- It depends on multi-view synchronized videos; single-view scenes would require additional depth estimation.
- Collision handling utilizes simple boundary conditions, showing limited capability in handling complex contact scenarios (e.g., multi-object interaction).
- The choices for the number of anchors and spring connections still rely on empirical heuristics.
Related Work & Insights¶
- vs PAC-NeRF: PAC-NeRF employs MPM but assumes global material parameters, whereas Spring-Gaus uses the spring-mass model to allow per-spring heterogeneous parameters.
- vs PhysGaussian: PhysGaussian uses MPM for forward simulation but does not estimate inverse parameters, while Spring-Gaus completes the entire inverse problem from video to physical parameters.
- vs DANO: DANO handles rigid bodies, while Spring-Gaus addresses elastic bodies, which is a more challenging setting.
Rating¶
- Novelty: ⭐⭐⭐⭐ The combination of spring-mass and 3DGS is simple yet effective, and the Soft Vector design is clever.
- Experimental Thoroughness: ⭐⭐⭐ Synthesized and real-world data are used, but the variety of real-world scenes is limited.
- Writing Quality: ⭐⭐⭐⭐ The methodology is clearly described, and the pipeline diagram is intuitive.
- Value: ⭐⭐⭐⭐ The first to reconstruct simulatable elastic 3DGS objects from videos.