DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds¶
Conference: CVPR 2025
arXiv: 2503.18402
Code: dashgaussian.github.io
Area: 3D Vision / 3DGS Acceleration
Keywords: 3D Gaussians, Training Acceleration, Frequency Scheduling, Resolution Scheduling, Adaptive Densification
TL;DR¶
DashGaussian is proposed, offering a joint framework for scheduling rendering resolution and Gaussian primitive count based on frequency analysis. It reformulates 3DGS optimization as a progressive fitting of high-frequency components, achieving an average acceleration of 45.7% without compromising rendering quality.
Background & Motivation¶
- Although 3DGS is significantly faster than NeRF (tens of minutes vs. days), further acceleration is still required on resource-constrained devices and for large-scale scene reconstruction.
- Existing acceleration methods fall into two categories: (1) engineering optimizations (e.g., efficient forward/backward implementations); (2) algorithmic optimizations (e.g., pruning redundant Gaussians), where the latter often sacrifices rendering quality.
- Three key observations: a) computational overhead is primarily determined by rendering resolution and Gaussian count; b) at the beginning of optimization, Gaussians are sparse, making high-resolution rendering wasteful; c) in the late stages of optimization, a surge in Gaussian count brings massive computation with limited quality improvement.
- Key Insight: Allocating computational resources reasonably is more efficient than simply pruning parameters.
Method¶
Overall Architecture¶
The optimization of 3DGS is reformulated as a progressive process of fitting higher-frequency components in training views. Based on this, a resolution scheduler and a primitive scheduler are proposed to jointly control the optimization complexity.
Key Designs¶
-
Frequency-Guided Resolution Scheduler:
- Function: Adaptively increases the rendering resolution step-by-step during optimization.
- Mechanism: Formulating image downsampling as removing high-frequency components, a resolution saliency function is defined as \(\mathcal{X}(\mathbf{F}) = \frac{1}{N}\sum_{n=1}^N \sum_{i,j} \|\mathbf{F}^n(i,j)\|_2\). Iteration steps for high/low resolutions are allocated using the frequency energy ratio \(f(\mathbf{F}, \mathbf{F}_r) = \frac{\mathcal{X}(\mathbf{F}) - \mathcal{X}(\mathbf{F}_r)}{\mathcal{X}(\mathbf{F})}\).
- Design Motivation: Low-frequency components contain core structural information and require less computation, so they should be fitted first. High-frequency details require more primitives to be effectively fitted and should be addressed after sufficient primitives are present.
- Switching Point: Switches from low resolution to high resolution at the \(s_r = S \cdot \mathcal{X}(\mathbf{F}_r) / \mathcal{X}(\mathbf{F})\) iteration.
-
Resolution-Guided Primitive Scheduler:
- Function: Controls the growth of Gaussian primitives in synchronization with the resolution.
- Mechanism: \(P_i = P_{\text{init}} + (P_{\text{fin}} - P_{\text{init}}) / (r^{(i)})^{2-i/S}\), where the power factor decays linearly from 2 to 1, producing a concave growth curve (suppressed in the early stage, encouraged in the middle stage).
- Design Motivation: A specific resolution should match an appropriate number of primitives. Too many primitives in low-resolution stages are wasteful and may lead to over-densification. The concave curve ensures primitives grow rapidly only when truly needed.
-
Momentum-Based Primitive Budget Estimation:
- Function: Adaptively determines the final primitive count \(P_{\text{fin}}\) without relying on dataset priors.
- Mechanism: Treating \(P_{\text{fin}}\) as momentum and the natural densification amount at each step \(P_{\text{add}}\) as force: \(P_{\text{fin}}^{(i)} = \max(P_{\text{fin}}^{(i-1)}, \gamma P_{\text{fin}}^{(i-1)} + \eta P_{\text{add}}^{(i)})\).
- Design Motivation: Bypasses artificially set upper bounds, dynamically adjusting the primitive budget based on actual scene requirements.
Loss & Training¶
- Uses anti-aliasing downsampling (frequency domain center cropping + DFT/IDFT) to avoid 2D aliasing.
- Discretizes the resolution factor using floor functions to encourage primitives to grow faster than under continuous scheduling.
- Hyperparameters: \(a=4\) (maximum downsampling corresponding to \(1/4\) of frequency energy), \(\gamma=0.98\), \(\eta=1\), \(P_{\text{fin}}^{(0)} = 5 \cdot P_{\text{init}}\).
- Maintains the original hyperparameters unchanged when incorporating various 3DGS backbones.
Key Experimental Results¶
Main Results (Comparison with Fast Optimization Methods)¶
| Method | Mip-NeRF360 PSNR | Time (min) | Deep Blending PSNR | Time (min) | T&T PSNR | Time (min) |
|---|---|---|---|---|---|---|
| 3DGS | 27.72 | 18.31 | 29.50 | 17.27 | 23.62 | 10.59 |
| Reduced-3DGS | 27.28 | 15.52 | 29.78 | 13.74 | 23.59 | 8.31 |
| Taming-3DGS | 27.61 | 5.51 | 29.69 | 4.52 | 23.62 | 4.02 |
| DashGaussian | 27.92 | 3.23 | 30.02 | 2.20 | 23.97 | 2.62 |
Speedup Effects on Various Backbones¶
| Backbone | Original Time | +DashGaussian | Speedup Ratio | PSNR Change |
|---|---|---|---|---|
| 3DGS | 18.31 min | 10.16 min | 44.5% | +0.09 |
| Mip-Splatting | 25.83 min | 12.60 min | 51.2% | +0.08 |
| Revising-3DGS | 5.73 min | 3.46 min | 39.6% | +0.20 |
| Taming-3DGS | 5.51 min | 3.23 min | 41.4% | +0.31 |
Ablation Study¶
| Configuration | Key Metrics | Description |
|---|---|---|
| Resolution scheduling only | Significant speedup but slight quality drop | Lack of primitive coordination leads to underfitting |
| Primitive scheduling only | Moderate speedup | High-resolution rendering remains the bottleneck |
| Joint scheduling (Full) | Maximum speedup with maintained quality | Optimal coordination between resolution and primitives |
| Fixed \(P_{\text{fin}}\) | Over/under-densification in some scenes | Momentum estimation yields better adaptivity |
Key Findings¶
- Accelerates optimization by 45.7% on average with widespread improvements in PSNR (speed is not gained at the cost of quality).
- Integrates seamlessly with arbitrary 3DGS backbones as a plug-and-play accelerator.
- The concave primitive growth curve (fewer in early stages, more in late stages) outperforms existing convex or front-heavy strategies.
- Compresses the optimization time of Taming-3DGS on the Mip-NeRF 360 dataset to approximately 3 minutes (~200 seconds level).
Highlights & Insights¶
- Unifies the theoretical foundation of resolution scheduling from the perspective of frequency analysis, reformulating 3DGS optimization as a progressive frequency fitting process.
- The philosophy of "allocating computational resources reasonably rather than pruning parameters" is more elegant than aggressive pruning and avoids quality loss.
- Momentum-based primitive budgeting is a truly adaptive solution, escaping the limitations of manually defined upper bounds.
- Experiments demonstrate that reasonable allocation of computational resources can even enhance quality (the low-resolution stage acts as implicit regularization).
Limitations & Future Work¶
- Discretizing the resolution factor (using floor) is an engineering approximation, leaving the theoretical optimality of continuous scheduling not fully realized.
- The frequency energy ratio \(f\) assumes perfect anti-aliased downsampling, whereas residual aliasing may exist in practice.
- The selection of hyperparameter \(a=4\) is not extensively ablated.
- Not fully validated on ultra-large-scale scenes (e.g., city-scale).
Related Work & Insights¶
- 3DGS \(\to\) Base representation; Mip-Splatting \(\to\) Multi-scale rendering and anti-aliasing.
- Taming-3DGS \(\to\) Engineering acceleration baseline; Mini-Splatting \(\to\) Algorithmic pruning baseline.
- Nyquist Shannon sampling theorem + DFT \(\to\) Theoretical foundation for resolution scheduling.
Rating¶
- Novelty: ⭐⭐⭐⭐ Frequency-guided resolution scheduling offers a novel perspective, and the joint scheduling scheme is designed cleverly.
- Experimental Thoroughness: ⭐⭐⭐⭐⭐ High thoroughness with three datasets, four backbones, and detailed ablations/comparisons.
- Writing Quality: ⭐⭐⭐⭐ Clear theoretical derivations; the frequency analysis perspective is highly educational.
- Value: ⭐⭐⭐⭐⭐ Extremely high practical value as a plug-and-play general accelerator.