Universal Beta Splatting¶
- Conference: ICLR 2026
- arXiv: 2510.03312
- Code: Project Page
- Area: 3D Vision / Neural Rendering
- Keywords: 3D Gaussian Splatting, Beta Kernel, N-Dimensional, View-Dependent, Dynamic Scene, Real-Time Rendering
TL;DR¶
The authors propose Universal Beta Splatting (UBS), which generalizes 3D Gaussian Splatting into N-dimensional anisotropic Beta kernels. By providing per-dimension shape control, it unifies the modeling of spatial geometry, view-dependent appearance, and scene dynamics within a single representation, achieving interpretable scene decomposition and SOTA rendering quality.
Background & Motivation¶
While 3D Gaussian Splatting (3DGS) enables real-time rendering through explicit primitives, the fixed bell-shaped profile of Gaussian kernels imposes fundamental limitations:
Spatial Dimension: Sharp boundaries require a large number of small primitives, leading to inefficiency.
Angular Dimension: View-dependent effects necessitate additional Spherical Harmonics (SH) encoding (48 parameters), resulting in a fragmented representation.
Temporal Dimension: Dynamic scenes require auxiliary deformation networks, increasing system complexity.
Key Insight: Different scene properties require distinct kernel behaviors—spatial geometry needs adaptive sharpness, angular appearance ranges from diffuse to specular, and temporal dynamics vary from static to rapid motion. Gaussian kernels enforce the same symmetric profile across all dimensions, whereas Beta kernels provide per-dimension shape control.
Method¶
Overall Architecture¶
UBS addresses the fundamental constraint of "fixed shapes" in 3DGS bell-shaped Gaussian kernels: where sharp boundaries require massive small primitives, view-dependent appearance needs external SH, and dynamic scenes require external deformation networks. Ours replaces this with an N-dimensional anisotropic Beta kernel with per-dimension adjustable shapes, allowing spatial geometry, view-dependent appearance, and scene dynamics to be represented by the same type of primitive. The pipeline operates as follows: each primitive carries position, covariance, and a set of Beta shape parameters \(\mathbf{b}\) within a unified \(N\)-dimensional distribution. The covariance is parameterized using a spatially orthogonal Cholesky decomposition to maintain the geometric structure of 3DGS. During rendering, given a viewpoint/time \(\mathbf{q}\), high-dimensional kernels are first projected back into 3D distributions via Beta-modulated conditional slicing, then processed through a standard 3DGS rasterization pipeline. Thus, it inherits real-time rendering while providing adaptive sharpness in every dimension; the learned \(\mathbf{b}\) parameters also yield an unsupervised scene decomposition map.
graph TD
A["Multi-view Images / Video"] --> B
subgraph PRIM["N-dimensional Beta Kernel"]
direction TB
B["Per-dimension Shape Parameter b<br/>Continuous Transition: Flat ↔ Sharp"] --> C["Spatially Orthogonal Cholesky<br/>Parameterized Covariance Σ"]
end
C -->|"Given Viewpoint / Time q"| D["Beta-modulated Conditional Slicing<br/>High-dim Kernels compressed to 3D Distribution"]
D --> E["Standard 3DGS Rasterization"]
E --> F["Rendered Image"]
C --> G["Universal Compatibility & Interpretable Decomposition<br/>b Parameters as Unsupervised Semantic Maps"]
Key Designs¶
1. N-dimensional Beta Kernel: Replacing Fixed Bell Profiles with Per-dimension Shape Parameters
Gaussian kernels enforce the same symmetric profile across all dimensions. UBS defines primitive density as \(\sigma(\mathbf{x}, \mathbf{q}) = \mathcal{B}(\mathbf{x}, \mathbf{q}; \boldsymbol{\mu}, \boldsymbol{\Sigma}, \mathbf{b}) \cdot o\), where \(\mathbf{x} \in \mathbb{R}^3\) represents spatial coordinates, \(\mathbf{q} \in \mathbb{R}^{N-3}\) encodes additional dimensions like viewpoint/time, and \(\mathbf{b} \in \mathbb{R}^{N-2}\) controls the Beta shape of each dimension. Crucially, the Beta exponent for each dimension \(\beta_i = 4\exp(b_i)\) is independently adjustable: negative \(b_i\) yields a flat profile suitable for smooth surfaces or diffuse reflections, while positive \(b_i\) yields sharp peaks fit for fine textures, fast motion, or specular highlights.
2. Spatially Orthogonal Cholesky Parameterization: Maintaining 3DGS Geometry in N-dimensions
Direct unconstrained decomposition of \(N\)-dimensional covariance would destroy the rotation-scaling semantics of spatial sub-blocks and lose compatibility with 3DGS weights. Ours parameterizes the Cholesky factor \(\mathbf{L}\) as a block lower triangular matrix:
The top-left \(\mathbf{R}_x \in SO(3)\) (maintained via first-order Taylor approximation) and \(\text{diag}(\mathbf{s}_x)\) reuse 3DGS rotation-scaling parameters, \(\mathbf{L}_{qx}\) encodes cross-dimensional correlation between spatial and extra dimensions, and \(\mathbf{L}_q\) describes the extra dimensions themselves.
3. Beta-modulated Conditional Slicing: Projecting Kernels for 3D Rendering
Given a viewpoint/time \(\mathbf{q}\), the 3D distribution is obtained via conditional slicing:
The term \(\text{diag}(\tilde{\boldsymbol{\beta}}_q)\) applies Beta shape parameters to non-spatial dimensions, allowing conditional mean and covariance to vary via Beta shapes rather than fixed linear transitions. Opacity is also modulated: \(o(\mathbf{q}) = o \prod_{i=1}^C (1 - d_i)^{4\beta_{q_i}}\), ensuring smooth and bounded decay.
4. Universal Compatibility & Interpretable Decomposition
Since Beta kernels converge to Gaussians as a limit, setting the dimension \(N\) and shape parameters \(\mathbf{b}\) can replicate existing methods:
| \(\mathbf{b}\) Setting | Equivalent Method |
|---|---|
| \(N=3\), \(b_x=0\) | ≈ 3DGS |
| \(N=3\), \(b_x \neq 0\) | ≈ DBS |
| \(N=6\), \(\mathbf{b}=\mathbf{0}\) | ≈ 6DGS |
| \(N=7\), \(\mathbf{b}=\mathbf{0}\) | ≈ 7DGS |
This unified parameterization leads to significant parameter reduction: 41% fewer parameters in static scenes (removing 48-param SH) and 73% fewer in dynamic scenes compared to 4DGS. Furthermore, negative spatial \(b_x\) corresponds to smooth surfaces, and positive angular \(b_d\) corresponds to specular reflections, enabling "free" unsupervised scene decomposition.
Loss & Training¶
Training objectives include standard reconstruction terms and two regularization terms:
The \(\ell_1\) regularization on opacity ensures MCMC densification prunes redundant primitives, while the scale penalty encourages primitive relocation to regions of higher demand.
Main Results¶
Static Scenes¶
NeRF Synthetic (UBS-6D vs 3DGS vs 6DGS):
| Scene | 3DGS PSNR | 6DGS PSNR | UBS-6D PSNR |
|---|---|---|---|
| chair | 35.60 | 35.55 | 36.72 |
| ficus | 35.49 | 34.62 | 36.90 |
| materials | 30.50 | 30.63 | 32.90 |
| lego | 36.06 | 35.22 | 36.95 |
PSNR gains reach up to +8.27 dB (on 6DGS-PBR dataset).
Dynamic Scenes¶
D-NeRF and 7DGS-PBR (UBS-7D vs 4DGS vs 7DGS): - PSNR improvements up to +2.78 dB. - Performance is notably superior in scenes with complex spatio-temporal-angular correlations (e.g., heart motion, sunlight changes, translucent deformations).
Key Findings¶
- Initializing Beta parameters at zero ensures convergence starting from the Gaussian limit.
- First-order approximation of spatially orthogonal Cholesky is comparable in accuracy to exact rotations but faster.
- MCMC optimization remains effective for Beta kernels.
- Real-time rendering performance is comparable to 3DGS.
Efficiency¶
- 30K iterations.
- Static: ~8-10 minutes on a single RTX 4090.
- Dynamic: Consistent with 4DGS/7DGS baselines on a single V100.
Highlights & Insights¶
- Unified Framework: A single Beta kernel primitive handles space/angle/time, replacing multiple specialized encodings.
- Backward Compatibility: Degenerates to Gaussian when Beta=0, guaranteeing a performance lower bound.
- Unsupervised Decomposition: Learned Beta parameters naturally separate geometry, appearance, and motion.
- Parameter Reduction: 41-73% reduction in parameters while improving performance.
- Real-time Rendering: Full CUDA acceleration support.
Limitations & Future Work¶
- Number of parameters in N-dimensional Cholesky grows with dimensionality.
- Bounded support of Beta kernels might require more primitives in extreme far-field views.
- Verification currently limited to 7 dimensions; higher-dimensional effects are unexplored.
- Training requires batch size 4 for dynamic scenes (higher VRAM consumption).
- First-order rotation approximation may lose precision under extreme rotation angles.
Related Work & Insights¶
- Kernel Design Alternatives: GES, TNT-GS, Half-GS, Disc-GS modify 3D Gaussians; DBS introduces 3D Beta kernels.
- High-dimensional Gaussians: 6DGS (space+view), 7DGS (space+view+time) use conditional distributions.
- Dynamic Scenes: D-NeRF uses deformation fields; 4DGS extends time dimension directly.
- Alternative Primitives: 3D Convex Splatting, Triangle Splatting, etc.
Rating¶
- Novelty: ⭐⭐⭐⭐⭐ — The N-dimensional Beta kernel framework is an elegant theoretical contribution.
- Value: ⭐⭐⭐⭐⭐ — Plug-and-play compatibility + real-time rendering.
- Clarity: ⭐⭐⭐⭐ — Mathematical derivations are clear, though notation is dense.
- Writing Quality: ⭐⭐⭐⭐⭐ — Establishes a general primitive framework for radiance field rendering.