This paper proposes PhyGaP, which integrates polarization cues into 2DGS optimization via a polarization deferred rendering pipeline (PolarDR), and introduces a self-occlusion-aware GridMap environment representation, enabling accurate reflection decomposition and realistic relighting of glossy objects.
Difficulty in reconstructing reflective objects: 3DGS and its variants lack explicit geometric representations, and the splatting pipeline cannot simulate secondary light transport, limiting their ability to model glossy surfaces.
Insufficient information in RGB images: Existing DR methods rely on accurate estimation of normals, reflectance, and roughness, yet ordinary RGB images do not encode these physical properties, leading to failure in decomposing albedo from specular reflection.
Poor relighting quality: Due to inaccurate reflection decomposition, existing methods frequently exhibit color shifts, unrealistic shadows, or surface discontinuities under novel illumination conditions.
Rich physical information encoded in polarization: Specular reflection produces strong linear polarization, while diffuse reflection produces weak polarization with a 90° shift in polarization angle, making polarization cues naturally suited for guiding reflectance property learning.
Self-occlusion in non-convex objects: Environment cubemaps assume light sources at infinity and cannot handle self-occlusion or indirect illumination of non-convex objects, causing relighting artifacts.
Existing polarization methods do not support relighting: PANDORA encodes environment maps implicitly, and PolGS does not decompose albedo; neither supports illumination replacement.
PhyGaP builds upon the 2DGS + Ref-Gaussian framework. Each Gaussian primitive maintains learnable attributes: albedo \(\boldsymbol{\lambda}\), index of refraction (IoR) \(\eta\), surface normal \(\mathbf{n}\), roughness \(r\), and a learnable environment cubemap mipmap \(E\). These attributes are splatted into 2D material maps via α-blending and then fed into PolarDR to compute per-pixel Stokes vectors, which are supervised against ground-truth polarization observations.
The polarization state of light is represented by the Stokes vector \(\mathbf{s}=[s_0, s_1, s_2, s_3]^\top\); light–surface interactions are modeled using Mueller matrices.
Specular polarization: The degree of polarization \(\beta_s\) is computed from Fresnel coefficients \(R^\perp, R^\parallel\) and combined with specular radiance \(L_s\) to obtain the specular Stokes component.
Diffuse polarization: The degree of polarization \(\beta_d\) is computed from transmittance Fresnel coefficients \(T^\perp, T^\parallel\) and combined with diffuse radiance \(L_d\) to obtain the diffuse Stokes component.
The sum of both components constitutes the rendered Stokes vector, which is directly compared with ground-truth polarization maps to explicitly constrain specular–diffuse decomposition.
Spherical harmonics are not used to represent color, as albedo should be view-independent.
The bounding box of the object is divided into a 3×3 grid on each face, with anchor cameras placed at grid nodes (excluding the bottom face), yielding \(N=52\) anchor cameras in total.
A single-bounce ray tracing step is performed at each anchor camera to construct a local cubemap \(\tilde{E}_i\) that blends the object's own color with the global environment.
During rendering, the Stokes contributions from all local cubemaps are fused via distance-weighted blending:
Evaluation on 9 scenes from the PANDORA/RMVP/SMVP/Mitsuba3 datasets. PhyGaP achieves an average improvement of approximately 2 dB PSNR over RGB-only methods and reduces normal cosine distance by 45.7%.
First polarization GS method with relighting capability: While PANDORA, PolGS, and other polarization-based methods do not support relighting, PhyGaP achieves explicit reflection decomposition and illumination replacement.
Physics-driven polarization rendering: PolarDR embeds the pBRDF model into GS deferred rendering and directly supervises with polarization Stokes vectors, eliminating the albedo–illumination ambiguity.
Practical and efficient GridMap: The combination of 52 anchor cameras and distance-weighted fusion addresses indirect illumination without scene-specific parameters, with manageable overhead and straightforward GPU parallelization.
Compatible with partial polarization inputs: Data acquisition requires only two standard RGB cameras equipped with linear polarizers, without reliance on dedicated polarization cameras.
Insufficient modeling of metallic surfaces: The pBRDF of metals involves complex-valued indices of refraction and phase terms that the current model may not accurately capture.
GridMap limitations for extreme geometries: Highly irregular objects or strongly specular scenes with multiple inter-reflections remain challenging.
Infinite light source assumption in environment maps: Finite-distance light sources in real scenes introduce reconstruction bias.
Multi-bounce light transport not modeled: GridMap performs only single-bounce ray tracing; complex inter-reflection scenarios leave room for improvement.
Novelty: ⭐⭐⭐⭐ — Integrating polarization pBRDF into GS deferred rendering alongside the GridMap design for indirect illumination represents a novel technical combination.
Experimental Thoroughness: ⭐⭐⭐⭐ — Covers 9 scenes, both synthetic and real data, and evaluates NVS, normals, decomposition, and relighting with complete ablations.
Writing Quality: ⭐⭐⭐⭐ — Well-structured with complete formula derivations and rich figures and tables.
Value: ⭐⭐⭐⭐ — First to achieve relighting capability in polarization-based GS, with practical application prospects in VR/AR and interactive design.