Notable other Hugging Face

Gaussian Light Field Splatting: A Physical Prior-Driven Vision Transformer for Unsupervised Low-Light Image Enhancement

Yuhan Chen, Wenxuan Yu, Guofa Li, Fuchen Li, Kunyang Huang, Yicui Shi

Published: Jun 16, 2026 — 14:37 UTC

Problem
Existing unsupervised low-light image enhancement techniques struggle with local exposure imbalances and color distortions, particularly under complex non-uniform illumination conditions. Furthermore, many Vision Transformers lack mechanisms to incorporate physical priors related to illumination degradation. This paper addresses these gaps by proposing a novel architecture that integrates physical modeling into the enhancement process. The work is presented as a preprint and has not yet undergone peer review.

Method
The proposed model, GLFS (Gaussian Light Field Splatting), employs a Vision Transformer architecture enhanced with a continuous physical illumination model derived from Gaussian splatting. In GLFS, scene illumination is represented as a superposition of anisotropic Gaussian basis functions, which allows for a more accurate representation of complex lighting conditions. The model introduces physics-guided biases into the self-attention mechanism, enabling the adaptive inference of a spatial gain field that facilitates uniform restoration across varying illumination. To mitigate color bias and structural degradation, the authors develop two novel loss functions: a color-vector angular loss that enforces hue consistency and a luminance-edge loss that enhances structural fidelity in local details. The training process and computational requirements are not explicitly detailed in the paper.

Results
GLFS demonstrates superior performance in illumination correction and detail preservation compared to several baseline methods. The authors report that GLFS achieves state-of-the-art results on standard low-light image enhancement benchmarks, although specific quantitative metrics (e.g., PSNR, SSIM) and comparisons to named baselines are not provided in the abstract. The extensive ablation studies conducted indicate that the integration of physical priors significantly contributes to the model’s effectiveness.

Limitations
The authors acknowledge that while GLFS improves upon existing methods, it may still face challenges in extreme low-light conditions where illumination is severely compromised. Additionally, the reliance on Gaussian basis functions may limit the model’s adaptability to all types of lighting scenarios. The paper does not discuss the computational efficiency or scalability of the model in real-time applications, which could be a concern for practical deployment.

Why it matters
The introduction of GLFS represents a significant advancement in the field of low-light image enhancement by effectively integrating physical priors into the Vision Transformer framework. This approach not only enhances the quality of low-light images but also opens avenues for further research into physically-informed neural networks. The implications of this work extend to various applications, including photography, surveillance, and autonomous systems, where low-light performance is critical. For further details, see the full paper available on arXiv.

By Callan Zhang · Jun 16, 2026 · Editorial standards →

Summarised from the primary source with AI assistance under human editorial oversight. Turing Wire is not a primary source — read the original for the authoritative account.

Source: arXiv cs.CV