Vision-Reasoning-Guided Occlusion Removal from Light Fields

#1 Vision-Reasoning-Guided Occlusion Removal from Light Fields [PDF] [Copy] [Kimi] [REL]

Authors: Mohamed Youssef, Oliver Bimber

Occlusion-robust scene recovery remains a major challenge in computational imaging, particularly in natural environments where dense foreground vegetation severely limits visibility. We propose a vision-reasoning-guided light field occlusion removal framework that combines the visibility recovery capability of light field integration (LFI) with the semantic reasoning capacity of vision-language models (VLMs). Multi-view observations are first integrated via LFI to suppress foreground occlusions and produce an initial visibility-enhanced representation. A VLM is then incorporated as a conditional semantic prior to restore degraded structures and recover fine details, guided by the observed measurements. To improve recovery consistency and reduce hallucination artifacts, we introduce a multi-sample fusion strategy that aggregates multiple generated hypotheses into a unified estimate. Experimental results on synthetic and real-world datasets demonstrate state-of-the-art performance, achieving the highest average SSIM across four synthetic light field benchmark scenes (4-Syn) and strong generalization across structured and unstructured acquisition settings. These results highlight the effectiveness of combining physical imaging constraints with vision-language reasoning for robust perception under severe occlusion, with applicability to search-and-rescue and exploratory robotic navigation.

Subject: Computer Vision and Pattern Recognition

Publish: 2026-06-18 09:24:55 UTC

2606.19985

#1 Vision-Reasoning-Guided Occlusion Removal from Light Fields [PDF] [Copy] [Kimi] [REL]