Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging

#1 Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging [PDF⁴] [Copy] [Kimi¹¹] [REL]

Authors: Nicolas Poggi, Shashank Agnihotri, Margret Keuper

Terahertz (THz) imaging enables non-invasive analysis for applications such as security screening and material classification, but effective image classification remains challenging due to limited annotations, low resolution, and visual ambiguity. We introduce In-Context Learning (ICL) with Vision-Language Models (VLMs) as a flexible, interpretable alternative that requires no fine-tuning. Using a modality-aligned prompting framework, we adapt two open-weight VLMs to the THz domain and evaluate them under zero-shot and one-shot settings. Our results show that ICL improves classification and interpretability in low-data regimes. This is the first application of ICL-enhanced VLMs to THz imaging, offering a promising direction for resource-constrained scientific domains. Code: \href{https://github.com/Nicolas-Poggi/Project_THz_Classification/tree/main}{GitHub repository}.

Subjects: Computation and Language , Computer Vision and Pattern Recognition

Publish: 2025-07-21 12:57:49 UTC

2507.15576

#1 Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging [PDF4] [Copy] [Kimi11] [REL]

#1 Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging [PDF⁴] [Copy] [Kimi¹¹] [REL]