2410.16028

Total: 1

#1 Few-shot target-driven instance detection based on open-vocabulary object detection models [PDF1] [Copy] [Kimi] [REL]

Authors: Ben Crulis ; Barthelemy Serres ; Cyril De Runz ; Gilles Venturini

Current large open vision models could be useful for one and few-shot object recognition. Nevertheless, gradient-based re-training solutions are costly. On the other hand, open-vocabulary object detection models bring closer visual and textual concepts in the same latent space, allowing zero-shot detection via prompting at small computational cost. We propose a lightweight method to turn the latter into a one-shot or few-shot object recognition models without requiring textual descriptions. Our experiments on the TEgO dataset using the YOLO-World model as a base show that performance increases with the model size, the number of examples and the use of image augmentation.

Subject: Computer Vision and Pattern Recognition

Publish: 2024-10-21 14:03:15 UTC