DINO Pre-training for Vision-based End-to-end Autonomous Driving

2407.10803

Total: 1

#1 DINO Pre-training for Vision-based End-to-end Autonomous Driving [PDF⁷] [Copy] [Kimi²] [REL]

Authors: Shubham Juneja, Povilas Daniušis, Virginijus Marcinkevičius

In this article, we focus on the pre-training of visual autonomous driving agents in the context of imitation learning. Current methods often rely on a classification-based pre-training, which we hypothesise to be holding back from extending capabilities of implicit image understanding. We propose pre-training the visual encoder of a driving agent using the self-distillation with no labels (DINO) method, which relies on a self-supervised learning paradigm.% and is trained on an unrelated task. Our experiments in CARLA environment in accordance with the Leaderboard benchmark reveal that the proposed pre-training is more efficient than classification-based pre-training, and is on par with the recently proposed pre-training based on visual place recognition (VPRPre).

Subjects: Computer Vision and Pattern Recognition , Machine Learning , Robotics

Publish: 2024-07-15 15:18:57 UTC

2407.10803

#1 DINO Pre-training for Vision-based End-to-end Autonomous Driving [PDF7] [Copy] [Kimi2] [REL]

#1 DINO Pre-training for Vision-based End-to-end Autonomous Driving [PDF⁷] [Copy] [Kimi²] [REL]