2512.11016

Total: 1

#1 SoccerMaster: A Vision Foundation Model for Soccer Understanding [PDF3] [Copy] [Kimi2] [REL]

Authors: Haolin Yang, Jiayuan Rao, Haoning Wu, Weidi Xie

Soccer understanding has recently garnered growing research interest due to its domain-specific complexity and unique challenges. Unlike prior works that typically rely on isolated, task-specific expert models, this work aims to propose a unified model to handle diverse soccer visual understanding tasks, ranging from fine-grained perception (e.g., athlete detection) to semantic reasoning (e.g., event classification). Specifically, our contributions are threefold: (i) we present SoccerMaster, the first soccer-specific vision foundation model that unifies diverse understanding tasks within a single framework via supervised multi-task pretraining; (ii) we develop an automated data curation pipeline to generate scalable spatial annotations, and integrate them with various existing soccer video datasets to construct SoccerFactory, a comprehensive pretraining data resource; and (iii) we conduct extensive evaluations demonstrating that SoccerMaster consistently outperforms task-specific expert models across diverse downstream tasks, highlighting its breadth and superiority. The data, code, and model will be publicly available.

Subjects: Computer Vision and Pattern Recognition , Artificial Intelligence

Publish: 2025-12-11 18:03:30 UTC