tK8GHR62EX@OpenReview

Total: 1

#1 SpectR: Dynamically Composing LM Experts with Spectral Routing [PDF] [Copy] [Kimi] [REL]

Authors: William Fleshman, Benjamin Van Durme

Training large, general-purpose language models poses significant challenges. The growing availability of specialized *expert* models, fine-tuned from pretrained models for specific tasks or domains, offers a promising alternative. Leveraging the potential of these existing expert models in real-world applications requires effective methods to select or merge the models best suited for a given task. This paper introduces SpectR, an approach for dynamically composing expert models at each time step during inference. Notably, our method requires no additional training and enables flexible, token- and layer-wise model combinations. Our experimental results demonstrate that SpectR improves routing accuracy over alternative training-free methods, increasing task performance across expert domains.

Subject: COLM.2025