Cool-Fusion: Fuse Large Language Models without Training

2025.acl-long.521@ACL

Total: 1

#1 Cool-Fusion: Fuse Large Language Models without Training [PDF²] [Copy] [Kimi²] [REL]

Authors: Cong Liu, Xiaojun Quan, Yan Pan, Weigang Wu, Xu Chen, Liang Lin

We focus on the problem of fusing two or more heterogeneous large language models (LLMs) to leverage their complementary strengths. One of the challenges of model fusion is high computational load, specifically in fine-tuning or aligning vocabularies. To address this, we propose Cool-Fusion, a simple yet effective approach that fuses the knowledge of source LLMs, which does not require training. Unlike ensemble methods, Cool-Fusion is applicable to any set of source LLMs that have different vocabularies. To overcome the vocabulary discrepancies among LLMs, we ensemble LLMs on text level, allowing them to rerank the generated texts by each other with different granularities. Extensive experiments have been conducted across a variety of benchmark datasets. On GSM8K, Cool-Fusion increases accuracy from three strong source LLMs by a significant margin of 17.4%.

Subject: ACL.2025 - Long Papers

2025.acl-long.521@ACL

#1 Cool-Fusion: Fuse Large Language Models without Training [PDF2] [Copy] [Kimi2] [REL]

#1 Cool-Fusion: Fuse Large Language Models without Training [PDF²] [Copy] [Kimi²] [REL]