GeLaCo: An Evolutionary Approach to Layer Compression | Cool Papers

#1 GeLaCo: An Evolutionary Approach to Layer Compression [PDF²] [Copy] [Kimi⁴] [REL]

Authors: David Ponce, Thierry Etchegoyhen, Javier Del Ser

Large Language Models (LLM) have achieved remarkable performance across a large number of tasks, but face critical deployment and usage barriers due to substantial computational requirements. Model compression methods, which aim to reduce model size while preserving its capacity, are an important means to mitigate these issues. Promising approaches along these lines, such as structured pruning, typically require costly empirical search for optimal variants and may run the risk of ignoring better solutions. In this work we introduce GeLaCo, an evolutionary approach to LLM compression via layer collapse. Our approach supports an efficient exploration of the compression solution space via population-based search and a module-wise similarity fitness function capturing attention, feed-forward, and hidden state representations. GeLaCo also supports both single and multi-objective evolutionary compression search, establishing the first Pareto frontier along compression and quality axes. We evaluate GeLaCo solutions via both perplexity-based and generative evaluations over foundational and instruction-tuned models, outperforming state-of-the-art alternatives.

Subject: Computation and Language

Publish: 2025-07-14 08:44:59 UTC

2507.10059

#1 GeLaCo: An Evolutionary Approach to Layer Compression [PDF2] [Copy] [Kimi4] [REL]

#1 GeLaCo: An Evolutionary Approach to Layer Compression [PDF²] [Copy] [Kimi⁴] [REL]