2503.15573

Total: 1

#1 Neuronal Activation States as Sample Embeddings for Data Selection in Task-Specific Instruction Tuning [PDF] [Copy] [Kimi3] [REL]

Authors: Da Ma, Gonghu Shang, Zhi Chen, Libo Qin, Yijie Luo, Lei Pan, Shuai Fan, Lu Chen, Kai Yu

Task-specific instruction tuning enhances the performance of large language models (LLMs) on specialized tasks, yet efficiently selecting relevant data for this purpose remains a challenge. Inspired by neural coactivation in the human brain, we propose a novel data selection method called NAS, which leverages neuronal activation states as embeddings for samples in the feature space. Extensive experiments show that NAS outperforms classical data selection methods in terms of both effectiveness and robustness across different models, datasets, and selection ratios.

Subject: Machine Learning

Publish: 2025-03-19 11:35:57 UTC