CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization

#1 CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization [PDF¹] [Copy] [Kimi¹] [REL]

Authors: Hoyoon Byun, Gyeongdeok Seo, Joonseong Kang, Taero Kim, Jihee Kim, Kyungwoo Song

In-context learning (ICL), a nonparametric learning method based on the knowledge of demonstration sets, has become a de facto standard for large language models (LLMs). The primary goal of ICL is to select valuable demonstration sets to enhance the performance of LLMs. Traditional ICL methods choose demonstration sets that share similar features with a given query. However, we have found that the performance of these traditional ICL approaches is limited on out-of-distribution (OOD) datasets, where the demonstration set and the query originate from different distributions. To ensure robust performance in OOD datasets, it is essential to learn causal representations that remain invariant between the source and target datasets. Inspired by causal representation learning, we propose causal-aware in-context learning (CCL). CCL captures the causal representations of a given dataset and selects demonstration sets that share similar causal features with the query. To achieve this, CCL employs a novel VAE-based causal representation learning technique. We demonstrate that CCL improves the OOD generalization performance of LLMs both theoretically and empirically. Code is available at: \url{https://github.com/MLAI-Yonsei/causal-context-learning}

Subject: NeurIPS.2025 - Poster

FxV7Fvlm2T@OpenReview

#1 CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization [PDF1] [Copy] [Kimi1] [REL]

#1 CCL: Causal-aware In-context Learning for Out-of-Distribution Generalization [PDF¹] [Copy] [Kimi¹] [REL]