PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning

#1 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning [PDF⁶] [Copy] [Kimi¹⁰] [REL]

Authors: Yuhui Shi, Yehan Yang, Qiang Sheng, Hao Mi, Beizhe Hu, Chaoxi Xu, Juan Cao

With the popularity of large language models (LLMs), undesirable societal problems like misinformation production and academic misconduct have been more severe, making LLM-generated text detection now of unprecedented importance. Although existing methods have made remarkable progress, a new challenge posed by text from privately tuned LLMs remains underexplored. Users could easily possess private LLMs by fine-tuning an open-source one with private corpora, resulting in a significant performance drop of existing detectors in practice. To address this issue, we propose PhantomHunter, an LLM-generated text detector specialized for detecting text from unseen, privately-tuned LLMs. Its family-aware learning framework captures family-level traits shared across the base models and their derivatives, instead of memorizing individual characteristics. Experiments on data from LLaMA, Gemma, and Mistral families show its superiority over 7 baselines and 3 industrial services, with F1 scores of over 96%.

Subjects: Computation and Language , Computers and Society

Publish: 2025-06-18 17:59:58 UTC

2506.15683

#1 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning [PDF6] [Copy] [Kimi10] [REL]

#1 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning [PDF⁶] [Copy] [Kimi¹⁰] [REL]