Reinforcement Fine-Tuned Large Language Models for Next POI Recommendation

#1 Reinforcement Fine-Tuned Large Language Models for Next POI Recommendation [PDF⁷] [Copy] [Kimi³] [REL]

Authors: Peibo Li, Shuang Ao, Hao Xue, Yang Song, Maarten de Rijke, Johan Barthélemy, Tomasz Bednarz, Flora D. Salim

Large language models (LLMs) have been adopted for next point-of-interest (POI) recommendation tasks. Typical LLM-based recommenders fall into two categories: prompt-based and supervised fine-tuning (SFT)-based models. Prompt-based models generally offer greater output flexibility but deliver lower accuracy, whereas SFT-based models achieve higher performance yet face a fundamental mismatch: next POI recommendation data does not naturally suit supervised fine-tuning. In SFT, the model is trained to reproduce the exact ground truth, but each training example provides only a single target POI, so there is no ground truth for producing a top-k list. To address this, we propose Refine-POI, a reinforcement fine-tuning framework for next POI recommendation. We introduce recommendation-driven rewards that enable LLMs to learn to generate top-k recommendation lists using only one ground-truth POI per example. Experiments on real-world datasets demonstrate that Refine-POI achieves state-of-the-art top-k recommendation performance.

Subjects: Information Retrieval , Artificial Intelligence , Machine Learning

Publish: 2025-06-19 02:51:10 UTC

2506.21599

#1 Reinforcement Fine-Tuned Large Language Models for Next POI Recommendation [PDF7] [Copy] [Kimi3] [REL]

#1 Reinforcement Fine-Tuned Large Language Models for Next POI Recommendation [PDF⁷] [Copy] [Kimi³] [REL]