Beyond Offline A/B Testing: Context-Aware Agent Simulation for Recommender System Evaluation 文章

ArXiv CS.AI2026-06-02NEWSen作者: Nicolas Bougie, Gian Maria Marconi, Xiaotong Ye, Narimasa Watanabe

摘要

arXiv:2604.09549v2 Announce Type: replace-cross Abstract: Recommender systems are central to online services, enabling users to navigate through massive amounts of content across various domains. However, their evaluation remains challenging due to the disconnect between offline metrics and online performance. The emergence of Large Language Model-powered agents offers a promising solution, yet existing studies model users in isolation, neglecting the contextual factors such as time, location, and needs, which fundamentally shape human decision-making. In this paper, we introduce ContextSim, an LLM agent framework that simulates believable user proxies by anchoring interactions in daily life activities. Namely, a life simulation module generates scenarios specifying when, where, and why users engage with recommendations. To align preferences with genuine humans, we model agents' internal thoughts and enforce consistency at both the action and trajectory levels.

Beyond Offline A/B Testing: Context-Aware Agent Simulation for Recommender System Evaluation 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (1)

相关技术