Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems 论文

2019引用 237
Recommender Systems and TechniquesAdvanced Bandit Algorithms ResearchStochastic Gradient Optimization Techniques

Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems · 相关技术