pith. sign in

arxiv: 1311.6355 · v1 · pith:KMYZ37BOnew · submitted 2013-11-06 · 💻 cs.MM · cs.IR· cs.LG

Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach

classification 💻 cs.MM cs.IRcs.LG
keywords recommendationuserapproachmusicmodelpreferencesbayesiangreedy
0
0 comments X
read the original abstract

Current music recommender systems typically act in a greedy fashion by recommending songs with the highest user ratings. Greedy recommendation, however, is suboptimal over the long term: it does not actively gather information on user preferences and fails to recommend novel songs that are potentially interesting. A successful recommender system must balance the needs to explore user preferences and to exploit this information for recommendation. This paper presents a new approach to music recommendation by formulating this exploration-exploitation trade-off as a reinforcement learning task called the multi-armed bandit. To learn user preferences, it uses a Bayesian model, which accounts for both audio content and the novelty of recommendations. A piecewise-linear approximation to the model and a variational inference algorithm are employed to speed up Bayesian inference. One additional benefit of our approach is a single unified model for both music recommendation and playlist generation. Both simulation results and a user study indicate strong potential for the new approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.