This document summarizes a presentation on personalized list recommendations using multi-armed bandit algorithms. It proposes a generalized linear model approach (GL-CDCM) that estimates item attraction weights based on user context and balances exploration and exploitation to maximize rewards. Theoretical analysis shows GL-CDCM achieves sub-linear regret bounds, and experiments demonstrate it outperforms existing methods on synthetic and real-world movie recommendation datasets. Future work could include tighter analysis, other click models, and more evaluation.