Finite-Sample analysis of reinforcement learning
Contraction mapping
Q-learning
2022-11-16 Rui Gao– Optimal robust policy for feature-based newsvendor
2022-11-17 Yinyu Ye– Online linear programming: applications and extensions
Multi-armed bandit problem
2022-11-14 Min Dai– bitcoin mining and electricity consumption