(Im)practical Reinforcement Learning Theory

John Langford

basic setting
direct experience
reset model
generative model
precise description
full table