(Im)practical Reinforcement Learning Theory
John Langford
basic setting
direct experience
reset model
generative model
precise description
full table