e-Approximate Planning

Assumption: You have a box which takes as input an MDP and provides an e,T-optimal policy.