Introduction - If you have any usage issues, please Google them yourself
This an example presentting how to apply value-iteration algorithm,policy-iteration algorithm and reinforcement learning algorithm to MDP model, which aims to solve the multi-period newsboy problem.