Introduction - If you have any usage issues, please Google them yourself
Reinforcement learning adaptive dynamic programming in value iteration and policy iteration method, neural network control method, LQR state regulator optimal control method to achieve a three-dimensional inverted pendulum on the vehicle stability control. Very robust, conducted experiments Gaussian white noise disturbance.