This project tests different reinforcement learning algorithms for stochastic optimal control problems. Namely they are used to solve a stochastic linear quadratic regulator problem in two different szenarios.
We just solve
where
Here we try to solve the continous SOC
with
We aim to minimize