Recent Posts

Week 6-7. Q learning - completed

1 minute read

The problem has been solved with the Q Learning algorithm parameterized with discrete action space of size 16 (4 bins for linear velocity and 4 for angular v...

Week 6-7. Q learning - changes

1 minute read

I am still working on how to improve the handling when the problem is multivariable (linear and angular velocity) by applying the Q Learning algorithm. The c...

Week 5-6. Simple Q learning - completed

less than 1 minute read

For today I wanted to show a video with a small demonstration of how the implemented algorithm has managed to solve the problem. It should be noted that the ...

Week 5-6. Simple Q learning - updated

less than 1 minute read

In the last post I published an implementation of the simplified Q learning algorithm for this problem in which I discretised both the space of observations ...