Toggle Menu

LogBook
- Check it out
Projects
- AI Exercises
Resources
- Usefull resources

Included random initial positions and random perturbations to cartpole problem (month 23)

less than 1 minute read

This month we focused on the two following goals:

Make cartpole learning stable enough making use of DQN, adaptative learning and replay buffers
Investigate about reward policies alternatives and some ways to mitigate catastrophic forgetting
Include monitorization to cartpole to know how solid is our solution (TODO include this monitorization as a common library in RL-Studio)
you can see more details about the projec status in cartpole project post

Twitter LinkedIn

You May Also Enjoy

Carla follow lane DDPG Vs PPO Vs SAC [July 3rd]

less than 1 minute read

Refine to excel town02 and bm metrics

Carla follow lane DDPG Vs PPO Vs SAC [July 2nd]

less than 1 minute read

Refine to excel town02 and bm metrics

Carla follow lane DDPG Vs PPO Vs SAC [July 1st]

less than 1 minute read

Refine PPO and compare in BM

Carla follow lane DDPG Vs PPO Vs SAC [June 3st]

less than 1 minute read

Refine behavior for intersections and evaluate agents