Releases: frgfm/drlnd-p1-navigation
Releases · frgfm/drlnd-p1-navigation
Fixed Q-target with experience replay
Basic implementation of DQN to solve the banana collection environment within 300 episodes.
Basic implementation of DQN to solve the banana collection environment within 300 episodes.