Control Algorithm: PMTG (CPG SAC) Solved in 696 episodes Average reward over 100 episodes: Solving requiremnt: to get average reward greater than 300 RL Library Stable Baselines 3 OpenAI BipedalWalker environment
Hide player controls
Hide resume playing