Myvideo

Guest

Login

Training a CUDA TDS Ant using C++ ARS Linear policy.

Uploaded By: Myvideo
1 view
0
0 votes
0

Training the Ant, running the full simulation on CUDA at 1 Million steps per second on an NVIDIA RTX 2080, using the Tiny Differentiable Simulator and a C Augmented Random Search implementation. It is a linear policy, action dimension 8, observation dimention 28. Running 256 Ants in parallel. Source code is here: See also Laikago trained using the same tech:

Share with your friends

Link:

Embed:

Video Size:

Custom size:

x

Add to Playlist:

Favorites
My Playlist
Watch Later