Myvideo

Guest

Login

Comparison snippet - Montezuma's Revenge

Uploaded By: Myvideo
1 view
0
0 votes
0

Demonstration video to our paper “Playing hard exploration games by watching YouTube videos“, The sequence on the left is the expert video used for imitation, on the right is our learnt policy. While our agent follows the path taken by the expert, our method allows the RL agent to still optimize low-level skills, such as timing jumps.

Share with your friends

Link:

Embed:

Video Size:

Custom size:

x

Add to Playlist:

Favorites
My Playlist
Watch Later