Dreamer v2: Mastering Atari with Discrete World Models (Machine Learning Research Paper Explained)

Uploaded By: Myvideo

Published on

19 Feb 2021

9 views

0

0 votes

0

About Share Download Add to

#dreamer #deeprl #reinforcementlearning Model-Based Reinforcement Learning has been lagging behind Model-Free RL on Atari, especially among single-GPU algorithms. This collaboration between Google AI, DeepMind, and the University of Toronto (UofT) pushes world models to the next level. The main contribution is a learned latent state consisting of one discrete part and one stochastic part, whereby the stochastic part is a set of 32 categorical variables, each with 32 possible values. The world model can freely decide how it wants to use these variables to represent the input, but is tasked with the prediction of future observations and rewards. This procedure gives rise to an informative latent representation and in a second step, reinforcement learning (A2C Actor-Critic) can be done purely - and very efficiently - on the basis of the world-model’s latent states. No observations needed! This paper combines this with straight-through estimators, KL balancing, and many other tricks to achieve state-of-the-art s

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/VmNhVlFYdU9Sc05qZ3FVVWt5TGJxK0lwc0l1NGdpNHpVdmJOcHNPMHh5az0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

Deluge | Future Garage Mix

11 months ago

01:51:32

Deluge | Future Garage Mix

2 7%

Otherworld | Beautiful Chill Music Mix

1 year ago

01:04:01

Otherworld | Beautiful Chill Music Mix

0 21%

100 ЛУЧШИХ ЗАРУБЕЖНЫХ ХИТОВ 1995 ГОДА // HIT SONGS OF 1995 //ЛУЧШИЕ ПЕСНИ 1995 ГОД //НАЗАД В ПРОШЛОЕ

1 year ago

00:12:24

100 ЛУЧШИХ ЗАРУБЕЖНЫХ ХИТОВ 1995 ГОДА // HIT SONGS OF 1995 //ЛУЧШИЕ ПЕСНИ 1995 ГОД //НАЗАД В ПРОШЛОЕ

0 35%

Streets of Rage 2 - Remastered Original Soundtrack

2 years ago

00:55:28

Streets of Rage 2 - Remastered Original Soundtrack

0 23%

Teigen Gayse - Sleeping With Her (Official Music Video)

2 years ago

00:02:59

Teigen Gayse - Sleeping With Her (Official Music Video)

0 30%

Save Game V2 | Public Work | Farming Simulator 22

2 years ago

00:02:15

Save Game V2 | Public Work | Farming Simulator 22

6 91%

YUKI'S DAYDREAM / An Experimental Short Film in Unreal 5

2 years ago

00:02:53

YUKI'S DAYDREAM / An Experimental Short Film in Unreal 5

0 57%

Dark EBM Industrial Gothic Wave

2 years ago

02:35:36

Dark EBM Industrial Gothic Wave

1 27%

ПОЛЕВАЯ ЖАРА - ЗАБЫТОГО ПРОШЛОГО...

2 years ago

00:04:02

ПОЛЕВАЯ ЖАРА - ЗАБЫТОГО ПРОШЛОГО...

0 32%

Taoufik Ft Arozin Sabyh - Fighting For Love V2 (HQ Video)

2 years ago

00:03:27

Taoufik Ft Arozin Sabyh - Fighting For Love V2 (HQ Video)

1 66%

RESCUING vehicles from FLOODED AUTOBAHN | Contractor Jobs | Farming Simulator 19 | Episode 12

2 years ago

00:18:05

RESCUING vehicles from FLOODED AUTOBAHN | Contractor Jobs | Farming Simulator 19 | Episode 12

0 43%

ФИНАЛ ТУРНИРА ПОДПИСЧИКОВ! / БЕЙБЛЭЙД БЁРСТ / SHADOW S3

2 years ago

00:02:34

ФИНАЛ ТУРНИРА ПОДПИСЧИКОВ! / БЕЙБЛЭЙД БЁРСТ / SHADOW S3

0 42%

Taoufik - Together (For You V2) Official Music Video

2 years ago

00:06:02

Taoufik - Together (For You V2) Official Music Video

0 22%

BTS Jungkook sets Guinness record for Spotify streams

2 years ago

00:00:39

BTS Jungkook sets Guinness record for Spotify streams

20 85%

WILD WYVERN VS STORM SPRIGGAN BEYBLADE BURST / SHADOW S3

2 years ago

00:02:19

WILD WYVERN VS STORM SPRIGGAN BEYBLADE BURST / SHADOW S3

1 45%

WHAT A GORGEOUS SONG!!! REACTING TO | BAND-MAID / Daydreaming (Official MV)

2 years ago

00:11:41

WHAT A GORGEOUS SONG!!! REACTING TO | BAND-MAID / Daydreaming (Official MV)

0 63%

ВНЕСИ под любой цветок и даже самый чахлый куст оживет, позеленеет, наберется сил, зацветет

2 years ago

00:25:30

ВНЕСИ под любой цветок и даже самый чахлый куст оживет, позеленеет, наберется сил, зацветет

0 41%

A State of Trance Episode 1104 - Live from Our House

3 years ago

03:32:24

A State of Trance Episode 1104 - Live from Our House

7 39%

Music Mix || By Taoufik || Oriental Deep House || Romanian Music Style || Balkan, Ethnic Music

3 years ago

01:01:33

Music Mix || By Taoufik || Oriental Deep House || Romanian Music Style || Balkan, Ethnic Music

0 74%

Taoufik & MerOne Music - Lost Stories (V2) Official Music Video

3 years ago

00:04:51

Taoufik & MerOne Music - Lost Stories (V2) Official Music Video

0 93%

DURAMAX 4FT BY 8FT SIDEMATE SHED BUILD HOW TO BUILD A SHED TIMELAPSE WITH REAL USA FAMILY VLOG STYLE

3 years ago

00:12:18

DURAMAX 4FT BY 8FT SIDEMATE SHED BUILD HOW TO BUILD A SHED TIMELAPSE WITH REAL USA FAMILY VLOG STYLE

0 90%

LEAVE ME ALONE! | Sparta DrLaSp V2 Remix V2 (RUSSIAN)

4 years ago

00:01:01

LEAVE ME ALONE! | Sparta DrLaSp V2 Remix V2 (RUSSIAN)

4 85%

Music Mix #2 | By Taoufik | Melancholic/Oriental Deep Housse | Romanian, Balkan, Ethnic Housse Music

4 years ago

01:37:06

Music Mix #2 | By Taoufik | Melancholic/Oriental Deep Housse | Romanian, Balkan, Ethnic Housse Music

2 24%

Taoufik & Merone Music - Summer Winds V2

4 years ago

00:05:17

Taoufik & Merone Music - Summer Winds V2

0 76%

0 Comments

Guest