Open Problems in Mechanistic Interpretability: A Whirlwind Tour

Uploaded By: Myvideo

Published on

22 Jun 2023

1 view

0

0 votes

0

About Share Download Add to

A Google TechTalk, presented by Neel Nanda, 2023/06/20 Google Algorithms Seminar - ABSTRACT: Mechanistic Interpretability is the study of reverse engineering the learned algorithms in a trained neural network, in the hopes of applying this understanding to make powerful systems safer and more steerable. In this talk Neel will give an overview of the field, summarise some key works, and outline what he sees as the most promising areas of future work and open problems. This will touch on techniques in casual abstraction and meditation analysis, understanding superposition and distributed representations, model editing, and studying individual circuits and neurons. About the Speaker: Neel works on the mechanistic interpretability team at Google DeepMind. He previously worked with Chris Olah at Anthropic on the transformer circuits agenda, and has done independent work on reverse-engineering modular addition and using this to understand grokking.

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/bDh4Nnd2YkhIL1VhZjFsTUpOY3lSUno4Z25SUUFPRUJXNGtnZ0lUVjNXWT0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

Listen until the end for a complete rebalancing of the 7 chakras Singing Bowls, Mindfulmed Chakras

1 day ago

00:30:30

Listen until the end for a complete rebalancing of the 7 chakras Singing Bowls, Mindfulmed Chakras

1 42%

Ambient Melodic Live Looping | Full 1 Hour 18 Min Live Concert | Cologne Germany | Reinhardt Buhr

1 week ago

01:18:18

Ambient Melodic Live Looping | Full 1 Hour 18 Min Live Concert | Cologne Germany | Reinhardt Buhr

1 62%

Millennium Bridge

3 weeks ago

00:01:01

Millennium Bridge

1 63%

Vana - BEG! (Official Music Video)

2 months ago

00:03:50

Vana - BEG! (Official Music Video)

1 78%

КАК СДЕЛАТЬ ТРЕК В ЖАНРЕ СКРИМО // МИДВЕСТ ЭМО

2 months ago

00:09:01

КАК СДЕЛАТЬ ТРЕК В ЖАНРЕ СКРИМО // МИДВЕСТ ЭМО

1 86%

L'horreur existentielle de l'usine trombones.

2 months ago

00:38:19

L'horreur existentielle de l'usine trombones.

1 9%

ISRAEL'S SHOCKING New Apartheid Plans For Gaza!

2 months ago

00:10:49

ISRAEL'S SHOCKING New Apartheid Plans For Gaza!

1 74%

Bertrand SCHOLLER : Quelle Chance que Poutine existe (et dautres), face aux adorateurs du diable

2 months ago

00:46:53

Bertrand SCHOLLER : Quelle Chance que Poutine existe (et dautres), face aux adorateurs du diable

1 64%

Sen Trope - AZIS & Iam Lumoss ' TikTok ' Remix 2024

2 months ago

00:03:35

Sen Trope - AZIS & Iam Lumoss ' TikTok ' Remix 2024

1 17%

Lost Frequencies - Are you with me (2000s) by Enem96 Remix + LIRYCS

2 months ago

00:05:13

Lost Frequencies - Are you with me (2000s) by Enem96 Remix + LIRYCS

1 62%

BREAKING: CAUGHT ON TAPE! Line Up Trump Voters, Shoot Them - Prof's SICK Classroom Rant Exposed!

2 months ago

00:16:01

BREAKING: CAUGHT ON TAPE! Line Up Trump Voters, Shoot Them - Prof's SICK Classroom Rant Exposed!

1 37%

Music for Surfers Live in Brooklyn

2 months ago

00:04:46

Music for Surfers Live in Brooklyn

1 92%

RILTIM, OMER BALIK, Adik, Davit Barqaia, Enza, Hamidshax, Mzade, Roudeep Deep Feelings Mix 2023

2 months ago

03:50:37

RILTIM, OMER BALIK, Adik, Davit Barqaia, Enza, Hamidshax, Mzade, Roudeep Deep Feelings Mix 2023

1 64%

Gamesblender № 697: проблемы Star Citizen, планы на Steam Deck 2 и слухи о ремейке Rayman

2 months ago

00:08:09

Gamesblender № 697: проблемы Star Citizen, планы на Steam Deck 2 и слухи о ремейке Rayman

1 51%

Bertrand SCHOLLER : se retrouver pour combattre le mal et ses marchands du Temple, vers COMPOSTELLE

2 months ago

00:18:48

Bertrand SCHOLLER : se retrouver pour combattre le mal et ses marchands du Temple, vers COMPOSTELLE

1 40%

ANGELIC VOICE TO ATTRACT TRUE LOVE (Subliminal) WARNING EXTREMELY POWERFUL

2 months ago

11:54:58

ANGELIC VOICE TO ATTRACT TRUE LOVE (Subliminal) WARNING EXTREMELY POWERFUL

1 54%

Aviator Predictor HACK! How I Got Into Aviator Predictor Tool Online With NO DOWNLOADS!

2 months ago

00:03:23

Aviator Predictor HACK! How I Got Into Aviator Predictor Tool Online With NO DOWNLOADS!

1 69%

Graham Hancock's Chaco Canyon STUNS Archaeologists! Filming SHUT DOWN!

2 months ago

00:23:55

Graham Hancock's Chaco Canyon STUNS Archaeologists! Filming SHUT DOWN!

1 75%

Praca przymusowa, prostytucja, porwania na narzdy - jak si broni Zbigniew Lasocik Expert w RR

2 months ago

01:22:35

Praca przymusowa, prostytucja, porwania na narzdy - jak si broni Zbigniew Lasocik Expert w RR

1 83%

Create Studio Pro Review Create Studio Pro Demo And Create Studio Pro Bonus

2 months ago

00:11:41

Create Studio Pro Review Create Studio Pro Demo And Create Studio Pro Bonus

1 26%

Yesterday's Problem OFFICIAL MUSIC VIDEO

2 months ago

00:03:49

Yesterday's Problem OFFICIAL MUSIC VIDEO

1 62%

Inigma - Focus ft. Oktae (Lyric Video)

2 months ago

00:04:40

Inigma - Focus ft. Oktae (Lyric Video)

1 33%

DIGGER - THE BOSS (OFFICIAL VIDEO)

2 months ago

00:03:14

DIGGER - THE BOSS (OFFICIAL VIDEO)

3 72%

Fastest knockout in UFC history Ronda Rousey Alexis Davis

2 months ago

00:00:33

Fastest knockout in UFC history Ronda Rousey Alexis Davis

1 29%

0 Comments

Guest