Attention is all you need (Transformer) - Model explanation (including math), Inference and Training

Uploaded By: Myvideo

Published on

22 Dec 2023

1 view

0

0 votes

0

About Share Download Add to

A complete explanation of all the layers of a Transformer Model: Multi-Head Self-Attention, Positional Encoding, including all the matrix multiplications and a complete description of the training and inference process. Slides PDF: Chapters 00:00 - Intro 01:10 - RNN and their problems 08:04 - Transformer Model 09:02 - Maths background and notations 12:20 - Encoder (overview) 12:31 - Input Embeddings 15:04 - Positional Encoding 20:08 - Single Head Self-Attention 28:30 - Multi-Head Attention 35:39 - Query, Key, Value 37:55 - Layer Normalization 40:13 - Decoder (overview) 42:24 - Masked Multi-Head Attention 44:59 - Training 52:09 - Inference

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/QzNKNWNKeHY3em1wR3pDUUorQS81VEtuZGlITFNCNGk3QUY0RzNHQnRPYz0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

airis asmr fast ASMR for ADHD, but your answer is wrong // быстрый асмр для СДВГ, но ты отвечаешь не правильно

1 hour ago

00:08:54

airis asmr fast ASMR for ADHD, but your answer is wrong // быстрый асмр для СДВГ, но ты отвечаешь не правильно

5 90%

Corey ASMR LOST YOUR TINGLES | ASMR Fast and Unpredictable ASMR | Chaotic ADHD ASMR

1 hour ago

00:32:49

Corey ASMR LOST YOUR TINGLES | ASMR Fast and Unpredictable ASMR | Chaotic ADHD ASMR

6 35%

Corey ASMR Today is the Day you BEAT Tingle Immunity ASMR

2 hours ago

00:41:27

Corey ASMR Today is the Day you BEAT Tingle Immunity ASMR

1 70%

Wait Is Finally Over! Sorrow Is Coming To An End If You Feel This Now, Pay Attention!

2 hours ago

00:11:49

Wait Is Finally Over! Sorrow Is Coming To An End If You Feel This Now, Pay Attention!

2 5%

Israel Hijacks Beirut Airport, Forces Iranian Plane into Dramatic U-Turn| Khamenei's Revenge Coming

2 hours ago

00:04:32

Israel Hijacks Beirut Airport, Forces Iranian Plane into Dramatic U-Turn| Khamenei's Revenge Coming

1 89%

Tejon Street Corner Thieves - Never Meant To Be (Music Video)

3 hours ago

00:04:01

Tejon Street Corner Thieves - Never Meant To Be (Music Video)

1 13%

How To Rank A YouTube Video - YouTube SEO Explained 2024 Method

3 hours ago

00:21:37

How To Rank A YouTube Video - YouTube SEO Explained 2024 Method

1 46%

VAV - Give me more (Un Poco Mas) (Feat. De La Ghetto & Play-N-Skillz) MV

4 hours ago

00:04:07

VAV - Give me more (Un Poco Mas) (Feat. De La Ghetto & Play-N-Skillz) MV

1 19%

Misfit(s) of Science - Can't touch you now - I SAW THIS GIRL - LOSTWAVE - Mirko Hirsch Remix

4 hours ago

00:03:57

Misfit(s) of Science - Can't touch you now - I SAW THIS GIRL - LOSTWAVE - Mirko Hirsch Remix

1 89%

Jordan B Peterson The Psychology Behind Nice Guys Finish Last | Keith Campbell | EP 480

5 hours ago

01:45:51

Jordan B Peterson The Psychology Behind Nice Guys Finish Last | Keith Campbell | EP 480

1 86%

Найди отличие / Spot the Difference _ выпуск № 548

5 hours ago

00:07:08

Найди отличие / Spot the Difference _ выпуск № 548

50 28%

Andrew Huberman Dr. Victor Carrin: How to Heal From Post-Traumatic Stress Disorder (PTSD)

5 hours ago

02:26:58

Andrew Huberman Dr. Victor Carrin: How to Heal From Post-Traumatic Stress Disorder (PTSD)

1 39%

Healing your Mind& Inducing Deep Sleep with Natures Rain Sounds,Moments that Melt Away Daily Stress

5 hours ago

08:00:57

Healing your Mind& Inducing Deep Sleep with Natures Rain Sounds,Moments that Melt Away Daily Stress

1 66%

HIMITSU DAYO

5 hours ago

00:00:18

HIMITSU DAYO

1 27%

'Death To Israel': Iraq, Lebanon Massive Protests After Nasrallah Killed | U.S Embassy Targeted

6 hours ago

00:03:10

'Death To Israel': Iraq, Lebanon Massive Protests After Nasrallah Killed | U.S Embassy Targeted

1 55%

Mysterious Sewing Techniques. You've Been Sewing Wrong All this Time!

6 hours ago

00:32:55

Mysterious Sewing Techniques. You've Been Sewing Wrong All this Time!

1 69%

Discoveries of Great Tailors. We Expose Their SECRET TECHNIQUES (Part #51)

6 hours ago

00:36:56

Discoveries of Great Tailors. We Expose Their SECRET TECHNIQUES (Part #51)

1 95%

Massive attack on 7 Russian regions - 100s of drones attacked airfields and military facilities

6 hours ago

00:06:06

Massive attack on 7 Russian regions - 100s of drones attacked airfields and military facilities

2 34%

The Forsaken UFO Village of Wanli

7 hours ago

00:00:55

The Forsaken UFO Village of Wanli

1 23%

MATANDO EL TIEMPO CON MARIO KART 7. COPA CENTELLA 50CC. GAMEPLAY EN ESPAOL. #8.

7 hours ago

00:22:26

MATANDO EL TIEMPO CON MARIO KART 7. COPA CENTELLA 50CC. GAMEPLAY EN ESPAOL. #8.

1 70%

How Much Slower Is A Gravel Bike

7 hours ago

00:13:35

How Much Slower Is A Gravel Bike

1 20%

MATANDO EL TIEMPO CON MARIO KART 7. COPA HOJA 50CC. GAMEPLAY EN ESPAOL. #7.

8 hours ago

00:19:40

MATANDO EL TIEMPO CON MARIO KART 7. COPA HOJA 50CC. GAMEPLAY EN ESPAOL. #7.

2 74%

Тяга в тренажере на дельтовидные мышцы плеча. Упражнение #39 Traction in the simulator for deltoids

8 hours ago

00:01:01

Тяга в тренажере на дельтовидные мышцы плеча. Упражнение #39 Traction in the simulator for deltoids

1 41%

Sacred Tree of Life 444Hz Positive Energy & Healing Spiritual & Emotional Connection

9 hours ago

00:01:00

Sacred Tree of Life 444Hz Positive Energy & Healing Spiritual & Emotional Connection

1 74%

0 Comments

Guest