Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)

Uploaded By: Myvideo

Published on

25 Nov 2021

6 views

0

0 votes

0

About Share Download Add to

#deeplearning #neuralarchitecturesearch #metalearning Deep Neural Networks are usually trained from a given parameter initialization using SGD until convergence at a local optimum. This paper goes a different route: Given a novel network architecture for a known dataset, can we predict the final network parameters without ever training them? The authors build a Graph-Hypernetwork and train on a novel dataset of various DNN-architectures to predict high-performing weights. The results show that not only can the GHN predict weights with non-trivial performance, but it can also generalize beyond the distribution of training architectures to predict weights for networks that are much larger, deeper, or wider than ever seen in training. OUTLINE: 0:00 - Intro & Overview 6:20 - DeepNets-1M Dataset 13:25 - How to train the Hypernetwork 17:30 - Recap on Graph Neural Networks 23:40 - Message Passing mirrors forward and backward propagation 25:20 - How to deal with different output shapes 28:45 - Differentiable Normal

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/YjVTRnpoVUw2U3Ftekl2bnp6WTBuQTJhZnROcklsRWYyRDFSVHVnU1hUdz0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

Самое важное для костей l Остеопороз - Лечение l Минералы и Витамины l Osteoporosis - Treatment

11 months ago

00:20:05

Самое важное для костей l Остеопороз - Лечение l Минералы и Витамины l Osteoporosis - Treatment

1 35%

How Milankovitch Cycles Are Causing Earths Climate To Change

11 months ago

00:14:18

How Milankovitch Cycles Are Causing Earths Climate To Change

0 8%

Allen Bradley MEASUREMENT MODULE 1440 SPD02 01RB

11 months ago

00:00:13

Allen Bradley MEASUREMENT MODULE 1440 SPD02 01RB

1 88%

82. Central Limit Theorem.

1 year ago

00:04:20

82. Central Limit Theorem.

0 30%

What Do Neural Networks Really Learn Exploring the Brain of an AI Model

1 year ago

00:17:35

What Do Neural Networks Really Learn Exploring the Brain of an AI Model

0 30%

Is Lottery Defeater a Scam (SCAM) LOTTERY DEFEATER Lottery Defeated Lottery Defeater Reviews

1 year ago

00:09:37

Is Lottery Defeater a Scam (SCAM) LOTTERY DEFEATER Lottery Defeated Lottery Defeater Reviews

0 9%

Digitakt 2 - Beginner's MEGA TUTORIAL

1 year ago

01:39:49

Digitakt 2 - Beginner's MEGA TUTORIAL

0 75%

How to Understand What Black Holes Look Like

1 year ago

00:09:19

How to Understand What Black Holes Look Like

2 70%

Как похудеть на интуитивном питании Что можно есть и что нельзя есть на интуитивном питании

1 year ago

00:18:28

Как похудеть на интуитивном питании Что можно есть и что нельзя есть на интуитивном питании

1 48%

Confidence Interval

1 year ago

00:03:11

Confidence Interval

0 80%

Deep Video Portraits - SIGGRAPH 2018

2 years ago

00:07:05

Deep Video Portraits - SIGGRAPH 2018

0 5%

The Science & Health Benefits of Deliberate Heat Exposure | Huberman Lab Podcast #69

2 years ago

01:53:11

The Science & Health Benefits of Deliberate Heat Exposure | Huberman Lab Podcast #69

0 36%

How to Tune a PID Controller

2 years ago

00:08:43

How to Tune a PID Controller

0 88%

Parseq tutorial 1: Fine grained control of Stable Diffusion and Deforum Keyframing & Interpolation

2 years ago

00:18:43

Parseq tutorial 1: Fine grained control of Stable Diffusion and Deforum Keyframing & Interpolation

0 82%

Comet Nishimura May Trigger a Meteor Shower at the End of 2023

2 years ago

00:03:48

Comet Nishimura May Trigger a Meteor Shower at the End of 2023

0 29%

Hands on Lumped Parameter Models with Workshop | JuliaCon 2023

2 years ago

02:52:27

Hands on Lumped Parameter Models with Workshop | JuliaCon 2023

1 44%

Cholesterol z diety Ci nie szkodzi

2 years ago

02:00:48

Cholesterol z diety Ci nie szkodzi

0 52%

Carl Zeiss Jenoptem 10x50 | Binoculars | Fernglas | бинокль

2 years ago

00:04:00

Carl Zeiss Jenoptem 10x50 | Binoculars | Fernglas | бинокль

0 50%

IC-CAP Complete Measurement + Multi-Device Platform Innovations 2022 - Overview (Part 1)

2 years ago

00:06:51

IC-CAP Complete Measurement + Multi-Device Platform Innovations 2022 - Overview (Part 1)

0 43%

The Importance of Monitor Color Calibration

2 years ago

00:09:47

The Importance of Monitor Color Calibration

0 89%

Zapomnij o cakowitym cholesterolu i LDL: 3 nowoczesne markery ryzyka chorb sercowo-naczyniowych

2 years ago

00:30:03

Zapomnij o cakowitym cholesterolu i LDL: 3 nowoczesne markery ryzyka chorb sercowo-naczyniowych

0 42%

Bayes' Rule: False Positive Paradox

2 years ago

00:06:42

Bayes' Rule: False Positive Paradox

0 77%

Transient E/T co-simulation | Celsius Thermal Solver

2 years ago

00:02:30

Transient E/T co-simulation | Celsius Thermal Solver

0 70%

Unsupervised Brain Models - How does Deep Learning inform Neuroscience (w/ Patrick Mineault)

2 years ago

01:21:28

Unsupervised Brain Models - How does Deep Learning inform Neuroscience (w/ Patrick Mineault)

6 30%

0 Comments

Guest