Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5

Uploaded By: Myvideo

Published on

21 Dec 2023

1,269 views

0

0 votes

0

About Share Download Add to

Оторвитесь от предновогодней суеты и уделите один вечер знаниям: 19 декабря в 20:00 пройдёт семинар от VK Lab. Наш стажёр Даниил Белопольских расскажет про мультимодальные модели, а именно: Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5. А ещё вы узнаете: — в чём сложность взаимодействия изображений и текста; — какие датасеты нужны для обучения таких моделей; — как их сравнивать. В конце семинара обязательно ответим на ваши вопросы. Подключайтесь!

Share with your friends

Link:

Embed:

<iframe width="640" height="360" src="//myvideo.cc/embed/UzdpbzV1ZnRTcU0wRzlkbGc2Ylk4WTFmZU1wQ01udktlcEFxRDY2YmFiOD0" frameborder="0" webkitallowfullscreen mozallowfullscreen allowfullscreen></iframe>

Video Size:

Custom size:

x

Autoplay video

Hide player controls

Hide resume playing

Add to Playlist:

Favorites

My Playlist

Watch Later

SORA Video To Video Is Literally Mind Blowing - 12 HD Demos - Changes Industry Forever For Real

7 months ago

00:05:01

SORA Video To Video Is Literally Mind Blowing - 12 HD Demos - Changes Industry Forever For Real

3 28%

Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5

9 months ago

00:52:32

Vision-Language Pre-Trained Models. Мы подробно разберём Flamingo, BLIP-2, LLaVA и LLaVA-1.5

1.3K 18%

Build Eye Detection with Python using OpenCV

1 year ago

00:07:07

Build Eye Detection with Python using OpenCV

30 38%

No, this angry AI isn't fake (see comment), w Elon Musk.

1 year ago

00:13:45

No, this angry AI isn't fake (see comment), w Elon Musk.

8 49%

OpenCV Python Tutorial For Beginners 36 - Eye Detection Haar Feature based Cascade Classifiers

2 years ago

00:07:12

OpenCV Python Tutorial For Beginners 36 - Eye Detection Haar Feature based Cascade Classifiers

8 41%

(12) Googles New Self-Driving Robot Is Amazing! - YouTube

2 years ago

00:08:12

(12) Googles New Self-Driving Robot Is Amazing! - YouTube

1 6%

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation

2 years ago

00:46:41

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation

5 81%

Can Wikipedia Help Offline Reinforcement Learning (Paper Explained)

3 years ago

00:38:35

Can Wikipedia Help Offline Reinforcement Learning (Paper Explained)

4 17%

Zeta Alpha's Trends in AI February 2022. ConvNets comeback, Neural IR, Multimodal

3 years ago

01:01:52

Zeta Alpha's Trends in AI February 2022. ConvNets comeback, Neural IR, Multimodal

4 25%

Harvard Medical AI: Sameer Sundrani presents Oscar: ... Pre-training for Vision-Language Tasks

3 years ago

00:17:22

Harvard Medical AI: Sameer Sundrani presents Oscar: ... Pre-training for Vision-Language Tasks

4 50%

ML News DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access

3 years ago

00:37:22

ML News DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access

6 11%

MedAI Session 23: Multimodal medical research of vision and language | Jean-Benoit Delbrouck

3 years ago

00:50:23

MedAI Session 23: Multimodal medical research of vision and language | Jean-Benoit Delbrouck

4 59%

ResNet Architecture and Residual Block Explained - Neural Networks and Deep Learning

3 years ago

00:13:09

ResNet Architecture and Residual Block Explained - Neural Networks and Deep Learning

10 59%

Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)

4 years ago

00:34:02

Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)

17 56%

How To Use The Pre-trained Neural Network MobileNet From Keras and TensorFlow

4 years ago

00:21:04

How To Use The Pre-trained Neural Network MobileNet From Keras and TensorFlow

19 95%

AI 360: 01/03/2021. Unified Transformer, Sebastian Ruder, OpenAI's DALL-E, GLOM and StudioGAN

4 years ago

00:05:08

AI 360: 01/03/2021. Unified Transformer, Sebastian Ruder, OpenAI's DALL-E, GLOM and StudioGAN

10 9%

Intelligent End-to-End AI Chatbot with Audio-Driven Facial Animation

4 years ago

00:01:38

Intelligent End-to-End AI Chatbot with Audio-Driven Facial Animation

15 48%

AWS DevDays 2020 - An Introduction to Deep Learning Theory and Use Cases

4 years ago

00:51:34

AWS DevDays 2020 - An Introduction to Deep Learning Theory and Use Cases

3 32%

BERT Can See Out of the Box

5 years ago

00:11:50

BERT Can See Out of the Box

8 27%

Transfer Learning for Image Classification (Webinar by Bhavesh Laddagiri, recorded on 19th. Dec'19)

5 years ago

00:57:26

Transfer Learning for Image Classification (Webinar by Bhavesh Laddagiri, recorded on 19th. Dec'19)

12 87%

Computer Vision with CNN ( Convolutional Neural Networks ) | Deep Learning | Great Learning

5 years ago

01:23:07

Computer Vision with CNN ( Convolutional Neural Networks ) | Deep Learning | Great Learning

1 44%

Build Intelligent Apps Using AI Services

5 years ago

00:41:31

Build Intelligent Apps Using AI Services

6 53%

0 Comments

Guest