Myvideo

Guest

Login

Игорь Котенков - RLHF Intro: from Zero to Aligned Intelligent Systems

Uploaded By: Myvideo
1 view
0
0 votes
0

- A story about Text Summarization - What the Alignment is, and what’s the problem? - How RLHF works - Data setup, and why we’d like to follow instructions - Reward Modeling and PPO - Why RLHF works (and when it doesn’t) - ChatGPT improvements - What’s next and what to expect? Data Fest 2023: Трек “Instruct Models“: Наши соц.сети: Telegram: Вконтакте:

Share with your friends

Link:

Embed:

Video Size:

Custom size:

x

Add to Playlist:

Favorites
My Playlist
Watch Later