Skip to content

Latest commit

 

History

History
41 lines (29 loc) · 1.6 KB

File metadata and controls

41 lines (29 loc) · 1.6 KB
title Reinforcement Learning
aliases
Reinforcement Learning
cssclasses
moc

Reinforcement Learning

RL as a subfield: MDPs, value/policy iteration, Q-learning, policy gradients, deep RL, imitation, and game-playing agents.

45 documents.

Start here

  1. MIT 6.S191 (2018): Deep Reinforcement Learning · 🎓 lecture · intro
  2. MIT 6.S191 (2019): Deep Reinforcement Learning · 🎓 lecture · intro
  3. Proximal Policy Optimization Algorithms · 📄 paper · advanced
  4. π₀: A Vision-Language-Action Flow Model for General Robot Control · 📄 paper · frontier
  5. A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications · 📄 paper · advanced
  6. Gemini Robotics: Bringing AI into the Physical World · 📄 paper · frontier

All documents

TABLE WITHOUT ID
  link(file.link, default(title, file.name)) AS Document,
  default(source, "") AS Type,
  default(published, "") AS Date
FROM #topic/reinforcement-learning and -"atlas"
SORT level ASC, published ASC

(The list above renders in Obsidian with the Dataview plugin. On GitHub, browse Start here or the full index.)

Related topics

Alignment, RLHF & Preference Tuning


← Atlas home