Reinforcement Learning

RL as a subfield: MDPs, value/policy iteration, Q-learning, policy gradients, deep RL, imitation, and game-playing agents.

45 documents.

Start here

MIT 6.S191 (2018): Deep Reinforcement Learning · 🎓 lecture · intro
MIT 6.S191 (2019): Deep Reinforcement Learning · 🎓 lecture · intro
Proximal Policy Optimization Algorithms · 📄 paper · advanced
π₀: A Vision-Language-Action Flow Model for General Robot Control · 📄 paper · frontier
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications · 📄 paper · advanced
Gemini Robotics: Bringing AI into the Physical World · 📄 paper · frontier

All documents

TABLE WITHOUT ID
  link(file.link, default(title, file.name)) AS Document,
  default(source, "") AS Type,
  default(published, "") AS Date
FROM #topic/reinforcement-learning and -"atlas"
SORT level ASC, published ASC

(The list above renders in Obsidian with the Dataview plugin. On GitHub, browse Start here or the full index.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reinforcement Learning

Start here

All documents

Related topics

FilesExpand file tree

reinforcement-learning.md

Latest commit

History

reinforcement-learning.md

File metadata and controls

Reinforcement Learning

Start here

All documents

Related topics