| title | Reinforcement Learning | |
|---|---|---|
| aliases |
|
|
| cssclasses |
|
RL as a subfield: MDPs, value/policy iteration, Q-learning, policy gradients, deep RL, imitation, and game-playing agents.
45 documents.
- MIT 6.S191 (2018): Deep Reinforcement Learning · 🎓 lecture · intro
- MIT 6.S191 (2019): Deep Reinforcement Learning · 🎓 lecture · intro
- Proximal Policy Optimization Algorithms · 📄 paper · advanced
- π₀: A Vision-Language-Action Flow Model for General Robot Control · 📄 paper · frontier
- A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications · 📄 paper · advanced
- Gemini Robotics: Bringing AI into the Physical World · 📄 paper · frontier
TABLE WITHOUT ID
link(file.link, default(title, file.name)) AS Document,
default(source, "") AS Type,
default(published, "") AS Date
FROM #topic/reinforcement-learning and -"atlas"
SORT level ASC, published ASC
(The list above renders in Obsidian with the Dataview plugin. On GitHub, browse Start here or the full index.)
Alignment, RLHF & Preference Tuning