Skip to content

docs: fix PPO title, variable typo, baselines URL, and PQN doc URL#549

Merged
pseudo-rnd-thoughts merged 1 commit into
vwxyzjn:masterfrom
mturan33:docs/fix-ppo-typos-and-pqn-url
Apr 20, 2026
Merged

docs: fix PPO title, variable typo, baselines URL, and PQN doc URL#549
pseudo-rnd-thoughts merged 1 commit into
vwxyzjn:masterfrom
mturan33:docs/fix-ppo-typos-and-pqn-url

Conversation

@mturan33
Copy link
Copy Markdown
Contributor

Description

Fixes four small documentation issues:

  1. Page title typo on docs/rl-algorithms/ppo.md:1: # Proximal Policy Gradient (PPO)# Proximal Policy Optimization (PPO). The original paper (Schulman et al., 2017, arXiv:1707.06347) is titled "Proximal Policy Optimization Algorithms", and this is how PPO is universally referred to. The same page already uses the correct name on line 11. Note that PPG (Phasic Policy Gradient) is a distinct algorithm with its own page at docs/rl-algorithms/ppg.md — unrelated. The same typo also occurs in README.md, docs/rl-algorithms/overview.md, docs/rl-algorithms/ppo-rnd.md, and docs/blog/posts/cleanrl-v1.md.

  2. Variable-name typo olad_approx_klold_approx_kl in three docs (ppo.md, ppg.md, ppo-trxl.md). Matches the variable name used throughout the code (e.g. cleanrl/ppo.py).

  3. Repo-name typo openai/baseliesopenai/baselines (8 occurrences in docs/rl-algorithms/ppo.md).

  4. Broken URL in docs/rl-algorithms/pqn.md: link to https://iclr-blog-track.github.io/2022/03/25/pqn-implementation-details/ currently 404s. Corrected to .../ppo-implementation-details/, matching the 11+ other references in the repo to the same blog post (The 37 Implementation Details of PPO by Huang et al., 2022).

Types of changes

  • Documentation

Checklist:

  • I've read the CONTRIBUTION guide (required).
  • I have ensured pre-commit run --all-files passes (required).
  • No behavioral changes (docs-only, not performance-impacting).

@vercel
Copy link
Copy Markdown

vercel Bot commented Apr 17, 2026

@mturan33 is attempting to deploy a commit to the Costa Huang's projects Team on Vercel.

A member of the Team first needs to authorize it.

@pseudo-rnd-thoughts pseudo-rnd-thoughts merged commit fe8d8a0 into vwxyzjn:master Apr 20, 2026
1 of 2 checks passed
@mturan33 mturan33 deleted the docs/fix-ppo-typos-and-pqn-url branch April 20, 2026 17:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants