#

agentic-rl

Here are 16 public repositories matching this topic...

AgentR1 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

agent llm agentic-rl

Updated Apr 27, 2026
Python

rlix

rlops / rlix

Run more RL experiments. Wait less for GPUs.

reinforcement-learning rl lora tinker mlops ml-systems gpu-scheduling llm-training agentic-rl

Updated May 2, 2026
Python

InternLM / ARM-Thinker

[CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"

vlm llm vision-language-model reward-modeling agentic-rl think-with-image

Updated Feb 13, 2026
Python

AgentR1 / Claw-R1

Claw-R1: Empowering OpenClaw with Advanced Agentic RL.

agent agentic-rl openclaw

Updated Apr 8, 2026
Python

AMAP-ML / Thinking-with-Map

[ACL 2026 Findings] Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

agent reasoning geo-localization mllm agentic-rl

Updated Mar 9, 2026
Python

Computer-use-agents / dart-gui

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

gui-agent computer-use-agent agentic-rl

Updated Feb 26, 2026
Python

hscspring / rl-llm-nlp

Curated, opinionated index of post-R1 LLM × Reinforcement Learning. Many deep-dive blog posts cross-linked to many papers — GRPO, DAPO, DPO, PPO, RLHF, GSPO, CISPO, VAPO, Reward Modeling, MoE RL stability, Verifier-Free RL, Training-Free RL, Agentic RL, DeepSeek-R1 reproduction.

Updated Apr 25, 2026

FlyTune / ProxMO-RL

Proximity-based Multi-turn Optimization (ProxMO) - Official Implementation

efficiency rl llm agentic-rl

Updated Mar 29, 2026
Python

strands-rl / strands-sglang

SGLang model provider for Strands Agents for on-policy agentic RL training.

ai-agents sglang strands-agents agentic-rl

Updated Apr 22, 2026
Python

horizon-llm / AlphaQuanter

[ACL2026] AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading.

agent agentic-rl

Updated Oct 17, 2025
Python

strands-rl / strands-env

Standardizing environment infrastructure with Strands Agents — step, observe, reward.

ai-agents strands-agents agentic-rl agent-environments

Updated May 2, 2026
Python

thu-unicorn / Doctor-R1

This is the official repository for our paper "Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning" published in ICRL 2026.

experience medical-ai agentic-rl

Updated Apr 11, 2026
Python

WxxShirley / Agent-STAR

Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"

agent reinforcement-learning reinforcement-learning-agent agentic-rl

Updated Mar 24, 2026
Python

little1d / MolAct

Official Code of Paper: MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization

agent ai drug-discovery drug-design llms molecule-editing molecule-optimization tool-augmented-agents agentic-rl

Updated Apr 13, 2026
Python

walkinglabs / hands-on-modern-rl

An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

agent tutorial pytorch dpo reinforcemen llm rlhf agentic agentic-ai grpo llm-alignment agentic-rl

Updated May 2, 2026
Python

sabalearning01 / OpenClaw-RL

Train and customize OpenClaw agents using reinforcement learning with simple language feedback and fully asynchronous optimization.

agent async gui-application slime memory-systems skill-learning rlhf sglang grpo agentic-rl on-policy-distillation openclaw openclaw-skills open-claw

Updated May 2, 2026
JavaScript

Improve this page

Add a description, image, and links to the agentic-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agentic-rl topic, visit your repo's landing page and select "manage topics."