Pinned Loading
Repositories
Showing 7 of 7 repositories
- strands-env Public
Standardizing environment infrastructure with Strands Agents — step, observe, reward.
horizon-rl/strands-env’s past year of commit activity - HeaPA Public
Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning
horizon-rl/HeaPA’s past year of commit activity - DeepPlanner Public Forked from AlexFanw/DeepPlanner
Code and dataset for paper: DeepPlanner: Scaling Planning Capability for Deep Research Agents via Advantage Shaping
horizon-rl/DeepPlanner’s past year of commit activity - Think-RM Public
[NeurIPS 2025] Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models
horizon-rl/Think-RM’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…