Skip to content
Change the repository type filter

All

    Repositories list

    • An end-to-end open ecosystem for robot learning
      Python
      Other
      49000Updated Jun 15, 2026Jun 15, 2026
    • Learning Safety Constraints for Large Language Models (ICML2025)
      Python
      63500Updated May 25, 2026May 25, 2026
    • A collection of algorithms and experiment tools for safe sim to real transfer in robotics.
      Python
      MIT License
      82700Updated May 19, 2026May 19, 2026
    • ombrl

      Public
      Python
      Other
      31111Updated May 8, 2026May 8, 2026
    • Code for the paper "ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning"
      Python
      Apache License 2.0
      11010Updated May 2, 2026May 2, 2026
    • rewarduq

      Public
      Code for "RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models"
      Python
      Apache License 2.0
      11701Updated Apr 21, 2026Apr 21, 2026
    • Aligning Language Models from User Interactions via Self-Distillation
      Python
      Apache License 2.0
      42310Updated Mar 31, 2026Mar 31, 2026
    • rlhf

      Public
      JavaScript
      0000Updated Mar 25, 2026Mar 25, 2026
    • SDPO

      Public
      Reinforcement Learning via Self-Distillation (SDPO)
      Python
      Apache License 2.0
      10795481Updated Feb 18, 2026Feb 18, 2026
    • fork from https://github.com/Physical-Intelligence/openpi
      Python
      Apache License 2.0
      9000Updated Jan 28, 2026Jan 28, 2026
    • Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
      Python
      46100Updated Jan 23, 2026Jan 23, 2026
    • Python
      0000Updated Dec 28, 2025Dec 28, 2025
    • Repository for doing model based RL
      Python
      MIT License
      1800Updated Sep 9, 2025Sep 9, 2025
    • fork
      Jupyter Notebook
      MIT License
      421000Updated Apr 29, 2025Apr 29, 2025
    • LITE

      Public
      LITE: Efficiently Estimating Gaussian Probability of Maximality
      Python
      1500Updated Feb 26, 2025Feb 26, 2025
    • opax

      Public
      Python
      41900Updated Jan 9, 2025Jan 9, 2025
    • Transferring inductive bias / prior knowledge from domain specific simulations and models
      Python
      MIT License
      1300Updated Dec 26, 2024Dec 26, 2024
    • MaxMinLCB

      Public
      Code for our paper "Bandits with Preference Feedback: A Stackelberg Game Perspective"
      Python
      Apache License 2.0
      1400Updated Dec 17, 2024Dec 17, 2024
    • Model based policy optimizers
      Python
      MIT License
      1600Updated Nov 15, 2024Nov 15, 2024
    • gosafeopt

      Public
      Globally Safe Model-free Exploration of Dynamical Systems
      Python
      GNU General Public License v3.0
      63300Updated Nov 11, 2024Nov 11, 2024
    • HPGD

      Public
      Python
      MIT License
      1400Updated Nov 1, 2024Nov 1, 2024
    • Python
      MIT License
      5500Updated Oct 30, 2024Oct 30, 2024
    • Python
      MIT License
      11200Updated Oct 15, 2024Oct 15, 2024
    • A Safety-Gym based benchmark suite for safe meta RL
      Python
      MIT License
      1100Updated Sep 26, 2024Sep 26, 2024
    • jax-cpo

      Public
      Implementation of Constrained Policy Optimization with JAX
      Python
      MIT License
      0400Updated Aug 29, 2024Aug 29, 2024
    • Implementation of adaptive constrained RL algorithms. Child repository of @lasgroup/safe-adaptation-gym
      Python
      MIT License
      1200Updated Jul 22, 2024Jul 22, 2024
    • TaCoS

      Public
      Python
      1300Updated May 22, 2024May 22, 2024
    • analysis, preparation and reporting for streptavidin design using active learning
      Jupyter Notebook
      0200Updated May 21, 2024May 21, 2024
    • cocorl

      Public
      Code for Convex Constraint Learning for RL
      Python
      1700Updated Feb 28, 2024Feb 28, 2024
    • ALEXP

      Public
      Simultaneous Online Optimization and Model Selection, based on our paper "Anytime Model Selection for Linear Bandits"
      Python
      MIT License
      0300Updated Nov 24, 2023Nov 24, 2023
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.