Skip to content
@sunblaze-ucb

sunblaze-ucb

Popular repositories Loading

  1. Intuitor Intuitor Public

    Code for the paper: "Learning to Reason without External Rewards"

    Python 369 41

  2. rl-generalization rl-generalization Public

    Modifiable OpenAI Gym environments for studying generalization in RL

    Python 87 14

  3. cybergym cybergym Public

    CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

    Python 85 15

  4. dpml-benchmark dpml-benchmark Public

    This repository contains the codes for first large-scale investigation of Differentially Private Convex Optimization algorithms.

    Python 63 18

  5. blackbox-attacks blackbox-attacks Public

    Code used in 'Exploring the Space of Black-box Attacks on Deep Neural Networks' (https://arxiv.org/abs/1712.09491)

    Python 61 13

  6. Virgo Virgo Public

    C++ 60 17

Repositories

Showing 10 of 49 repositories
  • cybergym-page Public
    sunblaze-ucb/cybergym-page’s past year of commit activity
    HTML 0 0 0 0 Updated Oct 26, 2025
  • VMDT-page Public
    sunblaze-ucb/VMDT-page’s past year of commit activity
    JavaScript 0 0 0 0 Updated Oct 24, 2025
  • mirage-bench Public
    sunblaze-ucb/mirage-bench’s past year of commit activity
    Python 6 Apache-2.0 1 0 0 Updated Oct 21, 2025
  • rl-grok-recipe Public

    Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""

    sunblaze-ucb/rl-grok-recipe’s past year of commit activity
    Python 20 0 1 0 Updated Oct 12, 2025
  • cybergym Public

    CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

    sunblaze-ucb/cybergym’s past year of commit activity
    Python 85 Apache-2.0 15 0 1 Updated Oct 8, 2025
  • AgentSynth Public

    AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

    sunblaze-ucb/AgentSynth’s past year of commit activity
    Python 33 Apache-2.0 2 2 0 Updated Oct 7, 2025
  • sunblaze-ucb/awesome-RLVR-boundary’s past year of commit activity
    1 0 0 1 Updated Oct 6, 2025
  • VMDT Public
    sunblaze-ucb/VMDT’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Oct 2, 2025
  • verina Public

    Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specification, and proof generation as well as their compositions.

    sunblaze-ucb/verina’s past year of commit activity
    Lean 29 Apache-2.0 5 1 0 Updated Sep 21, 2025
  • progent Public
    sunblaze-ucb/progent’s past year of commit activity
    Python 19 8 1 1 Updated Sep 11, 2025