Skip to content
@redwoodresearch

Redwood Research

Popular repositories Loading

  1. Easy-Transformer Easy-Transformer Public

    Forked from TransformerLensOrg/TransformerLens

    Python 128 18

  2. mlab mlab Public

    Machine Learning for Alignment Bootcamp

    Jupyter Notebook 79 42

  3. alignment_faking_public alignment_faking_public Public

    Forked from rgreenblatt/model_organism_public

    Python 79 16

  4. rust_circuit_public rust_circuit_public Public

    Rust 65 2

  5. Text-Steganography-Benchmark Text-Steganography-Benchmark Public

    Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.

    Python 24 4

  6. remix_public remix_public Public

    Python 19 3

Repositories

Showing 10 of 20 repositories

Top languages

Loading…

Most used topics

Loading…