Popular repositories Loading
-
bloom
bloom Public🌸 Generate automated evaluations for LLMs to assess behaviors like bias and sycophancy, ensuring reproducibility with customizable test scenarios.
-
ashwin-deshmukh121.github.io
ashwin-deshmukh121.github.io Public🌸 Generate tailored evaluation suites for LLMs to assess behaviors like bias and self-preservation, ensuring reproducibility with diverse test scenarios.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.