hannahxchen

Follow

Hannah Cyberey hannahxchen

Follow

Computer science PhD candidate at University of Virginia.

7 followers · 4 following

University of Virginia
Charlottesville, VA
hannahxchen.github.io
@hannah_roami

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

llm-censorship-steering llm-censorship-steering Public

Code for 'Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control'

Python 12 1
gender-bias-steering gender-bias-steering Public

Code for the paper "Unsupervised Concept Vector Extraction for Bias Control in LLMs"

Jupyter Notebook 1 1
automatic-paraphrase-dataset-augmentation automatic-paraphrase-dataset-augmentation Public

Code and data for automatic paraphrase dataset augmentation.

Jupyter Notebook 11
balanced-adversarial-training balanced-adversarial-training Public

Python 2
composed-debiasing composed-debiasing Public

Jupyter Notebook 1
allocational-harm-eval allocational-harm-eval Public

Code for evaluating allocational harms in machine learning and large language models.

Python