anti-trans project

This project uses AI models to study and detect bias and discrimination in anti-trans discourse. It curates datasets from examples of transphobia, which currently includes anti-trans legislation in the US that limits transgender rights relating to healthcare, access to bathrooms, sports and more, at the federal and state level. The goal is to use language about sex, gender, sexuality, and related terms from the legislation to train text generation and text classification Large Language Models (llms).

legislation datasets

The federal legislation dataset originates from the www.congress.gov website, and includes bills, amendments, and resolutions from the House of Representatives and and the Senate over sessions 117 (2022-2023) and 118 (2023-2024) that contain the keyword "transgender" from the 117th and 118th congressional sessions (2021-2024). Another dataset is being developed from federal bills that specifically focus on the current anti-trans movement, containing targeted [anti-trans federal bills from 2023-2024] (https://github.com/gofilipa/anti-trans-legislation/blob/main/processing/bill_data/transtracker_federal_bills.csv).

The state bills dataset originates from Erin Reed's "LGBTQ+ Legislative Tracking 2023" document, which gathers legislation that are explicitly anti-trans.

data gathering and processing

All the code for data gathering and processing is available in this repository.

To gather the federal bill data, I scraped the bill text from congress.gov servers and from the trans legislation tracker list notebooks).

The processing notebook contains a matcher that extracts definitions of gender and related terms (like "sexuality," "biological sex", etc). You can see the final dataset on my Huggingface datasets page.

training

I am currently in the process of using the data to train models which you can see on my HuggingFace profile page, gofilipa.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.ipynb_checkpoints		.ipynb_checkpoints
bills		bills
crawlers		crawlers
evals		evals
ga_work		ga_work
img		img
papers		papers
talks		talks
training		training
#train.txt#		#train.txt#
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
notes.org		notes.org
tiny-llama-1b-chat-v1.ipynb		tiny-llama-1b-chat-v1.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

anti-trans project

legislation datasets

data gathering and processing

training

About

Uh oh!

Releases

Packages

Uh oh!

Languages

gofilipa/anti-trans

Folders and files

Latest commit

History

Repository files navigation

anti-trans project

legislation datasets

data gathering and processing

training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages