Welcome! This guide will help you contribute to our datasets, even if you've never used GitHub before.
If you don't have one, sign up at github.com - it's free!
- Go to the
datasetsfolder - Click on the dataset folder you want to contribute to (e.g.,
ai-rights-opposition) - Click on
data.jsonl
- Click the pencil icon (✏️) at the top right
- GitHub will automatically create a "fork" (your own copy) if needed
Add a new line at the very end with your data. Copy this template and fill it in:
{"name": "Person or Organization Name", "social_links": [{"platform": "Twitter", "url": "https://twitter.com/their_username"}], "sources": [{"title": "Article or Source Title", "url": "https://example.com/full-url-to-source"}]}Before saving, make sure:
- ✅ Everything is on one line
- ✅ All quotes are double
"not single' - ✅ All brackets match:
{with}and[with] - ✅ URLs start with
https://orhttp://
- Scroll down to "Commit changes"
- Write a message like:
Add [Name] to dataset - Select "Create a new branch"
- Click "Propose changes"
- Click "Create pull request"
- Copy your entire line of JSON
- Go to jsonlines.org/validator
- Paste and click "Validate"
- Fix any errors shown in red
After you submit, GitHub will automatically check your entry:
- ✅ Green check = Valid! Ready for review
- ❌ Red X = Has errors - click "Details" to see what's wrong
{"name": "John Smith", "social_links": [], "sources": [{"title": "Interview on AI", "url": "https://news.com/interview"}]}{"name": "Jane Doe", "social_links": [{"platform": "Twitter", "url": "https://twitter.com/janedoe"}, {"platform": "LinkedIn", "url": "https://linkedin.com/in/janedoe"}], "sources": [{"title": "Blog Post", "url": "https://blog.com/post"}]}{"name": "Organization Name", "social_links": [{"platform": "Website", "url": "https://org.com"}], "sources": [{"title": "Press Release", "url": "https://org.com/press"}], "quote": "AI should remain under human control."}Problem: Syntax error in your JSON Fix: Check for:
- Missing quotes around text
- Single quotes instead of double quotes
- Missing commas between items
- Extra commas at the end
Problem: You forgot name, social_links, or sources
Fix: Make sure you have all three, even if social_links is empty: "social_links": []
Problem: URL doesn't start with http:// or https:// Fix: Add https:// to the beginning of the URL
Problem: Platform name not in allowed list Fix: Use one of: Twitter, LinkedIn, GitHub, Facebook, Instagram, YouTube, Website, Other
✅ Good Entry:
- Clear source showing the person's position on AI rights
- Direct quote about AI personhood/rights (if available)
- Link to primary source (their own writing/speech)
❌ Not Suitable:
- General AI safety concerns without mentioning rights/personhood
- Second-hand reports without primary sources
- Private communications or unverified claims
- Check existing entries in the file for examples
- Open an issue with your question
- Include your JSON in the issue so others can help
- Be specific about what error you're seeing
I'm trying to add [Name] to the [dataset name] dataset.
Here's my JSON:
[paste your JSON line here]
I'm getting this error:
[paste error message or describe the problem]
Can someone help me fix this?
- Watch for comments on your pull request
- Respond to feedback from reviewers
- Make requested changes by editing your file again
- Celebrate when it's merged! 🎉
If you want to test locally before submitting:
python3 -m pip install -r requirements.txtpython scripts/validate.py datasets/ai-rights-opposition# Check all datasets
for dir in datasets/*/; do
python scripts/validate.py "$dir"
doneEvery contribution helps make these datasets more complete and valuable for research. Don't be discouraged if your first attempt has issues.