§ 230 Citation Network

A computational analysis of the federal appellate citation network for 47 U.S.C. § 230 of the Communications Decency Act, 1997-2025.

Overview

This repository contains the data, code, and documentation for a citation network analysis of 70 validated federal circuit court and Supreme Court opinions applying § 230(c). The analysis applies PageRank centrality, Leiden community detection, Jensen-Shannon divergence, and temporal network analysis to characterize the structure and evolution of § 230 doctrine.

Key findings

Zeran v. AOL (4th Cir. 1997) holds the highest PageRank in every annual snapshot from 1997 through 2025 (PR = 0.202; α = 0.85)
Five structural clusters identified via Leiden community detection on the undirected citation projection (modularity Q = 0.238; mean NMI = 0.686 across 100 runs), corresponding to recognizable doctrinal traditions confirmed by legal expert review
Platform win rate under § 230 declined from 83% (2000-2004, n=6) to 41% (2020-2024, n=17); temporal trend is consistent with increasing judicial skepticism toward broad immunity claims
Mean pairwise Jensen-Shannon divergence between circuits: 0.65 bits (base 2; range 0.43-0.86)
DAG hierarchical level correlates with PageRank at Spearman ρ = -0.840 among cases with at least one inbound citation (n=41, p < 0.0001); full corpus ρ = -0.965 is inflated by 29 dangling nodes sharing identical PageRank floor values
False positive rate in automated CourtListener retrieval: 66.8% (141/211 candidates excluded after review)

Repository structure

. ├── README.md ├── requirements.txt ├── research_design.md # Inclusion criterion, exclusion patterns, CBL ├── codebook.md # Field-level documentation for all data files ├── replication_guide.md # Step-by-step reproduction instructions ├── raw_data/ │ └── s230_validated_20260411_030728.json # Validated corpus (70 cases) ├── data/ │ ├── s230_graph.gexf # Citation graph (Gephi/NetworkX) │ ├── s230_graph.graphml # Citation graph (GraphML) │ ├── s230_metrics.csv # Per-node network metrics │ ├── case_outcomes_raw.json # Outcome codings │ └── [supplementary analysis files] └── [01-23]_*.py # Analysis scripts in execution order

Reproducing the analysis

See replication_guide.md for complete instructions. Quick start:

python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python3 02_build_graph.py && python3 10_merge_edges.py && python3 03_compute_metrics.py

Data

The validated corpus and graph files are deposited at Zenodo: [DOI to be added]

Citation

[Citation to be added upon publication]

License

Code: MIT License Data: Creative Commons Attribution 4.0 (CC BY 4.0)

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
raw_data		raw_data
.gitignore		.gitignore
01_collect_data.py		01_collect_data.py
02_build_graph.py		02_build_graph.py
03_compute_metrics.py		03_compute_metrics.py
04_generate_legend.py		04_generate_legend.py
05_deduplicate.py		05_deduplicate.py
06_validate_corpus.py		06_validate_corpus.py
06b_check_flagged.py		06b_check_flagged.py
07_add_metadata.py		07_add_metadata.py
08_eyecite_extraction.py		08_eyecite_extraction.py
09_build_edges_from_eyecite.py		09_build_edges_from_eyecite.py
10_merge_edges.py		10_merge_edges.py
11_community_stability.py		11_community_stability.py
12_consensus_partition.py		12_consensus_partition.py
13_pagerank_robustness.py		13_pagerank_robustness.py
14_mutual_information.py		14_mutual_information.py
15_jensen_shannon.py		15_jensen_shannon.py
16_temporal_jsd.py		16_temporal_jsd.py
17_temporal_snapshots.py		17_temporal_snapshots.py
18_corpus_recall.py		18_corpus_recall.py
19_recall_estimation.py		19_recall_estimation.py
20_dual_corpus_analysis.py		20_dual_corpus_analysis.py
21_resilience_simulation.py		21_resilience_simulation.py
22_hyperbolic_embedding.py		22_hyperbolic_embedding.py
23_edge_sensitivity.py		23_edge_sensitivity.py
README.md		README.md
ZENODO_README.md		ZENODO_README.md
codebook.md		codebook.md
corpus_metadata.json		corpus_metadata.json
replication_guide.md		replication_guide.md
requirements.txt		requirements.txt
research_design.md		research_design.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

§ 230 Citation Network

Overview

Key findings

Repository structure

Reproducing the analysis

Data

Citation

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

§ 230 Citation Network

Overview

Key findings

Repository structure

Reproducing the analysis

Data

Citation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages