Skip to content

PL94 GLS initial #2647

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 3 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
38 changes: 38 additions & 0 deletions datasets/census-2020-pl94-gls.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
Name: 2020 Redistricting Data File Least Squares Estimates
Description: |
The 2020 Redistricting Data File Least Squares Estimates data product provides count estimates, and their standard deviations, for each tabulation that was published as part of the persons universe of the 2020 Redistricting Data File for the US, state, county, and tract geographic levels. These estimates are computed using the generalized least squares (GLS) estimator using as input the publicly available 2020 Census persons universe noisy measurement files for both [the Redistricting Data File](https://registry.opendata.aws/census-2020-pl94-nmf/) and [the Demographic and Housing Characteristics File](https://registry.opendata.aws/census-2020-dhc-nmf/). The algorithm used to compute this estimate is described in more detail in [Least Squares Estimation For Hierarchical Data](https://arxiv.org/abs/2404.13164). As described in more detail in this paper and the README below, the primary goal of this data product is to provide the data required to compute confidence intervals for the 2020 Census Redistricting Data File published counts that account for the uncertainty in these published counts due to the disclosure avoidance methods applied to 2020 Census tabulations. In other words, these confidence intervals do not account for other sources of error, such as those due to enumeration errors.
<br/>
<br/>
Documentation: "[2020 Redistricting Data File Least Squares Estimates README File](https://uscb-2020-product-releases.s3.amazonaws.com/decennial/gls/2020/redistricting-gls-estimates/README.html)"
Contact: [email protected]
ManagedBy: "[United States Census Bureau](http://www.census.gov/)"
UpdateFrequency: Not Updated
Tags:
- aws-pds #TO DO: did we add this tag? tags cannot contain dashes
- census
- differential privacy
- disclosure avoidance
- ethnicity
- group quarters
- housing
- housing units
- noisy measurements
- population
- race
- redistricting
- voting age
- confidence intervals
- least squares
License: CC0 1.0 Universal
Citation:
U.S. Census Bureau. “2020 Redistricting Data File Least Squares Estimates” [Experimental], 2025, 2020 Redistricting Data File Least Squares Estimates [https://registry.opendata.aws/census-2020-pl94-gls/], accessed on DATE. #TOdo: update URL
Resources:
- Description: 2020 Redistricting Data File Least Squares Estimates File
ARN: arn:aws:s3:::uscb-2020-product-releases/decennial/gls/2020/redistricting-gls-estimates
Region: us-west-2
Type: S3 Bucket
- Description: "Census Open Data S3 Inventory"
ARN: arn:aws:s3:::uscb-opendata-inventory/
Region: us-west-2
Type: S3 Bucket
DataAtWork:
2 changes: 2 additions & 0 deletions tags.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -78,6 +78,7 @@
- computer forensics
- computer security
- computer vision
- confidence intervals
- conservation
- contamination
- Continuously Operating Reference Station (CORS)
Expand Down Expand Up @@ -227,6 +228,7 @@
- land use
- last mile
- latino
- least squares
- lidar
- life sciences
- light-sheet microscopy
Expand Down