-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Time has arrived for building the first draft of the final data.frames + dictionary that we will include in the R data package. And it makes sense to pick low hanging fruits first. Let's start with California. It has the twin virtues of being relatively clean and big.
For CA, write a script that:
a. Replaces name with a random 10 character string
b. Does data integrity checks and flags or fixes issues as needed
c. unzips and rbinds years and tiers of government and adds useful information such as what level of government or what year the data are from if such information is missing.
d. final outcome = tidy data
After that, write a Rmd that presents some basic summaries of the data and presents a dictionary.
Note: if you think you can improve the description of the issue, please do. And don't let the description keep you from doing sensible things.