Description
Issue Description
This task involves writing a blog post that explains how to read an IPUMS extract into Julia. The post assumes that the user has already downloaded the extract--whether through the download API of IPUMS.jl
or through a download from the website. The user then extracts that data to a directory, then reads the DDI file using the read_ddi
function. The post should explain how the user can look at a summary of the metadata using the appropriate IPUMS.jl
function. Next, the post explains how to use the load_extract
function to load the IPUMs data into a Julia DataFrame. Once the data is in a DataFrame, the post should show some simple manipulations of the DataFrame, such as the describe
function which will show some metadata--such as the mean, minimum, maximum--for each column in the DataFrame.
Difficulty: Beginner
Time: 6 - 8 hours
Requirements
- Explain how to download the extract -- but just refer to previous blog post.
- Demonstrate how to use the
read_ddi
function to load the DDI file. - Show how to get a summary of the extract level and variable level metadata using the function in IPUMS.jl
- Show how to load the DAT file using the
load_extract
function. - Once the data is loaded, show how to see the metadata for the DataFrame and them columns of the DataFrame.
- Show how to create a simple plot of a variable from the dataset using the
Makie.jl
package. - Open and submit a PR for the post.
Expected Outcomes
The expected outcome is a markdown blog post with the appropriate code and explanation, as per the list above.
Additional Notes
Additional information and code examples are available in the docstrings/documentation for the IPUMS.jl
package. That is the best source for the Julia code.
Other Resources
documentation
channel - you should post here firsthelpdesk
channel - this would be to get more attention to your issue but maybe not as precise as you need.health-and-medicine
channel - this is where most of JuliaHealth is located these days.
Julia Discourse - I would advise posting here if you have an issue that you feel is long or requires a lot of time to explain as you might lose it within Julia Slack. Consider cross-posting your forum post to the Julia Slack in helpdesk
and/or documentation
.