👋 Hi, I’m @HankElmhurst
A data professional and current M.S. in Data Science student at the University of Virginia (expected Aug 2026).
I previously spent more than a decade at the U.S. Census Bureau working on large-scale federal economic data systems.
- Applied Data Science & Analytics
- Cleaning and structuring messy real-world data
- Building interpretable models for policy, compliance, and business decisions
- Decision / Risk Systems
- Public-sector and civic analytics (economic statistics, public health, investigations)
- Operations, forecasting, and resource-allocation problems
- Current Graduate Work
- Bayesian inference and decision theory
- Statistical inference/ Machine Learning
- Data structures, relational databases, and ETL
- Text / Documentation Analysis (NLP foundations)
- Programming: Python, R, Scripting, Java
- Data & modeling: pandas, NumPy, scikit-learn, tidyverse, tidymodels, ggplot2
- Databases: SQL (PL/SQL, PostgreSQL), relational schema design
- Workflow: Git/GitHub, Linux command line, Jupyter, VS Code, RStudio
- King_County_WA_Housing_Analysis – Private repository (available on request): end-to-end housing price analysis using public King County data, including ETL, feature engineering, regression models, diagnostics, and a fully documented report.
- DS5100_Final_Project – Monte Carlo simulation of simple games in Python (UVA course project).
- HW09-DSS100 – Early Python exercises from introductory programming coursework.
- Additional Projects will be made available
- GitHub: @HankElmhurst
- LinkedIn: [https://www.linkedin.com/in/hangyu001/]
- Email: available upon request