ML-OPT Equal Opportunities

Column mappings

If you want to see a comprehensive list of what each occupation code represents in the visualizations (e.g. MGR, BUS), take a look at the column SOCP_desc of the file data/metadata/mapping.csv

Creating data

Download zip from ACS 2019 per-person data
Extract the zip to the data/dataset folder
Run create_data.py and find the output csv in data/output

Preprocessing data

Run preprocess_data.py and find the preprocessed csv in data/output

Creating test data for debugging

Enter data/dataset folder
Run bash script ./create_test_data.sh 100 , where 100 the number of samples
Alternatively, run ./create_test_data_random.sh to get a random 0.1% subset of the rows

Running Sparse and Holistic Regression

The sparse and holistic regression frameworks are written in Sparsity.jl, but for demonstration purposes you can use Sparsity.ipynb. It was tested on Julia 1.6.3. Create and preprocess the data first.

Open Sparsity.ipynb notebook.
Run all (Will take up to 15-20 minutes)

Running the Prescriptive part

In order to run the prescriptive part of the project, first run the create_data.py and the preprocess_data.py scripts. Then, you can run the prescriptive.py file to create and visualize the prescriptions. Note that the file has a lot of python requirements, such as numpy, seaborn, XGBoost, plotly, pandas.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ML-OPT Equal Opportunities

Column mappings

Creating data

Preprocessing data

Creating test data for debugging

Running Sparse and Holistic Regression

Running the Prescriptive part

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

ML-OPT Equal Opportunities

Column mappings

Creating data

Preprocessing data

Creating test data for debugging

Running Sparse and Holistic Regression

Running the Prescriptive part