Skip to content

Commit 4f44916

Browse files
authored
Merge pull request #157 from best-practice-and-impact/synthetic-data-branch
Synthetic data branch
2 parents 4229f1c + 2a9b562 commit 4f44916

26 files changed

Lines changed: 580300 additions & 10 deletions

.gitignore

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,5 @@ __pycache__
66
!.gitignore
77
.ipynb_checkpoints*
88
_build/
9-
test_update_links/*
9+
test_update_links/*
10+
.venv/*

Rplots.pdf

495 KB
Binary file not shown.

config.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,3 +10,5 @@ rescue_with_pop_path_csv: "file:////home/cdsw/ons-spark/ons-spark/data/rescue_wi
1010
rescue_with_pop_path_parquet: "file:////home/cdsw/ons-spark/ons-spark/data/rescue_with_pop.parquet"
1111
staging_table_example: "staging_example_"
1212
database: "train_tmp"
13+
census_2011_teaching_data_path_csv: "file:////home/cdsw/ons-spark/ons-spark/data/census_2011_microdata_teaching_file.csv"
14+
census_relationship_file_path_csv: "file:////home/cdsw/ons-spark/ons-spark/data/census_relationship_file.csv"

mysyn_SD2001.csv

Lines changed: 5001 additions & 0 deletions
Large diffs are not rendered by default.

ons-spark/_toc.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -71,5 +71,5 @@ parts:
7171
- file: ancillary-topics/pandas-udfs
7272
- file: ancillary-topics/r-udfs
7373
- file: ancillary-topics/hive-partitions
74-
75-
74+
- file: ancillary-topics/synthpop_with_r
75+
- file: ancillary-topics/synthetic_data_python

0 commit comments

Comments
 (0)