Comparison of different models

TFLearn based wide and deep model (code copied from https://www.tensorflow.org/tutorials/wide_and_deep and adapted to jupyter notebook format)
TFLearn wide and deep re-implemented in Keras
XGBoost based implementation

Results

TFLearn wide and deep model - similarly to results from TF tutorial has accuracy is 84.5%
XGBoost - best accuracy is 86.1%
TFLearn wide and deep re-implemented in Keras - best accuracy is 85.1%

If you want to test it by yourself, download the data using TFLearn notebook
Keras version skipped tf.contrib.layers.crossed_column features. Implementing them could further improve accuracy

This type of "tabular" based dataset is still easiest to implement using XGBoost
Keras version was implemented using one-hot encoddings and separately embeddings. Surprisingly the one-hot encoding version achieved better accuracy
Probably when given more data, with more options in categorical columns ("workclass", "education", "marital_status" etc.) both TFLearn wide and deep and Keras embedding versions would perform better than XGBoost version.

Check notebooks for details

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitignore		.gitignore
Keras version.ipynb		Keras version.ipynb
README.md		README.md
Tensorflow - wide and deep.ipynb		Tensorflow - wide and deep.ipynb
XGBoost version.ipynb		XGBoost version.ipynb
utils.py		utils.py