AUC Score: 0.92258
From Debian 9 scratch
System dependencies
glibc≥ 2.27 (adddeb http://ftp.debian.org/debian sid mainto /etc/apt/sources.list)CUDA≥ 11.0Python≥ 3.6
Python dependencies (recommend to manage the packages by Anaconda)
jupyterlablatestTensorFlow2.3.0Pytorch1.3.1GraphVite0.2.2 Package Link
Run command:
graphvite run my_config.yml
Here's the config file that we used for the final submission where you can tune the hyper-parameters:
The graph embedding file (in pickle format) will be generated into the directory specified in the config file.
Then you can use the code in eval/evaluate.ipynb to generate the formatted probability CSV file for Kaggle submission.
my_config.yml- Config settings for running GraphVite.adj2edge.ipynb- Transforming adjacent list form to edge list form.evaluate.ipynb- Calculating AUC scores and Kaggle competition results.split_dataset.ipynb- Randomly splitting the raw data into 8:2 dataset, while generating fake edges in the 20% dataset.eval.ipynb- Making evaluation on the embeddings.logistic_reg.ipynb- Trying Logitic Regression based on the embeddings.