AUC Score: 0.92258
From Debian 9 scratch
System dependencies
glibc
≥ 2.27 (adddeb http://ftp.debian.org/debian sid main
to /etc/apt/sources.list)CUDA
≥ 11.0Python
≥ 3.6
Python dependencies (recommend to manage the packages by Anaconda)
jupyterlab
latestTensorFlow
2.3.0Pytorch
1.3.1GraphVite
0.2.2 Package Link
Run command:
graphvite run my_config.yml
Here's the config file that we used for the final submission where you can tune the hyper-parameters:
The graph embedding file (in pickle format) will be generated into the directory specified in the config file.
Then you can use the code in eval/evaluate.ipynb
to generate the formatted probability CSV file for Kaggle submission.
my_config.yml
- Config settings for running GraphVite.adj2edge.ipynb
- Transforming adjacent list form to edge list form.evaluate.ipynb
- Calculating AUC scores and Kaggle competition results.split_dataset.ipynb
- Randomly splitting the raw data into 8:2 dataset, while generating fake edges in the 20% dataset.eval.ipynb
- Making evaluation on the embeddings.logistic_reg.ipynb
- Trying Logitic Regression based on the embeddings.