Edge Proposal Sets for Link Prediction

This directory contains code for the paper. Edge Proposal Sets for Link Prediction. https://arxiv.org/abs/2106.15810

Software Requirements

This codebase requires Python 3, PyTorch 1.6+, OGB 1.3.1, torch_geometric 1.7.0. In principle, this code can be run on CPU but we assume GPU utilization throughout the codebase.

Reproduce the Leaderboad Results

ogbl-ppa (Hits @ 100)

python submit_job.py --reproduce ppa   
python submit_job.py --show ppa

Final Test: 53.24 ± 0.00

ogbl-collab (Hits @ 50)

python submit_job.py --reproduce collab   
python submit_job.py --show collab

Final Test: 65.48 ± 0.00

ogbl-ddi (Hits @ 20)

python submit_job.py --reproduce ddi   
python submit_job.py --show ddi

Final Test: 76.32 ± 2.54

General Usage

Experiments are run in the following way: Edit the submit_job.py file to contain the desired experiments. Currently, it reproduces our best results on ppa, collab, and ddi. At the end of the file, there are a few commented samples provided for sweeping across different sizes of proposal sets. The keys of the dictionary are the datasets, and the list of tuples are the experiments. For example, ("gcn", "simple", 150000, 6, 1, 210000) represents a filtering model of GCN, a ranking model of Common Neighbors, and a sweep from 150,000 to 210,000 edges added at intervals of 10,000, and running this only 1 trial (since Common Neighbors is deterministic).

You can then run python submit_job.py. The results will be saved in logs/. Base model training results are logged in <dataset>_<model>_generation.o%local<time>. Final results after applying Filter & Rank is in <dataset>_<filter_model>_<rank_model>_<other_parameters>.o%local<time>.

Other files: models.py contains model implementations, visualize.ipynb generates figures and the LaTeX tables (and only looks at the sweep range as specified in the paper, which can be set by the radius= argument), filter.py generates scores for the broad starting set using a given filtering model, train_and_eval.py contains all training and evaluation functions, rank.py is a standard link prediction setup inspired by OGB, and submit_job.py runs the entire pipeline together.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
email_data		email_data
fb		fb
models		models
reddit		reddit
twitch		twitch
README.md		README.md
adamic_utils.py		adamic_utils.py
filter.py		filter.py
logger.py		logger.py
models.py		models.py
rank.py		rank.py
submit_job.py		submit_job.py
train_and_eval.py		train_and_eval.py
visualize.ipynb		visualize.ipynb
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Edge Proposal Sets for Link Prediction

Software Requirements

Reproduce the Leaderboad Results

General Usage

About

Releases

Packages

Contributors 3

Languages

CUAI/Edge-Proposal-Sets

Folders and files

Latest commit

History

Repository files navigation

Edge Proposal Sets for Link Prediction

Software Requirements

Reproduce the Leaderboad Results

General Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages