Reshaping Graph Recommendation with Edge Graph Collaborative Filtering and Customer Reviews

This is the official implementation of the paper Reshaping Graph Recommendation with Edge Graph Collaborative Filtering and Customer Reviews, accepted as full paper at DL4SR@CIKM 2022.

This repository is heavily dependent on the framework Elliot, so we suggest you refer to the official GitHub page and documentation.

We report the codes for the baselines, and the proposed model EGCF. In the following, we indicate the specific path for each model, and its reference backend (i.e., NumPy, TensorFlow or PyTorch):

MostPop (path, backend: NumPy)
BPRMF (path, backend: NumPy)
MultiVAE (path, backend: TensorFlow)
ConvMF (path, backend: TensorFlow)
RMG (path, backend: TensorFlow)
NGCF (path, backend: PyTorch)
LightGCN (path, backend: PyTorch)
GAT (path, backend: PyTorch)
DGCF (path, backend: PyTorch)
EGCF (path, backend: PyTorch)

As for TensorFlow, we tested our models using the gpu version 2.3.2, with CUDA 10.1 and cuDNN 7.6.

As for PyTorch, we tested our models using the version 1.10.2, with CUDA 10.2 and cuDNN 8.0. Additionally, graph-based models require PyTorch Geometric, which is compatible with the versions of CUDA and PyTorch we indicated above.

Installation guidelines: scenario #1

If you have the possibility to install two different versions of CUDA on your workstation (i.e.,10.1 and 10.2), you may create two different virtual environments with the requirements files we included in the repository, as follows:

# TENSORFLOW ENVIRONMENT (CUDA 10.1, cuDNN 7.6)

$ python3 -m venv venv_tf
$ source venv_tf/bin/activate
$ pip install --upgrade pip
$ pip install -r requirements_tf.txt

# PYTORCH ENVIRONMENT (CUDA 10.2, cuDNN 8.0)

$ python3 -m venv venv_pt
$ source venv_pt/bin/activate
$ pip install --upgrade pip
$ pip install -r requirements_pt.txt
$ pip install torch-scatter torch-sparse torch-cluster torch-spline-conv torch-geometric -f https://data.pyg.org/whl/torch-1.10.0+cu102.html

Installation guidelines: scenario #2

A more convenient way of running experiments is to instantiate two docker containers having CUDA 10.1 and CUDA 10.2 already installed, respectively. We provide the Dockerfile to build each of the two containers.

Make sure you have Docker and NVIDIA Container Toolkit installed on your machine (you may refer to this guide).

Then, you may use the following Docker images to instantiate two containers equipped with CUDA 10.1 and CUDA 10.2, respectively:

Container Docker with CUDA 10.1 and cuDNN 7.6 (the environment for TensorFlow): link
Container Docker with CUDA 10.2 and cuDNN 8.0 (the environment for PyTorch): link

After the setup of your Docker containers, you may follow the exact same guidelines as scenario #1.

Datasets

At ./data/ you may find all tsv files for the datasets, i.e., training, validation, and test sets, and the interactions file (for EGCF). In order not to overload this repository, we provide the links to Google Drive for the review-based side information (both for EGCF and other baselines):

Dataset	Link EGCF	Link Review Baselines
Baby	drive	drive
Boys & Girls	drive	drive
Men	drive	drive

After having downloaded the six zip files, just put them into ./data/amazon_baby/, ./data/amazon_boys_girls/, and ./data/amazon_men/, respectively.

Finally, run the bash scripts ./data/<dataset_name>/create_<dataset_name>.sh to complete the procedure. Now you are all set, and you can start training and testing the models.

Training and testing models

To train and evaluate models an all considered metrics, you may run the following command:

$ python -u start_experiments.py --model <model-name> --dataset <dataset-name>

where <model-name> and <dataset-name> refer to the name of the model to be run and the dataset on which to run the experiment, respectively.

You may find all configutation files at ./config_files/<model-name>/<dataset-name>.yml, where all hyperparameter spaces and the exploration strategies are reported.

As for EGCF (i.e., our proposed model), configuration files follow the pattern ./config_files/egcf/<dataset-name>_<hop_number>.yml, useful to run the study on the hop number and select the best model configuration.

Results about calculated metrics are available in the folder ./results/<dataset-name>/performance/. Specifically, you need to access the tsv file having the following name pattern: rec_cutoff_<cutoff>_relthreshold_0_<datetime-experiment-end>.tsv.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
config_files		config_files
data		data
elliot		elliot
external/models		external/models
README.md		README.md
requirements_pt.txt		requirements_pt.txt
requirements_tf.txt		requirements_tf.txt
start_experiments.py		start_experiments.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reshaping Graph Recommendation with Edge Graph Collaborative Filtering and Customer Reviews

Installation guidelines: scenario #1

Installation guidelines: scenario #2

Datasets

Training and testing models

About

Releases

Packages

Languages

sisinflab/Edge-Graph-Collaborative-Filtering

Folders and files

Latest commit

History

Repository files navigation

Reshaping Graph Recommendation with Edge Graph Collaborative Filtering and Customer Reviews

Installation guidelines: scenario #1

Installation guidelines: scenario #2

Datasets

Training and testing models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages