Time and memory cost of sequential recommendation models

Datasets information:

Dataset	#User	#Item	#Interaction	Sparsity
ml-1m	6,041	3,707	1,000,209	0.9553
DIGINETICA	59,425	42,116	547,416	0.9998
Yelp	102,046	98,408	2,903,648	0.9997

Device information

OS:                   Linux
Python Version:       3.8.3
PyTorch Version:      1.7.0
cudatoolkit Version:  10.1
GPU:                  TITAN RTX（24GB）
Machine Specs:        32 CPU machine, 64GB RAM

1) ml-1m dataset:

Time and memory cost on ml-1m dataset:

Method	Training Time (sec/epoch)	Evaluate Time (sec/epoch)	GPU Memory (GB)
Improved GRU-Rec	7.78	0.11	1.27
SASRec	17.78	0.12	1.84
NARM	8.29	0.11	1.29
FPMC	7.51	0.11	1.18
STAMP	7.32	0.11	1.20
Caser	44.85	0.12	1.14
NextItNet	16433.27	96.31	1.86
TransRec	10.08	0.16	8.18
S3Rec	-	-	-
GRU4RecF	10.20	0.15	1.80
SASRecF	18.84	0.17	1.78
BERT4Rec	36.09	0.34	1.97
FDSA	31.86	0.19	2.32
SRGNN	327.38	2.19	1.21
GCSAN	335.27	0.02	1.58
KSR	-	-	-
GRU4RecKG	-	-	-

Config file of ml-1m dataset:

# dataset config
field_separator: "\t"
seq_separator: " "
USER_ID_FIELD: user_id
ITEM_ID_FIELD: item_id
TIME_FIELD: timestamp
NEG_PREFIX: neg_
ITEM_LIST_LENGTH_FIELD: item_length
LIST_SUFFIX: _list
MAX_ITEM_LIST_LENGTH: 20
POSITION_FIELD: position_id
load_col:
  inter: [user_id, item_id, timestamp]
min_user_inter_num: 0
min_item_inter_num: 0

# training and evaluation
epochs: 500
train_batch_size: 2048
eval_batch_size: 2048
valid_metric: MRR@10
eval_setting: TO_LS,full
training_neg_sample_num: 0

Other parameters (including model parameters) are default value.

NOTE :

For FPMC and TransRec model, training_neg_sample_num should be 1 .
For SASRecF, GRU4RecF and FDSA, load_col should as below:

load_col:
  inter: [user_id, item_id, timestamp]
  item: [item_id, genre]

2）DIGINETICA dataset:

Time and memory cost on DIGINETICA dataset:

Method	Training Time (sec/epoch)	Evaluate Time (sec/epoch)	GPU Memory (GB)
Improved GRU-Rec	4.10	1.05	4.02
SASRec	8.36	1.21	4.43
NARM	4.30	1.08	4.09
FPMC	2.98	1.08	4.08
STAMP	4.27	1.04	3.88
Caser	17.15	1.18	3.94
NextItNet	6150.49	947.66	4.54
TransRec	-	-	Out of Memory
S3Rec	-	-	-
GRU4RecF	4.79	1.17	4.83
SASRecF	8.66	1.29	5.11
BERT4Rec	16.80	3.54	7.97
FDSA	13.44	1.47	5.66
SRGNN	88.59	15.37	4.01
GCSAN	96.69	17.11	4.25
KSR	-	-	-
GRU4RecKG	-	-	-

Config file of DIGINETICA dataset:

# dataset config
field_separator: "\t"
seq_separator: " "
USER_ID_FIELD: session_id
ITEM_ID_FIELD: item_id
TIME_FIELD: timestamp
NEG_PREFIX: neg_
ITEM_LIST_LENGTH_FIELD: item_length
LIST_SUFFIX: _list
MAX_ITEM_LIST_LENGTH: 20
POSITION_FIELD: position_id
load_col:
  inter: [session_id, item_id, timestamp]
min_user_inter_num: 6
min_item_inter_num: 1

# training and evaluation
epochs: 500
train_batch_size: 2048
eval_batch_size: 2048
valid_metric: MRR@10
eval_setting: TO_LS,full
training_neg_sample_num: 0

Other parameters (including model parameters) are default value.

NOTE :

For FPMC and TransRec model, training_neg_sample_num should be 1 .
For SASRecF, GRU4RecF and FDSA, load_col should as below:

load_col:
   inter: [session_id, item_id, timestamp]
   item: [item_id, item_category]

3）Yelp dataset:

Time and memory cost on Yelp dataset:

Method	Training Time (sec/epoch)	Evaluation Time (sec/epoch)	GPU Memory (GB)
Improved GRU-Rec	44.31	2.74	7.92
SASRec	75.51	3.11	8.32
NARM	45.65	2.76	7.98
FPMC	21.05	3.05	8.22
STAMP	42.08	2.72	7.77
Caser	147.15	2.89	7.87
NextItNet	45019.38	1670.76	8.44
TransRec	-	-	Out of Memory
S3Rec	-	-	-
GRU4RecF	-	-	Out of Memory
SASRecF	-	-	Out of Memory
BERT4Rec	193.74	8.43	16.57
FDSA	-	-	Out of Memory
SRGNN	825.11	33.20	7.90
GCSAN	837.23	33.00	8.14
KSR	-	-	-
GRU4RecKG	-	-	-

Config file of DIGINETICA dataset:

# dataset config
field_separator: "\t"
seq_separator: " "
USER_ID_FIELD: session_id
ITEM_ID_FIELD: item_id
TIME_FIELD: timestamp
NEG_PREFIX: neg_
ITEM_LIST_LENGTH_FIELD: item_length
LIST_SUFFIX: _list
MAX_ITEM_LIST_LENGTH: 20
POSITION_FIELD: position_id
load_col:
  inter: [session_id, item_id, timestamp]
min_user_inter_num: 6
min_item_inter_num: 1

# training and evaluation
epochs: 500
train_batch_size: 2048
eval_batch_size: 2048
valid_metric: MRR@10
eval_setting: TO_LS,full
training_neg_sample_num: 0

Other parameters (including model parameters) are default value.

NOTE :

For FPMC and TransRec model, training_neg_sample_num should be 1 .
For SASRecF, GRU4RecF and FDSA, load_col should as below:

load_col:
    inter: [session_id, item_id, timestamp]
 	item: [item_id, item_category]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequential_recommendation.md

Sequential_recommendation.md

Time and memory cost of sequential recommendation models

Datasets information:

Device information

1) ml-1m dataset:

Time and memory cost on ml-1m dataset:

Config file of ml-1m dataset:

2）DIGINETICA dataset:

Time and memory cost on DIGINETICA dataset:

Config file of DIGINETICA dataset:

3）Yelp dataset:

Time and memory cost on Yelp dataset:

Config file of DIGINETICA dataset:

Files

Sequential_recommendation.md

Latest commit

History

Sequential_recommendation.md

File metadata and controls

Time and memory cost of sequential recommendation models

Datasets information:

Device information

1) ml-1m dataset:

Time and memory cost on ml-1m dataset:

Config file of ml-1m dataset:

2）DIGINETICA dataset:

Time and memory cost on DIGINETICA dataset:

Config file of DIGINETICA dataset:

3）Yelp dataset:

Time and memory cost on Yelp dataset:

Config file of DIGINETICA dataset: