Skip to content

SpMV_Struct_Benchmarks

Luc Berger edited this page Feb 24, 2020 · 15 revisions

Structured SpMV algorithms performance

This page gathers instructions and references to conduct performance tests of the spmv_struct algorithms in Kokkos-Kernels.

GPU performance

K80

The first table shows the comparison of performance on 2D problems using FD (5pt) and FE (9pt) stencils

Matrix size FD Struct FD Unstr FE Struct FD Unstr
50 0.03615 0.02654 0.03738 0.02720
100 0.04266 0.03411 0.04668 0.04069
150 0.05323 0.05426 0.06753 0.07118
225 0.07749 0.09979 0.10876 0.12682
300 0.10972 0.15491 0.16904 0.20675
400 0.16881 0.25213 0.26955 0.33847
1,000 0.88039 1.42716 1.55607 1.86464
2,000 3.30934 4.35175 5.36563 5.63246

3D problems

Matrix size FD Struct FD Unstr FE Struct FD Unstr
30 0.06985 0.08438 0.14827 0.14980
40 0.12008 0.18496 0.31668 0.33553
50 0.19112 0.30912 0.56940 0.60685
60 0.30857 0.50288 1.01198 1.03442
80 0.69584 1.09019 2.31828 2.26498
100 1.29433 2.09743 4.26691 3.95687
200 8.35539 11.51226 26.04506 29.12328

P100

The first table shows the comparison of performance on 2D problems using FD (5pt) and FE (9pt) stencils

Matrix size FD Struct FD Unstr FE Struct FD Unstr
50 0.02639 0.02654 0.02062 0.01521
100 0.02071 0.01825 0.02106 0.01716
150 0.02959 0.04015 0.02340 0.02078
225 0.02780 0.02477 0.03321 0.05163
300 0.03654 0.06123 0.04925 0.07147
400 0.04875 0.09654 0.06846 0.12177
1,000 0.20233 0.53389 0.32596 0.66484
2,000 0.74472 2.10274 1.23610 2.48697

3D problems

Matrix size FD Struct FD Unstr FE Struct FD Unstr
30 0.02234 0.02195 0.03912 0.04365
40 0.03450 0.06202 0.06745 0.07851
50 0.05008 0.12886 0.11510 0.13937
60 0.07094 0.19252 0.18333 0.22314
80 0.14351 0.42709 0.41497 0.51883
100 0.26672 0.83247 0.81861 0.98918
200 1.92387 6.64503 5.90622 7.67507

V100

Clone this wiki locally