https://sbustreamspot.github.io
This repository contains the core streaming heterogeneous graph clustering and anomaly detection code.
Before attempting execution, ensure you have the following available as two separate files:
-
Edges: A file containing one edge per line for all input graphs in the dataset. A sample is provided as
test_edges.txt
. The edge file used for the experiments in the paper is available at sbustreamspot-data. -
Bootstrap clusters: A file describing the bootstrap clusters. A sample is provided as
test_bootstrap_clusters.txt
. Bootstrap clusters used for the experiments in the paper are available here.
The output will contain a summary of the execution parameters, runtime, and the graph cluster assignments and anomaly scores every 10,000 edges. This output can be further analyzed in various dimensions with sbustreamspot-analyze.
Compilation and execution has been tested with GCC 5.2.1 on Ubuntu 15.10.
git clone https://github.com/sbustreamspot/sbustreamspot-core.git
cd sbustreamspot-core
make clean optimized
./streamspot --edges=test_edges.txt \
--bootstrap=test_bootstrap_clusters.txt \
--chunk-length=10 \
--num-parallel-graphs=10 \
--max-num-edges=10 \
--dataset=all
To use a different dataset, change test_edges.txt
and
test_bootstrap_clusters.txt
accordingly.
Most parameter settings are via the command-line; details on setting
them can be viewed by running ./streamspot --help
.
A few parameters are set at compile-time and can be found in param.h
.