Scarf: Self-Adaptive Tuning via Multi-Objective Reinforcement Learning for Apache Flink

Scarf is an automatic configuration tuning framework for Apache Flink. It consists of:

Knob selection acceleration through workload clustering
Multi-objective Reinforcement Learning (MORL)-based offline-online learning
Knowledge transfer via topology-agnostic GNN-based actor-critic network

Build and Run

Requirements

This tuner is implemented with Python 3.12. To run the tuner, install packages in requirements.txt.

This tuner is tested against Flink 2.0 running on Java 17 running YARN application mode with Hadoop 3.4.1.

The workloads are located in the flink-jobs/ directory. You need to compile the JAR file and upload it to HDFS using flink-jobs/build.sh.

The Tuning Pipeline

First, fill in the cluster address, job information and hyperparameters in config/config.yaml. The meaning of each configuration is described in utils/config.py.

Knob Selection

From Scratch

Run:

python main.py --mode selection --stage coldstart --config config/config.yaml

An output folder will be created under tuner.saveDir in the config file.

Analyze Results

Place the output directory in tuner.loadDir in the config file, and run:

python main.py --mode selection --stage analysis --config config/config.yaml

Speed Up with History

Place the output directories of historical tasks in selection/speedup.py, and run:

python main.py --mode selection --stage cluster --config config/config.yaml

Offline Training

From Scratch

Remove the value of tuner.loadDir in the config file, fill in the selected knobs in the knobs section of the config file, and run:

python main.py --mode offline --config config/config.yaml

With Transfer

Place the output directory of the task to transfer from in tuner.loadDir, and run:

python main.py --mode offline --config config/config.yaml

Online Tuning

Place the output directory of the offline trained task in tuner.loadDir, and run:

python main.py --mode online --config config/config.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
environment		environment
flink-jobs		flink-jobs
flink		flink
model		model
offline_learning		offline_learning
online_tuning		online_tuning
selection		selection
test		test
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scarf: Self-Adaptive Tuning via Multi-Objective Reinforcement Learning for Apache Flink

Build and Run

Requirements

The Tuning Pipeline

Knob Selection

From Scratch

Analyze Results

Speed Up with History

Offline Training

From Scratch

With Transfer

Online Tuning

About

Uh oh!

Releases

Packages

Languages

License

ZJU-DAILY/Scarf

Folders and files

Latest commit

History

Repository files navigation

Scarf: Self-Adaptive Tuning via Multi-Objective Reinforcement Learning for Apache Flink

Build and Run

Requirements

The Tuning Pipeline

Knob Selection

From Scratch

Analyze Results

Speed Up with History

Offline Training

From Scratch

With Transfer

Online Tuning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages