Introduction

This is a repo dedicated for a RL research paper about agent indicator method. It has been forked from rl-baseline3-zoo project, and modified to conduct experiments about agent indicators used in RL.

How to use

The script to conduct the hyperparameter optimization is indicator_opt.py. Following comman can be used to run the script.

$ python indicator_opt.py --algo {PPO or DQN} --env {env name} --n-timesteps {number of timesteps} --n-trials {number of set of hyperparameters to test with} --n-evaluations {number of evaluations} --sampler {sampler type(Optuna)} --pruner {pruner type(Optuna)}

Agent Indicators

There are four agent indicators implemented in this repo: Inversion, Inversion with Replacement, Geometric, Binary.

Inversion: Invert the observation of certain types of agents and add it as a channel to the original observation. For agents that do not need inversion, duplicate original observation.
Inversion with Replacement: The same as Inversion, but the inverted observation (or the same duplicate observation for agents that do not need it) is used in place of the original observation.
Geometric: Add an additional channel with alternating geometric checkered pattern for different types of agents.
Binary: Add additional channels, each of which is entirely black or white based on the type of an agent.

To see the implementation detail, please refer indicator_utils.py.

Evaluations

Hyperparameters that gave best results were trained again for multiple times for fair evaluation. The retraining and evaluation scipt is indicator_eval_params.py.

Results

All the experimental results that used for paper are stored in indicator_hyperparameters folder.

Name		Name	Last commit message	Last commit date
Latest commit History 293 Commits
.github		.github
docker		docker
hyperparams		hyperparams
images		images
indicator_hyperparameters		indicator_hyperparameters
logs/benchmark		logs/benchmark
rl-trained-agents @ 28bc94a		rl-trained-agents @ 28bc94a
scripts		scripts
tests		tests
utils		utils
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
benchmark.md		benchmark.md
draw_learning_graph_cooperative_pong.sh		draw_learning_graph_cooperative_pong.sh
draw_learning_graph_entombed_cooperative.sh		draw_learning_graph_entombed_cooperative.sh
draw_learning_graph_kaz.sh		draw_learning_graph_kaz.sh
draw_learning_graph_pong.sh		draw_learning_graph_pong.sh
draw_learning_graph_prospector.sh		draw_learning_graph_prospector.sh
draw_learning_graph_single_cooperative_pong.sh		draw_learning_graph_single_cooperative_pong.sh
draw_learning_graph_single_kaz.sh		draw_learning_graph_single_kaz.sh
draw_learning_graph_single_prospector.sh		draw_learning_graph_single_prospector.sh
enjoy.py		enjoy.py
eval_best_models_cooperative_pong.sh		eval_best_models_cooperative_pong.sh
eval_best_models_kaz.sh		eval_best_models_kaz.sh
eval_best_models_prospector.sh		eval_best_models_prospector.sh
eval_cooperative_pong.sh		eval_cooperative_pong.sh
eval_entombed_cooperative.sh		eval_entombed_cooperative.sh
eval_kaz.sh		eval_kaz.sh
eval_pong.sh		eval_pong.sh
eval_prospector.sh		eval_prospector.sh
indicator_best_params.py		indicator_best_params.py
indicator_draw_learning_graph.py		indicator_draw_learning_graph.py
indicator_draw_learning_graph_all.py		indicator_draw_learning_graph_all.py
indicator_draw_learning_graph_single.py		indicator_draw_learning_graph_single.py
indicator_draw_learning_graph_type.py		indicator_draw_learning_graph_type.py
indicator_eval_params.py		indicator_eval_params.py
indicator_evaluate_best_models.py		indicator_evaluate_best_models.py
indicator_make_gif.py		indicator_make_gif.py
indicator_opt.py		indicator_opt.py
indicator_util.py		indicator_util.py
make_gifs_coopeartive_pong.sh		make_gifs_coopeartive_pong.sh
make_gifs_kaz.sh		make_gifs_kaz.sh
make_gifs_prospector.sh		make_gifs_prospector.sh
make_result_plot.py		make_result_plot.py
requirements.txt		requirements.txt
run_best_params.sh		run_best_params.sh
run_cooperative_pong.sh		run_cooperative_pong.sh
run_entombed_cooperative.sh		run_entombed_cooperative.sh
run_kaz.sh		run_kaz.sh
run_pong.sh		run_pong.sh
run_prospector.sh		run_prospector.sh
setup.cfg		setup.cfg
train.py		train.py
version.txt		version.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

How to use

Agent Indicators

Evaluations

Results

About

Releases

Packages

Contributors 20

Languages

License

jkterry1/parameter-sharing-paper

Folders and files

Latest commit

History

Repository files navigation

Introduction

How to use

Agent Indicators

Evaluations

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 20

Languages

Packages