Learning to Incentivize Others

This is the code for experiments in the paper Learning to Incentivize Other Learning Agents. Baselines are included.

Setup

Python 3.6
Tensorflow >= 1.12
OpenAI Gym == 0.10.9
Clone and pip install Sequential Social Dilemma, which is a fork from the original open-source implementation.
Clone and pip install LOLA if you wish to run this baseline.
Clone this repository and run $ pip install -e . from the root.

Navigation

alg/ - Implementation of LIO and PG/AC baselines
env/ - Implementation of the Escape Room game and wrappers around the SSD environment.
results/ - Results of training will be stored in subfolders here. Each independent training run will create a subfolder that contains the final Tensorflow model, and reward log files. For example, 5 parallel independent training runs would create results/cleanup/10x10_lio_0,...,results/cleanup/10x10_lio_4 (depending on configurable strings in config files).
utils/ - Utility methods

Examples

Train LIO on Escape Room

Set config values in alg/config_room_lio.py
cd into the alg folder
Execute training script $ python train_multiprocess.py lio er. Default settings conduct 5 parallel runs with different seeds.
For a single run, execute $ python train_lio.py er.

Train LIO on Cleanup

Set config values in alg/config_ssd_lio.py
cd into the alg folder
Execute training script $ python train_multiprocess.py lio ssd.
For a single run, execute $ python train_ssd.py.

Citation

@article{yang2020learning,
  title={Learning to incentivize other learning agents},
  author={Yang, Jiachen and Li, Ang and Farajtabar, Mehrdad and Sunehag, Peter and Hughes, Edward and Zha, Hongyuan},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  pages={15208--15219},
  year={2020}
}

License

See LICENSE.

SPDX-License-Identifier: MIT

Name	Name	Last commit message	Last commit date
Latest commit 011235813 Update README.md Jun 13, 2022 979e297 · Jun 13, 2022 History 4 Commits
lio	lio	Restructure imports	Sep 24, 2020
.gitignore	.gitignore	Release	Aug 24, 2020
LICENSE	LICENSE	Release	Aug 24, 2020
README.md	README.md	Update README.md	Jun 13, 2022
setup.py	setup.py	Release	Aug 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Incentivize Others

Setup

Navigation

Examples

Train LIO on Escape Room

Train LIO on Cleanup

Citation

License

About

Releases

Packages

Languages

License

011235813/lio

Folders and files

Latest commit

History

Repository files navigation

Learning to Incentivize Others

Setup

Navigation

Examples

Train LIO on Escape Room

Train LIO on Cleanup

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages