Atari

Research Playground built on top of OpenAI's Atari Gym, prepared for implementing various Reinforcement Learning algorithms.

It can emulate any of the following games:

['Asterix', 'Asteroids', 'MsPacman', 'Kaboom', 'BankHeist', 'Kangaroo', 'Skiing', 'FishingDerby', 'Krull', 'Berzerk', 'Tutankham', 'Zaxxon', 'Venture', 'Riverraid', 'Centipede', 'Adventure', 'BeamRider', 'CrazyClimber', 'TimePilot', 'Carnival', 'Tennis', 'Seaquest', 'Bowling', 'SpaceInvaders', 'Freeway', 'YarsRevenge', 'RoadRunner', 'JourneyEscape', 'WizardOfWor', 'Gopher', 'Breakout', 'StarGunner', 'Atlantis', 'DoubleDunk', 'Hero', 'BattleZone', 'Solaris', 'UpNDown', 'Frostbite', 'KungFuMaster', 'Pooyan', 'Pitfall', 'MontezumaRevenge', 'PrivateEye', 'AirRaid', 'Amidar', 'Robotank', 'DemonAttack', 'Defender', 'NameThisGame', 'Phoenix', 'Gravitar', 'ElevatorAction', 'Pong', 'VideoPinball', 'IceHockey', 'Boxing', 'Assault', 'Alien', 'Qbert', 'Enduro', 'ChopperCommand', 'Jamesbond']

Check out corresponding Medium article: Atari - Reinforcement Learning in depth 🤖 (Part 1: DDQN)

Purpose

The ultimate goal of this project is to implement and compare various RL approaches with atari games as a common denominator.

Usage

Clone the repo.
Go to the project's root folder.
Install required packagespip install -r requirements.txt.
Launch atari. I recommend starting with help command to see all available modes python atari.py --help.

DDQN

Hyperparameters

* GAMMA = 0.99
* MEMORY_SIZE = 900000
* BATCH_SIZE = 32
* TRAINING_FREQUENCY = 4
* TARGET_NETWORK_UPDATE_FREQUENCY = 40000
* MODEL_PERSISTENCE_UPDATE_FREQUENCY = 10000
* REPLAY_START_SIZE = 50000
* EXPLORATION_MAX = 1.0
* EXPLORATION_MIN = 0.1
* EXPLORATION_TEST = 0.02
* EXPLORATION_STEPS = 850000

Model Architecture

Deep Convolutional Neural Network by DeepMind

* Conv2D (None, 32, 20, 20)
* Conv2D (None, 64, 9, 9)
* Conv2D (None, 64, 7, 7)
* Flatten (None, 3136)
* Dense (None, 512)
* Dense (None, 4)

Trainable params: 1,686,180

Performance

After 5M of steps (~40h on Tesla K80 GPU or ~90h on 2.9 GHz Intel i7 Quad-Core CPU):

SpaceInvaders

Training:

Normalized score - each reward clipped to (-1, 1)

Testing:

Human average: ~372

DDQN average: ~479 (128%)

Breakout

Training:

Normalized score - each reward clipped to (-1, 1)

Testing:

Human average: ~28

DDQN average: ~62 (221%)

Genetic Evolution

Atlantis

Training:

Normalized score - each reward clipped to (-1, 1)

Testing:

Human average: ~29,000

GE average: 31,000 (106%)

Author

Greg (Grzegorz) Surma

PORTFOLIO

GITHUB

BLOG

Name	Name	Last commit message	Last commit date
Latest commit gsurma Update README.md Jul 9, 2021 e261de9 · Jul 9, 2021 History 42 Commits
.github	.github	Create FUNDING.yml	Jul 24, 2019
.idea	.idea	genetic evolution, Atlantis	Dec 19, 2018
assets	assets	genetic evolution, Atlantis	Dec 19, 2018
game_models	game_models	genetic evolution, Atlantis	Dec 19, 2018
.gitignore	.gitignore	cleanup, added assets, readme	Sep 27, 2018
LICENSE	LICENSE	Initial commit	Sep 7, 2018
README.md	README.md	Update README.md	Jul 9, 2021
atari.py	atari.py	genetic evolution, inital implementation	Nov 4, 2018
convolutional_neural_network.py	convolutional_neural_network.py	hopefully working memory management of experience replay	Sep 20, 2018
gym_wrappers.py	gym_wrappers.py	paramtrized reward cliping	Sep 22, 2018
logger.py	logger.py	temp trend disable	Sep 29, 2018
requirements.txt	requirements.txt	clean slate	Sep 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Atari

Purpose

Usage

DDQN

Hyperparameters

Model Architecture

Performance

SpaceInvaders

Breakout

Genetic Evolution

Atlantis

Author

About

Releases

Sponsor this project

Packages

Languages

License

gsurma/atari

Folders and files

Latest commit

History

Repository files navigation

Atari

Purpose

Usage

DDQN

Hyperparameters

Model Architecture

Performance

SpaceInvaders

Breakout

Genetic Evolution

Atlantis

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages