Skip to content

skycarl/ml_proj_6

Folders and files

NameName
Last commit message
Last commit date

Latest commit

6b4b901 · Jan 28, 2022

History

61 Commits
Nov 22, 2020
Jan 28, 2022
Dec 13, 2020
Nov 22, 2020
Jan 28, 2022
Dec 13, 2020
Dec 13, 2020
Dec 13, 2020
Dec 13, 2020
Jan 28, 2022

Repository files navigation

Introduction to Machine Learning Project 6

Reinforcement learning: Value Iteration, SARSA, and Q-learning on the racetrack problem

Train a policy

To train a Q-learning policy:

python exp_Q.py

To train a SARSA policy:

python exp_SARSA.py

To train a value iteration policy:

python exp_VI.py

This results in an arrays directory which contains the policy at various points during training as well as a

Race with a policy

Once a policy is trained, it can be used in a race.

python race.py

This produces a race like so:

Results

Plot of learning curves for Q-Learning on different tracks and with different crash behavior. A "normal crash" means that if the car crashes into a wall, it returns to the last valid track square, whereas a "bad crash" means that it returns to the starting line upon crashing.

Example race:

About

Reinforcement learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages