Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 1.28 KB

README.md

File metadata and controls

14 lines (10 loc) · 1.28 KB

DEEP REINFORCEMENT LEARNING FOR 2048 GAME

This project is being done as a part of Web and Coding Club, IIT Bombay's Seasons of Code 2021.
In this project, our end goal is to try to implement a Deep Reinforcement Learning Algorithm to play the game of 2048 and get the highest possible score.

  • Week 1 : Implemented Expectimax Algorithm to create an Agent to play the 2048 achiving a median score of 10K for tree depth of 3.
  • Week 2 : Implemented MDP Planning to create an TicTacToe counteragent (given the moves of the oppponents at each state.
  • Week 3 : Implemented Tabular q-learning to create a Snake Xenzia AI for an environement of size 4*4 achieving a median length of 12.2
  • Week4 : Implemented Deep q-leearning to create an Agent to play the 2048 using a convNet with 4 conv layers & a fully connected layer achieving a median score of 37K & reaching the 2048 tile in more than 90% episodes

Learning curves