RL-Atari-Skiing/README.md at main · SuperbTUM/RL-Atari-Skiing

Introduction

Atari Skiing is a classical Atari game. Developing an AI player is difficult as the reward is only returned at the end of the game. Given the DQN as the baseline, we will try to implement some optimizations and see if there is any improvement. Current methodologies include:

Introducing heuristic data as initialization of replay buffer
Implementing D3QN + Noise (optional)
Cyclic epsilon
Using prioritized replay buffer prioritizing with absolute TD error
Implementing residual DQN
Implementing unrolled LSTM (DRQN, Sequential Sampling)
Rescaling Q function
Soft update on target model
Gradient clipping (optional)

Prerequisites

numpy <= 1.19.5

Quick Start

As we are building a playground, there are several ways of starting experiments. One example could be as follows:

python .\dqn.py --double_dqn --dueling --is_rnn --learning_rate 0.001 --training --include_flag_punishment

Checkpoints

Google Drive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduction

Prerequisites

Quick Start

Checkpoints

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Introduction

Prerequisites

Quick Start

Checkpoints