Deep Q-Network with Pytorch

DQN

Experience Replay

Replay Buffer

Another Problem that replays buffer solves:

Fix it

Summary

Fixed Q-Targets

Q-Learning Update

Fixed Target

Summary

DQN — Implementation

Model Architecture

DQN Agent

Train the Agent with DQN

Watch a Smart Agent!

Deep Q-Learning PipeLine

Different methods of dqn_Agent

Deep Q-Learning Improvements

Double DQN

Trained agent Example