## Policy gradient notes

Derivation of principals underlying policy gradient methods

## Reinforcement learning an introduction

Excercies of the the book.

## Reinforcement learning notes

A collection of notes on mathematical concepts in RL

## Safely Approximating the Value Function

Function approximation of the value function is key to generalisation. However, one has to be careful!