This is an optional quiz to test your understanding of the Reinforcement Learning material from Week 7.

Describe the elements (sets and functions) that are needed to give a formal description of a reinforcement learning environment. What is the difference between a deterministic environment and a stochastic environment?
Name three different models of optimality in reinforcement learning, and give a formula for calculating each one.
What is the definition of:
1. the optimal policy
2. the value function
3. the Q-function?
Assuming a stochastic environment, discount factor γ and learning rate of η, write the equation for
1. Temporal Difference learning TD(0)
2. Q-Learning
Remember to define any symbols you use.

Make sure you attempt the questions yourself, before looking at the Sample Answers.