COMP9444 Neural Networks and Deep Learning
Quiz 7 (Reinforcement Learning)
This is an optional quiz to test your understanding of
the Reinforcement Learning material from Week 7.
- Describe the elements (sets and functions)
that are needed to give a formal description of
a reinforcement learning environment.
What is the difference between a deterministic
environment and a stochastic environment?
- Name three different models of
optimality in reinforcement learning,
and give a formula for calculating each one.
- What is the definition of:
- the optimal policy
- the value function
- the Q-function?
- Assuming a stochastic environment,
discount factor γ
and learning rate of η, write the
equation for
- Temporal Difference learning TD(0)
- Q-Learning
Remember to define any symbols you use.
Make sure you attempt the questions yourself, before looking at the
Sample Answers.