COMP9444 Neural Networks and Deep Learning
Quiz 7 (Reinforcement Learning)
This is an optional quiz to test your understanding of
the material from Week 7.
- Explain the difference between the following paradigms,
in terms of what is presented to the agent, and what the
agent aims to do:
- Supervised Learning
- Unsupervised Learning
- Reinforcment Learning
- Describe the elements (sets and functions)
that are needed to give a formal description of
a reinforcement learning environment.
What is the difference between a deterministic
environment and a stochastic environment?
- Name three different models of
optimality in reinforcement learning,
and give a formula for calculating each one.
- What is the definition of:
- the optimal policy
- the value function
- the Q-function?
- Assuming a stochastic environment,
discount factor γ
and learning rate of η, write the
equation for
- Temporal Difference learning TD(0)
- Q-Learning
Remember to define any symbols you use.