- Write out the steps in the REINFORCE algorithm,
making sure to define any symbols you use.
- In the context of Deep Q-Learning, explain the following:
- Experience Replay
- Double Q-Learning
- What is the Energy function for these architectures:
- Boltzmann Machine
- Restricted Boltzmann Machine
Remember to define any variables you use.
-
The Variational Auto-Encoder is trained to maximize
Ez ∼ qφ(z | x(i))
[log pθ(x(i) | z)]
– DKL(qφ(z | x(i)) ||
p(z))
Briefly state what each of these two terms aims to achieve.
- Generative Adversarial Networks make use of a two-player
zero-sum game between a Generator Gθ and a
Discriminator Dψ, to compute
minθ maxψ (V(Gθ, Dψ))
Give the formula for V(Gθ, Dψ).
-
In the context of GANs, briefly explain what is meant by
mode collapse,
and list three different methods for avoiding it.