Generative Adversarial Networks

Setup: Assume we have data $x_{i}$ drawn from distribution $p_{data} (x)$ . Want to sample from $p_{data}$
Idea: Introduce a latent variable z with simple prior p(z). Sample z ~ p(z) and pass to a Generator Network x = G(z).
Then x is a sample from the Generator distribution $p_{G}$ . Want $p_{G} = p_{data}$

Generator Network: Try to fool the discriminator by generating real-looking images
Discriminator Network: Try to distinguish between real and fake images

Pasted image 20241205162806.png
Jointly train Generator network and Discriminator Network
Train jointly in minimax game

Discriminator wants to $D (x) = 1$ for real data and $D (x) = 0$ for fake data
Generator wants $D (x) = 1$ for fake data

Train G and D using alternating gradient updates

Pasted image 20241205165259.png
At start of training, generator is very bad and discriminator can easily tell apart real from fake

so D(G(z)) close to 0

Vanishing gradients for G

Train G to maximize -log(D(G(Z)))

Instead of minimize log(1-D(G(z)))
G gets strong gradients at start of training

Architecture: DC-GAN

Generator is an upsampling convolutional network
Discriminator is a convolutional network
Pasted image 20241205170313.png

Vector Math

Pasted image 20241205170613.png
Pasted image 20241205170621.png