Diffusion Model Assignment: Q7 and Q8: Q7: Blade Runner 2049 - Reconstructing Memory Through Noise
Diffusion Model Assignment: Q7 and Q8: Q7: Blade Runner 2049 - Reconstructing Memory Through Noise
Let:
t
Y
αt = 1 − β t , ᾱt = αi
i=1
Then: √ √
xt = ᾱt x0 + 1 − ᾱt ϵ, ϵ ∼ N (0, I)
So: √
q(xt | x0 ) = N (xt ; ᾱt x0 , (1 − ᾱt )I)
• Linear Schedule: βt increases linearly with t. This results in fast early corruption and lower
sample quality.
• Cosine Schedule: A nonlinear schedule where ᾱt follows a cosine function. This maintains
more signal in early steps and leads to higher quality generations.
1
Q8: Conditional DDPM for Ghibli-style Portrait Stylization
(a) Conditional Diffusion Setup
We aim to learn a conditional model:
p(x | y)
where x is a Ghibli-style portrait and y is a real-world portrait.
We define a conditional denoising model:
ϵθ (xt , t | y)
where √ √
xt = ᾱt x0 + 1 − ᾱt ϵ, ϵ ∼ N (0, I)
The full training objective:
In contrast, GANs suffer from training instability, harder conditioning, and mode collapse.
DDPMs are thus preferred for tasks like Ghibli-style stylization.