The power of diffusion models

~ Pramath


Part A

Part 1: Sampling loops

For convinence, parts 1.1, 1.2 and 1.3 have been combined into 1 image.
Below is the result.
Sharpen imgage

Part 1.4:

Here you can see that the image gets gradually denoised as the time step increaseas.

Sharpen imgage Sharpen imgage

Part 1.5: Diffusion model sampling

The difussion model is sampled with the prompt "a high quality photo". Below are the images sampled. Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage

Part 1.6: Classifier-Free Guidance

Sampled with classifier guidance. Below are the images sampled from the CFG of scale = 7.

Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage

Part 1.7: Image-image translation

The start timestep was varied. Below are the results.

Sharpen imgage

Part 1.7.1

Here is a web based image.

Sharpen imgage

Here is a hand drawn image and below it is the generated image from it.

Sharpen imgage Sharpen imgage

Here is another hand drawn image and its respective processed image.

Sharpen imgage Sharpen imgage

Part 1.7.2

Here is an inpainting of the camponile.

Sharpen imgage

Here is an inpainting of my own image.The first image is the original image. The image after is the inpainted image.

Sharpen imgage Sharpen imgage

Here is the second inpainting of my own image.The first image is the original image. The image after is the inpainted image.

Sharpen imgage Sharpen imgage

Part 1.7.3

Image to image translation using text condition. The first image is using the camponile and a propmpt of "a rocket ship".

Sharpen imgage

Part 1.8: Visual Anagram

Here are some visual anagrams I generated.

Sharpen imgage Sharpen imgage

Here is another one of a pencil and a rocket.

Sharpen imgage Sharpen imgage

Here is another one of a monkey eating bananas and amfalfi coast.

Sharpen imgage Sharpen imgage

Part B

Part 1

After the denoiser has been trained, here are the results. The first image is the denoiser after being trained for 1 epoch and the next image is after being trained for 5 epochs.

Sharpen imgage Sharpen imgage

The model was tested out of its distribution since it was trained at sigma = 0.5 noise level. Here are the results.

Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage Sharpen imgage

Part 2

The time based Unet was implemented. Here are the results over the epochs 1, 5, 19.

Epoch 1

Sharpen imgage

Epoch 5

Sharpen imgage

Epoch 19

Sharpen imgage