You are on page 1of 31

EnVision

HOW CAN AI
GENERATE IMAGES?
HOW CAN AI GENERATE IMAGES?

To generate images, the machine


uses two neural networks.
The first neural network The second neural network
is used to create the analyzes the generated
image based on the text image with reference
input by the user. images.
To create an image by only text prompts,
you would use an AI architecture called
Generative Adversarial Network (GAN).
What is GAN?
A generative adversarial network (GAN) is an artificial intelligence
algorithm used to generate new, realistic images from a training data set
GANs are made up of two neural networks, a generator and a
discriminator, that compete with each other in a zero-sum game.
The generator tries to generate fake images that are close to the
real images in the training data, while the discriminator tries to
distinguish between the real and fake images. As the two
networks compete, they both improve, and the generator
eventually becomes able to generate realistic images.
GANs are made up of two
neural networks, a generator
and a discriminator, that
compete with each other in
a zero-sum game. The SAMPLE(real) REAL
generator tries to generate TRAINING DATASET DATA
DISCRIMINATOR
FAKE
fake images that are close to
the real images in the
training data, while the
discriminator tries to
distinguish between the real
and fake images. As the two GENERATOR
networks compete, they
both improve, and the
generator eventually RANDOM
NOISE SAMPLE(fake)
becomes able to generate DATA
realistic images.
TRAINING DATA
Model Being trained on large
amount of data sets
TRAINING DATA DEEP LEARNING
Deep learning algorithms go through all
the training data and find variables that
improve their performance
TRAINING DEEP
DATA LEARNING

LATENT SPACE
Mathematical space way more
than 3-dimensional
DIFFUSION

TRAINING DEEP
LATENT
DATA LEARNING
SPACE

GENERATION
It translates the points in the
mathematical space into an image with
the help of generative process called
diffusion
DIFFUSION

Made with Midjourney

TRAINING
DATA
DEEP
LEARNING
LATENT SPACE GENERATION
OUTPUT
It starts with just noise and then over
the series of iterations, arranges pixels
into a composition that makes sense to
humans
DIFFUSION

Made with Midjourney

TRAINING DEEP LATENT SPACE GENERATION OUTPUT


DATA LEARNING
PROJECT GOALS

• A full-fledged web
application for image
generation using AI
• AI image generation using
trending techniques such as
stable diffusion, and neural
style transfer.
TEAM MEMBERS
TEAM MEMBERS

MOHD SAHIL SHIKHAR JAGLAN SATYAM RAJA

2021364524 2021364524 2021364524

You might also like