Deeplearning - Ai Deeplearning - Ai

Copyright Notice
These slides are distributed under the Creative Commons License.
DeepLearning.AI makes these slides available for educational purposes. You may not use or distribute
these slides for commercial purposes. You may make copies of these slides and use or distribute them for
educational purposes as long as you cite DeepLearning.AI as the source of the slides.
For the rest of the details of the license, see https://creativecommons.org/licenses/by-sa/2.0/legalcode

Face recognition
What is face
deeplearning.ai
recognition?
Face recognition
[Courtesy of Baidu] Andrew Ng

Face verification vs. face recognition
Verification
• Input image, name/ID
• Output whether the input image is that of the
claimed person
Recognition
• Has a database of K persons
• Get an input image
• Output ID if the image is any of the K persons (or
“not recognized”)
Andrew Ng
Face recognition
One-shot learning
deeplearning.ai
One-shot learning
Learning from one
example to recognize the
person again
Andrew Ng
Learning a “similarity” function
d(img1,img2) = degree of difference between images
If d(img1,img2) ≤ -
> -
Andrew Ng
Face recognition
Siamese network
deeplearning.ai
Siamese network
⋮ ⋮
" ($)
⋮ ⋮
" (&)
[Taigman et. al., 2014. DeepFace closing the gap to human level performance] Andrew Ng
Goal of learning
⋮ ⋮
f(" ($) )
Parameters of NN define an encoding ((" ) )

Learn parameters so that:
) + ) + &
If " , " are the same person, f " −f " is small.
) + ) + &
If " , " are different persons, f " −f " is large.
Andrew Ng
Face recognition
Triplet loss
deeplearning.ai
Learning Objective
Anchor Positive Anchor Negative
[Schroff et al.,2015, FaceNet: A unified embedding for face recognition and clustering] Andrew Ng
Loss function
Training set: 10k pictures of 1k persons
Choosing the triplets A,P,N
During training, if A,P,N are chosen randomly,

! ", $ + & ≤ !(", )) is easily satisfied.
Choose triplets that’re “hard” to train on.
Training set using triplet loss
Anchor Positive Negative
⋮ ⋮ ⋮
Andrew Ng
Face recognition
Face verification and

deeplearning.ai
binary classification
Learning the similarity function
⋮
$ (%) f($ (%) ) ()
$ (') f($ (') )
Face verification supervised learning
$ (
Neural Style
Transfer
What is neural style

deeplearning.ai
transfer?
Neural style transfer
Content Style Content Style
Generated image Generated image

[Images generated by Justin Johnson] Andrew Ng
Neural Style
Transfer
What are deep

deeplearning.ai
ConvNets learning?
Visualizing what a deep network is learning
⋮ ⋮ &'
26×26×256 13×13×256 13×13×384 13×13×384 6×6×256
55×55×96
FC FC
224×224×3 110×110×96 4096 4096
Pick a unit in layer 1. Find the nine

image patches that maximize the unit’s
activation.
Repeat for other units.
[Zeiler and Fergus., 2013, Visualizing and understanding convolutional networks] Andrew Ng
Visualizing deep layers
Layer 1 Layer 2 Layer 3 Layer 4 Layer 5
Andrew Ng
Visualizing deep layers: Layer 1
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Neural Style
Transfer
Cost function
deeplearning.ai
Neural style transfer cost function
Content C Style S
Generated image G
[Gatys et al., 2015. A neural algorithm of artistic style. Images on slide generated by Justin Johnson] Andrew Ng
Find the generated image G
1. Initiate G randomly
G: 100×100×3
2. Use gradient descent to minimize %(')
[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

Neural Style
Transfer
Content cost
deeplearning.ai
function
Content cost function
" # = % "'()*+)* ,, # + / "0*12+ (4, #)
• Say you use hidden layer ! to compute content cost.
• Use pre-trained ConvNet. (E.g., VGG network)
• Let 6[2](9) and 6[2](:) be the activation of layer !
on the images
• If 6[2](9) and 6[2](:) are similar, both images have
similar content

Neural Style
Transfer
Style cost function

deeplearning.ai
Meaning of the “style” of an image
255 134 93 22
255 134 202 22
123 42
255 231 94 22
83 2
123 94 83 4
"#
34 83
123 94 44 187
2 30
34 44 187 192
34 44 187 92 124
34 76 232
34 76 232 34
67 232
346776 83 124
194 142
⋮
83 194 94
67 83 194 202
Say you are using layer $’s activation to measure “style.”

Define style as correlation between activations across channels.
How correlated are the activations

%' across different channels?
%(
%&

Intuition about style of an image
Style image Generated Image
%' %'
%( %(
%& %&

Style matrix
[/] [/] [/] [/]
Let a*,,,- = activation at 2, 3, 4 . 7 is n9 ×n9

Style cost function
/ 1 / H / J
;<=>/? (A, 7) = E F F(7-- G − 7-- G )
[/] [/] [/]
2%' %& %( - -G

Convolutional
Networks in 1D or 3D
1D and 3D
deeplearning.ai
generalizations of
models
Convolutions in 2D and 1D
∗
2D filter
5×5
2D input image
14×14
1 20 15 3 18 12 4 17 1 3 10 3 1
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D data
Andrew Ng
3D convolution
∗
3D filter
3D volume
Andrew Ng

Deeplearning - Ai Deeplearning - Ai

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Deeplearning - Ai Deeplearning - Ai

Uploaded by

Copyright:

Available Formats

Copyright Notice

These slides are distributed under the Creative Commons License.

For the rest of the details of the license, see https://creativecommons.org/licenses/by-sa/2.0/legalcode

[Courtesy of Baidu] Andrew Ng

Parameters of NN define an encoding ((" ) )

Anchor Positive Anchor Negative

Training set: 10k pictures of 1k persons

During training, if A,P,N are chosen randomly,

Choose triplets that’re “hard” to train on.

Face verification and

$ (%) f($ (%) ) ()

$ (') f($ (') )

What is neural style

Content Style Content Style

Generated image Generated image

What are deep

Pick a unit in layer 1. Find the nine

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

Layer 1 Layer 2 Layer 3 Layer 4 Layer 5

2. Use gradient descent to minimize %(')

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

Style cost function

Say you are using layer $’s activation to measure “style.”

How correlated are the activations

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

[Gatys et al., 2015. A neural algorithm of artistic style] Andrew Ng

You might also like