Professional Documents
Culture Documents
❑ Convolution (CONV)
A convolution operation is an element wise matrix
multiplication operation. Where one of the matrices is the
image, and the other is the filter or kernel that turns the
image into something else. The output of this is the final
convoluted image. A several example of filter such as,
sharpen, sobel filters and etc.
Convolution (CONV)
❑ Padding
Padding is refers to the amount of pixels added to an
image when it is being processed by the kernel of a
CNN. Have two kinds of padding, such as:
- Valid
- Same
Convolution (CONV)
❑ Stride
Stride is the number of pixels shifts
over the input matrix. When the stride
is 1 then we move the filters to 1 pixel
at a time. When the stride is 2 then we
move the filters to 2 pixels at a time
and so on
Convolution in 2D
Mathematically, it’s: (2 * 1) + (0 * 0) + (1 * 1) + (0 * 0) + (1
* 0) + (0 * 0) + (0 * 0) + (0 * 1) + (1 * 0) = 3
Before After
Convolutions over volume (Convolutions on RGB images)
27 parameters
* =
3x3x3
4x4
6x6x3
Multiple filters
Vertical edges
=
*
3x3x3 4x4
Horizontal edges
= 4x4x2
*
6x6x3
3x3x3 4x4
ONE LAYER OF A CONVOLUTIONAL NETWORK
" 𝑤 [1] 𝑥 [0] "
= Relu
*
+ 𝑏1 =
*
3x3x3 4x4 4x4
𝑤 [1]
4x4x2
6x6x3 * = Relu
* + 𝑏2 =
𝑥 [0]
3x3x3 4x4 4x4
https://www.youtube.com/watch?v=hxA0wxibv8g&list=PLNgy4gid0G9cbw5OjwG2jxvFqYDqkGnpJ&index=11
https://www.coursera.org/lecture/convolutional-neural-networks/one-layer-of-a-convolutional-network-nsiuW
ONE LAYER OF A CONVOLUTIONAL NETWORK
Polling (POOL)
The pooling layer serves to progressively reduce the spatial size of the
representation, to reduce the number of parameters and amount of
computation in the network, and hence to also control overfitting. The
intuition is that the exact location of a feature is less important than its
rough location relative to other features.
Types of Pooling
➢ Mean pooling Max Pooling
➢ Max pooling
➢ Sum pooling Hyper parameter:
F=2x2
S=2
Flattening
https://poloclub.github.io/cnn-explainer/
Glossary
1.AlexNet
2.VGGNet
3.GoogLeNet
4.ResNet
5.etc,.
Computer Vision Problem: image classification, object
detection and segmentation
Session 1
(Image Classification)
Image classification
Image Classification is a
fundamental task that
attempts to comprehend an
entire image as a whole. The
goal is to classify the image by
assigning it to a specific label.
https://developers.google.com/machine-learning/practica/image-classification/convolutional-neural-networks
Practice
Click here !
https://drive.google.com/drive/folders/19STUv4SwltwhnFPjqh8wTFXn9sJT2PUp?usp=sharing
Practice
Practice (Mounting dataset)
1. Click here
2. Define the
repository
name
3. Check Box
4. Click here
Practice (Save your code in GitHub)
Go to Google colab
5. Click here
Practice (Save your code in GitHub)
6. Select
repository name
Practice (Submit your code to GitHub classroom)
*Accept the assignment following the link share by TA
https://classroom.github.com/a/pC0raJi_
Practice (Submit your code to GitHub classroom)
3. Select
repository
4. Click here
2. Import
code
Practice (Submit your code to GitHub classroom)
Click here
Session 2
(Object Detection and Instance segmentation)
To be continued…
Thank You
Contact person:
GitHub: anto112
Email = m07158031@o365.mcut.edu.tw
Line-Id = Haryanto_96