Professional Documents
Culture Documents
Vertical Edge
DNN 0.2
Changes in Value
Beak 0.7
Sharp Turns
Angular Edge
ImageNet
• 1.2 M Training Data*
• 50K Validation Data*
• 100K Test Data*
• 1000 Classes*
ImageNet
0.1
Horizontal Edge Circular Edge Eye
0.2
Vertical Edge
DNN
0.01
Changes in Value
Beak
Sharp Turns
ImageNet
Horizontal Edge Circular Edge Eye 0.1
Vertical Edge
DNN 0.2
Changes in Value
Beak 0.7
Sharp Turns
Angular Edge
ImageNet
Horizontal Edge Circular Edge Eye 0.1
Vertical Edge
DNN 0.2
Changes in Value
Beak 0.7
Sharp Turns
Angular Edge
Gradient
Calculation Off
ImageNet
Transfer Learning
DNN Output
Encoder
Transfer Learning
DNN Output
Encoder
CNN Architectures
AlexNet
- Used ReLU
- Around 60 M Param
- Used 2 GTX 580
- (VRAM 6 GB Total)
- Overlapping Pooling
- Used Dropout
- Simplified Architecture
- Consists of 3x3 Conv ,2x2 MaxPool
- Resolution down by scale of 2, Channel up by scale of 2
18
- 22 layers
- (27 Including
Pooliing)
5 x 5 x 512 1 x 1 x 512
a[2]3
a[1]n
1x1 Convolutions can estimate the information across all the input channels and represent them in
a smaller or larger number of channels, depending on the filter size specified Link
1x1 Convolutions
1x1
Conv2D
(64)
254 x 254 x 64
(256x1)x64 = 16,384
5x5
Conv2D
(64)
254 x 254 x 32
(5x5x256) x 64 = 409,600
Bottleneck
1x1 5x5
Conv2D Conv2D
(64) (64)
2017 MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Link
Link
MobileNetV2