You are on page 1of 4

Computer Vision Algorithms

The main problem of computer vision is that we have to


preprocess the image and turn it into numerical values

 grayscale images: each pixel typically consists of


8 bits (1 byte) for grayscale images

There are 256 possible grayscale colors

 color images: each pixel typically consists of


24 bits (3 byte) for color images (RGB components)

There are ~ 16 million colors


Computer Vision
Bootcamp
Python & OpenCV
Computer Vision Algorithms
Single-Shot Multibox Detector (SSD)
DATA AUGRMENTATION
Data augmentation is a common technique in machine learning to make sure the
model will be robust to different object sizes (and orientations etc.)

 it is also a good approach when the training dataset is too small

DATA AUGMENTATION GENERATES EXTRA TRAINING SAMPLES

 we can generate additional training examples with


patches of the original image at different IoU ratios

+ each image is randomly flipped horizontally with


50% probability to introduce translation invariance

(for example it does not matter whether the object


is on the right or the left side of th image)

You might also like