Professional Documents
Culture Documents
Rectification refers to a transformation process that re-projects both the left and right
images onto a common image plane parallel to the baseline as shown in the figure below
where image planes shown by dotted and bold lines refer to image planes before and after
rectification, respectively. In this case, the cameras are parallel, and consequently the
axes are parallel to the baseline. So, the corresponding epipolar lines would be horizontal
i.e., they have the same y-coordinate. This stereo imaging setup is called a standard or
canonical stereo setup.
The canonical stereo setup makes a stereo matching problem much easier. This is because
the search can be done along the horizontal line in the rectified images instead of
searching the entire image for finding the matching points. Rectification reduces the
dimensionality of the search space for matching points from two-dimensional space to
one-dimensional space. This can be done with the help of a 3×3 homography matrix H.
Mathematically, transforming the coordinates of the original image plane to a common
image plane can be written as follow
𝑈′ 𝑈
[ 𝑉′ ] = 𝐻 [ 𝑉 ]
𝑊′ 𝑊
Q2: Components of a Computer Vision System
Q3: Weak Perspective Projection and Orthographic Projection
In a human visual system, our eyes collapse a 3D world to a 2D retinal image, and the
brain has to reconstruct 3D information. In computer vision, this process occurs by
projection.
For this arrangement, the size of the image formed by the process of projection is given
by:
𝑦
𝑦𝑠 = d
𝑧
The optical center (center of projection) is put at the origin, and the image plane
(projection plane) is placed in front of the center of projection (COP) to avoid inverted
images. The camera looks down the negative z axis. From the similar triangles, it is
𝑥 𝑦
observed that the point (x, y, z) is mapped into (−d 𝑧 , − d 𝑧 , −d). The projection coordinate
𝑥 𝑦
on image is obtained by discarding the last coordinate, (x, y, z) → (−d , −d ). This is
𝑧 𝑧
known as perspective projection.
Let us now consider that the relative depths of points on the object are much smaller than
the average distance 𝑍𝑎𝑣 to COP. For each point on the object, the equation is given by
So, projection is reduced to uniform scaling for all the object point coordinates. This is
called weak persepective projection. In this case, the points at about the same depth are
considered, and each point is divided by the depth of its group.
Suppose, 𝑑 → ∞ in perspective projection model, Hence, for 𝑧 → −∞, the ratio −𝑑/𝑧 → 1
Therefore, the point (x, y, z) is mapped into (x, y). This is called Orthographic projection
or parallel projection.