Professional Documents
Culture Documents
(b)
Figure 1. Camera network in a hallway. camera and simultaneous estimation of point location and
calibration. They are also not concerned with external, but
only internal calibration.
1.2. Previous work Calibration of three views has also been accomplished
through the use of trilinear constraints [6, 13, 15] or quadri-
Previous calibration work has concentrated on the cali- linear constraints [6, 8, 15], but these tools are not well
bration of one camera with respect to a fixed reference cal- suited to the problem of calibrating many cameras at the
ibration pattern, such as in [4]. One notable paper is [9] same time. Spetsakis offers a solution in [14], but his al-
in which they ask if an accurate calibration pattern is re- gorithm uses only calibrated cameras, and additionally does
ally necessary. They answer their question in the negative not obtain the full projection matrices, even for calibrated
through the use of at least three or more views from the same cameras.
2.2. Input data
The fact that the cameras are synchronized and that the
lights can be turned off allows for many point correspon-
dences. A calibrator waves around a rod with an LED on
the end of it. The algorithm uses simple background sub-
traction and thresholding to extract the points. The corre-
spondences are then used in a large nonlinear eigenvalue
minimization routine, the basis of which is the epipolar con-
straint. The eigenvalue matrix encapsulates all the point cor-
respondences between every pair of cameras in a way that
minimizing the smallest eigenvalue results in the projection
matrices.
3. Mathematical preliminaries
= 4 0 A 0 5 4uv2 5
uv3
Interestingly, ground control points are not needed in order
find this H to within a conformal mapping. Only some sort 0 0 A
=[A] [uv ]s
of metric information must be specified. This metric infor- T
where [] is defined here.
mation can be supplied by a wand with lights on both ends.
The constant distance between the lights will calibrate the
entire system to within a scale factor. If the distance between Deriving another identity with v0 = Av results in:
the lights is measured, then the scale factor is specified as uv0T ]s =D[v0uT ]s
= D [A] [vu ]s
well. This technique has the added benefit of being highly T
redundant so that distances are equalized throughout the en-
= D [A] D [uv ]s
tire camera field of view. T
=[A] [uv ]s
T
3.3. Morphology operators
where [] = D[]D.
Morphology operators which operate on 3 3 matrices Given a vector v, the skew symmetric matrix associated
are necessary for the derivations. For any 3 3 matrix M , with it is defined as usual:
2 3
the stacking operator is defined as: 0 ?v3 v2
2 v = 4 v3
[ ] 0 ?v1 5
m1 3 ?v2 v1 0
[M ]s = 4m2 5
m3 Finally, note that
u
A , where A is any 3 3 matrix, results in the following 0 0 0
identity: These results will be used later in the paper.
4. Projective multiframe calibration where Aj;k = K T A0j;k K . Equation 1 just states that j ? T
T k is an eigenvector of Aj;k with zero eigenvalue.
4.1. Problem statement This condition in equation 1 will not hold in the presence
of noisy data, but the least eigenvector (eigenvector associ-
ated with the least eigenvalue) of Aj;k should be the trans-
The camera system projects N 3D points, represented by
lation between cameras j and k.
R1; R2; : : :; RN , onto M cameras, represented by projec-
tion matrices P1 ; P2; : : :; PM . A given point Ri is not nec-
This idea can be extended to be useful for many cameras.
essarily projected onto camera j , but if it is, the projection
Shown next is a technique for forming a matrix whose least
is ri;j = Pj Ri . If a particular point Ri does not project to
T
eigenvector is the concatenation of all the camera posi-
a camera j , then ri;j = [0 0 0]T for notational convenience.
T T T T
tions = [ T1 T2 : : : TM ?1 ]T , where it is assumed with-
The algorithm should output, given the points r, the matri-
T
out loss of generality that M = [0 0 0]T . To this end, form
ces Pj for all j . Since the camera system is calibrated using 2
A1;1+i
3
the LED method, it is assumed that there are many accurate 6
6 A2;2+i 7
7
point correspondences, so that minimality and robustness of 6 .. 7
6 . 7
data is not a concern. What is a concern is using all of the Ai =6
6 AM ?i;M
7
7
correspondences in an integrated minimization routine, so 6 7
that all of the cameras are calibrated at one time. 6 .. 7
4 . 5
AM;i
4.2. Solution framework
which is a block diagonal matrix containing the 3 3 epipo-
Given the set of points ri;j in cameras j , the algo- lar constraints for all the cameras which are i indices apart,
rithm searches for those matrices Bj?1 such that the ri;j =
and zeros elsewhere.
Given the 3M 3M shift matrix
B ?1 vecri;j
j
are effectively formed in cameras which are
2 3
only translations apart from each other. That is, all cameras 0 I3 0 ::: 0
have projection matrices of the form Pj = [I3 j i ]. The T 60
6 0 I3 : : : 07
7
6 .. 7
algorithm measures this with the epipolar constraint, assum- SM =6
6
..
.
..
. .77
ing that all the cameras are only translations apart from each
other. Thus the algorithm can be seen as putting all the cam-
40 : : : I35
0 0
r r
T Fj;k i;j 0 for all i. This can be factored as:
i;k where w is an unknown vector. Requiring BiT G00i to be skew
gives six linear conditions on w, which can be solved by
Fj;k = Bk?T [Tk ? Tj ] Bj?1 least squares to get Bi . The result is the B1 , B2 , and Bi ,
A single fundamental matrix is clearly not sufficient to ex- which are all consistent with our three fundamental matri-
tract any of the Bj ’s, but if it is only necessary to extract the ces. All the Bi can thus be used as initial conditions in the
Bj ’s to within a perspective transformation H , then all the nonlinear minimization.
Bj ’s can be obtained to within the same perspective trans-
formation using only fundamental matrices. 4.6. Obtaining the projection matrices
Since the B ’s only to within a perspective projection are
necessary, it is necessary to fix 12 parameters (3 more are Note that the nonlinear minimization is actually carried
fixed by setting TM = [0 0 0] and the last one is a scale out on the Bi . The Ti are extracted from the least eigenvec-
factor). Nine are fixed by setting B1 = I3 , and the three tor when the minimization is complete. The projection ma-
more parameters of B2 are fixed as follows. The fact that trices are then completed as:
B1 = I3 implies F1;2 = B2?1 [T2 ? T1 ] . It is noted here Pi = [Bi j Bi Ti ]
that the SVD of [T ] is
2 32
1 0 0 v1 T 3 4.7. Algorithm overview
T = ?v2 v1 T
[ ] 40 1 05 4v2 T 5
0 0 0 TT Input.
v v T
with 1 , 2 , mutually orthogonal. Now F1;2 can be de-
M The number of cameras
composed using the SVD as follows, N The number of points
h
0 0i v1TT r^i;j The corresponding points with 1 i M
F1;2 = U 01 2 0 v2 and 1
0 0 0 TT j N.
Output. M projection matrices Pi parameters for a rotation R are d T
= [d1d2 d3 ] , then the
g d
Soma parameters are = s[1 T ]T , with s a scale so that
Steps. g g
has length 1. Given unit , [1] shows that R can be ex-
pressed as
Find the initial matrices Bi using Section 4.5 R=
Using the Bi?1 as initial conditions, minimize the " 2
g0 + g12 ? g22 ? g32 2(g1 g2 ? g0 g3 ) 2(g1 g3 + g0 g2 )
#
smallest eigenvalue of C to obtain the optimal Bi and 2(g2 g1 + g0 g3 ) g0 ? g1 + g2 ? g3 2(g2 g3 ? g0 g1 )
2 2 2 2
References