Professional Documents
Culture Documents
www.elsevier.com/locate/npe
Abstract
We show that B-model topological strings on local Calabi–Yau threefolds are large-N duals of
matrix models, which in the planar limit naturally give rise to special geometry. These matrix
models directly compute F-terms in an associated N = 1 supersymmetric gauge theory, obtained by
deforming N = 2 theories by a superpotential term that can be directly identified with the potential
of the matrix model. Moreover by tuning some of the parameters of the geometry in a double scaling
limit we recover (p, q) conformal minimal models coupled to 2d gravity, thereby relating non-critical
string theories to type II superstrings on Calabi–Yau backgrounds.
2002 Published by Elsevier Science B.V.
1. Introduction
Large-N limits of U (N) gauge theories have been a source of inspiration in physics,
ever since ’t Hooft introduced the idea [1]. In particular the large-N limit of gauge theories
should be equivalent to some kind of closed string theory. The first contact this idea had
with string theory was in the context of non-critical bosonic strings described by c 1
conformal field theories coupled to two-dimensional gravity. It was found that by taking a
“double scaling limit” of N × N matrix models, where one send N to infinity while at the
same time going to some critical point, one ends up with non-critical bosonic strings [2,3].
This relation between gauge systems and strings was not exactly in the sense that ’t Hooft
originally suggested for the large-N expansion, as it involved a double scaling limit. In
particular, before taking this limit the matrix model would not have a string dual, whereas
according to ’t Hooft’s general idea one would have expected it to have.
In one context this was remedied by a different kind of matrix model introduced by
Kontsevich [4], where without taking a double scaling limit one finds an equivalence
between a matrix model and non-critical string theory. In particular the amplitudes of
the topological string observables introduced in [5] are directly computed by these matrix
integrals. This duality was in the same spirit as ’t Hooft’s general idea and can be seen as a
low-dimensional example of a holographic correspondence.
More recently large-N dualities have come back in various forms. In the context of
M-theory a large-N matrix formulation was advanced [6]. Since here an unconventional
large-N limit is involved, this again was not quite in the same spirit as ’t Hooft’s idea,
much as the double scaling limit of matrix models in the context of non-critical strings is
not. However, the AdS/CFT correspondence [7] is in the same spirit as ’t Hooft’s original
proposal in that one did not have to take a particular limit to obtain an equivalence.
Another example of such a strict large-N duality is the relation between Chern–Simons
gauge theory and A-model topological strings [8] where one also does not have to take any
particular limit for the equivalence to hold.
The main aim of this paper is to develop a mirror version of this last duality [8]. We
find matrix models that are dual to B-model topological strings on Calabi–Yau threefolds.
This is again in the same spirit as ’t Hooft, as one does not take a double scaling limit. We
will show in particular that the special geometry of Calabi–Yau threefolds that solves the
B-model at tree level emerges naturally from the dynamics of the eigenvalues of the matrix
model.
However, even though it is not required, one can also consider a double scaling limit
of this setup and obtain a specific class of Calabi–Yau manifolds that are dual to double-
scaled matrix models. In this sense we are enlarging the original equivalence of double
scaling limits of matrix models with string theory to an equivalence of all matrix models
with some kind of closed string theory, without any need to take a double scaling limit.
In particular our result shows that studying strings on non-compact Calabi–Yau spaces
provides a unifying approach to all matrix model descriptions.
Furthermore, it turns out that one can embed these large-N dualities in the context of
type IIB superstrings [9], a relation that was further explored in [10–15]. In the context of
this embedding one obtains a dictionary in which the planar limit of the matrix models is
seen to compute superpotentials for certain N = 1 supersymmetric gauge theories, where
the potential of the matrix model gets mapped to the superpotential for an adjoint scalar of
an N = 1 theory.
The organization of this paper is as follows: in Section 2 we propose the large-N con-
jecture for topological strings with matrix models after reviewing the various geometrical
ingredients. In Section 3 we pass this conjecture through some highly non-trivial checks,
and in particular check it at the planar limit. In Section 4 we discuss some generalizations
of this conjecture and its connections with non-critical bosonic strings coupled to gravity
and the double scaling limit. We also discuss the meaning of the double scaling limit from
the viewpoint of type IIB superstrings.
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 5
Instead of being general we consider a special class of such transitions, studied in [10],
and discuss its topological lift. The situation considered in [10] involved a string theory
realization of N = 2 supersymmetric U (N) gauge theory, deformed to N = 1 theory by
addition of a tree-level superpotential Tr W (Φ), which we take to be a general polynomial
of degree n + 1 of the adjoint field Φ. The Calabi–Yau geometry relevant for this was
studied in [21] and corresponds to considering the blowup of the local threefold given as a
hypersurface
uv + y 2 + W (x)2 = 0. (2.1)
The blow up takes place at the critical points of W , i.e., at W (x) = 0. Such transitions lead
to a geometry that contains n blown up P1 ’s which are all in the same homology class. So
in the resolved geometry we can find n isolated rational curves.
More precisely, the resolved singularity can be obtained by starting with the bundle
O(0) ⊕ O(−2) over P1 . This is the normal bundle to rational curve in K3 × C and
corresponds to N = 2 supersymmetry. Let us denote the sections of the normal bundle
by φ0 , φ1 . These sections are respectively 0-forms and 1-forms on the P1 . The field φ0
corresponds to the adjoined valued Higgs field Φ in the N = 2 SYM theory.
Now the inclusion of the superpotential W should give n isolated P1 ’s at the critical
values W (φ0 ) = 0. This is achieved by the following transition function. If z and z are
the coordinates on the northern and southern hemispheres of the P1 , then the resolution is
given by relating the patches (z, φ0 , φ1 ) and (z , φ0 , φ1 ) by
x = φ0 , u = 2φ1 , v = 2φ1 ,
ω = z φ1 , y = i 2ω − W (φ0 ) . (2.3)
Here we have introduced another variable ω in terms of which the geometry would have
been given by
Here one should be careful in regulating the periods over the non-compact B cycles [10].
The degree of the deformation f (x), which corresponds to normalizable deformations, is
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 7
such that these periods can be sensibly defined. In particular their variations with respect
to the variation of the coefficients of f (x) are cutoff independent.
As explained in [10] the A and B cycles can be represented as S 2 fibrations over paths
in the complex x-plane. After integrating the 3-form over these S 2 fibers we are left with
the integrals over these curves of the 1-form
η = y dx.
Here y is determined by the hyperelliptic curve
We now consider lifting these dualities to topological strings. As discussed in [9] the
key point is the observation in [22] that topological strings computes superpotential terms
of the corresponding gauge theory arising in type II superstrings. In particular the leading
planar diagram computes superpotential terms involving the gaugino bilinear field on the
gauge theory side, and the N = 2 prepotential on the dual gravity side (with some vev’s
for auxiliary fields, corresponding to turning on fluxes). More generally the topological
gravity in the presence of Calabi–Yau 3-fold is the B-model theory studied in [22] and
called “Kodaira–Spencer theory of gravity”. Thus all we need to do is to specify the gauge
theory dual, which should be the gauge theory on the topological branes wrapping the P1 ’s.
This theory is the reduction of the holomorphic Chern–Simons theory studied in [23] from
complex dimension three to complex dimension one, which we now turn to.
First suppose that we have no superpotential, i.e., W (x) = 0. In this case we simply
get the A1 geometry times the x-plane. We wrap N branes around the P1 . In this case the
normal directions to the P 1 correspond to the cotangent bundle and the trivial bundle C
associated to x. Let us call the Higgs fields in these two direction respectively Φ1 (z) for the
cotangent direction and Φ0 (z) for the x-direction. The topological theory on the B-brane
we obtain in this case is given by the action
1
S= Tr(Φ1 D A Φ0 ),
gs
P1
where Φ1 (z) is a U (N) adjoint valued (1, 0) form on P1 , Φ0 (z) is an adjoint valued scalar,
A(z) is a U (N) holomorphic (0, 1) form connection on P1 , and D A = ∂¯ + [A, −]. Here gs
denotes the string coupling constant. If we turn on the Higgs fields thereby deforming the
P1 to a non-holomorphic curve C, this action computes the integral
1
S= Ω
gs
Y
8 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20
where W (Φ0 ) corresponds to the superpotential and ω is a (1, 1) form which can be taken
to correspond to have unit volume on P1 .
A consistency check for this action is to note that the equations of motion following
from the above action agree with the fact that the classical solutions correspond to
holomorphic curves. In particular integrating out A gives
[Φ0 , Φ1 ] = 0,
so that Φ0 and Φ1 commute, i.e., we can assume they are simultaneously diagonal.
Variation with respect to the eigenvalues of Φ1 leads to
¯ 0 = 0,
∂Φ
which together with compactness of P1 implies that Φ0 is a constant. Variation with respect
to Φ0 gives
¯ 1 = W (Φ0 )ω
∂Φ
which, together with the fact that the integral of ∂Φ1 over P1 has to be zero for non-singular
Φ1 , leads to
W (Φ0 ) = 0 = Φ1 .
Thus the classical vacua indeed are localized at points W (Φ0 ) = 0, which describe the
positions of the n P1 ’s.
In fact the action (2.8) and the resulting quantum theory is rather trivial. In particular Φ1
also appears linearly and can be integrated out exactly, leading to the constraint ∂Φ ¯ 0 = 0,
which leads to the statement that Φ0 is a constant N × N matrix
Φ0 (z) = Φ = const.
Thus the full action just reduces to its last potential term, which after integration of ω over
P1 leads to the matrix action
1
SW (Φ) = Tr W (Φ). (2.9)
gs
Thus we see that the partition function of the gauge system is equivalent to a simple matrix
model!
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 9
We can give another derivation for the action (2.8) starting directly from the patching
functions (2.2) that determine the blown-up geometry. We split the P1 in two hemispheres
connected by a long cylinder. We denote the fields on these two patches as Φ0 , Φ1
respectively Φ0 , Φ1 . In the case W = 0 we are simply dealing with a gauged chiral CFT
given by an adjoined valued β–γ system of spin (1, 0). The partition function computes
the number of holomorphic blocks and this is one on the two-sphere. (Here we are ignoring
for a moment the factor coming from the volume of U (N).) In an operator formalism this
partition is simply given by pairing the left and right vacuum
Z = 0|0 = 1.
Now we want to implement the deformation induced by W . From the transition function
(2.2) we see that the fields are related in the following way (here we write the fields in
coordinates on the cylinder so that factors of z and z are absorbed)
Φ1 = Φ1 + W (Φ0 ).
Now there is an obvious operator U that implements this transformation on the Hilbert
space. If we define
1
U = exp Tr W Φ0 (z) dz, (2.10)
gs
(recall that the operator Φ0 (z) is an holomorphic field, so the contour does not matter as
long as it encircles the poles) then one easily verifies that
Φ1 = U Φ1 U −1 .
Here one uses the fact that the fields Φ0 and Φ1 are canonically conjugated
gs
Φ0 (z)Φ1 (w) ∼ .
z−w
Therefore in an Hamiltonian formalism the partition function should be given by inserting
the transformation U between the left and right vacua
Z = 0|U |0.
This is the familiar way to implement changes in complex structure in the operator
formalism of conformal field theory.
In Eq. (2.10) we have written
the deformation of the action in terms of the contour
integration or Wilson line Tr W (Φ0 ). Alternatively, this can be written as a surface
integral
1
U = exp Tr W (Φ0 ) ω,
gs
P1
where the volume form ω has been localized to a band along the equator of the P1 . But, as
noted above, one can take any 2-form, as long as it integrates to 1 over the sphere.
10 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20
We are now ready to state our conjecture in precise terms. Let FW,f (gs ) denote the
partition function of topological B-model for the Calabi–Yau manifold given by
uv + y 2 + W (x)2 + f (x) = 0,
where W is a fixed polynomial of degree n + 1 in x and f is a polynomial of degree n − 1
in x. Let Si denote the integral of the holomorphic 3-form over the ith S 3 coming from
the ith critical point of W . The periods Si will vary as we vary the n coefficients of f .
Inverting this map, given the variables Si we can find the coefficients of the polynomial
f (x) compatible with these periods.
On the gauge theory side we now consider the matrix model given by the action
1
SW (Φ) = Tr W (Φ).
gs
We expand this matrix model near the classical vacuum given by partitioning
N = N1 + · · · + Nn ,
and by putting Ni eigenvalues of Φ in the ith critical point of W . (Here we use both
stable and unstable critical points. In fact, since we work in the holomorphic context, this
difference does not really make sense.) Let FW,Ni (gs ) denote the free energy of this matrix
model expanded near this classical vacuum. Then the claim is
This comes from the fact that for this vacuum U (N1 ) × · · · × U (Nn ) denotes the unbroken
gauge group and we have to mod out by the corresponding volume of the constant gauge
transformations. This piece gives, as discussed in [16] the partition function of c = 1 at
self-dual radius. In particular the genus 0 answer will involve
1
F0 = S 2 log Si .
2 i
i
embedding in type IIB superstring this leads to the gaugino superpotential Weff =
In
i Ni ∂F0 /∂Si + αSi . (Note that within the type IIB context the parameters Si and Ni
are independent.) This is a first check on our conjecture. We are now ready to test the
above large-N conjecture in more detail.
We will first show how our conjecture can be proven in the planar limit using standard
manipulations in matrix model technology. A useful reference is for example [3]. We will
see how the special geometry of Calabi–Yau’s emerges naturally.
appear from the Jacobian picked up by the diagonalization process. After exponentiating
this contribution the effective action for the eigenvalues is given by
1
S(λ) = W (λj ) − 2 log(λi − λj ). (3.1)
gs
i i<j
We will now take the limit N → ∞ of this system while keeping fixed the ’t Hooft
coupling
µ = gs N.
In this standard large-N limit we will have a continuum of eigenvalues and their density
1
ρ(λ) = δ(λ − λi )
N
i
becomes a continuous function on the real axis normalized to ρ(λ) dλ = 1. (In the
following λ will always denote a real variable, in contrast with the variable x that can be
complex). The eigenvalues will fill a domain on the real axis. This domain might consist of
several disconnected components known as cuts. In the case of more than one components
one speaks of a multi-cut solution. In the present case they are at most n of these cuts. We
will denote the corresponding intervals in the complex x plane as Ai , i = 1, . . . , n.
To further analyze the model it will be convenient (and standard practice) to introduce
the trace of the resolvent of the matrix Φ
1 1 1 1
ω(x) = Tr = , x ∈ C, x = λi .
N Φ −x N λi − x
i
This resolvent plays an crucial role in matrix model technology. It also has an interesting
physical interpretation. For example, it can be thought of as a loop operator.
By multiplying the equation of motion (3.2) by the factor 1/(λi − x) and summing over
i one obtains the important relation (loop equation) [3]
1 1 1
ω2 (x) − ω (x) + ω(x)W (x) + f (x) = 0, (3.3)
N µ 4µ2
where the polynomial f (x) is of degree n − 1 and is given by
4µ W (x) − W (λi )
f (x) = . (3.4)
N x − λi
i
In some sense the function f (x) determines through (3.3) the whole solution of the matrix
integral. Since it is polynomial of degree n − 1 we only have to determine the n unknown
coefficients.
In the large-N limit the second term in (3.3) can be ignored and the differential equation
for ω(x) becomes an algebraic equation
1 1
ω2 (x) + ω(x)W (x) + f (x) = 0. (3.5)
µ 4µ2
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 13
From this we see that the resolvent ω(x) in general has a piece that can have branch cuts.
This singular part is captured by the function y(x) that we define here as
1
y(x) = 2µω(x) + W (x) = 2gs + W (x). (3.6)
λi − x
i
In terms of the variables (x, y) the relation (3.5) associated to the matrix model now takes
the form
We now want to evaluate the matrix integral around a particular stationary point where
particular fractions of the eigenvalues cluster around the different critical points. Around
such a multi-cut configuration it makes sense to make a perturbative expansion of the
matrix integral using large-N techniques. It is the contribution from one of these saddle
points that we are after.
Consider such a multi-cut solution. The filling fractions Ni /N , i.e., the relative number
of eigenvalues around each critical point, are given by the integrals
Ni
= ρ(λ) dλ.
N
Ai
Therefore we can compute the fraction of eigenvalues in a specific cut by doing a contour
integral around the cut. In this way we find that the quantities Si = gs Ni are given by the
period integrals around the cut, that is the periods on the Riemann surface
1
Si = y(x) dx. (3.10)
2πi
Ai
Here we make contact with the period integrals (2.6) as obtained in the topological B-
model computation. This is the first half of the derivation of the genus zero part of our
conjecture.
In order to complete the derivation, we now need to compute the change in the free
energy F0 (Si ) if we vary the filling fractions Si by adding an eigenvalue to the cuts
1
2Si = 2Ni .
gs
This change in the action is given by the work done by the force F (x) acting on an
eigenvalue if we move this eigenvalue from one branch to infinity. We have seen in (3.8)
that this force is given by
1
F (x) = y(x).
gs
So the variation of the free energy is computed in terms of the action F (x) dx, that is by
integrating the one-form y(x) dx along one of the B-cycles to the cut-off point x = Λ. We
therefore immediately find the special geometry relation
∂F0
= y(x) dx (3.11)
∂Si
Bi
expressing the B-periods in terms of the A-periods through the free energy F0 (S). Together
(3.10) and (3.11) give the precise match with the Calabi–Yau geometry. This concludes
the derivation of the planar version of our conjecture relating the matrix model to the
topological B-model on the deformed Calabi–Yau.
As we have mentioned, for a given potential W (x) the function f (x) that deforms the
singular Calabi–Yau and determines the solution of the matrix model can be expressed in
terms of the periods Si and vice versa. In fact, using definition (3.4) we can give a useful
relation valid in terms of the prepotential F0 . If one parametrizes the potential as
n+1
W (x) = uk x k ,
k=0
n−1
f (x) = 4µbk x k ,
k=0
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 15
n+1
∂F0 (ui , Si )
bk = (k + 2)uk+2 + j uj .
∂uj −k−2
j =k+2
Note that if we introduce Virasoro operators Lk = j j uj ∂/∂uj +k this equation can be
written as [24]
bk = (k + 2)uk+2 + L−k−2 F0 , k = 0, . . . , n − 1.
Can one extend the derivation of our conjecture to higher genera? One possibility might
be to use the loop equations that are derived by taking the expectation value of expression
(3.3) for the resolvent ω(x)
1
1
ω2 (x) + ω(x) W (x) + 2 f (x) = 0,
µ 4µ
where f (x) is now given by the expectation value of expression (3.4). These loop equations
have been studied for multi-cut solutions, for example in [25], and one can try to solve them
order by order in the string coupling constant gs . In principle these equations give recursion
relations that relate the higher genus amplitudes in terms of the tree-level free energy. It
would be very interesting to see if these equations are directly related to the equations of
motion for Kodaira–Spencer string field theory [22]. This is not completely unlikely since
collective field theories for the eigenvalue densities are known to have a similar form [26],
and morally the connection between topological strings and matrix models should go along
these lines.
It might be interesting to also briefly discuss the case of the pure conifold, here given by
the quadratic superpotential W (x) = x 2 . The large-N dual is from our point of view just
the Gaussian matrix model
1
dΦ e−1/gs Tr Φ .
2
Z=
Vol(U (N))
Since the integral is trivial, the only contribution comes from the normalization factor
1
N−1
∼ k!
Vol(U (N))
k=1
which has been shown to reproduce the all genus answer for the B-model on the conifold
in the 1/N expansion [16]. In this case the spectral curve is given by
y2 − x2 + µ = 0
and the eigenvalue density
ρ(λ) = λ2 − µ
16 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20
is Wigner’s famous semi-circle distribution. In the eigenvalue basis the all genus answer is
alternatively obtained by the method of orthogonal polynomials which indeed gives [27]
N−1
− g1s λ2i
dλi ∆(λ)2 e i ∼ k! .
i k=1
There is an interesting relation that directly connects the D-branes in the type II string
theory and the behaviour of the eigenvalues in the matrix models. In the “old days” it
was pointed out by Shenker [28] that the characteristic non-perturbative effect observed in
matrix models was the tunneling of eigenvalues and this was an effect of strength 1/gs .
This remark in some sense anticipated the importance of D-branes. Here we can connect
the two effects.
In the type IIB theory on the resolved geometry we can consider D5-branes wrapped
around an S 3 that interpolates between two S 2 . Such an object manifest itself as a domain
wall in the four uncompactified spacetime dimensions. After the geometric transition such
a D5-brane will connect two three-cycles. It will describe a process where one unit of RR
flux is transported. That is, the RR flux in one S 3 is decreased by one unit and the flux in
another S 3 is increased by one. Since the space–time superpotential is given by
∂F0
Weff = Ni + αSi , (3.12)
∂Si
i
the tension of a domain wall transferring flux from the ith to the j th cycle is given by
∂F0 ∂F0
T= − = y(x) dx.
∂Si ∂Sj
Bij
We now recognize this as the instanton action in the matrix model of an eigenvalue
tunneling from the cut Ai to the cut Aj along the path Bij .
Given that we have found a natural stringy interpretation of ordinary matrix model, one
could ask what is the meaning of the double scaling limit in the context of the old matrix
model [3]. Following that limit on the gravity side for the single matrix model leads to the
Calabi–Yau geometry
H = uv + y 2 + x 2m+1 + deformations = 0
as corresponding to the (2, 2m + 1) bosonic minimal model coupled to two-dimensional
gravity. The deformations correspond to the m observables of the (2, 2m + 1) model. In
fact for generic deformations of this geometry there are m A-cycles and m B-cycles and
we thus can choose m independent parameters to parametrize these deformations. The
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 17
infinitesimal deformations which map this geometry to the deformations of the (2, 2m + 1)
models are of the form
m
uv + y + x (x − i )2 = 0,
2
(4.1)
i=1
where i are related to the deformation of the (2, 2m + 1) model with primary fields. For
example, the case of pure 2d gravity, i.e., the (2, 3) model has only one observable and
in that case we have = µ1/2 . In the usual matrix model this is obtained by considering
a quartic superpotential W (x). Let us take that to be an even function of x. Then there
are three critical points, including one at x = 0. Upon deformation by f (x) the critical
points can split. If we put all the N eigenvalues at the well corresponding to x = 0 then
only the x = 0 double point splits and the other two do not split. The double scaling limit
corresponds to taking the limit where one of the double points reaches one pair of doubles
and the other reaches the other pair. Taking the limit where the geometry is localized near
one set of triple zeros we obtain the (2, 3) local geometry given in (4.1). Embedding this
kind of theory in type IIB strings gives exactly the kind of N → ∞ dualities proposed
recently in [19] which relate certain limits of gauge systems with type IIB strings on
Calabi–Yau geometries without fluxes.
Note that this map to topological B-model is in perfect accord with the fact that the
bosonic strings have a hidden N = 2 superconformal symmetry [29], which in this case
we can identify with the superconformal theory on the above Calabi–Yau threefold. As a
check note that, if we only turn on the cosmological constant, then the genus g answer for
topological B-model scales as the holomorphic threeform Ω to the power of 2 − 2g [22].
For the (2, 3) model since we have
du dv dy dx
Ω= ∼ µ5/4 .
dH
So we learn that the free energy of the B-model is given by an expansion of the form
2g−2 5/4(2−2g)
F= cg gs µ
g0
observables of the (p, q) minimal model. As discussed in [13] in this case the reduction
of holomorphic 3-form leads to p − 1 1-forms ηi which naturally get identified with the
p − 1 eigenvalue densities in a (p − 1) matrix model. It would be interesting to study these
model in more detail and in particular the corresponding multi-matrix model duals.
There are many indications that topological strings on a general target space might be
described by some kind of integrable systems. This was originally shown for the c 1
topological string theories [30]. More recently in the context of A-model topological
strings on non-compact Calabi–Yau this integrability was shown by dualizing to certain
observables of Chern–Simons gauge theory on three-manifolds [19,20]. For A-models on
compact target spaces evidence has been accumulating in the mathematical literature on
Gromov–Witten invariants (see, for example, [31,32]). If we manage to formulate a matrix
model dual of the topological string this integrability is in some sense manifest. Both the
finite-N and the large-N matrix models are well known to give tau-functions of the KP and
Toda hierarchies [3,27]. This integrability of matrix models was the underlying reason that
non-critical strings with c 1 were exactly solvable. Our results indicate that for a large
class of non-compact Calabi–Yau manifolds this integrability is present.
Furthermore one should also like to be able to include gravitational descendents within
the topological string model. In terms of the B-model we expect that these are given by
non-normalizable deformations of the complex structure.
A related issue is the description of the c = 1 string, which is equivalent to topological
strings on the conifold geometry [33]. In a forthcoming paper [34] we will describe how
tachyon scattering processes are indeed reproduced in the conifold string theory and can
be described by a large-N dual gauge system, making contact with [35] and the recent
work [36].
Acknowledgements
We would like to thank F. Cachazo, V. Kazakov, G. Moore and H. Ooguri for valuable
discussions. R.D. would like to thank the Harvard Physics Department and the Institute for
Advanced Study, Princeton for kind hospitality during part of this work. The research of
R.D. is partly supported by FOM and the CMPA grant of the University of Amsterdam,
C.V. is partly supported by NSF grants PHY-9802709 and DMS-0074329.
References
[1] G. ’t Hooft, A planar diagram theory for strong interactions, Nucl. Phys. B 72 (1974) 461.
[2] P. Ginsparg, G.W. Moore, Lectures on 2D gravity and 2D string theory, hep-th/9304011.
[3] P. Di Francesco, P. Ginsparg, J. Zinn-Justin, 2D gravity and random matrices, Phys. Rep. 254 (1995) 1,
hep-th/9306153.
[4] M. Kontsevich, Intersection theory on the moduli space of curves and the matrix airy function, Commun.
Math. Phys. 147 (1992) 1.
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20 19
[5] E. Witten, On the structure of the topological phase of two-dimensional gravity, Nucl. Phys. B 340 (1990)
281.
[6] T. Banks, W. Fischler, S.H. Shenker, L. Susskind, M-theory as a matrix model: a conjecture, Phys. Rev. D 55
(1997) 5112, hep-th/9610043.
[7] O. Aharony, S.S. Gubser, J.M. Maldacena, H. Ooguri, Y. Oz, Large N field theories, string theory and
gravity, Phys. Rep. 323 (2000) 183, hep-th/9905111.
[8] R. Gopakumar, C. Vafa, On the gauge theory/geometry correspondence, Adv. Theor. Math. Phys. 3 (1999)
1415, hep-th/9811131.
[9] C. Vafa, Superstrings and topological strings at large N , J. Math. Phys. 42 (2001) 2798, hep-th/0008142.
[10] F. Cachazo, K.A. Intriligator, C. Vafa, A large N duality via a geometric transition, Nucl. Phys. B 603 (2001)
3, hep-th/0103067.
[11] J.D. Edelstein, K. Oh, R. Tatar, Orientifold, geometric transition and large N duality for SO/Sp gauge
theories, JHEP 0105 (2001) 009, hep-th/0104037.
[12] K. Dasgupta, K. Oh, R. Tatar, Geometric transition, large N dualities and MQCD dynamics, Nucl. Phys.
B 610 (2001) 331, hep-th/0105066;
K. Dasgupta, K. Oh, R. Tatar, Open/closed string dualities and Seiberg duality from geometric transitions in
M-theory, hep-th/0106040;
K. Dasgupta, K. Oh, R. Tatar, Geometric transition versus cascading solution, JHEP 0201 (2002) 031, hep-
th/0110050.
[13] F. Cachazo, K.A. Intriligator, C. Vafa, A large N duality via a geometric transition, Nucl. Phys. B 603 (2001)
3, hep-th/0103067.
[14] F. Cachazo, B. Fiol, K.A. Intriligator, S. Katz, C. Vafa, A geometric unification of dualities, Nucl. Phys.
B 628 (2002) 3, hep-th/0110028.
[15] F. Cachazo, C. Vafa, N = 1 and N = 2 geometry from fluxes, hep-th/0206017.
[16] H. Ooguri, C. Vafa, Worldsheet derivation of a large N duality, hep-th/0205297.
[17] M. Aganagic, C. Vafa, G2 manifolds, mirror symmetry, and geometric engineering, hep-th/0110171.
[18] D.E. Diaconescu, B. Florea, A. Grassi, Geometric transitions and open string instantons, hep-th/0205234.
[19] M. Aganagic, M. Marino, C. Vafa, All loop topological string amplitudes from Chern–Simons theory, hep-
th/0206164.
[20] D.E. Diaconescu, B. Florea, A. Grassi, Geometric transitions, del Pezzo surfaces and open string instantons,
hep-th/0206163.
[21] S. Kachru, S. Katz, A.E. Lawrence, J. McGreevy, Open string instantons and superpotentials, Phys. Rev.
D 62 (2000) 026001, hep-th/9912151.
[22] M. Bershadsky, S. Cecotti, H. Ooguri, C. Vafa, Kodaira–Spencer theory of gravity and exact results for
quantum string amplitudes, Commun. Math. Phys. 165 (1994) 311, hep-th/9309140.
[23] E. Witten, Chern–Simons gauge theory as a string theory, hep-th/9207094.
[24] M. Bertola, B. Eynard, J. Harnad, Partition functions for matrix models and isomonodromic tau functions,
nlin.SI/0204054.
[25] G. Akemann, Higher genus correlators for the Hermitian matrix model with multiple cuts, Nucl. Phys. B 482
(1996) 403, hep-th/9606004.
[26] S.R. Das, A. Jevicki, String field theory and physical interpretation of D = 1 strings, Mod. Phys. Lett. A 5
(1990) 1639.
[27] A. Morozov, Matrix models as integrable systems, hep-th/9502091.
[28] S.H. Shenker, The strength of nonperturbative effects in string theory, in: Proc. Random Surfaces and
Quantum Gravity, Cargese, 1990, pp. 191–200.
[29] M. Bershadsky, W. Lerche, D. Nemeschansky, N.P. Warner, Extended N = 2 superconformal structure of
gravity and W gravity coupled to matter, Nucl. Phys. B 401 (1993) 304, hep-th/9211040.
[30] R. Dijkgraaf, Intersection theory, integrable hierarchies and topological field theory, in: Cargese Summer
School on New Symmetry Principles in Quantum Field Theory, 1991, hep-th/9201003.
[31] A.B. Givental, Gromov–Witten invariants and quantization of quadratic Hamiltonians, math.AG/0108100.
[32] A. Okounkov, R. Pandharipande, Gromov–Witten theory, Hurwitz theory, and completed cycle,
math.AG/0204305.
[33] D. Ghoshal, C. Vafa, c = 1 string as the topological theory of the conifold, Nucl. Phys. B 453 (1995) 121,
hep-th/9506122.
20 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 3–20
Abstract
We point out two extensions of the relation between matrix models, topological strings and N = 1
supersymmetric gauge theories. First, we note that by considering double scaling limits of unitary
matrix models one can obtain large-N duals of the local Calabi–Yau geometries that engineer N = 2
gauge theories. In particular, a double scaling limit of the Gross–Witten one-plaquette lattice model
gives the SU(2) Seiberg–Witten solution, including its induced gravitational corrections. Secondly,
we point out that the effective superpotential terms for N = 1 ADE quiver gauge theories is similarly
computed by large-N multi-matrix models, that have been considered in the context of ADE minimal
models on random surfaces. The associated spectral curves are multiple branched covers obtained as
Virasoro and W -constraints of the partition function.
2002 Published by Elsevier Science B.V.
1. Introduction
supergravity corrections R 2 F 2g−2 (with R the Riemann curvature and F the graviphoton
field strength) are similarly computed exactly by the genus g > 0 matrix diagrams.
This gauge theory/matrix model correspondence was a consequence of the large-N
dualities of [2–4] that relate the computation of holomorphic F-terms in the world-volume
theories of D-branes to partition functions of topological strings in local Calabi–Yau
geometries—a relation that was further explored in [5–11]. In the simplest case these local
non-compact Calabi–Yau manifolds take the form
vv + y 2 − W (x)2 + f (x) = 0.
One finds that in the B-model topological string the tree-level free energy can be computed
in terms of the periods of the meromorphic differential y dx on the associated Riemann
surface
y 2 − W (x)2 + f (x) = 0.
As we argued in [1] this curve and the associated special geometry arises naturally from
the large-N dynamics of the matrix integral with action W (Φ). But we should stress again
that the relation with matrix models goes beyond the planar limit. The higher genus string
partition functions Fg and the related gravitational couplings of the gauge theories are
exactly computed in the 1/N expansion of the matrix models.
Let us briefly summarize these connections, for more details see [1]. We start with the
Hermitian matrix integral
dΦ e−S(Φ)
with action
1
S(Φ) = Tr W (Φ),
gs
and W (x) is a polynomial of degree n + 1. The matrix integral can be reduced to an integral
over the eigenvalues x1 , . . . , xN of Φ in the potential W (x). In the classical limit gs → 0,
where one ignores the interactions among the eigenvalues, the equation of motion is given
by
∂S
y(x) = gs = W (x) = 0. (1.1)
∂x
The associated classical spectral curve is
y 2 − W (x)2 = 0, (1.2)
where x, y can be considered as complex variables. Writing W (x) =
i (x − ai ) we see
that this singular genus zero planar curve has n double points at the critical points x = ai .
Sometimes it can be helpful to think of the (x, y)-plane as a phase space, with y the
momentum conjugate to x, as given by the Hamilton–Jacobi equation (1.1). Then (1.2) has
an interpretation as the zero-energy level set of the (bosonic part of the) supersymmetric
quantum mechanics Hamiltonian associated to the superpotential W (x), and S(x) can be
thought of as the semi-classical WKB action of the associated quantum mechanical ground
state Ψ (x) ∼ e−S(x).
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 23
Classically, the N eigenvalues will cluster in groups of Ni in the critical points ai where
they will form some meta-stable state. The relative number of eigenvalues or filling fraction
of the critical point ai we will denote as
νi = Ni /N.
If gs is not zero, we have to take into account the Coulomb interaction that results from
integrating out the angular, off-diagonal components of the matrix Φ. The equation of
motion of a single eigenvalue x in the presence of the Dyson gas of eigenvalues x1 , . . . , xN
is now modified to
N
1
y = W (x) − 2gs .
x − xI
I =1
We will now take the large-N ’t Hooft limit keeping both µ = gs N and the filling
fractions νi fixed. In this case each critical point has its own ’t Hooft coupling
µi = gs Ni = µνi .
The collective dynamics of these eigenvalues in the large-N limit can be summarized
geometrically as follows. Each of the n double points x = ai gets resolved into two
branch points ai+ , ai− . The resulting branch cuts Ai = [ai− , ai+ ] are filled by a continuous
density of eigenvalues that behave as fermions and spread out due to the Pauli exclusion
principle. This process of splitting up of double points is very analogous to transition
from the classical to the quantum moduli space in the Seiberg–Witten solution of N = 2
supersymmetric gauge theories [12]—a relation that was explained in [11]. The resolution
of double points is captured by deforming the classical spectral curve (1.2) into the
quantum curve
where the cycles Bi run from the branch cuts to some cut-off point at infinity.
24 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39
Our strategy will be the following. The N = 2 theory can be geometrically engineered
by taking a suitable limit of type IIB string theory on a local Calabi–Yau [14–17]. This
local CY produces directly the Seiberg–Witten curve that encodes the dynamics of the
N = 2 gauge theory. More precisely the genus zero topological B-model amplitudes on the
local CY capture the Seiberg–Witten geometry. The higher genus amplitudes compute the
contributions of the gauge theory to certain gravitational terms of the form R 2 F 2g−2 [18,
19]. We will now engineer a matrix model that is large-N dual to this local CY geometry.
In particular the planar limit gives the SW geometry and the 1/N corrections capture the
generation of the corresponding gravitational terms.
To be specific, let us discuss here the simplest case of pure SU(2) N = 2 Yang–Mills
theory. The SW solution is given in terms of the familiar elliptic curve
2
w2 = y 2 + u − Λ4 , (2.1)
where u is the coordinate on the moduli space, i.e., the vev of the adjoint 12 tr Φ 2
, and Λ
the gauge theory scale, that we will sometimes set conveniently to Λ = 1.
The local CY obtained in the geometric engineering is given by the algebraic variety
[15–17]
1
vv + Λ z +
2
+ 2 y2 + u = 0
z
with z ∈ C∗ , i.e., an invertible variable. After reducing over the (v, v )-plane the associated
Riemann surface is
1
Λ z+
2
+ 2 y 2 + u = 0. (2.2)
z
Since z
= 0 we can multiply by z and substituting w = Λ2 z + y 2 + u to bring the curve in
the form (2.1).
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 25
Furthermore, the reduction of the holomorphic three-form on the CY gives directly the
SW differential y dz/z. The prepotential F (u) is then obtained by computing the periods
of this meromorphic one-form along the A-cycle √and B-cycle of the elliptic curve (2.1).
Note that there are four branch points at y = ± −u √ ± Λ . In the classical limit Λ → 0
2
they coalesce pairwise in two double points y = ± −u. The moduli space contains two
singularities at u = ±1 where monopoles respectively dyons become massless.
Since z is a C∗ variable, it makes sense to write z = eix with the variable x periodic
modulo 2π , and reexpress the original local CY and the resulting curve (2.2) as
Λ2 cos x + y 2 + u = 0,
with SW differential y dx. This way of writing the equation suggests a relation to a matrix
model with some suitable potential W (x). In fact, since x is now a periodic variable this
suggest a unitary matrix model where z = eix will get interpreted as the eigenvalue of a
unitary matrix U .
Unitary matrix models are defined as integrals over the group manifold U (N) of the
form
1 1
Z= dU exp − Tr W (U ) , (2.3)
Vol(U (N)) gs
U (N)
where dU is the Haar measure. As in the Hermitian matrix models, one can diagonalize U
and express everything in integrals over its eigenvalues
U ∼ diag eiα1 , . . . , eiαN ,
where the αi are periodic variables, giving
2 αI − αJ 1
Z= dαI sin exp − W (αI ) .
2 gs
I I <J I
Note that such a unitary matrix model can be viewed as a special case of a Hermitian
model by writing U = eiΦ with Φ a ‘compactified’ Hermitian matrix, i.e., a matrix with a
periodic spectrum Φ ∼ Φ +2π . Such a periodicity is achieved by adding multiples of 2π to
the eigenvalues αI of Φ. This addition of multiple images is the familiar way to compactify
transverse directions for D-branes or for matrix models in M-theory [20]. For example, in
this way the Vandermonde determinants in the measure become after regularization
αI − αJ
(αI − αJ + 2πn) = sin .
2
n∈Z
The unitary matrix model describes a collection of particles in a potential W (α) on the
unit circle interacting through a Coulomb potential. The equation of motion of the unitary
model is
αI − αJ
W (αI ) − 2gs cot = 0.
2
J
26 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39
The large-N solution proceeds exactly as in the uncompactified case. One introduces again
a resolvent, this time defined as
1 x − αI
ω(x) = − cot ,
N 2
I
that satisfies a quadratic loop equation that can be derived in exactly the same way as in
the Hermitian case. In the limit N → ∞ with gs N = µ fixed this loop equation takes the
following familiar form, when written in terms of the variable y = W (x) + 2µ ω(x),
Our candidate unitary matrix model will be a much studied one, namely the so-
called Gross–Witten model [13] with potential W (α) ∼ cos α. This model was originally
introduced as a lattice discretization of two-dimensional (non-supersymmetric) Yang–
Mills theory. In such a lattice model to each plaquette with holonomy U around the edge
one associates the Wilson action (in other contexts known as the Toda potential)
, ,
S(U ) = Tr U + U −1 = Tr cos(Φ).
2gs gs
Here Φ can be thought of as the lattice approximation to the gauge field strength Fµν , and
in the limit Φ → 0 this gives the quadratic Yang–Mills action Tr Φ 2 . (The parameter ,
we introduce for convenience. It can of course be absorbed by rescaling gs = gYM 2 .) In a
general lattice model one integrates over a collection of plaquettes, but in two dimensions
a single plaquette suffices to compute for instance a Wilson loop action.
The GW model has two critical points
W (α) = −, sin(α) = 0
at a1 = 0 and a2 = π . Note that for real and positive , (the case relevant for Yang–Mills
theory) the second point is an unstable critical point. But that issue is irrelevant for the
holomorphic matrix models that we are considering here. The parameter , can be complex,
and our eigenvalues eiα are allowed to move off the unit circle into the punctured complex
plane C∗ . In fact, following our general philosophy, we will consider the perturbative
expansion of the matrix integral (2.3) around the saddle point where N1 eigenvalues are at
the first critical point a1 and N2 = N − N1 are at the second point a2 . So we are dealing
with a two-cut, meta-stable solution to the matrix integral. These two cuts introduce a
second parameter, besides the overall ’t Hooft coupling µ = gs N , namely the relative
filling fraction
ν = (N1 − N2 )/N.
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 27
If we introduce the separate ’t Hooft couplings for the two critical points
µ1 = gs N1 , µ2 = gs N2 ,
then the difference of these couplings is related to the filling fraction
µ = µ1 − µ2 = gs (N1 − N2 ) = µ · ν.
We will be interested in computing the planar limit of the free energy F as a function of
the two coupling µ1 and µ2 , or equivalently as a function of the ’t Hooft coupling µ and
the filling fraction ν. Although ν takes values in the interval [−1, 1] the final result will
turn out to be a holomorphic function of ν.
The planar limit can be computed solving the loop equation (2.4). To this end we have
to compute the quantum correction f (x) defined in (2.5). For our choice of potential
W (x) = , cos x this becomes the following average over the eigenvalues
1 x − αI
f (x) = − ,(sin x − sin αI ) cot
N 2
I
1
=− ,(cos x + cos αI )
N
I
= −,(cos x + u). (2.6)
Here the constant u is defined as the average
1
u= cos αI .
N
I
In the semi-classical approximation gs → 0 we have
N1 − N2
u≈ = ν.
N
Since there are N1 eigenvalues at the critical point αI = 0 and N2 eigenvalues at αI = π ,
that contribute respectively +1 and −1 to the average of cos αI .
Inserting our expression for f (x) into (2.4) gives the spectral curve
Now we will take a double scaling limit of the GW model to obtain the SW solution
relevant for N = 2 supersymmetric gauge theory. In this limit we will send N → ∞ and at
the same time , → 0 and ν → 0, keeping ,Ni , ν/, and gs fixed—or, equivalently, we will
send the ’t Hooft coupling µ → ∞ keeping the difference of the two couplings µ1 and µ2
µ = gs (N1 − N2 ) = µν
fixed. In this limit the absolute difference in eigenvalues N1 − N2 remains finite, but N1
and N2 become both infinite, and therefore the relative filling fraction ν = (N1 − N2 )/N
goes to zero.
After rescaling y appropriately, the spectral curve reduces in this limit exactly to the
SW curve (with Λ = 1)
y 2 + cos x + u = 0.
Note that the double scaled curve depends only on a single parameter u that at weak
coupling could be identified with the filling fraction ν = (N1 − N2 )/N of the matrix model.
Of course the limit we are taking is at a strong coupling point and the relation between u
and matrix model modulus µ is more complicated, as discussed above.
As we have already mentioned, the prepotential is now computed by the periods of the
differential y dx = y dz/z along the A and B cycles. This allows us to identify the SW
periods as
a = y(x) dx = µ ,
A
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 29
∂F
aD = y(x) dx = . (2.8)
∂µ
B
From the original four branch points a1± , a2± obtained in resolving the two double
points a1 , a2 , our double scaling limit takes the branch points a1− , a2+ to infinity while
keeping a1+ , a2− at finite distance. This leaves two homology one-cycles: the B-cycle that
runs around the cut [a1+ , a2− ] and the dual A-cycle that is homologous to A1 − A2 . This
behaviour of the branch points is exactly the behaviour in the double scaling limit one takes
in the old matrix models [21,22], as we also noted in [1]. For example, the (2, 3) critical
point of the one-matrix model was obtained starting from a curve of the form y 2 = x 6 + · · ·
and the double scaling limit got rid of the all monomials with power more than x 3 giving
an equation of the form y 2 = x 3 + · · ·. (Note that these branch points in the x-plane should
not be confused with the branch points in the y-plane that are relevant for the SW solution.)
The double scaling is very analogous to the limit that was used in the A-model
topological string in [23]. In fact, when the scaling limit is embedded in type II string
theory, the resulting CY geometry based on (2.7) will have RR flux through the compact
cycles A1 and A2 . It is crucial that the remaining compact cycle B does not carry any
Ramond flux. We are thus engineering a large-N dual of a geometry without fluxes.
The GW model has a famous third order phase transition at (in our convention) µ/, = 2.
This signals a transition of the eigenvalue distribution in which the single cut changes
topology and starts to cover the whole unit circle. After the GW phase transition the
eigenvalue distribution is given by ρ(α) ∼ cos α + µ/2. Geometrically speaking, in that
phase all four branch points are on top of each other.
This phase transition is however not relevant in our model. First of all, we are studying
a more general question by considering a stationary phase approximation around a meta-
stable state with two clusters of eigenvalues and consequently have to work with a two-
dimensional phase diagram (µ, ν). As we argued the GW solution puts ν = 1 and that is
very far away from our double scaling limit in which ν tends to zero. Indeed in our limit
the number of eigenvalues in the two cuts is roughly equal. Secondly, we are dealing with
an holomorphic object, and holomorphy excludes any phase transitions, one can just go
around the singularity. The GW phase transition is just a (very special) real slice of our
complex phase diagram.
Finally it would be interesting to connect this approach to the beautiful semi-classical
computation of the SW solution and its gravitational counterparts in [24]. That computation
was inspired by matrix integrals appearing in D-branes formulas.
It is not difficult to guess how the SW solution for gauge group SU(n) can be
engineered. In this case the curve associated to the local CY is of the form
cos x + Pn (y) = 0,
with Pn (y) a polynomial of degree n. More generally we can consider a chain of U (ni )
gauge theories with bifundamental matter, for which the corresponding curve has been
obtained from the M5-brane viewpoint in [25] and from the viewpoint of geometric
30 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39
where F is a polynomial in e±ix and y. In particular if we consider the rank of all the
gauge groups to be equal to n, then F is a polynomial in y of degree n. Moreover the
difference in power of eix between the highest and lowest powers is the number of U (n)
gauge groups plus 1. As we will discuss in greater detail in the next section in the context
of Hermitian multi-matrix models such curves are typically produced by a multi-matrix
model consisting of n − 1 matrices. The choice of the coefficients in F will be related to
the choice of the action and some suitable double scaling limit, as we studied in the context
of SU(2) gauge theory here.
We will now turn to a related generalization of [1] where we will connect superpotential
computations of quiver gauge theories to multi-matrix models.
We will restrict our discussion here to the ADE quivers, in particular the Ar case,
although one can also include the affine quivers based on the extended Dynkin diagrams
D
A
E.
Let r denote the rank of the quiver G and consider a partition
N = N1 + · · · + Nr .
In the associated N = 2 quiver gauge theory we assign to each of the r vertices vi of
the Dynkin diagram of G a U (Ni ) gauge field and to links connecting vertices vi and
vj we associate bifundamentals Qij transforming in the representation (Ni , N j ) with a
Hermiticity condition Q†ij = Qj i .
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 31
with sij = −sj i = 1 (for some ordering i < j ), if the vertices vi and vj are linked in the
Dynkin diagram (we will write this relation also as i, j
), and sij = 0 otherwise. Here the
first term is the standard superpotential of the N = 2 theory with bifundamental matter. The
additional potentials Wi (Φi ) are introduced to break the supersymmetry down to N = 1.
Within type II string theory these quiver gauge theories are obtained by wrapping
D5-branes over a particular CY geometry that is a fibration of the corresponding ADE
singularity over the complex plane [7]. This geometry contains r intersecting P1 ’s.
According to [7,8] in the large-N limit the geometry undergoes a transition to a deformed
geometry where these P1 ’s are blown down and a number of S 3 ’s with RR flux are “blown
up”. The corresponding smooth CY geometry gives a dual description of the gauge theory
system.
In the context of B-model topological strings, the deformed CY geometry is dual
to a two-dimensional large-N gauge system, obtained from a collection of B-branes
wrapped on the intersecting P1 ’s. The world-volume theory of these branes consists of
open topological strings. So each P1 gives rise to a two-dimensional field theory with
Lagrangian [30]
1
S(Φ) = A Φ 0 + Wi Φ 0 ω ,
Tr Φi1 D (3.2)
i i
gs
P1
where ω is some volume form on P1 , and Φi0 and Φi1 are adjoint fields of respectively
spin 0 and spin 1 coupled to an U (Ni ) holomorphic gauge field. Here we included the
effect of the superpotential Wi (Φ0 ). The open topological strings connecting different P1 ’s
give as physical fields the bifundamentals Qij . Since the different P1 ’s intersect in points,
the action of these bifundamental scalar fields localizes to the intersection point x and is
given by
S(Q) = Tr Qij (x)Φj0 (x)Qj i (x) − Qj i (x)Φi0 (x)Qij (x) .
i,j
x∈P1 ∩P1
i j
(Compare the similar computation for the coupling of open topological strings connecting
Lagrangians A-branes intersection along one-dimensional curves in [31].)
As in [1] one can see that in the end this two-dimensional topological field theory can
be completely reduced to the zero modes of the fields Φi0 (x) = Φi and Qij . Thereby the
path-integral reduces to the “quiver matrix integral”
1
Z= dΦi dQij exp − Tr W (Φ, Q) , (3.3)
gs
i i,j
The generalization of the conjecture in [1] will now identify the free energy of the
large-N quiver matrix model, for given filling fractions of the saddle points, with the closed
topological string partition function in the corresponding deformed CY geometry, and this
in turn with the effective superpotential of the quiver gauge theory.
The saddle points of the quiver superpotential have been discussed extensively in [7–9]
following the mathematical literature. The eigenvalues x of the adjoints Φi have to satisfy
a series of equations: one for every positive roots αk of G. If that root is expressed in the
simple roots ei as
αk = nik ei ,
i
The saddle points can be labeled as xa,k with a = 1, . . . , d and d the maximal degree
occurring in (3.4). If such a critical point appears with multiplicity Na,k then the total
number of eigenvalues of the matrix Φi in this saddle point is Na,k · nik . A general saddle
point is therefore parametrized by the filling fractions νa,k = Na,k /N . We will consider the
matrix integral in the limit where both the Na,k and N tend to infinity keeping the filling
fractions and the ’t Hooft coupling finite.
In the case of an Ar quiver there is a more straightforward description of the saddle
points [7–9]. Introduce the r + 1 potentials
i
t0 (x) = 0, ti (x) = Wj (x), i = 1, . . . , r.
j =1
y − ti (x) = 0
intersect in various double points given by
j
tj (x) − ti (x) = Wk (x) = 0.
k=i+1
The saddle points of the Ar quiver matrix potential correspond exactly to these double
points.
In this Ar case the original singular CY geometry is given by
r
uv + y − ti (x) = 0
i=0
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 33
which after reduction over u, v gives precisely this collection of nodal curves. After the
deformation the corresponding smooth Riemann surface is given by
r
y − ti (x) + f (x, y) = 0 (3.5)
i=0
for a suitable normalizable quantum deformation f (x, y). Again every double point gets
resolved into two branch points. The resulting quantum curve is now an r + 1 fold cover
of the x-plane. By moving around in the x-plane these sheets will be exchanged through
Weyl reflections acting on the parameters ti .
The analogues of the meromorphic one-form are constructed by reducing the holomor-
phic three-form of the local CY over various cycles and for Ar have the following descrip-
tion [7,8]. Write the curve (3.5) in the factorized form
r
y − ai (x) = 0.
i=0
Quite remarkably it turns out that the quiver matrix integrals (3.3) (up to some minor
details) have already been studied in the context of the “old matrix models”. They have
been used to describe the coupling of ADE conformal minimal models to two-dimensional
gravity by Kostov [32], see also the reviews [33,34], and they have naturally emerged in
the study of matrix models and integrable systems in the work of the ITEP group [35]. We
will follow closely these works in presenting the main results, leaving the details to the
literature.
First of all, one can immediately integrate out the bifundamental fields Qij in the quiver
matrix integral to give an effective interaction between the adjoint fields Φi and Φj
det(Φi ⊗ 1 − 1 ⊗ Φj )−1 .
34 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39
Before we discuss the loop equations of the multi-matrix models, let us first rewrite the
solution of the one-matrix model as used in [1] in a more suggestive form, that is actually a
standard technique in matrix model technology. Here we found among others the reviews
[33,34,36] very helpful.
The resolvent ω(x) has a natural interpretation as a loop operator. More precisely, the
inverse Laplace transform
dx ix:
e ω(x) = Tr e:Φ
2π
is the zero-dimensional analogue of the Wilson loop. The non-linear all-genus loop
equation is usually written in terms of ω(x) as
dz W (z)
ω(z) = µ ω(x)2 , (3.8)
2πi x − z
C
where · · ·
indicates an expectation value within the matrix integral. The contour C
encircles all the cuts but not the point x. This equation is supplemented with the boundary
condition ω(x)
∼ 1/x at infinity. The loop equation acts as a Schwinger–Dyson equation
of the matrix model. It gives a recursive relation to solve for the loop operator and the free
energy. In the planar limit we have large-N factorization ω(x)2
= ω(x)
2 and the loop
equation becomes algebraic.
Loop operators are closely connected to collective fields. By integrating out the angular
variables the individual eigenvalues start to behave as fermions, and the collective field
is essentially constructed by bosonization of these fermion fields. In [1] we have already
speculated that this collective field should be identified with the Kodaira–Spencer field [18]
describing the closed strings moving on the local CY geometry.
For a single matrix model the collective field is defined as the chiral two-dimensional
scalar field
ϕ(x) = W (x) − 2gs log(x − λI ).
I
So in view of (3.7) we can identify the function ϕ(x) with the action S(x) of a single
eigenvalue as a function of its position x in the complex plane in the presence of the gas
of other eigenvalues λ1 , . . . , λN . The function ϕ(x) is multi-valued in the x-plane. It has
branch cuts around which it changes sign. It is therefore only properly defined on the
double cover
On this Riemann surface ϕ(x) has quantized periods around the A-cycles, given by the
filling numbers µi = gs Ni . Since it is a chiral field the periods around the dual B-cycles
are not independent and expressed by the special geometry relations as ∂F /∂µi .
Note that if we work with a general, not necessarily polynomial, superpotential
W (x) = tn x n ,
n0
then the expectation value of the field ∂ϕ(x) inserted in the matrix integral can be
represented by a linear differential operator in the couplings tn acting on the partition
function. For example,
−n−1 ∂
∂ϕ(x) = ntn x n−1
− 2gs2
x Z,
∂tn
n>0 n0
and similarly for multi-point functions. (Here we used that the derivative ∂/∂tn brings
down a factor g1s Tr Φ n .)
With this notation there is an elegant way to write the loop equations. Introduce the
holomorphic stress-tensor
T (x) = (∂ϕ)2 = Ln x −n−2 .
n
Then the all-genus loop equation (3.8) of the one-matrix model can be rewritten in the
suggestive form
dz 1
T (z) = 0. (3.10)
2πi x − z
C
That is, the expectation value T (x)
has no singular terms if x → 0. Therefore we can
also express (3.10) equivalently as the Virasoro constraints [37,38]
Ln Z = 0, n −1.
The derivation of the constraints in the matrix model is completely standard—it
simply expresses the Ward identities following from the invariance under infinitesimal
reparametrization of Φ → Φ + ,Φ n+1 of the matrix variable Φ.
In the planar limit we can substitute the classical values for ∂ϕ(x) = y in T (x) and then
Eq. (3.10) is a consequence of (3.9) that can now be written as
T (x) = W (x)2 − f (x),
which shows that T (x) is indeed regular (even polynomial) at x = 0.
The large-N solution of the quiver matrix integral (3.3) now proceeds along similar
lines [32,35]. One introduces r scalar fields ϕi (x) through the one-forms (3.7) as
yi (x) dx = ∂ϕi (x).
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 37
One can then show that the multi-valued fields ϕi (x) are actually the values of one single-
valued field ϕ(x) (essentially the full matrix model action) on a r + 1 branched cover of the
complex x-plane. This branched cover is the spectral curve associated to the quiver matrix
integral, and turns out to be given by (3.5) in the Ar case.
The general derivation of the curve proceeds through generalized loop equations.
For these multi-matrix model we do not only have the Virasoro constraints, expressing
reparametrization invariance in the matrix variables Φi . There are also higher order
relations [37,38]. The full set of loop equations are obtained by showing that the partition
function Z satisfies a set of W -constraints, labeled by the Casimirs of the corresponding
ADE Lie algebra, which contains the Virasoro constraints. These constraints take the form
dz 1 (s)
W (z) = 0,
2πi x − z
C
where W (s) (x) is a spin s current in the W -algebra. When expressed in modes these
equations take the form
Wn(s) · Z = 0, n 1 − s.
In the case of Ar there is leading spin r + 1 current that with a suitable basis of vectors
ϕ0 , . . . , vr can be written as
r
W (r+1) (x) ∼ (vi · ∂ϕ) + · · · .
i=0
We claim that in the planar limit this loop equation translates directly into the curve (3.5).
To be completely explicit let us give some more detail for the simplest case of A2 . Here
we have two matrices Φ1 , Φ2 with potentials Wi (Φ1 ) and W2 (Φ2 ). The classical singular
curve is after a shift in y given by
y − t1 (x) y − t2 (x) y − t3 (x) = 0
with
t1 = −(2W1 + W2 )/3, t2 = (W1 − W2 )/3, t3 = (W1 + 2W2 )/3,
all polynomials in x. To find the quantum curve we introduce the resolvents
1 1
w1 (x) = , w2 (x) = ,
x − λ1,I x − λ2,I
I I
and the one-forms yi (x) dx
y1 = W1 − µ(2ω1 − ω2 ), y2 = W2 − µ(2ω2 − ω1 ).
We now claim that the quantum curve is given by
y − a1 (x) y − a2 (x) y − a3 (x) = 0,
where the functions ai (x) are no longer polynomials, but instead are defined as
a1 = t1 + µω1 , a2 = t2 − µ(ω1 − ω2 ), a3 = t3 − µω2 .
38 R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39
a2 − a1 = y1 , a3 − a2 = y2 .
Now after some algebra, expanding out terms like ωi (x)3 , one verifies that indeed
y − a1 (x) y − a2 (x) y − a3 (x)
= y − t1 (x) y − t2 (x) y − t3 (x) + f (x)y + g(x) = 0
with f (x) and g(x) polynomials.
Acknowledgements
We would like to thank J. de Boer, M. Mariño, and E. Verlinde for discussions. The
research of R.D. is partly supported by FOM and the CMPA grant of the University of
Amsterdam, C.V. is partly supported by NSF grants PHY-9802709 and DMS-0074329.
References
[1] R. Dijkgraaf, C. Vafa, Matrix models, topological strings, and supersymmetric gauge theories, hep-
th/0206255.
[2] R. Gopakumar, C. Vafa, On the gauge theory/geometry correspondence, Adv. Theor. Math. Phys. 3 (1999)
1415, hep-th/9811131.
[3] C. Vafa, Superstrings and topological strings at large N , J. Math. Phys. 42 (2001) 2798, hep-th/0008142.
[4] F. Cachazo, K.A. Intriligator, C. Vafa, A large N duality via a geometric transition, Nucl. Phys. B 603
(2001) 3, hep-th/0103067.
[5] J.D. Edelstein, K. Oh, R. Tatar, Orientifold, geometric transition and large N duality for SO/Sp gauge
theories, JHEP 0105 (2001) 009, hep-th/0104037.
[6] K. Dasgupta, K. Oh, R. Tatar, Geometric transition, large N dualities and MQCD dynamics, Nucl. Phys.
B 610 (2001) 331, hep-th/0105066;
K. Dasgupta, K. Oh, R. Tatar, Open/closed string dualities and Seiberg duality from geometric transitions in
M-theory, hep-th/0106040;
K. Dasgupta, K. Oh, R. Tatar, Geometric transition versus cascading solution, JHEP 0201 (2002) 031, hep-
th/0110050.
[7] F. Cachazo, S. Katz, C. Vafa, Geometric transitions and N = 1 quiver theories, hep-th/0108120.
[8] F. Cachazo, B. Fiol, K.A. Intriligator, S. Katz, C. Vafa, A geometric unification of dualities, Nucl. Phys.
B 628 (2002) 3, hep-th/0110028.
[9] K.h. Oh, R. Tatar, Duality and confinement in N = 1 supersymmetric theories from geometric transitions,
hep-th/0112040.
[10] H. Fuji, Y. Ookouchi, Confining phase superpotentials for SO/Sp gauge theories via geometric transition,
hep-th/0205301.
[11] F. Cachazo, C. Vafa, N = 1 and N = 2 geometry from fluxes, hep-th/0206017.
[12] N. Seiberg, E. Witten, Electric–magnetic duality, monopole condensation, and confinement in N = 2
supersymmetric Yang–Mills theory, Nucl. Phys. B 426 (1994) 19;
N. Seiberg, E. Witten, Nucl. Phys. B 430 (1994) 485, hep-th/9407087, Erratum.
[13] D.J. Gross, E. Witten, Possible third order phase transition in the large N lattice gauge theory, Phys. Rev.
D 21 (1980) 446.
[14] S. Kachru, A. Klemm, W. Lerche, P. Mayr, C. Vafa, Nonperturbative results on the point particle limit of
N = 2 heterotic string compactifications, Nucl. Phys. B 459 (1996) 537, hep-th/9508155.
R. Dijkgraaf, C. Vafa / Nuclear Physics B 644 (2002) 21–39 39
[15] A. Klemm, W. Lerche, P. Mayr, C. Vafa, N. Warner, Self-dual strings and N = 2 supersymmetric field
theory, Nucl. Phys. B 477 (1996) 746, hep-th/9604034.
[16] S. Katz, A. Klemm, C. Vafa, Geometric engineering of quantum field theories, Nucl. Phys. B 497 (1997)
173, hep-th/9609239.
[17] S. Katz, P. Mayr, C. Vafa, Mirror symmetry and exact solution of 4D N = 2 gauge theories I, Adv. Theor.
Math. Phys. 1 (1998) 53, hep-th/9706110.
[18] M. Bershadsky, S. Cecotti, H. Ooguri, C. Vafa, Kodaira–Spencer theory of gravity and exact results for
quantum string amplitudes, Commun. Math. Phys. 165 (1994) 311, hep-th/9309140].
[19] I. Antoniadis, E. Gava, K.S. Narain, T.R. Taylor, Topological amplitudes in string theory, Nucl. Phys. B 413
(1994) 162, hep-th/9307158.
[20] W.I. Taylor, D-brane field theory on compact spaces, Phys. Lett. B 394 (1997) 283, hep-th/9611042.
[21] P. Ginsparg, G.W. Moore, Lectures on 2D gravity and 2D string theory, hep-th/9304011.
[22] P. Di Francesco, P. Ginsparg, J. Zinn-Justin, 2D gravity and random matrices, Phys. Rep. 254 (1995) 1,
hep-th/9306153.
[23] M. Aganagic, M. Marino, C. Vafa, All loop topological string amplitudes from Chern–Simons theory, hep-
th/0206164.
[24] N.A. Nekrasov, Seiberg–Witten prepotential from instanton counting, hep-th/0206161.
[25] E. Witten, Solutions of four-dimensional field theories via M-theory, Nucl. Phys. B 500 (1997) 3, hep-
th/9703166.
[26] K. Hori, C. Vafa, Mirror symmetry, hep-th/0002222.
[27] K. Hori, A. Iqbal, C. Vafa, D-branes and mirror symmetry, hep-th/0005247.
[28] D.E. Diaconescu, B. Florea, A. Grassi, Geometric transitions, del Pezzo surfaces and open string instantons,
hep-th/0206163.
[29] M. Marino, Chern–Simons theory, matrix integrals, and perturbative three-manifold invariants, hep-
th/0207096.
[30] S. Kachru, S. Katz, A.E. Lawrence, J. McGreevy, Open string instantons and superpotentials, Phys. Rev.
D 62 (2000) 026001, hep-th/9912151.
[31] H. Ooguri, C. Vafa, Knot invariants and topological strings, Nucl. Phys. B 577 (2000) 419, hep-th/9912123.
[32] I.K. Kostov, Gauge invariant matrix model for the A-D-E closed strings, Phys. Lett. B 297 (1992) 74, hep-
th/9208053.
[33] I.K. Kostov, Bilinear functional equations in 2D quantum gravity, in: Razlog 1995, New trends in quantum
field theory 77–90, hep-th/9602117.
[34] I.K. Kostov, Conformal field theory techniques in random matrix models, hep-th/9907060.
[35] S. Kharchev, A. Marshakov, A. Mironov, A. Morozov, S. Pakuliak, Conformal matrix models as an
alternative to conventional multimatrix models, Nucl. Phys. B 404 (1993) 717, hep-th/9208044.
[36] A. Morozov, Integrability and matrix models, Phys. Usp. 37 (1994) 1, hep-th/9303139.
[37] R. Dijkgraaf, H. Verlinde, E. Verlinde, Loop equations and Virasoro constraints in nonperturbative 2D
quantum gravity, Nucl. Phys. B 348 (1991) 435.
[38] M. Fukuma, H. Kawai, R. Nakayama, Continuum Schwinger–Dyson equations and universal structures in
two-dimensional quantum gravity, Int. J. Mod. Phys. A 6 (1991) 1385.
Nuclear Physics B 644 (2002) 40–64
www.elsevier.com/locate/npe
Received 18 July 2002; received in revised form 7 August 2002; accepted 29 August 2002
Abstract
We present an alternative form of gauge-invariant action for the superstring in the plane wave
background with Ramond–Ramond (RR) five-form flux. The Wess–Zumino term is given explicitly
in a bilinear form of the left-invariant currents by introducing a fermionic center to define the
nondegenerate group metric. The reparametrization invariance generators, whose combinations are
conformal generators, and fermionic constraints, half of which generate κ-symmetry, are obtained.
Equations of motion are obtained in conformal-invariant and background-covariant manners.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
Recently the plane wave solution with the Ramond–Ramond (RR) 5-form flux was
found as a maximally supersymmetric type IIB supergravity solution [1] in addition to
the Minkowski flat and the AdS5 × S5 spaces, based on studies of the plane wave
solutions with the 4-form flux in the 11-dimensional supergravity [2]. The Penrose’s
limiting procedure [3] was applied to the AdS spaces to obtain these plane wave solutions
with fluxes (pp-wave) [4–6]. It is recognized as an approximation of AdS spaces and leads
to interesting approaches to the AdS/CFT correspondence [7].
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 0 - 3
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 41
The gauge-invariant action for a superstring in the RR plane wave background was
presented [8] and it was also shown that the action in the light-cone gauge becomes simply
an action for 8 bosons and 8 fermions which are free and massive in 2 dimensions. Brane
actions in the RR pp-wave background have been widely studied [9] mostly in the light-
cone gauge. In the light-cone gauge the conformal symmetry is broken by a 2-dimensional
mass term at the gauge fixed level though it should be recovered in whole string theory
even in the RR pp-wave background [10]. The light-cone Hamiltonian does not commute
with other global space–time charges thus states in a supergravity multiplet have different
light-cone energy values [11] and the light-cone energy is not minimized for BPS states.
From a point of view of the symmetry, the light-cone approach is not always suitable
to understand systems. Manifest conformal-invariant approaches have been providing us
elegant formulations and practical computation methods in developments of string theories.
However, the conformal-invariant treatment in the RR pp-wave background has not been
explored except in an alternative hybrid approach [12].
In this paper we will study the superstring in the RR pp-wave background in a covariant
and manifestly conformal-invariant way. The RR backgrounds are usually described by the
Green–Schwarz type actions which contain Wess–Zumino (WZ) terms. In Ref. [8] the WZ
term of the superstring in the pp-wave background is given in a one parameter integral of
a closed three-form. Such integrals of the WZ terms are hardly performed explicitly for
general curved background cases, so it is difficult to discuss local symmetry constraints
and covariant equations of motion which will be needed in covariant string field theories.
In the flat background the integration in the WZ term is performed explicitly giving the
Green–Schwarz action [13]. In the pp-wave background it is performed only in the light-
cone gauge leading to the solvable action [8].
On the other hand, it was shown that an alternative form of the WZ term for the covariant
superstring in AdS spaces can be constructed in a bilinear form of the left-invariant (LI)
currents [14–17]. In contrast to the integral representation of the WZ term [18] the bilinear
form WZ term allows concrete computations of local symmetry constraints [19] and global
charges [17]. The WZ term can be constructed in a bilinear form of the LI currents due to
the existence of a nondegenerate group metric depending on the scale parameter. For the
super-pp-wave algebra the fermionic component of group metric is degenerate, so it cannot
be constructed in an analogous way. In this paper we find the bilinear form WZ term in the
super-pp wave background using the Penrose limit [5] of that in the super-AdS background
[17]. The limiting procedure must be taken carefully, since the bilinear form WZ term
contains a divergent coefficient when the Penrose limit parameter, Ω, is brought to zero.
The divergent term is a closed form which is a bilinear product of a leading term of the LI
current in Ω power series expansion, and it is subtracted. The finite contribution is given
by the next to leading term. The next to leading term of the LI currents can be obtained
not from the super-pp-wave algebra but from the nondegenerate super-pp-wave algebra.
42 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
L0,AdS = eAdS
â
eAdS,â
y 2 2 y 2 2
= −(dy)2 + sin dΩ42 + dy + sin d Ω4 (2.3)
R R
1 For super-AdS and super-pp-wave algebra, this fermionic generator is not center of the superalgebra,
because it does not commute with Lorentz generators. In the flat case, the fermionic generator is center of the
supertranslation algebra (not the super-Poincaré algebra). We use the word “center” throughout this paper but will
not cause confusion.
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 43
where dΩ42 and dΩ42 are 4-sphere metrics. The Penrose limit is obtained as Ω → 0 after
rescaling
y + → Ω 2y +, y − → y −, y î → Ωy î , (2.4)
±
√ −2 −1
where y = (y ± y )/ 2, corresponding to P+ → Ω P+ , P− → P− , Pî → Ω Pî .
9 0
Rewriting (2.1) in terms of (2.4) and taking Ω infinitesimally small, (2.1) turns out to be
a power series of Ω. Cartan 1-forms should be also rescaled as e+ → Ω 2 e+ , e− → e− ,
eî → Ωeî for consistency. Taking Ω → 0 limit, leading terms of the expansion in Ω are
identified to the LI Cartan 1-forms in the plane wave background
−
sin √y y î −
2 + + 2R î dy
Ω eAdS = Ω dy +2
− − 1 − y − − dy î
+ o(Ω 4)
√y y y
2R
+
≡ Ω 2 epp + o(Ω 4),
−
eAdS = dy − + o(Ω 4) ≡ epp
−
+ o(Ω 4),
−
sin √y −
2R î dy
ΩeAdS = Ω dy −
î î
− − 1 y − − dy î
+ o(Ω 3 )
√y y
2R
≡ Ωepp
î
+ o(Ω ).3
(2.5)
Bosonic part of the sigma model action in the pp-wave background is written, using the
leading terms, as
L0,pp = epp
â
epp,â = 2dy + dy −
√ 2 î
sin(y − / 2R) y î 2 − 2 y −
+ √ −1 (dy ) − 2 − dy dy
î
y − / 2R y− y
√ 2
sin(y − / 2R)
+ √ dy î dy î
−
y / 2R
8
2 2
8
= 2dx + dx − − 2µx î dx − + dx î dx î , (2.6)
î=1 î=1
where the last expression is obtained by the following field redefinition
sin 2µy − î
x î = y,
2µy −
sin 4µy −
8
(y î )2
x − = y −, x+ = y+ + 1 − . (2.7)
2y − 4µy −
î=1
For an Inönü–Wigner group contraction [21] the Cartan 1-forms are expanded with
respect to a parameter s which is brought to zero as
∞
TA → s −NA TA −→ LA (z) → s NA LA s NB zB = s N LA
(N) (z), (3.1)
N=NA
as was seen in (2.5) for bosonic part. The Maurer–Cartan equations are also expanded as
∞
∞ ∞
1 A
(N) + fBC
s N dLA (M) L(K) = 0,
s M+K LB C
(3.2)
2
N=NA M=NB K=NC
A is the structure constant of the original Lie algebra. Usually only leading terms
where fBC
of Cartan 1-forms are kept in the limiting procedure as was done in the previous section.
The Maurer–Cartan equations for the leading terms
1 A(NA )
(NA ) + fB(NB ) C(NC ) L(NB ) L(NC ) = 0,
dLA B C
A(N ) 2
fB(NBA) C(NC ) = fBC
A , for N = N + N ,
A B C
A(NA ) (3.3)
fB(NB ) C(NC )
= 0, for NA = NB + NC ,
describe the resultant group structure in the s → 0 limit.
The Penrose limit from the super-AdS group to the super-pp-wave group makes the
super-pp-wave group metric to be degenerate. This is the similar situation as in the flat limit
from the super-AdS group to the super-translation group where the metric is degenerate.
However, if the next to leading term in the fermionic Cartan 1-forms (3.1) is maintained in
the limiting procedure, the nondegenerate group metric of the central extended super-pp-
wave group can be constructed as we will show below.
The nondegenerate group metric can be defined in the super-AdS space and the bilinear
form WZ term can be constructed as follows. In terms of light-cone indices â = (+, −, i, i )
î = (i, i ) and the light cone projection operators for spinors #±
θ ± = #± θ, ζ± = ζ #± , Q± = Q#± ,
1 1
#± ≡ Γ± Γ∓ , Γ± = √ (Γ9 ± Γ0 ), (3.4)
2 2
the Cartan 1-forms for the super-AdS5 × S5 space are written as
G−1 + − + −
AdS dGAdS = LAdS P+ + LAdS P− + LAdS Pî + LAdS Q+ + LAdS Q−
î
1 î jˆ
+ Lî∗AdS P ∗ + LAdS Mî jˆ , (3.5)
î 2
where
Pi∗ = M0i , Pi∗ = M9i . (3.6)
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 45
Here the nondegenerate metric for spinor index is ρQαα I Qββ J = Cαβ Cα β (τ1 )I J ≡ ραβ ,
and this is used for the WZ term as Lα Lβ ραβ . It gives d(Lα Lβ ραβ ) = Lα Lc Lβ fαcβ ∼ H[3]
γ
with totally antisymmetric structure constants defined as fcβ ργ α . In [17] it was shown that
(3.8) gives correct exterior derivative d B[2] = H[3] , κ-invariance of the total action and
the correct flat limit. It was also shown that the second term in (3.8) is required for the
pseudo-supersymmetry invariance giving the correct string charge in the superalgebra.
Let us discuss the Penrose limit of this bilinear form WZ term, (3.8). The Penrose limit
is taken as the following rescaling [5]
θ + → Ωθ + , θ− → θ− (3.9)
in addition to (2.4). This corresponds to s = Ω in (3.1) and the scaling dimensions NA in
(3.2) to be
This implies that the group metric is degenerate in the ‘−’ spinor direction. In order to
Q Q
make it nondegenerate fP+−Q− and fP −Q+ must be included, so additional form is required
î
whose scaling dimension is NP+ + NQ− = NPî + NQ+ = 2. It is L− −
(2) ≡ Lpp and must be
maintained in the limiting procedure, and explicit computation confirms that L − −
pp = L(2)
is the next to leading term. Under the Penrose limit the Cartan 1-forms of the super-AdS
group reduce to those of the super-pp-wave background as
NA A NA +2 ) for A = Q ,
Ω LAdS = Ω NA LA pp + o(Ω −
(3.13)
L−
AdS = L − + Ω 2
pp L − + o(Ω 4 ),
pp
and they satisfy the following MC equations for bosonic and fermionic Cartan 1-forms,
respectively,
1 î î
dL+ +
pp − √ Lpp L∗pp − i Lpp Γ L = 0,
2
dL− −
pp − i Lpp Γ Lpp = 0,
1 jˆ jˆî
dLîpp + √ L−
pp L∗pp + Lpp Lpp − i Lpp Γ Lpp = 0,
î î
2
√ √
dLi∗pp − 4 2 µ2 L− i j ji
pp Lpp + L∗pp Lpp + 2 2 iµLpp ΠΓ *Lpp = 0,
i
√ i j j i √ i
dLi∗pp − 4 2 µ2 L−
pp Lpp + L∗pp Lpp + 2 2 iµLpp Π Γ *Lpp = 0,
ij
dLpp + Lki
jk
pp Lpp + 2iµLpp ΠΓ
−ij
*Lpp = 0,
j k
ij pp Π Γ −ij *Lpp = 0,
dLpp + Lkppi Lpp + 2iµL (3.14)
1 î jˆ √ î
dL+
pp + Lpp Γî jˆ L+ −
pp + 2 L∗pp Γ+î Lpp
4
−
+ µ* 2Lpp ΠL+ − ΠLîpp Γî+ L− pp = 0,
1 î jˆ
dL− −
pp + Lpp Γî jˆ Lpp = 0,
4
− 1 î jˆ − √ î
dLpp + Lpp Γî jˆ Lpp − 2 L∗pp ΠΓî ΠΓ− L+ pp
4
+ µ* −2L+ −
pp ΠLpp − Lpp ΠΓî Γ− pp
î +
= 0, (3.15)
where Π = Γ5678 . These MC equations coincide with the ones obtained by the direct
computation [8] using
G−1 + − + −
pp dGpp = Lpp P+ + Lpp P− + Lpp Pî + Lpp Q+ + Lpp Q−
î
1 î jˆ
+ Lî∗pp P ∗ + Lpp Mî jˆ , (3.16)
î 2
except for the last equation for L−
pp . It would be obtained if one uses an extended super-pp-
wave algebra with a fermionic center. The last equation of (3.15) has the same form as the
MC equation of the AdS (3.7) then it becomes nondegenerate.
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 47
Following to above limiting procedure from the super-AdS WZ term (3.8) we propose
a superstring action in the super-pp-wave background as
S = d σ L = d 2 σ (L0 + LWZ ),
2
√
L0 = −T −h huv Lu â Lv b̂ ηâb̂ , [2],pp ,
LWZ = T B (3.17)
where the WZ term is
+I
[2],pp = i L
B Γ− (τ1 )I J ΠL+J −I −J
pp − 2Lpp Γ+ (τ1 )I J Π Lpp
4µ pp
− d θ̄ +I Γ− (τ1 )I J Π dθ +J , (3.18)
and u, v = τ, σ = 0, 1 are the world volume indices and Cartan 1-forms are LA =
dzM LM A = dσ u Lu A with zM = (x m , θ µ ).
It satisfies the following criteria:
= H[3] . (3.19)
(ii) κ-invariance
The second condition (ii) κ-invariance of the action is confirmed as follows. Let us
denote an arbitrary variation of the coset element δG as the following combination
δκ Lâ = −fBC
â
;κ LB LC ,
δκ Lα = d ;κ Lα − fBC α
;κ LB LC , (3.22)
1
+ ;κ L = 0,
δκ L+ − √ ;κ Lî Lî∗ + Lî ;κ Lî∗ − 2i LΓ
2
− ;κ L = 0,
δκ L− − 2i LΓ
1 ˆ ˆ ˆ ˆ î ;κ L = 0,
δκ Lî + √ ;κ L− Lî∗ + L− ;κ Lî∗ + ;κ Lj Lj î + Lj ;κ Lj î − 2i LΓ
2
(3.23)
δκ L+ + µ* 2 ;κ L− ΠL+ + L− Π;κ L+ − Π ;κ Lî Γî Γ+ L− + Lî Γî Γ+ ;κ L−
1 ˆ ˆ 1
+ ;κ Lî j Γî jˆ L+ + Lî j Γî jˆ ;κ L+ − √ ;κ Lî∗ Γî+ L− + Lî∗ Γî+ ;κ L− = 0,
4 2 2
1 ˆ ˆ
δκ L− + ;κ Lî j Γî jˆ L− + Lî j Γî jˆ ;κ L− = 0,
4
δκ
L− − µ* 2 ;κ L+ ΠL− + L+ Π;κ L− − Π ;κ Lî Γî Γ− L+ + Lî Γî Γ− ;κ L+
1 ˆ ˆ
+ ;κ Lî j Γî jˆ
L− + Lî j Γî jˆ ;κ
L−
4
1
− √ Π ;κ Lî∗ Γî− ΠL+ + Lî∗ Γî− Π;κ L+ = 0. (3.24)
2 2
Using these relations, the κ-variation of the total action is calculated as
√
δκ (L0 + bLWZ ) = −T δκ −g g uv Lâu Lâ,v + bT * uv B[2]uv
= −2iT ;κ L − det Guv GuvL / u 1 + b* uvL
/ u τ3 Lv
= −2iT ;κ L (−Γ(1) + b)* uvL / u τ3 Lv (3.25)
with
1
Γ(1) = √ /,
τ3L 2
Γ(1) = 1, tr Γ(1) = 0,
2 − det Guv
− − det Guv GuvL
/ v = Γ(1)τ3 * uvL
/v. (3.26)
1 ij −
− Γ θ θ̄ Γij − Γ i j θ θ̄ − Γi j Γ+
2
ij +
− Γ θ θ̄ Γij − Γ i j θ θ̄ + Γi j Γ− Π* , (3.31)
#+ Ψ 2 #+ ≡ Ψ+2 = 2µi *ΠΓ+ Γ î θ − θ̄ − Γî + Γ+ Γ î θ − θ̄ − Γî Π* ,
#− Ψ 2 #− ≡ Ψ−2 = −µi Γ ij θ − θ̄ − Γij − Γ i j θ − θ̄ − Γi j Γ+ Π*,
#+ Ψ 2 #− Ω = 2µi *Π 2θ + θ̄ − Γ+ + Γ+ Γ î θ − θ̄ + Γî − Γ+ Γ î θ − θ̄ + ΠΓî *
1
− Γ ij θ + θ̄ − Γij − Γ i j θ + θ̄ − Γi j Γ+ Π* ,
2
#− Ψ 2 #+ Ω = 2µi *Π −2θ − θ̄ + Γ− − Γ− Γ î θ + θ̄ − Γî + Γ− Γ î θ + θ̄ − ΠΓî *
1
+ Γ ij θ − θ̄ + Γij − Γ i j θ − θ̄ + Γi j Γ− Π* ,
2
#− Ψ #− Ω 2 = 2µi *ΠΓ− Γ î θ + θ̄ + Γî − Γ− Γ î θ + θ̄ + Γî Π* .
2
The Cartan 1-forms are same as the one in [8] except for L − . From now on we use the
“x-coordinates” in (2.7) for simplicity in which the bosonic Cartan 1-forms are given by
2
e+ = dx + − 2µ2 x î dx − , e− = dx − , eî = dx î ,
√ ˆ
e∗î = −4 2 µ2 x î dx − , ωî j = 0. (3.32)
We confirm that this action reduces to following known ones:
1 2 −
+ Ψ− dθ + #− Ψ 2 #− Ω 2 dθ − + #− Ψ 2 #+ dθ + + o(µ2 ),
3!
L− = dθ − + o(µ2 ). (3.37)
The WZ term becomes
i
+ + − − 3
LWZ = T µ 2d θ̄ Γ− τ1 Π(L ) µ1 − 2d θ̄ Γ+ τ1 Π(L ) µ1 + o(µ )
4µ µ→0
1 1
= T i dx â θ̄ Γâ dθ + θ̄ Γ â τ3 dθ θ̄ Γ â dθ + θ̄ΠΓ î τ1 dθ θ̄ ΠΓ î * dθ
3 3
1 −
− θ̄ ΠΓ −ij τ1 dθ θ̄ ΠΓ −ij * dθ − θ̄ − ΠΓ −i j τ1 dθ θ̄ ΠΓ −i j * dθ
12
1
= T i dx â θ̄ Γâ dθ + θ̄ Γ â τ3 dθ θ̄ Γ â dθ , (3.38)
2
where the last equality is derived by using the relation in Section 2.3 of [17]. This action
is the Green–Schwarz superstring action in a flat space [13].
ẍ m = −Γ m nl ẋ n ẋ l (4.3)
with the Affine connection coefficients defined by Γ l nk = 12 g lm (−∂m gnk + ∂(n gk)m ).
For the pp-wave background the metric is given by
g++ g+− g+jˆ 0 1 0
gmn = g−+ g−− g−jˆ = 1 −4µ2 x 2 0 ,
gî+ gî− gî jˆ 0 0 δî jˆ
2 2
4µ x 1 0
g mn = 1 0 0 , (4.4)
0 0 δî jˆ
the Hamiltonian in the e = 1 gauge is given by
1
HPA = 2p+ p− + p 2 + (2µp+ )2 x 2 (4.5)
2
and equations of motion are
ẋ + = p− + (2µ)2 x 2 p+ , ẋ − = p+ , ẋ = p,
ṗ− = 0, ṗ+ = 0, ṗ = −(2µp+ ) x, 2
(4.6)
The constants of motion are the global translation and boost charges Pâ and P ∗ , instead
î
of the canonical momenta pî . Their forms in terms of the canonical variables are calculated
from
4.2. Superparticle
L0 â = ẋ m Lm â + θ̇ µ Lµ â , Lm â ≡ em â + Θm µ Lµ â . (4.15)
54 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
ẋ + = π− + (2µ)2 x 2 p+ , ẋ − = p+ , ẋ î = πî ,
ṗ− = 0, ṗ+ = 0, ṗî = −ω2 xî + 2µ2 p+ ζ Γî Γ+ ,
θ̇ µ = p+ (Ξ− θ )µ + πî (Ξî θ )µ , ζ̇µ = −p+ (ζ Ξ− )µ − πî (ζ Ξî )µ . (4.26)
Using facts π̇− = 0, π̇ = −ω2 x, the second-order equations for x m take the same form as
the bosonic particle (4.7). Furthermore, θ µ and ζµ satisfy second-order harmonic equation
with frequency ω. It is shown by using ∂î Ξ− = −Ξ− Ξî , (Ξ− )2 = −4µ2 + 4µ2 x î Ξî and
Ξî Ξjˆ = 0. Solutions are found in the same form as (4.8) and (4.12) with replacing p− , pî
by π− , πî . The Hamiltonian is expressed as
HSUPA = EB + EF ,
1
EB = 2p+ π− + ω2 α 2 + α̃ 2 = const,
2
2
+ + sin(Ψ+ /2)
EF = 2p+ µi θ̄ Γ
2
* Πθ +
Ψ+ /2
2 2 − sin(Ψ+ 2) 2 − −
− 2p+ µ i θ̄ π/ x/ Γ θ = −EB , (4.27)
Ψ+ /2
where ‘+’ projected part of fermionic constraints (4.19) are used.
In terms of global charges the Hamiltonian can be written as the quadratic Casimir
operator of the super-pp-wave algebra which is obtained by the Penrose limit from the
quadratic Casimir operator of the super-AdS algebra:
1
csAdS = Pâ Pb̂ ηâb̂ + Mâ2b̂ − Qαα A * AB C −1αβ C −1α β Qββ B
4
â=0,...,9 â b̂=0,...,4
or 5,...,9
Penrose limit
î jˆ iη∗
−→ cspp(−2) = Pâ Pb̂ ηâb̂ + Pî∗ Pj∗ˆ η∗ − Q+ C −1 Γ+ Π* Q+
4
â=0,...,9 î=1,...,8
⇒ 2HSUPA (4.28)
î jˆ √ ˆ √
with η∗ = (2 2µ)2 δ î j and η∗ = 2 2µ. The supercharge is obtained by the Penrose limit
from the one of the super-AdS result [17]
i − −
+ + − î
sin(Ψ+ /2) 2
Q+ = ζ+ 1 − √ Γ+î θ θ̄ Γî Π* − p+ i θ̄ Γ + pî i θ̄ Γ
2 2 Ψ+ /2
Ψ+
× cos(2µx − ) − *Π sin(2µx − ) (4.29)
sin Ψ+
which is consistent with that the Hamiltonian (4.26) and the quadratic Casimir operator in
(4.28).
56 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
It is difficult to solve the equation for x except in a case where only the zero-mode term
(p+,0 /T )2 is present. It is the same as the light-cone result [18]
1
x = α 0 sin(ω0 τ ) + α̃ 0 cos(ω0 τ ) + √ α n ei(ωn τ +2nσ ) + α̃ n ei(ωn τ −2nσ ) ,
2π n=0
ωn = sgn(n) (2µp+,0 )2 + (2n)2 , (4.40)
4.4. Superstring
Half of the above fermionic constraints generate κ-symmetry as shown in Section 3.2 for
the action (3.28).
Super-invariant (up to local Lorentz) combinations are Lâ1 and
π̃â = (e−1 )â m p̃m + ζ̃µ Ξm θ µ
T
= −√ −L0 b̂ G11 + L1 b̂ G01 ηb̂â ≡ (e−1 )â m π̃m
−G
δLWZ δLWZ β
p̃m = pm − β
Lm β , ζ̃µ = ζµ − β
Lµ .
δL0 δL0
The equations of motion for a superstring in the pp-wave background in the conformal
gauge, huv = ηuv , are obtained as
1 m
ẋ m = π̃ + (e−1 )â m Lµ â (τ3 θ )µ ,
T
ṗm = T Lm â L1â
1 n l T â −1 n b̂ −1 l
+ (∂m gnl ) π̃ π̃ − L1 (e )â L1 (e )b̂ − π̃ n Lµ â (e−1 )â l (τ3 θ )µ
2T 2
1 µ
− ζ (∂m Ξn )θ π̃ n − T x n (∂m Ξn )θ Lµ â L1â
T
δLWZ
− ∂m Lµ α (τ3 θ )µ , (4.48)
δL0 α
µ 1 m −1 m ν
θ̇ = −(τ3 θ ) + (Ξm θ )
µ µ
π̃ + (e )â Lν (τ3 θ ) ,
â
T
ζ̇µ = −(ζ τ3 )µ − T Lµ â L1â − π̃â Lµ â τ3
1 â ν
− (ζ Ξâ )µ π̃ + Lν (τ3 θ )
â
T
m ν ∂
left
+ T x (Ξm )ν µ Lν â − x (Ξm θ )ν + θ
m â
Lν L1â
∂θ µ
left
∂ left δLWZ α ν ∂
+ µ Lν (τ3 θ ) + π̃â Lν (τ3 θ )ν ,
â
(4.49)
∂θ δL0 α ∂θ µ
which are background-covariant.
In the components by using (4.4) and (4.24) the Hamiltonian in the conformal gauge is
rewritten as
H = H1 + H2 + H3 ,
1
H1 = dσ 2p+ π̃− + 4µ2 x 2 p+
2
+ π̃ 2 + p+ Lµ + (τ3 θ )µ
2T
ν
+ π̃− + 2µ2 x 2 p+ Lν − τ3 (θ − ) + π̃î Lµ î (τ3 θ )µ ,
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 59
&
T
H2 = dσ 2 (x + ) − 2µ2 x 2 (x − )
2
µ ν
+ θ + (x − ) Ξ− θ + x î Ξî θ − Lµ + (x − ) + (θ − ) Lν −
µ 2 ' δLWZ α
+ x î + θ + (x − ) Ξ− θ + x î Ξî θ − Lµ î + Lµ (τ 3 θ µ
) ,
δL0 α
H3 = dσ [−ζ τ3 θ ]. (4.50)
( (
For a flat case H1 and H2 reduce simply into (1/2T )p2 and (T /2)x 2 , respectively.
Equations of motion are written as
1 ν
ẋ + =π̃− + 4µ2 x 2 p+ + Lµ + (τ3 θ )µ + 2µ2 x 2 Lν − τ3 (θ − ) ,
T
1 ν 1
ẋ = p+ + Lν − τ3 (θ − ) ,
−
ẋ î = π̃î + Lµ î (τ3 θ )µ , (4.51)
T T
& '
ṗ− = T −2µ2 x 2 L1 − + (Ξ− θ )µ Lµ + L1 − + Lµ î L1 î
∂ δLWZ &
−
µ µ
'
+ dσ 2Lµ
α
# + Ξ − θ ẋ + Ξ θ ẋ î
− L µ
α
(τ 3 θ )
∂x − (σ ) δL0 α î
T
ν sin Ψ− sin Ψ−
ṗ+ = T (x − ) + (θ − ) Lν − + iT *Πθ − CΓ+ τ1 Π τ3 (θ − )
Ψ− Ψ−
ṗî = +T L1 î + Ξî θ − Lµ + L1 − + Lµ î L1 î
µ
%
1 −
− ν − −
− 4µ x p+
2 î
p+ + Lν τ3 (θ ) − (x ) L1
T
∂ δLWZ
− T x − (Ξ− Ξi θ − )µ Lµ + L1 − + Lµ î L1 î − dσ α
Lµ α (τ3 θ )µ
∂x î (σ ) δL0
T %
iT sin Ψ+ sin Ψ+ ˆ
− Ξî θ − CΓ− τ1 Π Ξ− θ ẋ − + Ξjˆ θ − ẋ j
µ Ψ+ Ψ+
T %
iT sin Ψ+ − sin Ψ+ ˆ
− (x ) Ξ− Ξî θ − CΓ− τ1 Π Ξ− θ ẋ − + Ξjˆ θ − ẋ j ,
µ Ψ+ Ψ+
(4.52)
+ λ
+ λ λ 1 −
− ν
(θ̇ ) = − τ3 (θ ) + (Ξ− θ ) p+ + Lν τ3 (θ )
T
1
+ (Ξî θ − )λ π̃î + Lµ î (τ3 θ )µ ,
T
θ̇ − = −τ3 (θ − ) ,
60 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
(ζ̇+ )λ = − (ζ+ ) τ3 λ
iT + sin Ψ+ 1 −
− µ
− ζ+ − L1 Γ− τ1 Π ν
(Ξ− #+ ) λ p+ + Lµ τ3 (θ )
µ Ψ+ ν T
µ
+ T − Lµ + L1 − + Lµ î L1 î (#+ )µ λ + (x − ) Ξ− + Lµ+ L1− + Lµ î L1 î
µ
− θ + (x − ) Ξ− θ + x î Ξî θ −
left left %
∂ + − ∂ jˆ jˆ
× L ν L 1 + Lν L 1
∂(θ + )λ ∂(θ + )λ
+ p+ Lν + (#+ )ν λ τ3 + π̃î Lν î (#+ )ν λ τ3
left left
∂ + ν ∂ jˆ
+ Lν (τ3 θ ) p+ + Lν (τ3 θ )ν π̃î
∂(θ + )λ ∂(θ + )λ
iT ∂ left +
+ dσ 1 Γ− τ1 Π Lµ + (τ3 θ )µ (σ )
L
2µ ∂(θ + )λ (σ ) +
%
iT sin Ψ+ T sin Ψ+ − − jˆ
+ CΓ− τ1 Π Ξ− θ ẋ + Ξjˆ θ ẋ
µ Ψ+ Ψ+
T %
iT sin Ψ+ − sin Ψ+ − − jˆ
− (x ) Ξ− CΓ− τ1 Π Ξ− θ ẋ + Ξjˆ θ ẋ , (4.53)
µ Ψ+ Ψ+
and second-order form equations become
The right-hand sides of the first and third equations are complicated functions of θ − or θ −
and θ + .
In this paper an alternative form of the gauge invariant action for the superstring in
the RR pp-wave background is proposed. It is explicit since the Wess–Zumino term is
bilinear with respect to the LI currents of the centrally extended super-pp-wave algebra. It
is obtained by the Penrose limit from the superstring action in the AdS background with the
bilinear WZ term [17]. The Penrose limit of the WZ term is given essentially as follows:
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 61
1. Rescale the coordinates in the LI 1-forms with a parameter Ω with suitable weights,
for example,
m
L−
AdS z → Ω z = L−
Nm m m 2 −
(0) (z ) + Ω L(2) (z ) + o(Ω ).
m 4
(5.1)
2. Rescale the WZ term with the same weight of the Nambu–Goto term as
Ω −2
L− − −
AdS Γ Πτ1 LAdS
= Ω −2 L − Γ − Πτ1 L− + Ω 2 2L
− Γ − Πτ1 L− + o(Ω 4 ) .
(0) (0) (0) (2)
3. Subtract the divergent term in Ω → 0 limit which is proportional to 1/Ω 2 and closed.
4. Take the Ω → 0 limit in the bilinear WZ term
− Γ − Πτ1 L− + L+ -dependent term.
LWZ,AdS → LWZ,pp = 2L(0) (2)
1. Rescale coordinates in the LI 1-forms with a parameter Ω with suitable weights and
take the leading terms in the limit Ω → 0
m
L−AdS z → Ω z → L−
Nm m m
(0) (z ),
L+AdS z → Ω
m
z → Ω 2 L+
Nm m m
(2) (z ). (5.3)
2. Construct the WZ term in the conventional form given by an integral of the three-
form [8]
d σ LWZ,AdS → d 2 σ LWZ,pp
2
− +
= d 3σ L L − ±
(0) / (2) τ3 L(0) + L -dependent terms .
In this case the next to leading term L−(2) is not necessary to be kept in taking the Penrose
limit of the WZ term. However, the resultant WZ term has the integral expression and does
not allow us simple treatment of the Hamiltonian and equations of motion in covariant
gauges.
The Hamiltonians for the bosonic particle, superparticle, string and superstring in
the RR pp-wave background are obtained in the conformal gauge. The particle and the
superparticle Hamiltonians are identified with the quadratic Casimir operators of the pp-
wave and the super-pp-wave algebras respectively. Once the superparticle Hamiltonian
(4.18) is recognized as the super-pp-invariant “mass operator” of the superstring theory, all
states in a supergravity multiplet are “massless” and the supersymmetry is manifest.
62 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
The world-sheet reparametrization generators and the local fermionic constraints are
also obtained. The combinations of the reparametrization constraints (4.45) and (4.46) as
H⊥ ± H and the first class part of FνI , where I = 1, 2 correspond to right/left (±) modes,
will make a closed set of constraint algebra, namely ABCD constraint system [22]. It
was shown that the local constraints of the AdS superstring satisfy the ABCD algebra
[19] as well as of the flat superstring. Since the super-pp-wave algebra is obtained by the
Penrose limit from the super-AdS algebra [5] as well as the flat algebra by the flat limit,
the local symmetry algebra is also expected to be obtained by the same limiting procedure
preserving the same structure of the ABCD algebra. The background independence of the
local symmetries is plausible.
Equations of motion in the conformal gauge are obtained and are background-covariant.
The equations of motion for x ± are obtained and will be important for taking into
account interactions. Quantization of the superstring theory in the conformal gauge may
require a suitable change of variables such as to the GL(4|4) matrix variable as was
done for the AdS5 × S5 case [16]. The covariant approach will be useful to examine
symmetry structures toward the covariant superstring field theory, S-T-U dualities which
are deeply related to the background symmetry and non-perturbative properties such as
BPS conditions.
The Cartan 1-forms in the AdS5 × S5 space are presented [17,18]. The left-invariant
Cartan one-forms of a coset SU(2, 2|4)/[SO(4, 1) × SO(5)] G = G(x, θ ) = exP eθQ are
defined by
1 1
G−1 dG = La Pa + La Pa + Lab Jab + La b Ja b + Lαα I Qαα I .
2 2
They are given by
sin(Ψ/2) 2
La = ea + iθ CC γ a Dθ,
Ψ/2
2
a a a sin(Ψ/2)
L = e − θ CC γ Dθ,
Ψ/2
sin(Ψ/2) 2
Lab = ωab − θ CC γ ab * Dθ,
Ψ/2
a b a b a b sin(Ψ/2) 2
L =ω + θ CC γ * Dθ,
Ψ/2
sin Ψ
Lα = Dθ,
Ψ
1 sinh(x/2) 2 [a b]
ω =
ab
dx x ,
2 x/2
1 sin(x /2) 2 [a b ]
ωa b = − dx x , (A.1)
2 x /2
M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64 63
where [ab] = ab − ba and charge conjugation matrix for AdS5 space and S5 space are C
and C , respectively, and
i a a
1 ab a b
Dθ = d − * γ ea + iγ ea + γ ωab + γ ωa b θ,
2 4
I αα I
θ CC γa ββ J − *γ a θ θ CC γa ββ J
αα
(Ψ 2 )αα I ββ J = *γ a θ
1 ab αα I 1 αα I
− γ θ θ CC γab * ββ J + γ a b θ θ CC γa b * ββ J .
2 2
(A.2)
After the Penrose limit using (2.4) and (3.9) they are written as (3.29). The relation between
the AdS variables and the Penrose variables are described in [5].
References
[1] M. Blau, J. Figueroa-O’Farrill, C. Hull, G. Papadopoulos, JHEP 0201 (2002) 047, hep-th/0110242.
[2] J. Kowalski-Glikman, Phys. Lett. B 134 (1984) 194;
C.M. Hull, Phys. Lett. B 139 (1984) 39;
P.T. Chrusciel, J. Kowalski-Glikman, Phys. Lett. B 149 (1984) 107;
J. Figueroa-O’Farrill, G. Papadopoulos, JHEP 0108 (2001) 036, hep-th/0105308.
[3] R. Penrose, Any space-time has a plane wave as a limit, in: M. Cahen, M. Flato (Eds.), Differential Geometry
and Relativity, Reidel, Dordrecht, 1976, p. 271.
[4] M. Blau, J. Figueroa-O’Farrill, C. Hull, G. Papadopoulos, Class. Quantum Grav. 19 (2002) L87, hep-
th/0201081;
R. Güven, Phys. Lett. B 482 (2000) 255;
R. Güven, Phys. Lett. B 637 (1–3) (2002) 168, hep-th/0005061;
M. Blau, J. Figueroa-O’Farrill, G. Papadopoulos, Penrose limit, supergravity and brane dynamics, hep-
th/0202111.
[5] M. Hatsuda, K. Kamimura, M. Sakaguchi, Nucl. Phys. B 632 (2002) 114, hep-th/0202190.
[6] M. Hatsuda, K. Kamimura, M. Sakaguchi, Nucl. Phys. B 637 (2002) 168, hep-th/0204002.
[7] D. Berenstein, J. Maldacena, H. Nastase, JHEP 0204 (2002) 013, hep-th/0202021.
[8] R.R. Metsaev, Nucl. Phys. B 625 (2002) 70, hep-th/0112044.
[9] A. Dabholkar, S. Parvizi, Dp branes in pp-wave background, Nucl. Phys. B 641 (2002) 223, hep-th/0203231;
A. Kumar, R.R. Nayak, Sanjay, D-brane solutions in pp-wave background, Phys. Lett. B 541 (2002) 183,
hep-th/0204025;
M. Alishahiha, A. Kumar, D-brane solutions from new isometries of pp-waves, Phys. Lett. B 542 (2002)
130, hep-th/0205134;
S.S. Pal, Solution to worldvolume action of D3 brane in pp-wave background, Mod. Phys. Lett. A 17 (2002)
1735, hep-th/0205303;
S. Hyun, H. Shin, Branes from matrix theory in pp-wave background, Phys. Lett. B 543 (2002) 115, hep-
th/0206090;
D. Bak, Supersymmetric branes in pp wave background, hep-th/0204033;
K. Skenderis, M. Taklor, JHEP 0206 (2002) 025, hep-th/0204054;
H. Singh, M5-branes with 3/8 supersymmetry in pp-wave background, hep-th/0205020;
P. Bain, P. Meessen, M. Zamaklar, Supergravity solutions for D-branes in Hpp-wave backgrounds, hep-
th/0205106;
M. Alishahiha, A. Kumar, D-brane solutions from new isometries of pp-waves, hep-th/0205134;
O. Bergman, M.R. Gaberdiel, M.B. Green, D-brane interactions in type IIB plane-wave background, hep-
th/0205183;
Y. Hikida, Y. Sugawara, JHEP 0206 (2002) 037, hep-th/0205200;
64 M. Hatsuda et al. / Nuclear Physics B 644 (2002) 40–64
Received 1 July 2002; received in revised form 26 August 2002; accepted 29 August 2002
Abstract
We study supersymmetric pp-waves in M-theory, their dimensional reduction to D0-branes or
pp-waves in type IIA, and their T-dualisation to solutions in the type IIB theory. The general class of
pp-waves that we consider encompass the Penrose limits of AdSp × S q with (p, q) = (4, 7), (7,4),
(3,3), (3,2), (2,3), (2,2), but includes also many other examples that can again lead to exactly-solvable
massive strings, but which do not arise from Penrose limits. All the pp-waves in D = 11 have 16
“standard” Killing spinors, but in certain cases one finds additional, or “supernumerary,” Killing
spinors too. These give rise to linearly-realised supersymmetries in the string or matrix models.
A focus of our investigation is on the circumstances when the Killing spinors are independent of
particular coordinates (x + or transverse-space coordinates), since these will survive at the field-
theory level in dimensional reduction or T-dualisation.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The Penrose limit [1] of the AdS5 × S 5 solution of type IIB theory is a pp-wave with
maximal supersymmetry [2,3]. This result is of considerable interest within the framework
of the AdS/CFT correspondence, since the pp-wave provides a background for which
the string theory action in light-cone gauge describes a massive free string, which is
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 2 - 7
66 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
exactly solvable [4,5], thus allowing explicit comparisons with results in the dual gauge
theory [5]. The Penrose limit of the AdS4 × S 7 and AdS7 × S 4 solutions of M-theory also
gives rise to a maximally-supersymmetric pp-wave, obtained in [6], and this provides a
simple background for the DLCQ description of M-theory, and the corresponding matrix-
model in this regime [5]. Subsequent papers have explored a variety of consequences and
generalisations of these observations [7–21].
In a previous paper [17], we studied a wider class of supersymmetric pp-waves in the
type IIB theory, generalising the maximally-supersymmetric one that arises as the Penrose
limit of AdS5 × S 5 . In particular, we allowed for a more general structure of the constant
self-dual five-form field strength; these structures were motivated by the flat (orbifold)
limit of a special holonomy transverse space for the pp waves. In fact any pp-wave within
the general class automatically has 16 Killing spinors, which we, therefore, denoted as
“standard” Killing spinors. In special cases one finds that there can be additional Killing
spinors, which we denoted as “supernumerary” Killing spinors. The maximum number,
16, of these is achieved for the Penrose limit of AdS5 × S 5 . (In this class we also found
another example of the Penrose limit of AdS3 × S 3 arising form an D3/D3-intersection.)
The focus of our study in [17] was to determine the circumstances under which one
obtains supernumerary Killing spinors in the type IIB pp-waves. These are important when
one considers the exactly-solvable string models in the pp-wave backgrounds; we found
that it is the supernumerary Killing spinors that are in one to one correspondence with
the associated linearly-realised worldsheet supersymmetries of the corresponding string
action. In fact the string theory is solved by going to the light-cone gauge, with the x +
coordinate in the pp-wave being set equal to the world-sheet time coordinate. In order that
the linearly-realised world-sheet supersymmetries be unbroken, it is necessary therefore
that the associated supernumerary Killing spinors be independent of the coordinate x + ,
which is indeed the case in all the type IIB pp-waves. For instance, all 16 supernumerary
Killing spinors in the Penrose limit of AdS5 × S 5 have this property [5,17].
A further significance of having Killing spinors in the type IIB pp-waves that are
independent of x + is that after performing a T-duality transformation on the x + coordinate
(which is always a Killing direction), the resulting type IIA solution will also be
supersymmetric. It can be lifted to M-theory, where it acquires an interpretation as a
supersymmetric deformed M2-brane, i.e., an M2-brane in which an additional 4-form
flux is turned on in the transverse space [22–28]. An intriguing feature of the deformed
M2-branes obtained by this T-dualisation procedure is that if any of the Killing spinors
originate from supernumerary Killing spinors (which are x + -independent), then in the
M-theory picture they solve the Killing-spinor equations despite violating the criterion that
is usually applied [22,25,29] for testing whether a supersymmetry survives when the extra
4-form flux is turned on [17].
In this paper, we study supersymmetric pp-waves in M-theory. In particular, we allow
for rather general structures for the constant 4-form field strength of M-theory, motivated
from the flat (orbifold) limit of special holonomy transverse space for the pp-waves.
These possible structures fall into two classes. Focusing on the nature of supersymmetry,
we again find that there are always 16 “standard” Killing spinors, and that additional
“supernumerary” Killing spinors can arise in special cases. Unlike the case of pp-waves
in type IIB, however, it is no longer automatic that supernumerary Killing spinors are
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 67
= −4 dx + dx − + H dx + + dzi2 ,
2 2
ds11 (1)
+
F(4) = µ dx ∧ Φ(3) , (2)
where Φ(3) is a harmonic 3-form in the flat nine-dimensional transverse space whose
metric is dzi2 , µ is a constant, and we are taking H here to depend only on zi . In the
vielbein basis e+ = dx + , e− = −2 dx − + 12 H dx + , ei = dzi , for which the metric is
2 = 2e + e − + e i e i , the spin connection is given by
ds11
1
ω+i = ∂i H e+ , ω−i = ω+− = ωij = 0, (3)
2
and the only non-vanishing Riemann tensor components, in the vielbein basis, are
1
R+i+j = − ∂i ∂j H. (4)
2
This implies that the only non-vanishing Ricci-tensor component is R++ = − 12 H . The
D = 11 supergravity equations are therefore satisfied if H obeys the equation
1
H = − µ2 |Φ(3) |2 . (5)
6
In this paper we shall focus on the cases where Φ(3) is a covariantly-constant 3-form. It is
sufficient for our purposes to take the solution for H to be
Q 2 2
H = c0 + − µi zi , (6)
r7
i
where c0 , Q and µi are constants, and r 2 ≡ zi zi . It follows from (5) that the µi are subject
to the condition
1
µ2i = µ2 |Φ(3) |2 . (7)
12
i
When µ = 0 (and, hence, µi = 0), the solution becomes a standard pp-wave in D = 11,
whose dimensional reduction gives a D0-brane in the type IIA theory, and the pp-wave
charge Q becomes the charge of the D0-brane.
The supercovariant derivative appearing in the supersymmetry transformation rule
δψM = DM is given by
1 N1 ···N4
DM = ∇M − ΓM FN1 ···N4 − 8FMN1 ···N3 Γ N1 ···N3 . (8)
288
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 69
where we have defined dzij k ≡ dzi ∧ dzj ∧ dzk . It should be noted that unless all four of
the mα coefficients are non-zero in Case 1, it is in fact encompassed (after a relabelling of
coordinates) within Case 2.
It is straightforward to verify that if we construct W as in (10), and write it as
W= mα Wα , (16)
α
where Wα denotes the individual Γij k structures (for example, W1 = iΓ129 is one of the
four structures in Case 1), then we shall have [Wα , Wβ ] = 0. In consequence, for either
Case 1 or Case 2 we can choose a basis for the gamma matrices in which the Wα are all
diagonal. It is useful to have in mind such a diagonal choice of basis in the subsequent
discussion.
For our canonical choices, one can see that if the mα are taken equal then Φ(3) in Case 1
can be expressed asmdz9 ∧ J , where J is the Kähler form for the eight-dimensional flat
space with metric 8i=1 dzi2 . Likewise, if the mα are set equal in Case 2, Φ(3) can be
expressed as mΨ(3) , where Ψ(3) is a G2 -invariant associative 3-form in the flat seven-
dimensional space with metric 7i=1 dzi2 .
The 16 standard Killing spinors correspond to taking χ = χ− , i.e., they are defined
by Γ− χ = 0. It is evident from (9) and (11) that they are all independent of all of the zi
coordinates. It is also evident from (12) that they will have x + dependence given by
i +W
χ = e 4 µx χ0 , (17)
where χ0 is any constant spinor satisfying Γ− χ0 = 0. If W annihilates any of these spinors,
then the associated “standard” Killing spinor will be independent of x + (and so, in fact, it
will be independent of all the coordinates). The discussion now divides into two, according
to whether we take Φ(3) to be given by (14) or (15):
Case 1:
For Φ(3) given by (14), the eigenvalues of W are
λi = ±m1 ± m2 ± m3 ± m4 , (18)
where the ± choices are all independent. Each eigenvalue occurs twice, making the 32 in
total. In the subspace of eigenspinors annihilated by Γ− one gets each eigenvalue once,
and likewise in the subspace annihilated by Γ+ . For the standard Killing spinors arising in
Case 1, it therefore follows that the possible numbers that are independent of x + can be
Nstan = 0, 2, 4 or 8.
The number Nstan = 0 is achieved for generic choices of the mα ; Nstan = 2 is achieved
for choices where m1 + m2 + m3 + m4 = 0; Nstan = 4 is achieved for choices with m4 = 0
and m1 + m2 + m3 = 0 (or permutations); and Nstan = 8 is achieved for choices with
m3 = m4 = 0 and m1 + m2 = 0 (or permutations).
Case 2:
When Φ(3) is given by (15), we find that again W generically has sixteen different
eigenvalues, each occurring twice. One copy of the sixteen again occurs in each of the Γ−
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 71
λ8 = m1 + m2 + m3 − m4 + m5 + m6 + m7 , (19)
and λi for 1 i 7 is given by reversing the sign of each mα that occurs as a coefficient of
any term containing the gamma matrix Γi . The numbers of standard Killing spinors that are
independent of x + that can be achieved for these Case 2 examples are therefore Nstan = 2n,
where 0 n 6 is the number of λi = 0 that are arranged to vanish by choosing the mα
appropriately. Thus for Case 2 we can have Nstan = 0, 2, 4, 6, 8, 10 or 12 standard Killing
spinors that are independent of x + .
Xi ≡ Γi W Γi + 3W (21)
is also diagonal, and therefore that (20) can be rewritten as
2 2
µ Xi − 144µ2i Γi χ = 0, for each i. (22)
From (11), it now follows that the solutions of (22) will give the supernumerary Killing
spinors, with zi dependence given by
i
= 1 − µi zi Γ− Γi χ. (23)
2
In particular, this means that a supernumerary Killing spinor is independent of a given
coordinate zi if and only if the associated coefficient µi in (6) is zero.
The discussion of the supernumerary Killing spinors now divides into the two
possibilities for Φ(3) , given by (14) or (15).
72 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
Case 1:
In this case Φ(3) is given by (14). In the direction i = 9 we have X9 = 4W , and so
µ29 = 19 µ2 λ2 , where λ is one of the eigenvalues of W given in (18). Without loss of
generality, since the other eigenvalues differ only in sign permutations of the mα , we can
take
1
µ29 = µ2 (m1 + m2 + m3 + m4 )2 . (24)
9
The remaining µi for 1 i 8 are then given by
1 2
µ21 = µ22 = µ (−2m1 + m2 + m3 + m4 )2 ,
36
1
µ23 = µ24 = µ2 (m1 − 2m2 + m3 + m4 )2 ,
36
1
µ25 = µ26 = µ2 (m1 + m2 − 2m3 + m4 )2 ,
36
1
µ27 = µ28 = µ2 (m1 + m2 + m3 − 2m4 )2 . (25)
36
To see this, we note that if (X9 − κ9 )Γ9 χ = 0 then 4W χ = κ9 χ . We are taking κ9 =
4(m1 + m2 + m3 + m4 ). Substituting into (X1 − κ1 )Γ1 χ = 0 we therefore find 14 κ9 χ +
3W χ − κ1 χ = 0, where W = Γ1 W Γ1 . From (14) and (10) it follows that the diagonal
matrix W has eigenvalues that are just those of W but with m2 , m3 and m4 reversed in sign,
and so W χ = (m1 − m2 − m3 − m4 )χ . Thus we deduce that κ1 = 2(2m1 − m2 − m3 − m4 ).
Applying an analogous argument for each direction i, we arrive at (25).
For a generic choice of the constants mα , there are precisely two supernumerary Killing
spinors. This is because a given bosonic solution has fixed values for the coefficients µα ,
and so there are two solutions to (20) since there is a twofold degeneracy in (25). In special
cases, where the mα are chosen so that two or more of the expressions in (25) are equal,
there can therefore be more solutions of (20). It is an elementary exercise to enumerate
all the possible numbers of supernumerary supersymmetries that can be achieved for
specific choices of mα . As in the case of type IIB pp-wave solutions, the supernumerary
supersymmetries can lead to a variety of “non-standard” fractions of total supersymmetry
that exceed 1/2 [17].
It is worth remarking that for Case 1, as a consequence of the equation X9 = 4W , it
follows that supernumerary Killing spinors are independent of x + if and only if they are
independent of z9 .
Case 2:
When the 3-form Φ(3) is given by (15), it follows that we shall have X8 = X9 = 2W ,
and so from (22) we shall have µ28 = µ29 = 36 1 2 2
µ λ , where λ is one of the eigenvalues
of W . These are now given by λ8 in (19), together with λi for 1 i 7 as described below
(19). Without loss of generality, since the µα have not yet been specified, we may choose
1 2 2 1
µ28 = µ29 = µ λ8 = µ2 (m1 + m2 + m3 − m4 + m5 + m6 + m7 )2 . (26)
36 36
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 73
Having obtained the M-theory pp-waves, we can dimensionally reduce the solutions
to D = 10, giving rise to D0-branes if we reduce on the x + coordinate, or to type IIA
pp-waves if we reduce instead on any of the zi coordinates. Of course a reduction on a
particular zi coordinate is possible only if it is a Killing direction, which means that the
associated coefficient µi in the metric function H must vanish.4
First let us consider a reduction on x + . This is a Killing direction for all pp-wave
solutions. However, some or all of the Killing spinors in a given solution may be dependent
on the x + coordinate, in which case they will not survive in the reduction of the solution
to type IIA supergravity. As we saw from (12), the criterion for a Killing spinor to be
independent of x + is that it should be annihilated by W . For the standard Killing spinors,
the fraction of the 16 Killing spinors that will survive the reduction on x + depends on
the detailed structure of W . The 16 standard Killing spinors exist for any solution for H ,
subject to (5). In particular, they exist when the D0-brane charge Q is turned on. The
supernumerary Killing spinors, on the other hand, are all eigenvectors of W with the
same eigenvalue, and hence they are all x + -independent if W χ = 0, but x + -dependent if
W χ
= 0. The supernumerary Killing spinors exist only if Q = 0 and the µi are distributed
appropriately.
For a reduction on one of the transverse coordinates zi , the corresponding constant µi in
the expression for H in the metric (1) must vanish, in order that ∂/∂zi be a Killing vector.
As we saw in (23), the Killing spinors are then also independent of the coordinate zi , and,
hence, they will all survive in the reduction. In Section 2.2, it was observed that the 16
standard Killing spinors are all independent of zi .
When Φ(3) is contained within the Case 1 in (14), the direction z9 is singled out. It
was observed in the discussion of Case 1 in Section 2.3 that if µ9 = 0 then we have also
W χ = 0, and so this implies that if z9 is a Killing direction then the supernumerary Killing
spinors will not only be independent of z9 , but also of x + .
In Case 2, where Φ(3) is given by (15), the directions z8 and z9 are singled out. If we
arrange for µ8 = µ9 = 0 (they are always equal), then as shown in Section 2.3 we also
have W χ = 0, and so the supernumerary Killing spinors will be independent of x + as well
as the reduction coordinate z8 or z9 .
4 There are, of course, many other Killing vectors in the pp-wave metric, which could be used for Kaluza–
Klein reduction, but we are not considering these here (see, however, Section 2.5). Some examples are discussed
in [20]; these typically give rise to an extra (constant) flux in the lower dimension, coming from a non-vanishing
Kaluza–Klein vector potential.
74 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
In fact these various reductions to type IIA can be related to the general type IIB
pp-waves obtained in [17], by means of T-duality. If we dimensionally reduce on z9 in
Case 1, or on z8 or z9 in Case 2, which is possible if parameters are chosen so that the
corresponding µi coefficient vanishes, the resulting supernumerary Killing spinors are
all independent x + . This implies that the type IIA string action then has linearly-realised
supersymmetries.
We can also obtain large classes of type IIA pp-wave solutions in which other µi
parameters are instead zero. (That is to say, µi other than µ9 in Case 1, or µ8 = µ9 in
Case 2.) In these circumstances, there can exist supernumerary Killing spinors that are
dependent on x + . One would then obtain a type IIA solution where some, or all, of the
world-sheet supersymmetries were non-linearly realised.
Here we consider an explicit example where only m1 and m2 are non-vanishing. (This
can equally well be for either Case 1 in (14) or Case 2 in (15), since the two are then
equivalent after coordinate relabellings.) Taking m1 and m2 as given, we can then arrange
for two choices for the µi coefficients that will give rise to supernumerary Killing spinors.
These are summarised in the following table.
The second choice is nothing but the first, with one of the two mi reversed in sign.
However, if the mα are given, fixed parameters, then these two choices for the µα
correspond to two independent solutions.5
For generic but fixed values of mα , each choice in Table 1 gives rise to 8 supernumerary
Killing spinors. When m1 = m2 in the second choice, the 8 supernumerary Killing spinors
are all independent of x + . This then corresponds to the Penrose limit of the M2/M5-brane
system (AdS3 × S 3 × T 4 ). By contrast, in the first choice in Table 1 the 8 supernumerary
Killing spinors depend on the x + coordinate. For both choices, 8 of the 16 standard Killing
spinors are independent of x + . Another special case arises when either m1 = ±2m2 or
m2 = ±2m1 , which implies that µ1 = µ2 = 0 or µ3 = µ4 = 0. Interestingly enough, this
case is T-dual to the maximally supersymmetric pp-wave arising from the Penrose limit
of AdS5 × S 5 . Suppose, for example, we have the choice giving µ1 = µ2 = 0. We can
Table 1
Choices for µi with fixed m1 and m2 that give supernumerary Killing spinors
µ21 = µ22 µ23 = µ24 µ25 = µ26 µ27 = µ28 µ29
1 1 1 1 1
36 (−2m1 + m2 ) 36 (m1 − 2m2 ) 36 (m1 + m2 ) 36 (m1 + m2 ) 9 (m1 + m2 )
2 2 2 2 2
1 1 1 1 1
36 (2m1 + m2 ) 36 (m1 + 2m2 ) 36 (m1 − m2 ) 36 (m1 − m2 ) 9 (m1 − m2 )
2 2 2 2 2
5 In our previous discussion, we adopted an “active” viewpoint when discussing the possible occurrences of
supernumerary Killing spinors, rather than the “passive” viewpoint we are adopting here. Namely, we previously
took a fixed choice for how the eigenvalues were to be expressed in terms of the mα , and then covered the
spectrum of possibilities by allowing the mα to be chosen freely. The two viewpoints are clearly equivalent, if
appropriate care is taken.
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 75
then reduce on z1 and T-dualise on z2 . The resulting type IIB solution is the maximally-
supersymmetric pp-wave, in a slightly non-standard coordinate system that was introduced
in [20] to make certain Killing directions in the transverse space manifest. The reverse
procedure of T-dualisation and lifting was performed in [20], to give the pp-wave in
D = 11.
Φ(3) = µ dz129 + dz349 + dz569 . (30)
In this case there are the 16 standard Killing spinors plus 4 supernumerary Killing spinors,
giving a total of 20 in all. All the Killing spinors depend on x + , and, hence, after reduction
on x + to type IIA the resulting D0-brane will have no supersymmetry. There are also
isometries in the zi coordinates with i = 1, . . . , 6. Since none of the Killing spinors
depends on any of these coordinates, the type IIA pp-wave that results from reducing
instead on one of these will have all 20 Killing spinors.
The M2/M2/M5/M5 brane intersection system gives an AdS2 × S 2 in its near-horizon
limit. The Penrose limit is given by
H = c0 − µ2 z12 + z62 ,
Φ(3) = µ dz123 + dz145 + dz246 + dz356 . (31)
In this case, 4 out of the 16 standard Killing spinors are independent of x + . Additionally,
there are 4 supernumerary Killing spinors, which are all x + -independent.
1
The fermionic coordinates θ are non-chiral, and can be written as θ = θ 2 , where
1 θ
Γ11 θ = θ 2 . In a notation adapted to the passage to light-cone gauge, we can introduce
−θ
world-sheet Dirac matrices 9i , with 90 = −iτ2 , 91 = τ1 and 92 = τ3 , where τi are the
Pauli matrices. The 9i act on the upper and lower 16 components θ 1 and θ 2 of the column
1
vector Ψ = θ 2 , and 92 is the chirality operator. The conjugate spinor in this notation
θ
is then Ψ̄ = Ψ † Γ0 90 , and we therefore have the “dictionary” θ̄ Oθ → −Ψ̄ 90 OΨ and
θ̄ Γ11 Oθ → Ψ̄ 91 OΨ , where O is any matrix or operator√constructed from the Γi matrices.
In the light-cone gauge, where X+ = τ , Γ− θ = 0 and −h hij = ηij , the fermionic part
of the type IIA Green–Schwarz action (32), therefore, becomes
i i
LF = iΨ̄ Γ+DΨ
/ − Ψ̄ 91 Γ+F/ (3) Ψ − eφ Ψ̄ Γ+ (91F/ (2) − 90F/ (4) )Ψ, (35)
4 4
where we have defined
1 1
/ (2) ≡ Γ i F+i ,
F / (3) ≡ Γ ij F+ij ,
F / (4) ≡ Γ ij k F+ij k .
F (36)
2 6
In the pp-wave backgrounds we are considering here, the world-sheet Dirac operator D / just
reduces to ∂/ in the light-cone gauge.
If the dimensional reduction from the D = 11 pp-wave to D = 10 is performed on a
Killing direction zi whose differential dzi does not appear in the expression for Φ(3) in
D = 11, then the solution in D = 10 is a pp-wave with only the RR 4-form F(4) as a
source. This situation can be achieved for any Φ(3) contained within Case 2, provided that
one reduces on the z8 or z9 coordinate. Since Case 1 is encompassed by Case 2 (after
appropriate coordinate relabellings) in all situations except where all four mα are non-
vanishing, it is only in this last circumstance that one is forced into a dimensional reduction
in which the differential dzi of the reduction coordinate is present in Φ(3) in D = 11. Of
course in other cases too, one may choose to perform the dimensional reduction on such a
coordinate zi , provided that the associated coefficient µi in the quadratic metric function
H vanishes, implying that ∂/∂zi is a Killing vector.
Let us first consider the case where the differential dzi of the reduction coordinate zi
does not appear in Φ(3) in D = 11. It then follows from (32) and (35) that after choosing
the light-cone gauge, the associated type IIA string action will be6
8
1 1 1 1
L= żi2 − zi 2 − µ2i zi2 + Ψ̄ i/
∂ + µ90 W Γ+ Ψ. (37)
2 2 2 4
i=1
6 See also [32] for a related discussion of the type IIA Green–Schwarz action in a gravitational wave
background.
78 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
in D = 10, where z is the reduction coordinate. We now have the non-vanishing NS–NS
field F(3) in the type IIA background, together, possibly, with a non-vanishing RR 4-form
F(4) . If the reduction of the 3-form Φ(3) is written as Φ(3) → Φ3 + dz ∧ Φ2 , and if we
define
i
Y ≡ Φij Γ ij , (39)
2
then it follows from (32) and (35) that after choosing the light-cone gauge, we shall obtain
the string action
8
1 2 1 2 1 1 1
L= żi − zi + µzi Bi − µ2i zi2 + Ψ̄ i/∂ + µ90 W + µ90 Y Γ+ Ψ.
2 2 2 4 4
i=1
(40)
Here, Bi denotes the components of the 1-form B(1) whose exterior derivative gives
Φ(2) = dB(1) . Since Φ(2) = 12 Φij dzi ∧ dzj where Φij are constants, we may, therefore,
take Bi to be given by
1
Bi = Φj i zj . (41)
2
Thus the string action in this case is given by
8 2
1 2 1 1 1 2 2 1
L= ż − z − µΦij zj − µi zi + µ2 Φik Φj k zj zk
2 i 2 i 2 2 8
i=1
1 1
+ Ψ̄ i/ ∂ + µ90 W + µ90 Y Γ+ Ψ. (42)
4 4
Thus the boson masses, as well as the fermion masses, are modified by the presence of the
NS–NS 3-form field. This is a generalisation of a result obtained in [5].
As an example, let us consider the pp-wave in D = 11 resulting from taking Φ(3) to be
given by Case 1, as in (14). After dimensional reduction on the coordinate z9 , which will
be a Killing direction provided that
m1 + m2 + m3 + m4 = 0 (43)
(see (24)), the type IIA light-cone action will be given by
8 2
1 2 1 1 1 2 2
L= ż − z − µΦij zj − µi zi
2 i 2 i 2 2
i=1
1 2 2 2 1
+ µ m1 z1 + z2 + · · · + m4 z7 + z8 + Ψ̄ i/
2 2 2 2
∂ + µ90 Y Γ+ Ψ, (44)
8 4
with Y given by
Y = im1 Γ12 + im2 Γ34 + im3 Γ56 + im4 Γ78 . (45)
(The matrix W is absent here, since all terms in Φ(3) in D = 11 involved a factor dz9 . Thus
the D = 10 background is purely NS–NS in this example.)
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 79
Note that the choice of gauge for writing B(1) ≡ Bi dzi is not unique. In this example
we could, for instance, choose, instead of writing it in the “symmetrical” gauge
1 1 1
B(1) = Φij zi dzj = m1 (z1 dz2 − z2 dz1 ) + · · · + m4 (z7 dz8 − z8 dz7 ), (46)
2 2 2
to write it in the “asymmetrical” form
B(1) = m1 z1 dz2 + m2 z3 dz4 + m3 z5 dz6 + m4 z7 dz8 . (47)
In this choice of gauge we would instead obtain the string action
8 2
1 2 1 1 1
L= żi − zi − µΦij zj − µ2i zi2
2 2 2 2
i=1
1 1
+ µ2 m21 z12 + m22 z32 + m23 z52 + m24 z72 + Ψ̄ i/
∂ + µ90 Γ+ Ψ. (48)
2 4
Of course the different gauge choices just change the action by a total derivative, and so
they are equivalent in the closed string sector.
Many of our examples can be T-dualised to pp-waves in type IIB theory when there are
two µi that vanish. In some cases, when the type IIA pp-wave is supported only by the
F(3) , the solution is also valid in type IIB theory supported by the NS–NS F(3) or the RR
F(3) , or both using S-duality rotation. Thus in this section, we consider the light-cone type
IIB string action in such a background.
In [30], the Green–Schwarz action for the type IIB string in an arbitrary bosonic
background was derived, giving all terms up to and including quadratic order in the
fermionic coordinates. In the notation of [30], the two Majorana–Weyl fermions were
1
denoted by θ 1 and θ 2 . If we put these in a column vector θ ≡ θ 2 , and define world-sheet
θ
Dirac matrices 9i by 90 = −iτ2 , 91 = τ1 and 92 = τ3 , where τi are the Pauli matrices, then
we find the following “dictionary” for converting the notation in [30] to the one we wish to
use here. For any matrix or operator O constructed from the target-space Dirac matrices,
we shall have
θ̄ 90 Oθ = −θ̄ 1 Oθ 1 − θ̄ 2 Oθ 2 , θ̄ 91 Oθ = θ̄ 1 Oθ 1 − θ̄ 2 Oθ 2 ,
θ̄ Oθ = 2θ̄ [1 Oθ 2] , θ̄92 Oθ = −2θ̄ (1Oθ 2) , (49)
where the conjugate of θ is defined by θ̄ = θ † Γ0 90 = (−θ̄ 2 , θ̄ 1 ). Substituting into
Eq. (3.29) of [30] (in the updated v2. where a minor typographical error has been corrected
and conventions adjusted), the type IIB Green–Schwarz action up to O(θ 2 ) is given by
1√ 1
L=− −h hij ∂i Xµ ∂j Xν gµν + ij ∂i Xµ ∂j Xν Bµν
2 2
i
+ i∂i Xµ θ̄ γ ij 90 Γµ Dj θ − ∂i Xµ ∂j Xν θ̄ γ ij 91 Γµ ρσ θ Gνρσ
8
i
+ eφ ∂i Xµ ∂j Xν θ̄ γ ij Γµ
8
80 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
1 1 ρ1 ···ρ5
× Γ ρ ∂ρ χ + 92 Γ ρ1 ρ2 ρ3 Fρ1 ρ2 ρ3 + Γ Fρ1 ···ρ5 Γν θ, (50)
6 240
where G(3) = dB(2) is the NS–NS 3-form, φ and χ are the dilaton and axion, and F(3) and
F(5) are the RR 3-form and self-dual 5-form, and we have defined
√
γ ij ≡ −h hij − ij 92 . (51)
+
√
In the light-cone gauge, X = τ , θ = Ψ with Γ− Ψ = 0, and −h h = η , we, ij ij
There are many examples in our general discussion where all the coefficients µi in the
metric function H are non-vanishing, implying that the pp-wave is intrinsically eleven-
dimensional. In these cases, the system is best described by a D0-brane action. Namely,
one can perform a DLCQ compactification [33–36] along the light-cone coordinate
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 81
x − ≡ x − + 2πR, and consider the sector with momentum 2p+ = −p− = N/R. The
dynamics of this sector is then described by a U (N) matrix model with the strength of
interactions governed by g ∼ 2R. The procedure as it applies to the case of the Penrose
limit of AdS4 × S 7 or AdS7 × S 4 was given in [5]. The form of the action for the general,
constant W , as studied in this paper, can be derived along the same lines, and is structurally
of the same form. This is due to the fact that the 4-form field strength enters the D0-
brane particle action in the light-cone gauge (i.e., the U (N) matrix model) only through
W = 6i Φij k Γ ij k . The form of the action is thus given by
9
i 2 2 i 2 9
L= Ẋ − µ2i Xi + Ψ T Ψ̇ + µΨ T W Ψ − µg Tr Xi Xj Xk Φij k
4 3
i=1 i,j,k=1
2
+ 2g 2 Tr Xi , Xj + 2ig Tr Ψ T Γ i Ψ, Xi . (55)
Note that in addition to the standard matrix-model interactions there are also the fermionic
and bosonic mass terms, and additionally the term tri-linear in Xi that is related to the
Myers effect [37].
The supersymmetry of this quantum mechanical matrix model fixes the coefficients
in front of the fermionic mass terms and the interaction terms in the same way as
it was derived for the special case of W = Φ123 Γ 123 in [5]. Indeed, the existence of
supersymmetry is dictated by the existence of the supernumerary Killing spinors. In fact,
the supersymmetry transformation parameter is exactly the supernumerary Killing spinor:
δXi = Ψ Γ i ,
1 1 i j
δΨ = Ẋ Γi + µX − W Γi + Γi W + ig X , X Γij ,
i i
4 12
= eµW t 0 . (56)
The case where W = Γ123 was given in [5]. In that case, the system is fully supersym-
metric, and hence 0 is an arbitrary constant spinor. Furthermore, since W has no zero
eigenvalues in that example, all the supersymmetry parameters are time-dependent.
For the more general W ’s that we have considered in this paper, 0 is subject to further
projection constraints, in accordance with the supernumerary Killing spinors. In our more
general cases W can annihilate 0 , implying that is then time-independent. In such an
example, the pp-wave can also be reduced to give rise to a pp-wave in type IIA, thus giving
an exactly-solvable string action. The existence of two routes, one corresponding to the
matrix model of the D0-particle action, and the other corresponding to the free massive
Type IIA string action, therefore suggests that these are dual descriptions of the theory
when the background is of this particular type.
4. Conclusions
transverse space for the pp-wave. These 3-forms fall into two classes, one motivated by
the Kähler form of the eight-dimensional special holonomy transverse space, and the
other motivated by the associative 3-form of a seven-dimensional transverse space of G2
holonomy.
This general class of pp-waves encompass the Penrose limits of AdSp × S q with
(p, q) = (4, 7), (7, 4), (3, 3), (3, 2), (2, 3), (2, 2) which are associated with the near hori-
zon limits of the M2-brane, M5-brane, and M2/M5, M5/M5/M5, M2/M2/M2 and
M2/M2/M5/M5 intersections, respectively. In addition this general class contains many
additional examples of pp-waves that are do not correspond to any known Penrose limit.
We focused on the study of the target space supersymmetry. In addition to 16 “standard”
Killing spinors that always arise, we determined the conditions under which additional
“supernumerary” Killing spinors appear. We also analysed the conditions under which the
Killing spinors are independent of the light cone x + coordinate, or of one or more of the
nine transverse coordinates. These conditions determine whether the reduction of the M-
theory pp-waves to type IIA supergravity, and subsequent T-dualisation to type IIB, remain
supersymmetric.
Since x + is always a Killing direction the M-theory pp-wave can always be reduced
on this coordinate, leading to a D0-brane configuration of the type IIA theory. Its
world-particle action corresponds to a DLCQ description of a matrix-theory action for
M-theory, with unbroken supersymmetry governed by x + -independent supernumerary
supersymmetries of the M-theory pp-wave background.
On the other hand the independence of a Killing spinor on a transverse coordinate allows
for a reduction on this coordinate down to a supersymmetric type IIA pp-wave. The light
cone string actions in these backgrounds correspond to exactly-solvable free massive string
theories, and again the supernumerary supersymmetries play a key role in determining the
supersymmetry of the string action.
Note added
We have updated the discussion of the type IIA and type IIB string actions in this version
of the paper, taking into account the corrections and improvements in v2. of [30]. There was
only one minor typographical in the type IIB action in the earlier version of [30], but there
were various inelegant notations and conventions, all of which have now been changed, and
these changes are incorporated in this version of the present paper. A detailed discussion
of the changes is given in the Addendum section in v2. of [30]. We are grateful to Kelly
Stelle for discussions leading to these improvements.
Acknowledgements
We are grateful to Gary Gibbons, Jim Liu and Justin Vázquez-Poritz for conversations.
H.L. and C.N.P. are grateful to University of Pennsylvania for hospitality and financial
support during the course of this work.
M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84 83
References
[1] R. Penrose, Any spacetime has a plane wave as a limit, in: Differential Geometry and Relativity, Reidel,
Dordrecht, 1976.
[2] M. Blau, J. Figueroa-O’Farrill, C. Hull, G. Papadopoulos, A new maximally supersymmetric background of
IIB superstring theory, JHEP 0201 (2002) 047, hep-th/0110242.
[3] M. Blau, J. Figueroa-O’Farrill, C. Hull, G. Papadopoulos, Penrose limits and maximal supersymmetry, hep-
th/0201081.
[4] R.R. Metsaev, Type IIB Green–Schwarz superstring in plane wave Ramond–Ramond background, Nucl.
Phys. B 625 (2002) 70, hep-th/0112044.
[5] D. Berenstein, J. Maldacena, H. Nastase, Strings in flat space and pp waves from N = 4 super-Yang–Mills,
hep-th/0202021.
[6] J. Kowalski-Glikman, Vacuum states in supersymmetric Kaluza–Klein theory, Phys. Lett. B 134 (1984) 194.
[7] R.R. Metsaev, A.A. Tseytlin, Exactly solvable model of superstring in plane wave Ramond–Ramond
background, hep-th/0202109.
[8] M. Blau, J. Figueroa-O’Farrill, G. Papadopoulos, Penrose limits, supergravity and brane dynamics, hep-
th/0202111.
[9] N. Itzhaki, I.R. Klebanov, S. Mukhi, PP-wave limit and enhanced supersymmetry in gauge theories, hep-
th/0202153.
[10] J. Gomis, H. Ooguri, Penrose limit of N = 1 gauge theories, hep-th/0202157.
[11] J.G. Russo, A.A. Tseytlin, On solvable models of type IIB superstring in NS–NS and RR plane wave
backgrounds, hep-th/0202179.
[12] L.A. Pando-Zayas, J. Sonnenschein, On Penrose limits and gauge theories, hep-th/0202186.
[13] M. Alishahiha, M.M. Sheikh-Jabbari, The pp-wave limits of orbifolded AdS5 × S 5 , hep-th/0203018.
[14] M. Billo’, I. Pesando, Boundary states for GS superstrings in an Hpp wave background, hep-th/0203028.
[15] N. Kim, A. Pankiewicz, S.-J. Rey, S. Theisen, Superstring on pp-wave orbifold from large-N quiver gauge
theory, hep-th/0203080.
[16] T. Takayanagi, S. Terashima, Strings on orbifolded pp-waves, hep-th/0203093.
[17] M. Cvetič, H. Lü, C.N. Pope, Penrose limits, pp-waves and deformed M2-branes, hep-th/0203082.
[18] U. Gursoy, C. Nunez, M. Schvellinger, RG flows from Spin(7), CY 4-fold and HK manifolds to AdS,
Penrose limits and pp waves, hep-th/0203124.
[19] E. Floratos, A. Kehagias, Penrose limits of orbifolds and orientifolds, hep-th/0203134.
[20] J. Michelson, (Twisted) toroidal compactification of pp-waves, hep-th/0203140.
[21] C.S. Chu, P.M. Ho, Noncommutative D-brane and open string in pp-wave background with B-field, hep-
th/0203186.
[22] S.W. Hawking, M.M. Taylor-Robinson, Bulk charges in eleven dimensions, Phys. Rev. D 58 (1998) 025006,
hep-th/9711042.
[23] M.J. Duff, J.M. Evans, R.R. Khuri, J.X. Lu, R. Minasian, The octonionic membrane, Phys. Lett. B 412
(1997) 281, hep-th/9706124.
[24] M. Cvetič, H. Lü, C.N. Pope, Brane resolution through transgression, Nucl. Phys. B 600 (2001) 103, hep-
th/0011023.
[25] K. Becker, A note on compactifications on Spin(7)-holonomy manifolds, JHEP 0105 (2001) 003, hep-
th/0011114.
[26] M. Cvetič, G.W. Gibbons, H. Lü, C.N. Pope, Ricci-flat metrics, harmonic forms and brane resolutions, hep-
th/0012011.
[27] M. Cvetič, G.W. Gibbons, H. Lü, C.N. Pope, Hyper-Kähler Calabi metrics, L2 harmonic forms, resolved
M2-branes, and AdS4 /CFT 3 correspondence, Nucl. Phys. B 617 (2001) 151, hep-th/0102185.
[28] M. Cvetič, G.W. Gibbons, H. Lü, C.N. Pope, New complete non-compact Spin(7) manifolds, Nucl. Phys.
B 620 (2002) 29, hep-th/0103155.
[29] K. Becker, M. Becker, M-theory on eight-manifolds, Nucl. Phys. B 477 (1996) 155, hep-th/9605053.
[30] M. Cvetič, H. Lü, C.N. Pope, K.S. Stelle, T-duality in the Green–Schwarz formalism, and the mass-
less/massive IIA duality map, Nucl. Phys. B 573 (2000) 149, hep-th/9907202.
[31] A.A. Tseytlin, On dilaton dependence of type II superstring action, Class. Quantum Grav. 13 (1996) L 81,
hep-th/9601109.
84 M. Cvetič et al. / Nuclear Physics B 644 (2002) 65–84
[32] S. Hyun, H. Shin, Supersymmetry of Green–Schwarz superstring and matrix string theory, Phys. Rev. D 64
(2001) 046008, hep-th/0012247.
[33] T. Banks, W. Fischler, S.H. Shenker, L. Susskind, M theory as a matrix model: a conjecture, Phys. Rev. D 55
(1997) 5112, hep-th/9610043.
[34] L. Susskind, Another conjecture about M(atrix) theory, hep-th/9704080.
[35] A. Sen, D0 branes on T n and matrix theory, Adv. Theor. Math. Phys. 2 (1998) 51, hep-th/9709220.
[36] N. Seiberg, Why is the matrix model correct?, Phys. Rev. Lett. 79 (1997) 3577, hep-th/9710009.
[37] R.C. Myers, Dielectric-branes, JHEP 9912 (1999) 022, hep-th/9910053.
Nuclear Physics B 644 (2002) 85–112
www.elsevier.com/locate/npe
Abstract
The new method of solving quantum mechanical problems is proposed. The finite, i.e., cut-off,
Hilbert space is algebraically implemented in the computer code with states represented by lists of
variable length. Complete numerical solution of a given system is then automatically obtained. The
technique is applied to Wess–Zumino quantum mechanics and D = 2 and D = 4 supersymmetric
Yang–Mills quantum mechanics with SU(2) gauge group. Convergence with increasing cut-off was
observed in many cases well within the reach of present machines. Many old results were confirmed
and some new ones, especially for the D = 4 system, are derived. Extension to D = 10 is possible
but computationally demanding for higher gauge groups.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
Since the original conjecture of Banks, Fischler, Shenker and Susskind of the
equivalence between M-theory and the D = 10 supersymmetric Yang–Mills quantum
mechanics (SYMQM) [1], a lot of effort has been put to understand, and ultimately solve,
the latter [4–17].
The general, not necessarily gauge, supersymmetric quantum mechanical systems have
much longer history [2,3]. Claudson and Halpern considered for the first time the gauge
systems also well before the BFSS hypothesis [4]. In particular a complete solution of the
D = 2, N = 2 SYMQM was given there (see also Ref. [5]). Later the specific gauge models
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 0 - 6
86 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
were studied in more detail and some candidates for the ground state were constructed in
the, gauge invariant, Born–Oppenheimer approximation [6,7].
Another important development was achieved by de Wit, Lüscher and Nicolai who have
shown that the spectrum of supersymmetric Yang–Mills quantum mechanics is continuous
[8,9] due to the cancellations between fermions and bosons. This is different from the
pure Yang–Mills case where the transverse fluctuations across the vacuum valleys do not
cancel and effectively block the valley, resulting in the discrete spectrum of the 0-volume
glueballs [10]. The continuous spectrum was first regarded as a setback, however it has
turned out into a virtue with the advent of the BFSS hypothesis and new interpretation
in terms of the scattering states. Still this new connection requires existence of the
localized state at the threshold of the spectrum—the supergraviton [11]. This question
triggered intense studies of the Witten index of SYMQM for various D and different
gauge groups [12–15]. Powerful techniques were developed to calculate analytically
non-Abelian integrals [16–18] related to the Witten index. They were accompanied by
complementary and original numerical methods [19,20]. The emerging picture is rather
satisfactory, indeed: for D < 10 there is no threshold bound state while for D = 10 there
is one, exactly as required by BFSS. This has been proven for N = 2, but the evidence is
being accumulated that it holds for higher N , as well as for other gauge groups.
The large N limit of SYMQM, which is relevant to M-theory, was studied in the
framework of the mean field approximation in Refs. [21,22]. This provided an interesting
realization of the black hole thermodynamics predicted by the M-theory [11,23].
In spite of all above results, SYMQM remains unsolved for D > 2. Therefore, it seems
natural to study this model with lattice methods which proved so successful in treating
more complex field theoretical systems. Such a programme has been proposed in Ref. [24],
beginning with the yet simpler (quenched, D = 4, N = 2) case which was later extended
to higher gauge groups 2 < N < 9 [25]. The asymptotic behavior in N was observed,
and indications of an onset of the interesting phase structure was found in agreement with
Refs. [21,22]. Including dynamical fermions is possible for D = 4, and may be feasible at
D = 10 for the first few N . However, an arbitrary N case is plagued by the sign problem
caused by the generally complex fermionic determinant.
Another interesting approach studied by many authors, follows the Eguchi–Kawai
trick to trade entirely the D-dimensional configuration space into a group space [26,27].
Proceeding in this way one is led to consider the fully reduced (to a single point in the
Euclidean space) model which nevertheless possesses a version of the supersymmetry [28].
Many analytical and numerical results were obtained in this way (for a review see, e.g.,
[29]). Numerical simulations were pushed to quite high N in simplified models [30].
However the sign problem which is also present there limits the Monte Carlo approach.
This may be alleviated by the new method proposed recently in Ref. [31].
It is important to remember that the subject has a lot of overlap with the small volume
study of gauge theories where the valuable expertise has been accumulating for a long time
[10,32–34]. Although the final goal there is to increase the space volume and to match
ultimately the standard large volume physics, the starting point is the pure Yang–Mills
quantum mechanics identical to the bosonic part of our supersymmetric systems. In fact
the classical results of Lüscher and Münster are the special case of the solution of the
D = 4 SYMQM in the gluino-free sector of the latter. This will be shown in Section 5.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 87
Recently van Baal has used the full machinery of the small volume approach to study
the supersymmetric vacuum state of the more complicated, compact version of D = 4
supersymmetric YM quantum mechanics [35].
Summarizing, even though above quantum mechanics is much simpler that the M-
theory, it remains unsolved and thereby still poses an interesting challenge.
In this paper we propose a new approach to this problem and present a series of
quantitative results for simpler systems not always solved up to now. To this end we
use the standard Hamiltonian formulation of quantum mechanics in the continuum,1
construct explicitly the (finite) basis of physical states and calculate algebraically matrix
representations of all relevant observables. This done, we proceed to calculate numerically
the complete spectrum, the energy eigenstates, and identify (super)symmetry multiplets.
This approach is entirely insensitive to the sign problem and is equally well applicable to
systems with and without fermions. Similarly to the lattice approach, the method has an
intrinsic cut-off: any quantitative results can be obtained only within the finite-dimensional
subspace of the whole Hilbert space. It turns out, however, that in all cases studied below
many important characteristics (in particular the low energy spectrum) can be reliably
obtained before the number of basis vectors grows out of control.
General Hamiltonian methods have been applied before to complete, space extended
field theories [38]. Recently Matsumura and collaborators [39] and Pinsky et al. have
applied this technique to study a variety of partly reduced, supersymmetric theories in
lower dimensions (see Refs. [40,41] and references therein).
In the next section we describe present approach and its algebraic computer implemen-
tation. In Sections 3 and 4 the spectra of Wess–Zumino quantum mechanics and D = 2
supersymmetric Yang–Mills SU(2) quantum mechanics are derived. Section 5 contains
main results of this paper. It is devoted to D = 4 supersymmetric Yang–Mills quantum
mechanics with the SU(2) gauge group. The global picture of the whole system, with the
quantitative spectra in all fermionic channels and their supersymmetric interrelations, will
be presented for the first time. Summary and discussion of the future applications follow
in the last two sections.
2. Quantum mechanics in a PC
We begin with the simple observation that the action of any quantum mechanical
observable can be efficiently implemented (e.g., in an algebraic program) if we use the
discrete eigen basis
1 n
{|n}, |n = √ a † |0, (1)
n!
1 This work was inspired by the discussion with C.M. Bender and the methods he developed in studying
various quantum mechanical systems [36,37].
88 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
of the occupation number operator a † a. For example, the bosonic coordinate and
momentum operators can be written as
1 1
x = √ a + a† , p = √ a − a† , (2)
2 i 2
where all dimensionfull parameters are set to 1. Since typical quantum observables are
relatively simple functions of x and p, they can be represented as the multiple actions
of the basic creation and annihilation operators.2 Including fermionic observables is
straightforward and will be done in subsequent sections. Generalization to more degrees of
freedom is also evident and will be carried out separately for individual systems.
The first step to quantify above considerations is to implement the Hilbert space in
an algebraic program. Any quantum state is a superposition of arbitrary number, ns , of
elementary states |n
ns
|st = aI n(I ) , (3)
I
Table 1
Quantum states in the Hilbert space and their computer implementation
Operation Quantum mechanics PC Mapping
Any state |st list
Sum |st1 + | st2 add[list 1 , list 2 ] (list1 , list 2 ) → list3
Number multiply α|st1 mult[α, list 1 ] list1 → list 2
Scalar product st1 |st2 sc[list1 , list2 ] (list1 , list 2 ) → number
Empty state |0 {1,{1},{0}}
Null state 0 {0,{}}
calculate matrix representations of the Hamiltonian and other quantum operators using
above rules. Thereby the problem is reduced to a simple question in linear algebra, namely
the spectrum of the system is given by the eigenvalues and eigenvectors of the Hamiltonian
matrix, their behaviour under rotations by the angular momentum matrix, etc.
Of course the most important question is how much the ultimate physics at Ncut = ∞
is distorted by the finite cut-off. This can be answered quantitatively by inspecting the
dependence of our results on Ncut for practically available sizes of the bases. In all systems
studied so far (and discussed below) the answer is positive: one can extract the meaningful
(i.e., Ncut = ∞) results before the size of the basis becomes unmanageable. However, this
can be answered only a posteriori and individually for every system.
The method outlined above turns out to be a rather powerful tool capable to solve
quantitatively various quantum mechanical problems with finite but large number of
degrees of freedom.
1 1
y = √ ay + ay† , py = √ ay − ay† , (10)
2 i 2
ψα = fα , ψα = fα† ,
†
(11)
which in turn satisfy
ai , ak† = δik , fα , fβ† = δαβ . (12)
Now we implement this quantum mechanical system in the computer as explained in the
previous section. The empty state will be represented by a list
(0, 0), (0, 0) ↔ 1, {1}, {0, 0}, {0, 0} , (13)
where the first brackets specify occupation numbers of the bosonic, and the second
fermionic, states. To define properly the fermionic creation/annihilation operators we
follow the original construction of Jordan and Wigner [43,44]
fα = Πβ<α (−1)Fβ σα− , fα† = Πβ<α (−1)Fβ σα+ , (14)
where the fermionic number operator Fα = fα† fα (no sum) and σα± are rising and lowering
operators, commuting for different α, with (σα± )2 = 0. The Jordan–Wigner phases ensure
uniform anticommutation rules for fermionic operators.
We proceed to construct the finite eigen basis of the fermionic, Fα , and bosonic, Bi ,
number operators. For more than one degree of freedom the organization of a basis and
definition of the cut-off, Ncut , is not unique. In this case we have chosen the cut-off to
be the maximal number of bosonic quanta. That is, the basis subject to the cut-off Ncut
consists of all orthonormal elementary states with B = B1 + B2 Ncut and all allowed
fermionic quanta, i.e., F = F1 + F2 2. In practice the basis is created by action on
the empty state with all elementary independent monomials of bosonic creation operators
up to Ncut ’th order, followed by the three independent monomials of fermionic creation
operators. Hence, the Hilbert space, subject to the cut-off Ncut , has 2(Ncut + 1)(Ncut + 2)
dimensions.
Given the basis, the matrix representation of the Hamiltonian (and any other observable
of interest) can be readily calculated with our computer based “quantum algebra”. The
spectrum and energy eigenstates are then obtained by numerical diagonalization of the
Hamiltonian matrix.
Fig. 1 shows the spectrum of low energy states as a function of the cut-off up to
Ncut = 15 for m = g = 1. Energies of the fourth and fifth level are shifted by 1 to avoid
confusion of levels for low Ncut . Clear convergence with Ncut is seen, well within a capacity
of a medium class PC. The convergence is faster for lower states. This general feature of
our method can be easily understood. The basis generated by creation operators, Eq. (12),
is nothing but the eigen basis of some normalized harmonic oscillator. It is obvious that
the ground state can be easier approximated by a series of harmonic oscillator wave
functions that the higher states with all their detailed structures, zeroes, etc. Quantitatively,
at Ncut = 10 the exact supersymmetric ground state energy is reproduced with the 3%
accuracy which further improves to below 1% at Ncut = 15.4 Supersymmetric pattern of
4 Since the exact value is zero, we take the first excited energy as a reference scale.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 91
Table 2
Splittings within the first three supersymmetric multiplets for different cut-offs
Ncut M1 M2 M3
9 0.1321 0.1574 0.1835
10 0.0924 0.1213 0.1245
11 0.0595 0.0941 0.0886
12 0.0420 0.0657 0.0609
13 0.0282 0.0440 0.0407
14 0.0212 0.0332 0.0287
15 0.0149 0.0207 0.0192
the spectrum is also evident. First, the lowest bosonic state has no fermionic counterpart, its
energy tends to zero, hence we identify it with the supersymmetric vacuum of the model.5
Second, all higher states group into bosonic–fermionic multiplets with the same energy.
Splittings inside the multiplets decrease with Ncut as shown in Table 2. We conclude that
supersymmetry at low energies is restored at the level of few percent in the cut Hilbert
space with states containing up to 15 bosonic quanta.
Let us see how the above convergence to the supersymmetric spectrum shows up in the
Witten index, which in this approach can be calculated directly from the definition
IW (T ) = (−1)Fi exp(−T Ei ). (15)
i
5 The model exhibits additional two-fold degeneracy in bosonic and fermionic sectors which is not related to
supersymmetry [42]
92 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
Fig. 2 shows Witten index for 4 < Ncut < 15. Nice convergence to the exact, time
independent, result IW (T ) = 2 is observed. Two things are happening which ensure this
behaviour. First, there are two (due to the above degeneracy within bosonic and fermionic
sectors) supersymmetric ground states. At the largest cut-off, Ncut = 15, their energies are
not exactly zero but tiny E0,0 = 0.0046, E0,1 = 0.0071, hence giving rise to the exponential
fall-off at large T with a very small slope. Second, cancellations between non-zero energy
states within supersymmetric multiplets are not exact and still leave exponential terms,
albeit with smaller and smaller (with increasing Ncut ) coefficients. This gives the residual
time dependence which is however much weaker than the exponential one.
Another, rather general feature of IW (T ), can be also observed here. Exactly at T = 0
Witten index vanishes for every Ncut just because the bases generated with our procedure
automatically satisfy a “global” supersymmetry requirement. Namely, the number of
bosonic and fermionic states in a basis is the same for every Ncut . Therefore, we conclude
that
On the other hand exact supersymmetry requires that this complete correspondence
between fermionic and bosonic states is violated since the ground state at E = 0 does not
have its fermionic counterpart while each non-zero energy state has. This apparent paradox
is reconciled by noting that exact supersymmetry emerges only in the limit of the infinite
Ncut . In another words the unbalanced fermionic state is pushed (with increasing Ncut )
to higher and higher energies and eventually “vanishes from the spectrum”. However, it
leaves a visible effect—Witten index has a discontinuity at T = 0 which can be expressed
as the non-commutativity of the T → 0 and Ncut → ∞ limits
The right-hand side of the above equation, known as the bulk contribution, may not be
integer for the continuous spectrum [13].
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 93
Above results were obtained for m = g = 1, but the method works equally well for
any other choices of parameters. The case m = 0 has a rotational symmetry and can
be solved by other methods [45]. To our knowledge no quantitative calculation of the
spectra and wave functions for m = 0 exists. As mentioned earlier this approach provides
a complete quantum mechanics of a system. In particular, wave functions in the coordinate
representation are simply given by the well defined linear combinations of the harmonic
oscillator wave functions. More detailed discussion of the model, and comparison with
other authors will be presented elsewhere.
This system, although solved analytically [4], has two new features: gauge invariance
and continuous spectrum. Therefore, we have chosen it as a next exercise. Reducing from
D = 2 to one time dimension one is left with the three real, colored, bosonic variables xa (t)
and three complex, fermionic degrees of freedom ψa (t) also in the adjoint representation
of SU(2), a = 1, 2, 3.
The Hamiltonian reads [4]
1
H = pa pa + ig$abc ψa† xb ψc , (18)
2
where the quantum operators x, p, ψ, ψ † satisfy
[xa , pb ] = iδab , ψa , ψb† = δab . (19)
Hence they can be written in terms of the creation and annihilation operators as before
1 1
xa = √ aa + aa† , pa = √ aa − aa† , (20)
2 i 2
ψa = fa , ψa = fa .
† †
(21)
To implement fermionic creation and annihilation operators we again used the Jordan–
Wigner construction.
The system has a local gauge invariance with the generators
Ga = $abc xb pc − iψb† ψc . (22)
Therefore, the physical Hilbert space consists only of the gauge invariant states. This can
be easily accommodated in our scheme noting that the gauge generators of the SU(2) are
just the angular momentum operators acting in color space. In fact the fermionic part of
Eq. (22) can be also interpreted in this way since the momentum canonically conjugate to
ψ: πψ = iψ † . Therefore, we construct all possible invariant under SU(2) combinations of
the creation operators (referred for short as creators) and use them to generate a complete
gauge invariant basis of states. There are four lower order creators:
Pauli principle implies that (ff ) = (af )2 = (aff )2 = (fff )2 = 0, therefore, the whole
basis can be conveniently organized into the four towers of states, each tower beginning
with one of the following states
|0F = |0, |1F = (af )|0, |2F = (aff )|0, |3F = (fff )|0, (25)
where we have labeled the states by the gauge invariant fermionic number F = fa† fa . The
empty state (and its Mathematica representation) reads
|(0, 0, 0), (0, 0, 0) ↔ 1, {1}, {0, 0, 0}, {0, 0, 0} , (26)
with the obvious assignment of bosonic and fermionic occupation numbers. To obtain the
whole basis it is now sufficient to repeatedly act on the four vectors, Eq. (25), with the
bosonic creator (aa), since action with other creators either gives zero, due to the Pauli
blocking, or produces linearly dependent state from another tower. Basis with the cut-off
Ncut is then obtained by applying (aa) up to Ncut times to each of the four “ground” states.
Of course our cut-off is gauge invariant, since it is defined in terms of the gauge invariant
creators. Moreover, since the gauge generators can be implemented into the algebraic
program as any other observables, we can directly verify gauge invariance of the basis
and any other state in question.
Once the above basis is constructed, we proceed to calculate the matrix representation
of the Hamiltonian, the spectrum and the eigenstates according to Section 2.6 Again
all computation can be done on a small PC and the longest run, corresponding to 20
bosonic quanta, takes approximately 500 sec. It is worth to mention that the most time
consuming are the algebraic operations on the states (cf. Table 1) in the PC-based abstract
Hilbert space. Consequently the program spends most of the time calculating the matrix
representations of various operators 7 while the numerical diagonalization is rather fast.
The Hamiltonian, Eq. (18) can be rewritten
1
H = pa pa + gxa Ga , (27)
2
hence, it reduces to that of a free particle in the physical, gauge invariant basis. It
follows that it preserves the gauge invariant fermionic number and, consequently, can be
diagonalized independently in each sector spanned by the four towers in Eq. (25).
The spectrum is doubly degenerate because of the particle-hole symmetry which relates
empty and filled fermionic states (|0F ↔ |3F ) and their 1-particle 1-hole counterparts
(|1F ↔ |2F ). This symmetry is not violated by the cut-off and indeed we see it exactly
in the spectrum. On the other hand, supersymmetry connects sectors which differ by 1 in
the fermionic number, e.g. (|0F ↔ |1F ) and in general is restored only at infinite Ncut .
A good measure of the SUSY violation is provided by the energy of the ground state which
should be 0. We see that the energy of the lowest (doubly degenerate) state converges to
0 but rather slowly, cf. Fig. 3. This is interpreted as the indication that the spectrum is
continuous at infinite cut-off. Indeed, it is hard to approximate the non-localized state by
Fig. 3. D = 2 supersymmetric Yang–Mills quantum mechanics. The energy of the lowest state as a function of he
cut-off and the 1/Ncut fit (dotted line).
the harmonic oscillator states (this in fact is our basis) and consequently the convergence
of such an approximation must be slow.8 This deficiency can be sometimes turned into an
advantage as will be seen in the D = 4 case.
Surprisingly, however, there exists a particular scheme of increasing the basis which
renders exact supersymmetry at every finite cut-off Ncut in this model. To see that, let us
first discuss tests of the SUSY on the operator level. SUSY generators read
Q = ψa pa , a = ψ̄a pa ,
Q (28)
with ψ̄ = ψ † , and satisfy the algebra
= 2H − gxa Ga ,
{Q, Q} 2 = 0.
Q2 = Q (29)
Since the matrix elements of the SUSY generators can be easily calculated in our PC-
based Hilbert space approach, one can readily check the matrix element version of
Eqs. (29). Indeed, we find that these relations are satisfied to better and better precision
with increasing Ncut . If so, it is natural to ask what is the spectrum of the Hamiltonian
matrix H QQ defined by Eq. (29) at finite cut-off. To this end we diagonalize the matrix
QQ 1 + N|Q|MM|Q|N
HN,N = N|Q|MM|Q|N , (30)
2
where N, N and M label vectors of our finite basis.9 Of course SUSY generators mix
the fermionic number, therefore, one should combine all four towers of Eq. (25) into one
big basis. One more trick is required to achieve exact SUSY at finite cut-off: we have to
increase the basis allowing from the beginning for the disparity between the fermionic and
bosonic sectors. That is, the size of the basis grows as: 2 + 4 + 4 + · · ·. Hence, dimension
8 This argument can be turned into a proof that the free spectrum converges like 1/N
cut [46] which is also
confirmed here (cf. Fig. 3).
9 The second term in Eq. (29) does not contribute in the physical basis.
96 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
Table 3
The spectrum and the degeneracy, d, of the Hamiltonian H QQ defined at finite Ncut as {Q, Q}/2
2 + 4Ncut 10 14 18 22 26 d
E0 0 0 0 0 0 2
E1 0.815 0.610 0.489 0.409 0.351 4
E2 2.685 1.904 1.495 1.236 1.056 4
E3 4.235 3.160 2.558 2.160 4
E4 0 5.856 4.522 3.743 4
E5 0 7.525 5.960 4
E6 0 0 9.230 4
of the cut Hilbert space is 2 + 4Ncut in this scheme. With this choice the spectrum of the
H QQ has exact supersymmetry as shown in Table 3.
As required, the ground state has zero energy and does not have the supersymmetric
image while higher states form supersymmetric doublets with the same energy. Additional
degeneracy is caused by the particle-hole symmetry as explained earlier. All non-zero
eigenvalues (with finite “principal” quantum number) tend to zero with increasing Ncut and
form, at Ncut = ∞, the continuum spectrum of a free Hamiltonian, Eq. (27). To reach the
solutions with non-zero energy, one should appropriately scale the index of an eigenvalue
with Ncut . The four towers of eigenstates are in the direct correspondence with the four
gauge invariant plane waves constructed in Ref. [4].
Even though existence of the supersymmetry preserving cut-off is quite interesting, we
belive that it is related to the simple structure of the D = 2 SYM quantum mechanics and
its complete solubility. Nevertheless, it is important to keep in mind that at finite Ncut one
is free to define the Hamiltonian within the “O(1/Ncut )” tolerance, i.e., as long as various
definitions converge to the same limit at infinite Ncut [39–41]. We decided to use H QQ
because it is the anticommutator of charges which is used to derive basic properties of the
SUSY spectrum. Further study of this exact realization of supersymmetry will be continued
elsewhere.
Finally let us shortly discuss the Witten index for this model. Because the particle-
hole symmetry interchanges odd and even fermionic numbers, the Witten index vanishes
identically. This is also true for any finite cut-off since Ncut preserves particle-hole
symmetry. Nevertheless one can obtain a non-trivial and interesting information by
defining the index restricted to the one pair of fermion–boson sectors. For example,
IW (T )(0,1) = (−1)Fi exp (−T Ei ), (31)
i,Fi =0,1
where the sum is now restricted only to the F = 0 and F = 1 sectors. Since supersymmetry
balances fermionic and bosonic states between these sectors (with the usual exception
of the vacuum), the restricted index is a good and non-trivial measure of the amount
of the violation/restoration of SUSY, even when the total Witten index vanishes.
(2,3) (0,1) (0,3) (1,2)
Obviously IW = −IW , and IW = IW = 0. Studying restricted index is particularly
interesting in this model since, due to the continuum spectrum, it does not have to be
integer and provides some information about the density of states. We have calculated the
index from our spectrum of the original Hamiltonian, Eq. (18), for a range of cut-offs:
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 97
5 < Ncut < 20. Fig. 4 shows the result for the 2 + 4 + 4 + · · · scheme, i.e., when the basis
is increased by every four vectors allowing from the beginning for the two unbalanced
states from the empty and filled sectors each. We see a slow approach towards the time
independent constant which seems to be 1/2.
Witten index, as a sum over all states, provides an average measure of the break-
ing/restoration of SUSY. Indeed, even though individual supersymmetric multiplets have
not yet clearly formed (at cut-offs used for this calculation), we see definite flattening of
the T dependence of IW (T ) which must then be a result of the average cancellation be-
tween many levels. Therefore, it is easier to see the onset of the supersymmetric behavior
in the Witten index than in the location of the individual levels. As before Fig. 4 shows
that the limiting value of the index is discontinuous at T = 0 with the same interpretation
as in Section 3. However, contrary to the discrete spectrum of the Wess–Zumino model,
the value of the index is not integer.10 The value of the index at T = 0 is the direct conse-
quence of our scheme (2 + 4 + 4 + · · ·) of increasing the basis and, of course, is scheme
dependent. As a final remark we note an intriguing existence of a “universal” point in T at
which the asymptotic value seems to be attained at all Ncut .
5.1. Preliminaries
H = HB + HF , (32)
10 It would be an interesting exercise to calculate the restricted index from the continuous spectrum of Ref. [4].
98 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
1 g2 j j
HB = pai pai + $abc $ade xbi xc xdi xe ,
2 4
ig
HF = $abc ψaT Γ k ψb xck , (33)
2
where ψ T is the transpose of the real Majorana spinor, and Γ in D = 4 are just the
standard Dirac α matrices. In all explicit calculations we are using Majorana representation
of Ref. [47].
Even though the three-dimensional space was reduced to a single point, the system
still has the internal spin(3) rotational symmetry, inherited from the original theory, and
generated by the angular momentum
j 1
J i = $ ij k xa pak − ψaT Σ j k ψa , (34)
4
with
i j k
Σjk = − Γ ,Γ . (35)
4
Further, the system has gauge invariance with the generators
i T
Ga = $abc xb pc − ψb ψc ,
k k
(36)
2
and is invariant under the supersymmetry transformations generated by
j
Qα = Γ k ψa α pak + ig$abc Σ j k ψa α xb xck . (37)
The bosonic potential (written now in the vector notation in the color space)
g2 2
V= Σj k x j × x k , (38)
4
exhibits the famous flat directions responsible for a rich structure of the spectrum.
At first sight one might expect that the spectrum of the purely bosonic (hence, non-
supersymmetric) system does not have localized states because of these flat valleys (cf.
Introduction). However, the flat directions are blocked by the energy of the transverse
quantum fluctuations since valleys narrow as we move from the origin. As a consequence
the spectrum of the model is discrete. Energies of the first lower states, known as zero
volume glueballs, were first calculated by Lüscher and Münster [32]. On the other hand,
in the supersymmetric system, transverse fluctuations cancel among bosons and fermions,
valleys are not blocked, and the spectrum of the model is expected to be continuous [8].
The story has its interesting continuation in D = 10 model where the evidence of the
threshold bound state was accumulated [12–17]. Existence of such a state is necessary
for the M-theory interpretation of the model where it is considered as a prototype of the
graviton multiplet. Therefore, detailed study of the low energy spectra of the whole family
of Yang–Mills quantum mechanical systems, including identification of the localized and
non-localized states is an important and fascinating subject. We shall face some of these
questions also in the D = 4 system.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 99
There are many ways to write quantum coordinates in terms of creation and annihilation
operators
i k† ρ σ † ρσ
aa , ab = δab
ik
, fa , fb = δik , ρ, σ = 1, 2. (39)
The only constraint comes from the canonical (anti)commutation relations
i k α β αβ
xa , pb = iδ ik δab , ψa , ψb = δab . (40)
For bosonic variables we just use the straightforward extension of Eq. (20) to more degrees
of freedom
1 1
xai = √ aai + aai† , pai = √ aai − aai† . (41)
2 i 2
For fermionic variables we begin with the classical Majorana fermion in the Weyl
representation [48]
ψW = ζ24 , −ζ14 , ζ1 , ζ2 , (42)
replace the classical Grassmann variables ζ, ζ 4 by fermionic creation and annihilation
operators f, f † , and transform to the Majorana representation. Final result is a quantum
hermitian Majorana spinor
−fa1 − ifa2 + ifa1† + fa2†
1 + i +ifa1 − fa2 − fa1† + ifa2†
ψa = √
2† , (43)
2 2 −f + if + ifa − fa
1 2 1†
a a
−ifa1 − fa2 + fa1† + ifa2†
which satisfies Eq. (40) due to (39). Other choices of fermionic creation and annihilation
operators are also possible [6,7,15].
The next step is to define an empty state and its, computer based, algebraic
representation
|(0, 0, 0), (0, 0, 0), (0, 0, 0), (0, 0, 0), (0, 0, 0) (44)
↔ 1, {1}, {{0, 0, 0}, {0, 0, 0}, {0, 0, 0}, {0, 0, 0}, {0, 0, 0}} , (45)
where the first three vectors (in color) specify bosonic, and the last two fermionic,
occupation numbers.
The Jordan–Wigner transformation requires additional specification in this case. As
before, the action of fai and fai† on elementary states is defined by the spin-like raising
and lowering operators corrected by the non-local Jordan–Wigner phases
fA = ΠB<A (−1)FB σB− , fA† = ΠB<A (−1)FB σA+ . (46)
However, since individual states are now labeled by a double index A, one must define the
ordering of the two-dimensional indices. We choose the lexicographic order. If A = (a, α)
and B = (b, β) than
ΠB<A = Πβ<α,b3 Πβ=α,b<a . (47)
Any other unambiguous definition of the ordering is admissible.
100 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
There are many ways to construct bases in higher-dimensional systems. Apart from
theoretical requirements one must also take into account practical limitations of computer
implementation. Of course the number of states at finite cut-off is generally bigger for
higher D. However, the theory also has more symmetries and consequently the number of
states in a particular channel can be kept manageable.
First important symmetry is the conservation of the number of Majorana fermions
[F, H ] = 0, F = fai† fai . (48)
That is, the vector interaction HF cannot produce Majorana pairs in this model. As a
consequence the whole Hilbert space splits into seven sectors of fixed F = 0, 1, . . . , 6.
Second, the system has the particle-hole symmetry
F ↔ 6 − F, (49)
therefore the first four sectors F = 0, 1, 2, 3 contain all information about the spectrum.
Further, the gauge invariance requires that our basis is built only from the gauge
invariant creators. All this is not different from the earlier D = 2 case.
The new element is the SO(3) invariance, with the angular momentum Eq. (34), which
can be used to split further the problem into the different channels of fixed J . However,
more we specify the basis more complicated its vectors become, and consequently more
computer time goes into the generation of such a basis and subsequent calculation of
matrix elements. Similarly one might contemplate using the reduced version of Lorentz
invariant composite operators as possible creators of a basis. Again this would generate
more complex basis since fields contain both creation and annihilation operators. Also,
Lorentz covariance is not that relevant in our fixed frame, Hamiltonian formulation.
Taking all above into account we have decided to produce the simplest basis of gauge
invariant vectors using elementary creation operators. To this end we proceed as follows.
Consider each fermionic sector separately, i.e., fix the fermionic number 0 F 3. At
given F create all independent vectors with fixed number of bosonic quanta B = aai† aai ,
and define a cut-off as the maximal number of bosonic quanta B Ncut , independently for
each F . Hence, Ncut can depend on F . To create all independent, gauge invariant states at
fixed F and B consider all possible contractions of color indices in a creator of (F, B) order
aai11† · · · aaiBB† fbσ11 † · · · fbσFF † , (50)
for all values of the spatial indices i and σ . All color contractions fall naturally into dif-
ferent gauge invariant classes. Creators from different classes differ by color contractions
between bosonic and fermionic operators. For example,
j†
aai† aa abk† abl† fcσ † fcρ† (51)
and
j† ρ†
aai† aa abk† acl† fcσ † fb (52)
belong to different gauge invariant classes according to our definition. Another example
involves odd number of operators where one “contraction” consists of one triple of color
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 101
Table 4
Sizes of the bases generated in each fermionic sector, F . Ns is the number of basis vectors with given number of
bosonic quanta, B, while Σ gives the cumulative size up to B. The last column gives the difference between the
total number of the bosonic and fermionic states in all seven sectors
F 0 1 2 3
B Ns Σ Ns Σ Ns Σ Ns Σ B −F
0 1 1 – – 1 1 4 4 0
1 – 1 6 6 9 10 6 10 0
2 6 7 6 12 21 31 42 52 0
3 1 8 36 48 63 94 56 108 0
4 21 29 36 84 111 205 192 300 0
5 6 35 126 210 240 445 240 540 0
6 56 91 126 336
7 21 112
8 126 238
jmax 8 11/2 6 11/2
11 For the SU(2) gauge group, this step can be generalized to higher SU(N ) groups.
12 By the time of printing the whole Table 4 has been filled out [51].
102 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
as well as the angular momentum content of the energy eigenstates. For example, the last
row of Table 4 gives the highest angular momentum which can be constructed in each
sector with the bases generated so far.
5.4. Results
The spectrum of the theory is shown in Fig. 5. A sample of states is labeled with
their angular momenta. It is important that our cut-off preserves the SO(3) symmetry.
Consequently, the basis described in previous section contains complete representations
of the rotation group for any Ncut . Hence, the spectrum displayed in Fig. 5 has appropriate
degeneracies for each value of the angular momentum quantum number j .
To maintain some clarity of the figure we have cut arbitrarily the upper part of the
spectrum, which extends to about Emax ∼ 35 with current Ncut . One expects that the
individual higher states have a considerable dependence on Ncut . However, they also carry
some relevant information, e.g., for quantum averages.
Apart from relating the bases in corresponding fermionic sectors, particle-hole symme-
try also implies equality of all corresponding energy levels. Since our cut-off respects this
symmetry we indeed observe, in sectors with F = 6, 5, 4, exact repetition of the eigenval-
ues from F = 0, 1, 2 subspaces. Therefore, only first four sectors are displayed in Fig. 5.
Evidently the spectrum of D = 4 theory is very rich and raises many interesting questions.
We shall discuss some of them, beginning with the relation to the already known results.
Fig. 6. First three energy levels in the F = 0 sector of D = 4 SYMQM for different cut-offs. Solid lines show the
0-volume results of Ref. [32].
Fig. 7. Cut-off dependence of the lower levels of the D = 4 SYMQM in four independent fermionic channels.
asymptotic of the wave function. We have observed this in a simple anharmonic oscillator
and other models. This regularity is also clearly confirmed in the Wess–Zumino and D = 2
SYMQM models discussed in previous sections.
Taking this into account we claim that the low energy spectrum of D = 4 SYMQM is
discrete in F = 0, 1, 5, 6 and continuous in the F = 2, 3, 4 sectors. This is an interesting
quantification of the result of Ref. [8] which was mentioned earlier. Since the fermionic
modes are crucial to provide continuous spectrum, it is natural that it does not show up in
the sectors where they do not exist at all, or are largely freezed out by Pauli blocking.
Second result, which is evident from Fig. 7, concerns the supersymmetric vacuum in
this model. Assuming that indeed the eigen energies have approximately converged in
F = 0, 1 sectors (none of them to zero) it follows that the SUSY vacuum cannot be in
empty and filled sectors (F = 1, 5 is also ruled out by the angular momentum).13 The
obvious candidates are the lowest states in F = 2, 4 sectors with their energy consistent
with zero at infinite Ncut , cf. Fig. 7. It follows, together with the conjectured correlation
between localizability and Ncut dependence, that the SUSY vacuum in this model is non-
normalizable. Further evidence is based on the structure of the supersymmetric multiplets,
and will be presented in the next section. Very recently van Baal studied F = 2 sector in
a more complex, non-compact, supersymmetric model of the same family [35]. Although
our results cannot be directly compared, due to the specific boundary conditions required in
[35], he finds that the energy of the lowest state in F = 2 sector is indeed very close to zero.
The deviations from exact zero are caused by the SUSY violating boundary conditions.14
It is important to note that not all states in F = 2, 3, 4 sectors have to belong to the
continuum. Supersymmetry together with the discrete spectrum in F = 0, 1 channels
implies existence of the normalizable states among the continuum of F = 2, 3, 4 states.
13 In early attempts, empty and filled sectors were considered as possible candidates for SUSY vacuum.
14 See Ref. [52] for more recent comparison.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 105
This will be discussed later, here we only give one explicit example. Indeed the energy of
the F = 2, j = 1 state, shown in Fig. 7 (flatter curve beginning at E = 6), has definitely
weaker dependence on Ncut than the others. We interpret this as a signature that the
lowest |2F , 1j state is localized. This situation may be a precursor of the more complex
phenomenon expected in the D = 10 theory. There, the zero energy, localized bound state
of D0 branes should exist at the threshold of the continuous spectrum. Present example
suggests that one way to distinguish such a state from the continuum may be by the
different Ncut dependence.
5.4.3. Supersymmetry
Operator level To check the supersymmetry algebra
BN , BN Ncut − 2. (59)
This limits the number of matrix elements available for the test. Still many interesting
predictions for lower states can be verified and should be satisfied exactly at finite Ncut .
One more property of supersymmetry generators should be kept in mind. Since they
change F by 1, they move between different fermionic sectors. Hence, the generic matrix
equation (57), when restricted to a particular sector, reads (α = β, no sum)
SUSY on the level of states The next goal is to identify supersymmetric multiplets in the
spectrum and to asses the effect of the cut-off on their splittings and mixings. It is clear
from Fig. 5 that any direct search for approximately degenerate states is difficult and may
not be conclusive. However our construction provides another simple way to move within
the SUSY multiplets. Namely, it is sufficient to use the explicit matrix representation of
supersymmetry generators in bases summarized in Table 4. If supersymmetry was exact,
acting with Q on an eigenstate of H , would generate states within the same multiplet. At
finite Ncut this is no longer true, however we expect that the resulting state, Q|Ψ say, will
be spread around the appropriate multiplet member (or members) in another fermionic
sector. This information will then be correlated with the spectrum of Fig. 5. Important
simplification comes from the rotational symmetry, which is exact at every Ncut in this
approach. Supersymmetric generators carry the angular momentum 1/2 which clearly
limits the range of possible targets Q|Ψ . Indeed, we confirm that
1
F, j |Q|F ± 1, j = 0 |j − j | = .
if (61)
2
To proceed, we denote the kth energy eigenvector in the mth fermionic sector and the
nth angular momentum channel by |mF , nj ; k. The subscript j will be omitted where
evident. A summary of various transitions we have analyzed is shown in Fig. 8. For
example, the action of Q1 on the first state in F = 0 sector gives
15 This is because we use the limited basis Eq. (59), which guarantees Eq. (57) exactly.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 107
The next (in the energy) |1F , 3/2; 1 state goes under Q1 in 30% to |2F , 1j ; 1, 2, 3 triplet,
with the rest shared among higher F = 2 states. In fact, the |2F , 1j ; 1, 2, 3 states are those
with the fast Ncut convergence, which we have already pointed earlier as good candidates
for localized states. Very little of |1F , 3/2; 1 goes back to the F = 0 sector (cf. fat head
arrows in Fig. 8)
16 All surrounding states have already converged within few percent which suggests that the above small rate
is also a O(1/Ncut ) effect.
108 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
Fig. 9. Witten index of the D = 4, supersymmetric Yang–Mills quantum mechanics for the number of bosonic
quanta bounded by Ncut = 2, . . . , 5. The bulk value 1/4 is also shown.
the lowest state Q1 |2F , 0j ; 12 / dim[3F ]. Indeed, it is smaller than that for other states
and falls with Ncut . However, it is too early to draw definite conclusions.
Clearly we have only begun. The aim here is just to introduce the new approach and its
potential,17 leaving specific applications for more focused articles.
Witten index The total number of bosonic and fermionic states is the same for each B,
cf. last column of Table 4. This is not a direct consequence of supersymmetry, but rather
of a combinatorics of binary fermionic systems, which is not spoiled by gauge invariance.
Total number of states gives the Witten index at T = 0. Hence, we expect
lim IW (0, Ncut ) = 0. (65)
Ncut →∞
As was already discussed Witten index is discontinuous at T = 0. In particular IW (0)
depends on the regularization, therefore, this number is tied to our B − F symmetric
scheme of increasing the basis.
Fig. 9 shows the (Euclidian) time dependence of the Witten index calculated from our
spectrum with up to five bosonic quanta. Clearly we are far from the convergence, but some
interesting signatures can be already observed. First, one can see how the discontinuity
at T = 0 emerges even at this early stage. Second, the exponential fall-off common to
all curves at large (T > 5) times is evident. This is where the lowest state dominates.
Since at this Ncut its energy is not zero, the fall-off is exponential. Finally, and most
interestingly, we observe the flattening shoulder appearing at T ∼ 2–3. This is the signal
of the supersymmetric cancellations which occur on the average, even though the exact
SUSY correspondence between individual states does not yet appear. The behavior with
Ncut is not inconsistent with the exact bulk value 1/4 [13] obtained also from the non-
Abelian integrals [14,16,17]. Clearly higher Ncut are needed, and, what is important, they
are perfectly within the range of present computers.
17 Only Hamiltonian methods are capable to provide such a detailed and quantitative characteristics as
discussed in this section.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 109
6. Summary
The new approach to quantum mechanical problems is proposed. The Hilbert space of
quantum states is algebraically implemented in the computer code. In particular realization,
used here, states are represented as Mathematica lists. All basic operations on quantum
states are mirrored as definite operations on above lists. Any quantum observable is
represented as a well defined function on these lists. This allows for automatic calculation
of matrix representation of a Hamiltonian and other quantum operators of interest. To this
end we use the discrete eigen basis of the operators of the numbers of quanta. The length
of above lists is not fixed. Similarly to the length of an arbitrary quantum combination of
basic states, it can vary dynamically allowing for any number of quanta.
Of course in any finite computer the maximal length of a combination must be limited.
Therefore, we impose a cut-off Ncut which bounds from above the number of allowed
quanta. The stringent and quantitative test of this approach is provided by checking the
dependence of any physical observable, e.g., the spectrum or the wave functions, on
the cut-off. This is similar to studying the infinite volume statistical systems via the
finite size scaling. We have applied above technique to the three progressively more
complicated systems: Wess–Zumino quantum mechanics, supersymmetric Yang–Mills
quantum mechanics in D = 1 + 1 dimensions, and D = 4 SYM quantum mechanics, both
for the SU(2) gauge group. In distinction from many other approaches (for example, lattice
simulations) the method is completely insensitive to the sign problem and works equally
well for bosonic and fermionic systems.
For Wess–Zumino quantum mechanics we can calculate the discrete spectrum for any
values of parameters. Clear restoration of the supersymmetry was observed for the cut-offs
well within the capabilities of present computers. Witten index was also obtained and its
convergence to the known result IW (T ) = 2 is clearly seen.
The next system, D = 2 SYM QM, possesses the gauge invariance which was readily
incorporated in our approach. The physical subspace of gauge invariant states was
explicitly constructed. Known structure of the solutions in terms of four fermionic sectors
was reproduced. Convergence of the method, and emergence of the supersymmetry, was
also studied in this more difficult case of the continuum spectrum. Witten index, restricted
to one supersymmetric branch of the model, was defined and computed for the first time.
Clear, albeit slow, convergence to the time independent fractional value IW R (T ) = 1/2 was
observed. Moreover, we have found a special scheme of increasing the basis such that the
supersymmetry is exact at any finite cut-off. This however, may be related to the exact
solubility of the model.
Finally, the method was applied to the unsolved up to now D = 4 SYMQM. This
much richer system has the SO(3) rotational symmetry inherited from its space extended
predecessor. Our approach preserves this symmetry exactly at any finite cut-off. The
Hilbert space splits again into seven sectors with fixed fermionic number. We have obtained
the complete spectra in all these sectors and studied their cut-off dependence. The spectrum
in F = 0 sector agrees with the classical 0-volume calculation for pure Yang–Mills
quantum mechanics [32]. An efficient method to distinguish between the discrete and
continuous spectrum was proposed. It turns out that the asymptotics of the wave function at
large distances determines the convergence of our calculations with the number of allowed
110 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
quanta Ncut . The continuous spectrum with non-localized wave functions converges slowly
(∼ O(1/Ncut ), while discrete, localized bound states lead to faster (sometimes even
exponential) convergence. Accordingly, we have found an evidence that the spectrum of
D = 4 SYM QM is discrete in F = 0, 1, 5, 6 sectors while it is continuous in the F = 2, 3, 4
sectors. This provides an explicit realization of the claim of Ref. [8] in particular fermionic
sectors. Interestingly, localized states exist also in the sectors with continuous spectrum.
This is a simple consequence of supersymmetry whose transformations move between
adjacent fermionic sectors. Our method allows to monitor directly the action of SUSY
generators and analyse supersymmetric images of any state. In this way we have identified
some candidates for lowest supersymmetric multiplets. They do not have the same energy
at current values of the cut-off, however the splittings are small and consistent with
vanishing at infinite Ncut . In particular, the 0-volume glueballs found in F = 0 sector
are relevant to fully supersymmetric theory in that there exist gluino–gluon bound states
with the same masses. We have also found a candidate for the supersymmetric vacuum
which seems to belong to the continuum. However, identification of SUSY multiplets in
the continuum part of the spectrum requires higher Ncut . Supersymmetry was also tested
on the operator level. In particular we confirmed that the spectrum of the Hamiltonian
coincides with that of Q21 . As a final application we have calculated the Witten index for
this theory. It still depends strongly on Ncut . Nevertheless an early evidence for some of its
asymptotic properties can be already seen.
7. Future prospects
The main goal of this work is to asses the feasibility of attacking with the new approach
the BFSS model of M-theory. Certainly the method is more efficient for smaller number
of degrees of freedom. Application to the Wess–Zumino quantum mechanics and D = 2
Yang–Mills quantum mechanics give quite satisfactory and quantitative results including
the new intriguing scheme which preserves exact supersymmetry for finite cut-off. For
more complex, and correspondingly richer, D = 4 SYMQM the method is able to provide
new results including detailed information about the supersymmetric structure of the
spectrum and the observables. Obviously we also see a room for improvement which is
especially needed in the continuum sector. However, and that we find most important,
further increase of the size of the Fock space is possible within the available technology.
The whole programme can be (and is being) implemented in the standard, compiler based
languages which usually improves performance by a factor 10–100. Further, the action
of the quantum operators can be optimized taking into account symmetries of the states.
Finally, one can go for more powerful computers.
Taking all above into account, one can reasonably expect substantial improvement in
the quality of the present D = 4, N = 2 results. At the same time one should be able to
study D = 4 systems with higher N . To answer the main question: yes, we think that the
quantitative study of the D = 10 theory is feasible, and reaching current quality results
for the D = 10 is realistic, beginning with the SU(2) gauge group. D = 10 Hamiltonian
and spin(9) generators do not conserve fermionic number [7,15]. The reason for this
complication should be better understood in the first place.
J. Wosiek / Nuclear Physics B 644 (2002) 85–112 111
Apart from increasing the number of degrees of freedom, higher gauge groups pose
an interesting problem of constructing gauge invariant states. This has already been done
in the small volume approach for SU(3) [49], and should not present any fundamental
problem for higher N as well. At the same time the possibility of some large N
simplification, specifically within the present approach, should be investigated.
Last, but not least, we would like to mention a host of applications of this method to
other quantum mechanical systems. For example, one may simply extend the 0-volume
calculations to the full non-supersymmetric QCD with dynamical quarks in fundamental
representation. Apart from known glueballs, this would give us a spectrum of quark-made-
hadrons in the “femtouniverse”.
As another rather different application, we mention that this program is already
being used as a routine in solving, to a high precision, quantum mechanics of a simple
two-dimensional building blocks of a prototype quantum computer [50]. In particular,
a complete quantum evolution in time with the time dependent Hamiltonian, was
straightforward to simulate.
The D = 4, N = 2 SYMQM studied here has 15 degrees of freedom. There are many
unsolved quantum mechanical systems with this or smaller complexity. Present approach
should be well applicable in some of these cases.
Obviously there are many routes which can be followed from this point and we are
looking forward to explore some of them.
By the time this article was in print, a faster recursive method of calculating matrix
elements has been developed [53].
Acknowledgements
I would like to thank C.M. Bender for an instructive discussion which inspired this
approach. I also thank P. van Baal, P. Breitenlohner, L. Hadasz, M. Rostworowski, H. Saller
for intensive discussions. This work is supported by the Polish Committee for Scientific
Research under the grant no. PB 2P03B01917.
References
[1] T. Banks, W. Fishler, S. Shenker, L. Susskind, Phys. Rev. D 55 (1997) 6189, hep-th/9610043.
[2] E. Witten, Nucl. Phys. B 185/188 (1981) 513.
[3] F. Cooper, A. Khare, U. Sukhatme, Phys. Rep. 251 (1995) 267, hep-th/9405029.
[4] M. Claudson, M.B. Halpern, Nucl. Phys. B 250 (1985) 689.
[5] S. Samuel, Phys. Lett. B 411 (1997) 268, hep-th/9705167.
[6] U.H. Danielsson, G. Ferretti, B. Sundborg, Int. J. Mod. Phys. A 11 (1996) 5463, hep-th/9603081.
[7] M.B. Halpern, C. Schwartz, Int. J. Mod. Phys. A 13 (1998) 4367, hep-th/9712133.
[8] B. de Wit, M. Lüscher, H. Nicolai, Nucl. Phys. B 320 (1989) 135.
112 J. Wosiek / Nuclear Physics B 644 (2002) 85–112
[9] H. Nicolai, R. Helling, hep-th/9809103, in: Trieste 1998, Non-perturbative aspects of strings, branes and
supersymmetry, pp. 29–74.
[10] M. Lüscher, Nucl. Phys. B 219 (1983) 233.
[11] J. Polchinski, String Theory, Cambridge Univ. Press, Cambridge, 1998.
[12] P. Yi, Nucl. Phys. B 505 (1997) 307, hep-th/9704098.
[13] S. Sethi, M. Stern, Commun. Math. Phys. 194 (1998) 675, hep-th/9705046.
[14] A.V. Smilga, Nucl. Phys. B 266 (1986) 45.
[15] V.G. Kac, A.V. Smilga, Nucl. Phys. B 571 (2000) 515, hep-th/9908096.
[16] G. Moore, N. Nekrasov, S. Shatashvili, Commun. Math. Phys. 209 (2000) 77, hep-th/9803265.
[17] M.B. Green, M. Gutperle, JHEP 01 (1998) 005, hep-th/9711107.
[18] F. Sugino, Int. J. Mod. Phys. A 14 (1999) 3979, hep-th/9904122.
[19] W. Krauth, H. Nicolai, M. Staudacher, Phys. Lett. B 431 (1998) 31, hep-th/9803117.
[20] W. Krauth, M. Staudacher, Nucl. Phys. B 584 (2000) 641, hep-th/0004076.
[21] D. Kabat, G. Lifschytz, Nucl. Phys. B 571 (2000) 419, hep-th/9910001.
[22] D. Kabat, G. Lifschytz, D.A. Lowe, Phys. Rev. D 64 (2001) 124015, hep-th/0105171.
[23] E. Martinec, hep-th/9909049, in: Cargesse 1999, Progress in string theory and brane theory, pp. 117–145.
[24] R.A. Janik, J. Wosiek, Acta Phys. Pol. B 32 (2001) 2143, hep-th/9903121.
[25] P. Bialas, J. Wosiek, Nucl. Phys. B (Proc. Suppl.) 106 (2002) 968, hep-lat/0111034.
[26] T. Eguchi, H. Kawai, Phys. Rev. Lett. 48 (1982) 1063.
[27] A. Gonzalez-Arroyo, J. Jurkiewicz, C.P. Khortals, Altes, in: J. Honercamp, et al. (Eds.), Proceedings of 11th
NATO Summer Institute, Plenum, New York, 1982.
[28] N. Ishibashi, H. Kawai, Y. Kitazawa, A. Tsushijya, Nucl. Phys. B 498 (1997) 467.
[29] H. Aoki, et al., Prog. Theor. Phys. Suppl. 134 (1999) 47, hep-th/9908038.
[30] J. Ambjorn, et al., JHEP 0007 (2000) 011, hep-th/0005147.
[31] J. Ambjorn, K.N. Anagnostopoulos, A. Krasnitz, JHEP 0106 (2001) 069, hep-ph/0101309.
[32] M. Lüscher, G. Münster, Nucl. Phys. B 232 (1984) 445.
[33] P. van Baal, Acta Phys. Pol. B 20 (1989) 295.
[34] P. van Baal, in: M. Shifman (Ed.), in: At the Frontiers of Particle Physics—Handbook of QCD, Boris Ioffe
Festschrift, Vol. 2, World Scientific, Singapore, 2001, p. 683, hep-ph/0008206.
[35] P. van Baal, hep-th/0112072, in: M. Olshanetsky, A. Vainstein (Eds.), The Michael Marinov Memorial
Volume Multiple Facets of Quantization and Supersymmetry, World Scientific, in press.
[36] C.M. Bender, et al., Phys. Rev. D 32 (1985) 1476.
[37] C.M. Bender, K.A. Milton, Phys. Rev. D 34 (1986) 3149.
[38] J. Kogut, L. Susskind, Phys. Rev. D 11 (1975) 395.
[39] Y. Matsumura, N. Sakai, T. Sakai, Phys. Rev. D 52 (1995) 2446.
[40] J.R. Hiller, S. Pinsky, U. Trittmann, hep-th/0112151.
[41] J.R. Hiller, S.S. Pinsky, U. Trittmann, hep-th/0106193.
[42] M.A. Shifman, ITEP Lectures on Particle Physics and Field Theory, World Scientific, Singapore, 1999.
[43] P. Jordan, E.P. Wigner, Z. Phys. 47 (1928) 631.
[44] J.D. Bjorken, S.D. Drell, Relativistic Quantum Fields, McGraw–Hill, New York, 1965.
[45] A. Nakamura, F. Palumbo, Phys. Lett. B 135 (1984) 96.
[46] J. Trzetrzelewski, J. Wosiek, in preparation.
[47] C. Itzykson, J.-B. Zuber, Quantum Field Theory, McGraw–Hill, New York, 1980.
[48] S. Weinberg, The Quantum Theory of Fields III—Supersymmetry, Cambridge Univ. Press, Cambridge,
2000.
[49] P. Weisz, V. Ziemann, Nucl. Phys. B 284 (1987) 157.
[50] V. Corato, et al., cond-mat/0205514.
[51] J. Kotanski, J. Wosiek, hep-lat/0208067, in: Proceedings of the XX Symposium on Lattice Field Theory,
June 2002, MIT, Cambridge, MA, in press.
[52] J. Wosiek, hep-th/0204243, in: Proceedings of the NATO Workshop QCD, Stara Lesna, Slovakia, January
2002, in press.
[53] M. Campostrini, J. Wosiek, hep-th/0209140.
Nuclear Physics B 644 (2002) 113–127
www.elsevier.com/locate/npe
Received 13 June 2002; received in revised form 8 August 2002; accepted 2 September 2002
Abstract
We study the closed and open supermembranes on the maximally supersymmetric pp-wave
background. In the framework of the membrane theory, the superalgebra is calculated by using the
Dirac bracket and we obtain its central extension by surface terms. The result supports the existence
of the extended objects in the membrane theory in the pp-wave limit. When the central terms are
discarded, the associated algebra completely agrees with that of Berenstein–Maldacena–Nastase
matrix model. We also discuss the open supermembranes on the pp-wave and elaborate the possible
boundary conditions.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 4 - 0
114 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
backgrounds. In particular, by taking a certain limit called Penrose limit [13,14], the
maximally supersymmetric pp-wave solution can be obtained from the AdS4 × S 7 or
AdS7 × S 4 backgrounds [15]. Also, the maximally supersymmetric IIB supergravity
background has been lately found [16] and it has been shown that the Green–Schwarz
type IIB superstring theory on the pp-wave is exactly solvable [17–19]. The pp-wave
background used in the works [17–19] can be also obtained by taking Penrose limit in
the AdS5 × S 5 [16].
The fact that the maximally supersymmetric pp-wave backgrounds are obtained by
taking the Penrose limit in the AdS background leads to the work [20], where the IIB
string theory on the pp-wave is used for investigating the AdS/CFT correspondence [21,
22] in the string theoretic analysis. That is the exactly solvable model with nontrivial
string background and it provides an interesting area to study properties of strings with
background fluxes. Moreover, the matrix model on the maximally supersymmetric pp-wave
has been proposed from the considerations for the superparticles [20]. The action of the
matrix model on the pp-wave has been also derived directly from the membrane theory on
the maximally supersymmetric pp-wave [23].
In this paper we consider the closed and open supermembranes on the eleven-
dimensional maximally supersymmetric pp-wave background. We calculate the super-
charges and associated algebra. In the case of the flat space, the correspondence of the
superalgebra has been shown [24]. We will show this correspondence on the pp-wave. In
contrast with the algebra in the matrix model on the pp-wave, surface terms are included in
our membrane case. The resulting algebra is the central extension of the superalgebra on
the pp-wave and we can discuss the extended objects contained in the membrane theory on
the pp-wave.
Next we discuss the boundary conditions for the open supermembrane on the pp-wave
by calculating surface terms under the variations of the supersymmetry transformations.
In the case of flat background, the open supermembrane can end on the p-dimensional
hypersurface only for the values p = 1, 5 and 9. However, we show that some additional
surface terms arise in the pp-wave case and only the value p = 1 is allowed for the open
supermembrane on the pp-wave.
This paper is organized as follows. In Section 2, as a short review we provide an
explanation of the action of the supermembrane and supersymmetries on the maximally
supersymmetric pp-wave background. In Section 3 we will calculate the supercharges
and associated algebra by the use of the Dirac bracket procedure. In order to discuss
the extended objects, we carefully analyze the surface terms. In Section 4, the boundary
conditions for the open supermembranes on the pp-wave will be considered. Section 5 is
devoted to considerations and discussions. In appendix, our notation is summarized.
by
2
9
µ 2
ds 2 = −2 dx + dx − + G++ dx + + dx ,
µ=1
2 2
µ 2 µ 2
G++ ≡ − x1 + x2 + x3 +
2 2
x4 + · · · + x9 ,
2
(2.1)
3 6
where the constant 4-form flux for +, 1, 2, 3 directions,
F+123 = µ, (µ = 0) (2.2)
is equipped.
The Lagrangian of supermembrane on the maximally supersymmetric pp-wave is given
as a sum of L0 and Wess–Zumino term LWZ 1
L = L0 + LWZ , L0 = − −g(X, θ ), (2.3)
where the induced metric gij is given by
and the supervielbein Π A and covariant derivative Di θ for θ are defined by using vielbein
eµ̂r̂ and spin connection ωr̂ ŝ
iM2 = 2 Tr̂ ŝ tˆûv̂ θ Fŝ tˆûv̂ θ̄ Γ r̂ − (Γr̂ ŝ θ ) θ̄ Γ r̂ ŝ tˆûv̂ ŵ Ftˆûv̂ŵ + 24Γtˆû F r̂ ŝ tˆû .
288
(2.10)
When we take the light-cone gauge in the Penrose limit, M2 = 0 is satisfied. In addition
the Dθ becomes a simple formula
Dθ = dθ + e+ T+ +123 θ F+123 , (2.11)
and supervielbeins are dramatically simplified, though the action in the AdS background
has nontrivial interaction terms. In this gauge we write down the Wess–Zumino term LWZ
1
LWZ = ! ij k Cµ̂ν̂ ρ̂ ∂i Xµ̂ ∂j Xν̂ ∂k Xρ̂
6
i ij k µ̂ ν̂ µ̂ 1 µ̂
+ ! θ̄ Γµ̂ν̂ Di θ Πj Πk + iΠj θ̄ Γ Dk θ − θ̄ Γ Dj θ θ̄ Γ Dk θ .
ν̂ ν̂
2 3
(2.12)
Here the supervielbeins on the maximally supersymmetric pp-wave are given by Eqs. (2.5)
and (2.11). The Cµ̂ν̂ ρ̂ is the 3-form potential and its field strength is described by Eq. (2.2).
The above supermembrane action is difficult to analyze directly, and so we shall rewrite
Lagrangian (2.3) following the work [3] in the light-cone gauge in terms of SO(9) spinor ψ
3 9
−1 1 1 r s 2 1 µ 2 2 1 µ 2 2
w L = Dτ X Dτ X − X , X
r r
− XI − XI
2 4 2 3 2 6 I =1 I =4
µ
3
− !I J K XK XI , XJ + iψ T γ r Xr , ψ
6
I,J,K=1
µ
+ iψ T Dτ ψ + i ψ T γ123ψ. (2.13)
4
We used a convention P0+ = 1. Here “τ ” is the time coordinate on the worldvolume
and { , } is Lie bracket given by using an arbitrary function w(σ ) of worldvolume spatial
coordinates σ a (a = 1, 2)
1 ab
{A, B} ≡ ! ∂a A∂b B (a, b = 1, 2)
w
with ∂a = ∂σ∂ a . Also this theory has large residual gauge symmetry called the area-
preserving diffeomorphism (APD) and the covariant derivative for this gauge symmetry
is defined by a gauge connection ω
Dτ Xr ≡ ∂τ Xr − ω, Xr . (2.14)
In this model, if we replace the variables in the Lagrangian (2.13) according to the rule as
follows:
X ξ i → X(τ ),
ψ ξ i → ψ(τ ),
d 2 σ w(σ ) → Tr,
{ , } → −i[ , ],
we can obtain the matrix model on the pp-wave [20], starting from the Lagrangian for
supermembrane on the maximally supersymmetric pp-wave.
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127 117
We have taken the light-cone gauge and so original symmetries are not seen manifestly
but the Lagrangian (2.13) still has residual supersymmetries,
δ! Xr = 2ψ T γ r !(τ ), δ! ω = 2ψ T !(τ ),
i
δ! ψ = −iDτ Xr γr !(τ ) + Xr , Xs γrs !(τ )
2
µ 3
µ I
9
+ i XI γI γ123 !(τ ) − i X γI γ123!(τ ),
3 6
I =1 I =4
µ
!(τ ) = exp γ123τ !0 (!0 : constant spinor). (2.15)
12
These transformation rules are 16 linearly-realized supersymmetries on the maximally
supersymmetric pp-wave. In taking the limit µ → 0, we recover the supersymmetry
transformations on the flat space. In the context of the eleven-dimensional supersymmetry,
this corresponds to the dynamical supersymmetry. The Lagrangian (2.13) has other 16
nonlinearly realized supersymmetries,
δη Xr = 0, δη ω = 0,
δη ψ = η(τ ),
µ
η(τ ) = exp − γ123τ η0 (η0 : constant spinor). (2.16)
4
It corresponds to the kinematical supersymmetry in the eleven-dimensional theory.
To begin, we derive supercharges for the supersymmetries (2.15) and (2.16), and then
study associated superalgebra by the use of the Dirac bracket. We discuss the extended
objects on the pp-wave from the viewpoint of the central charges of the superalgebra.
Supercharges Q+ and Q− of the linearly and nonlinearly realized supersymmetries,
respectively, are obtained as Noether charges
+ µ
− 12 1
Q = d σ w −2e
2 γ123 τ
DXr γr ψ + Xr , Xs γrs ψ
2
µ I µ I
3 9
+ X γI γ123ψ + X γI γ123ψ , (3.1)
3 6
I =1 I =4
−
µ
µ
Q = d 2 σ w −2ie 4 γ123 τ ψ = −2ie 4 γ123 τ ψ0 , (3.2)
where ψ0 is the zero-mode of ψ and we have used the normalization with d 2 σ w(σ ) = 1.
Next we shall calculate the superalgebra satisfied by (3.1) and (3.2). The supermem-
brane theory contains the fermionic field ψ α and this leads to the second class constraint
118 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
3
µ
−i d 2 σ ∂a SIaJ γ I J e− 3 γ123 τ αβ
I,J =1
9
µ
−i d 2 σ ∂a SIa J γ I J e− 3 γ123 τ αβ
I ,J =4
9
3
µ
− 2i d 2 σ ∂a SIaI γ I I e− 6 γ123 τ αβ , (3.12)
I =1 I =4
1 1 + T
i √ Q+ α , √ Q β
2 2 DB
µ IJ µ
3 9
= 2H δαβ + M0 (γI J γ123 )αβ − M0I J (γI J γ123)αβ
3 6
I,J =1 I ,J =4
3
9
µ
−2 d 2 σ ϕXI γ I αβ − 2 d 2 σ ϕXI γ I e 6 γ123 τ αβ
I =1 I =4
3
9
µ
+2 2
d σ ∂a SIa γ I
αβ
+2 d 2 σ ∂a SIa γ I e 6 γ123 τ αβ
I =1 I =4
3
9
+2 d 2 σ ∂a SIaJ I J γ I J I J αβ
I,J =1 I ,J =4
9
+2 d 2 σ ∂a SIa J K L γ I J K L αβ
I ,J ,K ,L =4
3 9
µ
+2 d 2 σ ∂a SIaJ KI γ I J KI e 6 γ123 τ αβ
I,J,K=1 I =4
3
9
µ
+2 d 2 σ ∂a SIaI J K γ I I J K e 6 γ123 τ αβ
I =1 I ,J ,K =4
3
9
+ 2µ d 2 σ ∂a UJaKI J γ J K γ I J αβ
J,K=1 I ,J =4
9
µ
+ 2µ d 2 σ ∂a UIa γ I γ123e 6 γ123 τ αβ . (3.13)
I =4
Here M I J and M I J are defined by
1
M I J ≡ XI P J − P I XJ − Sγ I J ψ, (3.14)
2
1
M I J ≡ XI P J − P I XJ − Sγ I J ψ, (3.15)
2
120 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
and the SO(3) × SO(6) Lorentz generators M0I J and M0I J are given as
M0I J ≡ d 2 σ M I J , (3.16)
I J
M0 ≡ d 2 σ M I J . (3.17)
√
2 We can absorb the factor 1/ 2 in front of the supercharges in the definition of the fermion ψ .
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127 121
Also, the above superalgebra includes some central charges. These charges indicate
the existence of extended objects in the supermembrane theory on the maximally
supersymmetric pp-wave. First, the charges Srs a and S a correspond to the transverse M2-
r
brane (D2-brane in type IIA string theory) and longitudinal M2-brane (fundamental string
a
in type IIA string theory), respectively. Next, Srst u corresponds to the longitudinal M5-
brane charge (D4-brane in type IIA string theory). As is well-known, these charges appear
in the supermembrane theory on the flat eleven-dimensional Minkowski space. In addition,
in our supermembrane theory the superalgebra includes the additional central charges,
UJaKI J and UIa . We do not properly confirm the physical interpretation of these extra
extended objects only living on the pp-wave. These might be related to the fuzzy membrane
and giant graviton discussed in [20], or another new extended object due to a certain kind
of the Myers effects on the pp-wave [27,28].
In the case of the open membrane, which has the boundary on the worldvolume
toward the spatial directions σ 1 and σ 2 , the surface terms do not vanish automatically.
Thus we must properly treat the total derivative terms under the variation of the above
supersymmetry transformations, and consider the boundary conditions in order for the
surface terms to vanish. Let us recall that the membrane p-branes are allowed for p = 1, 5
and 9 in the flat background due to the boundary conditions [29,30]. The M5-brane
corresponds to p = 5. The case of p = 9 is related to “the end of the world” in Hor̆ava–
Witten’s works [31].
In our pp-wave case, we obtain the total derivative terms for the linear supersymmetry
(2.15) explicitly
1 s t T
w X , Dτ X ψ γs γr !(τ ) + X , X ψ γst γr !(τ )
r s T
2
µ I J T
3
− w X , X ψ γI γJ γ123 !(τ )
3
I,J =1
µ I J T
9
− w X , X ψ γI γJ γ123!(τ )
6
I ,J =4
µ I I T
3 9
+ w X , X ψ γI I γ123!(τ )
3
I =1 I =4
µ I I T
3 9
− w X , X ψ γI I γ123!(τ ) , (4.1)
6
I =1 I =4
and for the nonlinear supersymmetry (2.16), we can calculate the corresponding term
w Xr , iηγr ψ . (4.2)
122 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
This surface term for the nonlinear supersymmetry has the same form as in the flat space.
However, some additional terms proportional to µ appear for the linear supersymmetry in
addition to the surface terms in the flat background. The variations of the action under the
linear and nonlinear supersymmetry transformations can be written as
δS = δ! S + δ!(µ) S + δη S,
1
δ! S = − dτ dξ ∂t Xr · Dτ Xs ψ T γs γr !(τ ) + Xs , Xt ψ T γst γr !(τ ) ,
2
∂Σ
(4.3)
µ
3
δ!(µ) S = − dτ dξ − ∂t XI · XJ ψ T γI γJ γ123!(τ )
3
∂Σ I,J =1
µ
9
− ∂t XI · XJ ψ T γI γJ γ123!(τ )
6
I ,J =4
µ
3 9
+ ∂t XI · XI ψ T γI I γ123!(τ )
3 I =1 I =4
µ
3 9
I T
− ∂t X · X ψ γI I γ123!(τ ) ,
I
(4.4)
6
I =1 I =4
δη S = −i dτ dξ ∂t Xr · η(τ )γr ψ, (4.5)
∂Σ
where ∂Σ is the boundary of the open supermembrane worldvolume and ξ is the
coordinate for the tangent direction of the boundary. Note that the tangential derivative
∂t and normal derivative ∂n on the boundary are defined by
∂t Xr ≡ ! ab na ∂b Xr , (4.6)
∂n X ≡ n ∂a X .
r a r
(4.7)
Here na is the unit vector toward the normal direction on the boundary. We would like to
consider the p-dimensional hypersurface (membrane p-brane) on which supermembranes
can end, and investigate the condition that such a surface can exist. First, by following the
discussion of the p-brane in string theory, the boundary conditions for our membrane are
classified
Neumann: ∂n X m = 0 (m = 0, 10 and some p − 1 coordinates), (4.8)
Dirichlet: ∂t X = 0
m
( m = other 10 − p coordinates). (4.9)
By applying these boundary conditions to (4.3) and (4.5), the constraints
η0T γm ψ = !0T γm γn ψ = !0T γm γn γn ψ = 0, (4.10)
can be obtained. These are the same conditions as in the flat case and (4.10) leads us to
the well-known results p = 1, 5 and 9. However, in the pp-wave case we also need to take
account of the constraints coming from the additional surface terms (4.4).
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127 123
P− ψ = 0, (4.12)
is in our hand. Then we can write ψ as ψ = P+ ψ. To begin, from the second equation in
(4.10), P+ !0 = 0 is followed. Next, we can see from the third equation in (4.10) that 9 − p
should be even. As a result, p = 1, 5 and 9 are allowed in the flat case for the boundary
hypersurface. However, the story does not end because the additional boundary terms exist
in the case of the pp-wave. We can easily check whether the additional surface terms (4.4)
vanish or not in each p = 1, 5 and 9 case. In the p = 1 case we can immediately see
that the additional terms (4.4) vanish. Here, it can be seen from the constraints (4.10) that
only the even number of gamma matrices with Neumann indices m’s and arbitrary number
of gamma matrices with Dirichlet indices m’s are allowed to appear between !0T and ψ.
Equivalently, odd number of gamma matrices with Neumann indices m’s cannot appear
between !0T and ψ. However, it is found from the expression (4.4) that such a condition
cannot be satisfied in the cases p = 5 and 9 because there are inevitably several terms
including odd number of Neumann components. In conclusion, only p = 1 is allowed for
membrane p-brane on the pp-wave, and for p = 5 and 9 membrane p-brane cannot exist.
This result would be also plausible from the viewpoint of the chirality matrix [32]. It is
because the flux is turned on the 1, 2, 3 directions on the pp-wave, and so the SO(4) and
SO(8) chirality, which is important for p = 5 and p = 9 cases, cannot be respected. The
reason that p = 1 case is allowed is unknown, since what p = 1 means physically has not
been well understood.
In the above discussion, we have assumed that the 1/2 BPS boundary hypersurface,
that is, the flat boundary hypersurface. However, it might be clear that such flat boundaries
cannot exist, because the pp-wave background is curved. Possibly, the curved hypersurface
as discussed in [33] might become the boundary of the supermembrane. But, we do not
know how to treat such curved boundaries, and do not discuss such a case here.
In this paper, we have studied the supercharges and its associated algebra. In particular,
by treating the surface terms carefully, the central extension of the superalgebra has been
derived. The superalgebra apart from the central charges completely agrees with that of
the matrix model on the pp-wave. The central charges obtained in our derivation realize
the flat space results in the limit µ → 0, and also include some additional ones. We do
not confirm the physical interpretation of the additional central charges. These seem to
124 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
indicate the extra extended objects coming from a kind of the Myers effect on the pp-wave
background.
Moreover, we have discussed the boundary conditions of the open supermembrane on
the maximally supersymmetric pp-wave background. It is well-known that the membrane
p-branes in the flat space are allowed to exist only for p = 1, 5 and 9. In the pp-wave
case, more strict constraints for such hypersurfaces arise, and so only the value p = 1
is allowed. In our discussion, we have not included the 2-form which can couple to the
boundary hypersurface. It might be possible by turning on the 2-form on the boundary that
5- and 9-dimensional hypersurfaces exist as the boundaries of the open supermembranes
on the pp-wave.
In this paper, we have used the SO(9) formulation for simplicity, but it is also
interesting to work in the SO(10, 1) covariant formulation, where the nature of longitudinal
components are clear and more definite considerations would be possible. This is an
interesting future work.
Acknowledgements
The work of K.S. is supported in part by the Grant-in-Aid from the Ministry of
Education, Science, Sports and Culture of Japan (? 14740115).
Appendix A
A.1. Notation
0 γµ
Γ µ = γ µ ⊗ σ3 = (real symmetric),
−γ µ0
0 −I16
Γ 0 = 1 ⊗ (−iσ2 ) = (real skew symmetric),
I16 0
0 I16
Γ 10 = 1 ⊗ σ1 = (real symmetric),
I16 0
1 + −
Γ ± ≡ √ Γ 0 ± Γ 10 , Γ ,Γ = −2I32 ,
2
+
√ 0 0 −
√ 0 −I16
Γ = 2 , Γ = 2 .
I16 0 0 0
We take the light-cone gauge and decompose the 32 component SO(10, 1) spinor θ in
terms of SO(9) spinor ψ with 16 components as follows:
X+ = τ, Γ +θ = 0 (θ̄ Γ + = 0),
1 0
⇒ θ = 1/4 ,
2 w ψ
1
θ̄ = θ T −Γ 0 = − 1/4 ψ T , 0 .
2 w
In the light-cone gauge, there are several useful identities
θ̄ Γ r̂ ∂i θ = 0 (for r̂ = −),
θ̄ Γrs ∂i θ = 0,
θ̄ Γ +r ∂i θ = 0,
θ̄ Γ +− ∂i θ = 0.
In the pp-wave background, the vielbein is calculated as
+ − − + 1
eµ̂r̂ : e+ = e− = 1, e+ = 0,
= − G++ , eµr = δµr ,
e−
4
µ̂ + − + − 1 µ µ
er̂ : e+ = e− = 1, e− = 0, e+ = G++ , er = δr ,
4
eµ̂r̂ : e++ = e+r = e−r = eµ+ = eµ− = 0,
1
e+− = e−+ = −1, e−− = − G++ , eµr = δ µr ,
4
eµ̂r̂ : eµ+ = eµ− = e−− = e+r = e−r = 0,
1
e+− = e−+ = −1, e++ = + G++ , eµr̂ = δµr ,
4
and the spin connection is evaluated as
1
ωr̂ ŝ ≡ ωµ̂r̂ ŝ dx µ̂ ⇒ ωr− = ∂ r G++ dx + , otherwise = 0,
4
1
⇒ ω = ∂ µ G++ dx + , otherwise = 0.
µ̂
ωµ̂ν̂ ≡ er̂ eŝν̂ ωr̂ ŝ µ−
4
126 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 113–127
References
[30] B. de Wit, K. Peeters, J. Plefka, Open and closed supermembranes with winding, Nucl. Phys. (Proc.
Suppl.) 68 (1998) 206, hep-th/9710215.
[31] P. Hor̆ava, E. Witten, Heterotic and type I string dynamics from eleven dimensions, Nucl. Phys. B 460 (1996)
506, hep-th/9510209;
P. Hor̆ava, E. Witten, Eleven-dimensional supergravity on a manifold with boundary, Nucl. Phys. B 475
(1996) 94, hep-th/9603142.
[32] K. Becker, M. Becker, Boundaries in M-theory, Nucl. Phys. B 472 (1996) 221, hep-th/9602071.
[33] D. Bak, Supersymmetric branes in pp-wave background, hep-th/0204033.
Nuclear Physics B 644 (2002) 128–150
www.elsevier.com/locate/npe
Received 9 August 2002; received in revised form 30 August 2002; accepted 9 September 2002
Abstract
We study type IIA string theories on the pp-waves with 24 supercharges. The type IIA pp-
wave backgrounds are derived from the maximally supersymmetric pp-wave solution in eleven
dimensions through the toroidal compactification on the spatial isometry directions. The associated
actions of type IIA strings are obtained by using these metrics and other background fields of the
type IIA supergravities on the one hand. On the other hand, we derive these theories from D = 11
supermembrane on the pp-wave via double-dimensional reduction for the spatial isometry directions.
The resulting actions agree with those of type IIA strings obtained in the study of the supergravities.
Also, the action of the matrix string is written down. Moreover, the quantization of closed and open
strings is discussed. In particular, we study Dp-branes allowed in one of the type IIA theories.
2002 Elsevier Science B.V. All rights reserved.
Keywords: Supermembrane; Matrix theory; M-theory; pp-wave; Double-dimensional reduction; Matrix string
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 0 - 9
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 129
Table 1
Maximal and less supersymmetric pp-waves: the less supersymmetric pp-waves are obtained by compactifica-
tions of the maximal pp-waves and the T-duality. The circles indicate the known solutions, ×’s that no such
solutions exist, and blank that it is not yet known whether such solutions exist. The superscript “∗” denotes there
are no supersymmetric D-branes
SUGRA 16 18 20 22 24 26 28 30 32
11 dim ◦ ◦ ◦ ◦ ◦ ◦ × × unique
Type IIA ◦ ◦ ◦ ◦ ◦ ◦∗ × ×
Type IIB ◦ ◦ ◦ ◦ ◦ unique
solvable in the Green–Schwarz (GS) formulation [5–7] with a light-cone gauge. This
pp-wave background [4] is also obtained from the AdS5 × S 5 via Penrose limit [3].
With this progress, the intensive studies of strings on the pp-waves were initiated. In
particular, this type IIB string has been combined with the AdS/CFT correspondence
and the almost BPS sector of a large N gauge theory has been studied [8]. Moreover,
the matrix model on the KG background has been proposed [8]. This model is often
referred as the Berenstein–Maldacena–Nastase (BMN) matrix model. As the de Wit–
Hoppe–Nicolai (dWHN) supermembrane [9–11] is closely related to the Banks–Fischler–
Shenker–Susskind (BFSS) matrix model [12] in the flat space, the BMN matrix model is
also intimately related to a supermembrane on the pp-wave [13–15]. In our previous works
[14,15], we have shown that the algebra of supercharges in the supermembrane theory on
the pp-wave agrees with that of the BMN matrix model in the same manner as the flat
space [16]. We have also discussed BPS conditions in the supermembrane on the pp-wave.
BPS multiplets in the BMN matrix model are also widely studied [13,17]. Moreover,
the classical solutions of the BMN matrix and the supermembrane are intensively
researched [18,19]. In particular, we have lately investigated the quantum stability of giant
gravitons [19], which are classical solutions of the BMN matrix model and exist due to the
presence of the constant 4-form flux [20].
With recent progress, less supersymmetric type IIB and IIA pp-wave backgrounds,
or strings on these pp-waves are greatly focused [21–28]. The maximal and less
supersymmetric pp-wave backgrounds of the eleven-dimensional supergravity, type IIA
and IIB theories are listed in Table 1 as far as we know. Motivated by these attempts,
we consider the type IIA strings on the pp-waves from two viewpoints in this paper.
On the one hand, we study the type IIA pp-waves and strings from the supergravity side
through the toroidal compactification. On the other hand, we use the double-dimensional
reduction (DDR) [29] for the supermembrane action on the maximally supersymmetric
pp-wave. Both results are equivalent as expected. We show that both compactifications
are done for a spatial isometry direction, which can be found in the same way as in the
type IIB case [21]. When we compactify this spatial direction, 8 supercharges are inevitably
broken. Therefore, the resulting type IIA theory has 24 supercharges, and is not maximally
supersymmetric. The type IIA string on this pp-wave is also exactly-solvable but it is
different from the one obtained from a type IIB string theory via the T-duality [28]. This
comes from the fact that the type IIA pp-wave with 24 supercharges is not unique and
the type IIA pp-wave considered in this paper is different from the one in [28]. Moreover,
the matrix string theory is considered. We also discuss the quantization of closed and open
130 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
strings in our type IIA theory. There we study the allowed Dp-branes in the theory. The
values p = 2, 4, 6 and 8 are allowed but the directions of D-branes are restricted as in the
case of type IIB string on the pp-wave [30–32].
This paper is organized as follows. In Section 2 we consider the type IIA pp-wave
backgrounds and actions of strings from two viewpoints. One is based on the analysis in
the supergravity and the other is based on the double-dimensional reduction. We will show
both results are equivalent. In Section 3 we consider the matrix string on the pp-wave and
formally write down the action of the matrix string from the supermembrane action on
the pp-wave in eleven dimensions. In Section 4 we will discuss the mode-expansions and
quantization of closed and open strings in the type IIA theory. We also discuss Dp-branes
and investigate the allowed value p and the direction of D-branes. Section 5 is devoted to
conclusions and discussions. In Appendix A we will briefly explain the compactification
on an SO(3)-direction. The different points from the SO(6)-case considered in the text are
summarized.
2
9
r 2
ds 2 = −2 dX+ dX− + G++ XI , XI dX+ + dX ,
r=1
3 2
µ 2 I 2
9
I I µ I 2
G++ X , X ≡ − X + X , (2.1)
3 6
I =1 I =4
where the constant 4-form flux for +, 1, 2, 3 directions,
F+123 = µ (µ = 0) (2.2)
is equipped. It is a unique pp-wave solution with 32 supercharges in eleven dimensions.
The Killing vectors of the KG solution are constructed as follows [2]:
ξe+ = −∂+ , ξe− = ∂− ,
µ + µ µ +
ξeI = − cos X ∂I + XI sin X ∂− (I = 1, 2, 3),
3 3 3
2
µ µ + µ µ +
ξeI = − sin
∗ X ∂I − I
X cos X ∂− ,
3 3 3 3
µ + µ µ +
ξeI = − cos X ∂I + XI sin X ∂− (I = 4, . . . , 9),
6 6 6
2
µ µ + µ µ +
ξe∗ = − sin X ∂I − XI cos X ∂− ,
I 6 6 6 6
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 131
where Γµ ’s are 32 × 32 gamma matrices and I ≡ Γ123 obeys I 2 = −1. The spinors ψ+
and ψ− with 32 components satisfy the conditions
Γ+ ψ+ = 0, Γ− ψ− = 0, (2.4)
hence they have 16 non-vanishing components. If X is a Killing vector, we can define an
associated Lie derivative LX on any spinor ψ by
1
LX ψ = XM ∇M ψ + ∇[M XN] Γ MN ψ, (2.5)
4
where ∇ is defined by
1 ab
∇M ≡ ∂M + ωM Γab .
4
This has the following properties:
1. If X is a Killing vector field, f is any smooth function and ψ is any spinor, then
LX (f ψ) = (Xf )ψ + f LX ψ.
2. When the symbol “·” (dot) denotes the Clifford action of vector fields on spinors, then
LX (Y · ψ) = [X, Y ] · ψ + Y · LX ψ.
132 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
The Lie derivatives for the Killing vector fields are given by [2]
µ µ
Lξe− (ψ+ , ψ− ) = 0, Lξe+ (ψ+ , ψ− ) = − I ψ+ , − I ψ− ,
12 4
µ
LξeI (ψ+ , ψ− ) = − I ΓI Γ− ψ+ , 0 ,
6
µ
Lξe (ψ+ , ψ− ) = − I ΓI Γ− ψ+ , 0 ,
I 12
µ2
Lξe∗ (ψ+ , ψ− ) = − ΓI Γ− ψ+ , 0 ,
I 18
µ2
Lξe∗ (ψ+ , ψ− ) = − ΓI Γ− ψ+ , 0 ,
I 72
1
LξMI J (ψ+ , ψ− ) = − ΓI J ψ+ , 0 ,
2
1
LξM (ψ+ , ψ− ) = − ΓI J ψ+ , 0 .
I J 2
By the use of the above results, we can count the remaining unbroken supersymmetries.
For example, in the case of ξeI + (3/µ)ξeJ∗ , we obtain the following expression
µ
LξeI +(3/µ)ξe∗ = − QΓJ Γ− ψ+ , 0 ,
J 3
1
QI J ≡ (1 + I ΓI ΓJ ). (2.6)
2
Clearly, 16 spinors are annihilated by Γ− . Furthermore, the constant matrix Q plays the
role of the projection operator and so annihilates additional 8 spinors in the same manner
as in the type IIB string [21]. For another example, in the case of ξeI + (6/µ)ξe∗ , the Lie
J
derivative is given by
µ 1
Lξe +(6/µ)ξe∗ = − QΓJ Γ− ψ+ , 0 , QI J ≡ (1 + I ΓI ΓJ ). (2.7)
I J 6 2
In the same way as in the case of ξeI + (3/µ)ξeJ∗ , 24 supersymmetries are preserved.
In conclusion, the above two cases of the type IIA pp-wave backgrounds preserve 24
supersymmetries.
XI = x I (I = 1, 2, 3), Xa = x a (a = 6, 7, 8, 9),
µ + µ +
X = x cos
4 4
x − x sin
5
x ,
6 6
µ + µ +
X = x sin
5 4
x + x cos
5
x , (2.8)
6 6
then the metric is rewritten as
2 9
r 2 2 5 + 4
ds 2 = −2 dx + dx − + G++ x I , x a dx + + dx − µx dx dx ,
3
r=1
3 2
µ 2 I 2
9
µ a 2
G++ x I , x a ≡ − x + x , (2.9)
3 6
I =1 a=6
but the constant 4-form flux is still expressed in Eq. (2.2). We can easily see from the above
metric (2.9) that the x 4 -direction is a manifest spatial isometry direction [21] and obtain
the metric of the type IIA by the standard technique of the dimensional reduction from the
eleven-dimensional supergravity to the type IIA supergravity in ten dimensions,
4 2
= e− 3 φ gµν dx µ dx ν + e 3 φ dy + dx µ Aµ ,
2
2
ds11 (2.10)
where gµν is a ten-dimensional metric, Aµ is a Kaluza–Klein gauge field (RR 1-form) and
φ is a dilaton. The ten-dimensional metric gµν is given by
2
4
a 2 8
b 2
gµν dx µ dx ν = −2 dx + dx − + g++ x a , x b dx + + dx + dx ,
a=1 b=5
4 2
µ 2 a 2
8
a b µ b 2
g++ x , x ≡ − x + x , (2.11)
3 6
a=1 b=5
where the subscript a denotes the coordinates of the string world-sheet σ a = (τ, σ ) and
η = diag(−1, 1) is the world-sheet metric. The L is the arbitrary length parameter. The
convention of the antisymmetric tensor is taken as τ σ = 1. The ten-dimensional metric
obtained previously and the light-cone gauge condition x + = τ lead to the bosonic action
of the type IIA string theory written as
2π 8
1 2 2
SB = dτ dσ ∂τ x i − ∂σ x i
4πα
0 i=1
2
4 2
8
µ
a 2 µ b 2
− x − x , (2.15)
3 6
a=1 b=5
µ
τ → Lτ, σ → Lσ, µ→ .
L
1 It has been reported that the numerical coefficients in the covariant derivatives in the type IIA and the type IIB
include some issues [24,28]. But it should be remarked that these are based on the difference of the convention in
Ref. [33], and not on the incorrectness. We thank C.N. Pope for the valuable comment on this point.
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 135
Using this covariant derivative, we can obtain the quadratic fermionic action of the
type IIA described by
2πL
2
i ab
SF = dτ dσ η δpq − ab (σ3 )pq ∂a x µ θ̄ p γµ (Db )qr θ r , (2.17)
2π
0 p,q,r=1
x + = τ, γ + θ p = 0,
then in the same way as in the type IIB case [6] the above action can be rewritten as
2πL
2
i
SF = − dτ dσ θ̄ p γ+ δpq (Dτ )qr + (σ3 )pq (Dσ )qr θ r , (2.18)
2π
0 p,q,r=1
where the length parameter should be fixed as L = α |p+ | now. The covariant derivatives
are also rewritten as
1 1
(Dτ )pq = ∂τ δpq + ωµν+ δpq − Hµν+ (σ3 )pq γ µν
4 2
1 1
+ Fµν γ µν + Fµνλδ γ µνλδ (σ1 )pq γ+ ,
2 · 2! 2 · 4!
(Dσ )pq = ∂σ δpq . (2.19)
µ
When we use the constant 2- and 4-form field strengths F+4 = 3 and F+123 = µ, the
fermionic action can be rewritten as
2π
i µ 1
SF = dτ dσ ψ T ∂τ ψ + ψ T γ9 ∂σ ψ + ψ T γ123 + γ49 ψ , (2.20)
2π 4 3
0
By following the work [11] in the light-cone gauge in terms of an SO(9) spinor ψ, we
can write down the action of the supermembrane on the pp-wave [13,14] as
2πL
2πL
1
S= dτ dσ dρ L,
;3M
0 0
2 3 2 9
−1 1 1 r s 2 µ µ
w L= Dτ X Dτ X − X , X
r r
− XI −
2
XI2
2 2 3 6
I =1 I =4
µ
3
− I J K XK XI , XJ
3
I,J,K=1
µ
+ iψ T γ r Xr , ψ + iψ T Dτ ψ + i ψ T γ123ψ, (2.22)
4
where (σ 0 , σ 1 , σ 2 ) = (τ, σ, ρ) is the set of world-volume coordinates on the membrane
and the { , } is a Lie bracket given by using an arbitrary function w(σ, ρ) of world-volume
spatial coordinates σ a (a = 1, 2) as follows:
ab
{A, B} ≡ ∂a A∂b B (a, b = 1, 2),
w
with ∂a = ∂/∂σ a . This theory has the τ -independent gauge symmetry called the
area-preserving diffeomorphism (APD). It is a residual symmetry belonging to the
reparametrization invariance of the membrane world-volume. When we use the gauge
connection ω, the covariant derivative for this gauge symmetry is defined by
Dτ Xr ≡ ∂τ Xr − ω, Xr (r = 1, 2, . . . , 9).
We have also introduced a parameter ;M , which is the M-theory scale related to the
membrane tension TM = 1/;3M . It is associated to the string coupling gs and the string
scale ;s in ten-dimensional string theory (up to some numerical constant) with a relation
1/3
;M = gs ;s . We use a normalization
0 σ 2πL, 0 ρ 2πL, dσ dρ w(σ, ρ) = L2 ,
with L being an arbitrary length parameter. In our light-cone gauge, the time coordinate
“τ ” is associated to the X+ as X+ = (;3M /(2πL)2 )P0+ τ and the longitudinal momentum
P + (σ, ρ) satisfies P + (σ, ρ) = (P0+ /L2 )w(σ, ρ). Hereafter we shall use a convention
P0+ = 1.
Here, we shall consider the double-dimensional reduction (DDR) in the SO(6)-
direction. It is considered that eleven-dimensional supermembrane theory in the flat space
should reduce to the type IIA string theory, at least classically. Based on this fact, we shall
carry out the DDR of the supermembrane on the pp-wave. We will show that the type IIA
string action on the pp-wave obtained in the previous subsection can be derived from the
supermembrane action on the pp-wave (2.22) through the double-dimensional reduction.
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 137
so that w = (2π)−2 and fix the parameter L as L = gs ;s . The resulting action is given by
2π
1
Sst = dτ dσ Lst ,
2π
0
8 2 2
1 i 2
i 2 µ
4
a 2 µ
8
b 2
Lst = ∂τ x − ∂σ x − x − x
2α 3 6
i=1 a=1 b=5
µ 1
+ iψ T 116 · ∂τ − γ9 · ∂σ + γ123 + γ49 ψ, (2.25)
4 3
where we have renamed the coordinates x 4 , x 5 , . . . , x 9 as x 9 , x 4 , x 5 , . . . , x 8 . It should
be understood that the mass term of x 4 arises from the second term in Eq. (2.10). This
term should describe the effect of the Kaluza–Klein 1-form. Also, the fermionic field and
parameters have been appropriately rescaled as
3/2
L ;M (2π)2
σ → Lσ, τ→ τ, ψ→ ψ, µ→ µ.
(2π)2 L L
2 We have modified the contribution of the spin connection in the revised version. This contribution is initially
pointed out in Ref. [34] where the correct type IIA action is obtained.
138 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
The parameters of the resulting theory are related with those of M-theory
1 (2π)L 1/3 √
= 3
, ;M = gs ;s , L = gs ;s , ;s = 2π α .
2πα ;M
It should be noted that the above action is identical with the type IIA action derived in
the previous subsection up to the sign of σ . Hereafter, we can use the following expression
of γ µ = (γ i , γ 8 , γ 9 ),
0 −i γ̃ i
γ i = γ̃ i ⊗ σ2 = (i = 1, . . . , 7), (2.26)
i γ̃ i 0
0 18 18 0
γ 8 = 18 ⊗ σ1 = , γ 9 = 18 ⊗ σ3 = , (2.27)
18 0 0 −18
where γ̃ i ’s (i = 1, . . . , 7) are SO(7) gamma matrices that obey commutation relations
γ̃ i γ̃ j + γ̃ j γ̃ i = 2δ ij . (2.28)
The 16 component fermion ψ is decomposed into two 8 component fermions Ψ 1 and Ψ 2
as
1
Ψ
ψ= .
Ψ2
Moreover, we can decompose the 8 component fermion into two eigen-spinors of the
matrix R ≡ γ̃1234 as follows:
1+R 1−R
Ψa = Ψa + Ψ a ≡ Ψ a+ + Ψ a− (a = 1, 2). (2.29)
2 2
By definition, the spinor Ψ a± satisfy
RΨ a± = ±Ψ a± . (2.30)
That is, Ψ a±are the eigen-spinors with eigen-value ±1, respectively. By the use of Ψ a± ,
the fermionic Lagrangian can be rewritten as
L = iΨ 1+T ∂− Ψ 1+ + iΨ 1−T ∂− Ψ 1− + iΨ 2+T ∂+ Ψ 2+ + iΨ 2−T ∂+ Ψ 2−
µ µ µ µ
− i Ψ 1−T Π T Ψ 2+ − i Ψ 1+T Π T Ψ 2− + i Ψ 2−T ΠΨ
1+ + i Ψ 2+T ΠΨ
1− ,
3 6 6 3
(2.31)
T T T
where Π ≡ γ̃123 , Π ≡ γ̃321 and satisfies Π Π = Π Π = 1.
We can also consider the matrix string theories [35] on the pp-wave3 from the
supermembrane by the use of the method in the work [36].
3 Matrix strings are also discussed in Refs. [37,38] from different viewpoints from ours.
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 139
Let us start with the supermembrane action (2.22), and rotate the variables into x’s as
given by (2.23). In this time, the gamma matrices are also transformed by this rotation. The
resulting supermembrane action is given by
2π
2πL
1
S= dτ dσ dρ L,
;3M
0 0
2 3 2 9
1
r 2 1 r s 2 µ I 2 µ I 2
L= Dτ x − x , x − x − x
2 2 3 6
I =1 I =6
µ
3
I J 2 5
− I J K x x , x − µx Dτ x
K 4
3 3
I,J,K=1
µ 1
+ iψ T γ r x r , ψ + iψ T Dτ ψ + i ψ T γ123 + γ54 ψ, (3.1)
4 3
where we have set w = (2π)−2 and rescaled σ as σ → (2π)2 σ . Now, the Lie bracket { , }
is simply defined by
Y −→ ρ + Y.
The Y is regarded as the compactified direction. As the result, the action is rewritten as
2π 2π
L
S= dτ dσ dρ L,
;3M
0 0
2
3
1 2 2 1 i j 2 µ I 2
L= 2
F0,σ + Dτ x i − DσY x i − 2
x , x − x
2 2L 3
I =1
2
µ
8
I 2 µ
3
2 4
− x − I J K x x , x − µx F0,σ
K I J
6
3L 3
I =5 I,J,K=1
1 T i i µ T 1
+ i ψ γ x , ψ + iψ Dτ ψ − iψ γ Dσ ψ + i ψ γ123 + γ49 ψ,
T T 9 Y
L 4 3
(3.2)
where we have reassigned the variables x 4 , x 5 , . . . , x 9 as x 9 , x 4 , x 5 , . . . , x 8 and rescaled
ρ → Lρ. We have also introduced the following quantities,
1 1
F0,σ ≡ ∂τ Y − ∂σ ω − {ω, Y }, Dτ x i ≡ ∂τ x i − ω, x i ,
L L
1
DσY x i ≡ ∂σ x i − Y, x i ,
L
140 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
where A0 ≡ ω and Aσ ≡ Y . The inverse compactification radius 1/L plays a role of the
gauge coupling constant. It seems that the action (3.2) is not explicitly invariant under
the area-preserving diffeomorphism. But the action indeed has this symmetry under the
transformation with an infinitesimal gauge parameter Λ
2π
L 2π
S= dτ dθ L,
;3M N
0
1 2 2 1 N 2 i j
2
L = Tr F0,θ2
+ Dτ x i − N 2 Dθ x i + x ,x
2 2 2πL
2 3 2 8
µ I 2 µ I 2
− x − x
3 6
I =1 I =5
3
µ N
2 4
+i I J K x x , x − µx F0,θ
K I J
3 2πL 3
I,J,K=1
N
+ Tr ψ T γ i x i , ψ + iψ T Dτ ψ − Niψ T γ 9 Dθ ψ
2πL
µ T 1
+ i ψ γ123 + γ49 ψ , (3.3)
4 3
where the quantities in the action are replaced with
N N
Dθ x i = ∂θ x i + i Y, x i .
2πL
If we rescale some constants as
τ √ L
τ→ , ψ→ N ψ, L→ , µ → Nµ,
N 2π
then we can rewrite Eq. (3.3) as
2π
1
S= dτ dθ L,
;3M
0
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 141
1 µ 4 2 2 2 1
2
L = Tr F0,θ − x + Dτ x i − Dθ x i + x i , x j
2 3 2
2
µ 2 I 2
4 8 3
µ I 2 2
− x − x +i µ I J K x I x J x K
3 6
3
I =1 I =5 I,J,K=1
µ 1
+ Tr ψ T γ i x i , ψ + iψ T Dτ ψ − iψ T γ 9 Dθ ψ + i ψ T γ123 + γ49 ψ ,
4 3
(3.4)
where the field strength and covariant derivatives are given by
F0,θ = ∂τ Y − ∂θ ω + i[ω, Y ], Dτ x i = ∂τ x i + i ω, x i ,
Dθ x i = ∂θ x i + i Y, x i .
The action (3.4) includes the 3-point interaction and several mass terms and also the field
strength of the gauge connection is shifted by x 4 . Thus it seems that the action (3.4) is
not invariant under the area-preserving diffeomorphism. However, this action of the matrix
string is actually invariant under the gauge transformation with an matrix parameter Λ
δx = −i Λ, x ,
i i
δψ = −i[Λ, ψ].
The τ -scaling leads to the N -dependence of the physical light-cone time X+ as
;3M P0+ τ
X+ = .
N(2πL)2
We also should rescale P0+ as P0+ → NP0+ so that X+ should be independent of N . The
diagonal elements of the matrix x i describe a fundamental string bit in the large N limit
and hence the total longitudinal momentum is proportional to the number N of string bits.
It is easily observed that the above action (3.4) becomes the usual matrix string action
in the flat limit µ → 0. Moreover, let us consider the IR region. At the time, the matrix
variables are restricted to the Cartan subalgebra. That is, the matrix becomes diagonal and
so the term including commutator should vanish. Finally, integrating out the field strength
F0,θ as the auxially field, one can find that the above action should reduce to the free
type IIA string theory obtained in the previous section, as is expected.
Also, we should remark that the above action of the matrix string is included in the
family of the work [37] where the action of the matrix string and supersymmetry have been
more generally investigated from the viewpoint of the mass deformation of the Yang–Mills
theory.
Finally, we comment on the classical solution. As in the BMN matrix model [8], for
example, this matrix string theory has the static fuzzy sphere solution described by
µ I
xI = J (I = 1, 2, 3),
3
x 4 = · · · = x 8 = Y = ω = 0, (3.5)
142 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
where the J I ’s are generators of an SU(2) algebra. The existence of the fuzzy sphere
solution might be physically expected from the presence of the constant flux of RR
3-form [20]. It would be possible to consider other classical solutions.
In this section we will consider the mode-expansions and quantization of closed and
open strings in the type IIA on the pp-wave. In particular, we investigate D-branes living
in the theory.
µ2 a
∂+ ∂− x a + x = 0 (a = 1, 2, 3, 4), (4.1)
9
µ2 b
∂+ ∂− x b + x = 0 (b = 5, 6, 7, 8), (4.2)
36
µ
∂+ Ψ 2+ + Π Ψ 1− = 0, (4.3)
3
µ T 2+
∂− Ψ 1− − Π Ψ = 0, (4.4)
3
µ
∂+ Ψ 2− + Π Ψ 1+ = 0, (4.5)
6
µ T 2−
∂− Ψ 1+ − Π Ψ = 0. (4.6)
6
The mode-expansions of bosonic variables are described by
µ 3 a µ
x a
(τ, σ ) = x0a cos τ + α p0 sin τ
3 µ 3
α 1 a B
+i B
αn φn + ᾱna φ̃nB (a = 1, 2, 3, 4),
2 ω
n =0 n
µ 6 b µ
x (τ, σ ) = x0 cos
b b
τ + α p0 sin τ
6 µ 6
α 1 b B b B
In this subsection we shall discuss the mode-expansions of open strings in the type IIA
string by imposing boundary conditions. In particular, we would like to consider D-branes,
following Ref. [30]. (For more detailed studies, see Refs. [31,32].) It has been shown in
Ref. [30] that Dp-brane is not allowed for p = 1, 9 and there are some restrictions on
directions of allowed D-branes. First we consider the open string action described by
π
1
Sst = dτ dσ Lst , (4.14)
2π
0
K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150 145
8 2 2
1
4 8
µ a 2 µ b 2
Lst = ∂+ x ∂− x −
i i
x − x
2α 3 6
i=1 a=1 b=5
+ iΨ ∂− Ψ + iΨ
1+T 1+ 1−T
+ iΨ
∂− Ψ 1− ∂+ Ψ + iΨ 2−T ∂+ Ψ 2−
2+T 2+
µ µ µ
− i Ψ 1−T Π T Ψ 2+ − i Ψ 1+T Π T Ψ 2− + i Ψ 2−T Π Ψ 1+
3 6 6
µ
+ i Ψ 2+T Π Ψ 1− . (4.15)
3
Similarly, we obtain equations of motion (4.1)–(4.6) from the above action (4.14). In
order to solve the above equations of motion we have to impose the following boundary
conditions on bosonic coordinates x i ’s (i = 1, 2, . . . , 8),
includes odd number of gamma matrices since the SO(8) chiralities of Ψ 1 and Ψ 2
The Ω
must be opposite in the type IIA theory and hence p is restricted to even.
Under these boundary conditions we can obtain classical solutions for equations of
motion, and mode-expansions of bosonic variables are given by
µ 3 µ
x a (τ, σ ) = x0a cos τ + 2α p0a sin τ (a = 1, 2, 3, 4)
3 µ 3
√ 1
α a e−iωn τ cos(nσ ) (Neumann),
B
+ i 2α (4.16)
ωnB n
n =0
√ 1
αna e−iωn τ sin(nσ ) (Dirichlet),
B
x a (τ, σ ) = 2α B
(4.17)
ω
n =0 n
µ 6 µ
x b (τ, σ ) = x0b cos τ + 2α p0b sin τ (b = 5, 6, 7, 8)
6 µ 6
√ 1
b −iωnB τ
+ i 2α αn e cos(nσ ) (Neumann), (4.18)
ωnB
n =0
√ 1
b −iωnB τ
x b (τ, σ ) = 2α B αn e sin(nσ ) (Dirichlet), (4.19)
ωn
n =0
146 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
where ωnB and ωnB have been defined by (4.11). The mode-expansions of fermionic
variables are the same as in the closed string case. The quantization can be done in the
same way as closed strings. The commutation relations of bosonic and fermionic modes
are the same as (4.12) and (4.13). The quantum Hamiltonian and spectrum can be also
studied with the standard procedure but we will not investigate them furthermore here.
Next we will study D-branes. Though the mode-expansions of fermionic variables are
the same as in the closed string case, in the open string case fermionic boundary conditions
lead to further constraints
Π
Ψ0 = Ω Ψ
0 , Ω 0 = −ΠΨ
T Ψ 0, n = ΩΨ
Ψ n (n = 0),
Ψ0 Π
=Ω Ψ
0 , ΩT 0
Ψ 0 ,
= −ΠΨ n
Ψ n
= ΩΨ (n = 0).
Π
Ω Ω
Π = −1. (4.20)
This condition is peculiar to the pp-wave, and gives an additional constraint for the Dp-
branes in the theory. In fact, in the massive type IIB theory, D1- and D9-branes are
forbidden and D3-, D5- and D7-branes can exist but those directions are limited. In the
massive type IIA theory similar restrictions are imposed.
We shall list the possible Dp-branes below:
We have considered the type IIA string theory on the pp-wave background from the
eleven-dimensional viewpoint. To begin, we have discussed the type IIA pp-wave solution
through the toroidal compactification of the maximally supersymmetric pp-wave solution
in eleven dimensions on a spatial isometry direction. Next, we have derived the action of the
type IIA string theory from the type IIA pp-wave solution of the supergravity. Moreover,
we have derived the type IIA string action from the eleven-dimensional supermembrane
theory on the maximally supersymmetric pp-wave background by applying the double-
dimensional reduction for a spatial isometry direction. The resulting action agrees with
the one obtained from the supergravity side. In particular, the Kaluza–Klein gauge field
induces a mass term of a bosonic coordinate in the type IIA theory. Furthermore, we have
written down the action of the matrix string on the pp-wave. This action contains the
3-point interaction and mass terms. Also, the field strength of the gauge connection is
shifted. However, this action is still gauge invariant, though this theory is not maximally
supersymmetric. In particular, this theory is reduced to the matrix string theory in the flat
space by taking the limit µ → 0. We have also discussed the quantization of closed and
open strings in the type IIA string. In particular, the allowed Dp-branes in this theory have
been investigated. The values p = 2, 4, 6 and 8 are allowed but the directions of D-branes
are constrained.
We can also consider compactifications along other isometry directions. In such cases
the number of the remaining supercharges is less than 24. Furthermore, it is nice to
study the type IIA pp-wave background preserving 26 supercharges [27] or type IIA
string theory on such a background from the eleven-dimensional supermembrane. It is
an interesting work to discuss less supersymmetric type IIA string theories from the
supermembrane. Moreover, the supersymmetric D-branes in such type IIA string theories
are very interesting subject to study.
It is nice to study the matrix string theory written down here from several aspects. In
particular, it would be interesting to study the relation between the matrix string theory on
the pp-wave and “string bit” [39].
Acknowledgement
The work of K.S. is supported in part by the Grant-in-Aid from the Ministry of
Education, Science, Sports and Culture of Japan (F 14740115).
2 9
r 2 4 2 + 1
ds 2 = −2 dx + dx − + G++ x 3 , x I dx + + dx − µx dx dx ,
3
r=1
µ 2 I 2
9
µ 2 3 2
G++ x 3 , x I ≡ − x + x , (A.2)
3 6 I =4
and the constant 4-form flux is still written in Eq. (2.2). In this case the x 1 -direction is a
manifest spatial isometry direction. In the same way, we can obtain the type IIA solution
from the above expression. The ten-dimensional metric gµν is given by
2
8
i 2
gµν dx µ dx ν = −2 dx + dx − + g++ x i dx + + dx ,
i=1
2 2 2
8
2 1 2 µ 2 2 µ a 2
g++ x i ≡ − µ x + x + x , (A.3)
3 3 6
a=3
where the mass term for x 1 is induced from the Kaluza–Klein gauge field Aµ as the case of
the compactification on an SO(6)-direction. It is also an easy exercise to derive the above
action (A.6) by using the double-dimensional reduction.
Next, we shall consider the fermionic sector. Now, in the study of the supergravity, the
field strength of RR 3-form is zero, but NS–NS 2-form is non-zero and it has the constant
field strength proportional to µ. Thus, this contribution induces the fermion mass term.
However, there might be possibly an issue for the numerical constant and the fermion mass
term obtained in the supergravity analysis is not identical with the one derived via double-
dimensional reduction if we naively use the expression of the covariant derivative in the
text.
References
[1] J. Kowalski-Glikman, Vacuum states in supersymmetric Kaluza–Klein theory, Phys. Lett. B 134 (1984) 194.
[2] J. Figueroa-O’Farrill, G. Papadopoulos, Homogeneous fluxes, branes and a maximally supersymmetric
solution of M-theory, JHEP 0108 (2001) 036, hep-th/0105308.
[3] R. Penrose, Any spacetime has a plane wave as a limit, in: Differential Geometry and Relativity, Reidel,
Dordrecht, 1976, pp. 271–275;
R. Güven, Plane wave limits and T-duality, Phys. Lett. B 482 (2000) 255, hep-th/0005061.
[4] M. Blau, J. Figueroa-O’Farrill, C. Hall, G. Papadopoulos, A new maximally supersymmetric background
of IIB superstring theory, JHEP 0201 (2001) 047, hep-th/0110242;
M. Blau, J. Figueroa-O’Farrill, C. Hall, G. Papadopoulos, Penrose limits and maximal supersymmetry, hep-
th/0201081.
[5] R.R. Metsaev, Type IIB Green–Schwarz superstring in plane wave Ramond–Ramond background, Nucl.
Phys. B 625 (2002) 70, hep-th/0112044.
[6] R.R. Metsaev, A.A. Tseytlin, Exactly solvable model of superstring in plane wave Ramond–Ramond
background, Phys. Rev. D 65 (2002) 126004, hep-th/0202109.
[7] J.G. Russo, A.A. Tseytlin, On solvable models of type IIB superstring in NS–NS and RR plane wave
backgrounds, JHEP 0204 (2002) 021, hep-th/0202179.
[8] D. Berenstein, J. Maldacena, H. Nastase, Strings in flat space and pp waves from N = 4 super-Yang–Mills,
JHEP 0204 (2002) 013, hep-th/0202021.
[9] E. Bergshoeff, E. Sezgin, P. Townsend, Supermembranes and eleven-dimensional supergravity, Phys.
Lett. 189 (1987) 75.
[10] E. Bergshoeff, E. Sezgin, P. Townsend, Properties of the eleven-dimensional supermembrane theory, Ann.
Phys. 185 (1988) 330.
[11] B. de Wit, J. Hoppe, H. Nicolai, On the quantum mechanics of supermembranes, Nucl. Phys. B 305 (1988)
545.
[12] T. Banks, W. Fischler, S.H. Shenker, L. Susskind, M-theory as a matrix model: conjecture, Phys. Rev. D 55
(1997) 5112, hep-th/9610043.
[13] K. Dasgupta, M.M. Sheikh-Jabbari, M. Van Raamsdonk, Matrix perturbation theory for M-theory on a pp-
wave, JHEP 0205 (2002) 056, hep-th/0205185.
[14] K. Sugiyama, K. Yoshida, Supermembrane on the pp-wave background, hep-th/0206070, Nucl. Phys. B,
in press.
[15] K. Sugiyama, K. Yoshida, BPS conditions of supermembrane on the pp-wave, hep-th/0206132, Phys.
Lett. B, in press.
[16] T. Banks, N. Seiberg, S. Shenker, Branes from matrices, Nucl. Phys. B 490 (1997) 91, hep-th/9612157.
[17] N. Kim, J. Plefka, On the spectrum of pp-wave matrix theory, hep-th/0207034;
K. Dasgupta, M.M. Sheikh-Jabbari, M. Van Raamsdonk, Protected multiplets of M-theory on a plane wave,
hep-th/0207050;
N. Kim, J.-H. Park, Superalgebra for M-theory on a pp-wave, hep-th/0207061.
150 K. Sugiyama, K. Yoshida / Nuclear Physics B 644 (2002) 128–150
Received 6 August 2002; received in revised form 6 September 2002; accepted 10 September 2002
Abstract
We study one-loop effective action of Berkooz–Douglas matrix theory and obtain non-Abelian
action of D0-branes in the longitudinal 5-brane background. In this paper, we extend the analysis
of hep-th/0201248 and calculate the part of the effective action containing fermions. We show that
the effective action is manifestly invariant under the loop-corrected SUSY transformation, and give
the explicit transformation laws. The effective action consists of blocks which are closed under the
SUSY, and it includes the supersymmetric completion of the couplings to the longitudinal 5-branes
proposed by Taylor and Van Raamsdonk as a subset.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The fact that multiple D-branes are described by the matrix valued coordinates led
to many surprising effects. Especially, Dp-branes are allowed to couple to Ramond–
Ramond forms of higher degree than p + 1 [1,2]. Due to those couplings, non-commutative
configurations are stabilized in the presence of certain background fields, which are
interpreted as the D-branes having finite extent [2]. To study such intrinsically stringy
effects, precise understanding of the effective action of multiple D-branes in background
fields is needed.
It is well known that a single D-brane in the low acceleration limit is described by the
Born–Infeld action which includes all the α corrections. However, it is not clear how to
generalize it to the case of multiple D-branes which have non-Abelian gauge symmetries.
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 4 - 6
152 M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169
In the flat background, the effective action of N coincident D-branes is given by the
maximally supersymmetric U(N) Yang–Mills theory in the small α limit, but the terms of
higher orders in α are not well-understood.
Supersymmetrization of multiple D-brane action including these higher order terms is
also a hard problem. In the Abelian case, there exists κ-symmetric formulation [3,4] which
gives a supersymmetric action after gauge fixing the world-volume diffeomorphism and
the κ-symmetry [4], but an attempt to define κ-symmetry with non-Abelian parameter [5]
does not seem successful [6]. Studies for obtaining α corrections to the SYM at the first
few orders including the fermionic part is being done from various standpoints (see the
references cited below for current status): (i) from the calculation of open string disk
amplitudes [6]; (ii) by constructing the invariants with respect to the α -corrected SUSY
transformation [7,8]; (iii) by requiring the existence of certain BPS states which are present
in string theory [9].
When we consider multiple D-branes in non-trivial backgrounds, determining the action
is further difficult. Background fields should be regarded as a function of matrix fields
somehow, but the principles for doing so is not clear. As for the supersymmetrization,
there is practically no knowledge up to now.
Effective action of matrix theory [10] gives insight for the multiple D-brane action.
Taylor and Van Raamsdonk studied the effective action of BFSS matrix theory in detail
and found the terms which can be interpreted as the supergravity interactions. From those
terms, they read off the coupling of D-branes to weak background fields, and further
proposed a form of the couplings to general weak background fields [1,11–13]. Those
couplings were applied to various contexts, including the gauge-theory calculation of
the absorption cross section of dilaton higher partial waves by D3-branes, which exactly
reproduces the semiclassical supergravity results [14].
The subject of our paper is matrix theory proposed by Berkooz and Douglas [15], which
is the matrix model for M-theory in the presence of longitudinal (L) 5-branes. Berkooz–
Douglas (BD) matrix theory is defined by the 0–4 string and 0–0 string sectors of the SYM
describing D0–D4 system. In a previous paper [16], we performed one-loop integration
of 0–4 fields and obtained the bosonic part of the non-Abelian action of D0-branes. We
found that the action consists of the terms given from the general proposal of Taylor and
Van Raamsdonk by substituting the L5-brane backgrounds, plus the corrections involving
extra commutators. Since L5-branes are degrees of freedom which are not present in BFSS
matrix theory, the fact that the proposed couplings were exactly reproduced is regarded as
a non-trivial evidence for the consistency of the BFSS and BD matrix models.
In this paper, extending the analysis of Ref. [16], we calculate the one-loop effective
action of BD matrix theory and obtain non-Abelian effective action of D0-branes
including the part containing fermionic fields. Especially, we reveal the consequence of
the supersymmetry of the classical action. We point out that the one-loop effective action
is manifestly invariant under effective SUSY transformation. The transformation law is
given simply by the one-loop expectation value of the classical SUSY transformation law.
By examining the transformation rules, we decompose the terms into the blocks which
close within themselves under the transformation. Among these blocks, we identify the
supersymmetric completion of the bosonic terms given by the Taylor and Van Raamsdonk’s
proposal applied to the longitudinal 5-brane background.
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 153
This paper is organized as follows. In Section 2, we review the action and SUSY
transformation of BD matrix theory. In Section 3, after explaining the method of the
loop calculation, we present the results for the fermionic terms of the effective action.
In Section 4, we discuss the SUSY of the effective action. We conclude with remarks on
the directions for future studies in Section 5. We summarize the representation of spinors
and gamma matrices adopted in this paper in Appendix A, and derive the invariance of
one-loop effective action under the loop-corrected SUSY transformation in Appendix B.
The action of Berkooz–Douglas matrix theory is given by the 0–0 and 0–4 string sectors
of the SYM which describe the D0–D4 bound state. In the case of N D0-branes and N4
D4-branes, the theory has U(N) gauge symmetry and U(N4 ) global symmetry (which is
not gauged, for the gauge fields on D4-branes are discarded). The action is based on the
D = 6 N = 1 SYM and reads as follows.
S = S0 + S5 , (2.1)
1 1 1
S0 = dt Tr D0 Xi D0 Xi + 2 [Xi , Xj ]2 − iθ−† D0 θ− − iθ+† D0 θ+
gs s 2 4λ
1 † 0 a 1 † 0 a
+ θ− γ γ [θ− , Xa ] + θ+ γ γ [θ+ , Xa ]
λ λ
1 † 0 1 † 0
+ θ+ γ [θ− , φ2 ] − θ− γ θ+ , φ̄2
λ λ
1 1 † ∗ †
+ θ+ C[θ− , φ1 ] + θ+ C θ− , φ̄1 , (2.2)
λ λ
1 1 1
S5 = dt (D0 vI )† D0 vI − 2 vI† (Xa )2 vI − iχ † D0 χ − χ † γ 0 γ a Xa χ
gs s λ λ
1
− 2 v1† φ1 , φ̄1 + φ2 , φ̄2 v1 − v2† φ1 , φ̄1
2λ
+ φ2 , φ̄2 v2 − 2v2† φ̄1 , φ̄2 v1
√
2 † 0
+ 2v1† φ1 , φ2 v2 − χ γ θ− v1 − χ † C ∗ θ−† v2 − v1† θ−† γ 0 χ + v2† θ− Cχ
λ
1 † † † † † † † †
− 2 v1 v1 )(v1 v1 + v2 v2 v2 v2 − 2 v1 v2 v2 v1 + 4 v1 v1 v2 v2
2λ
(2.3)
where we use the indices a, b = 5, . . . , 9 for the space transverse to the D4-branes,
m, n = 1, . . . , 4 for the space along the D4-branes, and i, j = 1, . . . , 9 to denote both of
them. Dimensionful parameter λ is defined by λ = 2π2s .
The part S0 is the terms containing only the 0–0 sector fields (Xa , φ1 , φ2 , θ− , θ+ ),
which is nothing but the BFSS action, i.e., the dimensional reduction of D = 10, N = 1
SYM. The 0–0 sector fields are in the adjoint rep. of U(N) and are singlets of U(N4 ).
Covariant derivatives for those fields are defined as D0 Xi = ∂0 Xi + i[A, Xi ]. Note that we
154 M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169
have defined the complex combination of Xm by (φ1 , φ2 ) = (X1 + iX2 , X3 + iX4 ), and
that the fermions are expressed as 6D Weyl spinors, which have 4 complex components.
As in the previous paper [16], we do not use the SU(2) Majorana convention, for we
prefer unconstrained fermions for the loop calculations. The subscripts on the spinors
θ+ , θ− denote the positive and negative 6D chiralities, respectively (γ̄ θ± = ±θ± , where
γ̄ = γ 0 γ 1 · · · γ 5 ). The matrix C is the charge conjugation matrix. We also use the ‘complex
conjugation matrix’ B. See Appendix A for our conventions for spinors and gamma
matrices, the relation between 10D and 6D notations, and the definitions of C and B.
We note the reader that in this paper, the transpose of the spinor indices is not indicated
explicitly. For example, θ+ C[θ− , φ1 ] in Eq. (2.2) means θ+t C[θ− , φ1 ] where t denotes the
transpose of spinor indices (but not of gauge indices).
The additional part S5 contains the 0–4 sector fields (vI , χ). These fields are given by
the hypermultiplets of the 6D theory which consist of 2 complex bosons vI (I = 1, 2)
and a complex spinor χ with positive 6D chirality (γ̄ χ = +χ ). Both of them are in
the bi-fundamental rep. of U(N) × U(N4 ). Covariant derivatives are defined as D0 vI =
∂0 vI + iAvI . We remark here that only half of the 0–0 sector fermions (θ− ) couple to the
0–4 sector fields.
This model has half the amount of supersymmetry as the BFSS model. The SUSY
parameter η is a complex 6D spinor with negative chirality, thus the number of (real)
supercharges is 8. The SUSY transformation law is given as follows. The 0–0 sector fields
transform as
i
δA = δ (0) A = η† θ− − θ−† η , δXa = δ (0) Xa = i η† γ 0 γ a θ− − θ−† γ 0 γ a η ,
λ
δφ1 = δ (0) φ1 = −2iη†C ∗ θ+† , δφ2 = δ (0) φ2 = −2iη† γ 0 θ+ ,
i i
δθ+ = δ (0) θ+ = D0 φ̄1 C ∗ η† + D0 φ2 γ 0 η + Xa , φ̄1 B ∗ γ a∗ η† + [Xa , φ2 ]γ a η,
λ λ
δθ− = δ (0) θ− + δ θ− (2.4)
where
i
δ (0) θ− = D0 Xa γ 0 γ a η + [Xa , Xb ]γ ab η
2λ
i i
− φ1 , φ̄1 + φ2 , φ̄2 η + φ̄1 , φ̄2 B ∗ η† , (2.5)
2λ λ
i 2i † ∗ †
δ θ− = −v1 v1† + v2 v2† η − v2 v1 B η . (2.6)
λ λ
We have denoted δ (0) the part of the transformation laws which does not contain 0–4 sector
fields in the RHS. It is the same as the transformation law for the BFSS theory, except for
the fact that the parameter is restricted to γ̄ η = −η now. Only δθ− has extra contribution
δ θ− containing 0–4 sector fields. The transformation law for the 0–4 sector fields is as
follows
√ √
δv1 = i 2 η† γ 0 χ, δv2 = −i 2 ηCχ,
√
√ 0 ∗ †
2i
δχ = − 2 D0 v2 γ B η + D0 v1 γ η − 0
Xa γ a v2 B ∗ η† + v1 η . (2.7)
λ
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 155
where A, B are the U(N) indices, A, B are the U(N4 ) indices and α, β, . . . (= 1, . . . , 8)
are the 6D spinor indices. We have also defined r = r/λ and /r = γ̃ a ra /λ.
Boson–boson vertices are
(int) 1 1
Sbos = dτ 2 vI† VI J (τ )vJ , (3.7)
gs s λ
where
V11 = 2ra X a + Xa2 + X
02 − iλ∂τ X 0 ∂τ + 1 φ1 , φ̄1 + φ2 , φ̄2 ,
0 − 2iλX
2
1
V22 = 2ra Xa + Xa + X0 − iλ∂τ X0 − 2iλX0 ∂τ −
2 2 φ1 , φ̄1 + φ2 , φ̄2 ,
2
V12 = [φ1 , φ2 ], V21 = − φ̄1 , φ̄2 . (3.8)
Fermion–fermion vertices are
(int) 1 1
0 χ.
Sfermi = dτ χ † γ̃ a Xa − iX (3.9)
gs s λ
Boson–fermion mixed vertices are
√
(int) 1 2 †
Smix = dτ χα LI α vI + vI† L†I α χα , (3.10)
gs s λ
where
L1α = γ 0 θ− α , L†1α = − θ−† γ 0 α ,
L2α = − C ∗ θ−† α , L†2α = θ− C α . (3.11)
(int) (int) (int)
The quadratic part of the action is given by S (quad) = S (free) + Sbos + Sfermi + Smix , and
the effective action is derived from the general expression (3.2). Note that Kbos (or Kfermi )
is read from (3.4) and (3.7) (or from (3.4) and (3.9)). Also, Kmix is read from (3.10).
We treat the vertices as an expansion around a reference time τ and obtain the effective
action as a double expansion in the number of vertices and the number of derivatives,
following the procedure described in Ref. [16]. Explicit form of the two-fermion (two L)
terms of the effective action is given as1
∞
∞
n+m+2
1 2
Γθ 2 = (−1)(n+m) m+2n+2
Di ! λ
m,n=0 Di =0 i=2
) (Dn+1 )
(D ) (D ) †(D
× dτ Tr VI1 ,I2 VI2 ,I23 · · · VIn ,Inn+1 LIn+1n+m+1 Xa1 a(D
···X n+m ) (Dm+n+2 )
LI1 ,β
,α m
dk 1 1 Dn+m+1 / r + ik a1
× (i∂k ) D2
· · · (i∂k ) γ̃ (i∂k )Dn+1
2π k 2 + r 2 k2 + r 2 k2 + r 2
/r + ik 1
× · · · γ̃ am (i∂k )Dn+m 2 (i∂k ) Dn+m+2
. (3.12)
k + r 2 k2 + r 2 αβ
1 See Ref. [16] for the terms which do not contain fermions and for more details on the derivation.
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 157
Γθ(d=0)
2
∞
2
= (−1)(n+m) dτ Tr VI1 ,I2 · · · VIn ,In+1 L†In+1 ,α X am LI1 ,β
a1 · · · X
λm+2n+2
m,n=0
dk 1 /r + ik a1 /r + ik r + ik
am / 1
× γ̃ · · · γ̃ .
2π (k 2 + r 2 )n k 2 + r 2 k2 + r 2 k 2 + r 2 αβ k 2 + r 2
(3.13)
In the above equations, we have assumed X 0 = 0. The dependence of the effective action
on X0 can be recovered by replacing ∂τ with the covariant derivative ∂τ + i[X 0 , ], or
can be directly calculated with a slight modification of the above prescription due to the
0 ∂τ vI .
presence of the derivative interaction vI X
Following the method explained above, we calculate the part of the effective action
containing fermionic backgrounds. We describe the θ 2 terms in detail in Section 3.2.1,
and briefly discuss the terms with more θ ’s in Section 3.2.2. The results are presented in
obtaining the Minkowskian effective action from Γ given
Euclidean signature. The rule for
below is to replace dτ with dt, and Dτ with −iD0 .
Interaction starts at 1/r 3 order. Note that only θ− (but not θ+ ) appear non-trivially in
the effective action, for only θ− couple to the quantum fields v or χ as we have seen in
Section 2.
3.2.1. θ 2 terms
N , d) up to N +d < 4,
We summarize the result of θ 2 terms of the effective action Γθ 2 (Xi
where N and d are the numbers of X i ’s and derivatives contained in Γ , respectively. We
present the result by assembling the terms with the same property.
There are two sets of terms which can be regarded as having similar structures as the θ−
part of the classical action:
Γθ 2 (d = 0)A
N4 1
=− dτ Tr θ−† γ̃ a X a , θ− − X a , θ−† γ̃ a θ− (3.14)
4 r3
3N4 rb
+ dτ STr θ−† γ̃ a X a , θ− − X b
a , θ−† γ̃ a θ− ; X (3.15)
4 r 5
3N4 1
+ dτ STr θ−† γ̃ a X a , θ− − X a , θ−† γ̃ a θ− ; (X
b )2 (3.16)
8 r5
15N4 rb rc
− dτ STr θ−† γ̃ a X a , θ− − X b , X
a , θ−† γ̃ a θ− ; X c , (3.17)
8 r 7
158 M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169
N4 λ 1
Γθ 2 (d = 1)A = 3
dτ Tr θ−† Dτ θ− − Dτ θ−† θ− (3.18)
4 r
3N4 λ rb
− b
dτ STr θ−† Dτ θ− − Dτ θ−† θ− ; X (3.19)
4 r5
3N4 λ 1 2
− dτ STr θ−† Dτ θ− − Dτ θ−† θ− ; X b (3.20)
8 r 5
15N4 λ rb rc
+ b , X
dτ STr θ−† Dτ θ− − Dτ θ−† θ− ; X c . (3.21)
8 r7
Here STr(K1 · · · Km ; y1, y2 , . . . , yn ) means that the trace operation is taken after sym-
metrizing all Ki ’s and yj ’s but keeping the location of K1 and the order of Ki ’s. Note
a , X
that each Ki is X m , θ− , or commutators (or covariant derivatives) of them. For exam-
ple,
STr θ−† Dτ θ− ; X c = 1 Tr θ−† Dτ θ− X
b , X b X
c + θ−† X
b X
c Dτ θ− + θ−† Xb Dτ θ− X
c
6
+ θ−† Dτ θ− X c X
b + θ−† X
c X
b Dτ θ− + θ−† Xc Dτ θ− X
b .
The terms in Γθ 2 (d = 0)A and Γθ 2 (d = 1)A are given by the non-Abelian Taylor expansion
of the leading terms (3.14) and (3.18)
1 1 3 1 δab ra rb
→ − ra Xa + −3 + 15 Xa Xb + · · · ,
r3 r3 r5 2 r5 r7
and following the symmetrized trace prescription [1,2], except for the fact that (3.16) and
(3.20) are of the form STr( ∗ ; (X b , X
b )2 ) rather than STr( ∗ ; X b ). Further discussion
on the ordering of matrices will be given in the next section when we consider the
supersymmetry of the effective action.
Terms with multiple number of gamma matrices are given as follows:
Γθ 2 (d = 0)B
3N4 ra
= dτ Tr θ−† γ̃ aa1 a2 X a1 , X
a2 θ− + θ−† γ̃ aa1 a2 θ− Xa1 , X
a2 (3.22)
16 r 5
3N4
+ dτ Tr θ−† γ̃ a1 a2 a3 Xa1 , X
a2 X a3 θ−
16 r 5
+ θ−† γ̃ a1 a2 a3 θ− X a2 X
a1 , X a3 (3.23)
15N4 ra rb
− dτ STr θ−† γ̃ aa1 a2 X a1 , Xa2 θ−
7
16 r
+ θ−† γ̃ aa1 a2 θ− X a1 , X b ,
a2 ; X (3.24)
Γθ 2 (d = 1)B
3N4 λ rb
=− a θ− + θ−† γ̃ ab θ− Dτ X
dτ Tr θ−† γ̃ ab Dτ X a (3.25)
8 r 5
3N4 λ 1
− b Dτ X
dτ Tr θ−† γ̃ ab X a θ− + θ−† γ̃ ab Dτ X
a X
b θ−
16 r 5
b Dτ X
+ θ−† γ̃ ab θ− X a + θ−† γ̃ ab θ− Dτ Xa X
b (3.26)
15N4 λ rb rc
+ dτ STr θ−† γ̃ ac Dτ Xa θ− + θ−† γ̃ ac θ− Dτ X b .
a ; X (3.27)
8 r 7
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 159
Γθ 2 (d = 1)C
3N4 λ
= dτ Tr θ−† X a , θ− , Dτ Xa + θ−† , X a , Dτ X a θ− (3.29)
16r 5
N4 λ
+ dτ Tr Dτ θ−† γ̃ ab X b , θ− − θ−† , X
a , X a , X
b γ̃ ab Dτ θ− .
32r 5
(3.30)
Terms containing Xm are collected as:
3N4 ra †
Γθ 2 (d = 0)φ = − dτ Tr Θ− Xm , X
n Γ 0a Γ mn Θ− (3.31)
16 r 5
3N4 †
− dτ Tr Θ− m , X
X n Γ 0a Γ mn Θ− X a (3.32)
16 r 5
15N4 ra rb †
+ dτ STr Θ− m , X
X n Γ 0a Γ mn Θ− ; X b , (3.33)
16 r 7
Γθ 2 (d = 1)φ
N4 λ † mn † mn
= 5
dτ Tr Dτ Θ− Xm , Xn Γ Θ− − Θ− Xm , Xn Γ Dτ Θ− . (3.34)
16 r
Note that we have used the notation of 10D gamma matrices and spinors to write (3.31)–
(3.34). The expressions in 6D notation is given by substituting the particular representation
(A.2) and (A.3) of 10D gamma matrices and the parametrization (A.4) of Majorana–Weyl
spinor Θ setting θ+ = 0. For example, (3.31) is written in 6D notation as
3N4 ra
5
dτ Tr θ−† γ̃ a θ− φ1 , φ̄1 + φ2 , φ̄2 + θ−† φ1 , φ̄1 + φ2 , φ̄2 γ̃ a θ−
16r
− 2θ− B φ1 , φ2 γ̃ a θ− − 2θ−† φ̄1 , φ̄2 γ̃ a B ∗ θ−† .
Terms in Γθ 2 (d = 0)B , Γθ 2 (d = 1)B and Γθ 2 (d = 0)φ have the structure of non-Abelian
Taylor expansion of ra /r 5 :
ra ra δab ra rb
→ + − 5 Xb + · · · .
r5 r5 r5 r7
They obey symmetrized trace prescription except for 1/r 5 -terms (3.23), (3.26) and (3.32),
as in the case of Γθ 2 (d = 0, 1)A .
There also exist terms with two derivatives:
N4 λ2
Γθ 2 (d = 2) = dτ Tr Dτ θ−† γ̃ a Xa , Dτ θ− − 2θ−† γ̃ a Dτ2 X
a , θ− . (3.35)
8r 5
To summarize, we obtained
3.2.2. θ 2n terms
The leading terms of Γθ 2n are given by
n
2 1
Γθ 2n (X) , d = 0 = N4 2
0
dτ Tr L†In α1 LI1 β1 L†I1 α2 LI2 β2 · · · L†In−1 αn L†In βn
λ n
dk /r + ik /r + ik /r + ik
× ··· 2
2π k 2 + r 2 α1 β1 k 2 + r 2 α2 β2 k + r 2 αn βn
N4 λn−1
∝ 3n−1 dτ Tr θ−2n . (3.37)
r
For example, four-fermion terms without Xi insertions are given as
0, d = 0
Γθ 4 (X)
N4 λ 1
=− dτ Tr θ−† θ− θ−† θ− + θ− θ−† θ− θ−† − 2 θ− Bθ− θ−† B ∗ θ−†
16 r 5
5N4 λ ra rb
+ 7
dτ Tr θ−† γ̃a θ− θ−† γ̃b θ− + θ− γ̃a∗ θ−† θ− γ̃b∗ θ−†
16 r
− 2 θ− B γ̃a θ− θ−† γ̃b B ∗ θ−† . (3.38)
This does not vanish unlike the case of Γθ 2 .
In this section, we discuss the supersymmetry of the effective action. The one-loop
effective action which was obtained by integrating out the 0–4 sector fields satisfies the
Ward identity corresponding to the SUSY invariance of the classical action of BD matrix
theory. As we show in Appendix B, at the one-loop level, the Ward identity can be written
in the form
transformed and ra is kept fixed.2 The discussion in Appendix B shows that δ (1) is given
simply by the one-loop expectation value of the SUSY transformation for the classical
action. Since only δθ− has the part containing the quantum fields (δ θ− defined in (2.6)),
only δ (1) θ− is non-zero and is given by
δ (1) θ− = δ θ−
i † †
= − v1 v1 + v2 v2 η− − 2 v2 v1† B ∗ η† . (4.2)
λ
4.1. Explicit forms of the one-loop corrected SUSY transformation
2 This assignment is possible without losing generality. Note that we have not imposed the tracelessness for
a .
X
162 M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169
15N4 λ ra rb
b η
−i 7
Sym θ−† γ̃ a θ− − θ− γ̃ a∗ θ−† ; X
8 r
− 2 Sym θ− B γ̃ a θ− ; Xb B ∗ η† (4.9)
and
1 (1) (b)
δ θ−
gs ls 1/r 5
3N4
=i 5
[φ1 , φ2 ], φ̄1 , φ̄2 η + φ̄1 , φ̄2 , φ1 , φ̄1 + φ2 , φ̄2 B ∗ η†
16 r
(4.10)
N4 λ2 2
−i D φ1 , φ̄1 + φ2 , φ̄2 η − 2D02 φ̄1 , φ̄2 B ∗ η† (4.11)
16 r 5 0
N4 λ2 †
− 5
θ− D0 θ− − D0 θ− θ−† η − 2(θ− BD0 θ− )B ∗ η† . (4.12)
4r
Note that (4.3), (4.4), (4.6) and (4.7) correspond to the first three terms of non-Abelian
Taylor expansion of 1/r 3 around ra . Also, (4.5), (4.8) and (4.9) correspond to the
expansion of ra /r 5 .
Our corrected SUSY transformation closes to the translation plus a field dependent
gauge transformation [17]. By using the equations of motion for the effective action
S0 + S (1) , we have shown
(0)
δ8 + δ8(1) , δη(0) + δη(1) φ 1/r 4 ∼ DM φ 8̄Γ M η + (EOM for φ) + O (gs ls )2
In this subsection, we try to clarify the structure of the supersymmetric action which
we have obtained, by identifying the blocks which transform within themselves. Firstly,
since ra is kept fixed in the effective SUSY transformation, the invariance of the one-loop
effective action (4.1) holds at each order of 1/r. In addition, the effective action at order
1/r 5 is further divided into two invariant blocks.
We shall examine the θ 0 - and θ 2 -terms at each order of 1/r.3 We include the bosonic
terms obtained in the previous paper [16]. Order 1/r 3 terms of the effective action
λ 1
Γ1/r 3 = 3 dt Tr D0 X a D0 Xa + 1 X b X
a , X b − 1 [φ1 , φ2 ] φ̄1 , φ̄2
a , X
r 4 8λ2 4λ2
1 2 i † 1 † a
+ φ , φ̄ + φ , φ̄ − θ D θ − − θ γ̃ X , θ − (4.13)
2 − 2λ −
1 1 2 2 0 a
16λ2
satisfy (4.1) along with (4.3).
3 Note that Γ which are discussed in this section denote the parts of the Minkowskian effective action S (1) .
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 163
(II)
The other block Γ1/r 5 consists of the rest of the terms of 1/r 5 effective action. They are
written with extra number of commutators compared to (4.15). We do not present all the
terms explicitly, but give an example. The part which contain six X a ’s (and no derivatives
or fermions) takes the form
(II) 3N4 λ 1
a , X
c X
b , X
a , X c
b , X
Γ1/r 5 X a
6 = − 5
dt − 2
Tr X
2r 24λ
1
+ Tr X b , Xa , X
c X a , X b , X
c
12λ 2
1
− Tr a , X
X a , X
c X b , X b , X
c . (4.16)
24λ2
(I) (II)
Criterion for discriminating Γ1/r 5 and Γ1/r 5 is given by the following number associated
with each term:
1
n = d + nθ + nc . (4.17)
2
Here, d is the number of derivatives, nθ is the number of fermions and nc is the
number of commutators in the symmetrized trace. (Symmetrization is applied regarding
the commutator as a single unit.) Under δ (0) S (1) , the number n is preserved (uniformly
increase by 1/2) as we see from the transformation law (2.4), (2.5). Also, terms in δ (1) S0
are classified in the same way. Thus, terms which are connected by SUSY transformation
must have the same n. The first two terms of the RHS of (4.17) is called the ‘order’ [18]
and used to specify the SUSY invariants in the case of Abelian v.e.v. Our discussion here
(I)
is the non-Abelian generalization of that concept. Terms in Γ1/r 3 , Γ1/r4 and Γ1/r 5 have
(II)
n = 2, and terms in Γ1/r 5 have n = 4.
(I)
We note here that the bosonic terms in Γ1/r 3 , Γ1/r4 and Γ1/r 5 are the ones which result
from the Taylor and Van Raamsdonk’s proposal when we apply it to L5-brane background.
We can see that the proposed currents which couple to SUGRA fields have n = 2, from
the explicit forms in Refs. [1,16]. Bosonic terms of the matrix expressions for multipole
moments are obtained from the currents by inserting X i with symmetric ordering, thus also
have n = 2. For a detailed description of the Taylor and Van Raamsdonk’s proposal applied
to the present background, see our previous paper [16].
(I)
We also note that the fermionic terms in Γ1/r 3 , Γ1/r4 and Γ1/r 5 are not of the same
forms as the ones arising from the above proposal, which is based on the analysis of BFSS
model. Whether the two forms of the fermionic terms are physically equivalent or not is
not clear at present.
5. Discussion
Acknowledgements
In this paper, we mainly use the complex 6D Weyl spinors. We summarize here our
conventions for spinors and gamma matrices. The fermions from the 0–0 sector are given
by Majorana–Weyl spinors in 10D. We explain that the fermionic terms in S0 in (2.2) is
given from the familiar 10D notation by choosing an explicit representation.
We consider fermionic matrix field Θ which satisfy the 10D Majorana–Weyl condition
Γ (10)Θ = Θ, Θ † = B (10)Θ, (A.1)
where Γ (10) = Γ 0Γ 1 · · · Γ 9 is the 10D chirality matrix, and B (10) is defined by
B (10) Γ M B (10)−1 = −Γ M∗ (M = 0, . . . , 9).
We make the SO(5, 1) × SO(4) decomposition of the 10D gamma matrices as follows.
Γ µ = γ µ ⊗ γ̂ (µ = 0, 5, . . . , 9),
Γ m
= 1 ⊗ γ̂ m
(m = 1, . . . , 4), (A.2)
where we have defined γ̂ = γ̂ 1 γ̂ 2 γ̂ 3 γ̂ 4 . The SO(5,1) gamma matrices γ µ have 8×8
components and the SO(4) gamma matrices γ̂ m have 4×4 components. We further assume
the explicit representation for the SO(4) gamma matrices
0 σm
γ̂ =
m
, (A.3)
−σ̄ m 0
where σi (i = 1, 2, 3) are Pauli matrices, and σ4 = i1. We also define σ̄i = −σi and
σ̄4 = σ4 .
The chirality in 10D is the product of 6D and 4D chiralities, and in the above
representation of gamma matrices,
12×2 0
Γ(10) = γ̄ ⊗
0 −12×2
where γ̄ = γ 0 γ 5 · · · γ 9 is the 6D chirality matrix. Consequently, 10D Weyl spinor Θ is
decomposed into two pairs of 6D Weyl spinors
θ+,1
θ
Θ = +,2 .
θ−,1
θ−,2
The matrix B (10) also allows the SO(5,1)×SO(4) decomposition
B (10) = B (6) ⊗ B (4) ,
where B (6) and B (4) satisfy
B (6) γ µ B (6)−1 = −γ µ∗ , B (4) γ̂ m B (4)−1 = −γ̂ m∗ .
We note here the relation for B (6)
B (6)T = −B (6), B (6)∗ = −B (6)−1.
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 167
θ† −B θ−,1
(6)
−,2
Since we prefer using unconstrained fermions to perform the loop calculations, we
explicitly eliminate half of the components of Θ (θ+,2 and θ−,2 ) using the SU(2) Majorana
conditions. That is, we take
θ+
−B (6)∗ θ †
Θ =
+ .
(A.4)
θ−
−B (6)∗ θ−†
The fermionic part of the action in (2.2) is obtained from the 10D covariant form
1 i 1
dt Tr
ΘΓ 0
D0 Θ +
ΘΓ i
[Θ, Xi ]
gs s 2 2λ
by taking the above representation of gamma matrices and substituting (A.4).
The one-loop effective action of the BD matrix theory is given by integrating out the
0–4 sector fields from the classical action S0 [φ] + S5 [φ, ϕ].
e−(S0 [φ]+S [φ]) = Dϕ e−(S0 [φ]+S5 [φ,ϕ]) .
(1)
(B.1)
Note that we denote the 0–0 sector fields by φ and 0–4 sector fields by ϕ, and that the
integration dτ is implicit throughout this appendix.
As a consequence of the SUSY invariance of the classical action, the following Ward
identity holds:
δS0 [φ] δS5 [φ, ϕ] δS5 [φ, ϕ] −(S0 [φ]+S5 [φ,ϕ])
0 = Dϕ δη φ + δη φ + δη ϕ e , (B.2)
δφ δφ δϕ
where δη φ and δη ϕ are the classical SUSY transformations given in (2.4)–(2.7). We shall
(0)
denote the part of δη φ which contain only the 0–0 sector fields by δη φ, and the part
containing 0–4 sector fields by δη φ.
168 M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169
We first note that the last term of (B.2) vanishes, for it is the infinitesimal form of the
invariance of the integral under the change of variables
−S5 [φ,ϕ] −S5 [φ,ϕ+δϕ]
Dϕ e = D(ϕ + δϕ) e = Dϕ e−S5 [φ,ϕ+δϕ]
δS0 −(S0 +S5 ) (0) δS0 [φ] −(S [φ]+S (1) [φ])
Dϕ δη φ e = δη φ + δη(1) φ e 0
δφ δφ
where
Dϕ δη φ e−S5
δη(1) φ ≡ . (B.3)
Dϕ e−S5
Finally, if we evaluate the second term of (B.2) to the one-loop order, it reduces to
δS5 [φ, ϕ] −(S0 [φ]+S5 [φ,ϕ]) δS5 [φ, ϕ] −(S0 [φ]+S5 [φ,ϕ])
Dϕ δη φ e = δη(0) φ Dϕ e .
δφ δφ
(B.4)
For the present model,
δS5 [φ, ϕ] −(S0 [φ]+S5 [φ,ϕ])
Dϕ δη φ e (B.5)
δφ
which would be present in the RHS of (B.4) has at least two loops, for
δS5 [φ, ϕ]
δη φ
δφ
is a four-point vertex as we see from the explicit forms of δ φ (2.6) and S5 (2.3). We
further note that the RHS of (B.4) gives the variation of the one-loop effective action (times
e−(S0 +S ) ) as we see from
(1)
5 −S5
δS (1) Dϕ δS
δφ e
=
δφ Dϕ e−S5
which results from the definition (B.1) of the effective action.
Taking these facts into account, to the one-loop order, (B.2) is rewritten as
δS0 [φ] δS0 [φ] δS (1) [φ]
0 = δη(0) φ + δη(1) φ + δη(0) φ , (B.6)
δφ δφ δφ
which shows that the effective action S0 + S (1) is invariant under effective SUSY
(0) (1)
transformation δη φ + δη φ.
We note that the present discussion is not directly applicable for showing the SUSY of
the effective action at 2-loop order and beyond. At these orders, (B.4) should no longer be
valid, due to the contribution from (B.5).
M. Asano, Y. Sekino / Nuclear Physics B 644 (2002) 151–169 169
In addition, we remark here that our simple discussion which involve only the Ward
identity for the SUSY is due to the fact that the gauge fields (which belong to the 0–0
sector) are not integrated. When they are to be integrated, as in the case of the BFSS model,
analysis of the Ward identity for the SUSY plus the one for the gauge symmetry is required
for studying the invariance of the effective action under effective SUSY. It is essentially due
to the fact that the gauge fixing term and the ghost action breaks SUSY. The formalism is
developed in Ref. [20] and explicit form of the effective SUSY transformation laws for the
one-loop effective action of BFSS model (for Abelian v.e.v.) is given in Ref. [21].
References
[1] W. Taylor, M. Van Raamsdonk, Supergravity currents and linearized interactions for matrix theory
configurations with fermionic backgrounds, JHEP 9904 (1999) 013, hep-th/9812239.
[2] R.C. Myers, Dielectric-branes, JHEP 9912 (1999) 022, hep-th/9910053.
[3] M. Cederwall, A. von Gussich, B.E.W. Nilsson, P. Sundell, A. Westerberg, The Dirichlet super-p-branes in
ten-dimensional type IIA and IIB supergravity, Nucl. Phys. B 490 (1997) 179, hep-th/9611159.
[4] M. Aganagic, C. Popescu, J.H. Schwarz, Gauge-invariant and gauge-fixed D-brane actions, Nucl. Phys.
B 495 (1997) 99, hep-th/9612080.
[5] E.A. Bergshoeff, M. de Roo, A. Sevrin, Non-Abelian Born–Infeld and kappa-symmetry, hep-th/0011018.
[6] E.A. Bergshoeff, A. Bilal, M. de Roo, A. Sevrin, Supersymmetric non-Abelian Born–Infeld revisited,
JHEP 0107 (2001) 029, hep-th/0105274.
[7] M. Cederwall, B.E.W. Nilsson, D. Tsimpis, D = 10 super-Yang–Mills at O(α 2 ), JHEP 0107 (2001) 042,
hep-th/0104236.
[8] A. Collinucci, M. de Roo, M.G.C. Eenink, Supersymmetric Yang–Mills theory at order α 3 , JHEP 0206
(2002) 024, hep-th/0205150.
[9] M. de Roo, M.G.C. Eenink, P. Koerber, A. Sevrin, Testing the fermionic terms in the non-Abelian D-brane
effective action through order α 3 , hep-th/0207015.
[10] T. Banks, W. Fischler, S.H. Shenker, L. Susskind, M-theory as a matrix model: a conjecture, Phys. Rev. D 55
(1997) 5112, hep-th/9610043.
[11] D. Kabat, W. Taylor, Linearized supergravity from matrix theory, Phys. Lett. B 426 (1998) 297, hep-
th/9712185.
[12] W. Taylor, M. Van Raamsdonk, Multiple D0-branes in weakly curved backgrounds, Nucl. Phys. B 558
(1999) 63, hep-th/9904095.
[13] W. Taylor, M. Van Raamsdonk, Multiple Dp-branes in weak background fields, Nucl. Phys. B 573 (2000)
703, hep-th/9910052.
[14] I. Klebanov, W. Taylor, M. Van Raamsdonk, Absorption of dilaton partial waves by D3-branes, Nucl. Phys.
B 560 (1999) 207, hep-th/9905174.
[15] M. Berkooz, M.R. Douglas, Five-branes in M(atrix) theory, Phys. Lett. B 395 (1997) 196, hep-th/9610236.
[16] M. Asano, Y. Sekino, Non-Abelian action of D0-branes from matrix theory in the longitudinal 5-brane
background, Nucl. Phys. B 639 (2002) 370, hep-th/0201248.
[17] B. de Wit, D.Z. Freedman, Combined supersymmetric and gauge-invariant field theories, Phys. Rev. D 12
(1975) 2286.
[18] J.A. Harvey, Spin dependence of D0-brane interactions, Nucl. Phys. Proc. Suppl. 68 (1998) 113, hep-
th/9706039.
[19] M.R. Douglas, D. Kabat, P. Pouliot, S.H. Shenker, D-branes and short distances in string theory, Nucl. Phys.
B 485 (1997) 85, hep-th/9608024.
[20] Y. Kazama, T. Muramatsu, On the supersymmetry and gauge structure of matrix theory, Nucl. Phys. B 584
(2000) 171, hep-th/0003161.
[21] Y. Kazama, T. Muramatsu, Fully off-shell effective action and its supersymmetry in matrix theory, Class.
Quantum Grav. 18 (2001) 2277, hep-th/0103116.
Nuclear Physics B 644 (2002) 170–200
www.elsevier.com/locate/npe
Abstract
We study D-branes on smooth noncompact toric Calabi–Yau manifolds that are resolutions
of Abelian orbifold singularities. Such a space has a distinguished basis {Si } for the compactly
supported K-theory. Using local mirror symmetry we demonstrate that the Si have simple
transformation properties under monodromy; in particular, they are the objects that generate
monodromy around the principal component of the discriminant locus. One of our examples, the
toric resolution of C3 /(Z2 × Z2 ), is a three parameter model for which we are able to give an
explicit solution of the GKZ system.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 6 2 - 9
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 171
is generated over R+ by exactly d lattice vectors that generate N over Z. We will only
consider this case.
Perhaps the simplest way of describing X is as follows: assume that there are k one-
dimensional cones in Σ generated by lattice vectors v1 , . . . , vk . Assign a homogeneous
variable zi to each of the vi and a multiplicative equivalence relation among the zi ,
(z1 , . . . , zk ) ∼ λq1 z1 , . . . , λqk zk (2.1)
with λ ∈ C∗ for any linear relation q1 v1 + · · · + qk vk = 0 among the generators vi . The
qi can be normalized to be integers without common divisor; in the context of a gauged
linear sigma model they are the charges with respect to the U (1) fields. The number of
independent relations of the type (2.1) is k − d.
Define a subset Ck \ FΣ of Ck = {(z1 , . . . , zk )} as the set of all k-tuples of zi with
the following property: if zi vanishes for all i ∈ I ⊂ {1, . . . , k}, then all vi with i ∈ I
belong to the same cone. Then X is (Ck \ FΣ )/(C∗ )k−d , where the division by (C∗ )k−d
is implemented by taking equivalence classes with respect to the multiplicative relations
(2.1).
Every one-dimensional cone generated by vi corresponds in a natural way to the
divisor Di determined by zi = 0. Similarly, an l-dimensional cone spanned by vi1 , . . . , vil
determines the codimension l subspace zi1 = · · · = zil = 0 of X.
a
Monomials of the type z1a1 · · · zk k are sections of line bundles O(a1 D1 + · · · + ak Dk ).
If we denote by M the lattice dual to N and by
, the pairing between N and M, it is
v ,m
v ,m
easily checked that monomials of the form z1 1 · · · zk k with m ∈ M are meromorphic
functions (i.e., invariant under (2.1)) on X. This implies the linear equivalence relations
imply that intersection numbers between d different toric divisors are 1 or 0 depending on
whether these divisors form a cone in Σ. This implies C ·Dd = ld = 1, C ·Dd+1 = ld+1 = 1
and C · Di = 0 for i > d + 1. For calculating C · Di with i < d we have to use linear
equivalence relations of the type (2.2). To calculate C · D1 we may choose m to fulfill
v1 , m = 1 and
v2 , m = · · · =
vd , m = 0. Then
0∼
vi , mDi =
v1 , mD1 +
vd+1 , mDd+1 + · · ·
= D1 +
−l1 v1 − · · · − ld vd , mDd+1 + · · ·
= D1 − l1 Dd+1 + · · · , (2.3)
i.e., D1 ∼ l1 Dd+1 + · · · where ‘· · ·’ stands for Di with i > d + 1 which do not intersect C.
Thus we find that C · D1 = l1 C · Dd+1 = l1 . As our choice of D1 among the Di with i < d
was arbitrary, we have indeed shown that C · Di = li for any i.
A set of generators for the Mori cone is then given by all those curves C (i) whose l (i)
cannot be written as nonnegative linear combinations of the other l (j ) . The matrix L whose
lines are the l (i) of the Mori cone generators has the following remarkable properties:
any Matrix Q consisting of d − k independent (linear combinations of ) lines of L serves
as a ‘charge matrix’ for the relations (2.1). If the Mori cone is simplicial, we just have
L = Q. This will be the case in most of our examples, so we will not distinguish between
L and Q in these
cases. Any column of L is associated with a toric divisor Di . If a linear
combination
j Lij aj of column vectors of Q vanishes, then the corresponding divisor
j aj Dj has vanishing intersection with any effective curve, i.e., it is trivial. Therefore a
diagram displaying the column vectors of L or Q encodes the linear equivalence relations
among the toric divisors Di . We may interpret these vectors as one-dimensional cones of
a fan, the so-called ‘secondary fan’ of X. Note, however, that two distinct but linearly
equivalent toric divisors correspond to the same vector in the secondary fan. As the entries
of L are the intersections between the generators of the Mori cone and the divisors, the
Kähler cone of X is determined by those j aj Dj such that the corresponding linear
combinations of the columns of L only have nonnegative entries.
We should stress that our analysis was in terms of a single fixed triangulation. If we
allow several distinct triangulations, the Mori cone vectors of any of them will lead to
correct charge matrices Q but the Kähler condition will depend on which combinations
of the charge vectors correspond to the Mori cone, i.e., on the choice of triangulation. In
this way several regions of a secondary fan constructed from some charge matrix Q can
correspond to different ‘geometric phases’ in the sense of [33,34].
We will now present some of the examples that we are going to use in this paper.
Example 1. The toric resolution of C2 /Zn .
We have toric divisors D0 , . . . , Dn corresponding to vectors
0 1 n
v0 = , v1 = , . . . , vn = . (2.4)
1 1 1
D4 · D5 = D4 · D1 = h, D4 · D2 = 0,
D5 · D1 = f, D5 · D2 = h + 3f, (2.14)
where f is the fibre of the F3 . The self-intersections of D4 and D5 are
D4 · h = −3, D4 · f = 1, D5 · h = 1, D5 · f = −2,
D1 · h = 1, D1 · f = 0, D2 · h = 0, D2 · f = 1. (2.16)
This implies that (C1 , C2 ) = (h, f ) and (D1 , D2 ) form mutually dual bases of the Mori
cone and the Kähler cone of X. In terms of codimension one (here, two-dimensional)
176 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
cones σ and the linear relations between the rays in the two cones of maximal dimension
that contain σ , we obtain the following linear relations among the vectors v1 , . . . , v5 of the
fan:
In our study of D-brane states we will have to address issues that involve quantum
geometry. A standard tool for this problem is the use of mirror symmetry. In particular,
classical periods in the mirror geometry get mapped to quantum corrected expressions
related to the middle cohomology of the original space. In the noncompact case one has
to use local mirror symmetry. For our applications of this subject we have relied mainly
on [35] and we refer to this paper for further references. The authors of [35] consider
decompactifications of Calabi–Yau hypersurfaces in toric varieties such that the volumes
of certain cycles remain compact. They show that in the decompactification limit these
cycles lead to differential equations that are identical with the GKZ differential systems
of a lower-dimensional geometry. We will assume that this remains true even for cases
where the non-compact Calabi–Yau geometry cannot be identified with a limiting case of
a compact Calabi–Yau hypersurface.
The local mirror of a d-dimensional noncompact Calabi–Yau geometry is determined
by interpreting the diagram of the hyperplane containing the end points of the vi now as
a polytope P in a (d − 1)-dimensional lattice M. A polytope corresponds to a line bundle
to be the origin.
L over a toric variety V by the following construction: fix any point in M
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 177
, the lattice
Describe the facets of P by equations Ej (m̃) :=
ṽj , m̃ + cj = 0, where ṽj ∈ N
dual to M and fix the sign ambiguity about ṽj in such a way that Ej (m̃) is nonnegative for
points m̃ of P . Choose V to be a toric variety whose one-dimensional rays are the ṽj ∈ N
corresponding to a variable xj as in the previous section. To every point m̃ ∈ M assign the
E (m̃)
monomial j xj j . Then L is the bundle whose sections are determined by polynomials
of the type
k E (m̃i )
P (a; x) = ai xj j . (3.1)
i=1 j
for any j . Given identifications of this type it is natural to seek a description in terms of
toric geometry. If we interpret the exponents of the λ’s as linear relations among vectors ui
in a toric diagram and notice that the ṽj generate M (at least over the rational numbers),
we find that the ui fulfill
m, v1 u1 +
m, v2 u2 + · · · +
m, vk uk = 0 (3.3)
for any m ∈ M. These are just the relations among the vectors of the secondary fan which
encodes, as we saw, the linear equivalence relations (2.2) of the divisors Di corresponding
to the vi . There are some subtleties, however: as we saw in the previous section, it is
possible that two distinct (but linearly equivalent) toric divisors lead to the same vector
in the secondary fan. We will show how to interpret this in the context of the examples.
Besides, it is possible that there are identifications in the moduli space that do not come
from rescalings of the type xj → λj xj and hence have a structure different from (3.2).
If this occurs, the toric variety associated with the secondary fan is called the ‘simplified
moduli space’ Msimp . Depending on whether we have extra identifications or not, the toric
variety corresponding to the secondary fan is a compactification of Msmooth (the moduli
space of all smooth local mirror hypersurfaces) or a covering space of a compactification
of Msmooth.
will degenerate over various loci in Msimp where ∂P (a; x)/∂xj = 0 can be solved
X
for all j without violating the conditions on which xi are allowed to vanish simultaneously.
178 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
Some of these loci may just be toric divisors, but usually there is also at least one connected
piece given by a polynomial equation in the ai to which we will refer as the primary or
principal component of the discriminant locus.
If we want to relate the mirror geometry to the original one, we have to find a region in
the moduli space where quantum corrections are strongly suppressed. This is the case for
the deep interior of the Kähler cone, the so-called large volume limit, which is dual to the
large complex structure limit. As we saw in Section 2, the Kähler cone can be determined
by writing any divisor as a linear combination of toric divisors and demanding that the
corresponding linear combination of columns of the matrix L contain only nonnegative
entries. If the resulting generators do not belong to the secondary fan, we have to blow up
the moduli space in order to be able to change to the large complex structure variables.
In those cases where the Mori cone is simplicial we can draw the secondary fan by
displaying the columns of L and the generators of the Kähler cone will be nothing but
the unit vectors. If we then write the linear relations among the vectors in the secondary
fan in such a way that we express every vector in terms of the unit vectors and use the
corresponding rules (2.1) to set all variables except the large complex structure variables1
zi to 1, we find that the zi can be expressed as
k
l
(i)
zi = ajj . (3.4)
j =1
Note that we do not include a sign here (compare with, e.g., [36]).
If X is the resolution of an orbifold singularity of the type Cd /Zn there is another
distinguished coordinate patch in the moduli space containing the orbifold locus where all
ai except the ones corresponding to the coordinates of the Cd are set to zero. At this point
the conformal field theory is expected to acquire a quantum symmetry. We find that the
moduli space in this case always has a singularity that looks locally like CdimM /Zn .
The GKZ differential
operators are calculated by using the following recipe: for every
linear relation lj vj = 0, where l corresponds to any curve in the Mori cone (see [35])
we define a differential operator in terms of the ai ,
lj −lj
D= ∂aj − ∂aj . (3.5)
j : lj >0 j : lj <0
µ
Assume that we work in a specific coordinate patch given by some φi = aj ij . In order
to transform (3.5) to a system involving the φi we can rewrite it in terms of operators
Θaj := aj ∂aj , commute all aj to the left using Θaj aj−1 = aj−1 (Θaj − 1) and then express
the Θai as i µij Θφi with Θφi := φi ∂φi .
We stress that the solutions of the GKZ system are not the periods on X but rather the
logarithmic integrals of the periods. While the periods are finite and nonvanishing on the
moduli space wherever X is nondegenerate, the GKZ solutions have extra singularities
at the zero loci of moduli space coordinates coming from the logarithmic integration.
1 We hope that no confusion arises from the fact that we use the same symbol z for the coordinates of X and
i
the large complex structure variables.
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 179
The GKZ solutions are multivalued and undergo monodromy transformations around
codimension one loci where they are not holomorphic. We will be interested mainly in
monodromies around the large complex structure divisors zi = 0 and around the principal
component of the discriminant locus. In addition, there is the possibility of a nontrivial
transformation (‘orbifold monodromy’, which, strictly speaking, is not a monodromy) if
the moduli space looks locally like CdimM /Zn .
We will now show how these concepts can be applied to our examples.
Example 1. The mirror geometry of C2 /Zn .
Here V is P1 and the polynomial is given by
∂a0 ∂a2 − ∂a21 , ∂a1 ∂a3 − ∂a22 , . . . , ∂an−2 ∂an − ∂a2n−1 (3.7)
become
Θa0 Θa2 − z1 (Θa1 − 1)Θa1 , . . . , Θan−2 Θan − zn−1 (Θan−1 − 1)Θan−1 (3.8)
with
Example 2.
of the resolution X of Cn /Zn is just the mirror geometry
The local mirror geometry X
is a
of a compact Calabi–Yau manifold realised as a degree n hypersurface in Pn−1 , i.e., X
degree n hypersurface
in Pn−1 /(Zn )n−2 . The GKZ operator ∂a1 · · · ∂an − ∂ann+1 becomes
where we have chosen the subscripts of the ai to correspond to those of the vi in Fig. 1.
The local mirror of C3 /Z5 is given by the vanishing locus of (3.15) in P2 /Z5 . The action
of Z5 on P2 has fixed points whenever two of the three xi vanish. The vanishing locus of
(3.15) passes through one of these fixed points if and only if one of a1 , a2 , a3 vanishes.
Thus the generic hypersurface misses the fixed points. A quintic polynomial in P2 defines,
by a standard calculation, a Riemann surface of Euler number χ = −10. As the Z5 acts
without fixed points on this surface, the Euler number is divided by 5, showing that the
local mirror geometry is that of a Riemann surface R with 2 − 2g = χ = −2, i.e., genus
g = 2.
Scalings xi → λi xi imply the equivalences
(a1 , a2 , a3 , a4 , a5 ) ∼ λ51 a1 , λ52 a2 , λ53 a3 , λ21 λ2 λ23 a4 , λ1 λ32 λ3 a5 . (3.16)
We want to find out about the D-brane vacuum states in type II string theory on X. The
mathematical structure that captures the largest number of properties of brane states is,
at present knowledge, the bounded derived category D b of coherent sheaves on X [37,
38] (but see the remarks in [39,40]). While we will make several remarks concerning
D b , we will work mainly with the somewhat coarser (but easier to handle) concepts of
K-theory. Let K(X) be the Grothendieck group of coherent sheaves on X. We expect
compact brane states on a noncompact space X to correspond to classes of the compactly
supported K-theory group K c (X). Using the duality between K(X) and K c (X) we can
determine a basis for K c (X) by first finding a basis for K(X).
Let us consider the situation where X is a smooth crepant resolution of a singularity
of the type Cd /G, where G is a finite subgroup of SU(d). Since X is smooth, K(X) is
generated by vector bundles (see, e.g., [41]). Moreover, if π : X → Cd /G is a crepant
resolution of an Abelian singularity, K(X) is in fact generated by n line bundles, where n
is the order of G (at least for d 3) [25]. Thus, for finding a basis for the group K c (X)
related to fractional branes it is convenient to first determine a set {Ri } (0 i < n) of line
bundles whose K-theory classes generate K(X). Clearly there is no choice for the Ri that
should be preferred a priori. Rather, there are two distinct constructions, each of which is
related to McKay correspondence:
(1) Mathematicians’ construction [22–25]: there is a vector bundle R (the ‘tautological
vector bundle’) transforming in the regular representation of G whose decomposition into
irreducibles gives the line bundles RiM . In particular, the RiM are generated by their sections
and the action of G on the sections determines a one-to-one correspondence between the
RiM and the characters of the irreducible representations of G. In the case of a resolution
of C2 /G with some finite group G the first Chern classes c1 (RiM ), i 1, form a basis
of H 2 (X, Z) dual to the basis of H2 (X, Z) given by the homology classes of a basis of
effective curves Ci in the resolution. In the case of a singularity of the type C3 /G with G
an Abelian subgroup of SL(3, C) in general there exist several crepant resolutions and not
for every resolution it is possible to define line bundles as above. However, it was shown
in [24] that there exists a distinguished crepant resolution, named G-Hilb, on which it is
still possible to define the tautological line bundles (see also [23,27,28]).2 The advantage
of this approach is that it is rigorously proven for d = 2 and d = 3.
(2) Physicists’ constructions: the authors of [13] suggest to consider, in the style of
[42], the world-volume theory of D0-branes, which is a theory of (n − 1) U (1) gauge
fields and d (n × n) matrices. It is conjectured (and shown in several examples) that the
vacua of such a theory in the different phases corresponding to different choices of Fayet–
Iliopoulos parameters all lead to moduli spaces that are nothing but the geometric phases of
the resolutions X of Cd /G. Now repeat this construction with an extra field transforming in
a specific one-dimensional representation ρi of G. It is conjectured that, independently of
the phase, this should lead to a space that is the total space of a line bundle RiP over X, and
that repeating this for all characters ρi should give a basis {RiP } of K(X). However, this
2 We thank A. Craw for emphasizing the importance of choosing the G-Hilb resolution to us.
184 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
construction is extremely tedious to work with. A different method for determining {RiP }
based on the boundary chiral ring associated to a certain two-dimensional gauge theory
has been proposed in [17]. The implications of this approach have been worked out for the
case of a single exceptional divisor that is a weighted projective space W = Pd−1 n1 ,...,nd with
Fermat weights [17] or a Grassmannian3 [19]. In all examples we are aware of, the RiP
have no sections. The advantage of this approach is that it appears to lead to dual classes
SiP whose interpretations in terms of D-branes are very well behaved.
Roughly, the resulting Ri can be summarized in the following way. There is a set of
divisor classes {[Fi ]} containing all Kähler cone generators [Ti ] and the trivial class [0]
such that all Fi are nef, i.e., have nonnegative intersection with any curve in the Mori
cone. If we denote by Ri± the line bundles O(±Fi ), then {RiM } = {Ri+ } and {RiP } = {Ri− }.
In two dimensions the [Fi ] are just the trivial class and the Kähler cone generators. In
higher dimensions we have to add extra divisor classes which are nonnegative integer
linear combinations of the [Ti ]. For the RiM with G-Hilb and d = 3 the authors of [23,
27] have given an explicit construction. In terms of the language used in this paper this can
be summarised in the following way.
Through the sections we can assign a character to any Ti . It is also possible to assign
characters to toric curves. Such a curve C corresponds up to a sign to some m ∈ M leading
to a linear equivalence as in (2.2). By collecting expressions with the same sign this can be
written as D ∼ D where D, D are effective divisors corresponding to the same character.
We then assign this character to C and the corresponding line segment in the diagram,
and find that all the characters obtainable in this way also occur in the list of characters
corresponding to the Ti . Then every interior point I of the toric diagram is of one of the
following types:
(1) There are three pairs of line segments with the same character meeting in I . In this
case we add nothing to the list of [Fi ] (the classes assigned by [23,27] in this case are
already among the Kähler cone generators).
(2) There are two pairs of line segments with characters χm , χn meeting in I (and
possibly an extra line segment). Then add [Tm + Tn ] to the list of [Fi ].
(3) There are three line segments with the same character χm . In this case add [2Tm ] to
the list of [Fi ].
It turns out that this procedure always leads to a one-to-one correspondence between
the Ri+ and the character table of G through the action of G on the sections.
In many cases the [Fi ] are the same in the mathematicians’ and physicists’ construc-
tions, i.e., RiP = (RiM )∗ . However, [17] seems to suggest partial resolutions in the case
with a single interior point where the exceptional divisor is a weighted projective space.
We note that the G-Hilb resolution may be incompatible with such a resolution or any
refinement of it, as the following example shows.
In Fig. 5 we have displayed the G-Hilb resolution of C3 /Z6 constructed according to
the rules of [23,27] and the partial resolution by an exceptional divisor P2(1,2,3) . Clearly the
former cannot be obtained as a refinement of the latter.
In the following we always follow the mathematicians’ approach.
3 P. Mayr informs us that this approach works in more general situations as well.
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 185
The next step in our construction of D-brane states is to find a basis for K c (X) that is
dual to the basis of K(X) defined in terms of line bundles Ri . According to [25], there is a
pairing (R, S) between representatives R of K(X) and S of K c (X) that can be evaluated
in terms of Chern characters
(R, S) = ch(R) ∪ chc (S)Td(X), (4.1)
X
c
with ch (S) the localized Chern character4 of the complex S and Td(X) the Todd class
of X. There is also a closely related pairing which will become important when we study
monodromies. It is defined as
R, S = R ∗ , S (4.2)
with R ∗ the line bundle (or, more generally, the complex) dual to R. If we restrict R to
K c (X), these pairings become well defined under the exchange of R and S and we find
that (R, S) is always symmetric whereas
R, S is symmetric in even dimensions and skew
in odd dimensions, as a consequence of the fact that Td(X) is even when c1 (X) is trivial.
The generally accepted way of obtaining a basis for K c (X) is to choose classes dual
to those given by the line bundles Ri with respect to ( , ). Following this convention, we
define classes of K c (X) by demanding that their representatives Sj fulfill (Ri , Sj ) = δij .
Thus we obtain Sj+ dual to Ri+ and Sj− dual to Ri− with respect to ( , ) and note that the Sj+
are dual to the Ri− and Sj− are dual to the Ri+ with respect to
, .
So far we have not been specific about the representatives Si of the compactly supported
K-theory. In the spirit of [2] we may interpret them as bound states of X-filling branes. In
mathematical terms this amounts to specifying a complex of vector bundles on X that
is exact outside a compact locus Y . It is not hard to check in every example that we
may indeed represent every Si as a formal linear combination of line bundles of the form
OX ( ai Di ) and that the chc (Si ) obtained from the line bundles Ri form a basis for all
Chern characters with support on the compact toric cycles.
Alternatively, one may wish to consider ‘pure’ branes defined in terms of the structure
sheaves of the independent lower-dimensional compact holomorphic cycles. Given the
structure sheaves OCi where the Ci form a basis for all compact holomorphic cycles on
If we demand that Z lv (t; SCi ) measure the complexified Kähler class at the large Kähler
limit we have to make the identification
ln zi
ti − 1 = + O(z). (4.8)
2πi
Note that this is different from the conventions usually adopted in the literature, but we
find that this is precisely the identification that works.
5 This formula occurs implicitly in [45] and explicitly in [13]; see also the remarks in [17].
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 187
Linearity implies that the central charge corresponding to any S is given in terms of the
charge vector by
Z(S) = n(Ci ) Z
SCi . (4.9)
Finally, we return to the subject of monodromy. In [1] it was conjectured (and pushed
further in the work of [3,4]) that the monodromies around loci in the moduli space where
the mirror X of a Calabi–Yau threefold X becomes singular induce autoequivalences of
b
D (X), the bounded derived category of coherent sheaves on X. Moreover, in the case of a
Fano surface embedded in a Calabi–Yau threefold, a relationship of these autoequivalences
of D b (X) with mutations of exceptional collections supported on the Fano surface was
pointed out in [4]. For our purposes we will view the various monodromies mainly as
automorphisms of K c (X). However, in some examples we will identify the monodromy
actions on the exceptional collections of coherent sheaves supported on the compact
divisors. As in the case of the local mirror geometry, we will be interested in the following
three types of transformations:
— Monodromy around large Kähler structure divisors in the moduli space;
— Monodromy around the primary component of the discriminant locus;
— ‘Orbifold monodromy’ in the case Cd /G.
Only the monodromy around a divisor zi = 0 in the moduli space where the Kähler
parameter ti (associated with the divisor class [Ti ] in X) becomes infinite allows for
a classical analysis. In this case we just take ti → ti + 1 in (4.5). Because of the
multiplicativity of Chern characters, the fact that the Chern character of a line bundle is
the exponential of its first Chern class and the form of (4.5), this transforms the Sj by
tensoring them with OX (−Ti ). By (4.1), the Ri transform by tensoring with OX (Ti ).
According to the observations in [9,46], ‘orbifold monodromy’ should cyclically
permute the Si if X is a resolution of Cd /Zn .
For the primary component of the discriminant locus we have the following picture:
in the case of a compact Calabi–Yau variety X it is conjectured (see [1,3–5,20]) that a
sheaf F is subjected to a Fourier–Mukai transform whose kernel is the structure sheaf OX ,
implying that the Chern character of F transforms as
ch(F ) −→ ch(F ) −
OX , F ch(OX ), (4.10)
where
, is the pairing (4.2). In our case of noncompact X this cannot work because it
would violate compact support conditions, but we make the following observation.
In all of our examples we obtain expressions for chc (S0− ) that allow us to choose S0−
in such a way that its restriction S0− |Ci to any compact toric cycle Ci is equal to OCi . For
the case of a resolution π of an orbifold singularity this means that our expressions for
chc (S0− ) are consistent with taking S0− to be the push-forward of the restriction of OX to
π −1 (0).
Wherever we have the possibility of comparison with the mirror geometry, we find that
the monodromy around the primary component of the discriminant locus is given by
ch(F ) −→ ch(F ) − S0− , F chc S0− . (4.11)
More precisely, the following happens: for one parameter models the principal component
is pointlike. If we decompose the GKZ solutions into logarithms and holomorphic pieces
188 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
n−1
chc S0± = p ∓ Ci , chc Si± = ±Ci for i > 0,
i=1
ch
SCi = p + Ci , ch
Sp = p (4.12)
and therefore
n−1
chc S0− = SCi − (n − 2) ch
ch Sp .
i=1
The restriction of OX to the union of the Ci is the same as SCi except for the n − 2
points of the form Ci · Ci+1 where
SCi has rank two. Upon subtracting the n − 2
sheaves with support on these points we arrive at a class that matches chc (S0− ). It is
easily checked that
Si− , Si− = 2 for all i. The large volume central charges are given
by Z lv (t; S0− ) = −1 + i ti and Z lv (t; Si− ) = −ti .
In the case of n = 2 this implies Z(S0− ) = 01 and Z(S1− ) = −1 − 01 and we see that the
principal component and orbifold monodromies found in the mirror geometry are precisely
the ones generated by (4.11) and permutations S0− ↔ S1− , respectively.
Example 2.
For Cn /Zn with Zn : (z1 , . . . , zn ) → (e2πi/n z1 , . . . , e2πi/n zn ) the restrictions of the
Ri to the exceptional divisor D Pn−1 are nothing but O, O(1), . . . , O(n − 1). The
M
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 189
independent holomorphic cycles are of the form Pj with 0 j n − 1 and the Ri± restrict
to OPj (±i). This example has been previously considered in [17,47,48]. We include it as
further evidence that the Si have the properties stated above. Defining
χkj := χ OPj (k), P = ch OPj (k) Td Pj
j
Pj
j +1
H
= ekH (4.13)
1 − e−H
Pj
with H the hyperplane divisor, we find that
j +1
H
χkj − χk−1,j = ekH 1 − e−H
1 − e−H
Pj
j j
H H
= e kH
H = e kH
= χk,j −1 . (4.14)
1 − e−H 1 − e−H
Pj Pj−1
k+j
With χk0 = χ0j = 1 this simple recursion is solved by χkj = j and we obtain
j ±i
(Ri± ,
SPj ) = j , implying (R0− ,
SPj ) = 1 for any j , (Ri− ,
SPj ) = 0 for 1 i j and
−
(Ri , SPj ) = (−1) j for i > j . This leads to the following expressions for the S − :
j i−1
S0− =
SPn−1 ,
n−2
n−1 j
(−1)k Sk− = SPn−1 − S j for k 1. (4.15)
k k−1 P
j =k−1
Again the restriction of S0− to any compact toric cycle is the same as the structure sheaf of
that cycle.
Alternatively we may determine the Si− by the ansatz Si− = aik i∗ OPn−1 (k). With
− n−1+k−i
Ri , i∗ OPn−1 (k) = χk−i,n−1 = (4.16)
n−1
we get (a −1 )ki = χk−i,n−1 which leads to
n
i
n
Si− = aik i∗ OPn−1 (k) = (−1)i−k i∗ OPn−1 (k), (4.17)
i −k
k=0 k=0
∗
Si− , Si− = aik ail ch i∗ OPn−1 (k) ch i∗ OPn−1 (l) Td(X)
k,l X
= aik ail 1 − e−nH e−kH elH Td Pn−1
k,l
Pn−1
= aik ail (χl−k,n−1 − χl−k−n,n−1 ). (4.18)
k,l
190 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
F0 = 0, F1 = D1 , F2 = 2D1 , F3 = D2 , F4 = D1 + D2 , (4.21)
for the bases of K(X), where we have chosen the labels such that sections of Ri+ transform
as # i under (z1 , z2 , z3 ) → (#z1 , # 3 z2 , #z3 ). Using (4.1) we find that the localized Chern
characters of the basis of K c (X) are given by
c ± 3 5 11
ch S0 = D4 + D5 ∓ h + f + p,
2 2 6
3 4
chc S1± = −2D4 − D5 ± 2h + f − p,
2 3
1 1 5 1
chc S2± = D4 ∓ h + p, chc S3± = −D5 ± f − p,
2 2 2 3
c ± 3 1
ch S4 = D5 ∓ f + p, (4.22)
2 3
with p the class of a point.
Let us now consider the branes defined in terms of the structure sheaves Op , Oh , Of ,
OP2 , OF3 of the independent lower-dimensional cycles. Denoting by Sp the result of three
successive inclusion maps acting on Op , etc., we arrive with the help of the Grothendieck–
Riemann–Roch theorem (4.3) at the following result:
3 3 5 4
ch
SD4 = D4 + h + p, ch SD5 = D5 + h + f + p,
2 2 2 3
ch
Sp = p, ch Sh = h + p, ch
Sf = f + p. (4.23)
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 191
This allows us to determine the D-brane charges ni = (nD4 , nD5 , np , nh , nf ) with np the
D0-brane charge, nh , nf D2-brane charges and nD4 , nD5 D4-brane charges of the Si− as
5. Beyond Cd /Zn
Up to now we have only considered cases of the type Cd /Zn with a single triangulation.
We now want to examine the range of validity of our statements regarding the Si− and
monodromy. We first present another example, the resolution of C3 /(Z2 × Z2 ), which
is still an orbifold but has several interesting features: it is not of the simple Zn type, it
allows for more than one triangulation, its resolution involves three new noncompact toric
divisors but no compact toric divisor, and finally it is a three parameter model whose GKZ
system can be solved explicitly. We will be able to show explicitly that the Si− vanish at
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 193
(branches of) the principal component of the discriminant locus and nowhere else. Aspects
of D-brane states on this model have been studied previously in, e.g., [50,51]. Finally we
examine the possibility of extending our results to cases not of the McKay type. We find
that they still hold in many examples but not in general.
Example 4. A toric resolution of C3 /(Z2 × Z2 ).
A singular space of the type C3 /(Z2 × Z2 ) where every nontrivial element of Z2 × Z2
acts by flipping the sign of two of the three coordinates of C3 can be resolved by
introducing three additional noncompact divisors and three compact curves. There are
several distinct possibilities for choosing the curves.
We use the G-Hilb resolution depicted in Fig. 6. The Mori cone is generated by the
following vectors:
4z1 z2 z3 − z1 − z2 − z3 + 1 = 0. (5.5)
The simplest formulation of the GKZ system can be obtained by mixing large complex
structure and orbifold coordinates. We find the operator
D1 = (Θφ1 + Θφ3 )(Θφ1 + Θφ2 ) − 4z1 z2 Θφ1 (Θφ1 − 1), (5.8)
which upon using (5.4) and (5.7) implies
1 − 4φ1−2 Θφ1 + 4φ1−2 Θφ1 Π = 0. (5.9)
The whole GKZ system has three solutions of the type ln((φi + φi2 − 4)/2) and, as
always, a constant solution. Upon returning to large complex structure variables, we obtain
√
1 + 1 − 4z2 z3 1
Π0 = 1, Π1 = ln − (ln z2 + ln z3 ) (5.10)
2 2
and the corresponding index-permuted expressions for Π2 and Π3 .
The divisors Fi determining the line bundles Ri are just D1 , D2 and D3 and we find
chc S0− = p + C1 + C2 + C3 , chc S1− = −C1 ,
chc S2− = −C2 , chc S3− = −C3 , (5.11)
where C1 is the compact curve at the intersection of D2 and D3 , etc. In terms of structure
sheaves we can represent S0− as SC1 + SC2 + SC3 − 2Sp . Noticing that all three curves
intersect in the same point, we find that we can again view S0− as the object whose
restriction to any compact toric cycle is the structure sheaf of that cycle.
The central charges are determined by Z lv (t; S0− ) = −1 + t1 + t2 + t3 and Z lv (t; Si− ) =
−ti for i ∈ {1, 2, 3}, leading to
Π1 + Π2 + Π3
Z S0− = 2 − ,
2πi
−Π1 + Π2 + Π3
Z S1− = −1 + , etc. (5.12)
2πi
At the orbifold point φ1 = φ2 = φ3 = 0 we have the following situation: the moduli
space develops a Z2 × Z2 singularity. Provided we make the right choice of sheets for the
square roots and logarithms, we find Π1 = Π2 = Π3 = 3πi/2 and thus Z(Si− ) = −1/4 for
X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200 195
over F0 P1 × P1 and F1 , respectively) are two parameter models that we treated with
analyses similar to the ones used, for example, 3 (C3 /Z5 ).
As a counterexample, consider the Calabi–Yau manifold depicted in Fig. 8, whose GKZ
system is again solvable. Here we find the following: if we choose as the line bundles Ri the
ones determined by the generators of the Kähler cone, we still find that the corresponding
S0− has the same restriction to compact toric cycles as OX . However, only two of the three
generators of K c (X) become massless at the conifold locus (these statements are true for
any triangulation). In particular, for the symmetric triangulation the vanishing locus of the
central charge of S0− does not coincide with the conifold locus.
Acknowledgements
We would like to thank Philip Candelas and Duiliu Diaconescu for very useful
conversations.
The GKZ operators6 corresponding to the Mori cone generators (2.17) are given by
D(1) = ∂a1 ∂a3 ∂a5 − ∂a34 , D(2) = ∂a2 ∂a4 − ∂a25 . (A.1)
This can be turned into a system involving z1 , z2 by standard manipulations described
above. In this way we arrive at the following expressions in terms of Θzi := zi ∂z∂ i :
&
∂ Γ (1 + ρ) 0, n = 0,
= −Sn /n!, n > 0, (A.5)
∂ρ Γ (1 + n + ρ) ρ=0 (−1)−n−1 (−n − 1)!, n < 0,
& 0, n = 0,
∂2 Γ (1 + ρ)
= finite, n > 0, (A.6)
∂ρ 2 Γ (1 + n + ρ) ρ=0 2(−1)−n S (−n − 1)!, n < 0,
−n−1
where Sn = 1 + 1/2 + · · · + 1/n. This yields the constant solution Π(z1 , z2 ; 0, 0) = 1 and,
with Πi1 ···ik for (∂ k Π/∂ρi1 · · · ∂ρik )|ρ=0 ,
n n n n
Π1 = ln z1 + z1 1 z2 2 An1 n2 − 3 z1 1 z2 2 Bn1 n2 ,
n n n n
Π2 = ln z2 − 2 z1 1 z2 2 An1 n2 + z1 1 z2 2 Bn1 n2 ,
Π11 = (ln z1 ) + 2 ln z1
2
z1n1 z2n2 An1 n2−3 z1n1 z2n2 Bn1 n2
n n n n
−6 z1 1 z2 2 Cn1 n2 + z1 1 z2 2 An1 n2 (−2S2n2 −n1 −1 + 6Sn2 −3n1 − 4Sn1 )
n n
+ z1 1 z2 2 Bn1 n2 (−18S3n1 −n2 −1 + 6Sn1 −2n2 + 12Sn1 ),
n n
n1 n2
Π12 = ln z1 ln z2 + ln z1 −2 z1 z2 An1 n2 + 1 2
z1 z2 Bn1 n2
n n n n
n1 n2
+ ln z2 z1 z2 An1 n2 − 3 z1 z2 Bn1 n2 + 7
1 2
z1 1 z2 2 Cn1 n2
n n
+ z1 1 z2 2 An1 n2 (4S2n2 −n1 −1 − 7Sn2 −3n1 + 4Sn1 − Sn2 )
n n
+ z1 1 z2 2 Bn1 n2 (6S3n1 −n2 − 7Sn1 −2n2 − 2Sn1 + 3Sn2 ),
n n
Π22 = (ln z2 )2 + 2 ln z2 −2 z1n1 z2n2 An1 n2 + z1 1 z2 2 Bn1 n2
n n n n
−4 z1 1 z2 2 Cn1 n2 + z1 1 z2 2 An1 n2 (−8S2n2 −n1 −1 + 4Sn2 −3n1 + 4Sn2 )
n n
+ z1 1 z2 2 Bn1 n2 (−2S3n1 −n2 −1 + 4Sn1 −2n2 − 2Sn2 ),
where
(2n2 − n1 − 1)!
An1 n2 = (−1)2n2 −n1 −1 ,
(n2 − 3n1 )!(n1 !)2 n2 !
(3n1 − n2 − 1)!
Bn1 n2 = (−1)3n1 −n2 −1 ,
(n1 − 2n2 )!(n1 !)2 n2 !
(2n2 − n1 − 1)!(3n1 − n2 − 1)!
Cn1 n2 = (−1)2n2 −n1 −1+3n1 −n2 −1
(n1 !)2 n2 !
and the summations are taken over those values of n1 , n2 where the arguments of all
factorials are nonnegative. Of the three expressions obtained by taking second derivatives
only the first one and the linear combination 3Π22 + 2Π12 of the other two actually solve
the GKZ system (A.2). We note that there is also a linear combination of third derivatives
198 X. de la Ossa et al. / Nuclear Physics B 644 (2002) 170–200
S4− with (4.8). Similarly we find at the other branch with z1 < 0 which intersects z2 = 0 at
z1 = −1/27 that
1 1 1
− Π11 + Π1 − = 0. (A.9)
2 2 4
This corresponds to a z1 -monodromy transformed version of S2− vanishing. The third
branch, with z1 > 0 and z2 < 0 is beyond the region of convergence. We have preliminary
evidence that at this branch a z2 -monodromy transformed version of S0− becomes
massless.
References
[31] D. Cox, The homogeneous coordinate ring of a toric variety, J. Alg. Geom. 4 (1995) 17, alg-geom/9210008.
[32] D. Cox, S. Katz, Mirror Symmetry and Algebraic Geometry, American Mathematical Society, Providence,
1999.
[33] E. Witten, Phases of N = 2 theories in two dimensions, Nucl. Phys. B 403 (1993) 159, hep-th/9301042.
[34] P.S. Aspinwall, B.R. Greene, D.R. Morrison, Multiple mirror manifolds and topology change in string
theory, Phys. Lett. B 303 (1993) 249, hep-th/9301043.
[35] T.-M. Chiang, A. Klemm, S.-T. Yau, E. Zaslow, Local mirror symmetry: calculations and interpretations,
Adv. Theor. Math. Phys. 3 (1999) 495, hep-th/9903053.
[36] P.S. Aspinwall, B.R. Greene, D.R. Morrison, Measuring small distances in N = 2 sigma models, Nucl. Phys.
B 420 (1994) 184, hep-th/9311042.
[37] M.R. Douglas, D-branes, categories and N = 1 supersymmetry, hep-th/0011017.
[38] P.S. Aspinwall, A. Lawrence, Derived categories and zero-brane stability, hep-th/0104147.
[39] C.I. Lazaroiu, Generalized complexes and string field theory, hep-th/0102122.
[40] D.-E. Diaconescu, Enhanced D-brane categories from string field theory, hep-th/0104200.
[41] W. Fulton, Intersection Theory, Springer, 1984.
[42] M.R. Douglas, B.R. Greene, D.R. Morrison, Orbifold resolution by D-branes, Nucl. Phys. B 506 (1997) 84,
hep-th/9704151.
[43] P. Baum, W. Fulton, R. MacPherson, Riemann–Roch and topological K-theory for singular varieties, Acta
Math. 143 (1979) 155.
[44] B. Iversen, Local Chern classes, Ann. Sci. École Norm. Sup. 9 (1976) 155.
[45] D.-E. Diaconescu, C. Römelsberger, D-branes and bundles on elliptic fibrations, Nucl. Phys. B 574 (2000)
245, hep-th/9910172.
[46] P.S. Aspinwall, Resolution of orbifold singularities in string theory, hep-th/9403123.
[47] E. Zaslow, Solitons and helices: the search for a math-physics bridge, Commun. Math. Phys. 175 (1996)
337, hep-th/9408133.
[48] K. Hori, A. Iqbal, C. Vafa, D-branes and mirror symmetry, hep-th/0005247.
[49] A.B. Kvichansky, D.Yu. Nogin, Exceptional collections on ruled surfaces, in: A.N. Rudakov, et al. (Eds.),
Helices and Vector Bundles: Seminaire Rudakov, in: London Math. Soc. Lecture Note Ser., Vol. 148,
Cambridge Univ. Press, Cambridge, 1995, p. 97.
[50] B.R. Greene, D-brane topology changing transitions, Nucl. Phys. B 525 (1998) 284, hep-th/9711124.
[51] P.S. Aspinwall, M.R. Plesser, D-branes, discrete torsion and the McKay correspondence, JHEP 0102 (2001)
009, hep-th/0009042.
[52] S. Mukhopadhyay, K. Ray, Fractional branes on a non-compact orbifold, hep-th/0102146.
Nuclear Physics B 644 (2002) 201–222
www.elsevier.com/locate/npe
Abstract
We consider a six-dimensional brane world model with asymmetric warp factors for time and both
extra spatial coordinates, y and z. We derive the set of differential equations governing the shortest
graviton path and numerically solve it for AdS–Schwarzschild and AdS–Reissner–Nordström bulks.
In both cases we derive a set of conditions for the existence of shortcuts in bulks with shielded
singularities and show some examples of shortcuts obtained under these conditions. Consequences
are discussed.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The ideas of Kaluza and Klein [1], advocating the physical possibility of extra
dimensions in order to achieve the unification of different field theories can be considered
as a landmark in Quantum Field Theory.
Such an importance grew specially half a century after the original works, in the
framework of supergravity and string theories. In the latter, the existence of extra
dimensions is actually enforced by consistency.
Furthermore, new possibilities to realize the extra dimensions permitted to explore new
mechanisms of explaining unified field theories. The possibility of explaining hierarchies in
such a context is specially appealing and has been confirmed in the works of Arkani-Hamed
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 7 - 6
202 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
and others [2,3]. The hierarchy between the electro-weak (∼ 100 GeV) and the Planck
(1019 GeV) scales has been focused by means of the consideration of extra dimensions at
a submilimeter size, which shows up in a theory of two extra dimensions connecting both
scales. Such an idea replaces the usual one where extra dimensions should only show up at
the Planck scale, and the tower of massive particles thus generated is above that level and
has a wider validity, including cosmology [4].
Such size constraints on the size of the extra dimensions constitute drawbacks in the
formulation of the theory.
More recently, Randall and Sundrum [5,6] proposed a model—or rather a class of
models–where there is a warp factor in the metric, such that even infinitely large extra
dimensions are allowed.
The existence of large extra dimensions with a warp factor naturally raises the question
of whether information can follow a shorter path outside the brane riding on gravitons
[7–10]. We proposed a simple calculation to establish the shortest path followed by a
graviton [11], which propagating in all dimensions in the so-called bulk, could in principle
follow a path which decouples from the brane, that is, from our universe, returning later
to another point, advanced in time with respect to a photon, which by construction must
follow a path in the brane, thus being delayed. In our previous paper we considered
a model constructed in [13] which was basically a generalized Friedmann–Robertson–
Walker universe with cosmological constant, with different scenarios in the brane (where
are living the Standard Model fields) and in the bulk. The result was actually a negative
one, that is, the shortest path followed by the graviton was the same as the one followed by
the photon, namely, inside the brane.
In a model introduced by Csáki et al. [14] the speed of light along flat four-dimensional
sections varies over the extra dimensions due to different warp factors for the space and
time coordinates, a construction similar to the one of Randall and Sundrum. Thus the
authors proposed that gravitational waves might travel faster than photons, which remain
in the brane. The delay between electromagnetic and gravity waves may be experimentally
detected with the gravitational waves detectors under way [10,15].
The models are basically AdS–Schwarzschild or AdS–Reissner–Nordström black holes
in the bulk. Brane models in AdS space with Schwarzschild singularities have been used to
understand the AdS/CFT correspondence and looks like a promising theoretical model [16,
17]. They are based on the Randall–Sundrum scheme [5,6], where a large mass hierarchy
is obtained with uncompactified dimensions from solutions of Einstein equations in higher
dimensions (i.e., in the bulk) with two separated branes. The four-dimensional part of
the metric is multiplied by a “warp” factor which is a rapidly changing function of the
additional dimension.
In this paper we consider a six-dimensional model and look for possible shortcuts
for AdS–Schwarzschild and AdS–Reissner–Nordström bulk configurations. The paper is
organized as follows. In Section 2 we describe a general six-dimensional model, derive
Einstein equations, and find the Israel conditions the metric has to satisfy due to the brane
embedding. At this point we choose a metric describing a six-dimensional black hole and
add a Z2 symmetry. In Section 3 we find the Euler–Lagrange equations which define
the graviton path in this model. Section 4 is devoted to study the numerical solutions
of these equations in the context of AdS–Schwarzschild bulk finding certain analytical
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 203
2. A six-dimensional model
We consider a six-dimensional model, such as the one constructed by Kanti et al. [18].
We also search for a solution of six-dimensional Einstein equation in AdS space of the
form
ds 2 = −n2 (t, y, z) dt 2 + a 2 (t, y, z) dΣk2 + b2 (t, y, z) dy 2 + c2 (t, y, z) dz2 , (1)
where dΣk2 represents the metric of the three-dimensional spatial sections with k =
−1, 0, 1 corresponding to a hyperbolic, a flat and an elliptic space, respectively.
The total energy–momentum tensor can be decomposed in two parts corresponding to
the bulk and the brane as
M(B) M(b)
T̃NM = T̆N + TN , (2)
where the brane contribution can be written as
M(b) δ(z − z0 )
TN = diag(−ρ, p, p, p, p̂, 0). (3)
bc
In order to have a well-defined geometry, the metric must be continuous across the
brane; however, its derivatives with respect to z can be discontinuous at the position of the
brane, generating a Dirac δ-function in the second derivatives of the metric with respect to
z [12]. These δ function terms must be matched with the components of the brane energy–
momentum tensor (3) in order to satisfy Einstein equations. Thus, we obtain the following
Israel conditions,
2
[∂z a] κ(6)
=− (p − p̂ + ρ),
a 0 b 0 c0 4
2
κ(6)
[∂z b]
2
= − ρ − 3(p − p̂) ,
b 0 c0 4
2
[∂z n] κ(6)
= p̂ + 3(p + ρ) . (4)
b0 c0 n0 4
A metric of the form (1) satisfying six-dimensional Einstein equations is given by
z2
ds 2 = −h(z) dt 2 + dΣk2 + h−1 (z) dz2, (5)
l2
204 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
where
dr 2
dΣk2 = + r 2 dΩ(2)
2
+ 1 − kr 2 dy 2, (6)
1 − kr 2
and
z2 M
h(z) = k + − 3 , for AdS–Schwarzschild bulk, (7)
l2 z
z 2 M Q2
h(z) = k + 2 − 3 + 6 , for AdS–Reissner–Nordström bulk, (8)
l z z
with l −2 ∝ −Λ (Λ being the cosmological constant), which describes a black hole in the
bulk, located at z = 0.
Following [14], we find a further solution by means of a Z2 symmetry inverting the
space with respect to the brane position. That is, considering a metric of the form
a n d 2 nn a n2
z̈ + −2 + ż + − = 0. (15)
a n d d2 a d2
√ √
For z z0 , a = z/ l, n = h(z), and d = 1/ h(z). For z z0 we have to use the Z2
symmetry showed up in (10).
Notice that this case is equivalent to consider the problem in five dimensions with the
metric shown in [14].
The most general case includes a y dependence on the graviton path; however, as we
will show in what follows, this dependence turns to be superflous and does not affect z-
equation since (15) is independently satisfied. This conclusion is not surprising if we notice
that the metric (13) is y-independent.
The two Euler–Lagrange equations considering a y dependence are then given by
2 a b 2 2
n − d ż ÿ + ż − + 2
2 2
n − d ż − nn + dd ż + d z̈ ẏ
2 2 2
a b
a b
+ b2 − żẏ 3 = 0 (16)
a b
and
a d 2
+
n2 − b2ẏ 2 z̈ + n − b2 ẏ 2 − 2nn + 2bbẏ 2 ż2 + b2 ẏ ÿ ż
a d
a 2 nn − bbẏ 2 2
+ − 2 n − b ẏ +
2 2
n − b2 ẏ 2 = 0. (17)
ad d2
It is clear that the case ẏ = 0 is a solution of this set of equations when at the same time
z obeys (15).
This set of equations can be handled leading to
zżẏ ż2
Fz + h(z) − Fy = 0,
h(z) h(z)
z2 ẏ 2 zżẏ
1− Fz + Fy = 0, (18)
h(z) h(z)
206 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
where
2 h (z)
Fy = ÿ + ż − ẏ,
z h(z)
ẍ α + Γµν ẋ ẋ = λẋ α .
α µ ν
(20)
Thus, a null curve is extreme if and only if it is a null geodesic.
Then, as (19) is the same as (15), our problem is reduced to the previous case with
constant y described by (15).
For k = 0 cases we can also consider (15) as the shortcut equation if we assume
the existence of a y-symmetry in our problem. In this way, our model represents a
generalization of [14].
4. AdS–Schwarzschild bulk
κ(6)ρ 4 2
M 2 k
= − (ω + 1) , (23)
z05 2
5 z0 40
4 ρ2
κ(6) 3k 5
=− − , (24)
64 z02 (8ω + 3) (8ω + 3)l 2
where ω = p/ρ.
As we saw in the previous section, the shortcuts in six dimensions are determined
from (15). We should also remember that the brane is static at z = z0 .
If a shortcut exists, there must be a time t = v in the graviton path when ż(v) = 0 and
z̈(v) 0. Thus, (15) evaluated at this point will give
h (zv ) h(zv )
z̈(v) + h(zv ) − = 0. (25)
2 zv
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 207
It is obvious that this minimum must be between the brane and the event horizon zh , if
a horizon exists. Otherwise, there is no turning point in the path since the graviton cannot
return after it goes through the event horizon. Hence, h(zv ) > 0. Thus, from (25) we require
h (zv ) h(zv )
F (zv ) = − 0 for zh < zv < z0 . (26)
2 zv
Using (7) this implies
5M k
F (zv ) = 4
− 0. (27)
2 zv zv
This equation has a zero in z = zf = 0 for k = 0
5
zf3 = M.
2k
Thus, for the k = 0 or k = −1 cases there is no positive root. Since the mass, M, is positive,
F (z) > 0 everywhere preventing the coexistence of shortcuts and horizons.
On the other hand, for k = 1 there is one real and positive root, which must satisfy
zf < z0 in order to have shortcuts. This is
5M
− 1 < 0.
2z03
Taking into account (23) and the fact that ε2 must be positive in (24)1
−4(ω + 1)ε2 z02 < 0, (28)
then
ω + 1 > 0. (29)
Now, let us study the conditions under which the event horizon must appear. In general,
the horizons occur at the zeros of h(z), or equivalently at the zeros of
z5 + z3 l 2 − l 2 M.
In the meantime, the non-vanishing zeros of h (z) occur when
2z5 + 3l 2 M = 0.
Since the derivative has no positive zeros with M > 0, there is just one event horizon. Then
as h(z) goes to −∞ at the origin, the conditions
M >0 (30)
and
h(z0 ) > 0 (31)
are necessary and, in fact, enough to have a horizon and assure that the brane lies after it.
Fig. 1. h(z) in six-dimensional AdS–Schwarzschild bulk with the brane located at z = 1/3. Notice that the
singularity is shielded by a horizon.
3 z2
ω+ + (ω + 1) 20 < 0.
4 l
If ω + 1 0, this condition is always satisfied, but this configuration does not produce
shortcuts as we would like. However, the condition is also satisfied with ω + 1 > 0 if we
require
3
−1 < ω < − , (32)
4
and
z02 ω + 3/4
<− . (33)
l2 ω+1
If we follow both (32) and (33) together with the fine-tunning for the energy (24), we
will have several shortcuts in AdS–Schwarzschild bulks with shielded singularity. In Figs. 1
and 2 we illustrate an example with ω = −4/5, z0 = 1/3, and l = 1. Notice in Fig. 1 that
the horizon appears before the brane.
Since this case is equivalent to consider the problem in five dimensions with h(z), M
and ρ given in [14], analogous results are obtained. In this case, the fine-tunning in the
energy is given by2
1 1 2
2
ε(5) =− + , (34)
3ω + 1 z02 l 2
Fig. 2. Shortcuts for several initial velocities in six-dimensional AdS–Schwarzschild bulk. Notice that there is a
threshold initial velocity for which the graviton cannot return to the brane and falls into the event horizon.
Fig. 3. h(z) in five-dimensional AdS–Schwarzschild bulk with the brane located at z = 1/2. Notice that the
singularity is shielded by a horizon.
and ω is confined to
2
−1 < ω < − , (35)
3
while the brane position is given by
z02 ω + 2/3
<− . (36)
l2 ω+1
An example is shown in Figs. 3 and 4 for ω = −3/4, z0 = 1/2, and l = 1.
210 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
Fig. 4. Shortcuts for several initial velocities in five-dimensional AdS–Schwarzschild bulk. As in the six
dimensional case, there is a threshold initial velocity for which the graviton cannot return to the brane and falls
into the event horizon.
5. AdS–Reissner–Nordström bulk
From the Israel conditions (11) we will have for the black hole mass and charge,
4
κ(6)
M 2k 8
= + + ρ 2 ω,
z05 z02 3l 2 24
4 2
Q2 k 5 8ω + 3 κ(6) ρ
= + + . (37)
z08 z02 3l 2 3 64
At this stage it is convenient to carefully study the possibility of existence of shortcuts
for every value of k.
5.2. k = 1 case
As we saw in the previous section, F (z) has two real, positive and distinct roots for
k = 1,
5 1
r13 = M − 25M 2 − 64Q2 , (45)
4 4
5 1
r23 = M + 25M 2 − 64Q2 . (46)
4 4
212 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
This is the only situation where the shortcuts can coexist with a shielded singularity. In
fact, this situation necessarily requires the second root of F (z) being at some point before
the brane position z0 . This also implies F (z0 ) < 0.
In addition, we must have both Q2 and M positive.
Given the fact that we have horizons, if the brane is not between them or at a horizon
position, then h(z0 ) > 0. Furthermore, in order to guarantee that the brane is located after
the event horizon, we also need h (z0 ) > 0.
From the discussion in the previous section we will have one or two horizons if and
only if h(r1 ) 0.
In summary, shortcuts in bulks with shielded singularities can occur only if k = 1 and
also if the following conditions are supplied:
(1) h(z0 ) > 0 and h (z0 ) > 0 to have both horizons before the brane;
(2) F (z0 ) < 0 and r2 < z0 to have shortcuts with shielded singularity;
(3) Q2 > 0 and M > 0, which assures the positivity of the black hole mass and charge;
(4) h(r1 ) 0 in order to have horizons.
h (z0 )
= −(4ω + 3)ε2 . (48)
2z0
The condition (47) is automatically satisfied since ε2 > 0.
From the condition (48)
ω + 1 > 0. (52)
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 213
1 9 z02
2 2
z0 ε > − − 2 . (56)
ω 20 l
1 3 z2
z02 ε2 < − − 20 . (58)
ω 4 l
Since 3/4 > 9/20 this condition is certainly compatible with (56).
On the other hand, the positivity of the squared black hole charge requires
Q2 5z02 8
> 0 ⇒ 1 + + ω + 1 z02 ε2 > 0, (59)
z06 3l 2 3
so that
1 5z2
z02 ε2 < −3 − 20 . (60)
8ω + 3 l
In spite of not being trivial, this equation is also compatible with (56). This requires
1 9 z02 1 5z02
− − 2 < −3 − 2 , (61)
ω 20 l 8ω + 3 l
or
1 9 z2
− ω+ − 20 (ω + 1) < 0, (62)
5 4 l
what is always true for −1 < ω < −3/4.
3 We assume that 25M 2 − 64Q2 > 0. We will return to this condition when we discuss the existence of
horizons, where we will impose a stronger restriction, M 2 − 4Q2 > 0.
214 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
x 8/3
+ x 2 − Mx + Q2 0. (63)
l2
We do not need to do a complete study of this equation. For our purposes it will be enough
to require
x 2 − Mx + Q2 < 0. (64)
Using (45) this implies
16 z02 32 2 16 2 2 2 4
1+ + ωz − l 2
ε 2
+ l z ω ε > 0, (67)
9 l2 9 0 9 0
what implies
1
z02 ε2 > −32ωz02 + 9l 2 + 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 , (68)
32
or
1
z02 ε2 < −32ωz02 + 9l 2 − 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 . (69)
32
Notice that because −1 < ω < −3/4,
1 9 z2 1 3 z2
− − 20 < z02 ε2 < − − 20 , or (71)
ω 20 l ω 4 l
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 215
1 9 z02 1 5z02
− − 2 < z0 ε <
2 2
−3 − 2 , (72)
ω 20 l 8ω + 3 l
depending on which condition is more restrictive.
In addition,
1
z02 ε2 > −32ωz02 + 9l 2 + 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 , (73)
32
or
1
z02 ε2 < −32ωz02 + 9l 2 − 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 . (74)
32
Now we are going to analyze the situations in which all these conditions are compatible.
Let us begin our analysis with Eq. (74). To be compatible with (71) and (72), we just
need
1 9 z02 1
− − 2 < −32ωz02 + 9l 2 − 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 ,
ω 20 l 32
(75)
that is,
9 9 3 z2 z2
− ω− + −64ω 20 + 9 − 64 20 ω2 < 0. (76)
20 32 32 l l
Since 3/4 < |ω| < 1,
9 9
− ω−
20 32
will always be positive and thus, (76) will never be satisfied. Then, we conclude that (74)
is not compatible either with (71) or (72). This implies that z02 ε2 must satisfy (73) together
with (71) or (72).
Let us initially compare (71) with (73). We must have
1 3 z02 1
− − 2 > −32ωz02 + 9l 2 + 3 −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 ω2 l 2 ,
ω 4 l 32
(77)
that is,
3 9 3 z2 z2
− ω− − −64ω 20 + 9 − 64 20 ω2 > 0. (78)
4 32 32 l l
In this case, since ω is negative and 3/4 <| ω |< 1,
3 9
− ω− >0
4 32
and (78) can be satisfied if
3 9 2 9 z2 z2
− ω− > −64ω 20 + 9 − 64 20 ω2 , (79)
4 32 1024 l l
216 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
or
9 z2 9
ω(ω + 1) 20 + (3 + 4ω)ω > 0. (80)
16 l 64
Since ω + 1 > 0 and 3 + 4ω < 0, for positive z0 the inequality will be only fulfilled if
z0 1 3 + 4ω
< − . (81)
l 2 1+ω
So that (73) and (71) can be compatible.
Now we are going to analyze the compatibility between (73) and (72). We must have
1 5z02 z2 9 3
−3 − 2 + 02 − > −64ωz02l 2 + 9l 4 − 64l 2 z02 ω2 (82)
8ω + 3 l ωl 32ω2 32ω2
or using that 8ω + 3 < 0, simplifying, and squaring both sides we can write (82) as
2
2
1 2 z ω2 z2
ω (3 + 4ω)2 + ω2 (ω + 1)2 20 + (3 + 4ω)(ω + 1) 20 > 0. (83)
16 l 2 l
This polynom has just one root for z02 / l 2
z02 1 3 + 4ω
= − .
l2 4 1+ω
Since the coefficient of z04 / l 4 is positive, the inequality is satisfied with the same
condition (81), so we verify that both (72) and (71) are compatible with (73) under the
same restrictions.
Furthermore, let us compare the upper limits of (71) and (72). Suppose
1 3 z2 1 5z2
− − 20 > −3 − 20 , (84)
ω 4 l 8ω + 3 l
what can also be written as
3 3z2
− (4ω + 3) − 20 (ω + 1) > 0. (85)
4 l
Thus, (84) is satisfied if and only if
z02 1 3 + 4ω
<− , (86)
l2 4 1+ω
which is just the same inequality (81), that z0 must satisfy. Hence, between (71) and (72),
it is enough to take into account the latter. Nevertheless, from (83) notice that (72) would
be also compatible with (73) if
z02 1 3 + 4ω
> − ,
l2 4 1+ω
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 217
and (84) would be satisfied with a change of sign implying that we should consider (71)
instead of (72); however, as stated before, the compatibility of (71) and (73) requires
z02 1 3 + 4ω
<− ,
l2 4 1+ω
which contradicts our hypothesis. Therefore, the only possible configuration is (86).
At last, we compare the lower limits of (72) and (73). Suppose
1 9 z02 1
− − 2 < −32ωz0
2
+ 9l 2
+ 3 −64ωz0
2 l 2 + 9l 4 − 64l 2 z2 ω2 ,
0
ω 20 l 32ω2l 2
(87)
or
9 9 3 z2 z2
− ω− < −64ω 20 + 9 − 64 20 ω2 .
20 32 32 l l
Squaring and simplifying we obtain
z02 9 4
> − ω + 1 .
l2 20(ω + 1) 5
For −1 < ω < −3/4 this inequality is always satisfied since the right-hand side is negative.
Hence, we conclude that (87) is valid and between the lower limits for the energy in (72)
and (73), we just need to choose the latter.
In short, by purely analytic considerations we conclude that shortcuts in bulks having
no naked singularities and a static brane embedded in can only appear if k = 1 and if the
following conditions are satisfied:
1 5z
< z02 ε2 < −3 − 20 . (89)
8ω + 3 l
In this way, it turns out to be simple to find shortcuts in bulks with shielded singularities.
As an example, let us choose ω = −9/10. From (88) we must have
√
z0 6
< ,
l 2
then we choose l = 1 and z0 = 1.
218 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
Fig. 5. h(z) in six-dimensional AdS–Reissner–Nordström bulk with the brane located at z = 1. Notice that the
singularity is shielded by two horizons.
Q2 1 2
= + + 3ωε(5)
2
+ ε(5)
2
, (92)
z06 z02 l 2
arriving to the following restrictions:
Fig. 6. Shortcuts for several initial velocities in six-dimensional AdS–Reissner–Nordström bulk. Notice that there
is threshold initial velocity for which the graviton cannot return to the brane and falls into the event horizon.
√
z0 15
< ,
l 3
so we choose l = 1 and z0 = 1.
From (94) the energy must fulfill
632 16 √ 24
+ 2
127 < ε(5) < ,
441 441 13
√
then we choose ε(5) = 461/250.
In Fig. 7 we can see h(z) according to the previous conditions. As in the six-dimensional
case, the singularity is protected by an event horizon, and the brane is at z = z0 = 1.
In Fig. 8 we show several graviton paths obtained under the previous conditions for
several initial velocities showing that shortcuts appear when we choose the parameters
following the analysis shown in this section analogously to the six-dimensional case.
220 E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222
Fig. 7. h(z) in five-dimensional AdS–Reissner–Nordström bulk with the brane located at z = 1. We see that the
singularity is protected by two horizons.
Fig. 8. Shortcuts for several initial velocities in five-dimensional AdS–Reissner–Nordström bulk. We see that
there is threshold initial velocity for which the graviton cannot return to the brane and falls into the event horizon.
6. Conclusions
In this paper we have shown that the shortest graviton path is governed by just one
equation involving the “radial” extra coordinate. We have also seen that a symmetry in the
“angular” extra coordinate has permitted us to consider curved spatial sections.
As pointed out in Sections 4 and 5.1 the cases k = 0 and k = −1 display no shortcuts,
leading to the conclusion that k = 1 is a necessary condition for the existence of shortcuts,
what completely agrees with the observation of Chung and Freese [9].
E. Abdalla et al. / Nuclear Physics B 644 (2002) 201–222 221
Acknowledgements
This work has been supported by Fundação de Amparo à Pesquisa do Estado de São
Paulo (FAPESP) and Conselho Nacional de Desenvolvimento Científico e Tecnológico
(CNPq), Brazil.
References
Abstract
We construct new Ginsparg–Wilson fermions for QCD by inserting an approximately chiral Dirac
operator—which involves ingredients of a perfect action—into the overlap formula. This accelerates
the convergence of the overlap Dirac operator by a factor of 5 compared to the standard construction,
which inserts the Wilson fermion as a point of departure. Taking into account the effort for treating
the improved fermion, we are left with an total computational overhead of about a factor 3. This
remaining factor is likely to be compensated by other virtues; here we show that the level of locality
is clearly improved, so that the exponent of the correlation decay is doubled. We also show that
approximate rotation invariance is drastically improved, but a careful scaling test has to be postponed.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
Over the recent years, substantial progress has been achieved in the long-standing
problem of constructing a formulation of chiral fermions on the lattice. It turned out that
there is a particularly harmless way to break the full chirality of the lattice Dirac operator,
so that the physical properties related to chirality are still represented correctly. This
breaking term is sufficient to circumvent the Nielsen–Ninomiya No-Go theorem [1], which
refers to full chirality, in the sense that the lattice Dirac operator D anti-commutes with γ5 .
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 8 9 - 7
224 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
1 Of course, this property holds for any fixed GW kernel R, if one uses the suitably generalized overlap
formula [5].
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 225
Dov ≈ D0 . (4)
This small alteration corresponds to a chiral correction (in the sense of the GWR).2
Thus the first issue is to construct an approximate GW fermion “by hand”. At this
point, we recall that also perfect [2] and classically perfect fermions [10] solve the GWR.
Their construction is very tedious, but in the present context we can use a relatively simple
approximation, namely, a hypercube fermion (HF). The mass renormalization is still quite
strong for simple HFs [11,12], which is a problem in their direct application [13]. However,
here we reach the chiral limit by inserting the HF into the overlap formula.
Of course, the (classically) perfect fermion has further virtues in addition to chirality.
In particular its scaling behavior is (almost) free of lattice artifacts, i.e., dimensionless
ratios of physical observables are (almost) independent of the lattice spacing. Moreover,
the observables are also rotation-invariant for perfect fermions, and rotation symmetry
is approximated very well for the truncated perfect hypercube fermions, as the pion
dispersion shows [11]. Since the overlap formula modifies the HF just a little—see relation
(4)—we expect the good scaling and rotation behavior to persist under the chiral correction.
Then we obtain an improved overlap fermion.
There is yet another virtue to be expected based on relation (4): the hypercube fermion
is short-ranged—its free couplings are restricted to a unit hypercube on the lattice—hence
we also expect a high level of locality for the overlap-HF (given by Dov if we insert
D0 = DHF ). Due to the modest alteration, the long range couplings can be turned on
just slightly, hence their exponential decay will be fast. The Wilson fermion is also short-
ranged, but it changes drastically in the transition to the Neuberger fermion, so we do
not have the above reason to expect a good locality. Indeed, even for the free Neuberger
fermion the decay of couplings (in the separation of fermion and anti-fermion) is a rather
slow exponential [5].
All these expected virtues of the overlap-HF have been tested and impressively
confirmed in the 2-flavor Schwinger model [14]. Now we carry on this program to QCD.
In Section 2, we first describe the construction of a suitable HF in d = 4, and we illustrate
its proximity to a GW fermion by evaluating typical fermionic spectra on small lattices.
In Section 3 we discuss the polynomial evaluation of the inverse square root in Eq. (3).
In Section 4 we investigate the speed of the numeric evaluation of the overlap formula.
We show that for our HF a polynomial of a low order is sufficient to approximate the
sign function or the inverse square root to a high accuracy. For the Neuberger fermion
the same accuracy requires a polynomial of a much higher degree. Section 5 gives results
for the condition numbers and their impact on the convergence rate, now proceeding to
larger lattices. Section 6 presents a comparative study of the level of locality, i.e., of the
exponential decay of the maximal correlation at large distances, and Section 7 compares
the violations of rotation invariance. Section 8 contains a summary and our conclusions.
2 Similarly, one could also insert an approximately chiral D in the 4d space of domain wall fermions [5,9].
0
226 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
For free or perturbatively interacting fermions, perfect actions can be constructed an-
alytically [15]. In the case of non-perturbative interactions, this is possible only numeri-
cally and only in the classical approximation.3 The renormalization group transformation
leading to the (quantum) perfect action would involve effectively a continuum functional
integral. Still its existence is of conceptual interest. It implies, for instance, that also a
topological lattice charge without lattice artifacts exists [17], and that supersymmetry can
in principle be represented continuously on the lattice [18]. Similarly, the perturbatively
perfect action shows that, e.g., the axial anomaly is reproduced correctly [19], but only a
modest improvement persists on the non-perturbative level [13,20]. What is (in principle)
accessible and promising for simulations is an approximation to the classically perfect ac-
tion [21], which works well for fermions in two dimensions [22]. However, the difficult
construction and application in d = 4 is still under investigation [23]. In that case, there is
also a potential for an excellent scaling, but in order to obtain a sufficient chiral quality, it
seems that the group working on it (the Bern Collaboration) also depends on the concept
of Refs. [5,14] (chiral correction).
In the free case, the truncated perfect HF has still an excellent scaling behavior [11,24],
if the renormalization group parameters are optimized for locality. Hence we use the free
HF as the point of departure in our attempt to construct a HF, which is an approximate
GW fermion, and which is also promising with respect to scaling and approximate rotation
invariance. A few elements for its gauging are then added such that the GWR is violated
only modestly at the coupling strength of interest. In this section, we are going to describe
this construction step by step. A synopsis of this construction has been anticipated in Refs.
[25].
The concept formulated in Refs. [5,14] has also been adapted in Ref. [26], where some
progress is reported, although a very simple (“planar”) operator D0 was used, which is
quite far from a GW fermion even in the free case. With this “planar overlap operator” some
results for the finite size scaling of the chiral condensate of Ref. [27] were reproduced,
and the effects of instantons in the chiral symmetry breaking were reconsidered [28].
Other very simple non-standard operators D0 were used in Refs. [29,30]. In contrast, a
very complicated approximate Ginsparg–Wilson fermion was constructed in Ref. [31] by
introducing many parameters and tuning them for a minimal GWR violation at a specific
value of β. This corresponds to the first part of our program (constructing an approximate
GW fermion by hand), but since the overlap formula is not used, chirality is still not exact.
Moreover, there are no ingredients in favor of improving other properties. However, that
work reports some gain if a specific improved gauge action is used. Different improved
gauge actions were applied in Refs. [23,32]. The use of improved gauge actions is also on
our agenda, but in the present work we always use the standard plaquette gauge action.
This allows us to observe unambiguously the progress due to the improved Dirac operator.
3 For recent work on perfect actions for semi-classical effective actions, see Ref. [16].
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 227
All the considerations below are based on quenched QCD configurations on periodic
lattices of sizes 44 , 84 (Sections 2, 3,4) and 124 (Sections 5, 6 and 7), and beyond Section 2
we always use β = 6.
As we mentioned above, we start from the couplings of the free perfect fermion. It
is optimized for locality, and then truncated to a HF by imposing periodic boundary
conditions. We write the resulting Dirac operator as
D(x − y) = ρµ (x − y)γµ + λ(x − y) (x, y ∈ Z4 , |xν − yν | 1), (5)
and the couplings of the vector term ρµ and the scalar term λ are given in Ref. [11], Table
1. Note that the components of x − y are restricted to −1, 0, 1, and that ρµ is odd in the
µ-direction and even otherwise, while λ is entirely even. In the free case, this HF is an
excellent approximation to a Ginsparg–Wilson fermion [5].
Our first HF for QCD, with a Dirac operator of the form
D(x, y, U ) = ρµ (x, y, U )γµ + λ(x, y, U ) (U : compact gauge field) (6)
is now obtained by “minimal gauging” of the free HF: the sites, which are coupled in the
HF formulation, are connected by shortest lattice path only. The gauging is done by simply
attaching the free coupling to these paths, divided into equal parts where several shortest
paths exist.
This simple prescription already provides an excellent pion dispersion relation [11].
On the other hand, with such a gauging the HF suffers from a strong additive mass
Fig. 1. The fermionic spectrum for the minimally gauged HF and for the Wilson fermion, in a typical configuration
at β = 5 on a 44 lattice.
228 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
Fig. 2. Typical spectra of the minimally gauged HF at β = 5.4 (on the left) and at β = 5.6 (on the right). In the
latter case, the overlap formula seems to be applicable with mass parameter µ ≈ 1.4.
4 Note that in such a situation it could be misguiding to consider only the spectrum of A† A , as it is sometimes
0 0
done in the literature. For the HF in Fig. 1, that spectrum alone would look quite satisfactory, even though the
physically crucial left arc is missing.
5 This is a sensible definition of the topological charge on the lattice for any Ginsparg–Wilson fermion [4].
Up to moderate coupling strength, it is often close to the geometrical charge [35]. In general, it also tends to
agree with the charge identified from cooling the SU(N ) gauge configurations, especially at increasing N [36].
In the case of classically perfect fermions it corresponds to the classically perfect charge [10], and analogously
for (quantum) perfect fermions.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 229
eigenvalues are mapped to the right-hand side, so that mass renormalization reappears.
Also further generalizations of R (beyond the restriction to one site) do not help in this
situation.
A question of interest is at which coupling strength the applicability of the overlap
formula with simple operators like DHF sets in. Typical spectra on a 44 lattice suggest
that for instance at β = 5.4 the coupling is still too strong, but at β 5.6 we are about to
approach to safer grounds, see Fig. 2. The same is true for the Wilson fermion (see first
Ref. in [25], Fig. 8). On larger lattices the minimal β is still likely to rise to about β 5.7.
Worried about the strong mass renormalization, we first want to move our HF towards
the chiral limit. We do so by amplifying each link variable by a factor 1/u,
1
Uµ (x) → Uµ (x), u 1. (7)
u
The idea is to compensate the (mean) link suppression due to the gauge field. This can be
viewed as a generalization of the critical hopping parameter used for Wilson fermions, but
it is also related to the spirit of “tadpole improvement” [37].
Fig. 3 illustrates that criticality is reached with u 0.8 at β = 6.6 We see that this
already provides a decent approximation to a GW fermion. This is remarkable, because we
Fig. 3. HF spectra at β = 6 on a 44 lattice at critical link amplification (u = 0.8) for three configurations.
6 The absence of the arc close to 0 is due to the small size of the lattice; it is not due to the choice of the lattice
Dirac operator.
230 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
approach the chiral limit in the most economic way, by staying within the framework of
minimal gauging. We did not introduce additional lattice paths yet, hence Step 2 does not
require any computational effort (once the critical value of u is determined).
We now want to improve the chiral quality of our HF further by going beyond minimal
gauging. As a non-minimal element we introduce fat links. For a given configurations, we
substitute each link variable according to the following scheme
α
link → (1 − α) · link + staples , α ∈ R, (8)
6
before evaluating the Dirac operator. This is still an economic tool. The substituted link
variable on the right-hand side is not mapped back onto the gauge group, in contrast to the
APE blocking [39], and we do not iterate the substitution (8).
As a first observation, we note that the mass renormalization is enhanced for
increasing α. This may seem counter-intuitive if one imagines that a positive α makes
the configurations “appear smoother”. However, a more precise picture confirms this
observation: the only coupling of range 0 is λ(0, 0, 0, 0) = 1.853 . . . The rest of the scalar
term is negative, and in the free case we have x λ(x) = 0. The gauge field now suppresses
the negative contributions, whereas λ(0, 0, 0, 0) remains unchanged, so that a positive mass
sets in (if we keep u = 1). This is already true for minimal gauging, but if we add fat
links with α > 0, this suppression of the negative part gets even stronger. A fraction of
the negative couplings is now attached to staples instead of single links, and the staple
suppression corresponds to the third power of the mean link suppression.
If we wanted to use fat links to move closer to the chiral limit, we had to take α < 0.
Then the critical value of u rises, and for some strongly negative α (far below −1) it even
arrives at 1, so that criticality could, in principle, be realized solely by means of fat links.
Fig. 4. The effect due to the variation of staple coefficient α in the uncritical HF (on the left). On the right: the
spectrum of the critical HF with and without fat links (everything at L = 4, β = 6).
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 231
However, in this case the rest of the spectrum is very far from a GW circle—in particular,
the upper and lower arc are far outside the GW circle—so we do not recommend negative
values of α. Fig. 4 (on the left) shows this effect, and we also see that positive α are
more adequate to make the shape of the spectrum circle-like. So what is really favorable
to improve the proximity to a GW fermion is a positive α along with an adapted (i.e.,
decreased) critical value of u. A good choice is α = 0.3, which requires u = 0.76 at
β = 6. This reduces the radial fluctuation of the eigenvalues—the eigenvalues move closer
together—and the upper and lower arc move closer to the GW circle. This is shown in
Fig. 4 (on the right), which compares the spectra with and without fat links for the same
configuration.
The above picture for the mass renormalization ignores the role of the vector term
ρµ (x, y, U ). In fact, some tests confirmed that it has only a very modest influence on the
location of the arc on the left-hand side (and also on the right-hand side). This location is
essentially determined by the scalar term, but the vector term is crucial for the height of
the upper and lower arc, i.e., the term ρµ γµ is responsible for the imaginary part of the
spectrum.
This property will now be used for a further improvement, still without extra
computational effort. We introduce different link amplification factors for the vector term
and the scalar term, since they play a different role. We first keep the critical factor 1/u as
Fig. 5. The HF on a 44 lattice with critical link amplification, fat links and a suppression of the vector term. We
also show the “continuation” around 0 on a 84 lattice with the same parameters, i.e., u = 0.76, α = 0.3, v = 0.92.
232 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
an overall link amplification, but then we multiply a link suppression factor v only in the
vector term. So now we multiply the links as follows:
v
Uµ → Uµ in ρµ (x, y, U ),
u
1
Uµ → Uµ in λ(x, y, U ), (9)
u
where u v 1. Still the fat links are useful to suppress the radial fluctuations of the
eigenvalues, so we stay with α = 0.3, u = 0.76, and the suitable vector term suppression
amounts to v = 0.92. Now the upper and lower arc also follow the GW circle, and we
obtain therefore a very satisfactory spectrum, see Fig. 5. It shows again the spectrum on a
44 lattice, but this time we also include part of the spectrum of a typical configuration on
a 84 lattice. From there we show the 100 eigenvalues with the smallest real parts, thus we
also visualize how the spectrum “continues” around zero.
An obvious candidate for a next step beyond minimal gauging is the clover term. We
performed a sequence of tests with it being added to the HF versions discussed above. We
varied the clover coefficient and also considered both signs, but from spectra on 44 lattices
we did not arrive at a clear further improvement of the GW approximation in this way.
A positive coefficient has generally the effect to pull the spectrum closer to the real axis,
as it was observed before for the Wilson fermion [38], hence the optimal parameter v rises
a little. A positive clover coefficient does improve the arc around zero—which appears on
the 84 lattice—a little bit, but it distorts the rest. As an example, we show the effect of the
clover coefficient 0.15 in Fig. 6. One could argue that it is precisely the left arc which is
physically crucial. However, we are going to insert our HF into the overlap formula, so we
end up with an exact GW fermion anyhow. In view of the convergence rate in the iterative
evaluation of this formula, the maximal deviation of the HF spectrum from the GW circle
matters, so one should not limit the attention to the left arc only.
Indeed, the systematic study of the transition to the overlap fermion on larger lattices—
which will be presented in Section 5—shows that a clover term with a small coefficient
may help a little to speed up the convergence. So we are going to include it on the 124
lattice (used in Sections 5, 6 and 7), since it is computationally cheap anyhow.
We first consider approximations of the sign function, which is done in one way or the
other in most of the literature. One writes the overlap formula as
H
Dov = µ 1 + γ5 √ = µ 1 + γ5 (H ) ,
H 2
H := γ5 (D0 − µ) = H † , (10)
and approximates the sign function
1, x > 0,
(x) = (11)
−1, x < 0,
by a polynomial. As an alternative to the Chebyshev polynomials that we are going to use
here, also an “optimal rational approximation” has been suggested [41] and it was applied,
for instance, in Refs. [7,42].
The eigenvalue distribution of H determines the interval which is relevant for the
approximation. Since this section is only meant to illustrate what the convergence rate
depends on, we consider µ = 1 for simplicity. (Later, when we study the convergence rate
in detail, optimized mass parameters µ will be inserted.)
In Fig. 7(a) we show the eigenvalue histograms for the cases of the Wilson fermion,
HW = γ5 (DW −1), and of our preferred HF on small lattices (α = 0.3, u = 0.76, v = 0.92),
234 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
(a) (b)
Fig. 7. (a) The eigenvalue histograms for typical spectra of HHF (grey) and HW (black) at β = 6; (b) the same
after rescaling so that all eigenvalues have absolute values 1.
still at β = 6, on a 44 lattice. We see that the spectrum of HHF is already sharply peaked at
±1, whereas for HW the distribution is very broad.
In order to make the polynomial approximation directly applicable for all eigenvalues,
we first have to rescale the spectra so that they are entirely confined to the interval [−1, 1],
see, for instance, Ref. [43]. The outcome of the minimal rescaling (division by the largest
absolute value of an eigenvalue) is shown in Fig. 7(b). We see that the spectrum of HHF
is not affected too much: the peaks are moved to about ±0.7, but there is still a large
gap around zero. This gap is important, because any polynomial approximation of the sign
function is plagued by its worst errors near the discontinuity at zero (remember for instance
the notorious “Gibbs phenomenon” of the Fourier expansion). On the other hand, for HW
there is (after rescaling) a considerable eigenvalue density around zero. This shows that it
requires much more effort to transform the Wilson fermion into an overlap fermion.
To demonstrate this prediction with an example, we use a linear combination of
Chebyshev polynomials with maximal degree 21 to approximate the sign function in the
overlap formula (10). We consider again a typical configuration at β = 6 on a 44 lattice.
Fig. 8 shows the resulting spectra if we start from the Wilson fermion resp. the HF, and we
see that the latter is clearly superior.
Fig. 8. The spectra of approximate overlap fermions on a 44 lattice, where the sign function is approximated by
a polynomial of degree 21. We use a configuration typical at β = 6 and start from D0 = DW (stars) resp. from
D0 = DHF (crosses).
function. Hence, the question here is how small δ is going to be. Actually the question is
the same as in Subsection 3.1 (and also the rescaling is the same), because A†0 A0 = H 2 . So
we can refer again to Fig. 7 where one just has to square the eigenvalues.
Also the comparison of spectra looks very similar to Fig. 8, so we do not repeat it but
turn to a systematic study of the convergence to an overlap fermion.
4. Convergence rate
7 The results of this and the following sections have been summarized before in Ref. [44].
236 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
Fig. 9. The maximal radial deviation from the normalized Ginsparg–Wilson circle, evaluated from of the full
spectrum on a 44 lattice. We compare the HF at µ = 1 with the Wilson fermion at µ = 1.6, in both cases expanding
the inverse square root as well as the sign function.
expanded in H . Hence the former actually picks up a factor of 2 in the comparison of the
polynomial degree, which makes the sign function look favorable, especially for the HF.
Since the volume considered in Fig. 9 is quite small, we now proceed to a 84 lattice.
Here we cannot evaluate the full spectrum any more, but with the Arnoldi algorithm we
can identify a selected set of eigenvalues. We thus evaluated the 100 eigenvalues with the
least real part (as shown in Fig. 5 before), because they are physically most relevant, and
we measured for this subset again the maximal radial deviation. In Fig. 10 we plot this
maximal deviation, and also the mean deviation, again as a function of the polynomial
degree. For the Wilson fermion we consider two options: we take either the free hopping
parameter and optimize the mass parameter to µ = 1.6, or we take µ = 1, as in the case of
the HF, and insert the critical Wilson fermion. The goal is to make sure that we compare
the HF really to the best application of the Wilson fermion. However, as Fig. 10 shows, the
difference between the different ways to use the Wilson fermion is tiny.8
Due to the arbitrary truncation at just 100 eigenvalues the exponential behavior is not as
clean as in the case of the full spectrum. However, we see that the behavior is very similar,
and the improvement of the HF clearly persists in the same magnitude.
To make these observations more quantitative, we discuss as an example the sign
expansion on the 44 lattice, which has a very precise exponential behavior. For some
configuration typical at β = 6 we measured for the Wilson fermion the maximal deviation
dWmax
(n) = exp(−0.134n), where n is the degree of the Chebyshev polynomial, and for
the mean deviation we found dW mean (n) = 0.13 exp(−0.134n), hence the exponential factor
8 For the Wilson fermion the variation of µ and the variation of the hopping parameter are equivalent, so we
observe here that optimization of the convergence leads very close to criticality.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 237
Fig. 10. The maximal radial deviation and the mean radial deviation from the normalized Ginsparg–Wilson circle,
evaluated from of the 100 energy eigenvalues with the lowest real parts (physical branch) on a lattice volume 84 .
We compare the inverse square root expansion for the HF and for the Wilson fermion. In the latter case we also
compare the case of the free hopping parameter and a suitable mass parameter of µ = 1.6 with the critical Wilson
fermion at µ = 1. They behave very similarly, and the HF converges much faster.
is the same. For the same configuration, the following HF deviations where obtained:
max
dHF (n) = exp(−0.737n), and dHF
mean
(n) = 0.1 exp(−0.737n).
We mention two ways how to arrive at conclusions from these numbers:
• First, we could fix some degree n which we consider affordable in a simulation. The
precision of the GW approximation compares as
max (n)
dW ∼
= e0.6n . (12)
max
dHF (n)
For realistic degrees like n = 20, . . . , 100 this ratio of the accuracies takes a very
considerable magnitude.
• On the other hand, we could fix a certain accuracy d max which we consider necessary
to trust the chiral quality of the approximated GW fermion. Then the polynomial
degrees, which are required to provide this precision, compare as
nW ∼
= 5.5. (13)
nHF
This factor may be regarded as the effective gain of the HF due to the faster
convergence, since the computational effort is essentially proportional to n.
The fluctuation of these ratios over different configurations are modest, even though
d max may vary significantly. For a systematic statistical study we now move on to larger
lattices.
238 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
5. Condition numbers
After the explicit convergence study of the last section, we now turn our attention to the
condition numbers of the operators A†0 A0 , which are crucial for the convergence rate. This
allows us to proceed to larger lattices of size 124 , still at β = 6. We first show a history
for the condition numbers for the HF and the Wilson fermion in Fig. 11 (on top). Here we
adapted the parameters so that they are optimal on the larger lattice. For the HF the new
set of optimal parameters reads
Fig. 11. On top: a history of the condition numbers of A†0 A0 at β = 6 on a 124 lattice. Below: the corresponding
history of the upper and lower bound of the spectra. It shows that the improvement of the condition number of
the HF is essentially due to the upper bound.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 239
Fig. 12. The precision of the overlap formula approximated by Chebyshev polynomials of the moderate degree
of 60. As a measure for the precision, we show the accuracy of the function fmax (r = 24) (cf. Section 6) for a set
of configurations at L = 12, β = 6. We plot this accuracy against the condition number, in order to illustrate their
monotonous relation, and the progress of the HF over the Wilson fermion.
at µ = 1.4—which is optimal with respect to locality [6]—differ only little from the result
in Fig. 11.
We see that the HF condition number is improved typically by one to two orders of
magnitude. Since it is defined by the ratio of the upper bound divided by the lower bound,
it is instructive to consider these bounds separately. Their histories are shown in Fig. 11
(below). They reveal that the improvement of the condition number is almost entirely an
effect due to the upper bound.
In Fig. 12 we illustrate explicitly how the condition number translates into an
accelerated convergence. As a (somewhat arbitrary) measure for the speed of convergence,
we consider Chebyshev polynomials at some moderate degree, n = 60, and measure the
deviation of the function fmax (r = 24) from the exact result (obtained from huge values of
n). (The quantity fmax (r = 24) represents the maximal correlation over the largest distance
on our lattice; an explicit definition will be given in Section 6.) A polynomial of degree 60
may be affordable in simulations, and we see that it typically approximates fmax (r = 24)
already to a high accuracy for the HF, but not for the Wilson fermion. For the latter we see
that µ = 1.64 is better for the condition number than the locality optimal mass parameter
of µ = 1.4.
However, it is important to note that most practical applications of overlap fermions are
performed such that the lowest few modes are projected out and treated separately. Then
the above polynomial evaluation concerns the rest of the eigenspace. This method helps a
lot, because often very few modes are responsible for a slow convergence. However, their
separate treatment is also tedious, because it requires a very accurate determination of the
corresponding eigenfunctions. Hence the number of modes to be projected out is usually
240 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
Fig. 13. The history of the “higher condition numbers” c2 , . . . , c20 (defined in Eq. (15)) for the operator A†0 A0 ,
built from the HF (at µ = 1.3) and from the Wilson fermion (at µ = 1.64).
not more than about 15; beyond that the spectrum becomes quite dense, so projecting out
further eigenvalues raises the remaining lower bound only very little.
To do justice to this situation, we introduce “higher condition numbers” ck , which are
defined by the ratio
largest eigenvalue
ck = . (15)
kth eigenvalue from below
If one projects out 15 eigenvalues, for example, then c16 is relevant for the convergence
of the polynomial evaluation. In Fig. 13 we show the histories for the higher conditions
numbers c2 , . . . , c20 , for the HF and the Wilson fermion. This plot confirms that around
k = 15 the eigenvalue density is large already, and beyond they become even more dense.
Hence it is hardly motivated to increase k much further. We also see that these histories
are much smoother compared to c1 (which was shown in Fig. 11), so here we are able
to take sensible mean values. The results are shown in Fig. 14, and we see that in the
relevant regime the improvement factor for the HF amounts to about 25. The convergence
rate behaves like the square root of the relevant condition number.9 Hence we gain a factor
of ≈ 5. This is amazingly consistent with the explicit result obtained on the small lattices
(but from the full spectrum) in Section 4.
9 Note that the relevant condition number corresponds to 1/δ 2 in the notation of Section 3.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 241
Fig. 14. The expectation values for the higher condition numbers ck (k = 2, . . . , 20) (defined in Eq. (15)) for the
operator A†0 A0 , built from the HF and from the Wilson fermion. In the most popular regime (around k = 15) the
HF gains a factor of ≈ 25 over the Wilson fermion. For the latter we also confirm that µ = 1.64 is a little better
for the condition number than the locality optimal parameter µ = 1.4.
6. Locality
Since there is occasionally some confusion about the term “locality” of a lattice action,
we refer to the definition that the lattice Dirac operator decays at least exponentially in
the distance between ψ̄ and ψ. This is the property which is crucial for providing a safe
continuum limit (since the decay width is fixed in lattice units).
It was conjectured [5]—and later proved [45]—that Ginsparg–Wilson fermions cannot
be “ultralocal”, not even in the free case. This means that their couplings cannot drop to
zero beyond a finite number of lattice spacings. However, locality in the above sense was
shown for the free perfect fermion [15] and for the Neuberger fermion [4] to hold. In Ref.
[5] it was shown that the truncated perfect free HF leads to an overlap-HF, which is more
local than the Neuberger fermion (it has a faster exponential decay).
In the presence of gauge interactions, locality can be proved for smooth configurations,
either by assuming a small upper limit for the deviation of any plaquette variable from 1
[6,46], or by assuming that the eigenvalues of A†0 A0 do not cluster densely in the vicinity of
zero [6]. For realistic configurations, the exponential decay was observed statistically for
the Neuberger fermion in quenched QCD at β = 6 [6]. This property may collapse at much
stronger coupling, but at some point the overlap formula is not applicable any more also
for other reasons, as we discussed in Section 2. The statistical demonstration was done by
showing that the “maximal correlation” between any two lattice sites, separated by a taxi
driver distance r, decays exponentially in r. More precisely, the expectation value of the
242 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
Fig. 15. Comparison of the degree of locality for the Neuberger fermion and for the overlap HF. We see that the
latter is clearly more local; the exponent differs by more than a factor of 2.
function
4
fmax (r) := max ψ(y) |xµ − yµ | = r (16)
x, y
µ=1
has to decay exponentially, if a unit source is located at the (arbitrary) site x. We probed 6
sites x for each configuration, then we went through all sites y to determine the maxima.
The 6 options for x were sufficient to stabilize the function fmax (r). (Remember that this
function was mentioned before in Section 5.)
As we explained in the introduction, the property (4) suggests that the higher degree of
locality of the overlap-HF may also persist in the presence of gauge interactions. In fact,
in the Schwinger model a comparison of the function fmax (r) confirmed this conjecture
[14]. Here we extend this comparison to QCD on a periodic 124 lattice, which is the size
that was also used in Ref. [6]. The use of the taxi driver metrics allows us to proceed to
maximal distance 24, and the exponential decay is clearly visible.10 Our result is shown in
Fig. 15. For the Neuberger fermion it agrees well with the result of Ref. [6] (which was also
obtained at β = 6), although we used a different mass parameter. We inserted µ = 1.64
which was optimal for the condition number of the Neuberger fermion (cf. Section 5),
whereas Ref. [6] used µ = 1.4, which is slightly better for the locality—because it was
10 Of course, the decay is also exponential with respect to the Euclidean distance. There the decay is less
smooth, but the asymptotic behavior can be extracted as 0.04 exp(−|x − y|) for the Neuberger fermion, and as
0.7 exp(−2|x − y|) for the overlap HF.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 243
optimized with this respect—but the difference is really small. Also for the overlap-HF we
used the parameter which is optimal for the condition number, in that case µ = 1.3.
We see that the overlap-HF is by far more local. To be explicit, the asymptotic
decay of fmax (r) behaves like 0.017 exp(−0.45r) for the Wilson fermion, and like
0.017 exp(−0.93r) for the HF. This observation suggests that the range of applicability
of the overlap formula extends up to stronger coupling for the overlap-HF.
Interestingly, the very clean exponential decay sets in only in the presence of
interactions. For free fermions, the decay is faster, of course, but the exponential behavior
is not as neat as in Fig. 15.
On the technical side we mention that the convergence of fmax (r) at long distances
requires a very precise evaluation of the inverse square root. This motivated the use of
fmax (r = 24) as a measure for the precision of the approximations to the overlap formula,
see Section 5.
As a last comparison between the Neuberger fermion and the overlap HF, we consider
the extent of the violations of rotation invariance. To quantify this property, we introduce
the function fmax (|x − y|), which corresponds to definition (16) in Euclidean metrics, as
Fig. 16. Comparison of the violations of the rotation invariance for the Neuberger fermion and for the overlap-HF.
We see that the latter approximates rotation invariance much better. In both cases, the violations of rotation
invariance decay exponentially with the distance, but for the overlap HF the exponent of the decay is almost
doubled.
244 W. Bietenholz / Nuclear Physics B 644 (2002) 223–247
We put a unit source on one site x and probe all other sites y for a fixed Euclidean distance.
For each distance |x − y| that occurs, we determine the difference
fmax |x − y| − fmin |x − y|
which represents a measure for the violation of rotation invariance. Fig. 16 shows that
this difference decays exponentially as a function of the Euclidean distance. (Again we
present data from a periodic 124 lattice at β = 6.) For the Neuberger fermion (at µ = 1.64),
the asymptotic decay amounts to 0.065 exp(−0.11 |x − y|). The corresponding asymptotic
decay for the overlap HF (at µ = 1.3) behaves as exp(−2.10 |x − y|).
8. Conclusions
coupling. We have also shown that the overlap-HF approximates the rotation invariance
very well, in contrast to the Neuberger fermion.
However, the really crucial question is if our expectation for an improved scaling can
also be confirmed. If the overlap-HF allows for the use of a somewhat larger physical lattice
spacing, then the remaining overhead could be more than compensated. This hope is based
in particular on the elements of an truncated perfect actions, which are incorporated in our
HF construction. Indeed, a strongly improved scaling was observed for free fermions and
in the 2-flavor Schwinger model [14]. We also run some tests for the meson dispersions in
QCD, but it turned out that the dispersions are quite noisy, in particular for the overlap-HF.
Hence we postpone the delicate question of scaling for further investigation. Interestingly,
the phenomenon of more noise is also known from O(a) improved Wilson fermions with
a clover term. As a general property, GW fermions are also O(a) improved. Hence it is
conceivable that the overlap HF has an even better scaling behavior than the original HF,
since the overlap formula removes the O(a) artifacts.
As long as GW fermions can only be applied in the quenched approximation, the
connection with chiral perturbation theory [47] seems most attractive. However, the
ultimate goal is the use of dynamical Ginsparg–Wilson fermions, which appears as a
tremendous task. Hence it is worthwhile studying the optimal access carefully. We hope
that the current work contributes to this optimization.
Acknowledgements
I am very much indebted to Ivan Hip, who made important contributions to this work. I
also thank him, as well as David Adams, Norbert Eicker, Philippe de Forcrand, Karl Jansen,
Waseem Kamleh, Thomas Lippert, Martin Lüscher, Klaus Schilling, Rainer Sommer and
Urs Wenger for useful discussions. The computations for this work were performed on the
NICSE machines at the Forschungszentrum Jülich.
References
[12] W. Bietenholz, N. Eicker, A. Frommer, Th. Lippert, B. Medeke, K. Schilling, G. Weuffen, Comput. Phys.
Commun. 119 (1999) 1.
[13] K. Orginos, et al., Nucl. Phys. B (Proc. Suppl.) 63 (1998) 904.
[14] W. Bietenholz, I. Hip, Nucl. Phys. B 570 (2000) 423;
W. Bietenholz, I. Hip, Nucl. Phys. (Proc. Suppl.) 83–84 (2000) 600.
[15] W. Bietenholz, U.-J. Wiese, Nucl. Phys. B 464 (1996) 319.
[16] S. Kato, N. Nakamura, T. Suzuki, S. Kitahara, Nucl. Phys. B 520 (1998) 323;
S. Fujimoto, S. Kato, T. Suzuki, Phys. Lett. B 476 (2000) 437;
M.N. Chernodub, S. Fujimoto, S. Kato, M. Murata, M.I. Polikarpov, T. Suzuki, Phys. Rev. D 62 (2000)
094506;
M.N. Chernodub, K. Ishiguro, T. Suzuki, hep-lat/0204003.
[17] W. Bietenholz, R. Brower, S. Chandrasekharan, U.-J. Wiese, Phys. Phys. B 407 (1997) 283.
[18] W. Bietenholz, Mod. Phys. Lett. A 14 (1999) 51.
[19] W. Bietenholz, U.-J. Wiese, Phys. Lett. B 378 (1996) 222;
W. Bietenholz, U.-J. Wiese, Nucl. Phys. B (Proc. Suppl.) 47 (1996) 575.
[20] W. Bietenholz, T. Struckmann, Int. J. Mod. Phys. C 10 (1999) 531.
[21] P. Hasenfratz, F. Niedermayer, Nucl. Phys. B 414 (1994) 785;
F. Niedermayer, P. Rüfenacht, U. Wenger, Nucl. Phys. B 597 (2001) 413;
P. Rüfenacht, U. Wenger, Nucl. Phys. B 616 (2001) 163.
[22] C.B. Lang, T.K. Pany, Nucl. Phys. B 513 (1998) 645;
F. Farchioni, V. Laliena, Phys. Rev. D 58 (1998) 054501.
[23] P. Hasenfratz, S. Hauswirth, K. Holland, T. Jörg, F. Niedermayer, U. Wenger, Nucl. Phys. B (Proc. Suppl.) 94
(2001) 627;
P. Hasenfratz, S. Hauswirth, K. Holland, T. Jörg, F. Niedermayer, Nucl. Phys. B (Proc. Suppl.) 106 (2002)
751;
P. Hasenfratz, S. Hauswirth, K. Holland, T. Jörg, F. Niedermayer, Nucl. Phys. B (Proc. Suppl.) 106 (2002)
799, hep-lat/0205010;
S. Hauswirth, Ph.D. thesis, hep-lat/0204015.
[24] W. Bietenholz, U.-J. Wiese, Phys. Lett. B 426 (1998) 114;
W. Bietenholz, Nucl. Phys. A 642 (1998) 275c.
[25] W. Bietenholz, in: X.-Q. Luo, E.B. Gregory (Eds.), Proceedings of the International Workshop on Non-
Perturbative Methods and Lattice QCD, Guangzhou (China), World Scientific, Singapore, 2001, p. 3, hep-
lat/0007017;
W. Bietenholz, N. Eicker, I. Hip, K. Schilling, Nucl. Phys. B (Proc. Suppl.) 94 (2001) 603.
[26] T. DeGrand, Phys. Rev. D 63 (2001) 034503.
[27] P. Hernández, K. Jansen, L. Lellouch, Phys. Lett. B 469 (1999) 198.
[28] T. DeGrand, A. Hasenfratz, Phys. Rev. D 64 (2001) 034512.
[29] A. Boriçi, Phys. Lett. B 453 (1999) 46.
[30] W. Kamleh, D. Adams, D. Leinweber, A. Williams, Phys. Rev. D 66 (2002) 014501.
[31] C. Gattringer, I. Hip, C.B. Lang, Nucl. Phys. B 597 (2001) 451;
For a corresponding work in d = 2, see: C. Gattringer, I. Hip, Phys. Lett. B 480 (2000) 112.
[32] S.J. Dong, F.X. Lee, K.F. Liu, J.B. Zhang, Phys. Rev. Lett. 85 (2000) 5051;
S.J. Dong, F.X. Lee, K.F. Liu, J.B. Zhang, Nucl. Phys. B (Proc. Suppl.) 94 (2001) 752;
F.D.R. Bonnet, P.O. Bowman, D.B. Leinweber, A.G. Williams, J.B. Zhang, Phys. Rev. D 65 (2002) 114503.
[33] W. Bietenholz, in: V. Mitrjushkin, G. Schierholz (Eds.), Lattice Fermions and the Structure of the Vacuum,
Kluwer Academic, 2000, p. 77, hep-lat/0001001.
[34] F. Farchioni, I. Hip, C.B. Lang, Phys. Lett. B 443 (1998) 214.
[35] F. Farchioni, I. Hip, C.B. Lang, M. Wohlgenannt, Nucl. Phys. B 544 (1999) 364.
[36] N. Cundy, M. Teper, U. Wenger, hep-lat/0203030.
[37] G.P. Lepage, P. Mackenzie, Phys. Rev. D 48 (1993) 2250.
[38] C. Gattringer, I. Hip, Nucl. Phys. B 541 (1999) 305.
[39] M. Albanese, et al., Comput. Phys. Commun. 45 (1987) 345.
[40] J. van den Eshof, A. Frommer, Th. Lippert, K. Schilling, H. van der Vorst, hep-lat/0202025;
J. van den Eshof, A. Frommer, Th. Lippert, K. Schilling, H. van der Vorst, Nucl. Phys. B (Proc. Suppl.) 106
(2002) 1070.
W. Bietenholz / Nuclear Physics B 644 (2002) 223–247 247
Abstract
We show that a non-trivial dilaton condensation alters the dimensions of orientifold planes. An
off-shell crosscap state which naturally interpolates between the usual on-shell crosscap states and
their T-duals plays an important role in the analysis. We present an explicit representation of the off-
shell crosscap state on an RP2 worldsheet in the gauge in which the worldsheet curvature in the bulk
of the fundamental region of the RP2 vanishes. We show that the non-trivial dilaton condensation
reproduces the correct descent relation among orientifold plane tensions.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
Orientifold planes (O-planes) as well as D-branes are important objects to reveal non-
perturbative effects in unoriented string theory. Recent studies on open-string tachyon
condensation have shown that background fields can control the dimensions of D-branes.1
The relationship between the dimensions of the D-branes and the configuration of the
background open-string fields is easily understood from the viewpoint of the worldsheet;
the background fields on the boundaries can alter the boundary conditions on the
✩
This work is supported in part by the Grant-in-Aid for Scientific Research (14540264) from the Ministry of
Education, Science and Culture, Japan.
* Corresponding author.
E-mail addresses: itoyama@sci.osaka-cu.ac.jp (H. Itoyama), nakashin@postman.riken.go.jp
(S. Nakamura).
1 See, for example, Ref. [1] for recent reviews on the related topics.
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 1 - 5
H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262 249
worldsheet. On the other hand, the relationship between the properties of O-planes and
background fields has not been understood well as yet.
In the present work, we investigate the connection between the dimensions of O-pla-
nes and the configuration of the background dilaton field, in unoriented bosonic string
theory. An O-plane is represented as a crosscap on an RP2 worldsheet while a D-brane is
represented as a boundary on a disc. Therefore we consider an RP2 worldsheet in the
presence of the background dilatons. The reason we consider dilatons is based on the
following property. Dilatons couple to the worldsheet curvature and their contribution can
be put on any part of the worldsheet in general. However, we show that the contribution
of the background dilatons localizes on the crosscap if we choose the gauge in which the
worldsheet curvature vanishes in the bulk of the fundamental region of the RP2 . This choice
of gauge is nice; the bulk part of the RP2 becomes free and all the interactions from the
background dilatons appear only on the crosscap. Therefore the effects of the background
dilatons can be translated into the modification of the crosscap conditions.
We introduce a new crosscap state which we refer to as an off-shell crosscap state
in order to analyze the properties of the RP2 worldsheet. Of course, O-planes couple
to only closed strings and we do not have open-string modes on the RP2 worldsheet.
Useful tools to analyze the properties of worldsheets in terms of closed-string modes are
boundary states and crosscap states [2–5]. Usually, boundary states and crosscap states
belong to the closed-string sector which preserves conformal invariance on the worldsheet.
Extensions of these boundary states which do not in general maintain conformal invariance
have been proposed recently [6,7]. (See also [8,9].) We call them off-shell boundary
states in the present article. The off-shell boundary state interpolates between the usual
on-shell boundary states and their T-duals. This state is defined on a disc worldsheet
with quadratic boundary interactions. The dimension of the corresponding D-brane is
controlled by the coupling constants of these interactions. Boundary string field theory
(BSFT) [10,11] states that the coupling constants also parameterize the configuration of
open-string tachyons [12–17]. Calculating the disc partition function by using these states
enables us to obtain the descent relation among the D-brane tensions if we take appropriate
on-shell limits of the partition functions.
An attempt to apply the foregoing idea to crosscap states is given in Ref. [18], in which
the definition of the off-shell crosscap state which interpolates the usual on-shell crosscap
states and their T-duals has been proposed. We present an explicit representation of the off-
shell crosscap state in the present work. We define the off-shell crosscap state on an RP2
worldsheet in the presence of quadratic interactions on the crosscap. The dimension of the
corresponding O-plane is controlled by the coupling constants of these interactions. The
physical meaning of the quadratic interactions on the crosscap is the background dilaton
field of quadratic configuration. The behaviour of the off-shell crosscap state shows that
the background dilatons control the dimension of the O-plane.
The off-shell crosscap state is a useful tool to obtain correlation functions on the RP2
worldsheet in the presence of the quadratic dilatons on the crosscap. The exact partition
function of the RP2 worldsheet which is considered to be proportional to the O-plane
tension can be calculated exactly by using the two-point function. We show that the
condensation of the dilatons of quadratic profile reproduces the correct descent relation
250 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
among O-plane tensions by taking appropriate on-shell limits of the partition function.
Note that we have not attempted to find the dynamical origin of the dilaton condensation.
The paper is organized as follows. In Section 2, we construct the RP2 worldsheet on
a complex plane and we show that the contribution of the background dilatons survives
on the crosscap alone if we choose the gauge in which the worldsheet curvature vanishes
in the bulk of the fundamental region of the RP2 . In Section 3 we define the off-shell
crosscap state and we obtain its explicit representation. We find that the behaviour of this
state signifies that a non-trivial dilaton condensation alters the dimensions of O-planes.
We calculate the partition function of the RP2 by using the off-shell crosscap state. In
Section 4, we verify that the non-trivial dilaton condensation reproduces the correct descent
relation of O-plane tensions, by taking appropriate limits of the partition function of the
RP2 worldsheet. In the final section, we summarize the results of this study. We make
comments on some relationship between the RP2 worldsheet and the supersymmetric disc
worldsheet which appears in the analysis of D D̄ systems in Appendix A.
Let us consider an RP2 worldsheet with background dilaton field. We show, in this
section, that the contribution of the background dilatons localizes on the crosscap if we
choose the gauge in which the worldsheet curvature in the bulk of the fundamental region
of the RP2 vanishes.
RP2 is a non-orientable Riemann surface of Euler number one with no hole, no boundary
and one crosscap. We construct the RP2 worldsheet on a complex z-plane by using an
involution where we identify z and − 1z̄ on the complex plane. We choose the fundamental
region Σ to be {z = reiσ | 0 r < 1, 0 σ < 2π} ∪ {z = reiσ | r = 1, 0 σ < π}. The
crosscap C, the non-trivial closed loop of the RP2 worldsheet, is represented as half of unit
circle {z = reiσ | r = 1, 0 σ < π} in this case.
To begin with, we set the metric inside the unit circle (|z| 1) on the complex plane as
1
hzz = hz̄z̄ = 0, hz̄z = hzz̄ = . (2.1)
2
The metric outside the unit circle (|z | 1) is obtained by the involution z = − 1z̄ , z̄ = − 1z
as
∂ z̄ ∂z 1 1
hz z̄ = hz̄z = 2 hz̄z = 4 hz̄z , (2.2)
∂z ∂ z̄ (z z̄ ) r
where r 2 = z z̄ for r 1. Therefore the metric on the entire complex plane can be written
as
1
hzz = hz̄z̄ = 0, hz̄z = hzz̄ = eρ , (2.3)
2
where
d
=4 dσ − r ln rΦ(r, σ ) r=1 + 2Φ(1, σ )
dr
0
π
=4 dσ Φ(1, σ ), (2.7)
0
which yields
π
1 √ 1
dr dσ g RΦ(r, σ ) = dσ Φ(1, σ ). (2.8)
4π π
Σ 0
Note that (2.8) gives the correct Euler number of the RP2 (which is one) if we set Φ = 1.
Therefore the contribution of the background dilaton concentrates on the crosscap with the
above gauge choice.
In this section, we define the off-shell crosscap state and obtain its explicit represen-
tation. We apply the off-shell crosscap state to calculate the correlation functions and the
252 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
partition function of the RP2 worldsheet with quadratic background dilaton field. We find
that the behaviour of the off-shell crosscap state signifies that a non-trivial dilaton conden-
sation alters the dimensions of O-planes.
1 µ
2
26
Φ(σ ) = a + uµ X (σ ) , (3.2)
2α
µ=1
where the interaction Φ(σ ) is inserted only on crosscap C. Note that the worldsheet action
is free in the “bulk” region {z = reiσ | 0 r < 1, 0 σ < 2π}. A closed string propagates
freely in the “bulk” toward the crosscap, from the viewpoint of the closed-string channel.
In this sense, Xµ in the “bulk” can be expanded as
iα µ
α µ z−n −n
µ µ z̄
X z, z̄ = X0 −
µ
p ln zz̄ + i αn + α̃n . (3.3)
2 2 n n
n=0
The existence of the crosscap, however, makes constraints on the oscillation of the
closed string in the neighborhood of the crosscap (in the region r → 1). For example,
the constraints when we have no interaction on C are given in Ref. [4] as
µ
X z, z̄ − Xµ −1/z̄, −1/z r→1 = 0,
X˙µ z, z̄ + X˙µ −1/z̄, −1/z = 0,
r→1
(3.4)
where
X˙µ z, z̄ ≡ w∂w + w̄ ∂¯w̄ Xµ w, w̄ w=z,w̄=z̄ ,
X˙µ −1/z̄, −1/z ≡ w∂w + w̄ ∂¯w̄ Xµ w, w̄ w=−1/z̄,w̄=−1/z . (3.5)
Note that (3.4) are equivalent to the following constraints:
K0 z, z̄ r→1 = 0,
z∂z + z̄∂¯z̄ K0 z, z̄ = 0,
r→1
(3.6)
where
K0 z, z̄ ≡ Ẋµ z, z̄ + Ẋµ −1/z̄, −1/z . (3.7)
These conditions are rewritten in terms of closed-string modes at r → 1 as
µ
αnµ + (−1)n α̃−n = 0, pµ = 0. (3.8)
The conditions (3.4), (3.6) or (3.8) are referred to as (on-shell) crosscap conditions.
H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262 253
The aim of this subsection is to extend the on-shell crosscap conditions into the case
uµ = 0. We should find, in other words, the constraints on the closed-string modes in the
neighborhood of C in the presence of interaction Φ. We call these constraints off-shell
crosscap conditions. We assert that the off-shell crosscap conditions can be written as
K z, z̄ r→1 = 0,
z∂z + z̄∂¯z̄ K z, z̄ r→1 = 0, (3.9)
where
K z, z̄ ≡ Ẋµ z, z̄ + Ẋµ −1/z̄, −1/z + uµ Xµ z, z̄ + Xµ −1/z̄, −1/z
= w∂w + w̄ ∂¯w̄ Xµ w, w̄ + uµ Xµ w, w̄ w=z,w̄=z̄
+ w∂w + w̄ ∂¯w̄ Xµ w, w̄ + uµ Xµ w, w̄ . (3.10)
w=−1/z̄,w̄=−1/z
The right-hand side of (3.10) indicates the meaning of the off-shell crosscap conditions;
(3.9) are the conditions so that the Xµ in the neighborhood of C, as well as its image by
the involution, connects smoothly with the Xµ on C which obeys
z∂z + z̄∂¯z̄ Xµ z, z̄ + uµ Xµ z, z̄ C = 0, (3.11)
2 Although RP2 has no boundary, (z∂ + z̄∂¯ )Xµ (z, z̄) which comes from the total derivative survives only
z z̄
on the crosscap due to the involution.
254 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
We define off-shell crosscap state C(u)| using the off-shell crosscap conditions as
µ uµ µ
n µ
C(u) − αn + (−1) α̃−n + n µ
α − (−1) α̃−n = 0,
n n
µ
C(u) −iα pµ + uµ X0 = 0. (3.15)
The explicit form of C(u)| is given as
C(u) = 0|C(u),
∞
1 µ
C(u) ≡ exp − X0 Aµν X0ν exp µ (m) ν
α̃m Cµν αm , (3.16)
2
m=1
where
1
Aµν ≡ A(uµ )δµν = uµ δµν ,
α
(−1)m m − uµ
(m)
Cµν ≡ C (m) (uµ )δµν = − δµν . (3.17)
m m + uµ
We can easily check that this off-shell crosscap state becomes (the T-dual of) the usual
on-shell crosscap state if we take the limit uµ → 0 (uµ → ∞). Therefore the off-shell
crosscap state naturally interpolates between the crosscap state for a higher-dimensional
O-plane and that for a lower-dimensional O-plane.
Next, we show that the off-shell crosscap state is a useful tool to evaluate the quantities
on the RP2 worldsheet. For example, we can calculate the Green’s function and the
partition function on the RP2 worldsheet in the presence of interaction Φ(σ ) on the
crosscap. Let us consider one-dimensional target space and omit the superscript µ of X
and u for simplicity. The Green’s function for this case is given as
C(u)|X(z, z̄)X(w, w̄)|0
G(z, w) =
C(u)|0
α α 2
= − ln|z − w|2 − ln 1 + zw̄
2 2
α ∞
1
k
k
+ − α u −zw̄ + −z̄w . (3.18)
u k(k + u)
k=1
In the case z = eiσ and w = eiσ , we obtain
α 2 α 2
G eiσ , eiσ = − ln 1 − ei(σ −σ ) − ln 1 + ei(σ −σ )
2 2
∞
α (−1) k
+ − α u eik(σ −σ ) + e−ik(σ −σ ) . (3.19)
u k(k + u)
k=1
H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262 255
We can next calculate the partition function of the RP2 worldsheet by using the
following relationship:
π
d 1 1
ln Z(u) = −
dσ X2 (σ ) = − X2 (σ )
du 2πα 2α
0
1 α u
= − −α ln(2q) − − 2α Ψ − Ψ (u)
2α u 2
1 d d u d
= ln(2q) + ln u + 2 2 ln - − ln -(u)
2 du du 2 du
√ u√
d 2q u -( u2 )2
= ln + const . (3.24)
du -(u)
We then obtain
√ u√
2q u -( u2 )2
Z(u) = , (3.25)
-(u)
up to the overall normalization factor. In general, the partition function for 26-dimensional
target space with the interaction (3.2) on the crosscap can be written as
26 26 √
u √ u
−a −a −a 2q µ uµ -( 2µ )2
Z(a, u) ≡ e Z(u) ≡ e Z(uµ ) = e , (3.26)
-(uµ )
µ=1 µ=1
256 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
up to the overall normalization factor. We note that the partition function (3.26) on the
RP2 has an identical representation with the partition function on the supersymmetric
disc worldsheet which have been considered in the analysis of open-string tachyon
condensation in D D̄ systems [15–17]. We present some comments on the relationship
between the RP2 worldsheet and the supersymmetric disc worldsheet in Appendix A.
In this section, we clarify the physical meaning of the partition function calculated in the
previous section. We show that the condensation of the quadratic dilaton field reproduces
the correct descent relation among the O-plane tensions.
Let us consider our work from the viewpoint of the sigma model approach. The basic
idea of the sigma model approach is that the spacetime action for string fields is essentially
the renormalized partition function of the worldsheet with corresponding background
string fields. In this sense, the spacetime action S for string field λi may be given as
S(λi ) ∼ [dgab ] dXµ e−Iχ (gab ,X ;λi )
µ
=
χ
Σχ
= Zsphere (λi ) + Zdisc(λi ) + ZRP2 (λi ) + · · · , (4.1)
where Iχ is the action on the worldsheet Σχ of the Euler number χ . Leading term
Zsphere(λi ) is the renormalized partition function on the sphere. This term is of order go−2
where go is the open-string coupling constant. Renormalized partition functions Zdisc(λi )
on the disc and ZRP2 (λi ) on the RP2 are the loop correction terms of order go−1 . In principle,
Zdisc(λi ) is proportional to the tension of the corresponding D-brane and ZRP2 (λi ) is
proportional to the tension of the corresponding O-plane.
It is known, however, that the right-hand side of (4.1) does not give the correct
spacetime action; we need modification of Zsphere(λi ) and Zdisc(λi )3 in order to obtain the
correct off-shell spacetime action. These modifications are closely related to the infinite
Möbius volume of the worldsheets; we need to subtract the divergence from the Möbius
infinity [19–21]. (See also [15,22].)
The situation is different for ZRP2 (λi ) (and for the partition functions of the worldsheets
of χ 0). We should note that the Möbius group of RP2 is SO(3) whose volume is finite,
and we have no Möbius infinity from the RP2 worldsheet. Therefore it is natural to assume
that partition function ZRP2 (λi ) itself is the exact loop correction term from the RP2 graph.
We have calculated the partition function (3.26) on the 2
RP worldsheet in the presence of
quadratic background dilaton field Φ = a + (2α )−1 µ uµ Xµ2 on the crosscap. We have
fixed the worldsheet metric so that the “bulk” part of the fundamental region of the RP2
becomes flat and the contribution of the dilatons concentrates on the crosscap. Therefore
we have to calculate the contribution of the ghost field and the anti-ghost field on the RP2
worldsheet in order to obtain the correct overall normalization of (3.26). We write this
overall factor A and then we have the following relationship:
26 √
u √ u 2
−a 2q µ uµ -( 2µ )
ZRP2 (Φ) = Ae ≡ ZRP2 (a, u). (4.2)
-(uµ )
µ=1
The sign of A should be minus. Note that factor e−a is equal to go−1 when the dilaton is
constant.
4x x-(x)2
F (x) ≡ . (4.4)
2-(2x)
F (x) behaves as follows:
F (x) ∼ 1 + (2 ln 2)x + O x 2 (x → 0), (4.5)
√
F (x) ∼ πx + O x −1/2 x→∞ . (4.6)
Z(u) around u = 0 is then
1 √
Z(u) ∼ 4 √ +O u . (4.7)
u
Thus, Z(u) diverges when u approaches 0. This is an IR divergence which corresponds to
the volume of the spacetime.
On the other hand Z(u) around u = ∞ is
u/2
q π 1
Z(u) ∼ 4 +O √ . (4.8)
2 2 u
We can obtain a finite and non-zero value of Z(u) in the limit u → ∞ if and only if q = 2.
Therefore we assign the value 2 to q. In other words, we have chosen the renormalization
scheme in (3.20) so that we can obtain a finite and non-zero value of Z(u) in the limit
u → ∞. Z(u) is then
√ u√
4 u -( u2 )2 4 u
Z(u) = =√ F , (4.9)
-(u) u 2
258 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
Therefore
√
T24 2α π π √
= 4 = π α . (4.18)
T25 4 2
This is precisely the ratio of the tension of an O24-plane and that of an O25-plane. In
general, we can show in a similar manner that
Tp √
q−p
= π α . (4.19)
Tq
5. Conclusion
We have considered the relationship between the configuration of the background dila-
ton field and the dimensions of the O-planes. We showed that the contribution of the dila-
tons on the RP2 worldsheet localizes on the crosscap if we choose the gauge in which the
worldsheet curvature in the “bulk” vanishes. This feature enables us to treat dilatons quite
easily. We have proposed the off-shell crosscap state which naturally interpolates between
the usual crosscap states and their T-duals. The behaviour of the off-shell crosscap state
signifies that the non-trivial dilaton condensation alters the dimensions of O-planes. We
obtained the correlation functions and partition function on the RP2 worldsheet in the pres-
ence of the quadratic dilaton field on the crosscap. We showed that the non-trivial dilaton
condensation reproduces the correct descent relation among O-plane tensions, by taking
the on-shell limits of the partition function of the RP2 . We found the correspondence be-
tween the RP2 worldsheet and the supersymmetric disc which is presented in Appendix A.
We would like to make some comments. We have studied the effects of the non-trivial
dilaton condensation on O-planes. In order to describe the full dynamics of dilatons, we
would need open-closed string field theory.4 Non-perturbative analysis would be necessary
too. However, the present work can be a step to understand the relationship between the
condensation of string fields and the transmutations of O-planes. An extension of the
present work into the supersymmetric case is interesting, and studying the relationship
between our work and the properties of O-planes described by F-theory [25,26] is one
further direction to pursue. Studying the relationship between the configuration of the
dilatons and the O-planes in type I theory [27,28] is also tempting. It has been shown that
non-trivial configurations of dilatons constrain the spacetime positions of D-branes [29].
Effects of the non-trivial dilaton condensation on D-branes are also interesting subjects to
study. Recent studies have shown that the condensation of the closed-string tachyons in
the twisted sector can alter the topology of the orbifolded spacetime [30–33]. We expect
that the present work can also be a step towards understanding the relationship between the
topology of the spacetime and the configuration of the string fields, since the dimensions
of O-planes are closely related to the topology of the orientifolded spacetime.
4 An unoriented open-closed string field theory has been proposed in Ref. [24].
260 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
Acknowledgements
1 2 µ¯ µ¯ µ¯
Isuper-disc = d z ∂X ∂Xµ + ψ ∂ψµ + ψ̃ ∂ ψ̃µ
2
4π α
M
1 µ 2 2
µ
ν
ρ 1
+ dσ y X + ψ ∂µ X ψ ∂ν Xρ . (A.1)
4π α µ ∂σ
∂M
This action has been considered in the context of open-string tachyon condensation in D D̄
systems [15–17]. The partition function of the super-disc is obtained in Ref. [15] as
26
1 µ
Z(y) ∝ F y , (A.2)
yµ
µ=1
where function F (x) has been defined in (4.4).
Therefore, the partition function (4.10) of the RP2 worldsheet has an identical form to
the partition function (A.2) of the super-disc. Comparing (4.10) and (A.2), we find the
correspondence
uµ
↔ y µ. (A.3)
2
At the level of integrated operators, we find the correspondence
ORP2 ↔ Odisc , (A.4)
where
2 2
ORP2 ≡ dσ X (σ ), (A.5)
α µ
C
is an operator on the RP2 and
2
1 ν
Odisc (σ ) ≡ dσ Xµ2 (σ ) + ψ µ ∂µ Xρ ψ ∂ν Xρ (σ ) , (A.6)
α ∂σ
∂M
H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262 261
is an operator on the super-disc. We should also note that the descent relation among
D-brane tensions in the D D̄ system can be obtained in the same manner as that we have
shown in Section 4. The correspondence (A.4) can be explained from the calculation of
Xµ2 for the RP2 , in which we have an extra minus sign in the contributions from odd
modes.5 These contributions correspond to those of the fermions on the super-disc, while
the even modes behave like the bosonic part on the super-disc.
References
[1] K. Ohmori, A review on tachyon condensation in open string field theories, hep-th/0102085;
I.Ya. Aref’eva, D.M. Belov, A.A. Giryavets, A.S. Koshelev, P.B. Medvedev, Noncommutative field theories
and (super)string field theories, hep-th/0111208.
[2] P. Horava, Background duality of open-string models, Phys. Lett. B 231 (1989) 251;
P. Horava, Strings on world sheet orbifolds, Nucl. Phys. B 327 (1989) 461.
[3] C.G. Callan, C. Lovelace, C.R. Nappi, S.A. Yost, String loop corrections to beta functions, Nucl. Phys. B 288
(1987) 525.
[4] C.G. Callan, C. Lovelace, C.R. Nappi, S.A. Yost, Adding holes and crosscaps to the superstring, Nucl. Phys.
B 293 (1987) 83.
[5] C.G. Callan, C. Lovelace, C.R. Nappi, S.A. Yost, Loop corrections to superstring equations of motion, Nucl.
Phys. B 308 (1988) 221.
[6] A. Fujii, H. Itoyama, Some computation on g function and disc partition function and boundary string field
theory, hep-th/0105247.
[7] T. Lee, Tachyon condensation, boundary state and noncommutative solitons, Phys. Rev. D 64 (2001) 106004,
hep-th/0105115.
[8] E.T. Akhmedov, M. Laidlaw, G.W. Semenoff, On a modification of the boundary state formalism in off-shell
string theory, hep-th/0106033.
[9] M. Laidlaw, G.W. Semenoff, The boundary state formalism and conformal invariance in off-shell string
theory, hep-th/0112203.
[10] E. Witten, On background independent open-string field theory, Phys. Rev. D 46 (1992) 5467, hep-
th/9208027.
[11] E. Witten, Some computations in background independent off-shell string theory, Phys. Rev. D 47 (1993)
3405, hep-th/9210065.
[12] J.A. Harvey, D. Kutasov, E. Martinec, On the relevance of tachyons, hep-th/0003101.
[13] A.A. Gerasimov, S.L. Shatashvili, On exact tachyon potential in open string field theory, JHEP 0010 (2000)
034, hep-th/0009103.
[14] D. Kutasov, M. Mariño, G. Moore, Some exact results on tachyon condensation in string field theory,
JHEP 0010 (2000) 045, hep-th/0009148.
[15] D. Kutasov, M. Mariño, G. Moore, Remarks on tachyon condensation in superstring field theory, hep-
th/0010108.
[16] P. Kraus, F. Larsen, Boundary string field theory of the D D̄ system, Phys. Rev. D 63 (2001) 106004, hep-
th/0012198.
[17] T. Takayanagi, S. Terashima, T. Uesugi, Brane–antibrane action from boundary string field theory,
JHEP 0103 (2001) 019, hep-th/0012210.
[18] H. Itoyama, S. Nakamura, Extension of boundary string field theory on disc and RP2 worldsheet geometries,
hep-th/0201035.
[19] O.D. Andreev, A.A. Tseytlin, Partition function representation for the open superstring effective action:
cancellation of Möbius infinities and derivative corrections to Born–Infeld Lagrangian, Nucl. Phys. B 311
(1988) 205.
5 See the third term of the first line on the right-hand side of (3.21).
262 H. Itoyama, S. Nakamura / Nuclear Physics B 644 (2002) 248–262
[20] A.A. Tseytlin, Möbius infinity subtraction and effective action in sigma model approach to closed string
theory, Phys. Lett. B 208 (1988) 221.
[21] A.A. Tseytlin, Conditions of Weyl invariance of two-dimensional sigma model from equations of stationarity
of ‘central charge’ action, Phys. Lett. B 194 (1987) 63.
[22] A.A. Tseytlin, Sigma model approach to string theory effective actions with tachyons, J. Math. Phys. 42
(2001) 2854, hep-th/0011033.
[23] S. Nakamura, Closed-string tachyon condensation and the on-shell effective action of open-string tachyons,
Prog. Theor. Phys. 106 (2001) 989, hep-th/0105054.
[24] T. Kugo, T. Takahashi, Unoriented open-closed string field theory, Prog. Theor. Phys. 99 (1998) 649, hep-
th/9711100.
[25] A. Sen, F-theory and orientifolds, Nucl. Phys. B 475 (1996) 562, hep-th/9605150.
[26] A. Dabholkar, On condensation of closed-string tachyons, hep-th/0109019.
[27] J. Polchinski, E. Witten, Evidence for heterotic—type I string duality, Nucl. Phys. B 460 (1996) 525, hep-
th/9510169.
[28] Y. Arakane, H. Itoyama, H. Kunitomo, A. Tokura, Infinity cancellation, type I compactification and string
S-matrix functional, Nucl. Phys. B 486 (1997) 149, hep-th/9609151.
[29] S. Nakamura, Dirichlet boundary conditions in generalized Liouville theory toward a QCD string, Prog.
Theor. Phys. 104 (2000) 809, hep-th/0004172.
[30] A. Adams, J. Polchinski, E. Silverstein, Don’t panic! Closed string tachyons in ALE spacetimes, JHEP 0110
(2001) 029, hep-th/0108075.
[31] C. Vafa, Mirror symmetry and closed string tachyon condensation, hep-th/0111051.
[32] J.A. Harvey, D. Kutasov, E.J. Martinec, G. Moore, Localized tachyons and RG flows, hep-th/0111154.
[33] A. Dabholkar, C. Vafa, tt ∗ geometry and closed string tachyon potential, JHEP 0202 (2002) 008, hep-
th/0111155.
Nuclear Physics B 644 (2002) 263–289
www.elsevier.com/locate/npe
Abstract
We perform a comprehensive study of the dominant two- and higher-loop contributions to the
205 Tl, neutron and muon electric dipole moments induced by Higgs bosons, third-generation quarks
and squarks, charginos and gluinos in the Minimal Supersymmetric Standard Model (MSSM). We
find that strong correlations exist among the contributing CP-violating operators, for large stop,
gluino and chargino phases, and for a wide range of values of tan β and charged Higgs-boson masses,
giving rise to large suppressions of the 205 Tl and neutron electric dipole moments below their present
experimental limits. Based on this observation, we discuss the constraints that the non-observation
of electric dipole moments imposes on the radiatively-generated CP-violating Higgs sector and
on the mechanism of electroweak baryogenesis in the MSSM. We improve previously suggested
benchmark scenarios of maximal CP violation for analyzing direct searches of CP-violating MSSM
Higgs bosons at high-energy colliders, and stress the important complementary rôle that a possible
high-sensitivity measurement of the muon electric dipole moment to the level of 10−24 e cm can
play in such analyses.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The non-observation of electric dipole moments (EDMs) of the thallium atom and
neutron, as well as the absence of large flavour-changing neutral-current (FCNC) decays
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 6 - X
264 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
put severe constraints on the parameters of a theory. Especially, these constraints become
even more severe for supersymmetric theories, such as the MSSM, in which too large
FCNC and CP-violating effects are generically predicted at the one-loop level, resulting
in gross violations with experimental data. A possible resolution of such FCNC and CP
crises, often considered in the literature [1], makes use of the decoupling properties of the
heavy squarks and sleptons of the first two generations, whose masses should be larger than
∼ 10 TeV. Thus, for sufficiently heavy squarks and sleptons, the one-loop predictions for
FCNC and EDM observables can be suppressed up to levels compatible with experiment.
Also, such a solution poses no serious problem to the gauge hierarchy, as long as the
first two generations of squarks and sleptons are not much heavier than 10 TeV. In this
case, because of their suppressed Yukawa couplings, the radiative effect of the first two
generations of sfermions on the Higgs-boson mass spectrum is still negligible, with respect
to that of TeV scalar top and bottom quarks.1
Recently, it has been shown [2,3] that even third-generation squarks may lead by
themselves to observable effects on the electron and neutron EDMs through Higgs-
boson-mediated two-loop graphs of the Barr–Zee type [4]. This observation offers new
possibilities to probe the CP-violating soft-supersymmetry-breaking parameters related to
the third-generation squarks. Most interestingly, the same CP-violating parameters may
induce radiatively a CP-noninvariant Higgs-sector [5–8], leading to novel signatures at
high-energy colliders [8–11]. It is then obvious that EDM constraints do have important
implications for the phenomenological predictions within the above framework of the
MSSM with explicit CP violation. Moreover, employing upper limits on EDMs, one is,
in principle, able to derive constraints on the phase of the SU(2)L gaugino mass, mW ,
which plays a central rôle in electroweak baryogenesis [12] in the MSSM [13,14].
On the experimental side, the current upper limit on the electron EDM de , as derived
from the absence of a permanent atomic EDM for 205 Tl, has improved by a factor of almost
2 over the last few years [15,16]. Specifically, the reported 2σ upper limit on a thallium
EDM is [16]
|dTl | 1.3 × 10−24 [e cm]. (1.1)
Then, the electron EDM de may be deduced indirectly by means of the effective Lagrangian
1
N ēiγ5 e + CP N
iγ5 N ēe
LEDM = − de ēσµν iγ5 eF µν + CS N
2
+ CT N
σµν iγ5 N ēσ µν e + · · · , (1.2)
where CS , CP , CT and the ellipses denote CP-violating operators of dimension 6 and
higher. With the aid of the effective Lagrangian (1.2), the atomic EDM of 205Tl may be
computed by [17–19]
dTl [e cm] = −585 × de [e cm] + 8.5 × 10−19 [e cm] × CS TeV−2
− 8.0 × 10−22 [e cm] × CT TeV−2 + · · · . (1.3)
1 One should bear in mind that radiative effects on the neutral Higgs-boson masses are proportional to the
fourth power of Yukawa couplings. A simple estimate indicates that the contribution of the second generation of
sfermions is smaller, by a factor of at least 10−7 , than those of the third generation.
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 265
In (1.3), the dots denote CP-odd operators of dimension 7 and higher. In our analysis,
we will assume that like CT , the CP-odd operators of dimension 7 and higher give rise
generically to negligible effects on the 205 Tl EDM. Moreover, although the contributions
of the neglected CP-odd operators to other heavy atoms may be comparable to that of de ,
the experimental upper limits are still much weaker than dTl , by at least one order of
magnitude. Consequently, we will only analyze predictions for the thallium EDM dTl
and consider only two operators: the electron EDM de and the CP-odd electron–nucleon
operator CS . From (1.1) and (1.3), it is then not difficult to deduce the following 2σ upper
limits on these two CP-odd operators:
|de | 2.2 × 10−27 [e cm], |CS | 1.5 × 10−6 TeV−2 . (1.4)
In the MSSM under study, the contributions from de and CS to dTl can be of comparable
size and therefore cannot be treated independently. In fact, depending on their relative
sign, one may increase or reduce the EDM bounds on the CP-violating parameters of the
theory. Here, the proposed high-sensitivity measurement of the muon EDM dµ to the level
10−24 e cm [20] may offer new constraints complementary to those obtained by dTl , since
CS and all higher-dimensional CP-odd operators are absent.
Unlike the thallium EDM, the upper limit on the neutron EDM dn is less severe, i.e.,
2 Alternatively, one-loop EDM contributions can be suppressed if the CP phases of the trilinear soft-Yukawa
, Bino B
couplings of the first two generations and the CP phases of Wino W and gluino g̃ are all zero, with
266 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
Let us first study the contribution of the CP-odd electron–nucleon operator CS [17–19]
to the 205 Tl EDM. At the elementary particle level, CS can be induced by two types of
CP-odd operators in supersymmetric theories: ēiγ5 eq̄q [40] and ēiγ5 eq̃ ∗ q̃, where q and q̃
denote quark and squark fields, respectively. In the MSSM, the above two CP-odd operators
of dimensions 6 and 5 are generated by interactions involving Higgs scalar–pseudoscalar
mixing and CP-violating vertex effects, as those shown in Fig. 1.
However, not all quarks and squarks can give rise to potentially large contributions to
the 205 Tl EDM. Our interest is to consider only enhanced Yukawa and trilinear couplings of
the Higgs bosons to quarks and squarks in the decoupling limit of the first two generation of
squarks. This criterion singles out the CP-odd operators related to top and bottom quarks,
and their supersymmetric partners. In fact, as is shown in Fig. 1, heavy quarks and squarks
do not contribute directly to the CP-odd operator CS , but only through the loop-induced
Bµ and µ being positive according to our CP conventions. In this case, however, if the first two generations of
sfermions are relatively light, e.g., few hundreds of GeV, then additional two-loop EDM graphs [36] exist, such
as those induced by a gluino CEDM, which give non-negligible contributions to the EDMs. Furthermore, there
are two-loop EDM effects induced by a CP-odd γ W + W − operator, which do not decouple in the limit of heavy
squarks and do not depend on Higgs-boson masses [37]. These two-loop EDM contributions are subdominant,
yielding an electron EDM term typically smaller than 10−27 e cm.
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 267
Fig. 1. Feynman graphs contributing to a non-vanishing CP-odd electron–nucleon operator CS . At the elementary
particle level, CS is predominantly induced by quantum effects involving (a) t-, b- quarks and (b) t˜-, b̃- squarks.
Blobs and heavy dots denote resummation of self-energy and vertex graphs, respectively.
Higgs-gluon–gluon couplings Hi gg, after they have been integrated out. Thus, the effective
Lagrangian responsible for generating CS is
3
(C ) gw Hi αs a,µν a
LeffS = gHi gg G Gµν + me tan βO3i ēiγ5 e , (2.1)
2MW 8π
i=1
where MW = gw v/2, O is the 3 ×3-mixing matrix that relates the weak to mass eigenstates
of the CP-violating Higgs bosons [6,8], and
2 v2 2 (H ) 2 (H )
gHi gg = g S
+ mq̃2 − mq̃1 ξq + mq̃1 + mq̃2 ζq
2 i 2 i . (2.2)
3 Hi qq 6m2q̃ m2q̃
q=t,b 1 2
S (H ) (H )
In (2.2), the dimensionless coefficients gH i qq
, ξq i , ζq i and the stop and sbottom masses
are given in Appendix A.
The largest contribution to the coupling parameter gHi gg comes from the scalar part of
S
the Hi b̄b coupling, gH i bb
. More explicitly, there are two CP-violating effects that dominate
S 2
gHi bb : (i) the tan β-enhanced threshold effects [40] described by the term
(+hb / hb ) tan2 β
gHi bb ∼ Im
S
O3i , (2.3)
1 + (δhb / hb ) + (+hb / hb ) tan β
and (ii) the scalar–pseudoscalar mixing effects contained in the mixing matrix ele-
ments O1i . The definition of the quantities δhb / hb and +hb / hb may be found in
Appendix A.
At this stage, it is important to observe that if (+hb / hb ) tan β 1, the tan2 β-
S P
dependence of the CP-violating threshold effects on gH i bb
and gH i bb
considerably
S P
modifies. In particular, in the large tan β limit, gHi bb and gHi bb asymptotically approach a
tan β-independent constant, i.e.,
1 + (δhb / hb )
gHS
i bb
→ Im O3i , (2.4)
(+hb / hb )
268 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
1 + (δhb / hb )
P
gH i bb
→ Im O1i . (2.5)
(+hb / hb )
Although the above limits may only be attainable in a very large tan β and quasi-
nonperturbative regime of the theory, the onset of a tan β-independent behaviour in gH S
i bb
and gH P
i bb may already start from moderately large values of tan β, i.e., for tan β 30.
Consequently, the limits (2.4) and (2.5) should be regarded as upper bounds on the
S P
CP-violating threshold-enhanced parts of the coupling parameters gH i bb
and gH i bb
. In
our numerical analysis in Section 4, we properly take into account the above-described
S
CP-violating resummation effects on gH i bb
.
The computation of the CP-odd electron–nucleon operator CS can now be performed by
utilizing standard QCD techniques based on the trace anomaly of the energy-momentum
tensor [42]. In the chiral quark mass limit, we then have the simple relation
αs
Observe that the operator CS exhibits an enhanced tan3 β dependence [40]; it, therefore,
becomes very significant for intermediate and large values of tan β. Numerical estimates
for this contribution to a thallium EDM will be presented in Section 4.
We now turn our attention to Higgs-boson two-loop effects [2,3] on the electron EDM
analogous to those first discussed by Barr and Zee [4] in non-supersymmetric theories. As
is shown in Fig. 2, these two-loop EDM effects originate predominantly from graphs that
involve: stop and sbottom squarks (Fig. 2(a,b)) [2], top and bottom quarks (Fig. 2(c)), and
charginos (Fig. 2(d)) [41].
Strictly speaking, the original Barr–Zee graphs induced by top and bottom quarks
in Fig. 2(c) appear beyond the two-loop approximation in the MSSM. However, it is a
formidable task to analytically compute the complete set of the relevant three- and higher-
loop graphs. Therefore, we consider only a subset of higher-loop corrections, in which
the dominant CP-violating terms in the Higgs-boson propagators and the Higgs-quark-
quark vertices are resummed. Such an approach should only be viewed as an effective one,
which is expected to capture the main bulk of the higher-order effects. In the same vein, we
improve previous two-loop EDM calculations related to third-generation squarks [2] and
charginos [41] by resumming dominant CP-violating self-energy terms in the Higgs-boson
propagators.
In the context of the aforementioned resummation approach, the dominant Higgs-boson
two-loop contributions to electron EDM are individually found to be
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 269
Fig. 2. Dominant Higgs-boson two-loop contributions to EDM of a light fermion f = e, µ, d in the MSSM
with explicit CP violation (mirror-symmetric graphs are not displayed). Heavy dots indicate resummation of
self-energy and vertex graphs. Two-loop graphs generating a CEDM for a d-quark are also shown.
3 P m2 m2
de 3αem gH i ee q̃1 q̃2
= m e Q2
q qξ (Hi )
F − F
e (a,b) 32π 3 MH 2 MH2 MH2
i=1 i q=t,b i i
m2 m2
q̃1 q̃2
+ ζq(Hi ) F 2
+ F 2
, (3.1)
MH i
M Hi
2
me 2 P
2 3
de 3αem mq
=− 2 2
Qq g g S
Hi ee Hi qq f 2
e 2
8π sin θw MW i=1 q=t,b MH
(c) i
m2q
+ gH
S
gP g
i ee Hi qq 2
, (3.2)
MH i
m + 2
me 1
2 3
de αem χj
=− √ gH
P
ee a H χ −
χ + f 2
e (d) 8 2 π 2 sin2 θw MW i=1 j =1,2 mχj+ i i j j MH i
m2 +
χj
+ gH
S
b − +g
i ee Hi χj χj 2
, (3.3)
MH i
270 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
P
where gH i ee
= − tan βO3i , gH
S
i ee
= O1i / cos β, and
1
x(1 − x) x(1 − x)
F (z) = dx ln , (3.4)
z − x(1 − x) z
0
1
z 1 − 2x(1 − x) x(1 − x)
f (z) = dx ln , (3.5)
2 x(1 − x) − z z
0
1
z 1 x(1 − x)
g(z) = dx ln (3.6)
2 x(1 − x) − z z
0
S P
are two-loop functions. The coupling coefficients gH i qq
, gH i qq
, aHi χ − χ + and bHi χ − χ + in
j j j j
(3.1)–(3.3), as well as the squark and chargino masses, are given in Appendix A. Eq. (3.1)
takes on the simpler analytic form presented in [2], if only the CP-odd component a of the
Higgs bosons is considered in an unresummed two-loop calculation of the EDM. In this
(H )
case, the coefficients ζ (Hi ) vanish and ξq i simplifies to
below the TeV scale. The CEDM of the d quark may be obtained from (3.1) and (3.2), if
one replaces the colour factor 3 by 1/2, and αem Q2q by αs . The computation of the neutron
EDM dn involves a number of hadronic uncertainties, when the EDMs are translated from
the quark to the hadron level [25]. For example, considering the valence-quark model and
renormalization-group running effects from the electroweak scale MZ down to the low-
energy hadronic scale Λh [3], one may be able to establish an approximate relation between
neutron and electron EDMs. Thus, taking the input values for the involved
√ kinematic
parameters: md (Λh ) = 10 MeV, αs (MZ ) = 0.12 and gs (Λh )/(4π) = 1/ 6, we find
dn ≈ −10de (t, t˜) + 1.2de t, t˜, χ ± + dn3G . (3.9)
On the RHS of (3.9), the first and second terms arise from a d-quark CEDM and EDM,
respectively, and dn3G is the contribution to dn due to the CP-odd three-gluon operator. In
obtaining (3.9), we have made two additional approximations as well. First, we neglected
the contribution of the u-quark EDM du to dn , as it is much smaller than the d-quark EDM
dd for the relevant region tan β 3. Second, we ignored the b- and b̃-quantum corrections
to dd and so to dn . Formulae (3.8) and (3.9) will be used to obtain numerical predictions
for the muon and neutron EDMs in the next section.
In Sections 2 and 3, we computed the dominant two-loop and the resummed higher-
loop contributions to EDMs that originate from third-generation quarks and squarks, and
charginos. Based on the derived analytic expressions, we can now analyze numerically
the impact of the experimental constraints due to the non-observation of thallium and
neutron EDMs on the CP-violating parameters of the theory, and hence on electroweak
baryogenesis and direct searches for CP-violating Higgs bosons in the MSSM. Moreover,
we will present predictions for the muon EDM dµ and discuss the implications of a possible
high-sensitivity measurement of dµ to the level 10−24 e cm for our analyses.
Based on the observation that CP-violating quantum effects on the neutral Higgs
sector get enhanced when the product Im(µAt )/MSUSY2 is large [5,6], the authors in [8,9]
introduced a benchmark scenario, called CPX, in which the effects of CP violation are
maximized. In CPX, the following values for the µ- and soft-SUSY-breaking parameters
were adopted:
M t = M
Q = M b = MSUSY , µ = 4MSUSY ,
|At | = |Ab | = 2MSUSY , arg(At,b ) = 90◦ ,
|mg̃ | = 1 [TeV], arg(mg̃ ) = 90◦ ,
= mB
mW = 0.3 [TeV]. (4.1)
Without loss of generality, the µ-parameter is chosen to be real. The predictions of CPX
showed [9] that even a light neutral Higgs boson with a mass as low as 60 GeV could
have escaped detection at LEP2.3 A recent experimental analysis of LEP2 data confirms
3 Similar remarks were made earlier in [6], but the LEP2 data were less restrictive then.
272 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
this observation [43]. Here, we wish to investigate the compatibility of the CPX scenario
with the experimental limits on EDMs. For this purpose, we allow variations in the gluino
phase, which enters the Higgs sector at two loops, but keep the At phase in (4.1) fixed.
In addition, we will present numerical results for EDMs, where the µ-parameter is varied
from 100 GeV to 4MSUSY . Finally, we leave unspecified the phases of the gaugino mass
parameters mW and mB . As we will see below, the phase of mW is greatly affected by
constraints from the electron EDM.
We start our numerical analysis by presenting predicted values for the 205 Tl EDM
dTl that arise entirely due to the CP-odd electron–nucleon operator CS and are denoted
as dTl (CS ). In Fig. 3, we display numerical estimates for dTl(CS ) as functions of tan β for
four different versions of the CPX scenario with MSUSY = 1 TeV: (a) MH + = 150 GeV,
arg(mg̃ ) = 0 ◦ ; (b) MH + = 300 GeV, arg(mg̃ ) = 0◦ ; (c) MH + = 150 GeV, arg(mg̃ ) = 90 ◦ ;
(d) MH + = 300 GeV, arg(mg̃ ) = 90◦ . The individual b, t˜, t, b̃ contributions to dTl (CS ),
along with their relative signs, are indicated by different types of lines. We observe that
the largest contribution to dTl comes from the b-quarks for large values of tan β, i.e.,
for tan β 15, for which the CP-violating vertex effects become important (see also
the discussion in Section 2). In particular, these CP-violating threshold effects, which
crucially depend on the term Im(+hb / hb ) tan2 β in (2.3), become even more important
for large gluino phases. Thus, the predictions for dTl (CS ) in panels Fig. 3(a) and (b), with
arg(mg̃ ) = 0◦ , are one order of magnitude larger than the ones in Figs. 3(c) and (d), with
arg(mg̃ ) = 90◦ .
For intermediate and smaller values of tan β, i.e., for tan β 15, CP-violating self-
energy effects are significant, especially for relatively light charged Higgs bosons with
masses in the range 150–200 GeV. In fact, these effects have generically opposite sign to
the CP-violating vertex effects, giving rise to natural cancellations among the contributing
EDM terms, and so lead to smaller values of dTl (CS ). Although our numerical results are
in qualitative agreement with those in Ref. [40], we actually find noticeable quantitative
differences, when resummed CP-violating self-energy and vertex effects are considered at
the same time.
Next, we shall investigate numerically higher-order CP-violating vertex and self-energy
effects induced by t- and b-quarks on the electron EDM de . Fig. 4 shows numerical
estimates for those resummed effects on de as functions of tan β, in variants of the CPX
scenario, with (a) MH + = 150 GeV and (b) MH + = 300 GeV. In particular, we considered
three different choices of the gluino phase: arg(mg̃ ) = 90◦ , 0, −90◦ , denoted as t1,2,3 ,
respectively. We find that CP-violating threshold corrections to the Hi tt coupling as small
as 5% are sufficient to lead to observable EDM values for de . In this respect, we see that
the t-quark effects strongly depend on the gluino phase through the combination Im(µmg̃ )
that occurs in Im(+ht / ht ) [cf. (A.4), (A.5)]. Thus, the t-quark contribution to de is positive
(negative) for negative (positive) gluino phases, while it is one order of magnitude smaller
and negative for vanishing gluino phases, i.e., for arg(mg̃ ) = 0◦ . For comparison, we also
included in Fig. 4 the dependence of positive stop/sbottom contributions to de [2] (long-
dash-dotted lines) on tan β. The sum of the t-, b-quark and t˜-, b̃-squark contributions
to de is given by the solid lines 1, 2, 3 for the same values of gluino phases. As before,
we indicate negative contributions to de with a minus sign. From Figs. 4(a) and (b), it
is interesting to notice that if arg(mg̃ ) = 90◦ in CPX, a cancellation between the t-quark
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 273
Fig. 3. Numerical estimates of 205 Tl EDM dTl induced by the CP-odd electron–nucleon operator CS as functions
of tan β, in four selected CPX scenarios with MSUSY = 1 TeV. The values of the CPX parameters are given
in (4.1). The individual b, t˜, t, b̃ contributions to dTl (CS ), along with their relative signs, are also displayed.
and t˜-squark EDM contributions occurs for almost the entire range of the perturbatively
allowed tan β values and for all phenomenologically viable charged Higgs-boson masses.
As a consequence of such a cancellation, the electron EDM de is always smaller than the
current 2σ experimental limit on de , i.e., de < 2.2 × 10−27 e cm, even for large tan β values
up to 30. As we will see below, this prediction may considerably change if contributions
from the CP-violating operator CS or chargino two-loop effects are considered.
In order to further gauge the importance of the t-quark two-loop EDM effects, we
present in Fig. 5 numerical values for de versus the µ-parameter for tan β = 20, and
for two charged Higgs-boson masses: (a) MH + = 150 GeV and (b) MH + = 300 GeV.
The soft-SUSY-breaking parameters are chosen as given in (4.1) for MSUSY = 1 TeV,
274 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
Fig. 4. Numerical estimates of resummed Higgs-boson two-loop effects on de , induced by t-, b-quarks and
t˜-, b̃-squarks, as functions of tan β, in two variants of the CPX scenario, with (a) MH + = 150 GeV and
(b) MH + = 300 GeV. The long-dash-dotted lines indicate the stop/sbottom contributions to de . The dotted lines
t1,2,3 correspond to top/bottom contributions, for arg(mg̃ ) = 90◦ , 0◦ , −90◦ , respectively. Likewise, the solid
lines 1, 2, 3 give the sum of all the aforementioned contributions to de for the same values of gluino phases.
Contributions to de that are denoted with a minus sign are negative.
except for the µ-parameter, which has been varied from 0.1–4 TeV. For the sake of
comparison, we also included the Higgs-boson two-loop EDM effects induced by t˜- and
b̃-squarks. The meaning of the various types of lines is exactly the same as those in Fig. 4.
Remarkably enough, we find that even µ values as low as 500 GeV may be sufficient
to lead to an electron EDM at the observable level through the original two-loop Barr–
Zee graph in Fig. 2(c). In this context, we also observe that the resummed Higgs-boson
two-loop contributions to de from t-quarks are comparable and even larger than those
coming from t˜-squarks for maximal gluino phases. In fact, if At,b = 0, the t˜-squark and
dominant CP-violating Higgs-mixing effects may be completely switched off, without
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 275
Fig. 5. Numerical values of resummed Higgs-boson two-loop effects on de , induced by t-, b-quarks and t˜-,
b̃-squarks, as functions of µ, in two variants of the CPX scenario, with tan β = 20, and (a) MH + = 150 GeV
and (b) MH + = 300 GeV. The meaning of the different line types is identical to that of Fig. 4. For At,b = 0, the
long-dash-dotted line disappears and so the solid lines collapse to the dotted ones.
much affecting the corresponding t-quark two-loop contributions to de . Note that in this
case the t-quark effects on de and the b-quark effects on the CS operator, which both
formally arise at the two-loop level, are proportional to Im(µmg̃ ). Therefore, they turn out
to be strongly correlated and their combined contribution to dTl should carefully be taken
into account (see also discussions of Figs. 7 and 8 below).
As was already pointed out in [2,3], charginos might also contribute to electron EDM
de at the two-loop level. Recently, a computation of those effects appeared in [41].
The authors derived strict constraints on the CP-violating parameters of a scenario in
which electroweak baryogenesis is mediated by CP-violating currents involving chargino
interactions. Here, we re-examine this issue within a scenario that favours the above
mechanism of electroweak baryogenesis and is not in conflict with LEP2 limits on the
276 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
Higgs-boson masses and couplings. Specifically, being conservative, we require that these
be MHi 111 GeV, for gH 2
i ZZ
0.3, where gHi ZZ is the Hi ZZ coupling given in units
of the SM hSM ZZ coupling. In addition, we demand that MHi + MHj 170 GeV. On
the other hand, in order for electroweak baryogenesis to proceed via a sufficiently strong
first-order phase transition, the right-handed stop mass parameter Mt must be rather small,
and the µ and the soft gaugino parameter mW must not be too large, typically smaller
than 0.5 TeV [13,14]. Especially, there is a resonant enhancement up to even 10 times the
observed baryon asymmetry, if the condition µ = mW is met [14]. Further requirements
for a scenario leading to successful electroweak baryogenesis are: (i) a moderate trilinear
At -parameter in the range, 0.2 At /M Q 0.65; (ii) a not very large tan β value,
tan β 20; (iii) a soft-SUSY-breaking parameter M Q of a few TeV, for phenomenological
reasons [14]. More explicitly, the following values for the mass parameters are employed:
Q = 3 TeV,
M t = 0,
M b = 3 [TeV],
M
|At | = |Ab | = 1.8 [TeV], arg(At,b ) = 0◦ , tan β 20,
◦
|mg̃ | = 3 TeV, arg(mg̃ ) = 0 ,
◦
µ = |mW
| 0.5 [TeV], ) = 90 .
arg(mW (4.2)
To be able to compare our predictions with those presented in Fig. 2 of Ref. [41],
we choose in Fig. 6(a) the values: µ = mW = 0.2 TeV and MH + = 170 GeV. Since
CP-violating Higgs-mixing effects in the mass spectrum are generically small for the
chosen values of the parameters in (4.2), our mass input MH + = 170 GeV corresponds to
M‘A’ ≈ 150 GeV for the mass of the almost CP-odd Higgs scalar A. Even though on a very
qualitative basis our numerical results on the linear tan β-increase behaviour of de agree
with those reported in [41], the actual functional dependences of the individual ‘h’, ‘H ’,
‘A’ contributions to de on tan β differ significantly. Unlike [41], we find in Fig. 6(a) that
for tan β 5, the tan β-enhanced effect on de originates from the heavier Higgs bosons
‘H ’ and ‘A’, while the EDM contribution due to the lightest Higgs boson ‘h’ is almost
negligible.4 Since the size of de is set by the heavier Higgs-boson masses, i.e., by MH + , and
by µ and mW , our predictions are rather robust under the different choices of the remaining
soft-SUSY-breaking parameters. Moreover, although our numerical values for the total
contribution to de agree very well with [41] for tan β = 2 (de ≈ 0.63 × 10−26 e cm),
they are smaller by ∼20% for tan β = 6, i.e., we find de ≈ 1.62 × 10−26 e cm, which
should be compared with de ≈ 2 × 10−26 e cm. Finally, the electroweak baryogenesis
scenario (4.2) in the low tan β region, tan β 6, which is studied by the authors in [41],
appears to be highly disfavoured by LEP2 data. In this respect, a phenomenologically
viable model, with MH + = 170 GeV, would require larger values of tan β, i.e., tan β 9.
In this case, one has to consider a factor of 10 suppression in the chargino phase, such
that the chargino two-loop EDM effects are reduced to a level close to the experimental
upper limit on de . Consequently, if no cancellations are assumed with possible one-loop
EDM terms, then a model with suppressed chargino phase of ∼ 5◦ and a relatively light
4 The fact that only ‘H ’ and ‘A’ contributions to d exhibit a linearly enhanced dependence on tan β may also
e
be verified independently by a flavour-graph analysis.
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 277
Fig. 6. de versus tan β in a scenario favoured by electroweak baryogenesis, with MSSM parameters
Q = M
M D = 3 TeV, M U = 0, At,b = 1.8 TeV, mg̃ = 3 TeV and arg(At,b ) = arg(mg̃ ) = 0◦ . In (a),
MH + = 170 GeV is used, corresponding to M‘A’ ≈ 150 GeV, and mW ◦
= µ = 0.2 TeV and arg(mW ) = 90 .
Also displayed are the individual ‘h’, ‘H ’, ‘A’ contributions to de and the LEP excluded region from direct
Higgs-boson searches. In (b), numerical values are shown for MH + = 150 GeV (solid), 200 GeV (dashed),
300 GeV (dotted), 500 GeV (dash-dotted) and 1 TeV (long-dash-dotted), in a scenario with mW = µ = 0.4 TeV
◦
and arg(mW ) = 90 .
charged Higgs boson, MH + = 150–200 GeV, might still be possible to account for the
observed baryon asymmetry in the Universe, provided the aforementioned resonant factor
10 is used. However, the above situation may be considerably relaxed for larger values of
MH + , since the chargino two-loop EDM effect on de decreases approximately by 1/MH +
as MH + increases. This dependence of de on MH + can explicitly be seen in the lower
panel (b) of Fig. 6, for increasing charged Higgs-boson masses: MH + = 150 GeV (solid),
200 GeV (dashed), 300 GeV (dotted), 500 GeV (dash-dotted) and 1 TeV (long-dash-
◦
dotted), in a scenario with mW = µ = 0.4 TeV and arg(mW ) = 90 .
278 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
Fig. 7. EDMs of dTl , dn and dµ as functions of tan β in two versions of the CPX scenario:
(a) arg(mg̃ ) = arg(mW ◦ ◦ ◦
) = 90 , and (b) arg(mg̃ ) = 35 , arg(mW ) = 90 . Also shown are the different contribu-
tions, along with their relative signs, to dTl from top/stop (long-dash-dotted) and chargino (dotted) Higgs-boson
two-loop graphs, as well as from the CP-odd electron–nucleon coupling CS (dashed).
In the following, we will present predictions for more realistic EDM observables, with
relatively reduced hadronic uncertainties, namely the thallium EDM dTl , the neutron EDM
dn , as well as the muon EDM dµ which was suggested to be measured with a high
sensitivity to the level of 10−24 e cm [20]. In Fig. 7, we display numerical values for dTl , dn
and dµ as functions of tan β in two versions of the CPX scenario, with MH + = 150 GeV:
(a) arg(mg̃ ) = arg(mW ◦ ◦ ◦
) = 90 , and (b) arg(mg̃ ) = 35 , arg(mW ) = 90 . Fig. 7 also shows
the different contributions, along with their relative signs, to dTl from top/stop (long-dash-
dotted) and chargino (dotted) Higgs-boson two-loop graphs, as well as from the CP-odd
electron–nucleon operator CS (dashed). Note that the type of lines used to represent the
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 279
Fig. 8. Numerical values of dTl , dn and dµ as functions of µ for two large-tan β scenarios, with tan β = 40,
MSUSY = 1 TeV, mg̃ = 1 TeV, mW ◦
= mB = 0.3 TeV, arg(mg̃ ) = arg(mW ) = 90 , At,b = 2 TeV,
arg(At,b ) = 90◦ : (a) MH + = 150 GeV; (b) MH + = 300 GeV. In analogy with Fig. 7, the individual contri-
butions to dTl due to top/stop and chargino two loop graphs and due to the CS operator are also shown.
◦
) 10 , or modest cancellations in 1 part to 10 with one-loop EDM terms
i.e., arg(mW
induced by the first two generations of sleptons.
5. Conclusions
To avoid the known CP and FCNC crises in the MSSM, we have considered a
framework, in which the first two generations of squarks and sleptons are heavier than
∼ 10 TeV, while the third generation is light, with masses not larger than the order of
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 281
a TeV. Within this framework of the MSSM, we have performed a systematic study of the
dominant two- and higher-loop contributions to the thallium, neutron and muon EDMs,
which are induced by b-, t-quarks, b̃-, t˜-squarks, charginos and gluinos. At present, the
most severe limits are obtained from the non-observation of a thallium EDM dTl , whereas
experimental upper limits on the neutron EDM dn are less stringent and usually constrain
large contributions from a d-quark CEDM and the CP-odd three-gluon operator. Also,
theoretical predictions for dn are plagued by a number of uncertainties while estimating
hadronic matrix elements.
The largest effects on the thallium EDM dTl result from two operators, the CP-odd
electron–nucleon operator CS and the electron EDM de . These two CP-violating op-
erators are formally induced at the two- and higher-loop levels and involve the ex-
change of CP-mixed Higgs bosons. Thus, strong constraints on the radiatively-generated
CP-violating Higgs sector of the MSSM can be derived from dTl , and hence on the analyses
for direct searches of CP-violating Higgs bosons at high-energy colliders, such as LEP2,
Tevatron and LHC [44]. In this context, we have analyzed the compatibility of an earlier
suggested benchmark scenario of maximal CP violation for LEP2 Higgs studies (CPX) [9]
with the thallium and neutron EDMs. We have observed the existence of strong correla-
tions among the different EDM terms, which enable the suppression of dTl and dn even
below the present experimental limits. Specifically, for 4 tan β 12 in the CPX scenario
with MH + = 150 GeV, the stop, gluino, and chargino phases are all allowed to receive their
maximal values, i.e., arg(At ) = arg(mg̃ ) = arg(mW ◦
) = 90 , without being in conflict with
EDM limits (cf. Fig. 7(a)). Most interestingly, for specific choices of the gluino phase, the
allowed range of tan β values compatible with EDM limits can be enlarged dramatically.
For instance, if arg(mg̃ ) = 35◦ in the aforementioned CPX scenario (see also Fig. 7(b)), the
predicted values for 25 tan β 45 do not contradict upper limits on thallium and neutron
EDMs. For the remaining range of tan β values, the obtained prediction does not exceed
the 2σ upper bound on |dTl | by a factor of ∼ 3. Evidently, the degree of cancellations re-
quired between the one- and two-loop EDM terms in the CPX scenario is not excessive,
for certain choices of the gluino phase.
At this point, it is important to stress that a muon EDM dµ measured at the 10−24 e cm
level will help to sensitively probe CP-violating regions of the MSSM parameter space
which cannot be accessed easily by measurements of the thallium and neutron EDMs.
This complementarity property is mainly a consequence of the fact that dµ is free from
interfering CP-odd electron–nucleus interactions thanks to the CS operator, which can
contribute significantly to dTl . Unlike the neutron EDM dn , dµ does not suffer from
hadronic uncertainties. Given the absence of a signal in the measurements of |dTl | and |dn |,
one may now wonder whether a positive signal in dµ would already imply a positive signal
on g − 2 as well. This is not the case within our framework of the MSSM. If the first two
generations of sfermions are above the TeV scale, the biggest contribution to g − 2 comes
again from related two-loop Barr–Zee-type graphs. However, for phenomenologically
viable charged Higgs-boson masses MH + 120 GeV in the MSSM [43], these effects
on g − 2 are negligible [45]. Then, only post-LEP2 high-energy colliders and the proposed
BNL experiment [20] on the muon EDM dµ might be able to sensitively explore the CP-
violating parameter space of the above framework of the MSSM in a rather complementary
manner.
282 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
We have also studied the impact of EDM constraints on the mechanism of electroweak
baryogenesis induced by CP-violating chargino currents. For this purpose, we considered
a scenario in (4.2), which favours the above mechanism of electroweak baryogenesis [14].
In such a scenario, the chargino two-loop graphs of Fig. 2(d) represent the dominant
contribution to de and dTl as well. However, as we detailed in Section 4, our theoretical
predictions for de are at variance with those presented in a recent communication [41].
Moreover, we find that LEP2 direct limits on Higgs-boson masses require intermediate
and larger values of tan β, i.e., tan β 6, for a phenomenologically viable scenario of
electroweak baryogenesis. In this tan β regime, experimental upper limits on |dTl| give
rise to strict constraints, especially when no cancellations between the chargino two-loop
and one-loop EDM terms are assumed. In the latter case, the charged Higgs-boson mass
MH + should be relatively large, i.e., MH + 700 GeV for tan β 6 and arg(mW )
◦
90 . Otherwise, for lighter charged Higgs bosons, either the chargino phase should be
suppressed by a factor of at least 10 or cancellations in 1 part to 10 with one-loop EDM
terms need be invoked.
In our computation of the Higgs-boson loop-induced EDMs, we have considered
resummation effects of higher-order CP-conserving and CP-violating terms in Higgs-boson
self-energies and vertices. In particular, the original t-quark two-loop graph suggested by
Barr and Zee [4] occurs beyond the two-loop approximation through threshold effects in
the Hi t¯t coupling and, depending on the choice of the gluino phase, it might even compete
with the t˜-squark two-loop graph [2]. Since our resummation of higher-order terms relied
on an effective Lagrangian approach, one may worry about the relevance of other higher-
order terms present in a complete computation. At this stage, we can only offer estimates
of those possible higher-order electroweak uncertainties in the calculation of EDMs. Thus,
we have checked our results with and without resumming the Higgs-boson self-energies.
In this way, no large modifications are found in our predictions; the variation of our results
is generally less than 10% for MH + 170 GeV, and becomes even smaller, to less than
1% for MH + 200 GeV. This may be attributed to the fact that the dominant contributions
to EDMs come from the heaviest Higgs bosons, on which the relative impact of radiative
effects is less important. On the other hand, CP-violating threshold effects constitute the
main source of theoretical uncertainties in the calculation of the original Barr–Zee graph
of Fig. 2(c), as they are less controllable for low values of tan β.5 In this context, we
remark that even the computation of the CP-odd three-gluon operator is haunted by relevant
higher-order electroweak uncertainties in the MSSM [30]. The Weinberg operator can be
generated in its original fashion [29] at three and higher loops which involve CP-violating
self-energy and vertex subgraphs of Higgs bosons. It then appears necessary to develop
improved techniques that would enable us to provide accurate estimates of (resummed)
higher-order terms in the calculation of EDMs. The present work is a step towards this
goal.
5 A crude estimate suggests that these additional higher-order effects are smaller than 20%.
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 283
Acknowledgements
The author thanks Marcela Carena, Carlos Wagner for discussions on issues related to
electroweak baryogenesis, Adam Ritz for comments on the computation of the mercury
EDM, Maxim Pospelov for clarifying remarks pertinent to [40], Darwin Chang and Wai-
Yee Keung for communications with regard to [41], and Athanasios Dedes for critical
remarks.
The couplings of the CP-mixed Higgs bosons H1,2,3 to t-, b-quarks, t˜-, b̃-squarks and
charginos χ + play a key rôle in our calculations. In this appendix we present the effective
Lagrangians describing the above interactions, after including dominant one- and two-loop
CP-even/CP-odd quantum effects on the Higgs-boson masses and their respective mixings.
Following the conventions of [8], we first write down the effective Lagrangian of the
Higgs-boson couplings to top and bottom quarks
3
gw mb S gw mt S
LH q̄q = − Hi b̄ gHi bb + igH
P
γ 5 b + t¯ g + ig P
γ 5 t ,
i bb Hi t t Hi t t
2MW 2MW
i=1
(A.1)
with [8]6
1 + (δhb / hb ) O1i
S
gH = Re
i bb 1 + (δhb / hb ) + (+hb / hb ) tan β cos β
(+hb / hb ) O2i
+ Re
1 + (δhb / hb ) + (+hb / hb ) tan β cos β
(+hb / hb )(tan2 β + 1)
+ Im O3i , (A.2)
1 + (δhb / hb ) + (+hb / hb ) tan β
6 Here, we have also used the fact that b- and t-quark masses are positive, i.e., Im m ∝ Im[h + (δh ) +
b b b
(+hb ) tan β] = 0 and Im mt ∝ Im[ht + (δht ) + (+ht ) cot β] = 0.
284 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
(+ht / ht )(cot2 β + 1)
+ Im O3i , (A.4)
1 + (δht / ht ) + (+ht / ht ) cot β
where αs = gs2 /(4π) is the SU(3)c fine structure constant, and I (a, b, c) is the one-loop
function
ab ln(a/b) + bc ln(b/c) + ac ln(c/a)
I (a, b, c) = . (A.10)
(a − b)(b − c)(a − c)
In addition, the stop and sbottom masses are given by (with q = t, b)
Fig. 9. Effective one-loop =01,2 b̄b and =01,2 t¯t couplings, δhb,t and +hb,t , generated by the exchange of
(a) gluinos g̃ and (b) Higgsinos h̃±
1,2 .
A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289 285
m2q̃1 (q̃2 )
1 2 2 q
= M Q + Mq + 2mq + Tz cos 2βMZ
2 2
2
+ (−) M q2 + cos 2βM 2 Tzq − 2Qq sin2 θw 2 + 4m2q |Aq − Rq µ∗ |2 ,
2 − M
Q Z
(A.11)
where Qt (Qb ) = 2/3(−1/3), Tzt = −Tzb = 1/2, Rt (Rb ) = cot β(tan β), and sin2 θw =
1 − MW 2 /M 2 .
Z
It is important to remark here that only the CP-violating vertex effects on gH S and
i bb
P 2
gHi bb , which are proportional to Im[(+hb / hb ) tan β] in (A.2) and (A.3), are enhanced for
moderately large values of tan β, i.e., 20 tan β 40. However, for very large values of
tan β, i.e., tan β 40, there is a 1/ tan2 β-dependent damping factor due to CP-violating
resummation effects which cancels the tan2 β-enhanced factor mentioned above. As a
consequence, in the large-tan β limit, the coupling factors gH S P
and gH approach a
i bb i bb
tan β-independent constant. A related discussion is also given in Section 2.
Another important ingredient for our computation of two-loop EDMs is the diagonal
effective couplings of the Higgs bosons to scalar top and bottom quarks. Taking the CP-
violating Higgs-mixing effects into account, the effective Lagrangian of interest to us may
be conveniently written in the form
3
vξq(Hi ) q̃1∗ q̃1 − q̃2∗ q̃2 + vζq(Hi ) q̃1∗ q̃1 + q̃2∗ q̃2 ,
diag
LH q̃ ∗ q̃ = Hi (A.12)
i=1 q=t,b
where
(H ) 2m2t O3i O1i ∗ O2i
ξt i = Im(µAt ) 2 − Re(µXt ) + Re(At Xt ) ,
v 2 (m2t˜ − m2t˜ ) sin β sin β sin β
2 1
(A.13)
2
2m O 3i O 2i ∗ O 1i
ξb(Hi ) = b
Im(µA b ) − Re(µX b ) + Re(A b X b ) ,
v 2 (m2 − m2 ) cos2 β cos β cos β
b̃2 b̃1
(A.14)
2
2m O2i 2
2m O1i
, g 2 , , g 2 , (A.15)
(H ) (H )
ζt i = − 2 t + O gw 2
ζb i = − 2 b + O gw 2
v sin β v cos β
with Xq = Aq − Rq µ∗ (q = t, b). Although we assumed m2q̃ > m2q̃ , the effective
1 2
Lagrangian (A.12) exhibits the nice feature that it is fully independent of the hierarchy
of squark masses.
Finally, we present the effective couplings of the CP-mixed Higgs bosons H1,2,3 to
+
charginos χ1,2 [7,47]. These may be conveniently described by the effective Lagrangian
gw
3
LH χ + χ − = − √ Hi χ̄j+ (aHi χ − χ + + bHi χ − χ + iγ5 )χk+ , (A.16)
2 2 i=1 j,k=1,2 j k j k
286 A. Pilaftsis / Nuclear Physics B 644 (2002) 263–289
where
R∗ L R∗ L
aHi χ − χ + = O1i C2j C1k + C2k C1j + O2i C1j
R L∗
C2k + C1k
R L∗
C2j
j k
R∗ L
− iO3i sin β C2j C1k − C2kR L∗
C1j
R∗ L
+ cos β C1j C2k − C1k
R L∗
C2j , (A.17)
R∗ L
bHi χ − χ + = iO1i C2j C1k − C2k C1j + iO2i C1j
R L∗
C2k − C1k
R∗ L R L∗
C2j
j k
R∗ L
+ O3i sin β C2j C1k + C2k C1j + cos β C1j
R L∗
C2k + C1k
R∗ L R L∗
C2j . (A.18)
In the above, C R and C L are 2 × 2 unitary matrices, which diagonalize the chargino mass
matrix:
mW gw φ20∗
MC = , (A.19)
gw φ10 µ
√ √
with φ10 = v1 / 2 and φ20∗ = v2 / 2, through the bi-unitary transformation
C R† MC C L = diag(mχ + , mχ + ). (A.20)
1 2
while the analytic expressions for the mixing matrices C L,R are quite lengthy in the
presence of CP violation, and will not be presented here; they can be computed using
standard techniques [7].
For completeness, we give the corresponding effective couplings of the would-be
+
Goldstone boson G0 to charginos χ1,2 :
R∗ L R∗ L
aG0 χ − χ + = i cos β C2j C1k − C2k C1j − i sin β C1j
R L∗
C2k − C1k
R L∗
C2j ,
j k
R∗ L R∗ L
bG0 χ − χ + = − cos β C2j C1k + C2k C1j + sin β C1j
R L∗
C2k + C1k
R L∗
C2j . (A.22)
j k
A non-trivial consistency check for the correctness of our analytic results is the vanishing
of the diagonal scalar couplings of the G0 boson to charginos, i.e., aG0 χ − χ + = 0.
j j
References
Abstract
A heavy triplet of leptons (Σ + , Σ 0 , Σ − )R per family is proposed as the possible anchor of a
small seesaw neutrino mass. A new U (1) gauge symmetry is then also possible, and the associated
gauge boson X may be discovered at or below the TeV scale. We discuss the phenomenology of
this proposal, with and without possible constraints from the NuTeV and atomic parity violation
experiments, which appear to show small discrepancies from the predictions of the standard model.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
To obtain nonzero neutrino masses so as to explain the observed atmospheric [1] and
solar [2] neutrino oscillations, the minimal standard model of particle interactions is often
extended to include three neutral fermion singlets, usually referred to as right-handed
singlet neutrinos. If they have large Majorana masses, then the famous seesaw mechanism
[3] allows the observed neutrinos to acquire naturally small Majorana masses. On the other
hand, there are other equivalent ways [4,5] to realize this effective dimension-five operator
[6] for neutrino mass. For example, if we replace each neutral fermion singlet by a triplet
[5,7]:
Σ = Σ + , Σ 0 , Σ − ∼ (1, 3, 0) (1)
under SU(3)C × SU(2)L × U (1)Y , the seesaw mechanism works just as well.
If the Majorana mass of Σ is very large, then its effect at low energies is indistin-
guishable from that of the canonical seesaw. On the other hand, if it is at or below the
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 5 - 5
E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302 291
TeV energy scale, which is a natural possibility if there exists a second Higgs doublet, as
shown recently [8], then there are interesting new experimental signatures for the origin of
neutrino mass. In Section 2, the phenomenology of this scenario is discussed.
It is well known [9] that in the case of one additional right-handed singlet neutrino per
family of quarks and leptons, it is possible to promote B − L (baryon number – lepton
number) from being a global U (1) symmetry to an U (1) gauge symmetry. Similarly a new
U (1)X gauge symmetry [10] is also possible here. The model is described in Section 3.
Since the X gauge boson may be at or below the TeV scale, it may be responsible
for some of the possible discrepancies observed in recent experiments. In Section 4, we
use it to explain the NuTeV result [11] and explore its phenomenological implications. In
Section 5, we do the same but using atomic parity nonconservation [12] as a constraint. In
Section 6, the Higgs sector is discussed and its difference from other proposals is noted.
We then conclude in Section 7.
a residual symmetry is still conserved, i.e., the conventional multiplicative lepton number,
where ν, e, and Σ are odd, but Φ and η are even. In other words, there are no unwanted
L = 1 interactions even though η0 = 0.
Whereas Σ 0 has no coupling to either the photon or the Z boson (as is the case with
NR ), Σ ± interacts with both. Hence our proposal is more easily tested experimentally than
the canonical seesaw. If the mass of Σ ± is below that of η, the former decays only via its
mixing with e± . Thus we expect the decay modes
will also determine fij . The subsequent decays of η+ and η0 occur through their small
mixings with φ + and φ 0 , so they are dominated by t b̄ and t t¯ final states and should be
easily identifiable.
Since Σ and η have distinctive signatures once they are produced, their discoveries are
primarily controlled by the size of the signal. We have estimated their pair production cross
sections at the LHC and at the Tevatron via the standard Drell–Yan mechanism. The spin
and color averaged matrix element squares are given by
2
2
1 Q q L q Lη Q q R q Lη
M 2 q q̄→η+ η− = e4 ut − m4η + + + , (8)
3 s s − MZ2 s s − MZ2
where
1
− sin2 θW Iq3 − Qq sin2 θW −Qq sin2 θW
Lη = 2
, Lq = , Rq = . (9)
sin θW cos θW sin θW cos θW sin θW cos θW
The analogous matrix element squares for Σ ± pair production are
1
2 2 Qq Lq RΣ 2
M q q̄→Σ + Σ − = e t − MΣ
2 4
+
3 s s − MZ2
2 2 Qq Rq RΣ 2
+ u − MΣ + , (10)
s s − MZ2
where
1 − sin2 θW
RΣ = . (11)
sin θW cos θW
Fig. 1 shows the LHC and Tevatron production cross sections of the heavy scalar pair η±
and Fig. 2 shows those of the heavy lepton pair Σ ± as functions of their mass. We see
from these figures that the final luminosity of about 300 fb−1 at the LHC will correspond
to a modest discovery limit of both η± and Σ ± up to a mass of about 1 TeV.
E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302 293
(u, d)L ∼ (3, 2, 1/6; n1), uR ∼ (3, 1, 2/3; n2), dR ∼ (3, 1, −1/3; n3),
(ν, e)L ∼ (1, 2, −1/2; n4), eR ∼ (1, 1, −1; n5), ΣR ∼ (1, 3, 0; n6). (12)
It has been shown recently [10] that U (1)X is free of all anomalies [13–15] for the
following assignments:
1 1
n2 = (7n1 − 3n4 ), n3 = (n1 + 3n4 ),
4 4
1 1
n5 = (−9n1 + 5n4 ), n6 = (3n1 + n4 ). (13)
4 4
This is a remarkable and highly nontrivial result.
As shown in Ref. [10], there are 6 conditions to be satisfied for the gauging of U (1)X .
Three of them do not involve n6 and have 2 solutions:
which means that two distinct Higgs doublets are sufficient for all possible Dirac fermion
masses in this model. If n4 = −3n1 is chosen, then again U (1)X will be proportional to
U (1)Y . However, for n4 = −3n1 , a new class of models is now possible with U (1)X as a
genuinely new gauge symmetry.
Consider νq and ν̄q deep inelastic scattering. It has recently been reported [11] by
the NuTeV Collaboration that their measurement of the effective sin2 θW , i.e., 0.2277 ±
0.0013 ± 0.0009, is about 3σ away from the standard-model prediction of 0.2227 ±
0.00037. In this model, the X gauge boson also contributes with
µ 1 − γ5 1 − γ5
JX = n1 ūγ µ u + n1 d̄γ µ d
2 2
1 + γ5 1 + γ5 1 − γ5
+ n2 ūγ µ u + n3 d̄γ µ d + n4 ν̄γ µ ν. (21)
2 2 2
Assuming very small X–Z mixing (| sin θ | 1), the effective neutrino–quark interactions
are then given by
GF q q
Hint = √ ν̄γ µ (1 − γ5 )ν /L q̄γµ (1 − γ5 )q + /R q̄γµ (1 + γ5 )q , (22)
2
where
1 2 2
/L = (1 − ξ )
u
− sin θW + n1 ζ, (23)
2 3
1 1 2
/L = (1 − ξ ) − + sin θW + n1 ζ,
d
(24)
2 3
2 2
/R = (1 − ξ ) − sin θW n2 ζ,
u
(25)
3
1 2
/R = (1 − ξ )
d
sin θW + n3 ζ, (26)
3
with
M 2 gX
ξ = 2n4 sin θ 1 − Z2 , (27)
MX gZ
2
2
MZ2 gX MZ gX
ζ = − sin θ 1 − 2 + 2n4 . (28)
MX gZ MX2 gZ2
The parameter ξ is constrained by data at the Z resonance to be very small. Using the
general analysis of Z–X mixing [16], we find
2s 2 c2 (c2 − s 2 )2 s 2 (−1 − 2s 2 + 4s 4 )
ξ= /1 + /2 + 2 /3
c −s
2 2 2c 2 c c2 − s 2
= 0.624/1 + 0.198/2 − 0.644/3, (29)
296 E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302
where s ≡ sin θW and c ≡ cos θW . Given that |/i | is of order 0.001, ξ is too small to make
much difference in the above [17]. We thus assume ξ = 0 (sin θ = 0) for our subsequent
discussion.
To account for the NuTeV result, i.e.,
eff 2 u 2 d 2
gL = /L + /L = 0.3005 ± 0.0014, (30)
eff 2 u 2 d 2
gR = /R + /R = 0.0310 ± 0.0011, (31)
against the standard-model prediction, i.e.,
eff 2 eff 2
gL SM = 0.3042, gR SM = 0.0301, (32)
consider the following specific model as an illustration:
3 5
n1 = 1, n2 = , n3 = ,
4 4
4 7 13
n4 = , n5 = − , n6 = . (33)
3 12 12
Then
2 2
∆ gLeff = − sin2 θW ζ + 2ζ 2 , (34)
3
eff 2 1 17
∆ gR = − sin2 θW ζ + ζ 2 . (35)
6 8
To fit the experimental values, we need a negative ∆(gLeff )2 . From Eq. (34) we see that it
reaches its maximum value at
1
ζ = sin2 θW , (36)
6
for which
2 1
∆ gLeff = − sin4 θW = −0.0028, (37)
18
2 1
∆ gReff = sin4 θW = +0.0016, (38)
32
in very good agreement with the experimental values of −0.0037 ± 0.0014 and +0.0009 ±
0.0011, respectively.
Using Eqs. (28), (33), and (36), we find that
2
gX sin2 θW gZ2
= . (39)
MX2 16 MZ2
Thus the production of the new gauge boson X may be studied as a function of the single
parameter MX in this scenario. We note first that if the U (1)X assignments of Eq. (33)
apply to electrons as well, then Eq. (39) is in serious conflict with atomic parity violation
and e+ e− cross sections. We must therefore attribute the NuTeV anomaly as being due to
the muon (and perhaps also the tau) sector [17,18] only. In the context of U (1)X , this may
be accomplished as follows. We change the electron’s assignments under U (1)X to zero so
E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302 297
that it does not couple to X at all. To preserve the cancellation of anomalies, we add heavy
fermions at the TeV energy scale, i.e.,
(N, F )L ∼ (1, 2, −1/2; n4), (N, F )R ∼ (1, 2, −1/2; 0), (40)
EL ∼ (1, 1, −1; 0), ER ∼ (1, 1, −1; n5). (41)
These are prevented from coupling to the known leptons by a discrete symmetry to forbid
terms such as EL eR , etc. As a result, the lightest among them is stable, in analogy to the
lightest supersymmetric particle of R-parity conserving supersymmetry.
The spin and color averaged matrix element square for the X boson signal at the
Tevatron and at the LHC is given by
gX4
1
M 2 q q̄→X→f f¯ =
3 (s − MX2 )2 + MX2 6X
2
2 2 2
× nqL u nfL + t 2 n2fR + n2qR u2 n2fR + t 2 n2fL , (42)
where
2
gX
6X = MX 18n21 + 9n22 + 9n23 + 4n24 + 2n25 . (43)
24π
Substituting the required value of gX from Eq. (39) and the X charges (ni ) from Eq. (33)
we see that for MX > 1 TeV, gX > 1 and its width becomes comparable to its mass. Fig. 3
shows the total X boson production cross sections at the LHC and at the Tevatron as
functions of its mass. We see that the predicted signal cross sections are really large if
the X boson is to account for the NuTeV anomaly. It may be noted here that there is a 95%
confidence-level upper limit of
σ (X)B X → e+ e− , µ+ µ− = 40 fb (44)
from the CDF experiment [19] at the Tevatron. The X charges of Eq. (33) correspond to a
branching fraction B(X → µ+ µ− ) = 4–5%. Thus assuming the CDF detection efficiency
to be roughly similar for the e+ e− and µ+ µ− channels, the above constraint would imply
σ (X) < 1000 fb at the Tevatron. On the other hand we see from Fig. 3 that σ (X) > 1000 fb
at the Tevatron right up to MX = 2 TeV (the finite value at the kinematic boundary is due
to the large width). Thus consistency with this limit will require MX > 2 TeV. Fig. 3 also
shows a very large signal cross section at the LHC up to MX = 3 TeV, corresponding to
gX
3. It remains large at larger values of MX as well, the cutoff being provided by the
perturbation theory limit on gX .
those of Fig. 3 due to the different X charges. On the other hand, we expect a high
B(X → e+ e− )
18% in this case. Thus the CDF limit of Eq. (44) would again imply
MX > 2 TeV. Nonetheless we expect a very large signal cross section at the LHC up to a
mass range of several TeV, till the value of gX is again cut off by the perturbation theory
limit.
Note added: after we have completed our calculations, a new report [24] appeared which
shifted the central value of QW to 0.38, but with the same uncertainties. Since our choice
of the lower limit of Eq. (48) corresponds to it being 0.33, it is also consistent with this
new result.
6. Higgs sector
As shown in Section 3, U (1)X requires two distinct Higgs doublets for fermion masses,
i.e., Φ1 = (φ1+ , φ10 ) with U (1)X charge (9n1 − n4 )/4 which couples to charged leptons, and
Φ2 = (φ2+ , φ20 ) with U (1)X charge (3n1 − 3n4 )/4 which couples to up and down quarks
as well as to Σ. [The leptonic Higgs doublet η of Section 2 may also be introduced so
that only it couples to Σ, while Φ2 couples only to quarks.] To break the U (1)X gauge
symmetry spontaneously, we add a singlet χ with U (1)X charge −2n6 , so that the Yukawa
term χΣΣ would allow Σ to acquire a large Majorana mass at the U (1)X breaking scale.
300 E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302
Note that the U (1)X charges allow the trilinear term χ † Φ1† Φ2 , without which V would
have 3 global U (1) symmetries, but only 2 U (1) gauge symmetries, resulting in an
unwanted Goldstone boson. We have not included η because m2η is large and positive as
discussed in Section 2. After the heavy χ [with χ ∼ 1 TeV] has been integrated out,
the reduced two-doublet Higgs potential is of the usual form. The difference from other
proposals is in their Yukawa couplings, i.e., Φ1 couples to charged leptons whereas Φ2
couples to both up and down quarks.
Let us briefly discuss the distinctive phenomenological features of this two-Higgs-
doublet model. While the vacuum expectation value φ20 is required to be ∼ 100 GeV
because of mt , φ10 can be anywhere between ∼ 100 GeV and 1–2 GeV. In terms of
the ratio tan β = φ20 /φ10 , they correspond to the limits tan β
1 and tan β 1. In
the former case, the phenomenological implications are similar to those of the standard
two-Higgs-doublet scenario where Φ2 couples to the up quarks and Φ2 couples to the
charged leptons as well as to the down quarks. In the latter case however, there are
distinctive differences between our proposal and the standard scenario, because Higgs
couplings proportional to mb are now multiplied by cot β instead of tan β. Let us
consider in particular the charged and the pseudoscalar neutral Higgs bosons, H ± and
A0 , which correspond to the linear combination Φ1 sin β − Φ2 cos β. Their distinctive
phenomenological features are summarized below.
(i) The H − t loop contribution to the b → sγ decay amplitude is suppressed. It has the
factor cot2 β instead of tan β cot β = 1 in the standard scenario. This means that the
charged Higgs boson in this model can be relatively light.
(ii) The H − → τ − ν̄ decay dominates over H − → t¯b for any charged Higgs mass.
(iii) The H ± production via t b̄ fusion is no longer the main production mechanism at
hadron colliders. It will instead be pair production via the Drell–Yan mechanism
discussed in Section 2. The plots of Fig. 1 apply equally to H ± pair production in
this case. Thus we expect a visible signal at the LHC up to a Higgs mass of about
1 TeV.
(iv) The A0 → τ + τ − decay dominates over A0 → b b̄ and A0 → t t¯.
(v) The A0 production is again no longer dominated by bb̄ or t t¯ fusion at hadron colliders,
but rather by associated production of A0 H 0 (H ± ) through Z(W ± ) exchange.
There are analogous distinctions for the two physical neutral scalars h0 and H 0 .
However, they depend on the additional mixing angle α which is not necessarily close
to β.
E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302 301
7. Conclusion
Acknowledgements
This work was supported in part by the US Department of Energy under Grant No. DE-
FG03-94ER40837.
References
[1] Super-Kamiokande Collaboration, S. Fukuda, et al., Phys. Rev. Lett. 85 (2000) 3999, and references therein.
[2] Super-Kamiokande Collaboration, S. Fukuda, et al., Phys. Rev. Lett. 86 (2001) 5656, and references therein;
See also: SNO Collaboration, Q.R. Ahmad, et al., Phys. Rev. Lett. 87 (2001) 071301;
SNO Collaboration, Q.R. Ahmad, et al., Phys. Rev. Lett. 89 (2002) 011301;
SNO Collaboration, Q.R. Ahmad, et al., Phys. Rev. Lett. 89 (2002) 011302.
[3] M. Gell-Mann, P. Ramond, R. Slansky, in: P. van Nieuwenhuizen, D.Z. Freedman (Eds.), Supergravity,
North-Holland, Amsterdam, 1979, p. 315;
T. Yanagida, in: O. Sawada, A. Sugamoto (Eds.), Proceedings of the Workshop on the Unified Theory and
the Baryon Number in the Universe, KEK, Tsukuba, Japan, 1979, p. 95;
R.N. Mohapatra, G. Senjanovic, Phys. Rev. Lett. 44 (1980) 912.
[4] E. Ma, U. Sarkar, Phys. Rev. Lett. 80 (1998) 5716.
302 E. Ma, D.P. Roy / Nuclear Physics B 644 (2002) 290–302
Abstract
We treat free large N superconformal field theories as holographic duals of higher spin (HS)
gauge theories expanded around AdS spacetime with radius R. The HS gauge theories contain
massless and light massive AdS fields. The HS current correlators are written in a crossing
symmetric form including only exchange of other HS currents. This and other arguments point to
the existence of a consistent truncation to massless HS fields. A survey of massless HS theories with
32 supersymmetries in D = 4, 5, 7 (where the 7D results are new) is given and the corresponding
composite operators are discussed. In the case of AdS4 , the cubic couplings of a minimal bosonic
massless HS gauge theory are described. We examine high energy/small tension limits giving rise
to massless HS fields in the type IIB string on AdS5 × S 5 and M-theory on AdS4/7 × S 7/4 . We
discuss breaking of HS symmetries to the symmetries of ordinary supergravity, and a particularly
natural Higgs mechanism in AdS5 × S 5 and AdS4 × S 7 where the HS symmetry is broken by
finite gYM . In AdS5 × S 5 it is shown that the supermultiplets of the leading Regge trajectory cross
over into the massless HS spectrum. We propose that gYM 2 = 0 corresponds to a critical string
tension of order 1/R 2 and a finite string coupling of order 1/N. In AdS7 × S 4 we give a rotating
membrane solution coupling to the massless HS currents, and describe these as limits of Wilson
surfaces in the AN−1 (2, 0) SCFT, expandable in terms of operators with anomalous dimensions
that are asymptotically small for large spin. The minimal energy configurations have semi-classical
energy E = s for all s and the geometry of infinitely stretched strings with energy and spin density
concentrated at the endpoints.
2002 Elsevier Science B.V. All rights reserved.
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 3 9 - 3
304 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
1. Introduction
The strong form of the Maldacena conjecture states that type IIB closed string theory
on AdS5 × S 5 with N units of five-form flux and string coupling gs corresponds to d = 4,
N = 4 SYM theory with SU(N) gauge group and Yang–Mills coupling gYM 2 = g [1–3].
s
This conjecture has been primarily tested for N gYM N 1, where supergravity is a
2
and possibly gYM = 0, where the SYM theory becomes a theory of afree SU(N) valued
2
gauge invariant operators are composite single-trace operators which can be arranged into
‘trajectories’ according to the value of the twist E − s, where E is the conformal dimension
and s is the spin. The twist is the anomalous contribution to E, which becomes small at
weak ’t Hooft coupling and large N .
A basic observation [6] is the non-intersection principle in a CFT which states that as the
coupling varies there cannot be any mixing between operators that are not mixing already
at the free level. This applies to both the spectrum of composite operators of 4D SYM
in the limit N 1 gYM 2 N and the spectrum of vertex operators of the sigma model
for N gYM N 1. Thus an important test of Maldacena conjecture is to verify that the
2
trajectories of SYM operators with constant twist cross over into the closed string Regge
trajectories.
In this paper we shall show that this is indeed the case for the leading trajectories,
which consist of the states with minimal E for fixed s. In fact, on the SYM side the
leading trajectory, i.e., the operators with minimal twist, consists of bilinear higher spin
(HS) tensors. In the free limit, these have twist 2 and the s 1 sector coincides with the
space of conserved HS currents. General aspects of these currents have been discussed
in [7,8]. The precise spectrum of twist 2 operators and the corresponding HS symmetry
algebra extension of the conformal/AdS group was constructed in [9,10] using group
theoretic methods which shows that the twist 2 operators in fact form an irreducible ‘gauge’
multiplet of the HS algebra.
In [9,10] it was also shown how to describe the HS gauge multiplet on the bulk side
at the level of a linearized AdS field theory containing HS gauge fields as well as other
interesting HS fields generalizing the self-dual two-form of the supergravity multiplet
contained in the spectrum. This immediately raises the following questions; is it possible
to extend this picture to an interacting theory of massless HS fields in AdS5 , and if so, is
this theory the result of a consistent truncation of the full closed string theory in the limit
N 1 gYM 2 N?
quite some time [14] (see, [15] for a review). In testing the free CFT/HS gauge theory
correspondence ideas, it is important to exhibit the couplings of the HS gauge theory.
The D = 4, N = 8 theory has been examined in great detail in [16,17]. In D = 4 the
basic interactions are contained in a minimal bosonic model which can be embedded as
a consistent truncation into HS gauge theories with N 0. The explicit couplings of the
minimal bosonic model in D = 4 are given in a generally covariant curvature expansion
scheme in [18,19]. Here we shall summarize the results of [19] at the level of cubic
couplings. The analogous bosonic truncation in D = 5 was given in [9] and in D = 7
in [20], though the full interactions still remains to be found. In this paper we also give the
symmetry algebra and massless spectrum of the D = 7, N = 2 HS gauge theory.
The issue of consistent truncation is crucial since the subleading trajectories in the
gauge theory correspond to massive AdS fields which are light, meaning that their AdS
energies are not separated from the massless ones by a mass-gap. Here it is important to
note that regardless of the detailed structure of the bulk interactions, it is still possible
[21] to arrange the effective bulk action into a 1/N 2 expansion such that its extremum
reproduces the 1/N 2 expansion of the correlators of the composite operators of the SU(N)
invariant singleton theory. In fact, this expansion remains highly non-trivial even in the
limit gYM2 = 0 [22,23]. In particular, if one sets to zero all the massive fields on the
boundary, then the extremum of the full effective action should reproduce the correlators
of the bilinear twist 2 operators. The massive fields may still become excited in the bulk,
if massless fields act as sources for massive fields. If this is the case, then the massless HS
gauge theory cannot serve as a good approximation for studying these processes, not even
as an effective theory since it is not possible to eliminate the light massive fields while
preserving locality (the non-localities which one encounters in massless HS theory are not
that bad). Thus, for the massless HS gauge theory to be relevant, it must be possible to
consistently set the massive fields to zero in the full theory, at least in the leading non-
trivial order in the 1/N 2 expansion.
There are several ways to test this consistent truncation. Firstly, it requires consistent
interactions among massless fields, for which there are many indications as already
mentioned. Given the consistent equations of motion or action for the massless fields, one
must then compute the bulk tree amplitudes, which by definition will only contain massless
excitations in the internal lines, and check that they correspond to the correlators of bilinear
composite operators computed in the singleton theory [19,24]. This direct method is
technically rather involved, however, and in this paper we instead provide indirect evidence
for consistent truncation by examining the nature of the correlators between bilinear
operators in singleton theories with large N . We also suggest that the arguments given
in [25–27] for the consistent truncation of type IIB and eleven-dimensional supergravities
on AdS4/7 × S 7/4 to gauged supergravity carry over to the HS context.
In this paper, we also emphasize the fact that the relations between the closed string
parameters gs and α in AdS5 × S 5 and the SYM parameters gYM 2 and N have so far been
tested only in the limit N gs N 1. In this regime the relations can be derived, e.g., by
first identifying the gauge theory parameters with the closed string parameters in flat 10D
spacetime, and then use D3-brane soliton description to interpolate from flat spacetime
down to AdS5 × S 5 . Since only 16 supersymmetries are preserved globally by the D3
brane, there may be string corrections to this computation.
306 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
It is important to note that the strong coupling tests of the AdS/CFT duality which
are based on exact calculations on the SYM side (see, for example, [28] and references √
therein) are still limited on the bulk side in that they do not go beyond the leading λ
approximation to the closed string theory in AdS background. As gYM 2 N becomes small
(keeping N large), we do not know the precise relations between closed string parameters
in AdS and the gauge theory parameters. It is clear that the string coupling gs decreases and
the sigma model coupling α /R 2 increases as gYM 2 decreases. We shall speculate that the
bulk parameters approach critical values as gYM 2 = 0 where the bulk theory is described
by closed string theory with coupling 1/N 2 and a singleton worldsheet CFT based on
critical level k affine PSU(2, 2|4) algebra, and that the left- and right-moving singleton
spin fields can be used in the construction of vertex operators describing massless HS fields
in the bulk. The level k is related to the worldsheet sigma model coupling constant, i.e.,
α /R 2 = ls2 /R 2 . The corrected relations between the closed string parameters in AdS5 × S 5
and the gauge theory parameters we propose are given by
gs = f1 (λ)gYM
2
, ls = f2 (λ)R, (1.1)
flat eleven-dimensional spacetime. Apparently, these SCFTs are isolated fixed points of the
renormalization groups (RG) that do not admit any marginal deformations, with or without
preservation of supersymmetry. Consequently they do not admit any coupling constants
and Lagrangian descriptions. The main window for viewing these strongly coupled theories
is therefore through the bulk supergravity, which is a valid approximation to M-theory at
fixed energies provided N 1. This corresponds to a subset of the SCFT operators with
fixed conformal dimensions as N 1 [1,4]. Recently other limits of the correspondence
based on considering large internal spin have been proposed [30].
In analogy with type IIB closed string theory on AdS5 × S 5 , it is natural to ask whether
M-theory on AdS4/7 × S 7/4 has an unbroken phase in which M-theory corrections become
relevant at fixed energy and the effective description of the bulk theory becomes a HS
gauge theory with holographic dual given by a free SCFT in d = 3 or d = 6. In other
words, we wish to examine whether it is possible to have a ‘phase diagram’ with two fixed
points, one corresponding to the free singleton SCFT describing the unbroken HS phase
and another one corresponding to the strongly coupled SCFT describing the broken phase.
From the bulk point of view the broken phase is described by membranes interacting in
the flat eleven-dimensional center of AdS4/7 × S 7/4 , while the unbroken phase, which is
specific to AdS, is described by membranes interacting close to the boundary of AdS.
By examining the RG flows on M2/5 branes and D2/4 branes we are led to propose
that the relevant free SCFTs in d = 3, 6 are described by free SU(N) valued OSp(8|4)
singletons and free SU(N) valued d = 6, N = (2, 0) tensor singletons. These theories
have of course figured in the literature before (see, for example, [27]), and have been
used in many circumstances in order to unravel information about the strongly coupled
SCFTs.1 Our point here is that due to the salient features of the large N limit the free
SCFTs make sense on their own as holographic images of the interesting unbroken phases
of M-theory. Technically speaking, large N implies factorization and 1/N expansion of
correlators which can be matched with the expansion of the bulk amplitudes in terms of
the fundamental Planck scale.
As in the case of the type IIB theory, an important issue is whether there is a consistent
truncation down to a massless sector. The ideas for examining this are similar to those
described above for the type IIB theory. The D = 4 case is particularly tractable as in this
case we already know the full form of the interactions among massless HS fields, which
makes it possible to test directly the consistent truncation without first having to construct
the interactions.
An intriguing feature of the proposed unbroken phases of M-theory on AdS4/7 × S 7/4
is that the spectrum is discrete and that there is a finite coupling, 1/N . Thus the unbroken
phases of M-theory appears to be on the same footing as the unbroken phase of the type IIB
theory on AdS5 × S 5 . This suggests that the unbroken phases in AdS4/7 × S 7/4 are theories
of M2-branes with fixed tension.
1 Free singletons, which form N − 1 plets of the Weyl group of SU(N ), appear in various ‘trivial’ IR limits
describing stacks of separated branes sitting at certain orbifold singularities [31]. These free singletons should not
be confused with the SU(N ) valued singletons, though they are curious from the HS perspective and they should
presumably be included into the phase diagram as separate HS phases.
308 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
To gather further evidence for this, we examine a family of rotating membrane solutions
in AdS7 × S 4 that are curved space generalizations of those given in flat spacetime in
[32,33] and membrane analogs of the string solutions found recently in [34] (which in
fact describe the leading Regge trajectory states). The minimal energy configurations have
semi-classical energy E = s for all s, and the geometry of infinitely stretched membranes
of zero width, whose energy and spin densities are concentrated in the asymptotic region.
By examining the supersymmetry enhancement in this region we can further show that the
rotating membranes indeed couple to the bilinear HS currents in the SCFT.
There is an important difference between the membrane solitons and the string solitons
given in [34]. The string solitons couple to operators whose anomalous dimensions become
asymptotically small only for large s, (E − s)/s → 0 as sα /R 2 → ∞. The membrane
solitons, on the other hand, couple to anomaly free operators for any value of s. This
is because they arise by taking the limit of zero width which has the dual interpretation
of shrinking a Wilson surface which means that the holographic dual flows to the free
singleton SCFT in d = 6.
We find it rather compelling that relatively simple, free SCFTs contain information
about the unbroken, and perhaps more fundamental, phases of type IIB closed string and
M-theory. Moreover, this means that the results on free SCFT which are scattered over the
literature can now be given a more direct physical interpretation.
In AdS/CFT correspondence, it is important that both the bulk and the boundary
theories admit 1/N expansions which define the physically relevant, i.e., asymptotically
convergent, expansions. In the unbroken HS phase, the bulk side may also admit a strongly
coupled closed string/membrane sigma model description, which we propose has large,
but fixed, coupling given by a critical tension, as mentioned above. In any event, consistent
truncation makes it possible to directly test the AdS/CFT correspondence using only the
action for the massless HS fields which does not require strongly coupled sigma model
computations.
The breaking of the HS symmetries requires the inclusion of Higgs fields whose
interactions require us to go beyond the consistent truncation to massless fields. Whether
this can be done at the level of some effective field theoretical construction in the bulk or
whether it requires extracting information from the strongly coupled sigma model is not
clear at present. Here we can only speculate that the large amount of symmetry present in
the unbroken phase should make the critical string and membrane sigma models amenable
to exact methods.
This paper is organized as follows. In Section 2, the properties of HS gauge theories in
D = 4, 5, 7 are reviewed, including their underlying symmetry algebras and field contents.
The results for the HS superalgebra and spectrum in 7D are new. In Section 3, the
composite singleton operators corresponding to the massless states of HS gauge theories,
their KK towers and Higgs multiplets are discussed. In Section 4, important aspects of
the CFT/HS gauge theory correspondence, and, in particular, the 1/N expansion in the
free CFT on the boundary are described. In Section 5, the 5D HS gauge theory as the
bulk theory arising in the critical limit of type IIB string theory and a Higgs mechanism
breaking the HS gauge symmetries down to those of ordinary supergravity are discussed.
In Section 6, first the CFT 3 /HS gauge theory correspondence for M-theory on AdS4 × S 7
is described. Then, the minimal bosonic truncation of the theory and its cubic interactions
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 309
are described. In Section 7, first the CFT 6 /HS gauge theory correspondence for M-theory
on AdS7 × S 4 is discussed. Then our rotating membrane solution in AdS7 × S 4 is given
and its properties and relevance to the 7D HS gauge theory are described. Section 8 is
devoted to a summary and discussion. In Appendix A, we present several tables which
show various sectors of the massless HS gauge theory spectra in D = 5, 7. In Appendix B,
we summarize the UIRs and BPS states of the maximal AdS superalgebras in D = 4, 5, 7.
In Appendices C and D, we collect further group theoretical information that is useful for
Sections 2 and 3.
HS gauge theories are generally covariant theories which admit AdS as a vacuum and
have an infinite number of local HS supersymmetries based on HS superalgebras which
are infinite-dimensional extension of the finite-dimensional AdS superalgebras [35] . The
fundamental UIRs of the HS super algebras in D = d + 1 = 4, 5, 7 dimensions are ultra-
short d-dimensional conformal supermultiplets, which we will refer to as singletons.2
Gauging of such a HS superalgebra yields a D-dimensional theory based on a massless
HS supermultiplet given by the symmetric product of two singletons. In this paper we shall
focus our attention on the HS extension of the AdS superalgebras in D = 4, 5, 7 with 32
real supersymmetries because these are the most natural ones to explore from the string/M-
theory point of view. In Section 8, we shall comment on possible extensions to higher D
and higher number of supersymmetries.
The massless HS multiplet is an infinite tower of massless AdS supermultiplets with
supergravity at the lowest level. One key property is the fact that a HS gauge theory in
D > 3 cannot be consistently truncated to an AdS supergravity. Basically, this is due to the
fact that derivatives of lower spin fields serve as sources for HS fields, and it can also be
seen from the structure of the OPE of free field theory stress-energy tensors in d > 2 [7].
However, in D = 4, 5, 7 there exist minimal bosonic truncations which have remarkably
simple physical field content, namely, massless fields of spin s = 0, 2, 4, 6, . . . described
by doubly traceless, symmetric tensors φµ1 ···µs . The embedding of these theories in their
supersymmetric extensions is explained in Tables 1, 2 and 3.
As for the full and covariant (i.e., background independent) interactions among the
massless fields, they are known in the 4D theory [14,18,19]. A condensed account of how
to extract cubic couplings in D = 4 will be given in Section 6.2. The fully interacting
theories in D = 5, 7 have not yet been constructed, though the results obtained so far are
promising [9,10,12,20].
We next list the HS superalgebras in D = 4, 5, 7, their singleton and massless
representations, and how the latter ones are assembled into master 1-form and master
0-form fields. The results in D = 4, 5 were obtained in [10,16]. The minimal bosonic HS
algebra in 7D was obtained in [20]. The results presented here for its supersymmetric
extension are new.
2 In d = 4, 6, these are usually referred to as doubletons, due to the fact that their oscillator construction is
based on two sets of oscillators as opposed to a single set of oscillators used in d = 3.
310 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
In particular, the zeroth level of hs(8|4) is the maximal finite subalgebra OSp(8|4) whose
generators schematically take the form
Qαi = yα θi , α̇i = ȳα̇ θi ,
Q Uij = θi θj ,
Mα β̇ = yα ȳβ̇ , Mαβ = yα yβ , Mα̇ β̇ = ȳα̇ ȳβ̇ . (2.4)
A generator P (!) in the !th level of hs(8|4) can be expanded as
P (!) (y, ȳ, θ )
1
= ȳ α̇1 · · · ȳ α̇m y β1 · · · y βn θ i1 · · · θ ip Pα̇1 ···α̇m β1 ···βn i1 ···ip . (2.5)
m+n+p
m! n! p!
= 4!+2
The spins of the components are given by s = 12 (m + n). The components with integer
spin are Grassmann even and those with half-integer spin are Grassmann odd. Bosons are
in the 1, 28 and 35± irreps of SO(8) and fermions in the 8 and 56 irreps. The reality
properties follow from P † = −P .
A UIR of OSp(8|4) is denoted by D(E0 , s; a1 , a2 , a3 , a4 ), where the notation is
explained in Appendix B. The fundamental UIR of OSp(8|4), which is also a UIR of
hs(8|4), is the ultra-short singleton [36,37]
D 12 , 0; 0, 0, 0, 1 ⊕ D 1, 12 ; 0, 0, 1, 0 . (2.6)
3 The algebra hs(8|4) is called shsE (8|4) in [9] and shsE (8, 4|0) in [35], where it is also shown to be
isomorphic to ho(8; 8|4).
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 311
Φ(y, ȳ, θ )
1
= ȳ α̇1 · · · ȳ α̇m y β1 · · · y βn θ i1 · · · θ ip Φα̇1 ···α̇m β1 ···βn i1 ···ip . (2.7)
−m+n+p
m! n! p!
= 0 mod 4
The reality condition on Φ is discussed in detail in [16,17]. The gauging gives rise to
a set of field equations for physical fields (the action still remains to be found) whose
spectrum is given by the symmetric product of two singletons which is given in Table 1.
The physical spin s 1 fields are the gauge fields in Aµ (y, ȳ, θ ) that correspond to hs(8|4)
generators in (2.5) satisfying |m − n| 1. Those with m = n contain the vierbein and
its HS generalizations, while those with |m − n| = 1 contain the gravitini and their HS
generalizations. The physical fields with s 12 arise in Φ(y, ȳ, θ ) as the components in
(2.7) with m + n 1. The remaining fields in Aµ and Φ are auxiliary and given in terms
of derivatives of the independent fields.
So far we have discussed the free massless HS gauge theory. The general formulation of
interacting massless HS gauge theory has been given in D = 4 [14] (see, [15] for a review),
and examined in detail for N = 8 [16,17]. There exists a minimal bosonic truncation of this
theory whose spectrum consist the physical states with spin s = 0, 2, 4, . . . , each occurring
once. This theory exhibits the basics of any HS gauge theory rather well and it will be
discussed in considerable detail in Section 6.2, which is based on [19].
Table 1
The SO(3, 2) × SO(8) content of the symmetric tensor product of two d = 3, N = 8 singletons. Each entry refers
to the SO(8) content. All SO(8) irreps are irreducible except 70 = 35+ + 35− and all the states have E0 = s + 1
except the scalars in one of the 35-plets at level ! = 0 and one of the scalars at level ! = 1. The representations
have been arranged into a tower of OSp(8|4) supermultiplets labeled by a level index !. The zeroth level is the
D = 4, N = 8 supergravity multiplet with 28 degrees of freedom. The level ! 1 multiplets have 2 × 28 degrees
of freedom. The spin s 1 fields arise in the hs(8|4) valued master gauge field and the spin s 12 arise in
the quasi-adjoint master zero-form. The minimal bosonic truncation of the spectrum is obtained by keeping the
maximum spin fields at each level and the (non-pseudo)scalar at level ! = 1
!\s 0 1 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2 2
0 70 56 28 8 1
1 1+1 8 28 56 70 56 28 8 1
2 1 8 28 56 70 56 28 8 1
3 1 8 28 56 70 ...
4 1 ...
..
.
312 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
The 5D HS superalgebra hs(2, 2|4) [10] is realized in terms of the following oscillators4
yα ȳβ = yα ȳβ + Cαβ , yα yβ = yα yβ , y † iΓ 0 C α
= ȳα , (2.8)
i †
θ i θ̄j = θ i θ̄j + δji , θi θj = θiθj , θ = θ̄i , (2.9)
1
Z = (ȳy + θ̄ θ ); (2.10)
2
and traceless in their spinor indices:
The tracelessness of Pi j means the removal of the outer U (1)Y automorphism generator
Y = θ̄ θ. (2.13)
The Lie bracket between P , Q ∈ hs(2, 2|4) is given by [P , Q] /I where I is the ideal
generated by elements of the form
∞
Pn (y, ȳ, θ, θ̄ ) Z ·
· · Z
, (2.14)
n=1 n factors
where Pn are polynomials which are traceless in their spinor indices. The structure of the
Lie bracket is similar to (2.3).
The zeroth level of hs(2, 2|4) is the maximal finite subalgebra
where PU(2, 2|4) is the centrally extended superalgebra (with 31 bosonic generators). The
PSU(2, 2|4) generators are realized schematically as
iα = ȳα θ i , 1
Qαi = yα θ̄i , Q Mαβ = ȳα yβ − Cαβ (ȳy),
4
1
U i j = θ̄ i θj − δji (θ̄θ ). (2.16)
4
The Lorentz spin of a generator in (2.11) is given by (jL , jR ) = ( 12 m, 12 n) and the U (1)Y
charge by Y = p − q. The components with integer jL + jR are Grassmann even and those
with half-integer jL + jR are Grassmann odd. Bosons are in the 10 , 150 , 200 , 62 , 102 and
14 irreps of SU(4) × U (1)Y and fermions in the 41 , 43 and 103 irreps. The reality properties
follow from the condition P † = −P which, in particular, implies that the irreps with Y = 0
are real. The generators of the algebra are summarized in Tables 4 and 5 (see Appendix A).
A UIR of SU(2, 2|4) is denoted by D(E0 , jL , jR ; a1 , a2 , a3 )Y where the notation
is explained in Appendix B. The fundamental UIRs of SU(2, 2|4) are the ultra-short
singletons given in Table 6 in Appendix A [41]. Due to the modding out of the ideal I
generated by elements of the form (2.14) the fundamental UIR of hs(2, 2|4) is the singleton
with vanishing Z charge, i.e., the Maxwell supermultiplet [41–44]
D(1, 0, 0; 0, 1, 0)0 ⊕ D 32 , 12 , 0; 1, 0, 0 −1 ⊕ D 32 , 0, 12 ; 0, 0, 1 1
⊕ D(2, 1, 0; 0, 0, 0)−2 ⊕ D(2, 0, 1; 0, 0, 0)2. (2.17)
By taking products of this multiplet we obtain further unitary representations of hs(2, 2|4).
In particular, the product of two singletons yields massless AdS5 fields whose energies,
which are given by E0 = 2 + jL + jR saturate the unitarity bound of a continuous series
(denoted as series A in Appendix B) [41,42].
The massless sector of the hs(2, 2|4) gauge theory is formulated in terms of an hs(2, 2|4)
valued master gauge field Aµ (y, ȳ, θ, θ̄) and a master zero-form Φ(y, ȳ, θ, θ̄ ) in a certain
quasi-adjoint representation of hs(8|4) [9,10], which contains the Weyl tensors and the
extra ‘matter’ fields given in Table 7 in Appendix A. The gauging gives rise to physical
fields whose spectrum is given by the symmetric product of two singletons given in
Table 2. The fields with |Y | 1 and |Y | 2, jL + jR 12 carry SO(4, 1) weights
such that the analysis of the curvature constraints in this sector is analogous to that in
D = 4. In the |Y | 2, jL + jR 1 sector the fields carry SO(4, 1) weights that require
a separate analysis. One finds that [10] the physical fields arise as two-form potentials in
Φ obeying the odd-dimensional self-duality equation B2 = dB2 or higher-spin analogs
of this equation (such equations have been more recently studied in the lightcone gauge
in [45]). The gauge fields in Aµ with |Y | 2 are auxiliary fields which are related to the
independent two-forms in Φ by generalized Hodge dualization rules [10].
So far we have discussed the free massless HS gauge theory. The full interacting theory
based on hs(2, 2|4) has not been constructed yet. However, the kinematics established
in [10] and summarized above, together with the already established principles [14] that
govern the structure of the interacting HS gauge theory in D = 4, suggest that the full
5D interacting theory is perfectly within reach. Indeed certain cubic interactions of the
minimal bosonic HS theory in 5D have already been constructed by Vasiliev [12].
314 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
Table 2
The symmetric tensor product of two d = 4, N = 4 SYM singletons arranged into levels ! = 0, 1, 2, . . . of
PSU(2, 2|4) multiplets. The entries denote USp(8) representations: 28 = 27 + 1, 56 = 48 + 8, 70 = 42 + 27 + 1,
which branch under SU(4) × U (1)Y as follows: 1 = 10 , 8 = 41 + 4̄−1 , 27 = 150 + 62 + 6̄−2 , 42 = 200 + 102 +
10−2 +14 + 1̄−4 and 48 = 201 +20−1 +43 + 4̄−3 . Each entry also carries SO(4) SU(2)L ×SU(2)R ⊂ SO(4, 2)
spins (jL , jR ). The total spin s is defined as s = jL + jR and the U (1)Y charge is given by Y = 2(jR − jL ).
The level ! = 0 multiplet is the D = 5, N = 8 supergravity multiplet. The level ! = 1 multiplet is the massless
Konishi multiplet. The level ! 0 multiplets have (4! + 1) × 28 degrees of freedom. The states in the s 12
sector arise as the physical states in the master scalar field Φ, as shown in Table 7. For s 1, the states with
Y = 0, ±1 arise in the sector of the master gauge field Aµ corresponding to the generators of hs(2, 2|4) listed
in Table 4. Those with Y = ±2, ±3, ±4 arise in the master scalar field Φ. With the exception noted in Table 7,
these have dual gauge fields corresponding to the generators of hs(2, 2|4) listed in Table 5. The minimal bosonic
truncation of the spectrum is obtained by keeping the maximum spin fields at each level and the scalar at level
!=1
!\s 0 1 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2 2
0 42 48 27 8 1
1 1 8 28 56 70 56 28 8 1
2 1 8 28 56 70 56 28 8 1
3 1 8 28 56 70 ...
4 1 ...
..
.
where PnI1 ···In (y, ȳ, θ, θ̄ ) has an expansion in terms of traceless, Weyl ordered multispinors
and the SU(2)Z indices I1 · · · In are symmetric. The structure of the Lie bracket is again
similar to (2.3). The zeroth level of hs(8∗ |4) is the maximal finite subalgebra OSp(8∗ |4)
realized schematically as
Qαi = yα θ̄i − ȳα θi , Mαβ = ȳ[α yβ] , Uij = θ(i θ̄j ) . (2.22)
An !th level generator P (!) in hs(8∗ |4) can be expanded as
where the components are traceless in their Lorentz spinor indices and belong to super-
Young tableaux with two rows of length 2! + 1. A single box in the super-Young tableaux
represents the superoscillator ξ A = (y α , θ i ) or ξ̄ A = (ȳ α , θ̄ i ). An arbitrary Weyl ordered
monomial in these superoscillators corresponds to a super-Young tableaux with two rows.
The restriction m + n = p + q in (2.23) (i.e., equal number of ξ A and ξ̄ A ) follows from the
condition [Z3 , P ] = 0, while the condition [Z± , P ] = 0 rules out super-Young tableaux
with rows of unequal length. The resulting super-Young tableaux of width 2! + 1 splits
into a set of Young tableaux of spinors. Each SO(6, 2) Young tableaux branches into a set
of Young tableaux of SO(6, 1) spinors. The spinorial SO(6, 1) × SO(5) Young tableaux
can be converted into tensorial ones by multiplying with appropriate Dirac matrices of
both groups. The resulting SO(5) irreps are 10 , 50 , 100, 140 , 12 , 52 , 102, 14 in the bosonic
sector and 41 , 161 , 43 in the fermionic sector, where the subscripts denote the U (1)Y charge
defined as
Y = nθ̄ − nθ , (2.24)
with nθ̄ = q and nθ = p, as specified in the expansion (2.23). The SO(6, 1) highest weights
(m1 , m2 , m3 ) are given by
UIR of hs(8∗ |4). This singleton is the (2, 0) tensor multiplet [44,46–48]
D(2, 0, 0, 0; 0, 1)0 ⊕ D 52 , 1, 0, 0; 1, 0 1 ⊕ D 52 , 0, 0, 1; 1, 0 −1
⊕ D(3, 0, 0, 2; 0, 0)2 ⊕ D(3, 2, 0, 0; 0, 0)−2. (2.27)
By taking products of this singleton one obtains further unitary representations of hs(8∗ |4).
In particular, the square yields massless AdS7 fields with energy E0 = 4 + s where s ≡ J1 .
These energies belong to an isolated series (denoted as series B in Appendix B) [48],
unlike in D = 4, 5 where the massless fields have energies that saturate a continuous series
(the continuous series is saturated by lowest weight spaces arising in the product of three
singletons).
The superalgebra hs(8∗ |4) has a minimal bosonic HS subalgebra hs(8∗ ) whose
representation theory and gauging was described in [20]. We shall assume that the
massless sector of the hs(8∗ |4) gauge theory is formulated in terms of an hs(8∗ |4) valued
master gauge field Aµ (y, ȳ, θ, θ̄) and a master zero-form Φ(y, ȳ, θ, θ̄ ) in a quasi-adjoint
representation5 of hs(8∗ |4) and that the gauging gives rise to physical fields whose
spectrum is given by the symmetric product of two tensor singletons listed in Table 3.
The gauge fields with Y = 0 carry SO(6, 1) weights which are similar to those in the
minimal bosonic theory [20]. The gauge fields with |Y | 1 carry SO(6, 1) weights which
are analogous to those carried by the |Y | 1 fields in the hs(2, 2|4) theory in D = 5.
Thus we expect that the physical fields with |Y | 1, s 1 arise in Aµ . The remaining
physical fields, which have s 12 , or s 1 and |Y | 2, must arise in Φ and be those given
Table 3
The symmetric tensor product of two d = 6, N = (2, 0) tensor singletons arranged into levels ! = 0, 1, 2, . . .
of OSp(8∗ |4) multiplets. The entries denote SO(5) × U (1)Y representations as follows: 14 = 140 , 16 = 161 ,
15 = 100 + 52 , 4 = 41 , 16 = 100 + 52 + 12 , 24 = 161 + 41 + 43 , 36 = 140 + 50 + 10 + 102 + 52 + 14 . The
SO(6) ⊂ SO(6, 2) highest weights (n1 , n2 , n3 ) associated with each entry are given by n1 = s, n2 = 12 |Y | and
n3 = 12 Y . The level ! = 0 multiplet is the D = 7, N = 2 supergravity multiplet. The level ! 0 supermultiplets
contain 13 (! + 1)(2! + 1)(4! + 3) × 28 degrees of freedom. The states with |Y | 1, s 1 are expected to arise in
the sector of the master gauge field Aµ corresponding to the generators given in Table 8. The states with s 12 ,
or |Y | 2 and s 1, which are listed in Table 11, are expected to arise in a quasi-adjoint master zero-form Φ.
With a few low-lying exceptions which are given in Table 11, these are generalized Hodge duals of the |Y | 2
sector of the master gauge field Aµ which corresponds to the hs(8∗ |4) generators listed in Table 9. The minimal
bosonic truncation of the spectrum is obtained by keeping the maximum spin fields at each level and the scalar at
level ! = 1
!\s 0 1 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2 2
0 14 16 15 4 1
1 1 4 16 24 36 24 16 4 1
2 1 4 16 24 36 24 16 4 1
3 1 4 16 24 36 ...
4 1 ...
..
.
5 This representation was defined for hs(8∗ ) in [20]. Its generalization to hs(8∗ |4) will not be given here.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 317
The fundamental UIR of OSp(8|4), which is also a UIR of hs(8|4), is the ultra-short
singleton specified in (2.6). This is just the d = 3, N = 8 scalar multiplet, and its superfield
realization has been known for sometime. In particular, it has arisen in the superembedding
formulation of M2-branes [49]. Following [50], let us work with a realization related to the
one in [49] by triality. The singleton superfield is then carries a spinor representation of
SO(8) and obeys the constraint
This means that Σ ij describes a BPS-1/8 multiplet and it satisfies the unitarity condition
of series B. In [50], a BPS short multiplet of this type is built out of four singletons using
harmonic superspace technique. In terms of ordinary superfields we write it as
Σ ij = (Γimnp )AB Γj mnp CD Φ A Φ B Φ C Φ D − trace. (3.11)
This is just the 35-plet contained in the symmetric product (8s × 8s × 8s × 8s )S . Since the
superfield Σ ij represents a BPS-1/8 multiplet, its components go up to smax = 7/2. There-
fore, it is natural to consider this superfield as a candidate for coupling to Higgs superfield
in the bulk which can be eaten by the massless Konishi multiplet to become massive.
Turning to the candidate anomaly superfield Σ i α2 ···α4!−4 given in (3.10), we observe that
it carries a semi-short IUR that saturates the unitarity bound of series A. In general, such
multiplets have been constructed as [50]
S [ai ] = Φ 2 BPS[ai ] , (3.12)
{µ1 ···µs }[ai ] {µ1 ···µs } [ai ]
S =J BPS , (3.13)
where BPS[ai ] is any one of the BPS short multiplets listed in (2.4)–(2.6), and J {µ1 ···µs }
is a spin s current. Assuming that the candidate anomaly superfield Σ i α2 ···α4!−4 belongs
to an irreducible representation of OSp(8|4), since it is an 8-plet of SO(8), it requires the
BPS-1/8 multiplet D(1, 0; 1, 0, 0, 0) and a spin s = 2! − 12 current in (3.13). However, the
BPS-1/8 multiplet cannot be built out of one type of singleton field. Thus, the construction
of Σ i α2 ···α4!−4 , which is important for a Higgs mechanism that can work at all levels ! 1,
remains an open problem.
The BPS multiplets that can be constructed from the product of one type of singletons
are all the BPS-1/2 and BPS-1/4 multiplets listed in (B.4) and (B.5), and all those BPS-
1/8 multiplets listed in (B.6) with integer s [51]. These multiplets, as well as the semi-
short multiplets discussed above which make use of them, are likely to play a significant
role in the description of the full HS gauge theory based on hs(8|4). In particular, the
KK supermultiplets associated with level ! supermultiplets of the massless HS theory are
expected to be BPS-1/2 multiplets. For example, the level ! = 0 multiplet and its KK
towers are realized as [53]
D(k/2, 0; 0, 0, k, 0): Φ(A1 ΦA2 · · · ΦAk ) − traces, k = 2, 3, . . . . (3.14)
Taking k = 2 gives the massless supergravity multiplet and k = 3, 4, . . . give their massive
KK descendants. Similarly, the semi-short multiplets (3.12) and (3.13) with BPS-1/2
composites carrying the irrep D(k/2, 0; 0, 0, k, 0) are candidates for KK descendants of
the level ! > 0 massless multiplets of the HS gauge theory based on hs(8|4).
The fundamental UIR of SU(2, 2|4), which is also a UIR of hs(2, 2|4), is the ultra-
short singleton specified in (2.17). This is the d = 4, N = 4 Maxwell multiplet realized in
terms of superfield Wij , where i = 1, . . . , 4 labels the 4-plet of SU(4) and W ij = −Wj i .6
6 This is the unique singleton multiplet of PSU(2, 2|4) and it has vanishing U (1) central charge. The
Z
centrally extended PU(2, 2|4) superalgebra admits an infinite number of singleton multiplets. These have jR = 0
320 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
[k l]m
Dαm Jijkl = χαij
mkl
+ δ[im λkl
αj ] + δ[i λαj ] , (3.17)
where λ and χ are both totally antisymmetric in lower and upper indices and totally
traceless. The superfield Jij,kl carries the irrep D(2, 0, 0; 0, 2, 0). It belongs to series C and
it describes a BPS-1/2 multiplet. Its components can be shown to contain the composite
operators that correspond to the level ! = 0 supergravity multiplet shown in Table 2, and
that the components with spin s 1 are conserved currents.
The level ! = 1 supercurrent is also a special one and is known as the massless Konishi
multiplet. It has the simple form [56]
J = Wij W ij . (3.18)
As a result of the basic singleton constraint (3.15), this current obeys the constraint
D ij J = 0, D ij := D α(i Dαj ) . (3.19)
This multiplet has and they precisely correspond to the level ! = 1 massless states
5 × 28
shown in Table 2. It is characterized by the irrep D(2, 0, 0; 0, 0, 0) carried by its lowest
component. It is a semi-short multiplet which saturates the unitarity bound of series A.
(their complex conjugates have jL = 0), and E0 = jL + 1. Each singleton multiplet forms a massless UIR of
the d = 4, N = 4 Poincaré superalgebra and is characterized by central charge ! = 2|Z| = 0, 1, 2, . . . . Viewed
as massless states of d = 4 Poincaré group, they carry Lorentz spin (i.e., maximum SO(2) helicity) s = jL .
The first three levels of the singleton spectrum shown in Table 6 are special because they are the only singleton
multiplets which contain scalar fields. They are D(1, 0, 0; 0, 1, 0), D(1, 0, 0; 1, 0, 0) and D(1, 0, 0; 0, 0, 0) with Z
charges (0, 1/2, 1) and they can be described by superfields (Wij , W i , W ), respectively. The level ! singletons
D( 2! , 2! − 1, 0; 0, 0, 0) have central charge Z = 2! and can be described by superfield ωα1 ···α!−2 . The constraints
satisfied by all singleton superfields can be found in [50].
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 321
In the Poincaré limit, the states are labeled by the little group SO(3) × SU(4). Denoting
the irreps by Rs , where R is denotes an USp(8) irrep (which should be decomposed into
SU(4) irreps) and s is the SO(3) spin, the level ! = 1 massless Konishi multiplet can be
obtained by tensoring the level ! = 0 supergravity multiplet with an SU(4) singlet spin
s = 2 state as follows:
The superfield Jµ1 µ2 ···µ2!−2 carries the irrep D(4! − 2, 2! − 2, 2! − 2; 0, 0, 0) and its
components have spins that range from (2! − 2) to (2! + 2). This superfield saturates
the unitarity bound of series A and it describes a semi-short multiplet.
The explicit construction of all the supercurrents in terms of Maxwell singleton is
straightforward but tedious exercise which apparently has not been carried so far. They
are known, however, for the minimal bosonic truncation of the massless HS gauge theory
in D = 5 discussed above. They take the form [8,13]
2!−2
(−1)k
jµ1 ···µ2!−2 = ∂µ · · · ∂µk φ ∗ ∂µk+1 · · · ∂µ2!−2 φ − traces. (3.23)
(k!)2 !((s − k)!)2 1
k=0
So far we have considered free SYM singletons. Switching on the SYM interactions, the
currents listed above for ! 1 will no longer be conserved. The resulting anomalies can
be characterized as follows
√
D ij J = λ Σ ij ,
µ √
σ̄ 1 α̇β D iβ Jµ1 µ2 ···µ!−2 = λ Σµi 2 ···µ2!−2 ,α̇ ,
µ β̇ √
σ 1 α D i β̇ Jµ1 µ2 ···µ2!−2 = λ Σiµ2 ···µ!−2 ,α , (3.24)
where the constant normalization factor is introduced for later convenience (see Section 5).
The superfields on the right-hand side carry the following UIRs of SU(2, 2|4)
In the interacting SYM singleton theory the anomaly superfield Σ ij takes the well known
form (see, for example, [61,65]):
4
Σ ij = Tr W k(i W j )! Wk! , (3.28)
N 3/2
where the constant normalization factor is introduced for later convenience (see Section 5).
This superfield belongs to series B and it describes a BPS-1/8 multiplet. Consequently its
components go up to smax = 7/2 and therefore it is a candidate for coupling to Higgs
superfield in the bulk which can be eaten by the massless Konishi multiplet to become
massive. All the components of the massive Konishi multiplet of PSU(2, 2|4) have been
tabulated in [57].
The candidate anomaly superfields Σαi 2 ···α2!−2 ,α , on the other hand carries a semi-short
IUR that satisfy the unitarity bounds of series A or B. In general, such multiplets have been
constructed as [50]
S [ai ] = Φ 2 BPS[ai ] ,
S {µ1 ···µs }[ai ] = J {µ1 ···µs } BPS[ai ] , (3.29)
[ai ]
where BPS is any one of the BPS operators listed in (B.11)–(B.13), and J {µ1 ···µs } is a
spin s current, to be constructed out of the free SYM singleton in our case. For the BPS-
1/2 and BPS-1/4 cases, both of the above operators saturate the series A unitarity bound
(B.8), while in the case of BPS-1/8, they belong to series B. Assuming that the candidate
anomaly superfield Σαi 2 ···α2!−2 ,α carries an irreducible representation, and given that it is in
(100) of SO(4), attempting to construct it as in (3.29) requires the use of BPS-1/8 multiplet
D(3/2, 0, 0; 1, 0, 0), as follows from (B.13). However, these BPS multiples cannot be built
out of SYM singletons alone [50].
The BPS multiplets that can be constructed out of products of SYM singleton alone
are all the BPS-1/2 and BPS-1/4 multiplets listed in (B.11) and (B.12), and all those
BPS-1/8 multiplets listed in (B.13) with integer r [51]. These multiplets, and the semi-
short multiplets discussed above which make use of them, are likely to play a role in
finding the massive states of the full HS gauge theory based on hs(2, 2|4). In particular,
the KK supermultiplets associated with level ! supermultiplets of the massless HS theory
are expected to make use of the BPS-1/2 states. For example, the level ! = 0 multiplet and
its KK towers are realized as [3,58,59]
D(k, 0; 0, k, 0): W(a1 Wa2 · · · Wak ) − traces, k = 2, 3, . . . . (3.30)
Setting k = 2 gives the massless supergravity multiplet and k = 3, 4, . . . their massive KK
descendants. Similarly, the semi-short multiplets (3.12) and (3.13) involving the BPS-1/2
composites carrying the irrep D(k, 0; 0, k, 0) are candidates for KK descendants of the
level ! > 0 massless multiplets of the HS gauge theory based on hs(2, 2|4).
The fundamental UIRs of OSp(8∗ |4) are the singletons given in Table 10 in Appendix A
[46,48]. Each row in the table denotes an irreducible singleton multiplet. The superfield
realization of the 6D singletons have been studied by several authors. Here we shall follow
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 323
[50,51] where several references to earlier literature can also be found. There exist several
papers on the construction of the composite operators out of the 6D singletons as well; see
[50,51,66,67], for example.
There exist an infinite set of singletons of OSp(8∗ |4). They are shown in Table 10 and
listed in Appendix B. The (2, 0) tensor singleton is the only one which is singlet under an
SU(2)Z defined in Section 2.3. Here we shall focus our attention to the level ! = 0 singleton
described by the superfield W ij which forms the tensor multiplet of d = 6, N = (2, 0)
Poincaré supersymmetry, since all the HS gauge theory states will be formed out of them.
To begin with, we shall take a single copy of the tensor multiplet. Abelian nature of
the singletons is essential for the construction of conserved currents. The superfield Wij
satisfies the following constraints and reality condition [66]
J = Wij W ij . (3.33)
This current obeys the constraint [50]
j
αβγ δ Dα(i Dβ Dγk) J = 0. (3.34)
The superfield J carries the irrep D(4; 0, 0, 0; 0, 0). It has 14 × 28 components and it can
be obtained group theoretically by tensoring the level ! = 0 supergravity multiplet with the
graviton state which has 14 degrees of freedom. It is a semi-short multiplet which belongs
to series B.
The massless multiplets arising at level ! 2 in the spectrum shown in Table 2 are
generic and the corresponding conserved currents are contained in the superfield
The superfield Jα1 ···α2!−2 ,β1 ···β2!−2 carries the irrep D(2! + 2; 0, 2! − 2, 0; 0, 0). It is a semi-
short multiplet which belongs to series B.
An explicit construction of these supercurrents in terms of the (2, 0) tensor singleton
apparently has not been carried out so far. They are known, however, for the minimal
bosonic truncation of the massless HS gauge theory in D = 7 discussed earlier. They take
the form [8,13]
2!−2
(−1)k
jµ1 ···µ2!−2 = ∂µ · · · ∂µk φ ∗ ∂µk+1 · · · ∂µ2!−2 φ
k!(k + 1)!(s − k)!(s − k + 1)! 1
k=0
− traces. (3.37)
So far we have considered free (2, 0) tensor singletons. Interactions for multi-copies of
these singletons are not known and they are expected to be radically different than those
familiar from ordinary field theory. These interactions are also expected to break the HS
gauge symmetries down to those of level ! = 0 supergravity. Let us characterize the break-
down in the conservation laws of the supercurrents of level ! 1 as follows
j
αβγ δ Dα(i Dβ Dγk) J = gΣ δij k , (3.38)
δγ α1 β1
Dγi Jα1 ···α2!−2 ,β1 ···β2!−2 = gΣαδi2 ···α2!−2 ,β1 ···β2!−2 , (3.39)
where g is some coupling constant. Unlike in the cases of d = 3, 4, here we see that the
representation content of the candidate anomaly superfields do not correspond to any BPS
short or semi-short multiplets listed in Appendix B. Of course, here we are assuming that
these anomaly superfields are irreducible. Their computation from first principles may
in principle reveal that they are reducible, and possibly derivatives of some irreducible
superfields. The nature of the anomaly superfields should also reflect the fact that there
are no local non-Abelian interactions for tensor fields that can be described by continuous
deformations of the free theory [90]. This is a qualitative difference between d = 6 and
d = 3, 4, where the free fields admit SYM deformations (after dualization of a scalar in
d = 3).
The semi-short multiplets, as in 3D and 4D cases, have also been constructed in terms
of building blocks discussed above, and they take the form [50]
As in the cases of 3D and 4D, here too, setting k = 2 gives the massless supergravity
multiplet and k = 3, 4, . . . give their massive KK descendants. Similarly, the semi-short
multiplets (3.40) and (3.41) with BPS-1/2 composites carrying the irrep D(2k, 0, 0, 0; 0, k)
are candidates for KK descendants of the level ! > 0 massless multiplets of the HS gauge
theory based on hs(8∗ |4).
O1 O2 O3 O4 ! = O1 O2 ! O3 O4 ! + O1 O2 O3 O4 !conn , (4.1)
O1 O2 Or ! Or O3 O4 !
O1 O2 O3 O4 !conn = , (4.2)
r
Or Or !
where the disconnected terms are the contributions from the unit operator and the
connected terms are the contributions from the remaining operators. The factorization
means that the connected terms are suppressed by powers of 1/N :
O1 O2 O3 O4 !conn
→ 0 as N → ∞. (4.3)
O1 O2 ! O3 O4 !
In general, there can be several parameters in addition to N in CFTd . Fortunately,
supersymmetry puts considerable amount of constraint on these possibilities. With
application to type IIB string and M-theory in mind, we shall assume that G = SU(N) and
consider SU(N) valued singleton scalar superfields denoted by W I , I = 1, . . . , n. In this
case we have N = N 2 − 1 and the singletons transform in the fundamental representation
of the R-symmetry group SO(n). For the cases of interest, namely in d = 3, 4, 6, we have in
mind the R-symmetry groups SO(8), SO(6) and SO(5), respectively, which correspond to
16 ordinary plus 16 special supersymmetries in the CFTd . The SU(N) valued singletons in
d = 4 are adequate for discussing the tensionless limit of the type IIB theory on AdS5 × S 5 .
The extent to which SU(N) valued singletons in d = 3, 6 may encode the properties of (an
unbroken phase of ) M-theory on AdS4/7 × S 7/4 is discussed in Sections 6 and 7.
The basic composite operators in CFTd are primary bilinear single-trace operators
O(2)r , where the index r labels collectively the set of (SO(d, 2) × R)-representations
326 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
involved [7,8]. These operators do not mix with any other operators and provide conserved
HS currents with spin s 1, and certain composite operators of lower spin s < 1. Together
they form an HS multiplet that corresponds in a one-to-one fashion to an HS multiplet of
physical massless bulk fields,7 φ(2)r . In the supersymmetric singleton models of special
interest to type IIB/M-theory the bilinear primaries are discussed in Section 3 and the
corresponding massless spectra are listed in Tables 1, 2 and 3.
The free CFT d also contains composite operators which are pth order monomials in
the basic singleton and its derivatives. Those composites which are not normal ordered
products of other composites as N → ∞ are interpreted as massive single-particle states
in AdS. We shall denote these operators and the corresponding massive bulk fields by
O(p)r and φ(p)r , respectively, where p 3 and r is an additional set of indices labeling the
SO(d, 2) × R weights. The massiveness means that there is no shortening of the associated
SO(d, 2) weight spaces. This implies that the massive operators are not conserved and
hence there are no gauge symmetries associated with the corresponding massive AdS
fields. However, as discussed in the previous section, some of the massive operators belong
to shortened supermultiplets, provided that the superconformal weights saturate certain
unitarity bounds or belong to discrete series. This is the case, for example, for 1/2 BPS
KK modes and the Higgs multiplets listed in the previous section.
For fixed p the space of massive operators O(p)r clearly decomposes into irreducible
HS multiplets, though the representation theory of HS algebras, such as their root structure,
has not yet been developed far enough to characterize the precise ‘lowest’ weights carried
by these multiplets (see [20] for a discussion of this point).
Composite operators which are normal ordered products of other composite operators
as N → ∞ are interpreted as many-particle states. In the case of SU(N) valued singletons,
the single-particle states, O(p)r (p = 2, 3, . . .) are given in the large N limit by single-trace
operators. The n-particle states, which we shall denote by O(p1 ,...,pn )r are given in this limit
by multi-trace operators in the form of normal ordered products of single trace operators
O(pi )ri and their derivatives, pi = 2, 3, . . . , i = 1, . . . , n.
For finite N there is mixing between the single-trace and multi-trace operators [7,23].
This is because n-particle states in the bulk couple to operators that diagonalize the two-
point function:
OR OS ! = ηRS , (4.4)
where R = (p1 , . . . , pn )r and ηRS is an N -independent diagonal matrix. For example,
consider the minimal bosonic truncation based on a single SU(N) valued singleton field W .
The bilinear and tri-linear composites, which have to be single-traces, do not mix. However,
the quartic composites do mix, and they do so as follows. The diagonal scalar states of
energy ∆ = 2d − 4 are given schematically by
2f
O(4) = J(4) + f J(2,2), O(2,2) = J(2,2) − J(4) , (4.5)
1+f2
7 In the minimal bosonic truncation this dictionary has been extended to also include local currents
corresponding to the auxiliary HS gauge fields of the bulk theory [8]. This offers an opportunity to compute
bulk amplitudes in a first order formalism.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 327
where f (N) = a
N + b
N3
, with a and b being some constants, and
1 4
J(4) ∼ 2
:tr W :, J(2,2) ∼ :tr W 2 tr W 2 :, (4.6)
N
are assumed to be normalized such that
the fact that a successful definition of (4.9) in principle would give rise to a consistent
bulk theory including quantum gravity8 suggests that only the special supersymmetric
singletons corresponding to limits of string/M-theory will be viable in the above sense.
Thus we shall assume that ultimately (4.9) makes sense only for free SCFTs in d 6 with
less than or equal to 16 supersymmetries.9 We address these issues further below when
we discuss the subleading 1/N corrections to the definition of the vacuum used in the
correlator on the right-hand side of (4.9).
The generating functional makes sense only as an asymptotic expansion in 1/N in
which a given order is a formal power series expansion in φ(p)r , which has a finite radius
of convergence by the combinatorial counting rules for double line diagrams of fixed
topology. From the normalization (4.4) and assuming that O! = 0 it follows that as far
as the 1/N counting goes the effective action has the form
1 1 1 1
Γeff [φ] = φ + f3
2
φ + 2 f4
3
φ4 + · · · ,
N N2 N N2
1 1
fn ∼1+O . (4.10)
N2 N2
The singleton field theory determines Γeff [Φ(p)r ] up to non-linear field redefinitions of the
type φ → φ + N1 φ 2 + · · · . After rescaling the fields as
φ = NΦ, (4.11)
we define the classical action as follows
Γeff [Φ] = Γcl [Φ] + O 1/N 2 , (4.12)
N2
Γcl [Φ] = d−1 d d+1 x L Φ, R∂Φ, (R∂)2 Φ, . . . + boundary term, (4.13)
R
where R is the AdS radius. We can now state the properties of the HS gauge theories as
follows. They possess:
8 As the basic mechanism behind holography is general covariance, this raises the question whether
holography exhibits any new features as general covariance is extended by HS symmetries. To analyze this,
we presumably need to refine our present, mainly algebraic, understanding of HS symmetries by formulating
these in a more geometric language, perhaps by extending the set of spacetime coordinates as to realize HS gauge
transformations as extended reparametrizations [11].
9 Massless HS fields admit background independent self-interactions in D = 4, and it is most likely that
this is the case for all D (though interactions in D > 7 bring in symplectic spacetime symmetries). However,
the theories of massless HS fields in higher dimensions are presumably not consistent truncations of quantum
consistent theories.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 329
1 N
d−1
= . (4.14)
lPl R d−1
Given these facts we would like to determine the effective action Γeff [φ(p)r ] from a set of
bulk interactions, without any direct reference to the boundary singleton. The basic issue is
whether the interactions can be derived from a string or membrane sigma model, that can
be coupled to the HS background fields. The mass-scale of the HS spectrum is set by the
AdS radius R, which is suggestive of a sigma-model with a fixed critical tension of order
1 in units where the AdS radius R = 1, as we shall discuss further in Sections 5, 6 and 7.
Due to the absence of mass-gap it is not possible to separate the massless fields, φ(2)r ,
from the massive AdS fields, φ(p)r , p > 2, by taking a low energy limit. In a local process in
AdS with energies of the order E ∼ n/R, n 1, the massive modes with E0 < n/R behave
essentially as the KK modes which arise in an AdS compactification of string/M-theory.
Thus the only reasonable possibility in which the massless modes can be separated from the
massive modes in a HS theory is by consistent truncation to the massless sector,11 which
is similar to what happens in the (maximally supersymmetric) sphere compactifications of
type IIB and eleven-dimensional supergravities. There are examples, however, of compact
manifolds, such as T 1,1 , where the higher-dimensional supergravity theory does not admit
a consistent truncation despite the fact that there does exist a lower-dimensional gauged
supergravity.12
Thus we propose that the HS gauge theories in D = 4, 5, 7 with gauge groups hs(8|4),
hs(2, 2|4) and hs(8∗ |4) admit consistent truncation down to the corresponding massless
theories, which we described in Section 2. This consistent truncation can be directly
tested by verifying that the massless bulk theory reproduces exactly the correlators of the
corresponding bilinear operators in the singleton theory. This is a non-trivial test since
nothing is known about higher-dimensional covariant description of the HS theory so far.
Consistent truncation of the full HS gauge theory to its
massless sector requires that
there are no terms in the effective bulk action of the form φ(p) φ(2) · · · φ(2) for p 3. Let
us show this in the case of scalar bulk fields. Then the corresponding singleton correlators
are non-zero provided that ∆(p) n∆(2) where n 2 is the number of massless fields.
The case ∆(p) = n∆(2) is called an extremal correlator. The extremality condition implies
p = 2n and in that case it is straightforward to use free field contraction rules to show that
n
2
O(p) (x)O(2)(x1 ) · · · O(2) (xn ) = ∆(x − xi ) , (4.15)
i=1
10 In the case of SU(N ) valued singletons N = N 2 − 1, which means that the Planck constant in the bulk is
given by h̄ = 1/N 2 . The 1/N corrections to the bulk theory are therefore weighted by positive integer powers of
the Planck’s constant.
11 We thank L. Rastelli for helpful discussions on this point.
12 We thank C. Pope for pointing this to us.
330 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
where ∆(x) = |x|−d+2 is the singleton propagator. Consider, on the other hand, the bulk
integral
d+1
d z
I= d+1
K∆(p) (z, x)K∆(2) (z, x1 ) · · · K∆(2) (z, xn ), (4.16)
z0
Thus, in the extremal case this integral diverges logarithmically, and the residue of the pole,
treating ∆ as a variable, has the same structure as the extremal correlation function. By
assumption, the antiholographic dual should, however, give rise to finite amplitudes. The
resolution is that a term which diverges logarithmically is scale-invariant, which means
that it can be represented equivalently by a boundary term which is finite. Thus extremal
correlators give rise to couplings that are boundary terms and therefore they do not upset
the consistent truncation.
A similar argument applies to the near-extremal case, when d − 2 < ∆ < n∆(2) .
Here the integral I is finite, but the dependence on the x’s is not of the same form
as the singleton CFT correlator. There are exchange diagrams, though, with the correct
structure of the x-dependence [5]. Thus the near-extremal correlators must be represented
antiholographically in terms of exchange diagrams, and there cannot be any contact term
in the bulk action that can upset consistent truncation we are examining.
The above evidence for consistent truncation is similar to the one given for ordinary type
IIB supergravity AdS × S 5 [25,26] and eleven-dimensional supergravity on AdS4/7 × S 7/4
[27]. The main difference is that whereas the arguments in SUGRA only holds for 1/2
BPS states, the arguments given here for HS theory hold for more general operators since
the holographic dual is by assumption a singleton.
To provide further evidence for consistent truncation, we examine the correlator of four
massless scalar operators Oi = O(2) (xi ), i = 1, . . . , 4. Using free field theory contraction
rules it can be written on manifestly crossing symmetric form as
O1 O2 = η12 + C12 (2)r O(2)r (x2 ) + C12 (4)r O(4)r (x2 ) + C12 (2,2)r O(2,2)r (x2 ), (4.21)
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 331
where we recall that O(2)r denotes the set of all primary bilinear single-trace operators
labeled by an index r, and O(4)r and O(2,2)r are as given in (4.5). The resulting s-channel
expansion is given by
A(s,t )
1234 ≡ :O1 O3 ::O2 O4 :!conn = 2 C12
1 (2)r
C34,(2)r = 12 C32 (2)r C14,(2)r , (4.26)
A(t,u)
1324 ≡ :O1 O2 ::O3 O4 :!conn = 1
2 C13
(2)r
C24,(2)r = 1
2 C14
(2)r
C23,(2)r , (4.27)
(u,s)
A1243 ≡ :O1 O4 ::O2 O3 :!conn = 1
2 C12
(2)r
C43,(2)r = 1
2 C13
(2)r
C42,(2)r . (4.28)
To show the second equality in (4.26) we first use (4.21) to expand the single contraction
connecting O1 to O2 in terms of C12 (2)r O(2)r and similarly for 3 and 4. The remaining two
contractions that contribute to the connected part give rise to 12 ηrs , where the factor of 12
arises due to the normal ordering prescription which forbids contractions connecting 1 with
2 and 3 with 4, respectively. The third equality in (4.26) follows by instead using (4.21) to
expand the single contractions connecting 1 to 4 and 3 to 2. The relations (4.27) and (4.28)
are obtained analogously. Eqs. (4.26) and (4.28) imply that the finite contribution (4.24)
can be rewritten in terms of partial wave expansions involving only exchange of bilinear
operators in the crossed channels. We also see that the crossing symmetric form of the
singular contribution C12 (2)r C34(2)r in (4.22) is given by 12 C12 (2)r C34(2)r + (1 ↔ 2). Thus
the complete four-point correlator can be written in a manifestly crossing symmetric form
332 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
in the 1/N expansion, i.e., the extrema of the two actions are equal provided the massive
modes Φ(p)r , p 3 are set to zero at the boundary of AdS. The consistent truncation can
now be phrased as the stronger condition
S[φ(2)r ; V] (4.33)
for massless fields where V represents a set of arbitrary parameters. As explained in Sec-
tion 6.2, and in more detail in [19], there exist an interaction ambiguity
in the 4D HS gauge
theory which involves the introduction of an odd function V(x) = ∞ n=1 b 2n+1 x 2n+1 . Al-
ready the simplest choice V(x) = b1 x gives rise to a highly non-trivial model with a struc-
ture of the type indicated in (4.31). The nth order term in V(x) results in higher order
derivative corrections starting at order 2n + 2 in the Lagrangian. Thus, in D = 4 the con-
sistent truncation (4.32) implies a specific choice V(x) = VΓ (x) such that
at the level of the composite operators built from the singleton superfield, which contains
the Abelian field strength but not any explicit gauge potential. In fact, this requires that we
introduce gauge couplings by hand, after which gYM 2 can be shifted to any finite value by
marginal deformations.
The above arguments suggest that we modify the definition of the generating functional
in the singleton theory by working with full singleton correlators given schematically by
1
O1 · · · On !full = R k O1 · · · On , (4.36)
k!
k
where the (super)conformally invariant singleton sewing operator R is defined as the sum
over a complete set single trace operators describing a virtual closed string process:
dd x dd y
R= ηrs (x − y)O(p)r (x)O(p)s (y). (4.37)
p
|x − y| 2d
Since each power of R adds an extra power of 1/N 2 , the above definition does not affect
the classical limit though it yields the desired non-trivial subleading 1/N corrections to the
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 335
correlators. The insertion of R formally corresponds to taking a trace, which in turn implies
that the correlation function becomes periodic along a cycle on the conformal plane. In
string theory, Rstr acts similarly, and has the geometric effect of adding a handle to the
two-dimensional worldsheet. This suggests that R insertions describe large fluctuations
of the D3 brane worldvolume in the singleton limit. As in the closed string theory, the
consistency of the sewing operation in the free singleton theory may lead to restrictions on
the spacetime superdimension.
In summary, we propose to use HS symmetries in diverse dimensions to determine
actions (or field equations) for massless HS multiplets up to certain well-defined interaction
ambiguities and then to compare the resulting Witten amplitudes with correlators of
bilinear operators in corresponding large N singleton theories. The next step in this
program is to explain the consistent singleton/HS correspondences as limits of string
and M-theories, which, in particular, require the identifications of possible schemes for
breaking HS symmetries.
We emphasize that the tests of CFT/AdS in the HS regime involve a free CFT on the
boundary, unlike the tests in the supergravity regime where the boundary CFT is strongly
coupled. This is possible due to the proposed consistent truncation and the fact that there
still remains the expansion parameter 1/N .
It is not clear exactly how the state of affairs will change once the HS symmetries are
broken. In Section 3 we have identified candidate Higgs multiplets in d = 3, 4. Presumably
this can be done also in d = 6 provided that we develop the proper mathematical language
for describing the interactions on the M5-brane. In general, we expect that the Higgsing
upsets the consistent truncation to the massless sector alone. Moreover, it is not obvious
if there exists a generalized consistent truncation scheme that retains the massless, Higgs
and other relevant massive fields. In any event, it will be interesting to see whether HS
field theoretic methods can be used to describe the Higgsing or one has to resort to some
more basic definition of the bulk interactions, based on some sigma model. We believe it
is too early to make any conclusive remarks on this, though it seems possible to describe
couplings between massless HS fields and Higgs fields, which should form HS multiplets
fitting into master fields of the type discussed in Section 2.
In Sections 5–7 we shall discuss these issues in more detail, and case by case for the
theories described in Section 2.
the classical D3-brane solution with harmonic function H (r) = 1 + 4πNgs ls4 r −4 . The
functions f1,2 (λ) account for possible string corrections to the interpolating region, where
only 16 supersymmetries are preserved. The type IIB string/4D SYM correspondence is
an AdS/CFT correspondence whereby the 4D SYM theory is identified as the holographic
dual of the type IIB closed string theory. The closed string theory is based on a non-linear
sigma-model with coupling constant ls /R. A (dimensionless) closed string amplitude A(str)
has the doubly asymptotic expansion
∞
2g−2
A(str) = gs A(str)
g (ls /R), (5.2)
g=0
(str)
where the amplitude Ag (ls /R), which is obtained from worldsheet perturbation theory
on a Riemann surfaces of fixed genus g, is given by an asymptotic expansion in ls /R. The
5D Planck length is given by
1 N2
3
= . (5.3)
lPl R3
Thus the perturbative string expansion in AdS5 × S 5 makes sense provided that
N 1, gs 1, ls R. (5.4)
The ’t Hooft expansion of the corresponding correlation function A(SYM) in the SYM
theory reads
∞
A(SYM) = N 2−2g A(SYM)
g (λ), (5.5)
g=0
where the amplitude A(SYM) g (λ) is obtained from double-line Feynman graphs with
fixed topology and is given by an analytical expansion in λ. Hence the conjectured
correspondence A(str) = A(SYM) can be examined order-by-order in string loop expansion
and SYM 1/N expansion, leading to a set of strong/weak coupling dualities between
A(str)
g (ls /R) and Ag
(SYM)
(λ).
As discussed earlier, it has been proposed that the HS gauge theory emerges in the
description of the type IIB string theory on AdS5 × S 5 in the limit [9,21–23]
gs → 0, ls → ∞; N 1, R fixed. (5.6)
In this limit the dual free SYM theory is described by an SU(N) valued d = 4, N = 4
SYM singleton. As discussed in the previous section, the bulk physics is conjectured to
be an HS gauge theory in 5D which admits a consistent truncation to an effective action
Γcl [Φ(2)r ] for massless fields. The HS gauge group hs(2, 2|4) and its massless gauge theory
has been described in [10]. We emphasize that there should be direct agreement between
the individual terms in the 1/N expansions of massless gauge theory amplitudes and the
correlators of bilinear currents in the free CFT as described in (4.34) (without having to
first obtain strong coupling results).
There still remains the task of constructing the full interacting HS gauge theory in
5D, though cubic interactions for massless spin s = 2, 4, 6, . . . fields have already been
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 337
constructed in [12]. These form a subset of the cubic interactions of the minimal bosonic
truncation Sbos of S[φ(2) ; V] provided that it is consistent to set the scalar field φ in Sbos
equal to zero at the cubic level. This requirement means that Sbos must not have any cubic
interactions that are linear in φ and quadratic in spin s 2 fields. On the other hand, from
the known stress-energy tensor OPEs (see, for example, Eq. (4.58) in [7]), it follows that
the effective action Γeff [φ(2)] should give rise to a non-zero cubic graviton–graviton–scalar
amplitude. Thus the scalar can only be consistently truncated at the cubic level if this
amplitude is represented by a boundary term in Γeff [φ(2)], i.e., if the correlator in question
is extremal or near-extremal. Whether or not this is the case remains to be seen.
We next discuss breaking of the HS symmetry. The level ! = 0 supergravity multiplet
of the massless spectrum of the hs(2, 2|4) theory contains a dilaton, ϕ which is an SU(4)
singlet with energy ∆ = 4 and AdS mass m2 = 0. Since m2 = 0 it is consistent to give ϕ
a VEV in the linearized theory, and we shall assume that this is possible also in the full
HS gauge theory. This corresponds to switching on a finite gYM 2 in the 4D SYM theory. As
i
result the 4D supercovariant derivative Dα becomes also gauge covariant. This does not
upset the stress-energy conservation law (3.17), as it is first order in the superderivative,
while it breaks the Konishi multiplet conservation law (3.19), which is second order in
derivatives. Using the relation
D ij W kl = −2gYM W k(i , W j )l (5.7)
which follows from the superspace formulation of the N = 4 SYM system in 4D, one finds
that the anomalous conservation law for the Konishi current is given by (see, for example,
[61,65]):
4gYM √
D ij J = tr W k(i W j )l Wkl ≡ λ Σ ij . (5.8)
N
The operator Σ ij belongs to the massive Higgs multiplet with smax = 7/2 discussed in
2 the anomalous conservation law (5.8) describes how Σ ij is
Section 3. Thus, for finite gYM
‘eaten’ by the massless Konishi operator J to form a massive operator which belongs to the
long massive Konishi multiplet with smax = 4 containing 216 states. The coupling between
the corresponding bulk fields, which are described on the boundary by prepotentials V and
Vij , and the massless Konishi operator J and its Higgs descendant Σ ij is described by
Sboundary = d 4 x d 16 θ J V + Σ ij Vij . (5.9)
2 , the action S
For finite gYM boundary is invariant under modified gauge transformations
involving a Stückelberg shift transformation of the massive Higgs field,
√
δV = D ij Λij , δVij = − λ Λij . (5.10)
We thus expect that for finite ϕ! = gs the√
effective action Γeff [φ(p)r ] contains kinetic terms
of the schematic form |dφ(2) |2 + |dφ(3) + λ φ(2) |2 , describing a single massive gauge field
with non-critical mass [21]
λ
m2 − m2crit ∼ 2 , (5.11)
R
where (D 2 − m2crit )φ = 0 for an AdS massless field φ.
338 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
As discussed in Section 3, the massive spectrum also contains 1/2 BPS massive states
that have the interpretation of KK modes built on the massless HS multiplets. We shall
assume that the Higgs mechanism can be described at the level of KK towers as well, and
that the remaining massive HS multiplets can be organized into massive HS multiplets and
their KK towers. This picture is suggestive of a covariant theory in D = 10 with ‘critical’
length scale l10 and coupling constant g = 1/N which admits AdS5 × S 5 with radius
R = l10 as a vacuum. Since HS interactions in AdS spaces blow up in the flat limit for
finite g, we do not expect the 10D HS theory to admit 10D Minkowski space as a vacuum
for finite g > 0. For g = 0 we get a quadratic Lagrangian, however, which is second order
in derivatives, and as it contains no positive powers of R, it does admit a flat space limit.
Thus, the tensionless limit of the type IIB string theory in 10D flat spacetime is trivial.
Higgsing of the critical theory leads to a non-critical theory with l10 < R which for
l10 R should be identified with type IIB string theory in AdS5 × S 5 with ls ∼ l10 .
For small ls /R the spectrum of string states with AdS energy (measured in units of
1/R) satisfying the condition E R 2 / ls2 , and spin s R 2 / ls2 , can be obtained by
KK reducing the 10D Minkowski space spectrum on S 5 by means of group theoretical
methods (at the classical level these states are described by ‘short’ strings with energy
E = Rl/ ls2 and length l R). In particular, for fixed SO(4) × SO(6) highest weight, the
worldsheet Hamiltonian has a ground state which is the ‘lightest’ state carrying that highest
weight. The lightest states correspond to the leading Regge trajectory in 10D Minkowski
space and form supermultiplets in both 10D Minkowski space and in AdS5 × S 5 with
smax = 2, 4, 6, . . . . In 10D Minkowski space these arise at closed string level ! = 12 smax − 1
(see, for example, [69]), where all multiplets are massive except for level ! = 0 where the
supergravity multiplet resides. For example, the lightest smax = 4 multiplet is the massive
Konishi multiplet which resides at level ! = 1.
As ls /R varies from ls /R 1 to ls /R 1 the different Regge trajectories do not mix
[6] even though the five-form flux and other terms of order 1 in units of R will become
comparable to the mass-term. This follows from the fact that in an exact CFT that admits
a perturbative formulation, such as the worldsheet theory and the boundary SYM theory,
there cannot be mixing between two operators that do not mix in the free theory. Note that
such an admixture would require the introduction of a mass-parameter in the perturbative
formulation, which is not compatible with conformal invariance.
Indeed, there is an exact agreement between the supermultiplet structures of the leading
Regge trajectory for large string tension and the set of massless states of the critical
hs(2, 2|4) theory, such that the level ! multiplet on the leading Regge trajectory flows,
after reversed Higgsing, to the level ! multiplet of the massless spectrum given in Table 2.
We have already argued in Sections 4 and 5 that there should exist a consistent
truncation of the full hs(2, 2|4) theory down to its massless sector. There is no analogous
truncation of the non-critical string theory down to the leading Regge trajectory because the
lightest states of level ! 1 consist of massless states plus Higgs states. The Higgs states
belong to the massive sector of the hs(2, 2|4) theory and therefore break the consistent
truncation.
Since the HS symmetries are broken spontaneously it would be interesting to construct
a HS field theoretic description in AdS of the couplings between the massless fields and the
Higgs fields. Clearly the master field formalism described in Section 2 should be useful in
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 339
doing this, though one presumably needs to invoke some additional information, perhaps
from the structure of the factorization of the SYM correlation functions for λ 1. Thus we
should try to find a HS action S(Φ(2)r , Hr ; V, M) for massless fields Φ(2)r and Higgs fields
Hr , where V are the parameters describing the gauge interactions, as will be discussed in
Section 6.2, and M are the parameters describing the coupling of the gauge multiplet
to the massive Higgs fields. We can then study the issue of whether the ‘weak/weak’
version of the AdS/CFT correspondence, which is valid for the massless sector at λ = 0,
can generalized to include the leading Regge trajectory for λ > 0.
The above identification of the leading Regge trajectory states with long strings was also
made recently by the authors of [34], who conjecture that it is possible to follow the bilinear
HS currents with large spin s from weak to strong ’t Hooft coupling, where they correspond
to long strings of length l ∼ R which describe the portion of the leading Regge trajectory
with large spins s R 2 / ls2 1 and AdS energies E = s + R 2 / ls2 log(sls2 /R 2 ). As the
long strings grow infinite in size they become open strings of infinite energy which couple
to bi-local, light-like Wilson lines whose operator product expansion contains the bilinear
HS currents. The long strings have a relatively √ large ratio of energy to spin as compared
to the short strings which have energies E = s R/ ls . Thus, for finite gs and small string
length, a string scattering process at high energies is described by incoming long strings
which fall into AdS spacetime where they fragment into short strings which interact and
then recombine into outgoing long strings. Moreover, the high-energy scattering process
with leading Regge trajectory states corresponds to a CFT correlator with bilinear currents
of large spin and relatively small anomalous contribution, (E − s)/s → 0 as s → ∞. This
suggests that the extreme high energy scattering can be described by the massless HS gauge
theory.
The bilinear currents do not mix with other operators in the free field theory, which
means that they cannot mix at any finite order in perturbation theory either. In the world
sheet sigma-model the counterpart to this statement is that the vertex operators describing
the insertion of long string states at small sigma-model coupling ls /R should ‘flow’ without
mixing to vertex operators at large sigma-model coupling [6]. Moreover, it is expected that
the long strings, which are quantum-mechanically unstable for finite gs , become stable as
! → 0 since ls /R increases (which removes the short string states from the spectrum) and
gs becomes small (which suppress the decay process). This suggests that as λ → 0 there
remains a non-trivial worldsheet sigma-model describing stable long string states which
correspond to the singleton single-trace operators.
From the above discussions we are led to propose that there is a cross-over from large
to small λ in the expressions for the AdS string length and string coupling in terms of the
gauge theory quantities given in (5.1), such that
f1 (λ) ∼ 1/λ, f2 (λ) ∼ 1 + O(λ) for λ 1. (5.12)
Then ls /R ∼ 1 and gs ∼ 1/N as λ → 0. This suggests that the hs(2, 2|4) higher spin
gauge theory is described by a string theory which has a left-moving and right-moving
PSU(2, 2|4) KM algebra with critical level k = kcrit ∼ 1 which admits a singleton
representation and an affine hs(2, 2|4) extension. To be more precise, the critical value for
the level should be such that there exists a maximally reducible Verma module based on the
singleton which contains a maximal number of null-states. In fact, it has been shown [70]
340 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
that the affine SO(3, 2) Sp(4) algebra admits singleton-like representations for k = 5/2.
It would be interesting to generalize this result to SO(D − 1, 2) and supersymmetric cases.
For critical level the closed string spectrum then contains physical massless HS states
formed by multiplying a left-moving and a right-moving singleton. The algebra hs(2, 2|4)
can be identified with the following coset
hs(2, 2|4) = Env PSU(2, 2|4) /R, (5.13)
where R is a certain ideal generated by elements in Env(PSU(2, 2|4)) which vanish
identically when the PSU(2, 2|4) generators are realized in terms of a single super-
oscillator as described in Section 2.1. For k = kcrit this construction should lift to the
affine case. The symmetry enhancement from AdS group to HS algebra for critical level,
i.e., critical radius in units of fixed string length, would be similar in spirit to the SU(2)
enhancement occurring at the self-dual radius for string theory on a circle.
The possibility to realize massless higher spins directly in the bulk as products of
left-moving and right-moving singleton representations at critical KM level is rather
appealing. Perhaps the close resemblance between the HS gauge theories in D = 4, 5, 7
is an indication of that singletons play a similar role on critical membranes in D = 4, 7.
6.1. Holography
Already in [71] it was observed that the OSp(8|4) singleton may play a role in the
description of the supermembrane on AdS4 × S 7 . In [40] the quantization of the d = 3,
N = 8 singleton theory corresponding to a single membrane was shown to yield the infinite
set of massless HS fields contained in the symmetric tensor product of two singleton weight
spaces [38]. Moreover, it was conjectured in [40] that these massless states, as well as the
massive states contained in the higher order tensor products, arise in the supermembrane
theory.13 Subsequently, the group theoretical HS/singleton connection was utilized in [35]
and the fully interacting massless HS field equations in D = 4 were constructed in [14].
In the light of [1], the 4D HS/singleton connection found in [40] was revived as an actual
AdS/CFT correspondence in [16,17].
Importantly, the role of large N discussed in Section 4 was not emphasized in these
early formulations of the correspondence. Thus we need to refine the formulation of the
correspondence by identifying the appropriate dependence on N of the free OSp(8|4)
singleton.
Let us first recall the Maldacena conjecture [1] on the correspondence between
M-theory on AdS4 × S 7 with N units of 7-form flux on S 7 and the low energy dynamics
of N parallel M2-branes in flat eleven-dimensional spacetime, which is described by a
strongly coupled d = 3, N = 8 CFT with SO(8)R symmetry [1,4]. This theory defines a
13 To describe the S 7 compactified M-theory all higher tensor products are needed. The resulting theory lives
on the double cover of AdS4 times S 7 . It is consistent to truncate the theory to only even powers of the singleton.
This corresponds to M-theory on the single cover of AdS4 times S 7 /Z2 RP7 .
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 341
non-trivial IR fixed point of d = 3, N = 8 SYM theory with SU(N) gauge group. The
resulting SO(7)R -invariant flow has an antiholographic description as a D2-brane near-
horizon geometry, which is reliable in the UV where the dilaton is small. In the IR the
dilaton blows up and the IIA solution lifts to the SO(8)R invariant AdS4 × S 7 near horizon
region of a stack of N coinciding M2-branes. The resulting antiholographic description of
the strongly coupled SCFT is conjectured to be M-theory on AdS4 × S 7 . For large N the
membrane tension scales like
√
1 N
TM2 = 3 ∼ 3 , (6.1)
lM2 R
where R is the AdS radius, and the 4D Planck length is given by
1 N 3/2
2
= . (6.2)
lPl R2
Hence, for large N ,
R lM2 lPl . (6.3)
For AdS energies E obeying 1 E R/ lM2 the low-energy dynamics of the
antiholographic dual is conjectured [1] to be described by D = 4, N = 8 gauged
supergravity. In particular, it follows from the normalization (6.2) that the strongly coupled
SCFT has ∼ N 3/2 massless degrees of freedom for large N [72,73].
In the UV limit of the D2-brane geometry the dilaton eφ vanishes and the 10D
gravitational curvature diverges (which one might interpret as the appearance of the new
massless HS states that we shall define below). The D2-brane field theory becomes a
SU(N) invariant theory of free 3D super-Maxwell multiplets. Here we note that the
Yang–Mills coupling in the dual SYM theory on the stack of N coinciding D2-branes,
2 = g / l , is held fixed in taking the near-horizon limit. This coupling also coincides
gYM s s
with the ‘local’ Yang–Mills coupling on a stack of probe D2-branes placed at energy
scale u in the near-horizon region, gYM 2 (u) ≡ e φ(u) −g (u)/ l 2 = g 2 , as required for
00 s YM
interpreting the stack of probe branes as describing a Higgs branch of the dual SYM theory.
Thus both the dilaton and running string length vanishes in the UV limit, which is why we
can trust the free SU(N) field theory even though the gravitational curvature diverges.
Dualizing the vector fields and using gYM 2 to rescale the fields, we obtain a free SU(N)
Conversely, assuming that this Lagrangian describes a fixed point on the membrane we can
break SO(8)R → SO(7)R by taking the M-theory to have a finite radius R11 and take Φ 8
to be periodic:
Φ 8 ∼ Φ 8 + g, (6.5)
14 The singleton consists of 8 scalars in 8 and 8 spinors in 8 of SO(8) . By triality one can also obtain a
v s R
singleton multiplet in which the scalars are in 8s and the spinors in 8c .
342 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
where the radius g is a constant with dimension 1/2 which we identify as g = R11 /(l11 )3/2
and l11 is the eleven-dimensional Planck length. We recover the free OSp(8|4) invariant
singleton in the decompactification limit R11 → ∞. We may instead use g to dualize Φ 8
and introduce Yang–Mills interactions with gYM = g. The effective coupling is geff 2 =
2
g /u, where u is the 3D energy scale, and as a result the theory now decompactifies in
the IR [1,4,25,31,74]. Thus we have two decompactification limits, the free SU(N) valued
OSp(8|4) singleton field theory which resides in the UV and the strongly coupled SO(8)R
invariant d = 3, N = 8 SCFT in the IR.
Thus it is natural to describe the low energy dynamics of M2-branes in terms of an UV
fixed point of free SU(N) valued OSp(8|4) singletons and an IR fixed point of strongly
coupled OSp(8|4) singletons. We note that the number of massless degrees of freedom
indeed decreases along the RG flow, from N 2 to N 3/2 .
We conjecture that the free singleton theory at the UV fixed point mentioned above
is the holographic dual of the hs(4|8) gauge theory which admits the massless hs(4|8)
gauge theory described in Section 2.2 as a consistent truncation. This theory describes an
unbroken phase of M-theory with N units of M2-brane charge. The strongly coupled fixed
point is the holographic image of a broken phase, which admits an effective supergravity
description at low energies.
There are also IR fixed points containing free OSp(8|4) singletons forming (N − 1)-
dimensional representation of the Weyl group of SU(N) [31]. These are curious points
from the point of view of HS dynamics, and it may be that one should also include them as
non-trivial points in the phase diagram.
As discussed in the previous section, the unbroken phase of the type IIB theory on
AdS5 × S 5 √ arises either as the critical limit λ → 0 at fixed E and s, or as the high energy
limit s λ at fixed λ and N 1. Moreover, as we shall see in the next section,
the unbroken phase of M-theory on AdS7 × S 4 arises at high energies whereby certain
membrane solitons propagate close to the boundary of AdS7 . This suggests that also the
unbroken phase of M-theory on AdS4 × S 7 arises in a high energy limit in which bulk
membranes couple to HS operators in the strongly coupled SCFT 3 with asymptotically
small anomalous dimensions, (E − s)/s → 0, as s → ∞. The four-form flux in the AdS4
directions results ensures the M2-brane equations admit spherical membrane solutions in
AdS4 × S 7 [75–77]. These solutions carry internal SO(8) spin, and are hence closely related
to the matrix-model found in the pp-wave limit [30]. It is natural to expect that these
solutions can be deformed into time-dependent membrane solutions carrying also AdS
spin, in analogy with the string solutions in AdS3 with NS-fluxes [78]. We also expect the
anomalous part of the energy to be minimized and certain fractional supersymmetry to be
restored by taking large AdS radius, i.e., large bulk energies, such that the solution couples
to the conserved HS currents of the hs(8|4) theory. The fact that the holographic dual
resides at a UV fixed point should be encoded into the local geometry of the solution and
to how it minimizes the AdS energy, as in the case of the rotating membrane in AdS7 × S 4 .
It will be interesting to examine the above picture in more detail and in particular to
examine the fluctuation spectrum about this solution, where we expect to find some critical
membrane theory with fixed tension, and perhaps singletons in the worldvolume, giving
rise to the massless HS states.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 343
We expect that the Higgsing of the massless HS fields and the resulting spontaneous
breaking of the hs(4|8) is described by a radially dependent solution to the HS theory which
is the antiholographic dual of the 3D SYM flow obtained by switching on a finite gYM 2 as
discussed above. It will be interesting to see whether HS field theoretic methods are still
relevant for describing this solution, which would then yield ‘weak/weak’ correspondence
between the HS theory coupled to Higgs sector and the SYM theory with expansions in
both 1/N and gYM 2 . It may also be necessary to exhibit in more detail the nature of the
In this section we shall outline the structure of the minimal bosonic HS gauge theory
in D = 4 which is a consistent truncation of the supersymmetric HS theory discussed
in Section 2.1. The spectrum consists of massless fields with spin s = 0, 2, 4, . . . , each
occurring once. The underlying algebra, called hs(4), is an infinite-dimensional extension
of the bosonic AdS4 group. Similar truncation exists also in D = 5, 7 at the spectrum level
but only in D = 4 a full interacting theory is known, both supersymmetric and minimal
bosonic.
The 4D minimal bosonic model is of great interest because it is the simplest interacting
HS gauge theory (with propagating HS degrees of freedom), and yet it exhibits all the
essential principles that underlie such theories. It is a very good starting point for finding
ways to construct the D = 5, 7 HS gauge theories as well. Moreover, it is amenable to
calculations and it is possible to test directly in this model the consistent truncation of the
kind discussed in Section 4 which is required for the holography picture to make sense.
Here, we will not go as far as carrying out these tests [24] but we will nonetheless exhibit
the structure the couplings to give the reader an idea about how they actually look like, as
well as providing enough ingredients to facilitate the required holography computations.
Here we shall focus our attention on the quadratic terms in all the field equations, which,
of course, mean all the cubic couplings at the action level. In an accompanying paper
[19], we shall give a more detailed treatment involving an expansion scheme where the
gravitational gauge fields are treated exactly and the gravitational curvatures and the HS
gauge fields as weak perturbations to all orders. The 4D HS/3d singleton correspondence
in the hs(4) theory at the level of quadratic field equation/cubic action will be provided
elsewhere [24].
The massless field equations (including general interaction ambiguities) have been
given in [14] and studied in more detail in [16–18] and more recently in [19]. These studies
are based on a curvature expansion scheme. The most important step in the expansion
scheme is the linearized analysis which shows that all auxiliary fields are non-propagating.
As a result it is possible to solve iteratively for the auxiliary fields and obtain the physical
field equations to any order. In fact, this scheme yields field equations in terms of only the
physical fields.
The HS spin algebra hs(4) is obtained from hs(4|8) defined in Section 2.1 by setting the
fermionic generators θ i equal to zero. To describe the field equations in 4D spacetime,
which has coordinates x µ , one introduces an auxiliary set of coordinates (zα , z̄α̇ )
which are Grassmann even spinors that are non-commutative in nature, and consider
344 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
extensions ϕ(x; z, z̄) of the basic spacetime fields ϕ(x). One then imposes an integrable
curvature constraint in the extended space, whose (x; z, z̄)-components determine the (z, z̄)
dependence of the extended fields ϕ(x; z, z̄) in terms of “initial” conditions φ(x). Setting
z = z̄ = 0 in the remaining x-components of the curvature constraint leads to reduced
curvature constraints in spacetime, which are integrable by construction and one can show
that they contain the physical field equations of the HS gauge theory. Since (z, z̄) are
non-commutative, the reduced constraints contain interactions even though the original
constraint in (x; z, z̄) space has a simple form.
The basic building blocks of the theory are a master 0-form Φ and a master 1-form
The curvature constraints giving rise to the spacetime field equations read
= Â + Â Â = i dzα ∧ dzα Φ
F
i
κ + d z̄α̇ ∧ d z̄α̇ Φ
κ̄, (6.10)
4 4
Φ
D = dΦ + Â Φ− Φ π̄(Â) = 0, (6.11)
where the operators κ, κ̄ are defined as
κ = exp iy α zα , κ̄ = κ † = exp −i ȳ α̇ z̄α̇ , (6.12)
the π -map, and its complex conjugate π̄ , acting on an arbitrary polynomial f (y, ȳ; z, z̄)
are defined as
π f (y, ȳ; z, z̄) = f (−y, ȳ; −z, z̄), π̄ f (y, ȳ; z, z̄) = f (y, −ȳ; z, −z̄), (6.13)
and the -product between two arbitrary polynomials f (y, ȳ, x; z̄) and g(y, ȳ; z, z̄) is
defined as
←− − −
← → →
− ←− ←− − → →
−
∂ ∂ ∂ ∂ ∂ ∂ ∂ ∂
f ∗ g = f exp i + − +i − + g.
∂zα ∂yα ∂zα ∂y α ∂ z̄α̇ ∂ ȳα̇ ∂ z̄α̇ ∂ ȳ α̇
(6.14)
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 345
δ Â = d ˆ + [Â, ˆ ] , = ˆ φ̂ − Φ
δΦ .
ˆ (6.15)
Given the initial conditions (6.7), the components of the constraints (6.10), (6.11) which
in powers of Φ, which
have at least one α or α̇ index can be solved by expanding  and Φ
contains curvatures and the scalar field, as follows:
∞
∞
∞
=
Φ (n) ,
Φ Âα = Â(n)
α , µ = Â(n)
µ , (6.16)
n=1 n=1 n=0
where
Φ, n = 1,
(n)
Φ = (6.17)
Z=0 0, n = 2, 3, . . . ,
Aµ , n = 0,
µ Z=0 = 0,
Â(n)
n = 1, 2, 3, . . . ,
(6.18)
Âα Z=0 = 0, (6.19)
Φ|Y =0 = φ. (6.23)
As for the vierbein and HS gauge fields, requiring that they transform homogeneously
under Lorentz transformation, one is led to the following expansion scheme for the master
gauge field
Aµ = eµ + ωµ + Wµ + iωµαβ Âα Âβ − h.c. Z=0 , (6.24)
346 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
= 2∇[ν Wρ]α2 ···αs−1 β̇ γ̇ α̇2 ···α̇s−1 − (s − 2)(σνρ σµ )α2 δ Wµα3 ···αs−1 β̇ γ̇ δ̇ α̇2 ···α̇s−1
− s(σµ σνρ )β̇ γ Wµγ α2 α3 ···αs−1 γ̇ α̇2 ···α̇s−1 . (6.29)
The covariant derivatives in (6.26) and (6.29) are with respect to lo the Lorentz
connection ω. Furthermore, in (6.28) and (6.29), separate symmetrization in the dotted
and undotted indices is understood. Further definitions are
Pµ(2) = Φ π̄ (Wµ ) − Wµ Φ
(2) π̄(eµ ) − eµ Φ
+ Φ π̄ êµ(1) − êµ(1) Φ + Φ (2) , (6.30)
Z=0
(2)
Jµν =− ν (1) + Wµ , êν(1)
êµ(1) êν(1) + eµ , êν(2) + eµ , W Z=0
(1)
+ iRµν αβ Â(1) α Âβ + h.c. Z=0 + Wµ Wν + (µ ↔ ν) (6.31)
and the hatted quantities occurring in the above equations are given by [19]
1
i
α = − zα
Â(1) t dt Φ(−tz, ȳ)κ(tz, y), (6.32)
2
0
1 1
(1)
Â(2)
α = zα t dt  (1)β
Â(1)
β z→t z,z̄→t z̄ + z̄ β̇
t dt Â(1)
α , Âβ̇ z→t z,z̄→t z̄ , (6.33)
0 0
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 347
1
dt ∂Wµ α(1) ∂Wµ
µ(1)
W = −i , Â + Âα̇(1), ,
t ∂y α ∗ ∂ ȳ α̇ ∗ z→t z,z̄→t z̄
0
1
Φ(2) =z α
dt Φ π̄ Â(1)
α − Â(1)
α Φ t →t z,z̄→t z̄
0
1
(1) (1)
+ z̄α̇ dt Φ π Âα̇ − Âα̇ Φ t →t z,z̄→t z̄ ,
0
1
dt (1)
êµ(1) = −ieµα α̇ α ∗ + Âα̇ , yα ∗ z→t z,z̄→t z̄ ,
ȳα̇ , Â(1)
t
0
1
dt (2)
êµ(2) = −ieµα α̇ α ∗ + Âα̇ , yα ∗ z→t z,z̄→t z̄ ,
ȳα̇ , Â(2)
t
0
1 1
dt dt
− eµα α̇
t t
0 0
∂ ∂ (1)
× Â β(1)
− ȳ α̇ , Â (1)
α ∗
+ Â α̇ , y α ∗ z→t z,z̄→t z̄
∂zβ ∂y β
∂ ∂ (1)
+ Âβ̇(1) + ȳα̇ , Â(1)
α ∗
+ Â α̇ , y α ∗ z→t z,z̄→t z̄
∂ z̄β̇ ∂ ȳ β̇
∂ ∂ (1)
+ β
+ β α ∗ + Âα̇ , yα ∗
ȳα̇ , Â(1) Âβ(1)
∂z ∂y z→t z,z̄→t z̄
∂ ∂
+ −
∂ z̄β̇ ∂ ȳ β̇
(1)
× ȳα̇ , Â(1)
α ∗ + Â α̇ , y α ∗
 β̇(1)
. (6.34)
z→t z,z̄→t z̄ z→t z,z̄→t z̄
In the above formulae, the replacement of (z, z̄) by (tz, t z̄) is to be made inside the integrals
and after performing the -products. Note also the quantity Â(1) α is a basic building block
which occurs in many of the formulae above and that it is first order in Φ.
It is important to note that not all the fields occurring in (6.8) and (6.9) are independent.
An analysis of the constraints (6.10) and (6.11) shows (a) Φα1 ···α2s (s = 2, 4, . . .) are the
Weyl tensors which can be in terms of the curvatures, (b) Φα(m)α̇(n) for m + n > 2 can
αβ
be solved in terms of φ, the Weyl tensors and their derivatives, (c) ωµ is, of course,
the Lorentz spin connection which can be solved in terms of the vierbein eµα α̇ , and (d)
Wµα(m)α̇(n) for |m − n| 2 are auxiliary gauge fields which can be solved in terms of the
physical fields Wα(s−1)α̇(s−1) [19].
348 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
ωµ ab = ωµ ab (e) + κµ ab , (6.40)
where κµ ab is the con-torsion tensor related to the torsion tensor Tµν a as
κµ ab = Tµ ab − Tµ ba + T ab µ , (6.41)
15 We have set the AdS radius R = 1 but it is straightforward to re-introduce R by dimensional analysis in
which the master 0-form and the master 1-form fields are dimensionless.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 349
where
∂ ∂ (2)
Tµν a = σ a α β̇ J . (6.42)
∂yα ∂ ȳβ̇ µν Y =0
(3) The elimination of the auxiliary fields by means of Eqs. (6.35) and (6.36) gives rise
to higher derivative interactions. In particular, in a given spin sector, the auxiliary fields
are Wµα1 ···αk α̇k+1 ···α̇2s−2 with k = 0, 1, . . . , s/2 and they are related to the physical fields
Wµα(s−1)α̇(s−1) schematically as
Φα(m)α̇(m) ∼ ∂ m φ,
Φα(m)α̇(n) ∼ ∂ (m−n)/2 Φα(m−n) , m − n = 0 mod 4. (6.44)
(4) Whether the master constraints (6.10) and (6.11) are unique is an important question.
In fact, there exist a generalization of (6.10) in which [19] we let
Φκ → V Φ κ , Φ κ̄ → VΦ κ̄ , (6.45)
† ) = (V(X))† . In [19] we
where V(X) is a -function, with its complex conjugate V(X
argue that this function must be of the form
∞
V(X) = b2n+1 X2n+1 , |b1 | = 1. (6.46)
n=0
perturbations with this effect [82].16 This is believed to reflect the fact that open membranes
ending on coinciding M5-branes give rise to tensionless closed strings and that the proper
language for formulating the dynamics on the fivebrane is therefore not ordinary field
theory but rather some nonlocal extension of it.
However, if we are willing to give up 6D covariance, then we can use lower-dimensional
RG flows based on ordinary interacting field theories to define the AN−1 (2, 0) theory [1,
4,25,31,74]. In particular, circle reductions of the 6D theory describes RG flows of 4D
and 5D SYM theories with SU(N) gauge group. The SO(4)R invariant RG5 flow has a
type IIA supergravity dual description in terms of the near horizon region of a D4-brane
solution. In the UV limit the dilaton diverges and the solution uplifts to the AdS7 × S 4
near horizon region of the stack of M5-branes. The resulting antiholographic description
of the AN−1 (2, 0) theory is conjectured to be M-theory on AdS7 × S 4 [1]. For large N the
membrane tension scales like
N
TM2 ∼ , (7.1)
R3
where R is the AdS radius, and the 7D Planck length is given by
1 N3
5
= . (7.2)
lPl R5
For large N the Planck length is much smaller than the M2 length scale, lM2 which in
turn is much smaller than the radius. Thus, for energies E obeying 1 E R/ lM2 the
low-energy dynamics of the antiholographic dual is described by D = 7, N = 2 gauged
supergravity. The AN−1 (2, 0) theory has been conjectured to admit an expansion in terms
of integer powers of 1/N which factorize for large N [4].17 From (7.2) it follows that the
AN−1 (2, 0) theory has ∼ N 3 massless degrees of freedom for large N which contain the
N − 1 massless (2, 0) tensor multiplets of the ‘Higgs branch’ of the theory.
In the IR limit of the D4-brane geometry the dilaton eφ vanishes and the gravitational
curvature diverges. As for the D2-brane discussed in Section 6.1, the dual SYM coupling
2 = g l is held fixed in taking the near horizon limit and equals the local Yang–Mills
gYM s s
coupling gYM2 (u) ≡ e φ(u) / −g (u)/ l 2 = g 2 . Hence the local string length diverges in
00 s YM
the IR (unlike the case of the D2-brane where the local string length disappears together
with the dilaton in the UV). Hence, naively the D4-brane field theory becomes a free
SU(N) valued d = 5, N = 2 Maxwell theory with SO(5)R symmetry and finite Yang–Mills
coupling gYM2 . This theory can be made scale invariant by absorbing g 2
YM into the fields,
but this symmetry is superficial since it cannot be lifted to superconformal invariance.
Instead a more natural interpretation is that superconformal invariance is restored by
uplifting to a free SU(N) valued d = 6, N = (2, 0) tensor singleton described by the
16 A single tensor multiplet admits self-interactions, such as, for example, those describing the motion of a
single M5-brane [83–85].
17 From (7.1) it follows that M-theory on AdS × S 4 has an expansion in terms of integer powers of 1/T
7 M2
rather than integer powers of the 7D Plank’s constant. The same remark applies to M-theory on AdS4 × S 7 , which
√
has M2 tension given by (6.1) and has been conjectured to have an expansion in terms of integer powers of 1/ N
[4].
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 351
superconformal action18
2
S6 = d 6 x tr dΦ a + |dB|2 + fermions , (7.3)
where we have set the fermions equal to zero, γαβ = ∂α XM ∂β XN gMN and C3 is the pull-
back of the M-theory three-form potential which has non-zero components only in S 4 . The
18 Tensor self-duality and supersymmetry can be restored at the level of the field equations [85].
352 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
ρ0 θ2
cosh2 ρ sinh ρ
E = 4N dρ dθ , (7.10)
cosh2 ρ − ω2 sinh2 ρ sin2 θ
0 θ1
ρ0 θ2
sinh3 ρ sin2 θ
s = 4Nω dρ dθ , (7.11)
cosh2 ρ − ω2 sinh2 ρ sin2 θ
0 θ1
where
coth ρ0 = ω. (7.13)
If we assume that ! is small then
!N
E= 2
2
2 F1 2, 1; 3/2; 1/ω , (7.14)
ω
2!N
s= 2
2 F1 2, 2; 5/2; 1/ω . (7.15)
3ω3
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 353
For ω 1 this describes short membranes with length ρ0 ∼ 1/ω and energy and spins
given by
E 3 = 8!Ns 2 , E, s !N. (7.16)
In flat eleven-dimensional spacetime an analogous relation holds between mass and spin
for all values of the spin (in flat space this relation follows from dimensional analysis).
Thus, in flat spacetime the mass is minimized for given spin by sending ! → 0 and ω → 0
(keeping s fixed). The flat space spectrum therefore contains massless states arbitrary spin,
which can be thought of as infinitely long, thin string-like membranes which are virtually
at rest.
In fact, long ago bosonic open membrane (a disk) rotating simultaneously about two
axis was considered in [32] where the relation a relation like (7.16) was derived. Such
solutions are possible for D 5. Later, this solution was generalized in [33] for the D = 11
supermembrane [86], by gluing two copies of the open membrane of [32] along their edges
to obtain a ‘pancake’ membrane. The zero-point energy of this membrane was studied by
these authors and later in [87]. It was conjectured in [33] that the (semi-classical) energy-
angular momentum relation of the kind (7.16) would be modified by an integral or half
integral number due to the fact that the fermionic coordinates of the supermembrane also
carry intrinsic angular momentum. See [88] for a review of this fascinating subject.
Going back to AdS7 × S 4 , for slow rotation, ω ∼ 1, ω > 1 and finite width !, the solution
(7.8) describes long membranes whose energy and spin now obeys
3π 2/3
E−s = (!N)2/3 s 1/3 , E, s !N. (7.17)
21/3
For ω → 1 the energy and spin diverges and the rotating membrane develops a boundary
given by a folded closed string of length ! which trace out a Wilson surface in the stack of
five-branes. Thus, the long membranes of width ! with finite energy describe operators in
the AN−1 (2, 0) theory which arise in the operator product expansion of the Wilson surface.
The shape of the Wilson surface together with (2.21) suggest that its expansion contains
bilinear higher spin operators which have asymptotically small anomalous dimensions,
(E − s)/s 1 for high spin, s !N 1. In the limit s → ∞ their interactions should be
equivalent to those described by the singletons.
Suppose there is no boundary condition which fixes ! to a finite value. The prescription
is then to vary ! keeping s fixed as to minimize E. The minimal energy configuration for
given spin s is obtained by taking ! → 0, ω → 1 which results in an infinitely long string-
like membrane with energy E = s (the ratio E/s is larger for short wide membranes than
for long thin ones). Note that this geometry is assumed for any value of√s, unlike in the case
of the type IIB closed string which became infinitely long only as s/ λ → ∞. As ! → 0
the dual Wilson surface collapses and the higher derivative corrections to the AN−1 (2, 0)
theory becomes suppressed, resulting in a flow down to the free tensor theory describing
the unbroken phase with hs(8∗ |4) gauge symmetry.
Let us examine the supersymmetry of this solution. The condition for worldvolume
supersymmetry is [75]
1 1 αβγ
Γ = , Γ =√ ∂α XM ∂β XN ∂γ XN , (7.18)
− det γ 3!
354 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
and that is the Killing spinor of the AdS7 × S 4 background. An important property of
these Killing spinors is that as we approach the boundary of AdS7 , i.e., as ρ → ∞, they
become an eigenstate of a constant Γ -matrix as follows [75]
Γ = , Γ = Γ012345, (7.19)
where Γa are flat Dirac matrices and a = 0, . . . , 5 are the indices tangent to the boundary
of AdS7 . We have relabeled the coordinates of AdS7 as
t, φ, θ, ψ , θ , φ , ρ → (x0 , x1 , . . . , x5 , ρ), (7.20)
where (ψ , θ , φ ) are the S 3 angles. Now, inserting the solution (7.8) into the definition of
Γ in (7.18) gives
(cΓ0 + (ωs) sin θ Γ1 )Γ62
Γ = , (7.21)
c2 − ω2 s 2 sin2 θ
where c = cosh ρ and s = sinh ρ. Next, we find that [Γ, Γ ] = 0. Therefore the
worldvolume supersymmetries can be written as
We have proposed that type IIB string theory with N units of D3-brane charge and
M-theory with N units of M2-brane or M5-brane charge have unbroken phases described
by HS gauge theories which admit consistent truncations to massless HS gauge theories
in D = 4, 5, 7 with holographic duals given by SU(N) valued scalar singleton theories in
d = 3, 4, 6 with 16 supersymmetries. The corresponding HS algebras are
which can be used to re-construct the spectrum of the type IIB string and M-theory in
appropriate limits as discussed in more detail in Section 5.
In the case of type IIB string on AdS5 × S 5 , we have conjectured that the hs(2, 2|4)
gauge theory arises in a critical limit of the type IIB theory in which
gstr ∼ 1/N, lstr ∼ R fixed R, N 1, (8.4)
and that this limit corresponds to the free 4D, N = 4, SU(N) SYM in which gYM = 0. This
means that the relations between the closed string parameters in AdS5 × S 5 and the gauge
theory parameters for λ 1, which are read off from the D3-brane solution obtained in the
supergravity approximation, are renormalized, as discussed in Section 5, and summarized
in (1.1) and (1.2).
In the case of M-theory on AdS4/7 × S 7/4 , we have conjectured the holographic
boundary theories to be a SU(N) valued OSp(8|4) singleton field theory which resides
at a UV fixed point in 3d, and a free SU(N) valued (2, 0) tensor singleton field theory
residing at a IR fixed point in 6d.
The spectrum of massless states in all the HS gauge theories discussed here have the
universal property that they all arise in the symmetric product of two singletons. This
motivates a worldsheet sigma model description of these theories based on an affine
extension of AdS superalgebras in D = 4, 5, 7 with critical KM levels leading to left-
moving and right-moving singleton Verma modules with a maximal number of null-states.
In this respect, the existence of a singleton-like representations of affine SO(3, 2) with level
kcrit = 5/2 found in [70] is encouraging.
The idea of obtaining the massless states of a D = 4, N = 8 HS theory starting from the
free OSp(8|4) singleton theory, which in turn was obtained from the eleven-dimensional
supermembrane on AdS4 × S 7 , already appeared long ago [40]. We recall that all the
massless fields in this theory, with the exception of a pseudoscalar, satisfy the energy-spin
relation E0 = s + 1. More recently, long rotating strings that extend to the boundary of
AdS5 and couple to operators which are asymptotically anomaly free, i.e., (E − s)/s → 0
as E, s → ∞, have been studied [34].
Motivated by above the considerations, we have found rotating long membrane
solutions (7.8) to the equations which describe the M2-brane in AdS7 × S 4 background.
These membranes have width ! and the geometry of infinitely stretched strings with energy
and spin density concentrated at the end points. They satisfy the semi-classical energy-spin
relation E = s. A feature not present in the string case is that the energy is minimized for
fixed spin by sending the angular velocity ω → 1 and the width ! → 0 keeping s fixed,
resulting in infinitely long membranes with string-like geometry and semi-classical energy
E = s. In Section 7, we have interpreted these as the lowest weight states of the massless
supermultiplets of the 7D HS gauge theory discussed in Section 2 (see Table 3). Further
aspects of this picture, especially the quantization issue, remain to be studied.
It would also be interesting to study the spherical membrane in AdS4 and examine
whether it admits ‘breathing’ and ‘rotation’ modes similar to those of strings in AdS3 with
NS-fluxes [78].
As there is effectively no separation in AdS energy between the massless HS fields
and the massive HS fields, we have proposed that the massless HS theories (based on HS
extension of the 32 supercharge AdSd+2 superalgebras in d = 3, 4, 6) arise as a result of
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 357
consistent truncation of the full HS theories. This proposal can be tested explicitly since
for large N , the singleton theory and the HS gauge theory can be compared order by
order in the 1/N expansion: consistent truncation implies that the massless HS theory
action is the generating functional of correlators of bilinear operators. Indeed, a correlation
function of four bilinear operators in a singleton theory can be written in a manifestly
s–t–u symmetric form in terms of two- and three-point functions involving only bilinear
operators, as discussed in Section 4.
We have also examined mechanisms for spontaneous breaking of HS gauge symmetry
down to the symmetries underlying ordinary supergravity. In D = 4, 5 the ‘order
parameter’ for breaking of HS gauge symmetry is the holographic Yang–Mills coupling.
In 4D this is a marginal deformation which corresponds to a finite dilaton VEV in the
bulk. The broken theory has an AdS vacuum in which the broken gauge fields have non-
critical masses m2 − m2crit ∼ NgYM 2 /R 2 . Using the non-intersection principle we argue
these cross over into the leading Regge trajectory as NgYM 2 . We have also identified the
Higgs multiplets at arbitrary level in the HS spectrum, and the realization of the level-
one Higgs multiplet in terms of composite operators (i.e., anomaly multiplets) in the free
singleton SCFT.
Also in 3d, where the Yang–Mills coupling is a relevant perturbation, we have identified
the Higgs multiplets at arbitrary level in the HS spectrum, and the realization of the level-
one Higgs multiplet in terms of composite operators (i.e., anomaly multiplets) in the free
OSp(8|4) singleton field theory.
In D = 7 we do not know what is the order parameter for breaking HS gauge symmetry,
nor have we identified the Higgs multiplets. This is presumably related to the fact that the
massless gauge fields in D = 7 belong to the discrete B series (see (B.16) in Appendix B).
We believe this issue should have a simple resolution in a framework where the nature of
the mysterious interactions on the five-brane is well understood. We stress that the Higgsing
of the 7D HS gauge theory is dual to weak irrelevant perturbations of the tensor theory in
the IR, which should be describable using a field theoretic, perhaps non-local, construction
in 6d. One may also speculate that the continuous A series (see (B.15)) could play a role
in this, since the corresponding fields can be Higgsed, which signals the existence of the
corresponding anomaly multiplets. This, in turn, would provide valuable data on the details
of the interactions in 6d.
An interesting open problem is to use the HS gauging techniques described in Sections 2
and 6 to construct interactions between massless HS fields and Higgs fields. Clearly, the
issue of consistent truncation becomes moot once we include (massive) Higgs fields. It is
therefore a challenge to examine whether some generalized truncation scheme, perhaps of
the type described in [27], may temper the fluctuations in the massive sector.
In testing various aspects of the AdS/HS gauge theory correspondences discussed in this
paper, it will be very useful to develop a deeper understanding of the geometrical nature
of HS interactions, possibly formulating them in a generalized superembedding approach.
This would provide a universal tool for studying the HS dynamics [91] which would not
only simplify the task of coupling Higgs master fields to HS gauge theories but also yield
a superfield formulation [91] that would simplify the treatment of the bulk interaction and
the computations of the attendant Witten diagrams. On the boundary side, the existing
literature on the OPE computations involving free fields should be extended to cases where
358 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
subleading in 1/N contributions will arise [22]. We have described few examples of such
correlators in Section 4.
In this paper we have focused our attention on HS gauge theories in D = 4, 5, 7. No
doubt these results can be extended to AdS6 as well. In D = 3 the HS gauge fields do
not propagate physical degrees of freedom. Nonetheless, physical matter fields of spin
s = 0, 1/2 can be coupled to massless HS gauge theory [92,93]. The advantage here is
that an action principle is known and the mathematics is much simpler than in higher
dimensions. It would be interesting to study this model in the context of massless higher
spins and holography.
At the algebraic level there is in principle no bound on the number of supersymmetries
in HS gauge theories and we expect consistent massless interactions for any N in D 7,
though certain restrictions follow from the requirement of an R symmetry neutral vierbein
[91]. As discussed in Section 4, the restrictions on the spacetime superdimension are
instead expected to be related to the consistency of the full HS quantum theory, including
both massless and massive states, which requires the full generating functional (4.9) of the
free singleton SCFT with finite sources for composite operators. Effectively, the condition
that this quantity exists is expected to be as restrictive in the free singleton SCFT as in the
(strongly) interacting singleton SCFT. This may lead to the restriction that the holographic
dual cannot have more than 16 supersymmetries in d 6. Similar restrictions should
follow from the quantum consistency of the yet to be constructed dual bulk sigma models.
In Section 4, similar effects were argued to arise in the holographic theory due to insertions
of sewing operators in the free singleton field theory required for unitarity.
Another particularly interesting class of singleton CFTs, which we have not considered
here, are the free 4d conformal HS theories constructed in [11]. Here the singleton field
is a master field comprising an infinite set of ordinary singletons which together form an
irreducible representation of a HS extension of the d-dimensional conformal group. In that
case the relevant HS symmetry algebra is an infinite-dimensional extension of Sp(8, R)
which contains the AdS group in 5D.
To conclude, we believe that the remarkable algebraic and geometric structures
underlying HS gauge symmetry are natural extensions of supergravity and will be
important guides towards the true foundations of string and M-theory. In particular, the
simplicity of their holographic duals together with the fact that the bulk physics can still
be phrased in a relatively simple language is both gratifying and compelling. Clearly much
remains to be done in this subject which may be viewed as being still in its infancy.
Acknowledgements
In this appendix, we tabulate the spectra of singletons and the generators of the super-
HS groups and the field content of the master scalar fields in AdS5 and AdS7 . The case of
Table 4
The hs(2, 2|4) generators with Y = 0, ±1 arranged into levels labeled by ! = 14 (ny + nȳ + nθ + nθ̄ − 2). The
entries are SU(4) × U (1)Y representations as follows: 15 = 150 , 4 = 41 , 1 = 10 , 16 = 150 + 10 , 24 = 201 + 41
and 36 = 200 + 150 + 10 , where the U (1)Y charge is defined by Y = ny − nȳ . The SO(4, 1) content is given by
the highest weights m1 m2 12 |Y | where m1 = 12 (ny + nȳ ). Upon gauging, these generators give rise to spin
s = m1 + 1 gauge fields which can be used to write a canonical set of covariant curvature constraints. As a result
the gauge fields for m2 12 |Y | + 1, s 2 are auxiliary while those for m2 = 12 |Y | contain physical degrees of
freedom
!\s 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2
0 15 4 1
1 16 24 36 24 16 4 1
2 1 4 16 24 36 24 16 4 1
3 1 4 16 24 36 ...
4 1 ...
..
.
Table 5
The hs(2, 2|4) generators with Y = ±2, ±3, ±4. The entries are SU(4) × U (1)Y representations as follows:
16 = 102 + 62 , 4 = 43 , 6 = 62 and 1 = 14 . Further notation is defined in Table 4. These generators are associated
with gauge fields dual to generalized antisymmetric tensor fields contained in the scalar master field Φ; see
Table 7 for s 1
!\s 2 5 3 7 4 9 5 11 6 ···
2 2 2 2
1 16 4 6
2 6 4 16 + 1 4 6
3 6 4 16 + 1 ...
..
.
Table 6
The d = 4, N = 4 singletons. The quantity Z is the SU(2, 2|4) central charge carried by the supermultiplet. The
entries in the table denote SU(4) representations. Each entry carries an SO(4) ⊂ SO(6) ⊂ SO(6, 2) representation
(jL , 0), and their complex conjugates (0, jR ). The states for each value of |Z| form a single massless irrep of
d = 4, N = 4 Poincaré superalgebra, and the states carry spin s = jL . For all the states E0 = s + 1, where E0 is
the lowest AdS5 energy. There exists an outer automorphism group U (1)Y of SU(2, 2|4), and the U (1)Y charges
of 6, 4 and 1 are 0, ±1 and ±2, respectively. The Z = 0 multiplet is the d = 4, N = 4 SYM singleton multiplet
which has 8 + 8 degrees of freedom. All the other singleton multiplets have 16 + 16 degrees of freedom. For
superfield realization of all the singletons listed in this table, see Section 3.2
|Z|\s 0 1 1 3 2 5 3 ···
2 2 2
0 6 4 1
1 4 6+1 4 1
2
1 1 4 6 4 1
3 1 4 6 4 1
2
2 1 4 6 4 1
.. ..
. .
360 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
Table 7
The physical fields contained in the master scalar field Φ arising in the hs(2, 2|4) gauge theory in D = 5.
The entries are the following SU(4) × U (1)Y representations for s < 1: 42 = 200 + 102 + 10−2 + 14 + 1̄−4 ,
48 = 201 + 20−1 + 43 + 4̄−3 , 8 = 41 + 4̄−1 and 10 ; for s 1: 62 , 43 , 16 = 102 + 62 and 14 . The spin s 1
sector is realized in the field theory in terms of two-form potentials and their higher spin generalizations. These
fields obey self-duality in D = 5 and have dual one-form gauge fields corresponding to the generators given
in Table 5, with the exception of the underlined representations, which have no one-form duals. Here the form
degree refers to the number of curved indices as opposed to the tangential multi-spinor indices arising from the
(y, ȳ)-expansion
!\s 0 1 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2 2
0 42 48 6
1 1 8 6 4 16 + 1 4 6
2 6 4 16 + 1 4 6
3 6 4 16 + 1 ...
..
.
Table 8
The hs(8∗ |4) generators with Y = 0, ±1 arranged into levels labeled by ! = 14 (ny + nȳ + nθ + nθ̄ − 2).
The entries are SO(5) × U (1)Y representations as follows: 10 = 100 , 4 = 41 , 1 = 10 , 20 = 161 + 41 and
20 = 140 + 50 + 10 , where the U (1)Y charge is defined by Y = ny − nȳ . The SO(6, 1) content is labeled
by highest weights m1 m2 m3 = 12 |Y |, where m1 = 12 (ny + nȳ ). Upon gauging, these generators give rise
to spin s = m1 + 1 gauge fields which can be used to write a canonical set of covariant curvature constraints.
As a result the gauge fields for m2 12 |Y | + 1, s 2 are auxiliary while those for m2 = 12 |Y | contain physical
degrees of freedom
!\s 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2
0 10 4 1
1 10 20 20 20 10 4 1
2 1 4 10 20 20 20 10 4 1
3 1 4 10 20 20 ...
4 1 ...
..
.
Table 9
The hs(8∗ |4) generators with Y = ±2, ±3, ±4. The entries are SO(6) × U (1)Y representations as follows:
15 = 52 + 102 , 4 = 43 , 6 = 52 + 12 and 16 = 102 + 52 + 14 . These generators are associated with gauge fields
dual to generalized antisymmetric three-form tensor fields contained in the scalar master field Φ; see Table 11 for
s 1. Further notation is defined in Table 8
!\s 2 5 3 7 4 9 5 11 6 ···
2 2 2 2
1 15 4 6
2 6 4 16 4 6
3 6 4 16 ...
..
.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 361
Table 10
The d = 6, N = (2, 0) singletons. The quantity Z denotes the SU(2)Z spin defined in Section 2.3. The entries
denote USp(4)Y SO(5) × U (1)Y representations, which are irreducible except 6 = 5 + 1. The U (1)Y charges
of 1, 4, 5 are 0, ±1 and ±2, respectively. The SO(6) highest weights (n1 , n2 , n3 ) associated with each entry are
given by n1 = n = 2 = n3 = s, and the AdS7 energy by E0 = s + 2. The level ! = 0 (Z = 0) multiplet is the
d = 6, N = (2, 0) tensor singleton; see Section 3.3 for superfield realization of all the singletons shown in the
table, and composites formed out of the tensor singleton
|Z|\s 0 1 1 3 2 5 3 ···
2 2 2
0 5 4 1
1 4 6 4 1
2
1 1 4 6 4 1
3 1 4 6 4 1
2
2 1 4 6 4 1
. .
.. ..
Table 11
The physical fields expected to arise in the master scalar field Φ in the hs(8∗ |4) gauge theory in D = 7. The
entries are SO(6) × U (1)Y representations, where 6 = 5 + 1 and 15 = 10 + 5. The spin s 1 sector is expected
to be realized in Φ in terms of three-form potentials and their higher spin generalizations. These fields obey
self-duality in D = 7 and have dual one-form gauge fields corresponding to the generators given in Table 9,
with the exception of the underlined representations, which have no one-form duals. Here the form degree
refers to the number of curved indices as opposed to the tangential multi-spinor indices arising from the (y, ȳ)-
expansion
!\s 0 1 1 3 2 5 3 7 4 9 5 11 6 ···
2 2 2 2 2 2
0 140 161 52
1 10 41 62 43 152 + 14 43 62
2 62 43 152 + 14 43 62
3 62 43 152 + 14 ...
..
.
D = 4 is relatively simpler and has been presented in Section 2.1. The spectrum of physical
states described by the master gauge fields in D = 4, 5, 7 are also given in Section 2.
D(E0 , s; a1 , a2 , a3 , a4 ), (B.1)
where E0 is the minimum eigenvalue of the AdS energy generator M05 , s denotes SO(3) ⊂
SO(3, 2) spin and (a1 , a2 , a3 , a4 ) are the Dynkin labels of the SO(8) irrep carried by the
lowest energy state. There exist two series of supermultiplets [48]:
D(E0 , jL , jR ; a1 , a2 , a3 )Y , (B.7)
where E0 is the eigenvalue of the AdS energy generator M06 and (jL , jR ) label the
SO(4) ⊂ SO(4, 2) irrep, (a1 , a2 , a3 ) denote the Dynkin labels of the SO(6) SU(4)
R-symmetry irrep carried by the minimum energy states and Y denotes the outer U (1)Y
automorphism charge, which will often be suppressed when it is vanishing. There exist
three series of supermultiplets [42]:
which implies
E0 1 + JL + a1 + a2 + a3 , 1 + JL 12 (a3 − a1 );
(C) E0 = 2a1 + a2 , a3 = a1 , JL = JR = 0. (B.10)
In the case of series B, irreps with (JL ↔ JR , a1 ↔ a3 ) must also be included. The irreps
listed above are carried by the lowest components of the supermultiplets, and the entire
PSU(2, 2|4) supermultiplets are obtained by acting with supercharges.
The lowest components of the massless supermultiplets shown in Table 2 saturate the
unitarity bound of series A as E0 = s + 2, with JL = JR = s/2, except in level ! = 0
supergravity multiplet in which case D(2, 0, 0; 0, 2, 0) belongs to series C. The discrete
series C contains the BPS multiplets. In particular the Maxwell singleton multiplet is
characterized by D(1, 0, 0; 0, 1, 0) carried by its lowest component and it belongs to
series C. It can be described by a suitably constrained superfield. Taking a properly
symmetrized and constrained product of E0 singletons superfields one can construct BPS
superfields whose lowest components carry the following irreps [50]
BPS-1/2: D(p, 0, 0; 0, p, 0), (B.11)
BPS-1/4: D(p + 2q, 0, 0; q, p, q), (B.12)
BPS-1/8: D(p + 2q + 3r, 0, 0; q, p, q + 2r). (B.13)
The BPS-1/2 and BPS-1/4 multiplets belong to series C, and the BPS-1/8 multiplets
belong to series B. The KK towers of the level ! = 0 supergravity are the BPS-1/2
multiplets given by D(k, 0, 0; 0, k, 0) with k = 3, 4, . . . [3,58,59].
There exists an extensive literature on the OPEs of various BPS-1/2 operators. The
UIRs which can appear in these OPEs belong to series A with JL = JR = s/2, and series C
[51].
The index i = 1, . . . , 4, labels the 4-plet of USp(4), the index α = 1, . . . , 4 labels the chiral
spinor of SO(6), W ij = −W j i and symplectic traceless, Ω ij Wij = 0, and ωα1 ···α!−2 is
totally symmetric in its indices (see Table 10 for further details). The superspace constraints
imposed on these superfields can be found in [50]. The superfield Wij represents the
well known (2, 0) tensor singleton and it is singlet under an SU(2)Z group defined in
Section 2.3. The singleton superfields (W i , W, ωα1 ···α!−2 ), on the other hand, carry SU(2)Z
spins (1/2, 1, !/2), respectively. These are the level ! = 1, 2 and ! 3 singletons shown
in Table 10.
Taking suitably symmetrized and constrained products of singleton superfields, one can
construct BPS superfields whose lowest components carry the following UIRs [50]:
Both of these belong to series D. The KK towers of the level ! = 0 supergravity are the
BPS-1/2 multiplets given by D(2k, 0, 0, 0; 0, k) with k = 3, 4, . . . [50].
The OPEs of BPS-1/2 operators have been studied [51,90]. The supermultiplets that
can appear in the OPE of two BPS-1/2 operators belong to series A with (J1 , J2 , J3 ) =
(0, s, 0); series B with (J1 , J2 ) = (0, s) and E0 = 4 + s + 2(a1 + a2 ); series C with J1 = 0
and E0 = 2 + 2(a1 + a2 ), and series D [51].
C.1. USp(8)
n1 = a1 + a2 + a3 + a4 , n2 = a2 + a3 + a4 ,
n3 = a3 + a4 , n4 = a4 .
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 365
The HWS labels of SU(4) irreps are (n1 , n2 , n3 ). They satisfy n1 n2 n3 and they
are related to the Dynkin labels [a1 , a2 , a3 ] as follows:
n1 = a1 + a2 + a3 , n2 = a2 + a3 , n3 = a3 .
The SO(6) HW labels by (m1 , m2 , m3 ) obey m1 m2 |m3 | and they are related to the
SO(6) Dynkin labels [b1 , b2 , b3 ] as
The USp(4) irreps have the HWS labels (n1 , n2 ) which satisfy n1 n2 and they are
related to the Dynkin labels [a1 , a2 ] as
n1 = a1 + a2 , n2 = a2 .
The irreps of SO(5) have HW labels (m1 , m2 ) which satisfy m1 m2 0 and they are
related to the SO(5) Dynkin labels [b1, b2 ] as
m1 = b1 + 12 b2 , m2 = 12 b2 .
These are related to the USp(4) HW labels (n1 , n2 ) and USp(4) Dynkin labels [a1 , a2 ] as
m1 = 12 (n1 + n2 ), m2 = 12 (n1 − n2 ),
b1 = a2 , b2 = a1 .
C.4. SO(8)
where η = diag(−, +, . . . , +, −). The compact basis, which is suitable for describing
physical AdS fields, consists of the AdS energy E = −M0,d+2 , the SO(d) generators Mij
(i = 1, . . . , d) and the spin-boosts L±
i = Mi,d+2 ∓iM0i , which shift the AdS energy by ±1.
In compact basis the SO(d, 2) weight spaces D(E0 ; m1 , . . . , m[d/2]) are obtained by acting
with L+ i on lowest weight states, which have minimal energy E = E0 and carry SO(d)
highest weights (m1 , . . . , m[d/2]). Note that the label m1 is the SO(3) ⊂ SO(3, 2) spin in
the case of AdS4 and the sum jL + jR of SU(2)L × SU(2)R SO(4) ⊂ SO(4, 2) spins
in the case of AdS5 . The non-compact basis, which is suitable for describing conformal
fields, consists of the dilatation generator D = Md,d+2 , the SO(d − 1, 1) generators
Mµν (µ = 0, 1, . . . , d − 1), and the d-dimensional momentum Pµ = Mµd + Mµ,d+2 and
generator of special conformal transformations Kµ = Mµd − Mµ,d+2 . The compact basis
(E, Mij , L± i ) and non-compact basis (D, Mµν , Kµ , Pµ ) are related [94] by a similarity
transformation executed by the (non-unitary) operator
S = exp iL+
d, (D.2)
with the following properties
1
SDS −1 = −iE + L− , (D.3)
2 d
i
SM0a S −1 = −iMa,d − L− , SMab S −1 = Mab , (D.4)
2 a
i 1
SK0 S −1 = − L− , SKa S −1 = − L− , (D.5)
2 d 2 a
where we have split the indices as follows
a a
i = 1, 2, . . . , d − 1, d, µ = 0, 1, 2, . . ., d − 1 . (D.6)
Hence (d + 1)-dimensional time-evolution and spatial rotation are equivalent to d-dimen-
sional dilatation and Lorentz rotation. Thus
∆ = E0
and Lorentz spin given by (m1 , . . . , m[d/2]).
References
[1] J.M. Maldacena, The large N limit of superconformal field theories and supergravity, JHEP 9807 (1998)
013, hep-th/9711200.
[2] S.S. Gubser, I.R. Klebanov, A.M. Polyakov, Gauge theory correlators from noncritical strings, Phys. Lett.
B 428 (1998) 105, hep-th/9802109.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 367
[3] E. Witten, Anti-de Sitter space and holography, Adv. Theor. Math. Phys. 2 (1998) 253, hep-th/9802150.
[4] O. Aharony, S.S. Gubser, J. Maldacena, H. Ooguri, Y. Oz, Large N field theories, string theory and gravity,
Phys. Rep. 323 (2000) 183, hep-th/9905111.
[5] E. D’Hoker, D.Z. Freedman, Supersymmetric gauge theories and the AdS/CFT correspondence, TASI
Lectures, hep-th/0201253.
[6] A.M. Polyakov, Gauge fields and spacetime, hep-th/0110196.
[7] D. Anselmi, Theory of higher spin tensor currents and central charges, Nucl. Phys. B 541 (1999) 323, hep-
th/9808004.
[8] S.E. Konstein, M.A. Vasiliev, V.N. Zaikin, Conformal higher spin currents in any dimension and AdS/CFT
correspondence, JHEP 0012 (2000) 018, hep-th/0010239.
[9] E. Sezgin, P. Sundell, Doubletons and 5D higher spin gauge theory, JHEP 0109 (2001) 036, hep-th/0105001.
[10] E. Sezgin, P. Sundell, Towards massless higher spin extension of D = 5, N = 8 gauged supergravity,
JHEP 0109 (2001) 025, hep-th/0107186.
[11] M.A. Vasiliev, Conformal higher spin symmetries of 4D massless supermultiplets and osp(L, 2M) invariant
equations in generalized (super)space, hep-th/0106149.
[12] M.A. Vasiliev, Cubic interactions of bosonic higher spin gauge fields in AdS5 , Nucl. Phys. B 616 (2001)
106, hep-th/0106200.
[13] A. Mikhailov, Notes on higher spin symmetries, hep-th/0201019.
[14] M.A. Vasiliev, Consistent equations for interacting gauge fields of all spins in 3 + 1 dimensions, Phys. Lett.
B 243 (1990) 378.
[15] M.A. Vasiliev, Higher spin gauge theories: star-product and AdS space, hep-th/9910096.
[16] E. Sezgin, P. Sundell, Higher spin N = 8 supergravity, JHEP 9811 (1998) 016, hep-th/9805125.
[17] E. Sezgin, P. Sundell, Higher spin N = 8 supergravity in AdS4 , hep-th/9903020.
[18] E. Sezgin, P. Sundell, On curvature expansion of higher spin gauge theory, Class. Quantum Grav. 18 (2001)
3241, hep-th/0012168.
[19] E. Sezgin, P. Sundell, Analysis of higher spin field equations in four dimensions, JHEP 0207 (2002) 055,
hep-th/0205132.
[20] E. Sezgin, P. Sundell, 7D higher spin gauge theory: bosonic algebra and linearized constraints, hep-
th/0112100.
[21] E. Witten, Talk given at J.H. Schwarz’ 60th Birthday Conference, Cal Tech, 2–3 November 2001.
[22] P. Haggi-Mani, B. Sundborg, Free large N supersymmetric Yang–Mills theory as a string theory, JHEP 0004
(2000) 031, hep-th/0002189.
[23] B. Sundborg, Stringy gravity, interacting tensionless strings and massless higher spins, Nucl. Phys. Proc.
Suppl. 102 (2001) 113, hep-th/0103247.
[24] U. Danielsson, F. Kristiansson, P. Rajan, E. Segin, P. Sundell, in preparation.
[25] S. Lee, S. Minwalla, M. Rangamani, N. Seiberg, Three-point functions of chiral operators in D = 4, N = 4
SYM at large N , Adv. Theor. Math. Phys. 2 (1998) 697, hep-th/9806074.
[26] E. D’Hoker, D.Z. Freedman, S.D. Mathur, A. Matusis, L. Rastelli, Extremal correlators in the AdS/CFT
correspondence, hep-th/9908160.
[27] E. D’Hoker, B. Pioline, Near-extremal correlators and generalized consistent truncation for AdS4|7 × S 7|4 ,
JHEP 0007 (2000) 021, hep-th/0006103.
[28] G.W. Semenoff, K. Zarembo, Wilson loops in SYM theory: from weak to strong coupling, Nucl. Phys. Proc.
Suppl. 108 (2002) 106, hep-th/0202156.
[29] I.R. Klebanov, P. Ouyang, E. Witten, A gravity dual of the chiral anomaly, hep-th/0202056.
[30] D. Berenstein, J. Maldacena, H. Nastase, Strings in flat space and pp waves from N = 4 super-Yang–Mills,
JHEP 0204 (2002) 013, hep-th/0202021.
[31] N. Seiberg, Notes on theories with 16 supercharges, Nucl. Phys. Proc. Suppl. 67 (1998) 158, hep-th/9705117.
[32] K. Kikkawa, M. Yamasaki, Can the membrane be a unification model?, Prog. Theor. Phys. 76 (1986) 1379.
[33] L. Mezincescu, R.I. Nepomechie, P. van Nieuwenhuizen, Do supermembranes contain massless particles?,
Nucl. Phys. B 309 (1988) 317.
[34] S.S. Gubser, I.R. Klebanov, A.M. Polyakov, A semi-classical limit of the gauge/string correspondence, hep-
th/0204051.
368 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
[35] S.E. Konstein, M.A. Vasiliev, Extended higher-spin superalgebras and their massless representations, Nucl.
Phys. B 331 (1990) 475.
[36] M. Günaydin, Oscillator like unitary representations of non-compact gauge groups and supergroups and
extended supergravity theories, in: E. Inönü, M. Serdaroglu (Eds.), Lecture Notes in Physics, Vol. 180,
1983.
[37] H. Nicolai, E. Sezgin, Singleton representations of OSp(N, 4), Phys. Lett. B 143 (1984) 389.
[38] M. Flato, C. Fronsdal, One massless particle equals two Dirac singletons, Lett. Math. Phys. 2 (1978) 421.
[39] M. Günaydin, N.P. Warner, Unitary supermultiplets of OSp(8/4, R) and the spectrum of the S 7
compactification of eleven-dimensional supergravity, Nucl. Phys. B 272 (1986) 99.
[40] E. Bergshoeff, A. Salam, E. Sezgin, Y. Tanii, Singletons, higher spin massless states and the supermembrane,
Phys. Lett. B 205 (1988) 237.
[41] M. Günaydin, D. Minic, M. Zagermann, Doubleton conformal theories, CPT and IIB string on AdS5 × S 5 ,
Nucl. Phys. B 534 (1998) 96;
M. Günaydin, D. Minic, M. Zagermann, Nucl. Phys. B 538 (1999) 531, Erratum, hep-th/9806042.
[42] V.K. Dobrev, V.B. Petkova, All positive energy unitary irreducible representations of extended conformal
supersymmetry, Phys. Lett. B 162 (1985) 127.
[43] M. Günaydin, N. Marcus, The spectrum of the S 5 compactification of the chiral N = 2, D = 10 supergravity
and the unitary supermultiplets of U (2, 2/4), Class. Quantum Grav. 2 (1985) L11.
[44] H. Nicolai, E. Sezgin, Y. Tanii, Conformally invariant supersymmetric field theories on S p × S 1 , Nucl. Phys.
B 305 (1988) 483.
[45] R.R. Metsaev, Massless arbitrary spin fields in AdS5 , hep-th/0201226.
[46] M. Günaydin, S. Takemae, Unitary supermultiplets of OSp(8∗ |4) and the AdS7 /CFT 6 duality, Nucl. Phys.
B 578 (2000) 405, hep-th/9910110.
[47] M. Günaydin, P. van Nieuwenhuizen, N.P. Warner, General construction of anti-de Sitter superalgebras and
the spectrum of the S 4 compactification of eleven-dimensional supergravity, Nucl. Phys. B 255 (1985) 63.
[48] S. Minwalla, Restrictions imposed by superconformal invariance on quantum field theories, Adv. Theor.
Math. Phys. 2 (1998) 781, hep-th/9712074.
[49] P.S. Howe, E. Sezgin, Superbranes, Phys. Lett. B 390 (1997) 133, hep-th/9607227.
[50] S. Ferrara, E. Sokatchev, Superconformal interpretation of BPS states in AdS geometries, Int. J. Theor.
Phys. 40 (2001) 935, hep-th/0005151.
[51] S. Ferrara, E. Sokatchev, Universal properties of superconformal OPEs for 1/2 BPS operators in 3 D 6,
hep-th/0110174.
[52] S. Ferrara, E. Sokatchev, Conformal primaries of OSp(8/4, R) and BPS states in AdS4 , JHEP 0005 (2000)
038, hep-th/0003051.
[53] L. Andrianopoli, S. Ferrara, “Non-chiral” primary superfields in the AdSd+1 /CFT d correspondence, Lett.
Math. Phys. 46 (1998) 265, hep-th/9807150.
[54] S. Minwalla, Particles on AdS4/7 and primary operators on M2/5 brane worldvolumes, JHEP 9810 (1998)
002, hep-th/9803053.
[55] E. Bergshoeff, M. de Roo, B. de Wit, Extended conformal supergravity, Nucl. Phys. B 182 (1981) 173.
[56] P. Howe, K.S. Stelle, P.K. Townsend, Supercurrents, Nucl. Phys. B 192 (1981) 332.
[57] L. Andrianopoli, S. Ferrara, On short and long SU(2, 2/4) multiplets in the AdS/CFT correspondence, Lett.
Math. Phys. 48 (1999) 145, hep-th/9812067.
[58] S. Ferrara, C. Fronsdal, A. Zaffaroni, On N = 8 supergravity on AdS5 and N = 4 superconformal Yang–
Mills theory, Nucl. Phys. B 532 (1998) 153, hep-th/9802203.
[59] L. Adrianopoli, S. Ferrara, KK excitations on AdS5 × S 5 as N = 4 “primary” superfields, Phys. Lett. B 430
(1998) 248, hep-th/9803171.
[60] S. Ferrara, M.A. Lledó, A. Zaffaroni, Born–Infeld corrections to D3-brane action in AdS5 × S 5 and
N = 4, d = 4 primary superfields, Phys. Rev. D 58 (1998) 105029, hep-th/9805082.
[61] S. Ferrara, A. Zaffaroni, Bulk gauge fields in AdS supergravity and supersingletons, hep-th/9807090.
[62] S. Ferrara, A. Zaffaroni, Superconformal field theories, multiplet shortening, and the AdS5 /SCFT 4
correspondence, hep-th/9908163.
[63] S. Ferrara, E. Sokatchev, Short representations of SU(2, 2/N ) and harmonic superspace analyticity, Lett.
Math. Phys. 52 (2000) 247, hep-th/9912168.
E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370 369
[64] L. Adrianopoli, S. Ferrara, E. Sokatchev, B. Zupnik, Shortening of primary operators in N -extended SCFT 4
and harmonic analyticity, Adv. Theor. Math. Phys. 3 (1999) 1149, hep-th/9912007.
[65] P.J. Heslop, P.S. Howe, A note on composite operators in N = 4 SYM, Phys. Lett. B 516 (2001) 367, hep-
th/0106238.
[66] P.S. Howe, G. Sierra, P.K. Townsend, Supersymmetry in six dimensions, Nucl. Phys. B 221 (1983) 331.
[67] S. Ferrara, E. Sokatchev, Representations of (1, 0) and (2, 0) superconformal algebras in six dimensions:
massless and short superfields, Lett. Math. Phys. 51 (2000) 55, hep-th/0001178.
[68] H. Liu, Scattering in anti de Sitter space and operator product expansion, Phys. Rev. D 60 (1999) 106005,
hep-th/9811152.
[69] P. Ramond, Group theory for string states, in: M.J. Bowick, F. Gürsey (Eds.), in: High Energy Physics 1985,
Vol. 1, World Scientific, Singapore, 1986, p. 274.
[70] V.K. Dobrev, E. Sezgin, A remarkable representation of the SO(3, 2) Kac–Moody algebra, Int. J. Mod. Phys.
A 6 (1991) 4699.
[71] M.J. Duff, Supermembranes: the first fifteen weeks, Class. Quantum Grav. 5 (1988) 189.
[72] I.R. Klebanov, A.A. Tseytlin, Entropy of near-extremal black p-branes, Nucl. Phys. B 475 (1996) 164,
hep-th/9604089.
[73] M. Henningson, K. Skenderis, The holographic Weyl anomaly, JHEP 9807 (1998) 023, hep-th/9806087.
[74] K. Intriligator, Maximally supersymmetric RG flows and AdS duality, Nucl. Phys. B 580 (2000) 99.
[75] E. Bergshoeff, M.J. Duff, C.N. Pope, E. Sezgin, Supersymmetric supermembrane vacua and singletons,
Phys. Lett. B 199 (1987) 69.
[76] E. Bergshoeff, M.J. Duff, C.N. Pope, E. Sezgin, Compactifications of the eleven-dimensional supermem-
brane, Phys. Lett. B 224 (1989) 71.
[77] M.J. Duff, C.N. Pope, E. Sezgin, A stable supermembrane vacuum with a discrete spectrum, Phys. Lett.
B 225 (1989) 319.
[78] J. Maldacena, H. Ooguri, Strings in AdS3 and the SL(2, R) WZW model. Part 1: The spectrum, J. Math.
Phys. 42 (2001) 2929, hep-th/0001053.
[79] E. Witten, Some comments on string dynamics, in: STRINGS 95: Future Perspectives in String Theory,
hep-th/9507121.
[80] A. Strominger, Open p-branes, Phys. Lett. B 383 (1996) 44, hep-th/9512059.
[81] E. Witten, Five-branes and M-theory on an orbifold, Nucl. Phys. B 463 (1996) 383, hep-th/9512219.
[82] X. Bekaert, M. Henneaux, A. Sevrin, Deformations of chiral two-forms in six dimensions, Phys. Lett. B 468
(1999) 228, hep-th/9909094.
[83] P.S. Howe, E. Sezgin, D = 11, p = 5, Phys. Lett. B 394 (1997) 62, hep-th/9611008.
[84] P.S. Howe, E. Sezgin, P.C. West, Covariant field equations of the M-theory five-brane, Phys. Lett. B 399
(1997) 49, hep-th/9702008.
[85] M. Cederwall, B.E.W. Nilsson, P. Sundell, An action for the super-5-brane in D = 11 supergravity,
JHEP 9804 (1998) 007, hep-th/9712059.
[86] E. Bergshoeff, E. Sezgin, P.K. Townsend, Supermembranes and eleven-dimensional supergravity, Phys. Lett.
B 189 (1987) 75.
[87] S.K. Gandhi, K.S. Stelle, Vanishing of the supermembrane partition function, Class. Quantum Grav. 5 (1988)
L127.
[88] P.K. Townsend, Three lectures on supermembranes, in: M. Green, M. Grisaru, R. Iengo, E. Sezgin,
A. Strominger (Eds.), Superstrings’88, World Scientific, Singapore, 1989.
[89] F. Bastianelli, S. Frolov, A.A. Tseytlin, Conformal anomaly of (2, 0) tensor multiplet in six dimensions and
AdS/CFT correspondence, JHEP 0002 (2000) 013, hep-th/0001041.
[90] G. Arutyunov, E. Sokatchev, Implications of superconformal symmetry for interacting (2, 0) tensor
multiplets, hep-th/0201145.
[91] J. Engquist, P. Sundell, E. Sezgin, N = 1 superspace formulation of D = 4, N = 1, 2, 4, 8 higher spin gauge
theories, to appear.
[92] S. Prokushkin, M. Vasiliev, Higher spin gauge interactions for massive matter fields in 3D AdS spacetime,
Nucl. Phys. B 545 (1999) 385, hep-th/9806236.
370 E. Sezgin, P. Sundell / Nuclear Physics B 644 (2002) 303–370
[93] S. Prokushkin, A. Segal, M. Vasiliev, Coordinate-free action for AdS3 higher-spin–matter systems, Phys.
Lett. B 478 (2000) 333, hep-th/9912280342.
[94] M. Günaydin, AdS/CFT dualities and the unitary representations of non-compact groups and supergroups:
Wigner versus Dirac, hep-th/0005168.
Nuclear Physics B 644 (2002) 371–382
www.elsevier.com/locate/npe
Received 6 May 2002; received in revised form 20 August 2002; accepted 23 August 2002
Abstract
The Ekpyrotic scenario assumes that our visible Universe is a boundary brane in a five-dimensional
bulk and that the hot Big Bang occurs when a nearly supersymmetric five-brane travelling along
the fifth dimension collides with our visible brane. We show that the generation of isocurvature
perturbations is a generic prediction of the Ekpyrotic Universe. This is due to the interactions in the
kinetic terms between the brane modulus parametrizing the position of the five-brane in the bulk and
the dilaton and volume moduli. We show how to separate explicitly the adiabatic and isocurvature
modes by performing a rotation in field space. Our results indicate that adiabatic and isocurvature
perturbations might be cross-correlated and that curvature perturbations might be entirely seeded by
isocurvature perturbations.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The Ekpyrotic cosmology [1,2] has recently received a great deal of attention because
it represents an alternative to inflationary cosmology [3]. It addresses the cosmological
horizon, flatness and monopole problems and generates a spectrum of density perturbations
without invoking any superluminal expansion. Being realized in the context of eleven-
dimensional heterotic M-theory, the Ekpyrotic scenario is also based on strong particle
physics grounds. It assumes that our visible Universe is a boundary brane in the five-
dimensional bulk obtained compactifying six of the eleven dimensions on a Calabi–Yau
manifold. The hot Big Bang occurs when a nearly supersymmetric five-brane travelling
along the fifth dimension collides with our visible brane. The five-brane is attracted towards
the boundary brane where we live by an inter-brane potential which is exponentially
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 6 5 - 4
372 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
suppressed when the two branes are far apart. The dynamics of the system is described
on the four-dimensional point of view in terms of a scalar field ϕ, the brane modulus,
parametrizing the separation between the two branes.
The branes are assumed to start widely separated almost at rest and the four-dimensional
observer experiences a contraction of the cosmological scale factor characterized by a
singularity at the time when the two branes hit each other. Since the Hubble radius contracts
faster than comoving scales, microscopic sub-horizon fluctuations during the contracting
phase may well produce curvature perturbations on cosmological scales today [1,4]. The
spectrum of these adiabatic perturbations was claimed to be scale-independent if the
brane modulus potential scales as e−cϕ/Mp with c 1. This finding has been recently
challenged in a series of papers [5–10]. The difficulty in determining the final spectrum
of perturbations arises from the fact that there is no prescription on how to match the
perturbations generated in a contracting phase across the ‘bounce’ to those in the expanding
hot Big Bang phase when radiation dominates.1 For instance, if the matching is performed
at ρ + δρ = const hypersurfaces, the Ekpyrotic scenario leads to a blue spectrum with
spectral index for scalar perturbations ns = 3. However, if the matching hypersurface is
chosen to be differently the authors of Ref. [12] claim that one may obtain a flat spectrum
with spectral index ns = 1.
In this paper we wish to take a modest step and observe that in the Ekpyrotic scenario
both adiabatic and isocurvature perturbations may be generated during the contraction
phase when the five-brane slowly approaches our visible world. A cross-correlation
between entropy and curvature perturbations may left imprinted. We will also show
that curvature perturbations may be entirely sourced by isocurvature perturbations, thus
providing a way to produce a scale-invariant spectrum of adiabatic perturbations, at least
before the bounce.
The paper is organized as follows. In Section 2 we briefly summarize how the effective
action for the five-brane modulus is derived emphasizing the coupling between the five-
brane modulus and the dilaton and the volume modulus in the kinetic term. This coupling
is responsible for the generation of entropy modes. In Section 3 we describe the generation
of adiabatic and isocurvature perturbations using the powerful technique of rotation in
field space and obtaining the equation for the gravitational potential in terms of properly
defined adiabatic and isocurvature fields. Finally, we end with some concluding remarks in
Section 4.
1 At present, there is no known mechanism to reverse the bounce. This problem has been circumvented by
adopting the so-called ‘cyclic’ model where the two colliding branes are the boundary branes [11].
A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382 373
2 In the chiral version of the theory where the dilaton field is described by the real part Re S of the scalar
component of the chiral multiplet S and the five-brane modulus belongs to a chiral multiplet
S, the kinetic term
S+
τ ( S)2
(2.1) can be understood as coming from the Kähler potential K = − log S + S − 16 − 3 log(T + T).
T +T
374 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
brane induces a nontrivial dynamics of the dilaton and volume moduli since solutions
with exactly constant β and χ do not exist. Similarly, the perturbations of the five-brane
modulus, of the dilaton and of the volume modulus might mix as soon as the five-brane
moves. This implies not only that both adiabatic and isocurvature perturbations may be
created, but also that they might be cross-correlated and that isocurvature perturbations
may seed curvature perturbations on super-horizon scales.
3. Cosmological perturbations
Scalar field perturbations, with comoving wavenumber k = 2πa/λ for a mode with
physical wavelength λ, then obey the perturbation equations
2
¨ + 3H δϕ
δϕ ˙ + k δϕ + Vϕϕ eαχ δϕ − 4ϕ̇ Φ̇ − 2Vϕ eαχ Φ
a2
˙ + ϕ̇ δχ)
+ α(χ̇ δϕ ˙ − αVϕ eαχ δχ = 0 (3.6)
and
k2
¨ + 3H δχ
δχ ˙ + δχ + Vχχ δχ − 4χ̇ Φ̇ − 2Vχ Φ
a2
α 2 2 −αχ ˙ −αχ = 0.
− ϕ̇ e δχ + α ϕ̇ δϕe (3.7)
2
The perturbed Einstein equations read
4π
Φ̈ + 4H Φ̇ + 2Ḣ + 3H 2 Φ = δp, (3.8)
Mp 2
4π
Φ̇ + H Φ = − δq, (3.9)
Mp 2
k2 4π
3H Φ̇ + 3H 2 Φ + 2
Φ=− δρ, (3.10)
a Mp 2
where the total energy and momentum perturbations are given by [25]
˙ ˙ −αχ α 2 −αχ
δp = δTi = χ̇ δχ + ϕ̇ δϕe
i
− ϕ̇ e δχ
2
2
− Φ χ̇ + ϕ̇ 2 e−αχ − Vϕ δϕ − Vχ δχ, (3.11)
α
δρ = δT00 = χ̇ δχ ˙ + ϕ̇ δϕe
˙ −αχ − ϕ̇ 2 e−αχ δχ
2
2 2 −αχ
− Φ χ̇ + ϕ̇ e + Vϕ δϕ + Vχ δχ, (3.12)
δq = δT0 = − χ̇δχ + ϕ̇δϕe−αχ .
i
(3.13)
We obtain a useful relation if we take the Eq. (3.9) times −3H and then sum it to Eq. (3.10)
[26]
k2 4π
2
Φ =− -m , (3.14)
a Mp 2
where
-m ≡ δρ − 3H δq (3.15)
is used to represent the total matter perturbation [25].
Let us now turn to isocurvature perturbation. When two scalar fields are present
in the dynamics of the system, isocurvature perturbations are generated [25]. In our
case, inserting the expressions for the pressure and the energy density as well as their
376 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
In order to clarify the role of adiabatic and entropy perturbations, their evolution and
their interconnection, from now on we will follow Ref. [26] and define new adiabatic and
entropy fields by a rotation in field space. The “adiabatic field”, σ , represents the path
length along the classical trajectory, such that
σ̈ + 3H σ̇ + Vσ = 0, (3.19)
where
Armed with these definitions, we are now ready to compute the equation of motion of
the gravitational potential Φ. A straightforward computation leads to
σ̈ σ̈ k2 8π
Φ̈ + H − 2 Φ̇ + 2Ḣ − 2H Φ + 2Φ =− Vs δs. (3.24)
σ̇ σ̇ a Mp 2
This equation makes manifest the neat separation of the roles played by the adiabatic and
isocurvature perturbations. Indeed, on the left-hand side only the adiabatic field σ appears
while on the right-hand side there is a source which is proportional only to the relative
entropy perturbations δs between the brane-modulus and the dilaton field.
Eq. (3.24) shows explicitly how on large scales entropy perturbations can source the
gravitational potential in the Ekpyrotic cosmology. When the five-brane slowly approaches
the boundary brane and the brane modulus decreases in time, perturbations in the brane
modulus, in the dilaton field and in the gravitational potential are generated. Furthermore,
adiabatic and isocurvature perturbations are inevitably correlated. These results are similar
to what found in standard inflation when two or more scalar fields are present [26,28–31].
Using now V̇ = Vσ σ̇ , δV = Vσ δσ + Vs δs and
χ̇ δχ ˙ −αχ − α ϕ̇ 2 e−αχ δχ = σ̇ δσ
˙ + ϕ̇ δϕe ˙ + Vs δs, (3.25)
2
we can write the isocurvature source S in Eq. (3.16) as
˙ + Vs δs) − ΦVσ σ̇ 2
(Vσ + 3H σ̇ )(Vσ δσ + Vs δs) + Vσ (σ̇ δσ
S =2 . (3.26)
3(2Vσ + 3H σ̇ )σ̇ 2
Note this expression is the same as in the standard case α = 0 [26]. We can rewrite the last
expression as
˙ + Vs δs) − ΦVσ σ̇ 2
−σ̈ (Vσ δσ + Vs δs) + Vσ (σ̇ δσ
S=2
3(2Vσ + 3H σ̇ )σ̇ 2
˙ − Φ σ̇ − σ̈ δσ ) + Vs δs(Vσ − σ̈ )
Vσ (σ̇ δσ 2
=2 . (3.27)
3(2Vσ + 3H σ̇ )σ̇ 2
Making use of Eq. (3.14) we can express -m in terms of the new fields δσ and δs [26]
˙ − σ̇ Φ) − σ̈ δσ + 2Vs δs.
-m = σ̇ (δσ (3.28)
Finally, using Eqs. (3.14) and (3.28) we get
Vσ -m − 2Vσ Vs δs + Vs δs(2Vσ + 3H σ̇ )
S=2
3(2Vσ + 3H σ̇ )σ̇ 2
2
Mp 2 Vσ ak 2 Φ H Vs δs
=− + . (3.29)
6π(2Vσ + 3H σ̇ )σ̇ 2 (2Vσ + 3H σ̇ )σ̇
Eq. (3.24) might be rewritten as [25]
ṗ
Ṙ = −3H S. (3.30)
ρ̇
The change in the curvature perturbation on large scales can therefore be directly related
to the nonadiabatic part of the pressure perturbation [25].
378 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
Clearly, there can be significant changes in the gravitational potential on large scales
and a large cross-correlation between the adiabatic and the isocurvature modes only if the
entropy perturbation is not suppressed. The next step is therefore to compute the equation
of motion of the entropy field δs. The computation is lengthy, but straightforward. One
finds
2
¨ ˙ k Vs Mp 2 k 2
δs + 3H δs + − θ̇ 2
+ V ss + Γ α δs = − Φ, (3.31)
a2 σ̇ 2 2πa 2
where we have defined
(p 1). It is easy to show that |χ̇/ϕ̇| = |αMp p/2| 1. The adiabatic field σ is
A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382 379
given by exp(−αχ/2)ϕ and the solution to the homogeneous part of the equation for
the gravitational potential (3.24) leads to Φ = A(k) Ha + B(k) at superhorizon scales in
the collapsing phase. The A-growing mode has a scale-independent spectrum, |A|2 k 3 is
k-independent, while the constant mode has a blue spectrum. One then needs to match this
solution to the usual (approximately constant) gravitational potential in the radiation era.
If the matching from the collapsing phase to the radiation era is performed on constant
energy surfaces, the gravitational potential in the radiation era inherits the blue spectrum
from the constant mode in the collapsing phase. However, a nonzero surface tension—
provided by some high-energy theory ingredient—is needed to go through the bounce.
This implies that the transition surface needs not to be a constant energy surface [12].
For instance, imposing the matching on a surface where its shear vanish, one finds that the
gravitational potential in the radiation phase inherits the flat spectrum of A [12]. On the top
of that, a cross-correlation between adiabatic and isocurvature modes is generated before
the collapse with CΦδs ∝ αpMp k 2 = O(p)k 2 , i.e., a blue spectrum.
If |χ̇/ϕ̇| 1 and V (χ) = −M 4 exp(−cχ/Mp ) with M 4 V0 0, one obtains a
sort of modified version of the pre-Big Bang model [32] with a potential for the dilaton
field. The adiabatic field σ is identified
√ with the dilaton field χ and the final spectrum of
curvature perturbations is flat if c = 3 and the matching through the bounce is done onto
a constant energy surface [33].3 Even in this a case cross-correlation between adiabatic and
isocurvature modes is present. Yet, it is suppressed.
A much more interesting situation is realized if the brane-modulus potential is very tiny,
V0 0 and V (χ) = −M 4 exp(−βχ/Mp ), with M some mass scale. Going to conformal
time dτ = dt/a and integrating Eq. (3.4), one finds ϕ = C exp(αχ)/a 2 , where primes
mean derivatives with respect to conformal time τ and C is an integration constant.
The ansatz a(τ ) = (−Mp (1 − q)τ )q/(1−q) and χ(τ ) = A ln(−Mp (1 − q)τ ) satisfy the
equations of motion if αA = 2(3q − 1)/(1 − q), βA/Mp = 2/(1 − q) and αMp /β =
3q − 1. Suppose now that the energy density of the system is dominated by the kinetic term
of the brane-modulus. This will be the case if, for instance, A C/Mp . After introducing
the new variable δS = aδs, Eq. (3.31) reduces to
a α2
δS + k 2 − − ϕ 2 e−αχ δS = 0. (3.37)
a 4
√
Since C 2 2qMp 4 and α = 2/Mp , one finds
2q 2
δS + k 2 − δS = 0. (3.38)
(1 − q)2 τ 2
A nearly invariant spectrum for the entropy perturbations is obtained if 2q 2/(1 − q)2 2,
or q 1/2. This is a desirable output since it means that adiabatic perturbations are entirely
sourced by entropy perturbations inducing a flat spectrum for curvature perturbations with
maximum cross-correlation.
3 If the matching is done using the prescription discussed in Ref. [12], the spectrum is flat if c 1.
380 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
4. Concluding remarks
Let us conclude with some comments. The generation of adiabatic plus isocurvature
perturbations and their cross-correlation we have described in this paper occur in the
Ekpyrotic scenario before the moving five-brane collapses onto the boundary brane.
Isocurvature perturbations might not survive after the bounce if during the subsequent
period of reheating both the brane modulus and the dilaton decay into the same species.
In order to have isocurvature perturbations deep in the radiation era after the collapse it
is necessary to have at least one nonzero isocurvature perturbation Sαβ ≡ δα /(1 + wα ) −
δβ /(1 + wβ ) = 0, where δα = δρα /ρα and wα = pα /ρα (the ratio of the pressure to the
energy density) for some components α and β of the system. This may happen if the
fields responsible for the isocurvature perturbations decay into radiation and cold dark
matter at different epochs. On the other hand, we have seen that curvature perturbations
may be entirely seeded by isocurvature perturbations, thus providing a novel mechanism
to produce a scale-independent spectrum of adiabatic perturbations in the Ekpyrotic
Universe.
All previous discussions make it clear that the features of the cosmological perturbations
after the Big Bang as well as the way reheating takes place depend strongly on the details of
the transition from the collapsing to the expanding phase when the five-brane is absorbed
by the boundary brane. In this absorption process the degree of freedom represented by the
brane modulus gets replaced by new degrees of freedom.
In eleven-dimensional M-theory, E8 × E8 gauge fields with strength Fzi zj living on
the boundary of the eleventh dimension and in the six-dimensional Calabi–Yau manifold
(zi with i = 1, 2, 3 are the complex coordinates of such a manifold) satisfy equations of
motion of the type Fzz = Fz̄z̄ = g zz̄ Fzz̄ = 0. Gauge field configurations satisfying these
equations are generically called instantons (for instance, if the Calabi–Yau manifold is
a two-dimensional torus times a four-dimensional variety K3, one gets the traditional
quantum field theory instanton equations F = ±F ). In nonstandard compactifications of
M-theory with a certain number N of five-branes, one schematically obtains the following
constraint
N + F ∧ F = const, (4.1)
that is the number of five-branes plus the “number” of instantons given by F ∧ F is
conserved. From Eq. (4.1) one can infer that in M-theory itis possible to replace one five-
brane with one instanton and vice versa. An instanton with F ∧ F = a can shrink to zero
size, becoming a so-called small instanton with F ∧ F = aδ(zi ) and leave the boundary
brane along the eleventh dimension under the form of a five-brane [34]. A unit of instanton
flux is replaced by a unit of five-brane flux still satisfying the constraint (4.1).
This phenomenon is similar to what happens in Type IIA string theory where D4-
branes may get emitted or absorbed by a set of D8-brane plus O8-orientifold. A crucial
role for describing this phenomenon is played by the strings stretched between the D4-
brane and the (D8 + O8) system. If they are massive, i.e., if their length is nonzero,
it means that the D4-brane is in the bulk away from the (D8 + O8) system. On the
contrary, if the strings are tensionless, their length is zero and the D4-brane touches the
A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382 381
(D8 + O8) system. The transition may be described from the four-dimensional point of
view as a Higgs mechanism. The D4-brane can be described by an N = 2 vector multiplet
{Wα , φ} while the strings stretched between the D4-brane and the (D8 + O8) system are
described in terms of some hypermultiplets Y . They appear as particles on the D4-brane.
Upon compactification to four-dimensions the Lagrangian (in the N = 1 supersymmetry
language) can be written as
+ d 4 θ (φ̄φ + Y
d 2 θ Wα2 + Y φ Y Y ). (4.2)
When
φ = 0, the Y -field is massive. This means that the strings between the D4-brane
and the (D8 + O8) system have a nonvanishing length (or tension): the D4-brane is in the
bulk. At the transition point
Y =
φ = 0, tensionless (massless) strings appear in the
spectrum and the D4-brane is absorbed by the (D8 + O8) system. This transition gives
rise to gauge field configurations whose moduli space contain a number of free parameters
matching the number of the degrees of freedom before the absorption. Some of the Y -fields
are now interpreted as instanton moduli.
Going back to the case of the Ekpyrotic scenario, a crucial role is played by two-
dimensional branes, membranes, stretched between the boundary brane and the slowly
approaching five-brane [35]. These membranes are massive when the five-brane is in the
bulk and the vacuum expectation value of the brane modulus is nonzero. When the five-
brane touches the boundary brane, these membranes have zero length. Therefore, one might
hope to describe the transition from the five-brane to the small instanton in terms of a four-
dimensional Higgs mechanism as done for Type IIA string theory. The problem is that
the massless membranes appear as tensionless strings in the five-brane world-volume even
after compactifying down to four-dimensions. At the transition point the relevant degrees
of freedom of the theory are therefore tensionless strings, the anti-self-dual tensors of the
five-brane, the brane modulus and some instanton gauge field configurations. They are the
fundamental degrees of freedom which allow the description of the system during the last
moments before the collision and the subsequent Big Bang. In terms of these degrees of
freedom the theory does not admit a simple and perturbative four-dimensional description.
Nevertheless, there might be cases in which the system is tractable. One might hope to
do some progress if the transition starts when the five-brane is sufficiently far from the
boundary brane and the membranes—or better the corresponding strings—are sufficiently
massive to admit a description in terms of four-dimensional scalar fields Y . Work along
these lines is in progress [36].
Acknowledgements
References
[1] J. Khoury, B.A. Ovrut, P.J. Steinhardt, N. Turok, Phys. Rev. D 64 (2001) 123522.
382 A. Notari, A. Riotto / Nuclear Physics B 644 (2002) 371–382
Abstract
We study the axial anomaly defined on a finite-size lattice by using a Dirac operator which obeys
the Ginsparg–Wilson relation. When the gauge group is U(1), we show that the basic structure of
axial anomaly on the infinite lattice, which can be deduced by a cohomological analysis, persists even
on (sufficiently large) finite-size lattices. For non-Abelian gauge groups, we propose a conjecture on
a possible form of axial anomaly on the infinite lattice, which holds to all orders in perturbation
theory. With this conjecture, we show that a structure of the axial anomaly on finite-size lattices is
again basically identical to that on the infinite lattice. Our analysis with the Ginsparg–Wilson–Dirac
operator indicates that, in appropriate frameworks, the basic structure of axial anomaly is quite robust
and it persists even in a system with finite ultraviolet and infrared cutoffs.
2002 Elsevier Science B.V. All rights reserved.
Keywords: Renormalization; Regularization and renormalons; Lattice gauge field theories; Gauge symmetry;
Anomalies in field and string theories
1. Introduction
In Ref. [1], Lüscher pointed out that a cohomological analysis can be used to determine
a basic structure of the axial anomaly in Abelian gauge theories with finite lattice spacings.
This work paved a way to study the axial anomaly in a system with a finite ultraviolet cutoff
and then the technique was applied for various cases [2–6]. The crucial properties which
make this analysis possible are the locality, the gauge invariance and a topological property
of the axial anomaly. The axial anomaly defined by the gauge covariant Dirac operator [7,8]
which satisfies the Ginsparg–Wilson relation [9], especially the overlap-Dirac operator [8],
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 2 - X
384 H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394
in fact possesses the required properties [10–14]. A further elaborate analysis with this
recognition finally led to a non-perturbative construction of anomaly-free Abelian chiral
gauge theories on the lattice [15].
The cohomological analysis, however, is limited to the case of a lattice with an infinite
size. A direct cohomological analysis for finite-size lattices is not feasible because:
(i) The analysis is based on the lattice Poincaré lemma [1], which is a lattice analogue of
the Poincaré lemma being valid for Rd . When the topology of the lattice is non-trivial
(as is the case for the periodic lattice), one expects a non-trivial d-cohomology on the
lattice.
(ii) The cohomology relevant to an analysis of axial anomaly is a local cohomology, for
which the concept of the locality is vital. The meaning of the locality, however, is not
clear on a lattice with a finite size because a Dirac operator which obeys the Ginsparg–
Wilson relation has to have exponentially decaying tails [16,17].
In this paper, we study the axial anomaly defined on a finite-size lattice by using the
Ginsparg–Wilson–Dirac operator. This analysis provides an approach to the axial anomaly
in a system with ultraviolet and infrared cutoffs. As already noted, a direct generalization of
the technique of Ref. [1] is not feasible. Instead, we point out that it is possible to determine
the structure of axial anomaly using an argument similar to that of Ref. [15] at least in
Abelian gauge theories. For non-Abelian theories, we propose a conjecture on a possible
form of axial anomaly on the infinite lattice, which is correct within perturbation theory.
Under this conjecture, a similar argument can be applied to non-Abelian cases too. These
results indicate that the structure of axial anomaly is quite robust even with ultraviolet
and infrared cutoffs in appropriate formulations (in the present case, a formulation based
on the Ginsparg–Wilson relation). We consider an even-dimensional lattice Γ whose size
is L, Γ = {x ∈ Zd | 0 xµ < L}, and the gauge field U (x, µ) ∈ G (G is the gauge group)
is assumed to be periodic on Γ , U (x + Lν̂, µ) = U (x, µ).1 The lattice spacing a is set to
be unity, except when the classical continuum limit is considered.
2. Preliminaries
The axial anomaly for the Ginsparg–Wilson–Dirac operator is defined by (see, for
example, Refs. [11,12] for the background)
1
A(x) = tr γd+1 1 − D(x, x) . (2.1)
2
The kernel of the Dirac operator D(x, y) satisfies the Ginsparg–Wilson relation
The salient feature of A(x) is a lattice analogue of the analytic index theorem [10]
A(x) = n+ − n− , (2.3)
x∈Γ
which follows from the algebraic relation (2.2) alone; here n+ (n− ) is the number of zero-
modes of γd+1 D with the positive (negative) chirality. The index theorem (2.3) implies
that the Dirac operator cannot be a smooth function of the gauge field in general, because
the configuration space of lattice gauge field is arcwise connected and, barring a possibility
that n+ − n− is constant for all configurations, the integer n+ − n− jumps at certain points
in the configuration space. A sufficient condition for the smoothness of the overlap-Dirac
operator [8] is the admissibility [13,14]
1 − U (x, µ, ν) < for all x, µ, ν, (2.4)
where U (x, µ, ν) is the plaquette variable and is a constant smaller than (2 −
√
2)/d(d − 1) [14].2 After imposing this admissibility, the space of allowed gauge field
configurations may have non-trivial topology. This condition also guarantees the locality
of the operator [13,14]
D(x, y) C 1 +
x − y
p e−
x−y
/ , (2.5)
where C and p are constants and is a localization range of the Dirac operator. In
addition to the gauge covariance and the locality of the Dirac operator, we assume
that it has the same transformation law as the standard Wilson–Dirac operator under
discrete symmetries of the lattice (rotations, reflections, etc.). In particular, we require the
translational invariance, i.e., D(x, y) is identical to D(x + z, y + z) if the gauge field is
shifted at the same time U (x, µ) → U (x + z, µ).
Suppose that we have constructed a Dirac operator on a lattice with the size L. When
L → ∞, D(x, y) is promoted to a Dirac operator on the infinite lattice D(x, y) →
D ∞ (x, y). This operator also obeys the Ginsparg–Wilson relation
where δ denotes a local variation of the gauge field. This property can be shown from
the Ginsparg–Wilson relation (2.6) (see Ref. [2], for example). A∞ (x) is thus a local
topological gauge-invariant pseudoscalar field.4 When the gauge group is U(1), we can
then apply the cohomological analysis [1,2] to this quantity. The result is5
4 A field φ(x) is termed local, when its dependence on the gauge field at a point y is exponentially suppressed
as
x − y
→ ∞. For a more precise definition, see Ref. [1].
5 ∂ and ∂ ∗ denote the forward and the backward difference operators, respectively:
µ µ
6 For the cohomological argument to apply, the constant in Eq. (2.4) has to be smaller than 1 and
|Fµν (x)| < π/3 [1].
H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394 387
axial anomaly which depend on the details of the Dirac operator adopted. Our aim in this
paper is to show or argue that the structure represented by Eqs. (2.11) and (2.12) persists
even on finite-size lattices and for general gauge groups G.
For the axial anomaly defined on a finite lattice (2.1), a direct cohomological analysis
is not feasible. Nevertheless, we can show the following
Theorem 3.1. When G = U(1), if the lattice is sufficiently large compared to the
localization range of the Dirac operator, say L/ n,
We emphasize that, for a sufficiently large L, Eq. (3.1) is an exact statement for the axial
anomaly A(x). Eq. (3.2) shows that the current kµ (x) differs from the local current kµ∞ (x)
defined on the infinite lattice only by an exponentially small amount. Hence, when the
lattice size becomes large compared to ρ and thus when the concept of the locality becomes
meaningful, the current kµ (x) can be regarded as a local current. In this way, Eq. (3.1)
shows that the structure of axial anomaly on finite-size lattices is basically identical to that
on the infinite lattice (2.11). The validity of this theorem has been argued intuitively by
Chiu [25].
Proof. The configuration space of the gauge fields allowed by the admissibility (2.4)
consists of many components. Each component is uniquely characterized [15] by the
magnetic flux
1
L−1
mµν = Fµν (x + s µ̂ + t ν̂), (3.3)
2πi
s,t =0
which is an integer. For a configuration with the magnetic flux mµν , from Eq. (2.12), one
has [26]
N (−1)d/2
A∞ (x) = q(x) = µ ν ···µ ν mµ ν mµ ν · · · mµd/2 νd/2
2d/2(d/2)! 1 1 d/2 d/2 1 1 2 2
x∈Γ x∈Γ
= an integer, (3.4)
388 H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394
where the first equality follows from the translational invariance of kµ∞ (x) (namely, kµ∞ (x)
is a periodic current on Γ , when
the gauge field is periodic).7 Combined with the index
theorem (2.3), we see that x∈Γ A(x) − x∈Γ A∞ (x) is an integer. This integer is,
however, bounded by an exponentially small quantity: from the assumed property (2.7),
one infers that
there exists a periodic current bµ (x) which is given by a sum of c(y), the precise meaning
of which is given in Eq. (3.9) below, such that
Applying this lemma to Eq. (3.6), we see that there exists a gauge-invariant periodic
current +kµ (x) such that A(x) − A∞ (x) = ∂µ∗ +kµ (x). This field is exponentially small,
|+kµ (x)| κ1 Lν1 e−L/ , thus kµ (x) = kµ∞ (x) + +kµ (x) which proves the theorem. ✷
The assertions of the Lemma 3.1 immediately follow from the explicit construction
of bµ (x) (though this is not unique):
xµ
1
L−1
L−1
bµ (x) = ··· c(x1, . . . , xµ−1 , yµ , . . . , yd )
Ld−µ
yµ =0 yµ+1 =0 yd =0
xµ + 1
L−1 L−1
− d−µ+1 ··· c(x1, . . . , xµ−1 , yµ , . . . , yd ). (3.9)
L
yµ =0 yd =0
Note that since bµ (x) is given by a sum of the field c(x), bµ (x) is gauge-invariant if so
is c(x).
7 Eq. (2.3) and Theorem 3.1 show that the index is given by the combination (3.4) in terms of the magnetic
flux. For the overlap-Dirac operator, this relation has been verified numerically for d = 2 and d = 4 [27,28] and
proven analytically for d = 2 [28].
H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394 389
4. Non-Abelian cases
The explicit expression of Lüscher’s topological density is known only for d = 2 and for
d = 4. In our context, it is given by N times Eq. (32) of Ref. [29]. We simply assume
that the construction can be pursued for higher-dimensional cases.8 The construction of
Ref. [29] does not provide a pseudoscalar q(x). However, we may always enforce this
pseudoscalar property by taking average over lattice symmetries; we assume that this has
been done and q(x) is a pseudoscalar. The topological density has the classical continuum
limit
1 N i d/2
lim q(x) = µ ν ···µ ν tr Fµ1 ν1 Fµ2 ν2 · · · Fµd/2 νd/2 (x). (4.2)
a→0 a d (4π)d/2(d/2)! 1 1 d/2 d/2
At the moment, we cannot prove the above conjecture in the non-perturbative level.
However, we see that the conjecture holds to all orders in perturbation theory; the
following theorem guarantees that a gauge-invariant topological field is unique (up to a
total divergence) under certain conditions.
Theorem 4.1. Let p(x) be a local gauge-invariant pseudoscalar field (which is translational-
invariant) on the infinite lattice whose dependences on the lattice spacing a arise only
though the gauge field.9 If it is topological
δp(x) = 0, (4.3)
x∈Rd
and the classical continuum limit lima→0 p(x)/a d vanishes, then to all orders in
perturbation theory,
8 For G = (1), the construction of Ref. [29] can be generalized to arbitrary dimensions [26]. The equivalence
U
of Eq. (4.1) with Eq. (2.11) for G = U(1) has been shown [26]. See also Ref. [30].
9 Recall that in the classical continuum limit the gauge potential is introduced as U (x, µ) =
Proof. Our proof is rather similar to the cohomological argument of Ref. [5]. We expand
p(x) with respect to the bare gauge coupling constant g0 introduced by U (x, µ) =
eg0 Aµ (x) :
∞
p(x) = p(k) (x),
k=1
g0k (k)
p(k) (x) = p (x, y1 , . . . , yk )aµ11···a
···µk Aµ1 (y1 ) · · · Aµk (yk ),
k a1 ak
(4.5)
k! y ,...,y
1 k
Moreover, since p(1) (x) is a local topological pseudoscalar field and Eq. (4.6) is the gauge
transformation in Abelian theory, one can invoke the cohomological analysis in Abelian
theory. The result is
p(1) (x) = ∂µ∗ ,(1)
µ (x), µ (x) = g0
,(1) ,(1) a a
µ (x, y)ν Aν (y). (4.8)
y
The local axial vector current ,(1)µ (x) is invariant under Eqs. (4.6) and (4.7). A key
(1) (1)
observation is that, from ,µ (x), one can construct a field ,̂µ (x) such that it is invariant
under the original non-Abelian gauge transformation and its lowest-order O(g0 ) term
coincides with ,(1) a
µ (x). This can be accomplished by substituting the gauge potential Aµ (y)
in Eq. (4.8) by the expression [5]
2 a
Âaµ (x, y) = tr T 1 − W (x, y)U (y, µ)W (x, y + µ̂)−1 , (4.9)
g0
where W (x, y) is the ordered product of the link variables from y to x along the shortest
path that goes first in direction 1, then direction 2, and so on. Note that Âaµ (x, y) behaves
gauge covariantly under the original non-Abelian gauge transformation. Thus the resulting
expression,
µ (x) = g0
,̂(1) ,(1) a a
µ (x, y)ν Âν (x, y), (4.10)
y
(1)
is invariant under the non-Abelian gauge transformation due to the invariance of ,µ (x)
under Eq. (4.7). Moreover, since
with ω(x, y) the oriented line sum of the gauge potential from y to x, the invariance under
Eq. (4.6) implies that ,̂(1) (1) (1)
µ (x) = ,µ (x) + O(g0 ). Using ,̂µ (x), we may define a local
2 10
Going back to Eq. (4.1), we note that both A∞ (x) and q(x) are a local gauge-invariant
topological pseudoscalar field (for the latter, those properties follow from the construction
of q(x) [29]). Moreover, they have the same classical continuum limit (4.2). Thus, applying
Theorem 4.1 to A∞ (x) − q(x), we see that the conjecture holds to all orders in perturbation
theory.
Now, in the proof of Theorem 3.1 in Abelian theory, every stepsare valid even for
non-Abelian theories, except for the crucial relation (3.4), namely, x∈Γ A∞ (x) is an
integer.
With our Conjecture 4.1 for non-Abelian cases, this last condition is also satisfied;
x∈Γ q(x) is Lüscher’s topological charge on a periodic lattice which is an integer. So,
repeating the proof for Theorem 3.1, we have
Corollary of conjecture 4.1. For general G, if the lattice is sufficiently large compared to
the localization range of the Dirac operator, say L/ n,
A(x) = q(x) + ∂µ∗ kµ (x), (4.14)
where kµ (x) is a gauge-invariant periodic current on Γ . The current kµ (x), moreover,
satisfies the bound
kµ (x) − k ∞ (x) κ1 Lν1 e−L/ , (4.15)
µ
(1)
10 The current ,̂ (x) so constructed is not an axial vector under the lattice symmetries. However, we can
µ
always enforce this by taking average over lattice symmetries.
392 H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394
with constants κ1 and ν1 . The topological density q(x) is given by Lüscher’s topological
density [29] and its higher-dimensional extensions.
This corollary states that the basic structure of axial anomaly on finite lattices is identical
that on the infinite lattice. Summing Eq. (4.14) over the lattice Γ , one has an equality
between the index of the Dirac operator (2.3) and the geometrically-defined lattice
topological charge [29]. This equivalence (“lattice index theorem”) has been thought to
be true for long time since the analyses in Refs. [27,31]. Our argument provides a further
support for this equivalence.
5. Conclusion
In this paper, we have studied the axial anomaly defined on a finite-size lattice by using
a Ginsparg–Wilson–Dirac operator. For G = U(1), we show that the basic structure of
axial anomaly on the infinite lattice, which has a quite analogous form to the continuum
counterpart, persists even on a sufficiently large finite-size lattices. For general G, we
conjectured that the axial anomaly on the infinite lattice is basically given by Lüscher’s
topological density; actually this holds to all orders in perturbation theory. With this
conjecture, we showed that this structure again persists even on finite-size lattices. Since
Lüscher’s topological density is a geometrically natural definition of the Chern form in
lattice gauge theory (note that it is proportional to str T a1 · · · T ad/2 ), our analysis indicates
that the basic structure of axial anomaly in continuum theory is quite robust and it persists
even in a system with finite ultraviolet and infrared cutoffs. Of course, we indicated this
persistency only in a framework with the Ginsparg–Wilson relation. To understand precise
conditions on the formulation for this persistency to hold is an interesting open question;
for example, one may enlarge the set of formulations by using the generalized Ginsparg–
Wilson relation [32].
In the gauge-invariant lattice formulation of Abelian chiral gauge theories [15], a
knowledge on the structure of U(1) gauge anomaly on finite-size lattices was of crucial
importance. Recalling this fact, we believe that our analyses will be useful in extending the
construction of Ref. [15] to non-Abelian gauge theories.
Acknowledgements
H.S. would like to thank Takanori Fujiwara, Takahiro Fukui, Yoshio Kikukawa and
Martin Lüscher for valuable discussions. We are grateful to Kazuo Fujikawa for a careful
reading of the manuscript.
References
[1] M. Lüscher, Topology and the axial anomaly in abelian lattice gauge theories, Nucl. Phys. B 538 (1999)
515, hep-lat/9808021.
H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394 393
[2] T. Fujiwara, H. Suzuki, K. Wu, Noncommutative differential calculus and the axial anomaly in abelian lattice
gauge theories, Nucl. Phys. B 569 (2000) 643, hep-lat/9906015;
T. Fujiwara, H. Suzuki, K. Wu, Axial anomaly in lattice abelian gauge theory in arbitrary dimensions, Phys.
Lett. B 463 (1999) 63, hep-lat/9906016.
[3] H. Suzuki, Anomaly cancellation condition in lattice gauge theory, Nucl. Phys. B 585 (2000) 471, hep-
lat/0002009;
H. Igarashi, K. Okuyama, H. Suzuki, Errata and addenda to “Anomaly cancellation condition in lattice gauge
theory”, hep-lat/0012018.
[4] Y. Kikukawa, Y. Nakayama, Gauge anomaly cancellation in SU(2)L × U(1)Y electroweak theory on the
lattice, Nucl. Phys. B 597 (2001) 519, hep-lat/0005015.
[5] M. Lüscher, Lattice regularization of chiral gauge theories to all orders of perturbation theory, J. High
Energy Phys. 06 (2000) 028, hep-lat/0006014.
[6] Y. Kikukawa, Domain wall fermion and chiral gauge theories on the lattice with exact gauge invariance,
Phys. Rev. D 65 (2002) 074504, hep-lat/0105032.
[7] P. Hasenfratz, Prospects for perfect actions, Nucl. Phys. (Proc. Suppl.) 63 (1998) 53, hep-lat/9709110;
P. Hasenfratz, Lattice QCD without tuning, mixing and current renormalization, Nucl. Phys. B 525 (1998)
401, hep-lat/9802007.
[8] H. Neuberger, Exactly massless quarks on the lattice, Phys. Lett. B 417 (1998) 141, hep-lat/9707022;
H. Neuberger, More about exactly massless quarks on the lattice, Phys. Lett. B 427 (1998) 353, hep-
lat/9801031.
[9] P.H. Ginsparg, K.G. Wilson, A remnant of chiral symmetry on the lattice, Phys. Rev. D 25 (1982) 2649.
[10] P. Hasenfratz, V. Laliena, F. Niedermayer, The index theorem in QCD with a finite cut-off, Phys. Lett. B 427
(1998) 125, hep-lat/9801021.
[11] M. Lüscher, Exact chiral symmetry on the lattice and the Ginsparg–Wilson relation, Phys. Lett. B 428 (1998)
342, hep-lat/9802011.
[12] F. Niedermayer, Exact chiral symmetry, topological charge and related topics, Nucl. Phys. (Proc. Suppl.) 73
(1999) 105, hep-lat/9810026.
[13] P. Hernández, K. Jansen, M. Lüscher, Locality properties of Neuberger’s lattice Dirac operator, Nucl. Phys.
B 552 (1999) 363, hep-lat/9808010.
[14] H. Neuberger, Bounds on the Wilson Dirac operator, Phys. Rev. D 61 (2000) 085015, hep-lat/9911004.
[15] M. Lüscher, Abelian chiral gauge theories on the lattice with exact gauge invariance, Nucl. Phys. B 549
(1999) 295, hep-lat/9811032.
[16] I. Horvath, Ginsparg–Wilson relation and ultralocality, Phys. Rev. Lett. 81 (1998) 4063, hep-lat/9808002;
I. Horvath, Ginsparg–Wilson–Lüscher symmetry and ultralocality, Phys. Rev. D 60 (1999) 034510, hep-
lat/9901014.
[17] W. Bietenholz, On the absence of ultralocal Ginsparg–Wilson fermions, hep-lat/9901005.
[18] Y. Kikukawa, A. Yamada, Weak coupling expansion of massless QCD with a Ginsparg–Wilson fermion and
axial U(1) anomaly, Phys. Lett. B 448 (1999) 265, hep-lat/9806013.
[19] K. Fujikawa, A continuum limit of the chiral Jacobian in lattice gauge theory, Nucl. Phys. B 546 (1999) 480,
hep-th/9811235.
[20] D.H. Adams, Axial anomaly and topological charge in lattice gauge theory with overlap-Dirac operator,
Ann. Phys. (N.Y.) 296 (2002) 131, hep-lat/9812003;
D.H. Adams, On the continuum limit of fermionic topological charge in lattice gauge theory, J. Math.
Phys. 42 (2001) 5522, hep-lat/0009026.
[21] H. Suzuki, Simple evaluation of chiral Jacobian with the overlap Dirac operator, Prog. Theor. Phys. 102
(1999) 141, hep-th/9812019.
[22] T.W. Chiu, T.H. Hsieh, Perturbation calculation of the axial anomaly of Ginsparg–Wilson fermion, hep-
lat/9901011.
[23] T. Reisz, H.J. Rothe, The axial anomaly in lattice QED: a universal point of view, Phys. Lett. B 455 (1999)
246, hep-lat/9903003.
[24] M. Frewer, H.J. Rothe, Universality of the axial anomaly in lattice QCD, Phys. Rev. D 63 (2001) 054506,
hep-lat/0004005.
[25] T.W. Chiu, The axial anomaly of Ginsparg–Wilson fermion, Phys. Lett. B 445 (1999) 371, hep-lat/9809013.
394 H. Igarashi et al. / Nuclear Physics B 644 (2002) 383–394
[26] T. Fujiwara, H. Suzuki, K. Wu, Topological charge of lattice Abelian gauge theory, Prog. Theor. Phys. 105
(2001) 789, hep-lat/0001029.
[27] R. Narayanan, H. Neuberger, Chiral fermions on the lattice, Phys. Rev. Lett. 71 (1993) 3251, hep-
lat/9308011;
R. Narayanan, H. Neuberger, A construction of lattice chiral gauge theories, Nucl. Phys. B 443 (1995) 305,
hep-th/9411108.
[28] T. Fujiwara, A numerical study of spectral flows of Hermitian Wilson–Dirac operator and the index theorem
in Abelian gauge theories on finite lattices, Prog. Theor. Phys. 107 (2002) 163, hep-lat/0012007;
H. Kurokawa, T. Fujiwara, Spectrum of the hermitian Wilson–Dirac operator for a uniform magnetic field
in two dimensions, hep-lat/0206014.
[29] M. Lüscher, Topology of lattice gauge fields, Commun. Math. Phys. 85 (1982) 39.
[30] A. Phillips, Characteristic numbers of U(1) valued lattice gauge fields, Ann. Phys. (N.Y.) 161 (1985) 399.
[31] R. Narayanan, P. Vranas, A numerical test of the continuum index theorem on the lattice, Nucl. Phys. B 506
(1997) 373, hep-lat/9702005.
[32] K. Fujikawa, Algebraic generalization of the Ginsparg–Wilson relation, Nucl. Phys. B 589 (2000) 487, hep-
lat/0004012;
K. Fujikawa, M. Ishibashi, Chiral anomaly for a new class of lattice Dirac operators, Nucl. Phys. B 587
(2000) 419, hep-lat/0005003;
K. Fujikawa, M. Ishibashi, Locality properties of a new class of lattice Dirac operators, Nucl. Phys. B 605
(2001) 365, hep-lat/0102012;
K. Fujikawa, M. Ishibashi, A perturbative study of a general class of lattice Dirac operators, Phys. Rev. D 65
(2002) 114504, hep-lat/0201016.
Nuclear Physics B 644 (2002) 395–400
www.elsevier.com/locate/npe
Abstract
In this paper we calculate the (g − 2)µ contribution due to a light stabilized radion using the radion
couplings both to the kinetic energy and the mass of the muon. We find that the radion mediated muon
anomaly (ar ) is a calculable quantity free from powerlike ultraviolet divergences. We have estimated
ar both for Λ mφ mµ and Λ mφ ≈ mµ . Our results show that under the first (second)
condition the radion mediated muon anomaly can be detected with the ultimate future precision for
measuring the muon anomaly provided φ is less than 425 (600) GeV. Whereas with the present
precision ar can be detected provided φ is less than 250 GeV in the first case and 375 GeV in the
second case.
2002 Elsevier Science B.V. All rights reserved.
Recently there has been a lot of interest in studying the phenomenology of models of
large [1] and small [2] extra dimensions. Phenomenological data from a variety of sources
have been used to constrain the unknown (free) parameters of models of extra dimensions.
In particular, the precision measurement of muon anomaly by the BNL Collaboration has
been used to constrain the free parameters of extra dimension models [3]. In this paper we
calculate the muon anomaly due to a light stabilized radion [4] in the Randall–Sundrum
model. Using the radion couplings both to the kinetic energy (K.E.) and the mass of the
muon we find that the radion mediated muon anomaly is a calculable quantity free from
powerlike ultraviolet divergences. This is unlike the Kaluza–Klein graviton contribution to
the oblique electroweak parameters S, T and U which is plagued by uncalculable powerlike
divergences [5]. We also find that the radion coupling to the K.E. of the muon leads to a
muon anomaly contribution that could be tested with the present (future) experimental
precision even for mφ mµ provided φ is less than 250 (425) GeV. Our result is
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 4 3 - 5
396 P. Das, U. Mahanta / Nuclear Physics B 644 (2002) 395–400
(a) (b)
(c) (d)
Fig. 1. Feynman diagrams that gives rise to the radion contribution to muon anomaly.
significantly greater than previous estimates which have reported very small values of
radion mediated muon anomaly for mφ mµ . The reason the previous estimates had
obtained very small values of ar is that they used only the radion coupling to the mass
of the muon neglecting its coupling to the K.E. of the muon.
The Feynman diagrams that give could give rise to radion mediated muon anomaly are
shown in Fig. 1.
The Feynman rules for evaluating these diagrams can be found in Ref. [6]. The results
of our calculation are briefly presented below.
9e d 4l
−ieΓ1µ = x dx dy /l − p
/ (1 − x) − 23 + xy mµ
2φ2 (2π)4
× /l + p/ (1 − xy) − p
/ (1 − x) + mµ γµ [/l − p / xy + p
/ x + mµ ]
1
× /l − p/ xy − 53 − x mµ
[(l + p) − mµ ][(l + p )2 − m2µ ](l 2 − m2φ )
2 2
9iemµ
=
64π 2φ2
Λ2 3
× x dx dy ln 2 − C1 (x, y)(pµ + pµ ) + C2 (x, y)mµ γµ
A 2
P. Das, U. Mahanta / Nuclear Physics B 644 (2002) 395–400 397
3iem3µ 1
− x dx dy D1 (x, y)(pµ + pµ ) + D2 (x, y)mµ γµ , (1)
32π 2φ2 A 2
3e d 4 l [ 32 (/l + 2/p ) − 4mµ ][/l + p
/ + mµ ]
−ieΓ2µ = − γµ
φ2 (2π)4 [(l + p )2 − m2µ ][l 2 − m2φ ]
1
−3ie 3 2 3 2 Λ2
= dx Λ + B 1 − 2 ln 2
16π 2φ2 2 2 B
0
3 Λ2
+ 1 + x (2 − x)m2µ ln 2 − 1 γµ , (2)
2 B
3e d 4l [/l + p
/ + mµ ][ 32 (/l + 2/
p) − 4mµ ]
−ieΓ3µ = − 2 γ µ
φ (2π) 4 [(l + p) − mµ ][l − m2φ ]
2 2 2
1
−3ie 3 2 3 2 Λ2
= dx Λ + B 1 − 2 ln 2
16π 2φ2 2 2 B
0
3 Λ2
+ 1 + x (2 − x)mµ ln 2 − 1 γµ
2
(3)
2 B
and
3e d 4l 1 3ie Λ2
−ieΓ4µ = γµ = − Λ − mφ ln 2 γµ .
2 2
(4)
φ2 (2π)4 (l 2 − m2φ ) 16φ2 mφ
In the above
Fig. 2. ρ parameter constraints on radion VEV φ and radion mass mφ . The allowed region lies above the curve.
Fig. 3. Plot of muon anomaly ar against the radion VEV in the first case. The horizontal line corresponds to the
ultimate future precision of the experiment.
P. Das, U. Mahanta / Nuclear Physics B 644 (2002) 395–400 399
Fig. 4. Plot of muon anomaly ar against the radion VEV in the second case. The horizontal line corresponds to
the ultimate future precision of the experiment.
applying power counting it can be shown that the anomalous magnetic moment term is at
most log divergent. Using the Gordon identity to replace the convective current (pµ + pµ )
by a vector current (γµ ) and a spin current (iσµν q ν ) we get
9m2µ Λ2 3
ar = x dx dy ln 2 − C1 (x, y)
32π 2 φ2 A 2
2 2
3mµ mµ
− x dx dy 2 D1 (x, y). (11)
16π φ
2 2 A
We shall now estimate the radion mediated muon anomaly under the following two
conditions:
Case I: Λ mφ mµ
In this case we shall consider radion masses in the range 1 GeV to a few hundred GeV.
The reason being such radion masses do not require a large fine tuning of the parameters
in the Golberger and Wise stabilization scheme. Under this condition m2µ /A2 1 and the
second term becomes negligible in comparision to the first term (logarithmic) term. We find
that for φ ≈ 250 GeV the radion mediated muon anomaly turns out to be 1.24 × 10−9 .
On the other hand for φ ≈ 500 GeV the value of ar drops to 3.2 × 10−10 .
Case II: Λ mφ ≈ mµ
Under this condition the first term becomes comparable to the second and both have
to be kept in estimating ar . We find that for φ ≈ 375 GeV the radion mediated muon
anomaly is about 1.1 ×10−9 . On the other hand for φ ≈ 600 GeV the value of ar falls to
4 × 10−10 .
The precision measurement of muon anomaly currently constitutes one of the most
stringent tests for new physics beyond the standard model (SM), particularly in the light
of the recent and future promised results from E821 experiment at BNL. The most recent
experimental value of the muon anomaly is aexp = (11659203 ± 15) × 10−10 [8]. The
µ
SM prediction is aSM = (11659176 ± 6.7) × 10−10 [9]. The SM prediction for the muon
µ
400 P. Das, U. Mahanta / Nuclear Physics B 644 (2002) 395–400
anomaly therefore differs from the experimental value by (27 ± 16.4) × 10−10 , i.e., by 1.6
standard deviation. Since this is less than a 2σ effect we shall not use it to set limits on the
unknown parameters mφ and φ. Rather we shall use the precision of the BNL experiment
as a benchmark for testability of radion mediated muon anomaly. It is hoped that the BNL
Collaboration will be able to reach a precision of 4 × 10−10 . Our results show that in the
first case the radion mediated muon anomaly can be detected with the present (future)
precision for measuring the anomaly provided φ is less than 250 (425) GeV. Whereas in
the second case the detectability of ar with the present (furture) precision requires φ to
be less than 375 (600) GeV. It is worthwhile to compare these bounds with those obtained
from precision measurement of the oblique electroweak parameters. At 2σ difference from
the central value of the T parameter one obtains a lower bound of 400 GeV on φ for
radion masses lying between 1 and 100 GeV [10] The muon anomaly bounds on φ are
therefore comparable with those obtained from the T parameter.
We would like to note that in the first case (mφ mµ ) our result for ar is significantly
greater than those of previously published estimates. Previous estimates of radion mediated
muon anomaly had neglected the radion coupling to the KE of the muon and used only
the radion coupling to the mass of the muon. As a result they got only a subdominant
contribution proportional to m2µ × m2µ /m2φ . It can be shown that the radion coupling to the
muon reduces to the mass of the muon only if both muon lines are on shell. However, in
calculating the loop diagrams shown in Fig. 1 one certainly cannot assume that the muon
lines at each vertex are on shell. The previous estimates of radion mediated muon anomaly
are therefore not trustable.
References
Erratum
There is an error in the integrated cross-sections due to the one-loop corrections to the
reactions q + q̄ → V + g and q(q̄) + g → V + q(q̄). Therefore the coefficient functions
(2),C (2),C (2),C (2),C
∆q q̄ A Eq. (B.9), ∆q q̄ F Eq. (B.10), ∆q q̄ A Eq. (B.19), ∆q q̄ F Eq. (B.20), are changed.
The correct coefficient functions are obtained by the following modifications.
To Eq. (B.9) one has to add
2
αs
CA CF −16x ln x ln(1 − x) + 8x ln2 x − 16x Li2 (1 − x) .
4π
2
αs
CF2 −(48 + 16x) ln x ln(1 − x) − 16 ln x
4π
+ (24 + 8x) ln2 x − (48 + 16x) Li2 (1 − x) .
2
αs
CA Tf 8x ln x ln(1 − x) − 8x ln x − 4x ln2 x + 8x Li2 (1 − x) .
4π
✩
PII of original article: S0550-3213(91)00343-8.
E-mail address: neerven@lorentz.leidenuniv.nl (W.L. van Neerven).
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 4 - 3
404 R. Hamberg et al. / Nuclear Physics B 644 (2002) 403–404
2
αs
CF Tf −(24 − 8x) ln x ln(1 − x) − 24(1 − x) ln(1 − x)
4π
+ (28 − 44x) ln x + (12 − 4x) ln2 x
− (24 − 8x) Li2 (1 − x) + 12(1 − x) .
The new coefficient functions are now in agreement with the results found in [1]. Further
there are some misprints. The function Li3 ( 1+x
1−x ) on line 8 in Eq. (B.19) and on line 4
in Eq. (B.24) has to be read as Li3 ( 1−x
1+x ). Likewise the function Li3 (− 1+x
1−x ) on line 8 in
1−x
Eq. (B.19) and on line 4 in Eq. (B.24) has to be read as Li3 (− 1+x ).
Acknowledgement
W.L. van Neerven would like to thank W.B. Kilgore for discussions. We also would like
to thank V. Ravindran in recalculating the coefficient functions which led to the agreement
with the results in [1].
References
[1] R.V. Harlander, W.B. Kilgore, Phys. Rev. Lett. 88 (2002) 201801.
Nuclear Physics B 644 (2002) 405–406
www.elsevier.com/locate/npe
Erratum
which guarantees the gauge invariance taking into account Eqs. (1.7), (1.11), (2.20)
and (2.21)
5
δλ SCS = k5 dζ (−4) du Tr λD (+4) D (+2) V −− , ∇ ++ V −− = 0. (2.27)
k5
5
SCS = d 5 x d 8 θ du Tr V ++ V1−− V ++ , D (+2) V1−− V ++ + · · · , (2.28)
3
✩
PII of original article: S0550-3213(99)00267-9.
E-mail address: zupnik@thsun1.jinr.ru (B. Zupnik).
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 4 0 - 4
406 B. Zupnik / Nuclear Physics B 644 (2002) 405–406
where higher-order terms are omitted and the linear approximation of the perturbative
solution for V −− is used
V ++ (z, u1 )
V1−− V ++ = du1 . (2.29)
(u+ u+
1)
2
References
[1] T. Kugo, K. Ohashi, Prog. Theor. Phys. 105 (2001) 323, hep-ph/0010288.
[2] E. Bergshoeff, et al., hep-th/0205230.
Nuclear Physics B 644 [FS] (2002) 409–432
www.elsevier.com/locate/npe
Abstract
We study descendants of inhomogeneous vertex models with boundary reflections when the spin–
spin scattering is assumed to be quasi-classical. This corresponds to consider certain power expansion
of the boundary-Yang–Baxter equation (or reflection equation). As final product, integrable su(2)-
spin chains interacting with a long range with XXZ anisotropy are obtained. The spin–spin coupling
constants are non-uniform, and a non-uniform tunable external magnetic field is applied; the latter
can be obtained when the boundary conditions are assumed to be quasi-classical as well. The exact
spectrum is achieved by algebraic Bethe ansatz. Having realized the su(2) operators in terms of
fermions, the class of models we found turns out to describe confined fermions with pairing force
interactions. The class of models presented in this paper is a one-parameter extension of certain
Hamiltonians constructed previously. Extensions to su(n)-spin open chains are discussed.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 1 - 8
410 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
and more applications in contemporary physics. The key toward this powerful synthesis
is to notice that the “scattering” of the degrees of freedom of both the VM and the spin
chains is described by the same matrix. The Quantum Inverse Scattering Method (QISM)
exploits this fact systematically [3]. The method relies on the observation that transfer
matrices tˆ = Tr(T ) span a one-parameter family of commuting operators if a (scattering)
matrix R exists such that T , R satisfy the celebrated Quantum Yang–Baxter equation. The
equivalence between VM and Heisenberg chains consists in the fact that these models have
the same R-matrix. Due to the property of the scattering R(u, v) = R(u − v), ∀u, v ∈ C
the integrability of VM is preserved if disorder is added at each lattice site such that the
scattering “wave momenta” u, v result to be shifted arbitrarily. In this case, however, it
is difficult to extract a Hamiltonian. A route to simplify the problem is to resort the so-
called “quasi-classical” limit of the QISM. The term “quasi-classical” here indicates that
the scattering between the degrees of freedom of the model is assumed to be quasi-classical.
Quantitatively, this means that a parameter η does exist such that the scattering matrix is
of the form R(u) ∝ 1 ⊗ 1 + ηr(u) in the limit η → 0 (η plays the role of h̄). The quantity
r(u) fulfills the classical Yang–Baxter equation (that is a restatement of the Jacobi identity
for the Poisson brackets of suitable action-angle variables). It is worthwhile to mention,
however, that the systems obtained by this quasi-classical expansion consist of quantum
spins (by no means quasi-classical). The quasi-classical expansion of the transfer matrix
of disordered VM (in the lowest spin representation) is non-trivial and it produces the
Gaudin’s magnet Hamiltonians [4,5] containing a long range spin interaction (in contrast
with the range of the Heisenberg chains which involves nearest neighbour spins).
A richer variety of integrable models by QISM comes from imposing non-trivial
boundary conditions different from the periodic ones. Twisted boundary conditions, for
example, imposed to the six vertex model [6] produce the Gaudin magnet in a non-uniform
local magnetic field, which is very important for physical applications. In fact having
realized the (pseudo)spin algebra in terms of fermions the XXX Gaudin Hamiltonians
in a non-uniform magnetic field are the constants of the motion of the BCS model [7] that
describes pairs of electrons (in time reversed states) interacting with a long range uniform
pairing coupling. The exact solution of the BCS model was found long ago by Richardson
[8] and rediscovered recently. In particular it was used to study small metallic grains
[9,10]; the picture was merged in the scenario of QISM in Ref. [11]. Connections with
WZNW models in field theory have been deeply investigated [12] based on the relation
between solution of KZ equation and Gaudin model found in Refs. [13,14]. The class
of pairing Hamiltonians was generalized by investigating the quasi-classical expansion
of the disordered twisted six vertex model with XXZ R-matrix. In terms of fermions
this class of Hamiltonians represents interacting electron pairs with certain non-uniform
long-range coupling strengths [15–18]. Twisted rings can be cut to open chains and loops
include two reflections at the boundaries. The possibility to include such reflections in
integrable theory was founded and systematically investigated by Sklyanin [19]. The quasi-
classical limit of the disordered six vertex model with boundaries was investigated first
by one of the authors [20,21]. This led to a model where the spin couplings contain
an additional parameter with respect to the original Gaudin magnet, and in a vanishing
external magnetic field (see Eq. (27) and the relative discussion below of it). In the present
work, we proceed along this line. We still consider an inhomogeneous six-vertex model
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 411
with boundary reflection, following closely Ref. [20]. By properly choosing the reflection
parameters, we introduce an external non-uniform magnetic field of tunable strength in the
Hamiltonian. The trick consists in the assumption that also the boundary conditions have
a quasi-classical expansion (see Eq. (25) and Section 3.2). At best of our knowledge, this
idea is pursued for the first time in the present paper. In the following we summarize the
main results obtained in the bulk of the paper. The class of spin-Sj (j = 1, . . . , N ) models
that we find has Hamiltonian of the form
(z)
H= 2hj Sjz − Jj k
Sjz
Skz + Jj k
Sj+
Sk− + h.c. . (1)
j j,k
j =k
We agree that the Latin indices j, k will run from 1 to N , where N is the number of spins.
The operators S α , α = ±, z are su(2) operators. The couplings are
(z)
Jj k = Ij k cosh(2pzj ) + cosh(2pzk ) − 2 cos (2pt) ,
Jj k = Ij k sinh p(zj − it) sinh p(zk + it) ,
z hj − hk
Ij k = J S ,
cosh(2pzj ) − cosh(2pzk )
z −1
J S = J 1 − J Sz , (2)
where S z is the total z-component of the spin. The quantities hj , zj are two arbitrary
sets of real parameters; t is also a real arbitrary parameter and it directly comes from the
boundary terms (see Eq. (16) with ξ = it); finally, p can be 1, i or can be tending to zero
corresponding to hyperbolic, trigonometric,and rational couplings, respectively.
The eigenstates in the sector with S z = j Sj − M, are
M
|Ψ =
S − (eα )|H , (3)
α=1
where
cosh[p(u + zj + 2it)] − cosh[p(u − zj )]
S − (u) =
Sj− (4)
cosh(2pu) − cosh(2pzj )
j
and
N
z
|H =
S = Sj .
j
j =1
where
z 1 − xj xk z 1 − xj λα
τj = Sj 1 − J S Sk +J S , (6)
xj − xk xj − λα
k=j α
412 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
where we defined
1 + xj
= cosh(2pzj ) − cos(2pt),
1 − xj
1 + λα
= cosh(2peα ) − cos(2pt). (7)
1 − λα
The rapidities λα satisfy, in the sector having total spin S z , the Bethe equations:
1 Sj
−
λα − λβ λα − xj
β=α j
1 1 + J (S z )(1 + S z ) 1 − J (S z )(1 + S z )
+ + = 0. (8)
2J (S z ) 1 + λα 1 − λα
We point out that the dependence on the reflection parameters comes only in the coupling
and eigenvectors (Eqs. (2) and (3)), while the eigenvalues depend on t only implicitely
(through Eq. (7)).
The rational limit of the models is recovered for p → 0.
For t = 0 and pt = π/2, the models reduce to the ones that we presented in Ref. [15]
(see Section 3.3). Thus the class of models we discuss in the present paper is a one-
parameter extension of the former class.
Using the fermionic realizations of su(2) the Hamiltonian (1) can be rephrased to
describe confined fermions interacting with pairing and exchange forces (see Eq. (48)).
The paper is organized as follows. In the next section we summarize the main
ingredients of the inverse scattering of VM with boundaries. In Section 3 we construct the
integrable models we deal with together with their exact solution. In Section 4 we use the
fermionic realization of the su(2) algebra to rewrite the Hamiltonians in a second quantized
form. Section 5 is devoted to final remarks. In Appendix A we review basic properties of
VM. In Appendix B we prove the integrability of a class of models when a more general
(off-diagonal) reflection at the boundary is applied (see Eqs. (B.1)–(B.4)). We also discuss
a generalization to su(n) case in Appendix C.
In this section, we review how the QISM is applied to VM, in order to obtain a family
of commuting transfer matrices. VM describe a system of interacting classical objects on a
two-dimensional lattice. As described in Appendix A, the partition function of the system
can be written as Z = Tr{tˆ(1) · · · tˆ(K)}, where tˆ(i) are operators in some appropriate
many-body linear space (in the sense that it is the direct product of N elementary linear
spaces). The VM is exactly solvable if [tˆ(i), tˆ(i )] = 0. Usually, it is assumed that the
dependence on the ith row of the lattice comes through a parameter ui , which takes values
on some domain of the complex plane. Then the requirement for exact solvability becomes
[tˆ(u), tˆ(v)] = 0, ∀u, v belonging to the domain.
The QISM provides a way of constructing classes of commuting operators tˆ(u), finding
their eigenvalues and their common eigenstates, and extracting Hamiltonians whose
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 413
integrals of motions are tˆ(u). The QISM is a procedure which starts from the R-matrix
and from the Lax operator to yield the transfer matrix tˆ(u). From the transfer matrix, a
class of Hamiltonians can be extracted in various ways, to be depicted below. The QISM
has a built-in Algebraic Bethe Ansatz (ABA) which provides the diagonalization of the
tˆ(u), and hence of the Hamiltonian.
The XXZ R-matrix is
a(u, v) 0 0 0
0 b(u, v) c(u, v) 0
R(u, v) =
0
, (9)
c(u, v) b(u, v) 0
0 0 0 a(u, v)
where
sinh [p(u − v + η)] sinh [p(−v)]
a(u, v) = , b(u, v) = ,
p p
sinh (pη)
c(u, v) = .
p
It is connected to the Ř-matrix defined for VM by R(u, v) = P12 Ř(1, 2), where
1 0 0 0
0 0 1 0
P12 =
0 1 0 0
0 0 0 1
is the permutation operator, and it is assumed that the dependence of R upon the rows
comes through a parameter assigned to each row.
The corresponding Lax operators are
1 sinh p u + η Sjz Sj−
sinh(pη)
Lj (u) = . (10)
p sinh(pη)Sj+ sinh p u − η
Sjz
Here p is the anisotropy parameter, in the sense that, when p = 0—in which case one can
put either p = 1 or p = i—the QISM yields a Hamiltonian with XXZ-type couplings,
while in the limit p → 0, the hyperbolic/trigonometric functions reduce to rational ones,
and the QISM generates a Hamiltonian having XXX couplings; η, instead, is the so-called
quantum parameter which plays the role of h̄; as we shall later see, it gives the degree of
deformation of the classical algebra su(2) into the quantum algebra suq (2). We remark that
the terminology is somehow misleading: since we associate the algebra su(2) with spins,
realized either by true spins or by pairs of time-reversed electrons, in the limit η → 0 we
obtain genuine quantum Hamiltonians.
The Lax operators act on the auxiliary two-dimensional vector space V, and on the
quantum space Hn . They obey the fundamental Yang–Baxter relation Eq. (A.1), which in
terms of the R-matrix now reads
1 2 2 1
R(u − v)Lj (u − zj )Lj (v − zj ) = Lj (v − zj )Lj (u − zj )R(u − v). (11)
414 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
Due to the additive property of the R-matrix R(u, v) = R(u − v), parameters zj taking
into account on-site disorder through the lattice can be introduced.
1 2
As customary Lj (u) = Lj (u) ⊗ 1, and Lj (u) = 1 ⊗ Lj (u); the external product is
meant between two copies of the space V, while the multiplication of the elements of Lj ,
which are operators on Hj , is an internal product. The relation (11) is actually obeyed only
for 1/2 spins, i.e., for dim(Hj ) = 2. The order of the representation (that is the dimension
of Hj ) can be extended to larger values, keeping the dimension of V fixed to 2; however,
one has to renounce to the algebra su(2), and introduce rather the quantum algebra suq (2),
which ensures that the relation (11) is obeyed whatever is the representation of the algebra.
The parameter q is related to the parameters p and η by q = exp(pη). The commutation
rules are
sinh(2pη
Sjz )
Sjz ,
Sj± =
Sj± ,
Sj+ ,
Sj− = . (12)
sinh(pη)
In the quasi-classical limit η → 0, or in the isotropic limit p → 0, suq (2) reduces to su(2).
Next, we consider the monodromy matrix T (u) ≡ L1 (u − z1 ) · · · LN (u − zN ). We have
an internal product over V and an external one over Hj and Hj ; thus T (u) is an operator
over V ⊗ H1 ⊗ · · · ⊗ HN . It has the form
A(u) B(u)
T (u) = ,
C(u) D(u)
with A, B, C, D operators over H = j Hj .
j (u), Lk (v)] = 0 for j = k,
The local relation (11), and the ultra-locality property, [Lab cd
K− (u) = K(u, ξ− ),
K+ (u) = K(u + η, ξ+ ),
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 415
where
1 sinh p(u + ξ ) 0
K(u, ξ ) = . (16)
p 0 − sinh p(u − ξ )
The family of commuting transfer matrices is
tˆ(u) = Tr K+ (u)U (u) , (17)
where
−1
A(u) B(u)
U (u) = T (u)K− (u)T (−u) = .
C(u) D(u)
The inverse of the monodromy matrix is defined as [3]
where σ y is the Pauli matrix in the representation where σ z is diagonal, and detq T (u) =
A(u − η/2)D(u + η/2) − C(u − η/2)B(u + η/2) is the quantum determinant, which is a
su(2)-number.
The eigenvectors of tˆ(u), in the sector with S z = j Sj − M, are given by [19]
M
B(eα )|H ,
α=1
where
|H =
S z = Sj
j
j
is the pseudo-vacuum state having all maximum Sjz eigenvalues. The eigenvalues are
sinh[2p(u + η)]
t (u) = cosh 2p(u + σ ) − cosh(2pδ)
sinh[p(2u + η)]
M
sinh[p(u − eα − η)] sinh[p(u + eα )]
× a(u)d(−u + η)
sinh[p(u + eα + η)] sinh[p(u − eα )]
α=1
sinh(2pu)
− cosh 2p(u − σ + η) − cosh(2pδ)
sinh[p(2u + η)]
M
sinh[p(u − eα + η)] sinh[p(u + eα + 2η)]
× a(−u + η)d(u), (18)
sinh[p(u + eα + η)] sinh[p(u − eα )]
α=1
416 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
where we put ξ+ + ξ− = 2σ , ξ+ − ξ− = 2δ and a(u) = N j =1 sinh[p(u − zj + ηSj )]/p,
N
d(u) = j =1 sinh[p(u − zj − ηSj )]/p. The eα satisfy the Bethe equations:
cosh[2p(eα + σ )] − cosh(2pδ)
cosh[2p(eα − σ + η)] − cosh(2pδ)
sinh[p(eα − eβ − η)] sinh[p(eα + eβ )]
×
sinh[p(eα − eβ + η)] sinh[p(eα + eβ + 2η)]
β=α
sinh[p(eα − zj − ηSj )] sinh[p(eα + zj − η(Sj + 1))]
= . (19)
sinh[p(eα − zj + ηSj )] sinh[p(eα + zj + η(Sj − 1))]
j
The final step to obtain integrable models from the procedure above is to observe that
transfer matrices can be used as generating functional of Hamiltonians. A possibility is
H≡ ln tˆ(u)
. (20)
∂u u=uc
sinh(pu)
Lj (u) = 1 + ηlj(1)(u) + η2 lj(2) (u) + O η3 ,
p
R(u) = 1 ⊗ 1 + ηr(u) + O η2 ,
p2 z 2
lj(2) (u) =
S 1. (24)
2 j
Thus, the monodromy matrix is:
p − z z
T (u) P (u) 1 + η + − − +
− cosh puj Sj σ + Sj σ + Sj σ
j
sinh (puj )
− z z + − − + − + + −
cosh (pu− j ) cosh (puk )Sj Sk 1 + Sj Sk σ σ + Sj Sk σ σ
+ p2 η2
j <k
sinh (pu− −
j ) sinh (puk )
1 z 2
+ Sj 1 + irrelevant terms ,
2
j
+
detq T (−u + η/2) P (−u) 1 − pη
2
coth puj ,
j
1 z z
T −1
(−u) P −1
(−u) 1 + pη cosh pu+ + − − +
j Sj σ + Sj σ + Sj σ
j
sinh (pu+
j )
+ z z + − − + − + + −
cosh (pu+
j ) cosh (puk )Sj Sk 1 + Sj Sk σ σ + Sj Sk σ σ
+ p2 η2
j <k
sinh (pu+ +
j ) sinh (puk )
1 z 2
+ Sj 1 + irrelevant terms ,
2
j
where we defined u±
j ≡ u ± zj , P (u) =
−
j sinh(puj )/p. The irrelevant terms are either
C-numbers or off-diagonal matrices, contributing to the trace with terms order η3 . The
expansion of K± reads
K± (u, ξ± )
(0)
1 sinh p u + ξ± 0
(0)
p 0 − sinh p u − ξ±
(1) (0)
δ±,+ + ξ± cosh p u + ξ± 0
+η (1) (0)
,
0 − δ±,+ − ξ± cosh p u − ξ±
where δ±,+ is the usual Kronecker δ, and we took into account the expansion
(0) (1)
ξ± ξ± + ηξ± . (25)
(0) (0) (0) (0) (1) (1)
We define for convenience 2ξ = ξ+ − ξ− , 2Σ = ξ+ + ξ− , and 2ς = ξ+ + ξ− . Then
the terms in the expansion of the transfer matrix given in Eq. (21) read
1
τ̂ (0) (u) = 2
P (u)P −1 (−u) cosh (2pu) cosh (2pΣ) − cosh (2pσ ) ,
p
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 419
1
τ̂ (1) (u) = C-numbers + P (u)P −1 (−u) sinh (2pu) sinh(2pΣ)
p
− z
× coth puj + coth pu+
j Sj ,
j
τ̂ (2)
(u) = C-numbers
+ P (u)P −1 (−u) 2ς sinh (2pu) cosh(2pΣ)
+ z
× coth pu−j + coth puj Sj
j
+ cosh (2pu) sinh (2pΣ) − sinh (2pξ )
+ z
× coth pu−j + coth puj Sj
j
1 + z
+ sinh (2pu) sinh(2pΣ) coth pu+
j coth puk Sj
2
jk
1
+ cosh (2pu) cosh (2pΣ) − cosh (2pξ )
2
+ + z z
× coth pu− coth pu−
j + coth puj k + coth puk Sj Sk
jk
1 1 1
+ +
2 sinh(pu− −
j ) sinh(puk ) sinh(pu+ +
j ) sinh(puk )
+ −
× Sj Sj−
Sk + Sk+
1 Sk− +
Sj+ Sj−
Sk+
− cosh (2pu) cosh (2pξ ) − cosh (2pΣ)
2 sinh (pu− +
j ) sinh (puk )
jk
1 Sk− −
Sj+ Sj−
Sk+
+ sinh (2pu) sinh (2pξ ) . (26)
2 sinh (pu−
jk
+
j ) sinh (puk )
As can be seen by Eqs. (22), the operators τ̂ (2)(u) commute with each other if
(1)
τ̂ (u), τ̂ (3)(v) + τ̂ (3) (u), τ̂ (1)(v) = 0.
A sufficient condition for this relation to be fulfilled is that τ̂ (1) (u) is just a C-number. This
requires that Σ = 0.
At first, we assume classical boundary. With this term we mean that the boundary
parameters ξ± are assumed to be independent of η, i.e., ς = 0. This case was analyzed
by one of the authors in Ref. [20]. The commuting family of operators τ̂ (2)(u) given above
420 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
reduces to
Sz
τ̂0(2) (u) = 2P (u)P −1 (−u) sinh2 (2pu) Sz
cosh (2pu) − cosh (2pzj ) j
j
1
+
(cosh(2pu) − cosh(2pzj ))(cosh(2pu) − cosh(2pzk ))
j,k
j =k
1 z z
× cosh(2pzj ) + cosh(2pzk ) − 2 cosh(2pξ ) Sj
Sk
2
1 + −
+ cosh p(zj + zk ) − cosh(2pξ ) cosh p(zj − zk ) Sk +
Sj Sj−
Sk+
2
1 + −
− sinh(2pξ ) sinh p(zj − zk ) Sj
Sk − Sj−
Sk+ ,
2
where we dropped the C-numbers and the Casimir coming from the term j = k in the
sums. A finite subset of u-independent operators in involution can be obtained taking the
limits u → zj of τ̂ (2) (u), and dividing by the factor
sinh(2pzj )P −1 (−zj ) sinh p(zj − zk ) /p.
k=j
They are
1
τ̂0j =
Skz
Sjz +
cosh(2pzj ) − cosh(2pzk )
k k
k=j
z z
× cosh(2pzj ) + cosh(2pzk ) − 2 cosh(2pξ ) Sj
Sk
+ −
+ cosh p(zj + zk ) − cosh(2pξ ) cosh p(zj − zk ) Sj Sj−
Sk + Sk+
+ − !
− sinh(2pξ ) sinh p(zj − zk ) Sj
Sk − Sj−
Sk+ . (27)
The τ̂j form a complete set, in the sense that any τ̂ (2)(u) can be built from them according
to the formula
1
τ̂0 (u) = 2P (u)P −1 (−u) sinh2 (2pu)
(2)
τ̂0j .
cosh(2pu) − cosh(2pzj )
j
z z S z
≡ Sjz
We notice the term k Sk Sj in the operators τ̂j . It describes a self-interaction of
the spins with the magnetic field generated by the spins themselves. In the next section we
shall see how to add an external magnetic field.
The eigenstates of operators (27) are given by
M
|Ψ =
S − (eα )|H , (28)
α=1
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 421
where
cosh[p(u + zj + 2ξ )] − cosh[p(u − zj )]
S − (u) = Sj− ∝
B(u)
.
cosh(2pu) − cosh(2pzj ) dη η=0
j
The rapidities eα fulfill the first order term in the expansion of Eqs. (19) around η = 0
1 Sj
−
cosh(2peα ) − cosh(2peβ ) cosh(2peα ) − cosh(2pzj )
β=α j
1/2
+ = 0. (29)
cosh(2peα ) − cosh(2pξ )
Putting cosh(2peα ) = exp(2Eα ) + cosh(2pξ ) and cosh(2pzj ) = exp(2wj ) + cosh(2pξ ),
the equations above reduce to the modified Gaudin’s equations presented in Ref. [15]:
coth(Eα − Eβ ) − Sj coth(Eα − wj ) + S z = 0. (30)
β=α j
In this section we show how to include a scalable term proportional to Sjz in the
operators τ̂j . Such a term is crucial for physical applications since, as we shall see, it allows
to introduce a non-uniform magnetic field in the Hamiltonian. Furthermore, when the spins
are realized by pairs of time-reversed electrons Sj+ = cj ↑ cj ↓ a
Sjz = − 12 (n̂j ↑ + n̂j ↓ − 1),
non-uniform magnetic field corresponds to a kinetic energy term (see Section 4). In order
(1) (1)
to reach our goal, we have exploited the fact that ξ± can depend on η, i.e., ξ+ + ξ− is not
necessarily zero. We refer to this kind of boundary conditions as a non-classical boundary.
Thus, we put ς = 0. We obtain
2ς + Sz
τ̂ (2) (u) = 2P (u)P −1 (−u) sinh2 (2pu) Sz
cosh (2pu) − cosh (2pzj ) j
j
1
+
(cosh(2pu) − cosh(2pzj ))(cosh(2pu) − cosh(2pzk ))
j,k
j =k
1 z z
× cosh(2pzj ) + cosh(2pzk ) − 2 cosh(2pξ ) Sj
Sk
2
1 + −
+ cosh p(zj + zk ) − cosh(2pξ ) cosh p(zj − zk ) Sj Sj−
Sk + Sk+
2
1 + −
− sinh(2pξ ) sinh p(zj − zk ) Sk −
Sj Sj−
Sk+ . (32)
2
422 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
The integrals of motion are obtained again taking the limits u → zj , dividing now also by
2ς +
S z:
. z z 1
τ̂j = τj
S = Sj − J
S
cosh(2pzj ) − cosh(2pzk )
k
k=j
z z
× cosh(2pzj ) + cosh(2pzk ) − 2 cosh(2pξ ) Sj
Sk
+ −
+ cosh p(zj + zk ) − cosh(2pξ ) cosh p(zj − zk ) Sj Sj−
Sk + Sk+
+ − !
− sinh(2pξ ) sinh p(zj − zk ) Sk −
Sj Sj−
Sk+ , (33)
1
τ̂ (2) (u) = 2 2ς +
S z P (u)P −1 (−u) sinh2 (2pu) τ̂j .
cosh(2pu) − cosh(2pzj )
j
For real J , the τ̂j are Hermitian if zj are real and ξ is pure imaginary, or vice versa. Their
eigenstates are still given by Eq. (28)
M
|Ψ =
S − (eα )|H , (34)
α=1
where
cosh[p(u + zj + 2ξ )] − cosh[p(u − zj )]
−
S (u) = −
Sj ∝ B(u)
.
cosh(2pu) − cosh(2pzj ) dη η=0
j
The difference with the previous subsection, is that the first order term in the expansion of
Eqs. (19) around η = 0 contains an additional term
1 Sj
−
cosh(2peα ) − cosh(2peβ ) cosh(2peα ) − cosh(2pzj )
β=α j
2 (1 + 1/J )
1
+ = 0. (35)
cosh(2peα ) − cosh(2pξ )
Putting cosh(2peα ) = exp(2Eα ) + cosh(2pξ ) and cosh(2pzj ) = exp(2wj ) + cosh(2pξ ),
the equations above reduce as well to the modified Gaudin’s equations presented in Ref.
[15]:
1
coth(Eα − Eβ ) − Sj coth(Eα − wj ) + = 0. (36)
J (S z )
β=α j
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 423
There are parameterizations yielding rational Bethe equations. Among these, we consider
1 + xj
= cosh(2pzj ) − cosh(2pξ ),
1 − xj
1 + λα
= cosh(2peα ) − cosh(2pξ ). (38)
1 − λα
Then the eigenvalues are
z 1 − xj xk z 1 − xj λα
τj = Sj 1 − J S Sk +J S . (39)
xj − xk xj − λα
k=j α
We show that, at ξ = 0 and pξ = iπ/2, these integrals of motion are equivalent to the
modified Gaudin’s Hamiltonians introduced in Ref. [15]. The integrals of motion reduce to
z cosh(2pzj ) + cosh(2pzk ) ∓ 2 z z
τ̂j =
Sjz − J S S
S
cosh(2pzj ) − cosh(2pzk ) j k
k=j
cosh[p(zj + zk )] ∓ cosh[p(zj − zk )] + −
+ Sj−
Sk +
Sj Sk+ , (41)
cosh(2pzj ) − cosh(2pzk )
where the upper sign refers to ξ = 0 and the lower one to pξ = iπ/2. We make the change
of variable sinh (pzj ) = exp wj , if ξ = 0, cosh (pzj ) = exp wj , if pξ = iπ/2, obtaining
z 1 + − − +
z
τ̂j = Sj − J S coth(wj − wk )
Sjz
Skz +
S S + Sj Sk .
2 sinh(wj − wk ) j k
k=j
In this section we employ the fermionic realization of su(2) to write the integrable
models we found Eq. (42) in second quantization. The two orthogonal Dj - and Dl -
dimensional realizations are
Dj
+ †
+ =
K cj,δj ↓ cj δj ↑ , − = K
K ,
j j j
δj =1
Dj
z = 1
K (1 − n̂j δj ↑ − n̂j δj ↓ ), (43)
j
2
δj =1
and
Dl Dl
† 1
Sl+
= †
clρ clρl ↓ , Sl−
= Sl+ ,
Slz = (n̂lρl ↑ − n̂lρl ↓ ), (44)
l↑ 2
ρl =1 ρl =1
ΩS
ΩS
HS = 2ζl τl
S + Jllxx
S l2 , (47)
l=1 l=1
O
where operators τ (O), = K,
S are defined in Eq. (33). Due to the orthogonality of the
realizations (43), (44) we observe that [HK , HS ] = 0. Furthermore, HK and HS are block-
diagonal, and their common eigenstates are the direct product of the eigenstates of HK
and of HS , each restricted to the subspace corresponding to one of its blocks [16]. The
integrability together with the exact solution of the Hamiltonian (45) follows from the
integrability of each HK , HS proved in Section 3.2 and from Eqs. (34), (36), and (37).
Finally, the second quantized form of the Hamiltonian (45) reads
† †
H= εaσ n̂aσ + Uab (n̂a↑ + n̂a↓ )(n̂b↑ + n̂b↓ ) + gab ca↑ ca↓ cb↓ cb↑
aσ ab
xx † †
+ Jab
z
(n̂a↑ − n̂a↓ )(n̂b↑ − n̂b↓ ) + Jab ca↑ cb↓ cb↑ ca↓ , (48)
where Ω number the levels, , Ω and ca,σ ≡ cj (a),δj (a)(a),σ ; the constant in
a, b = 1, . . .
Eq. (45) turns out to be E0 = j Dj εj + j k Dj Dk Uj k .
The kinetic energy term reads
1 1
εaσ n̂aσ = (εa↑ + εa↓ )(n̂a↑ + n̂a↓ ) + (εa↑ − εa↓ )(n̂a↑ − n̂a↓ ) .
aσ a
2 2
We choose a partition—in equivalence classes—of the single particle levels in such a
way that all levels having the same value of εa ≡ 12 (εa↑ + εa↓ ) belong to the same class
(hence we write εj instead of εa , where j individuates the class2 ), and, analogously, a
second partition is defined in such a way that all the levels having the same value of
ζa ≡ 12 (εa↑ − εa↓ ) belong to the same class (hence we write the common value as ζl ).3
The couplings between levels a and b depend only on the equivalence classes of the two
levels. For j ≡ j (a) = k ≡ k(b) and l ≡ l(a) = m ≡ m(b), they are
cosh[p(zj + zk )] − cosh[p(zj − zk − 2itK )]
gab = gj k = 2JK K z (ηj − ηk ) ,
cosh(2pzj ) − cosh(2pzk )
cosh(2pzj ) + cosh(2pzk ) − 2 cos (2ptK )
4Uab = 4Uj k = JK K z (ηj − ηk ) ,
cosh(2pzj ) − cosh(2pzk )
cosh[p (yl + ym )] − cosh[p (yl − ym + 2itS )]
Jabxx
= Jlmxx
= −2JS S z (ζl − ζm ) ,
cosh(2p yl ) − cosh(2p ym )
z z cosh(2p yl ) + cosh(2p ym ) − 2 cos (2p tS )
Jab = Jlm = −JS S z (ζl − ζm ) . (49)
cosh(2p yl ) − cosh(2p ym )
For j = k, we have the relation gjj = 4Ujj , and gjj can be chosen arbitrarily.4
Analogously, for l = m, we have Jllz = Jllxx .
5. Conclusions
(a) (b)
procedure changing the values of the horizontal legs, associating the left one to a row, and
the right one to a column; a block matrix Lj (i) is finally obtained, which is conventionally
hi,j ,h
called the Lax operator. It is a (Hi × Hi ) matrix whose entries Lj i+1,j
(i) are in turn
(Vj × Vj ) matrices, i.e., operators over the linear space Hj . The partition function of the
(1 × N) lattice with periodic boundary conditions in vertical and horizontal direction is,
in terms of the Lax operators, Z1 = TrV TrH {L1 (1) · · · LN (1)} ≡ TrV tˆ(1), where by TrH
we mean the trace over the horizontal space, and by TrV the trace over the vertical ones;
we introduced the transfer matrix tˆ(i) ≡ TrH {L1 (i) · · · L N (i)}, where the hat is meant to
remind that the transfer matrix is an operator over H = j Hj , the direct product of the
linear spaces associated to the vertical legs. For a (K × N) lattice, the partition function is
tˆ(i),tˆ(i )] = 0, ∀i, i , it is possible to simultaneously diagonal-
Z = TrV {tˆ(1) · · · tˆ(K)}. If [
ize the tˆ(i), obtaining Z = r K ˆ
i=1 tr (i), where tr (i) is the rth eigenvalue of t (i). In this
case, the VM is exactly solvable.
From the tˆ(i), it is commonly possible to extract many-body Hamiltonians of interest.5
Thus, a given exactly solvable vertex model corresponds uniquely to a family of
commuting many-body operators.
It turns out that the transfer matrices commute with each other, and thus the
corresponding vertex model is exactly solvable, if Hi = Hi = H , ∀i, i , and a family of
(H 2 × H 2 ) matrices, the Ř-matrices, exists, such that the Lax operators obey the relation
Given a Ř-matrix, this is a very strict requirement, which in general implies that many legs
configurations are not allowed, i.e., their weight is zero, while the allowed ones are related
to each other by some parametrization.
A relevant case is when the dimensions of vertical and horizontal space are equal, and
they do not depend on the row or column: Hi = Vj ≡ 2S + 1. Then, the Ř-matrices are
but the Lax operators where the matrix elements have been written down explicitly in
their matrix representation. It turns out that the entries of the Lax operators are matrices
5 In general, such Hamiltonians are not by any means related to the Hamiltonian of the VM.
428 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
belonging to the (2S + 1)-dimensional realization of su(2), i.e., spins over the Hilbert6
space Hj . There is the drawback that the Ř-matrix is difficult to determine and to handle,
since its dimension increases very fast with S [29,30]. A technique to build larger Ř-
matrices using the (4 × 4) Ř-matrices (the simplest ones) as building blocks was devised
by Kulish, Reshetikhin, and Sklyanin [31]. In the present paper, by means of the quasi-
classical expansion, we will build up operators for spins higher than 1/2 still making use
of (4 × 4) Ř-matrices.
In this appendix we construct the Hamiltonian when the general solution of the
reflection equation (15) is considered [33,34]. In this case we have
1 sinh p(u + η + ξ+ ) κ+ sinh 2p(u + η)
K+ (u) = . (B.1)
p κ+ sinh 2p(u + η) − sinh p(u + η − ξ+ )
Since we want τ̂ (1) to be a C-number, we must impose κ+ iηc. Thus, the final effect of
the general reflection results in an additional term in the second order of the transfer matrix
2ic
τ̂ (2) (u) → τ̂ (2) (u) + P (u)P −1 (−u) sinh2 (2pu)
p
sinh[p(zj − ξ )]
×
S+
cosh(2pu) − cosh(2pzj ) j
j
sinh[p(zj + ξ )]
−
Sj− . (B.2)
cosh(2pu) − cosh(2pzj )
j
The Hamiltonian is again built according to
H= 2hj τ̂j , (B.3)
j
where the integrals of motion are
z sinh[p(zj − ξ )] + sinh[p(zj + ξ )] −
τ̂j = 1 − J Sz Sj − icJ
Sj −
Sj
p p
1
−J
cosh(2pzj ) − cosh(2pzk )
k
k=j
z z
× cosh(2pzj ) + cosh(2pzk ) − 2 cosh(2pξ ) Sj
Sk
+ −
+ cosh p(zj + zk ) − cosh(2pξ ) cosh p(zj − zk ) Sj Sj−
Sk + Sk+
+ − !
− sinh(2pξ ) sinh p(zj − zk ) Sk −
Sj Sj−
Sk+ . (B.4)
We point out that the Hamiltonian is Hermitian for real c and ξ = it with real t. The
diagonalization of this class of Hamiltonians might be achieved by functional Bethe ansatz
[6]. Nevertheless, it seems worth to study the models (B.3), (B.4) since their potential
application to condensed matter (see also Eqs. (43), (44)).
1 pλ 1 −pλ
F− n
K− (λ) = e sinh p(ξ− − λ) E aa + e sinh p(ξ− + λ) E aa ,
p p
a=1 a=F− +1
1 −pλ−2pηa
F+
K+ (λ) = e sinh p(ξ+ + λ) E aa
p
a=1
1
n
+ epλ+pη(n−2a) sinh p(ξ+ − ηn − λ) E aa , (C.2)
p
a=F+ +1
sinh(2pzj )
+ (n − F) aa
E
sinh(p(ξ − zj )) sinh(p(ξ + zj )) j
n
! aa
+ ξ−(1) coth p(ξ + zj ) + ξ+(1) coth p(ξ − zj ) E
j
a=F+1
n
+ coth p(zj − zk ) + coth p(zj + zk ) aa E
E aa
j k
k=j a
n
ep(zj −zk ) sgn(a−b)
+ ba E
E ab
sinh(p(zj − zk )) j k
a=b
F
n
sinh(p(ξ + zj )) ep(zk −zj )
+ E ab
ba E
sinh(p(ξ − zj )) sinh(p(zj + zk )) j k
a=1 b=F+1
n
F
sinh(p(ξ − zj )) ep(zj −zk )
+ ba E
E ab
sinh(p(ξ + zj )) sinh(p(zj + zk )) j k
a=F+1 b=1
F
ep(zj +zk ) sgn(b−a) ba ab
n
ep(zj +zk ) sgn(b−a) ba ab
+ E E + E E ,
sinh(p(zj + zk )) j k sinh(p(zj + zk )) j k
a,b=1 a,b=F+1
a=b a=b
M1
(1) (1)
+ coth p zj + ek + coth p zj − ek . (C.3)
k
The Bethe ansatz equations can be obtained in the same limit of a result in Ref. [35], and
we have (for a = 1, 2, . . . , n − 1)
Ma
(a) (a) (a) (a)
2 coth p ej + ek + coth p ej − ek
k=j
+ δa,F n + ξ−(1) − ξ+(1) coth p(z − ξ ) + coth p(z + ξ )
A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432 431
Ma+1
= coth p ej(a) + ek(a+1) + coth p ej(a) − ek(a+1)
k
Ma−1
+ coth p ej(a) + ek(a−1) + coth p ej(a) − ek(a−1) . (C.4)
k
References
[1] R.J. Baxter, Exactly Solved Models in Statistical Mechanics, Academic Press, 1982.
[2] M. Gaudin, La fonction d’onde de Bethe, Masson, 1983.
[3] V.E. Korepin, N.M. Bogoliubov, A.G. Izergin, Quantum Inverse Scattering Method and Correlation
Functions, Cambridge Univ. Press, 1993.
[4] M. Gaudin, J. Physique 37 (1976) 1087.
[5] K. Hikami, P.P. Kulish, M. Wadati, J. Phys. Soc. Jpn. 61 (1992) 3071.
[6] E.K. Sklyanin, J. Sov. Math. 47 (1989) 2473.
[7] M.C. Cambiaggio, A.M.F. Rivas, M. Saraceno, Nucl. Phys. A 624 (1997) 157.
[8] R.W. Richardson, Phys. Lett. 3 (1963) 277.
[9] J. von Delft, D.C. Ralph, Phys. Rep. 345 (2001) 61.
[10] A. Mastellone, G. Falci, R. Fazio, Phys. Rev. Lett. 80 (1998) 4542.
[11] L. Amico, G. Falci, R. Fazio, J. Phys. A 34 (2001) 6425–6434.
[12] G. Sierra, Nucl. Phys. B 572 (2000) 517–534.
[13] H.M. Babujian, J. Phys. A 26 (1993) 6981.
[14] N. Reshetikhin, A. Varchenko, in: Geometry Topology, and Physics, 1995, p. 293.
[15] L. Amico, A. Di Lorenzo, A. Osterloh, Phys. Rev. Lett. 86 (2001) 5759.
[16] L. Amico, A. Di Lorenzo, A. Osterloh, Nucl. Phys. B 614 (2001) 449.
[17] J. Dukelsky, C. Esebbag, P. Schuck, Phys. Rev. Lett. 87 (2001) 66403.
[18] J. von Delft, R. Poghossian, Algebraic Bethe ansatz for a discrete-state BCS pairing model, cond-
mat/0106405.
[19] E.K. Sklyanin, J. Phys. A 21 (1988) 2375.
[20] K. Hikami, J. Phys. A 28 (1995) 4997.
[21] K. Hikami, J. Phys. A 28 (1995) 4053.
[22] I. Cherednik, Theor. Math. Phys. 61 (1984) 35.
[23] H.J. de Vega, Nucl. Phys. B 240 (1984) 495.
[24] H.M. Babujian, Phys. Lett. A 90 (1982) 479.
[25] L.A. Takhtajan, Phys. Lett. A 87 (1982) 479.
[26] L. Amico, A. Di Lorenzo, A. Mastellone, A. Osterloh, R. Raimondi, Ann. Phys. 299 (2002) 228.
[27] A. Di Lorenzo, A new class of exactly solvable models, PhD thesis, Università di Catania, Italy, 2001.
[28] R.W. Richardson, private communication.
[29] V.I. Fateev, A.B. Zamolodchikov, Sov. J. Nucl. Phys. 32 (1980) 298.
[30] K. Sogo, Y. Akutsu, T. Abe, Prog. Theor. Phys. 70 (1983) 730.
432 A. Di Lorenzo et al. / Nuclear Physics B 644 [FS] (2002) 409–432
[31] P.P. Kulish, N. Reshetikhin, E.K. Sklyanin, Lett. Math. Phys. 5 (1981) 393.
[32] P.P. Kulish, N. Manojlovic, Lett. Math. Phys. 55 (2001) 77.
[33] H.J. de Vega, A. Gonzalez-Ruiz, J. Phys. A 26 (1993) L519.
[34] S. Ghoshal, A. Zamolodchikov, Int. J. Mod. Phys. A 9 (1994) 3841.
[35] H.J. de Vega, A. González-Ruiz, Mod. Phys. Lett. A 9 (1994) 2207.
[36] A. Doikou, R.I. Nepomechie, Nucl. Phys. B 530 (1998) 641.
Nuclear Physics B 644 [FS] (2002) 433–450
www.elsevier.com/locate/npe
Abstract
We compute the O(1/N) correction to the stability critical exponent, ω, in the Landau–Ginzburg–
Wilson model with O(N) × O(m) symmetry at the stable chiral fixed point and the stable direction
at the unstable antichiral fixed point. Several constraints on the O(1/N) coefficients of the four loop
perturbative β-functions are computed.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 8 - 0
434 J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450
above the Heisenberg one, depending on the value of N , which are either stable or unstable,
[10,11]. However, the properties of the phase transitions in this Landau–Ginzburg–Wilson
class of models is controversial, [9,11–21]. First, the two-dimensional nonlinear sigma
model with the same symmetry is believed to reside in the same universality class, [10,
11]. Therefore, it ought to be possible to use either model to compute useful information
on the critical properties of the physically interesting stable chiral fixed point. However,
it has been pointed out in [12] that the results from a (d = 2 + ε)-dimensional study do
not match those deduced from the higher-dimensional theory. Second, in field theoretical
calculations such as perturbation theory the phase transitions are regarded as second order
whilst numerical or Monte Carlo simulations would appear to indicate transitions are first
order in nature, [9,13–18]. Moreover, the particular behaviour depends on the value of N
though the precise range where this occurs is still undetermined. Further, one recent work
has suggested an interesting point of view for the origin of the disagreements for N = 2
and 3 in both two and three dimensions. In [22] it is argued that it is due to the fact that
one flows to the stable chiral fixed point along a spiral-like trajectory in contrast to the
usual descent. To endeavour to clarify the issue the perturbative analysis of the Landau–
Ginzburg–Wilson critical behaviour has recently been extended to a higher loop order in
[11]. Previous one and two loop computations were carried out in [6–8]. The new three
loop MS calculations of the renormalization group functions such as the β-functions of
both coupling constants and anomalous dimensions, [11], have provided more accurate
information on the fixed point locations and the range of parameters for which they exist
and are stable or not. Indeed in this respect the models with the more general symmetry
group of O(N) × O(m) were studied with m only set to m = 2 at the end, [11]. Such three
loop calculations represent the current perturbative status of the model.
However, it is in principle possible to extend this three loop MS dimensionally
regularized calculation to the next order, though it will involve a huge number of Feynman
diagrams. In previous work in the simpler O(N) models the perturbative computations
were complemented with large N calculations of the same renormalization group functions
to several orders in powers of 1/N . For instance, various critical exponents are available
at both O(1/N 2 ) and O(1/N 3 ), [23–26], as functions of d, with 2 < d < 4, which
correspond through the critical renormalization group equation with the renormalization
group functions. More correctly these critical exponents were computed in d-dimensions
at the nontrivial Wilson–Fisher fixed point of the d-dimensional β-function which
corresponds to the Heisenberg fixed point in three dimensions in O(N) φ 4 theory or the
O(N) nonlinear sigma model which does lie in the same universality class. The coefficients
of the powers of
in the
-expansion of such exponents in d = 4 − 2
dimensions are in
exact agreement with the perturbative coefficients in the renormalization group function
to the perturbative order they are known, at a particular order in 1/N . More significantly
the large N critical exponents contain new higher order information in the uncomputed
coefficients which would therefore assist future perturbative calculations. Indeed in the
Landau–Ginzburg–Wilson context various critical exponents have already been computed
in the model with O(N) × O(m) symmetry at the two nontrivial fixed points which exist
in addition to the Heisenberg fixed point, [6,11]. These are known as the chiral stable, (CS),
and antichiral unstable, (AU), fixed points. Moreover, the results for the critical exponents η
and ν at O(1/N 2 ) at both CS and AU are in agreement with the new perturbative results of
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 435
[11]. However, to extract the new information encoded in the exponents at four and higher
loops in these d-dimensional functions in relation to the four-dimensional theory one
requires knowledge of the location of the fixed points to the same loop and large N order.
From the critical renormalization group equation such information is encoded in the critical
exponent ω which relates to the critical β-function slope of the model in the universality
class which is renormalizable in four dimensions. In the Landau–Ginzburg–Wilson model
this has not yet been computed at O(1/N) at either CS or AU. Therefore, it is the purpose
of this article to rectify this gap and thereby unlock the door to higher order information on
the structure of the perturbative renormalization group equations such as the β-function.
In O(N) φ 4 theory this problem has already been resolved at O(1/N 2 ), [26], where the
O(1/N) value for ω is relatively trivial to establish, [27], with the elegant machinery
of the large N critical point method of [23,24]. However, for the CS and AU Landau–
Ginzburg–Wilson fixed points the leading order, O(1/N), analysis is much more involved
since one is studying a model with two independent coupling constants. Therefore, whilst
our calculation also opens the road to an O(1/N 2 ) computation, it represents a nontrivial
example of how one treats the large N formalism for ω exponents explicitly in a quantum
field theory with more than one coupling constant which deserves detailed treatment.
The paper is organized as follows. In Section 2 we recall the background details of
the model we are interested in and derive explicit expressions for the location of the
various fixed points from the explicit three loop perturbative results at O(1/N) as well
as the perturbative values of the eigenexponents of the stability matrix at criticality. These
are related to the exponents ω which we are interested in. Section 3 is devoted to the
development of the large N formalism for computing these various ω and the explicit
d-dimensional expressions are given at O(1/N). Various concluding remarks are given in
Section 4.
2. Background
structure of (2.1) by considering the β-functions for each coupling which have been
computed to several orders, [8,11]. At three loops these are
βu (u, v)
1 (mn + 8) 2 1 v
= (d − 4)u + u − (m − 1)(n − 1)v u −
2 6 3 2
1 11 13 5
− (3mn + 14)u3 + (m − 1)(n − 1) u2 − uv + v 2 v
6 9 12 18
u4
+ 33m2 n2 + 922mn + 2960 + ζ(3)(480mn + 2112)
432
− 4 79mn + 1318 + 768ζ(3) u3
− 555mn − 460(m + n) + 6836 + 4032ζ(3) u2 v
+ 2 213mn − 358(m + n) + 1933 + 960ζ(3) uv 2
(m − 1)(n − 1)v
− 121mn − 309(m + n) + 817 + 216ζ(3) v 3
864
+ n3 [a1 − a2 − a3 − a4 − a5 − a6 ]u5 + a2 u4 v + a3 u3 v 2
+ a4 u2 v 3 + a5 uv 4 + a6 v 5
6 1
+O u ; 2 (2.3)
N
and
βv (u, v)
1 1 1
= (d − 4)v + 2uv + (m + n − 8)v 2 − (5mn + 82)u2 v
2 6 18
1 2 1
+ 5mn − 11(m + n) + 53 uv − 13mn − 35(m + n) + 99 v 3
9 36
+ 52m2 n2 − 57mn(m + n) − 2206mn
− 111 m2 + n2 + 4291(m + n) − 8084
v 4
− ζ (3) 1416mn − 3216(m + n) + 7392
864
− 39m2 n2 − 35mn(m + n) − 1302mn − 36 m2 + n2 + 2401(m + n)
v 3 u
− 5725 − ζ (3) 768mn − 1824(m + n) + 4896
216
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 437
+ 78m2 n2 − 35mn(m + n) − 2114mn + 3182(m + n) − 12520
u2 v 2
− ζ (3) 1152mn − 2304(m + n) + 10368
432
u3 v
− 13m n − 368mn − 3284 − ζ(3)(192mn + 2688)
2 2
216
+ N (b2 − b3 − b4 − b5 − b6 )u v + b3 u v + b4 u v + b5 uv 4 + b6 v 5
3 4 3 2 2 3
1
+ O u6 ; 2 , (2.4)
N
where we have rescaled the coupling constants by a numerical factor to ensure the
expressions are in the correct format for comparing with the Heisenberg large N value
for ω and the ones we compute here. Also we have included the d-dependent terms as we
are interested in the fixed point structure in d-dimensions. To assist with determining new
information on the four loop structure of both β-functions at O(1/N) we have introduced
parameters, {ai } and {bi }, for the coefficients of the possible terms.
By examining the solutions to βu = 0 and βv = 0, [8,11], several fixed points emerge.
First, there are the two obvious ones of the Gaussian fixed point, uc = 0, vc = 0, and the
Heisenberg fixed point, uc = 0, vc = 0. For the latter point setting v = 0 and m = 1 in
(2.3) one recovers the usual O(N) symmetric φ 4 theory whose β-function is known at five
loops in MS in four dimensions, [2]. Indeed in our choice of parametrization of βv (u, v)
at four loops we used this fact to restrict the function to be proportional to v. These two
fixed points clearly lie on the axes of the (u, v) coupling plane. However, for a range of
values of N and m there are two other fixed points which both have uc = 0 and vc = 0.
One is known as CS and the other AU. The ultraviolet renormalization group flow of the
four fixed points in the (u, v) plane is shown graphically in Fig. 1. The range of values for
N and m for which such a renormalization group flow is present has been detailed in [8,
11], for example. However, for the purposes of the large N calculation we will require the
values of uc and vc to several order in
and powers of 1/N where we take the convention
Fig. 1. Renormalization group flow on the (u, v) plane illustrating the Gaussian (G), Heisenberg (H), stable chiral
(CS) and unstable antichiral (AU) fixed points.
438 J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450
d = 4 − 2
. In [8,11] the explicit functions of N and m were presented at two loops. The
full expressions at three loops can also be derived but are large, [11], and the full form is
not necessary for our purposes. Indeed if we write
∞
ur1 ur2 r 1
uc = + 2
+O ,
N N N3
r=1
∞
vr1 vr2 r 1
vc = + 2
+ (2.5)
N N N3
r=1
for the 1/N expansion of the critical couplings in powers of
the four fixed points are
determined to O(
4 ) and O(1/N 2 ) as follows. For the Gaussian fixed point uri = vri
= 0. At the Heisenberg fixed point u11 = 6/m, ur1 = 0 for r = 1, u12 = − 48/m2 ,
u22 = 108/m2, u32 = − 99/m2 , u42 = − 7776(a1 − a2 − a3 − a4 − a5 − a6 )/m5 and
vri = 0. At the stable fixed point, CS,
These agree to two loops with the expressions given in [8,11]. As we will be computing
the critical exponents ω which relate to the critical slope of the β-functions in large N we
can use these values to determine the O(1/N) form of the critical exponents. As we are
working with a two coupling model the stability exponents of each fixed point are related
to the eigenvalues, λI , of the matrix of derivatives, Ω(u, v), evaluated at the appropriate
fixed point where
∂βu (u,v) ∂βu (u,v)
Ω(u, v) = ∂βv∂u (u,v)
∂v
∂βv (u,v) . (2.8)
∂u ∂v
For the Gaussian case this is trivial and will not concern us here. For the Heisenberg fixed
point Ω(u, v) becomes triangular because βv (u, v) has no terms involving only u at any
order which implies
∂βv (u, v)
Heis
= 0. (2.9)
∂u
uc ,vc
1 It is worth stressing that our convention, d = 4 − 2
, and the form we take for the β-functions, (2.3) and
(2.4), implies that for a stable direction the eigenexponent is (4 − d)/2 + O(1/N ). This differs from the standard
result for the stability eigenexponent of (4 − d) + O(1/N ) by a factor of 2 which ought to be taken into account
when comparing with other calculations. (See, for example, [2].)
440 J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450
− 4 1944(m + 1)4 a1 + 972(m − 1)(m + 1)4 b2
4 5
− 2m + m − 13 (m + 2)(m − 1)
2
+O
(m + 1)5
1
+O ,
N2
1 (13m2 + 26m + 25)
2
λCS
− =
− 6(m + 1)
−
N (m + 1)
(3m4 + 12m3 + 82m2 + 116m − 5) 3
+
2(m + 1)3
+ 4 324(2m − 1)(m − 1)(m + 1)4 a1
+ 3m2 − 3m + 1 (m − 1)b5
4
+ (2m − 1) 2m2 − 2m + 1 b6 3
m
5 1
+O
+O ,
N2
(m − 1)(m + 2) 3 3 4 1
λAU
− = −
+ 6
− 13
2
+
+ O
+ O , (2.11)
mN 2 N2
where the sign of the O(
) terms relates to the stability property of the fixed point
when viewed from the ultraviolet renormalization group flow of Fig. 1. For the exponents
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 441
corresponding to the stable direction we have included the O(
4 ) terms which depend on
the unknown parameters of the O(1/N) four loop β-functions of (2.3) and (2.4) and which
will be constrained by our O(1/N) critical exponents.
3. Large N formalism
and at AU we have
2Γ (2µ − 2)
z1 = 0, y1 = − . (3.7)
Γ (2 − µ)Γ (µ − 2)
Moreover, for completeness the exponent η at O(1/N) is given by
∞
ηi
η= , (3.8)
Ni
i=1
where
2(m + 1)Γ (2µ − 2)
η1CS = − ,
Γ (µ + 1)Γ (µ − 1)Γ (µ − 2)Γ (2 − µ)
2(m − 1)(m + 2)Γ (2µ − 2)
η1AU = − . (3.9)
mΓ (µ + 1)Γ (µ − 1)Γ (µ − 2)Γ (2 − µ)
Any subsequent exponent at either fixed point will be expressed in terms of their respective
value for η1 .
Since the exponents ωI relate to corrections to scaling then to compute them one
considers corrections to the asymptotic scaling forms (3.2), [23,24,26]. In coordinate space
we take
A ω B ω
φ(x) ∼ 2 α 1 + A x 2 , σ (x) ∼ 2 β 1 + B x 2 ,
(x ) (x )
C
σT (x) ∼ 2 γ 1 + C x 2
ω
(3.10)
(x )
where A , B and C are the x-independent correction to scaling amplitudes whose
values are not important here. In addition to (3.10) one requires the scaling form of the
inverse propagators which are determined by inverting the Fourier transform of (3.10) in
momentum space. Thus
p(α) ω
φ −1 (x) ∼ 1 − q(α, ω) x 2 A ,
(x 2 )2µ−α A
p(β) ω
σ −1 (x) ∼ 1 − q(β, ω) x 2 B ,
(x 2 )2µ−β B
p(γ ) ω
σT−1 (x) ∼ 1 − q(γ , ω) x 2 C , (3.11)
(x 2 )2µ−γ C
where the functions p(x) and q(x, y) are defined by
a(x − µ) a(x − y)a(x + y − µ)
p(x) = , q(x, y) = (3.12)
a(x) a(x)a(x − µ)
with a(x) = Γ (µ − x)/Γ (x). Further, in our notation each exponent ωI in the stable
direction will have the 1/N expansion
∞
ωi
ω = (µ − 2) + (3.13)
Ni
i=1
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 443
Fig. 3. Basic topologies for the corrections to the σ and T ab Schwinger–Dyson equations to determine ω at
O(1/N ).
the consistency equation. For instance, whilst the product q(α, ω)q(β, ω)q(γ , ω) in (3.15)
will be O(1/N), this equation, (3.15), would give an incorrect value for the exponents
since the contributions in (3.15) from σ −1 and σT−1 are of the same order in 1/N as those
from the two and three loop correction graphs of Fig. 3 which we had naively omitted.
Thus to determine ω1 correctly one must include the contributions from these diagrams to
the B and C parts of the Schwinger–Dyson representation in the asymptotic approach to
the fixed points. Hence, the σ and T ab equations in (3.14) are modified to
ω 1 ω
0 = p(β) 1 − q(β, ω) x 2 B + Nmz 1 + 2 x 2 A
2
1 1
+ Nmz Π1 B + (m + 2)(m − 1)NyzΠ1 C
2
2 4
1 ω
+ N 2 m2 z3 Π2 B + (m + 2)(m − 1)N 2 y 2 zΠ2 C x 2 ,
2
2 ω 1 ω
0 = p(γ ) 1 − q(γ , ω) x C + Ny 1 + 2 x 2 A
2
1 (m − 2) 2
+ NyzΠ1 B + Ny Π1 C + N 2 y 2 zΠ2 B
2 4m
(m − 2)(m + 4) 2 3 ω
+ N 2 y 2 zΠ2 C + N y Π2 C x 2 . (3.16)
4m
In these equations to ensure the correct contribution to the ω Schwinger–Dyson equation
after decoupling one correction term, (x 2 )ω , is on one internal σ or T ab line. There are
corrections to the φ lines but these only contribute to ω2 after examining where they appear
in the consistency determinant. This feature of having to include higher order graphs to
obtain the correct ω1 is not peculiar to the CS fixed point as it already occurs at the
Heisenberg point, [26]. With these additional diagrams the correct consistency equation
is given by setting the determinant of the matrix
1 + q(α, ω)z + (m−1)(m+2) y z (m−1)(m+2)
y
2m 2m
2 q(β, ω) + zΠ1 + 2mN z2 Π2 (m−1)(m+2)
y(Π1 + 2NyΠ2 )
2m
2 z(Π1 + 2NyΠ2 ) q(γ , ω) + (m−2)(m+4) Ny 2 Π2
2m
+ (m−2)
2m yΠ1 + 2NyzΠ2
(3.17)
to zero.
Therefore, to solve for the respective ω’s from (3.17) the values of these additional
diagrams must be computed. As the graphs are similar to those used in the O(N) φ 4
calculation of ω if the group theory of each graph is suppressed we merely quote the
d-dimensional values for the respective two and three loop graphs of Fig. 3. They are,
ignoring symmetry factors,
2
Π1 = ν(2, µ − 1, µ − 1) ,
2
Π2 = ν(2, µ − 1, µ − 1) ν(1, 2, 2µ − 3)ν(4 − µ, µ − 1, 2µ − 3), (3.18)
where ν(x, y, z) = a(x)a(y)a(z). These were calculated using the method of uniqueness
of [28] which was extended from three dimensions to d-dimensions in [23,24]. Further,
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 445
since the O(1/N) values of q(β, ω) and q(γ , ω) now need to be included it transpires that
the O(1/N) expression for the vertex anomalous dimensions, χ and χT , are required. This
is due to
a(4 − µ)Γ (µ) [ω1 − η1 − χ1 ] 1
q(β, ω) = +O . (3.19)
a(2)a(2 − µ) N N2
We have computed each vertex anomalous dimension at each fixed point and find
µ(4µ − 5)η1CS
χ1CS = − ,
(µ − 2)
µ[(2µ − 3)m + (4µ − 5)]η1CS
,1 = −
χTCS ,
(µ − 2)(m + 1)
µ(m − 2)[(m + 4)(2µ − 3) + 1]η1AU
,1 = −
χTAU . (3.20)
(m − 1)(m + 2)(µ − 2)
The expression for χ1CS is formally the same as that for the O(1/N) vertex dimension at
the Heisenberg fixed point. For each exponent we have computed the value by applying
the technique of [29] of large N critical point renormalization of 3-point functions to each
vertex at the appropriate fixed point. For χ we have checked that the value agrees with that
given by the scaling law which emerges from the theory which is in the same universality
class as (2.2). This is believed to be the O(N) × O(m) two-dimensional nonlinear sigma
model where instead of the term quadratic in σ of (2.2) one has a term linear in σ where its
coupling constant is related to the critical exponent ν. However, in such a model it is clear
from group theory that there can be no linear term in T ab and therefore both expressions
for χT cannot be derived from a scaling law but only direct calculation.
With these values we can now determine the solution for the consistency equation for
each fixed point. For CS since the matrix is 3 × 3 two values for ω emerge. These are
(2µ − 1)η1CS
CS
ω± = (µ − 2) +
2(m + 1)(µ − 2)N
× m(µ − 1)(µ − 4) + 2µ2 − 7µ + 4
1/2
± µ (m2 − 1)(µ − 1)2 + 2(m − 1)(2µ − 3)(µ − 1) + (5µ − 8)2
1
+O , (3.21)
N2
where the ± subscript refers to the sign in front of the discriminant. Both are perfectly
acceptable solutions since one corresponds to one stable direction at CS and the other to
the eigenexponent from the second direction.
For AU the derivation of the consistency equation follows the same pattern as that for
CS in that two and three loop graphs have also to be included due to the large N counting.
The consistency equation itself can be deduced from (3.17) by deleting the second row and
column from the matrix and setting z = 0 in the remaining entries. Since only the T ab field
446 J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450
which agrees with the explicit four loop φ 4 computation when one sets m = 1 and
determines the O(1/N) coefficient of the u5 term of βu (u, v). From the CS fixed point
we have
(3m + 7)(m − 3)
2a1 + (m − 1)b2 = − (3.26)
15552
and
(m + 1) a2 + 2a3 + 3a4 + 4a5 + 5a6 − b3 − 2b4 − 3b5 − 4b6
+ (2m − 1)(m − 1)a1 − 2m(m − 2)b2
ζ (3) (3m2 + 6m + 19)
= + . (3.27)
108(m + 1)3 5184(m + 1)5
The final constraint arises from the stable direction at the AU fixed point which gives
(m − 1)4 b2 + (m − 1)3 b3 + (2m − 1)(m − 1)2 b4 + (3m2 − 3m + 1)(m − 1)b5
+ (2m − 1)(2m2 − 2m + 1)b6
[3m2 − 5m − 54]
=− . (3.28)
15552
One can now deduce the values for the exponents in the four stable directions in three
dimensions. These are, with our conventions,
1 4(m + 4) 1
ω+CS
=− + + O ,
2 3π 2 N N2
1 16(m + 1) 1
ω−CS
=− + 2
+O ,
2 3π N N2
1 4[m2 + 4m − 8] 1
ω+AU
=− + + O ,
2 3π 2 mN N2
1 32 1
ω+Heis
=− + + O . (3.29)
2 3π 2 mN N2
For CS the corrections both have the same sign and interestingly neither involves a square
root which appears in the d-dimensional expression. For the specific case of m = 2 we
have from (3.29)
CS
1 8 1
ω+ m=2 = − + 2 + O ,
2 π N N2
CS
1 16 1
ω− m=2
=− + 2 +O ,
2 π N N2
AU
1 8 1
ω+ m=2
=− + +O ,
2 3π 2 N N2
Heis
1 16 1
ω+ m=2
= − + + O . (3.30)
2 3π 2 N N2
With these values we can comment on the possible breakdown of stability in this model. In
our computations so far have relied on the fact that the stability picture for the fixed points
448 J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450
represented in Fig. 1 is valid for the full range of N in the large N expansion. However, it
has been suggested that for certain values of N this scenario may be different and that
the underlying assumption of the existence of a second order phase transition, around
which the large N critical point formalism is built, could break down. Indeed there is
some controversy in this model in three dimensions about the existence of CS and whether
the phase transition is first or second order for relatively low values of N . Moreover, the
precise value of N where the order changes has not been determined consistently from
different methods. (A recent review is given in [17].) For instance, a value has been found
for this critical value of N by using (d = 4 − 2
)-dimensional perturbation theory and
it was extended to three dimensions using standard resummation techniques, [30], giving
Nc = 3.39. By contrast, Monte Carlo methods and another
-expansion extraction have
suggested Nc < 2, [6–8]. With the large N corrections (3.30) we can naively examine the
range of values of N for which either CS stability exponent changes sign. For ω− CS
|m=2
this will occur when Nc = 3.24 whilst ω+ |m=2 changes sign when Nc = 1.62. The
CS
former would suggest a critical N in a similar range to that of [30]. However, these
remarks ought to be qualified with various observations. First, we have only computed
the O(1/N) correction to stability where N is assumed to be large. Therefore, one has to
ask whether the approximation will still be valid for such a low value of N . Second, the
nature of the large N expansion is a reordering of perturbation theory such that a certain
class of diagrams are summed first. Therefore, if one could compute to all orders one would
reproduce ordinary perturbation theory and so obtaining a value for Nc in three dimensions
which is not inconsistent with the resummed value determined from several loop orders in
ordinary perturbation theory would seem only to reinforce that particular value. In other
words if nonperturbative effects become significant at CS for low values of N to affect the
precise location of Nc these will have been omitted in perturbation theory. Third, for our
value of Nc we have naively assumed the large N series is convergent and therefore that
our simple assumption that when the O(1/N) correction exceeds 1/2 the character of the
fixed point changes is valid. Only a higher order calculation would resolve this.
4. Discussion
We have computed the correction to scaling exponents ω in all the stable directions
of the Landau–Ginzburg–Wilson model with O(N) × O(m) symmetry at O(1/N) in
d-dimensions. This allows one to extract information in all the available large N exponents
of [11] at O(1/N) in relation to the four-dimensional theory in the same way that the two-
dimensional critical slope information contained in the exponent ν does for the underlying
two-dimensional theory. It is also worth stressing again, [11], that from the point of view
of the large N formalism a consistent picture emerges in terms of the active fields of the
theory formulated in terms of the auxiliary fields σ and T ab of (2.2). At the Heisenberg and
AU fixed points only σ and T ab , respectively, propagate which corresponds in the large
N formalism developed here to one stable direction and hence only one eigenexponent
emerged. However, at the only fully stable fixed point both fields are relevant leading
to two independent stability exponents. This is a natural way to picture this particular
model which we assume persists to higher orders in large N . Moreover, our consistency
J.A. Gracey / Nuclear Physics B 644 [FS] (2002) 433–450 449
with three loop MS perturbation theory provides an important internal cross check on the
values of quantities, such as the vertex anomalous dimensions, which we had to compute en
route to our expressions for ωi . Further, the information contained within the expressions
(3.21), (3.22) and (3.23) will provide important checks on any future explicit four loop MS
perturbative calculations which would improve the accuracy of the numerical estimates
deduced from (2.3), (2.4) and other renormalization group functions. Such four loop
calculations are certainly viable since five loop results are available in MS in ordinary
O(N) φ 4 theory. For example, the integration routines for the four loop Feynman diagrams
have already been constructed. However, one can also attack this problem from the large
N point of view. For instance, we have demonstrated the elegance of the formalism to
produce the critical eigenexponents at O(1/N). However, this machinery has already been
extended in [26] to compute ω2 in the Heisenberg model. Therefore, we would expect
there to be no serious obstacles to extending the present computation to find ω2CS and ω2AU .
For example, the values of the underlying three, four, five and six loop Feynman diagrams
which are analogous to the corrections of Fig. 3 for the O(1/N 2 ) calculation have been
determined. We hope to return to this in a future article.
References
Received 16 May 2002; received in revised form 29 August 2002; accepted 4 September 2002
Abstract
We study the Euclidean two-point function of Fermi fields in the SU(2)-Thirring model on the
whole distance (energy) scale. We perform perturbative and renormalization group analyses to obtain
the short-distance asymptotics, and numerically evaluate the long-distance behavior by using the
form factor expansion. Our results illustrate the use of bosonization and conformal perturbation
theory in the renormalization group analysis of a fermionic theory, and numerically confirm the
validity of the form factor expansion in the case of the SU(2)-Thirring model.
2002 Elsevier Science B.V. All rights reserved.
PACS: 11.10.Kk
Keywords: Integrable quantum field theory; Correlation function; Conformal perturbation theory;
Renormalization group; SU(2)-Thirring model
1. Introduction
* Corresponding author.
E-mail address: doyon@physics.rutgers.edu (B. Doyon).
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 7 9 5 - 2
452 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
1 The definition of the coupling constant g is not conventional, but convenient for our purposes.
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 453
Our analysis of the fermion correlator is based, on the one hand, on recently
proposed expressions for the form factors of soliton-creating operators (or topologically
charged fields) in the sine-Gordon model [13],2 and on the other hand, on a conformal
perturbative analysis of two-point correlation functions involving such fields. The form
factor expressions can be used to obtain the long-distance behavior of these two-point
functions, whereas conformal perturbation theory gives their short-distance expansion
([16]). The interest in some of these topological fields stems from their rôle in fermionic
theories. For instance, it is well-known that the sine-Gordon model is equivalent to the
massive Thirring model [17]. The components of the Thirring fermion field are then
associated with soliton-creating operators of topological charge ±1 and Lorenz spin
± 12 , and correlators of these operators in the sine-Gordon model are related to fermion
correlators in the massive Thirring model [18]. More interestingly, the sine-Gordon
theory is closely related to a model which is an integrable deformation of (1.1) [5].
This “deformed” (or anisotropic) SU(2)-Thirring model exhibits the so-called spin-charge
separation, which is translated by its representation in terms of two bosonic theories, one
for the charge part, one for the spin part. The spin part of the fermion field corresponds
to soliton-creating operators of topological charge ±1 and Lorenz spin ± 14 in the sine-
Gordon model, and its charge part is related to similar operators in a free massless bosonic
theory.
Although form factor expansions and conformal perturbation theory are very effective
tools for the study of, respectively, the long-distance and the short-distance asymptotics of
Schwinger’s functions [16,19–21], one usually gets into trouble when trying to compare
both predictions in a region where they are expected to be accurate enough. Indeed,
in general, one has the freedom of choosing the overall multiplicative normalization in
the expansion arising from conformal perturbation theory as well as in the form factor
expansion, and there is no systematic way of relating both normalizations. For the case of
the soliton-creating operators, the constant relating both normalizations was conjectured
in [13]. It allows one to make unambiguous numerical predictions on the correlation
functions of soliton-creating fields on the whole distance scale using the combined
conformal perturbation theory and form factor data. We performed this calculation for
the case of the SU(2)-Thirring fermion.
The paper is organized as follows. In Section 2, we recall some standard results
concerning the anisotropic SU(2)-Thirring model and its relation to the sine-Gordon
theory. In Section 3, the short-distance behavior of correlators of the soliton-creating
operators is examined by means of conformal perturbation theory. Here we also perform a
Renormalization Group (RG) resummation of the perturbative expansion in the vicinity
of the Kosterlitz-Thouless point which corresponds to the SU(2) limit of the fermion
theory. In Section 4, the perturbative calculation is adapted to the momentum space fermion
Schwinger’s function; we give the two-point function in the SU(2)-Thirring model to third
order in the running coupling. This particular result was recently obtained by standard
perturbation theory in the modified Minimal Subtraction (MS) scheme [22] (calculations
2 Without taking normalization into consideration, some of such form factors were considered previously in
Refs. [14,15]
454 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
The SU(2)-invariant Thirring model admits an integrable generalization such that the
underlying SU(2) symmetry is explicitly broken down to U (1) ⊗ Z2 :
πg 3 3 πg⊥ 1 1
AATM = d x 2
Ψ̄σ γµ ∂ Ψσ +
µ
J J + Jµ Jµ + Jµ Jµ ,
2 2
(2.1)
8 µ µ 8
σ =↑,↓
where
JµA = Ψ̄ γµ τ A Ψ (2.2)
are vector currents (and, as before, τ A are Pauli matrices). The model (2.1) is renormal-
izable, and its coupling constants g , g⊥ should be understood as “running” ones. In par-
ticular, in the RG-invariant domain g |g⊥ |, all RG trajectories originate from the line
g⊥ = 0 of UV stable fixed points, and (2.1) indeed defines a quantum field theory.3 Hence,
in this domain (which is the only one that we discuss here), each RG trajectory is uniquely
characterized by the limiting value
1
ρ= lim g () (2.3)
2 →0
of the running coupling g () at extremely short distances ( stands for the length scale),
i.e., the theory (2.1) depends only on the dimensionless parameter ρ, besides the mass
scale M appearing through dimensional transmutation.
As is well-known (see, e.g., [3,5]), the model (2.1) can be bosonized in terms of the
sine-Gordon field ϕ(x),
1
AsG = d x 2
(∂ν ϕ) − 2µ cos(βϕ) ,
2
(2.4)
16π
with the coupling constant β in (2.4) related to ρ (2.3) by
1
β2 = , (2.5)
1+ρ
3 The Hamiltonians corresponding to opposite choices of the sign of g are unitary equivalent, so the sign of
⊥
this coupling does not affect the physical observables.
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 455
and a free massless boson. Then the mass scale M is identified with the mass of the sine-
Gordon solitons, which is related to the parameter µ by [24]
√
1
Γ ( 1+ρ )
1 2ρ
π Γ ( 12 + 2ρ ) 1+ρ
µ= ρ M 1
. (2.6)
πΓ ( 1+ρ ) 2Γ ( 2ρ )
The precise operator relations between (2.1) and (2.3) can be found in [13]. In particular,
for the two-point fermion correlator, the bosonization implies that
δσ σ γµ x µ (1)
Ψσ (x)Ψ̄σ (0) = F (r), (2.7)
2π |x|3/2 1/4
(n)
where we use the notation Fω (n = 1, ω = 1/4) for the real function which depends
only on the distance r = |x| (and implicitly on the mass scale M and the parameter ρ),
and which, in essence, coincides with the Euclidean correlator of non-local topologically
charged fields in the model (2.4):
n −n z̄ ωn (n)
O−ωβ (x)Oωβ (0) = eiπ Fω (r), (2.8)
z
where z = x1 + ix2 , z̄ = x1 − ix2 . Again we refer the reader to the paper [13] for the precise
definition of the field Oan (a = ωβ). Here we note that it carries an integer topological
charge n, a scale dimension
2ω2 n2
d= + (1 + ρ), (2.9)
1+ρ 8
and a Lorentz spin ωn.
3. Short-distance expansion
We now turn to the analysis of the short-distance behavior of the correlator (2.8). In
general, one can examine this behavior via the operator product expansion, for instance:
Fω(n) (r) = CI (r) + Ccos(βϕ) (r)cos(βϕ) + · · · . (3.1)
The structure functions (CI (r), Ccos(βϕ) (r), etc.) admit power series expansions in µ2 ,
which can be obtained by using the standard rules of conformal perturbation theory [16,25],
whereas the vacuum expectation values of the associated operators are in general non-
analytical at µ = 0. In the perturbative treatment, we regard the sine-Gordon model (2.4)
as a Gaussian conformal field theory
1
AGauss = d 2 x (∂ν ϕ)2 (3.2)
16π
perturbed by the relevant operator cos(βϕ). Notice that in the limit µ → 0, the non-local
topologically charged fields Oan can be expressed in terms of the right and left moving parts
456 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
Here we discuss the short-distance expansion of the correlator (2.8) for β 2 sufficiently
close to unity. For this purpose, it is convenient to use the notation
. = 1 − β 2 1. (3.10)
Our previous short-distance analysis suggests the following expansion for the structure
function CI :
∞
2. 2k
CI (r) = r 2d
1+ ck µr , (3.11)
k=1
where the coefficients ck are given by certain 4k-fold Coulomb-type integrals. Evidently,
this expansion cannot be directly applied in the limit . → 0, where the perturbation
cos(βϕ) of the Gaussian action (3.2) becomes marginal. However, being expressed as
a function of the scaling distance Mr, the structure function CI (r) should admit the
following form:
Here the function Γg is supposed to have a regular power series expansion in terms of the
running coupling constants g,⊥ = g,⊥ (r):
∞
Γg = γlk gl g⊥
2k
, (3.14)
l,k=0
where γlk are constant coefficients. Notice that only even powers of the coupling g⊥
appear in this expansion (see footnote 3). In writing (3.13), we use the normalization
condition (3.7), and take into account that the UV limiting value of Γg coincides with
the scale dimension (2.9),
lim Γg = d. (3.15)
r→0
We have also assumed that there is no resonance mixing of the operator Oωβ n with other
fields, so it is renormalized as a singlet. One can easily check that this is indeed the case
for the operators with |ω| < 12 + |n|4 .
Condition (3.15) already encloses an important restriction on the series (3.14). Indeed,
using Eqs. (2.3) and (2.9) along with the condition that the line of UV stable fixed points
corresponds to g⊥ = 0, one obtains
6
Γg = Γ (0) (g ) + Γ (1)(g )g⊥
2
+ Γ (2) (g )g⊥
4
+ O g⊥ , (3.16)
where
2ω2 n2 g
Γ (0) (g ) = g + 1+ .
1+ 2
8 2
The values of the other coefficients γl,k1 appearing in (3.14) essentially depend on
the choice of a renormalization scheme, i.e., on the precise specification of the running
coupling constants. The latter obey the RG equations
g⊥2
dg dg⊥ g g⊥
r = , r = . (3.17)
dr f (g , g⊥ ) dr f⊥ (g , g⊥ )
Perturbatively, f (g , g⊥ ) and f⊥ (g , g⊥ ) admit loop expansions as power series in g and
g⊥ . In this work, we will use the scheme introduced by Al.B. Zamolodchikov [24,29]. He
showed that under a suitable diffeomorphism in g and g⊥ , the functions f and f⊥ can be
chosen to be equal to each other, and furthermore, to be equal to
g
f = f⊥ = 1 + . (3.18)
2
With this choice for the β-function, the RG equations (3.17) can be integrated. To do this,
we note that this system of differential equations has a first integral, the numerical value of
which is determined through the condition (2.3),
g2 − g⊥
2
= (2ρ)2 . (3.19)
Using (3.19), (3.10) and (2.5), Eq. (3.17) are solved as
√
1+q 4 q
g = 2ρ , g⊥ = ρ , (3.20)
1−q 1−q
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 459
where
−2.
1−q
q = (rΛ)4. . (3.21)
ρ
The normalization scale Λ is another integration constant of the system (3.17). It is of the
order of the physical mass scale and supposed to have a regular loop expansion,
Λ = M exp τ0 + τ1 ρ + τ2 ρ 2 + · · · . (3.22)
It should be noted that the even coefficients τ0 , τ2 , . . . are essentially ambiguous and can
be chosen at will. A variation of these coefficients corresponds to a smooth redefinition of
the coupling constants which does not affect the β-function. By contrast, the odd constants
τ2k+1 are unambiguous and precisely specified once the form of the RG equations is fixed.
It is possible to show [24,29] that the odd constants vanish in the Zamolodchikov’s scheme:
τ2k+1 = 0 (k = 0, 1 . . .).
Once the coefficients τ2k in (3.22) are chosen, the running coupling constants are
completely specified, and all coefficients in the power series expansion (3.14) are
determined unambiguously. They can be explicitly calculated by comparing the conformal
perturbative result (3.5) with the form (3.13). From (3.13),
1
Γg = − r∂r log(CI )
2
and, as it follows from the general conformal perturbative expansion (3.11) and the
definition (3.21) of q, the function Γg can be expanded in powers of q. Explicitly, using
the conformal perturbative result (3.5),
√ 4.
ρ
Γg = d − 2. µ2 Jn 2ω(1 − .), 2. − 2 q + O q 2 . (3.23)
Λ
Moreover, the coefficients in this expansion are power series in ρ. For example, using
Eqs. (2.6) and (3.22), it is easy to show that
√ 2.
πµ ρ 1 2
= exp −2τ̄0 ρ + 2τ̄0 − ρ
. Λ 2
2 1 3
− 2τ2 + 2τ̄0 − ζ(3) − ρ + O ρ4 . (3.24)
3 2
Here and after, we set for convenience
π γE +τ̄0
eτ0 = e , (3.25)
8
where γE = 0.5772 . . . is the Euler constant. The integral Jn (2ω(1 − .), 2. − 2) appearing
in (3.23) can also be expanded in powers of ρ, using . = ρ/(1 + ρ). In Appendix A,
we quote the first few terms in the expansion of Jn (a, c) (3.6) around c = −2, which are
obtained through the use of (3.9). From this expansion, it is easy to obtain the expansion
of Jn (2ω(1 − .), 2. − 2) in powers ρ. Then, one can compare the conformal perturbative
460 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
expansion of Γg in q and ρ (3.23) with the corresponding expansion (3.14) coming from
the RG analysis (where of course one should expand g and g⊥ 2 in q and ρ from (3.20)).
This determines the coefficients γl,1 for l = 0, 1, 2. If we want an expression valid to order
g 4 , we need one more coefficient: γ0,2 . In principle, it can be obtained from the expansion
in ρ of the coefficients c2 in the series (3.11). In Section 5, we describe a way to find γ0,2
without the cumbersome calculation beyond the lowest order in conformal perturbation
theory.
In order to simplify the form of the structure function (3.13), it is convenient, instead of
using the coefficients γl,k , to parametrize the first few terms of the power series expansions
Γ (1,2)(g ) (3.16) as:
2
1 n u1 3u2 2 3
Γ (g ) = −
(1)
g − + v1 g + v2 − g + O g ,
1 + 2 32 2 2
v2
Γ (2) (g ) = − + O(g ). (3.26)
2
The explicit values of the coefficients u1 , u2 , v1 and v2 in (3.26) are given in Appendix B.
Let us substitute (3.16) and (3.26) into Eq. (3.13). The RG flow Eq. (3.17) allow one to
evaluate the integral and to write the structure function in the form (3.12) with
2 ω2 −n2 (1−ρ 2 )/16
= (Mr)−4ω
(ren) 2 −n2 (1+ρ 2 )/4
CI g⊥
× e−u1 g −u2 g 1 + g⊥
3
2
(v1 + v2 g ) + O g 4 , (3.27)
and
√ n2 /2−2d 2ρu +(2ρ)3 u +···
Zn,ω = M 2d 2ρ+1 ρeτ0 ρ+τ2 ρ +···
3
e 1 2 . (3.28)
Notice that the transformation
does not affect the structure function CI (3.12) due to relation (3.19).
Our prime interest in this work is the correlation function (2.7). For n = 1 and ω = 1/4,
the relations obtained above lead to the following perturbative expansion for the two-point
fermion correlator in the anisotropic SU(2)-Thirring model:
ZΨ δσ σ γµ x µ 2 ρ 2 3 τ̄0 3
Ψσ (x)Ψ̄σ (0) = g⊥
16 exp − g − g
2π ρ2 16 32
|x|2+ 4
3 1 2 3 1 1
× exp τ̄0 − g⊥ − τ̄02 − τ̄0 − 2
g g⊥ + O g4 ,
16 4 16 6 16
(3.30)
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 461
where
− ρ 3
ρ2
− 8(1+ρ) π 4(1+ρ) 3ρ γE 3 4
ZΨ = (4ρ) M exp − ρ +O ρ .
2 8 4
In Eq. (3.30), we use the notation τ̄0 defined by (3.25).
We now set ρ = 0 and g = g⊥ = g in (3.30) to obtain the perturbative expansion of the
scaling function F (1.3) for the SU(2)-Thirring model,
3 3 1 2 3 1 4
F (pert)
= exp − g + τ̄0 − g − τ̄ −
2
g +O g .
3
(3.31)
16 16 4 16 0 16
Here the running coupling constant g solves the equation
−1 1 π γE +τ̄0
−g + ln(g) = ln e Mr , (3.32)
2 2
which is the limit ρ = 0 of Eqs. (3.20) and (3.21).
Let us stress here that, if the perturbation series could be summed, then the function F
should not depend on the auxiliary parameter τ̄0 :
∂F
= 0.
∂ τ̄0
This is, however, not true if we truncate the series (3.31) at some order N (for instance, if
one leaves only the terms explicitly written in (3.31)). In this case,
∂ (pert)
FN = O g N+1 ,
∂ τ̄0
(pert)
where the truncated series is denoted by FN . In fitting numerical data with (3.31), we
may treat τ̄0 as an optimization parameter, allowing us to minimize or at least develop a
feeling for the effects of the remainder of the series. Similar ideas have been discussed for
QCD in Ref. [30].
It may be worth mentioning that Eq. (3.27), along with explicit values of the coefficients
quoted in Appendix B, allows one to immediately determine the short-distance expansion
of some other conventional correlators in the (anisotropic) SU(2)-Thirring model. For
example, since the sine-Gordon field ϕ (2.4) itself can be defined by the relation
∂
ϕ = −i Oan ,
∂a n=0
a=0
where
g 1 g2 g3
I1 = 1 − + τ̄0 + − τ̄0 (τ̄0 + 1)
2 4 2 2
3
τ̄0 τ̄0 13 7
+ + τ̄02 + − − ζ(3) g 4 + O g 5 ,
2 4 128 16
2
g 7 2 τ̄0 13 7
I2 = 1 − 2τ̄0 g + τ̄0 (3τ̄0 + 1)g − 4τ̄0 + τ̄0 + −
2 3
− ζ(3) g 3
4 2 2 16 2
+ O g4 ,
where
Γ ( 32 − a)
Q(a) = 21−2a ,
Γ ( 12 + a)
and
1 ρ2
dΨ = +
2 4(1 + ρ)
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 463
is the scale dimension of the fermion field. The factor Q(dΨ − 2.)/Q(dΨ ) is essentially
the only source of differences between the RG treatments in coordinate space and in
momentum space. The RG analysis in momentum space goes as in the previous section.
The perturbative part in µ of F obeys the Callan–Symanzik equation, so it can be written
as
∞
2 dΨ − 1 ds
F (pert)
= Q(dΨ ) p 2 exp − Γg − dΨ , (4.3)
s
p2
where the function Γg admits a power series expansion in terms of the momentum-space
running coupling constants g,⊥ = g,⊥ (p2 ) depending on the Lorentz invariant p2 :
∞
Γg = γ̃l,k gl g⊥
2k
. (4.4)
l,k=0
Notice that, with some abuse of notations, we use here the same symbols g,⊥ for
the momentum-space running couplings as we used for the coordinate-space running
couplings. In order to fix the coefficients in (4.4), we have to choose a renormalization
scheme. Substituting r by 1/ p2 in (3.21) defines Zamolodchikov’s scheme in momentum
space. It is a simple matter to repeat the steps of the previous section in order to determine
the first few coefficients γ̃l,1 in (4.4). Just compare the logarithmic derivatives of the
expressions (4.2) and (4.3); the only difference is that the factor Q(dΨ − 2.)/Q(dΨ )
in (4.2) will have to be expanded in ρ, giving non-trivial contributions. As for the
coefficients γ̃l,2 , one would in principle need the next order in conformal perturbation
theory. However, again as in the previous section, it is possible to determine γ̃0,2 without
this calculation, as described in the next section. From these coefficients, and from the form
of the RG flow equation, one can evaluate the integral in (4.3) and obtain the asymptotic
behavior of the two-point function in the Euclidean region at p2 → +∞. We quote here
the result in the case of the SU(2)-Thirring model,
3 3 1 2 3 1 4
F(pert)
= exp − g + τ̃0 − g − τ̃ −
2
g +O g .
3
(4.5)
16 16 4 16 0 16
Here
√
1
−g −1 + ln(g) = ln 2π Meτ̃0 / p2 , (4.6)
2
and τ̃0 is an arbitrary parameter which can be chosen at will. Notice the strong similarity
between (4.5) and (3.31).
We also quote here the corresponding function Γg (4.4) in the case g = g⊥ :
1 3 3 3 3
Γg = + g 2 − τ̃0 g 3 + 3τ̃02 + τ̃0 − g4 + O g5 . (4.7)
2 32 16 32 16
In [22], the anomalous dimension for the fermion field in the MS scheme was found
to fourth order for a general non-Abelian Thirring model (see also [31] and references
464 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
where the function hλ and the anomalous dimension γλ were given in [22] to fourth order
in λ for the Thirring model with a general non-Abelian symmetry. In the particular case of
the SU(2)-symmetry, they specialize to
15 2 11 3 3(80ζ(3) − 511) 4
hλ = 1 + 2
λ − 3
λ + 4
λ + O λ5 , (4.13)
128π 512π 32768π
and
3 2 15 3 3
γλ = − 2
λ + 3
λ + 4
λ4 + O λ5 . (4.14)
16π 64π 1024π
Comparing (4.3) in the case ρ = 0 with the above expressions, one has the following
relation:
1 γ λ βλ d
Γg = − − log(hλ ). (4.15)
2 2 2 dλ
Using Eqs. (4.10)–(4.14), one can check that our result (4.7) agrees with (4.15), provided
that
τ = τ̃0 . (4.16)
Notice that the relation between the normalization scale ΛMS and M,
√
ΛMS = 2π M, (4.17)
which is a consequence of (4.12) and (4.16), was previously found in Ref. [12].
5. Long-distance behavior
+1
implies that the non-vanishing form factors of the operator Oβ/4 are of the form
+1
vac|Oβ/4 (0)A− (θ1 ) · · · A− (θN+1 )A+ (θ1 ) · · · A+ (θN ) , (5.1)
where θi and θj denote rapidities of solitons and antisolitons respectively. Up to an overall
normalization, all these form factors can be written down in closed form, as certain N -fold
integrals [14,32,33]. The spectral decomposition for the correlation function (2.7) then
gives
466 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
+∞
dθ −Mr cosh(θ) 2
+1
(0)A− (θ )
(1)
F1/4 (r) = e vac | Oβ/4
2π
−∞
∞
1 dθ1d θ2 dθ3 −Mr 3 cosh(θk )
+ e k=1
3! (2π)3
−∞
× vac|O+1 (0)Aσ (θ1 )Aσ (θ2 )Aσ (θ3 ) 2 + · · · , (5.2)
β/4 1 2 3
σ1 +σ2 +σ3 =−1
where the dots stand for the five-particle and higher contributions, which are of the order
of e−5Mr . The long-distance asymptotic behavior of the correlation function is dominated
by the contribution of the one-particle states,
+1
(0)A− (θ ) = Z1 (β/4) e 4 (θ+ 2 ) ,
1 iπ
vac|Oβ/4
and has an especially simple form,
−Mr
e
(1)
F1/4 (r) = Z1 (β/4) √ + O e−3Mr . (5.3)
2πMr
Here we use the notation Zn (a)(a = ωβ) from work [13] for the field-strength renormal-
ization which controls the long-distance asymptotics of the correlation function (2.8). Let
us stress here that the overall multiplicative normalization of the field Oβ/4
1 was already
fixed by the condition (3.7), hence, the constant Z1 (β/4) is totally unambiguous. In [13],
the following explicit formula for Zn (ωβ) was proposed:
Zn (ωβ)
n
n2
√π MΓ ( 3 + 1 ) 2d
C2 2 C2 − 4 2 2ρ
=
2C12 16ρ 2Γ (1 + 2ρ )
1
∞
dt cosh(4ωt)e−(1+ρ)nt − 1 n −2t
× exp + − 2de .
t 2 sinh(t) sinh((1 + ρ)t) cosh(tρ) 2 sinh(t)
0
(5.4)
In this formula, d is the scale dimension (2.9) and the constants C1 , C2 read
∞
2− 12 e 4 Γ ( 14 )
5 1
dt sinh2 ( 2 )e−t
tρ
C1 = √ exp ,
π A3G t 2 cosh2 (tρ) sinh(t)
0
∞
dt sinh2 ( 2 )e−t
tρ
Γ 4 ( 14 )
C2 = exp −2 ,
4π 3 t cosh(tρ) sinh(t)
0
where AG = 1.282427 . . . is the Glaisher constant.
We do not write down explicitly the general formula for the three-particle contribution
in (5.3) because it is a rather mechanical substitution of relations presented in [13]. (For
β 2 = 1 the corresponding formulas can be found in Appendix C.) Here we make the
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 467
Z1 (β/4) 1√
= 2− 3 π e− 4 A3G exp w1 ρ 2 + O ρ 3 .
1
(5.5)
Z1,1/4
The explicit form of the coefficient w1 is not essential here. What is important is that
the linear term in ρ does not appear in the expansion (5.5). This observation can be
immediately generalized and checked for any n and ω. Furthermore, we expect that
∞
Zn (ωβ)
log = wk ρ 2k + O ρ ∞ , (5.6)
Zn,ω
k=0
where Zn (ωβ) is the normalization constant (5.4). In other words, by means of the
transformation (3.29) with properly chosen coefficients wk , the constant Zn,ω in (3.12)
can be set to be equal (in a sense of formal power series) to Zn (ωβ). At the moment, we
do not have a rigorous proof of (5.6). But it leads to some interesting prediction to be
checked. As was already mentioned, the calculations performed in the leading order in
conformal perturbation theory determine only the combination v2 − 3u2 /2, but do not fix
the individual values of the coefficients u2 and v2 in the series (3.27). Accepting (5.6),
one can immediately find the values of the coefficients u2 (see Appendix B). In the case
n = 1, ω = 1/4, it allows one to extend the perturbative expansion (3.31), as well as the
equivalent expansion (4.5), to order g 3 . As was discussed in Section 4, Eq. (4.5) is in a
complete agreement with the result of four-loop perturbative calculations from [22]. This
in fact shows that the ρ 3 -term really is absent in the series (5.5).
6. Spectral density
The spectral density is an important quantity related to the two-point function and its
analytical structure in momentum space. It is often what is measured in actual condensed
matter experiments [4,23], and it allows one to completely reconstruct the two-point
function. In this section, we discuss the properties of the spectral density in the SU(2)-
Thirring model.
The spectral decomposition of the fermion Green’s function yields the following form
(4.1):
for the function F
+∞ (s)
AF
p2 = 1 −
F ds 2 . (6.1)
p +s
M2
468 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
p2 = −M 2 , and that the spectral density can be recovered from the discontinuity along
this cut:
(s) = 1 iπ −iπ
AF F e s −F e s . (6.2)
2πi
The easiest way to obtain the large s asymptotics of the spectral density is to use
the expansion (4.5) along with knowledge of the analytical properties of the coupling
constant g (4.6) as a function of the complex variable p2 . Notice that g can be expressed
in terms of the principal branch of the product log (or Lambert) function, which gives the
solution for W in W eW = z (see, e.g., [34]):
2 −2τ̃0
−1 p e
g = 2W . (6.3)
πM 2
The principal branch of the W -function analytically maps the complex z-plane minus the
branch cut z ∈ ] − ∞, −e−1] to the part of the complex W -plane enclosing the real axis and
delimited by the curve !e W = − "m W cot("m W ) for −π < "m W < π . The analyticity
implies that the power series
∞
1 d n
iφz W (z)
n! dz z=s
n=0
converges for real positive s > e−1 and |φ| π and coincides with W (eiφ s). Similar
considerations are, of course, valid for the coupling constant g (6.3). In particular, for
sufficiently large s,
∞
1 d n
g e±iπ s = ±iπp2 2 g p2 .
n! dp p 2 =s
n=0
This then gives us, with (4.5) and the RG flow equation (4.9), the asymptotic expansion of
the spectral density for large s. It can be written in the following form:
2
(pert)
(s) = − g 1 − g − π − 1 g 2 + O g 3 ∂ F
2
AF . (6.4)
2 2 4 ∂g p2 =s
Here the function F(pert) is given by (4.5) and g is defined by Eq. (4.6).
Now let us consider the threshold behavior of the spectral density. According to the
analyses of the previous section, the long-distance asymptotic behavior of the scaling
function F (1.3) is described by the expansion
F = F (1) + F (3) + O e−5Mr , (6.5)
where
F (1) = Ce−Mr ,
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 469
The function F (3) in (6.5) gives the three-particle contribution to the correlation function.
Using the definitions (1.3), (4.1) and the above relation, one can obtain:
2 1
F p =C 1− + ···. (6.6)
1 + p2 /M 2
Here the dots stand for contributions of the massive multiparticle intermediate states. The
last relation implies that the spectral density (6.2) can be written as
C Θ(s − M 2 )
(s) =
AF (3) (s),
+ Θ s − 9M 2 AF (6.7)
π s/M 2 − 1
where
1 for s 0,
Θ(s) =
0 for s < 0
and AF (3) is some function which contributes to the spectral density only above the
threshold s = 9M 2 .
7. Numerics
8. Conclusion
Table 1
The scaling function F (1.2), (1.3). The first columns give the results of the long-distance expansion which
includes contributions of the one-, three- and one + three-particle states. The data in the last two columns
correspond to the perturbative expansion (3.31) for the two different values of the auxiliary parameter τ̄0
Mr F (1) F (3) F (1) + F (3) F (pert) (τ̄0 = −0.25) F (pert) (τ̄0 = 0.25)
0 0.921862 0.068 0.990 1.00000 1.00000
0.00001 0.921853 0.0553 0.9771 0.980129 0.980130
0.00005 0.921816 0.0522 0.9740 0.976311 0.976314
0.0001 0.921770 0.0504 0.9722 0.974192 0.974196
0.0002 0.921678 0.0483 0.9700 0.971674 0.971678
0.001 0.920941 0.0415 0.9624 0.963508 0.963520
0.002 0.920020 0.0375 0.9575 0.958435 0.958454
0.01 0.912689 0.0252 0.9379 0.939386 0.939460
0.025 0.899101 0.0168 0.9159 0.919294 0.919494
0.05 0.876902 0.0106 0.8875 0.894050 0.894547
0.075 0.855251 0.00738 0.86263 0.871796 0.872717
0.1 0.834135 0.00541 0.83955 0.850520 0.852013
0.15 0.793454 0.00317 0.79662 0.808380 0.811548
0.2 0.754757 0.00200 0.75676 0.765139 0.770842
0.25 0.717947 0.00131 0.71926 0.719980 0.729252
0.3 0.682932 0.000889 0.683822 0.672640 0.686654
0.35 0.649625 0.000617 0.650243 0.623153 0.643171
0.4 0.617942 0.000436 0.618379 0.571774 0.599063
0.45 0.587805 0.000313 0.588118 0.518942 0.554677
0.5 0.559137 0.000227 0.559365 0.465257 0.510405
behaviors needs an adjustment of the normalization used in one method with respect
to that used in the other method. The necessary formula for such an adjustment was
proposed in [13]: there the exact form factors, with appropriate normalization constant,
were conjectured assuming a “conformal” normalization of the fields. This allowed us to
numerically compare both methods and to observe an agreement to within 1% in the region
0 < Mr < 0.05. Moreover, using these exact form factors we conjectured an infinite set of
relations between expansion coefficients of the fermion anomalous dimension arising in
the RG treatment of the anisotropic model. This was done essentially by identifying the
singular part in ρ 2 (see Eqs. (2.3) and (2.5)) of the normalization constant conjectured
in [13] with the singular part of the normalisation constant obtained in the RG treatment
(see (5.6)). Using one of these relations along with a first non-trivial order calculation in
conformal perturbation theory, we obtained the desired fermion Schwinger’s function to
third order in the coupling of the SU(2) model, and observed agreement with what was
obtained in [22] by standard perturbation theory. These results, numerical and analytical,
suggest the validity of the conjectured exact form factors of [13] in the case of the SU(2)-
Thirring model. It might be interesting to apply similar methods to Thirring-like models
with other symmetries.
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 471
Acknowledgements
S.L. is grateful to F. Essler, A. Tsvelik for their helpful collaboration during the
early stages of this work. We have also extremely benefited from discussions with A.
Zamolodchikov, Al. Zamolodchikov.
This research is supported in part by DOE grant #DE-FG02-96ER10919. B.D. is
supported in part by a NSERC Postgraduate Scholarship.
Appendix A
In this appendix, we give the first few terms in the expansion of Jn (a, c) (3.6) around
c = −2. The coefficients in this expansion involve standard functions of a, which could
2ω −2
then easily be used to obtain an expansion of Jn ( 1+ρ , 1+ρ ) in powers of ρ, as is needed
in (3.23). To simplify the result, we will use the parameter
b = c + 2.
We find the following expansions in b of the functions A(q, b − 2), B(q, b − 2) (3.8)
involved in (3.9):
2
Γ (2 − b − q)Γ (b) 2 π ψ(1 − q) + γE 3
A(q, b − 2) = 1+b + +O b ,
Γ (2 − b)Γ (1 + b − q) 6 q
Γ (q + b)Γ (b) b b2
B(q, b − 2) = 1+ + 2 q 2 ψ (q) − 1
Γ (q)Γ (2b) 2q 2q
b3
+ qψ (q) + 2ψ (q) + O b4 .
4q
Hence,
π 2 4a 2 − n2 + 2nb
Jn (a, b − 2) =
2 b2 (1 − b)2
Gn (a) aGn (a) + Gn (a) 10
× exp −Gn (a)b + −2 + ζ(3) b 3
12 n2 − 4a 2 3
4
+O b ,
where
Gn (a) = ψ(a + n/2) + ψ(−a + n/2) + 2γE ,
and Gn (a) = d
da Gn (a), Gn (a) = d2
G (a).
da 2 n
Appendix B
In this appendix, we write down explicit expressions for the coefficients u1 , u2 , v1 and
v2 taking part in the expansion (3.16), (3.26) of the function Γg .
472 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
On the one hand, from the assumption (5.6), the coefficients of odd powers of ρ in the
exponential factor of Zn,ω (3.28) are completely fixed by the conjectured constant Zn (ωβ)
(5.4). This fixes u1 , u2 uniquely, giving
n2 3 n(n − 2)
u1 = ω 2 − Tn (2ω) − + ,
16 2 16
1 n2 n2 1 ω(4ω2 − 1)
u2 = ω2 − ω2 + − Tn (2ω) + Tn (2ω)
12 16 16 2 12
1 2 1 n(n + 4) 11ω2 1 τ2 n2
+ ω − Tn (2ω) − − + τ̄0 + ω −
2
,
4 12 768 48 24 2 16
(B.1)
where
n2 Tn (2ω) 1 2 n(n − 4) 1
− ω2 − + ω + − ωTn (2ω)
16 2 4 16 2
1 n2 1 2 n(n − 4)
− ω −
2
T (2ω) −
3
ω + Tn2 (2ω)
12 16 n 8 16
1 2 n(n − 2) 1
− ω − − Tn (2ω)
8 8 2
1 2 n2 n(n − 8) u1 v1 3u2 τ̄0
− ω − 2τ2 − 14ζ(3) − 3 − + + + − .
8 16 256 8 2 2 8
Appendix C
In this appendix, we give the formula for the three-particle contribution F (3) (6.5) to the
fermion two-point function in the SU(2)-Thirring model that we used for our numerical
B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475 473
calculations. We first specialize the expression written in [13] to the case of three-particle
form factors of the field Oβ/4
1 for β 2 = 1:
vac|O1/4
1
(0)A− (θ1 ) · · · A+ (θk ) · · · A− (θ3 ) in
9
AG2 Γ 3 ( 14 ) iπ 1
3
θm
=− 15 3 9
G(θm − θj )
e 8 M4 e 4
2 e π
4 8
m=1 4
m<j
dγ − γ
k 3
× e 2 W (θp − γ ) W (γ − θp )
2π
C+ p=1 p=k+1
dγ − γ
k−1 3
+ e 2 W (θp − γ ) W (γ − θp ) . (C.1)
2π
C− p=1 p=k
and
Γ ( 34 − iθ
2π )Γ (− 4
1
+ iθ
2π )
W (θ ) = 2 . (C.3)
Γ 2 ( 14 )
The contour C+ starts from −∞ on the real axis of the complex γ -plane, goes above the
poles located at γ = θp + iπ/2, p = 1, . . . , k, and below those located at γ = θp − iπ/2,
p = k + 1, . . . , 3, always staying in the strip −π/2 − 0 < "m γ < π/2 + 0, and finally
extends to +∞ on the real axis. Similarly, the contour C− goes above the poles located at
γ = θp + iπ/2, p = 1, . . . , k − 1, and below those at γ = θp − iπ/2, p = k, . . . , 3. Notice
that the integrals in (C.1) can be expressed in terms of the generalized hypergeometric
function 3 F2 at unity.
Using the expressions (C.1) and performing one of the rapidity integrals in (5.2), one
can obtain the following form for the function F (3) in (6.5):
∞ √
2e− 4 A9G
3
e−Mr 3+2 cosh x+2 cosh y+2 cosh(x−y)
F (3)
= dx dy
3πΓ 6 ( 14 )
1
(3 + 2 cosh x + 2 cosh y + 2 cosh(x − y)) 4
−∞
2
× 2|R1 (x, y)|2 + |R2 (x, y)|2 G(x)G(y)G(x − y)
−x 1
x+y e + e−y + 1 4
×e 2 .
ex + ey + 1
The functions R1 and R2 here are
y
R2 (x, y) = e− 2 + 4 R1 (−x, y − x) − e− 2 − 4 R1∗ (−y, x − y)
x iπ iπ
474 B. Doyon, S. Lukyanov / Nuclear Physics B 644 [FS] (2002) 451–475
and
cosh x2 cosh y2 1 1 ix 1 iy ix iy
R1 (x, y) = − U − ,− − ,− − ;− ,−
2 sinh x sinh y 2 2 2π 2 2π 2π 2π
cosh y−x
2 cosh 2
x
+ e− 2
x
2 sinh(y − x) sinh x
1 1 i(y − x) 1 ix i(y − x) ix
× , − , + ;1− ,2 +
2 2 2π 2 2π 2π 2π
y cosh x−y
2 cosh 2
y
+ e− 2
2 sinh(x − y) sinh y
1 1 i(x − y) 1 iy i(x − y) iy
×U , − , + ;1− ,2 + ,
2 2 2π 2 2π 2π 2π
where U (a, b, c; d, e) is related to the generalized hypergeometric function 3 F2 by
Γ (a)Γ (b)Γ (c)
U (a, b, c; d, e) = 3 F2 (a, b, c; d, e; 1).
Γ (d)Γ (e)
References
[17] S. Coleman, The quantum sine-Gordon equation as the massive Thirring model, Phys. Rev. D 11 (1975)
2088–2097.
[18] S. Mandelstam, Soliton operators for the quantized sine-Gordon equation, Phys. Rev. D 11 (1975) 3026–
3030.
[19] B. Berg, M. Karowski, P. Weisz, Construction of Green’s functions from an exact S-matrix, Phys. Rev. D 19
(1979) 2477–2479.
[20] J. Balog, M. Niedermaier, Off-shell dynamics of the O(3) non-linear sigma model beyond Monte Carlo and
perturbation theory, Nucl. Phys. B 500 (1997) 421–461.
[21] C. Acerbi, G. Mussardo, A. Valleriani, Form-factors and correlation functions of the stress-energy tensor in
(1) (1) (2)
massive deformation of the minimal models E(N) × E(N) /E(N) , Int. J. Mod. Phys. A 11 (1996) 5327–5364.
[22] D.B. Ali, J.A. Gracey, Four loop wave function renormalization in the non-Abelian Thirring model, Nucl.
Phys. B 605 (2001) 337–364.
[23] F.H.L. Essler, A.M. Tsevelik, Weakly coupled one-dimensional Mott insulator, cond-mat/0108382.
[24] Al.B. Zamolodchikov, Mass scale in the sine-Gordon model and its reduction, Int. J. Mod. Phys. A 10 (1995)
1125–1150.
[25] R. Guida, N. Magnoli, All order IR finite expansion for short distance behavior of massless theories
perturbed by a relevant operator, Nucl. Phys. B 471 (1996) 361–388.
[26] S.D. Mahur, Quantum Kac–Moody symmetry in integrable field theories, Nucl. Phys. B 369 (1992) 433–
460.
[27] V. Dotsenko, M. Picco, P. Pujoi, Renormalization group calculation of correlation functions for the 2D
random bound Ising and Potts models, Nucl. Phys. B 455 (1995) 701–723.
[28] R. Guida, N. Magnoli, Tricritical Ising model near criticality, Int. J. Mod. Phys. A 13 (1998) 1145–1158.
[29] Al.B. Zamolodchikov, unpublished.
[30] P.M. Stevenson, Optimized perturbation theory, Phys. Rev. D 23 (1981) 2916–2943.
[31] J.F. Bennett, J.A. Gracey, Three-loop renormalization of the SU(Nc ) non-Abelian Thirring model, Nucl.
Phys. B 563 (1999) 390–436.
[32] F.A. Smirnov, Form-factors in Completely Integrable Models of Quantum Field Theory, World Scientific,
Singapore, 1992.
[33] S. Lukyanov, Free field representation for massive integrable models, Commun. Math. Phys. 167 (1) (1995)
183–226;
S. Lukyanov, Form-factors of exponential fields in the sine-Gordon model, Mod. Phys. Lett. A 12 (1997)
2543–2550.
[34] R.M. Corless, G.H. Gonnet, D.E.G. Hare, D.J. Jeffrey, D.E. Knuth, On the Lambert W function, Adv.
Comput. Math. 5 (4) (1996) 329–359;
D.J. Jeffrey, D.E.G. Hare, R.M. Corless, Unwinding the branches of the Lambert W function, Math. Sci. 21
(1996) 1–7.
Nuclear Physics B 644 [FS] (2002) 476–494
www.elsevier.com/locate/npe
Received 23 May 2002; received in revised form 5 August 2002; accepted 12 September 2002
Abstract
By constructing the reflection spin-Dunkl operators, the integrable Sutherland–Römer model
(SRM) with open boundary condition is established, which describes a one-dimensional, two-
component, quantum many-particle system in which like particles interact with a pair potential
g(g + 1)/ sinh2 (r), while unlike particles interact with a pair potential −g(g + 1)/ cosh2 (r). By
solving the Schrödinger equation and using the properties of the hypergeometric functions and
gamma functions, the two-particle scattering matrix and the reflection matrix are obtained in the
framework of the asymptotic Bethe ansatz method. The Bethe ansatz equations of the system are
obtained. The Hamiltonians of SRM with some other open boundary conditions are expressed
explicitly. Our method can be generalized, as a example, to the boundary Calogero–Sutherland model
which is also constructed by the reflection spin-Dunkl operators.
2002 Elsevier Science B.V. All rights reserved.
Keywords: Sutherland–Römer model; Reflection spin-Dunkl operator; Asymptotic Bethe ansatz; Reflection
equation
1. Introduction
* Corresponding author.
E-mail address: caojp@phy.nwu.edu.cn (J.-P. Cao).
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 9 - 5
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 477
where xj and σjz are the coordinate and z component Pauli matrix of the particle j ,
respectively, N the total number of particles, g the coupling constant. The two kinds of
quasi-particles are distinguished by a quantum number σjz = ±1, which may be thought of
as either spin or charge. When like quasi-particles are near, the repulsive potential increases
as 1/r 2 , while for large separations, both potentials decay exponentially. This system
was first introduced by Calogero [16], who showed it to be integrable. Sutherland [17]
soon afterward showed that the system could be exactly solved, and gave the solution for
a single-component system. Sutherland and Römer discussed the exact solution (in the
thermodynamic limit) for the system with two-component. All the above works only deal
with the systems with periodic boundary conditions.
478 J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494
Impurities and boundary effects play a relevant role in 1D system. The strong coupling
fixed point of impurity models frequently renormalizes to the equivalent open boundary
impurity problem [18]. This is also expected to be the case of models with long range
potentials and could be of relevance for the edge states of the FQHE. Open boundaries
have been studied in the context of BCN type CSM [19–21] and open Haldane–Shastry
spin chain [22].
In this paper, we construct the Hamiltonian of the SRM with open boundary conditions
(see Eq. (2)) by the reflection spin-Dunkle operators and prove its integrability. By solving
the Schrödinger equation and using the ABAM, we obtain the scattering matrix of the
two particles and the reflecting matrix. Then, with the help of quantum inverse scattering
method, we obtain the eigenvalues of the transfer matrix and the Hamiltonian of the system.
This paper is organized as follows. In Section 2 are some descriptions of the reflection
spin-Dunkl operators and the system consider in this paper is constructed. In Section 3,
the two-particle scattering matrix and reflecting matrix are calculated. The Hamiltonian is
diagonalized in Section 4. Section 5 includes a brief summary and some discussions.
N g(g + 1)(1 + σ z σ z )
j l
N g(g + 1)(1 − σ z σ z )
j l
+ −
j =l
2 sinh2 (xj + xl ) j =l
2 cosh2 (xj + xl )
N ρ(ρ + 1)(1 + σ z )
j
N ρ(ρ + 1)(1 − σ z )
j
+ − , (2)
j =1
sinh2 (xj ) j =1
cosh2 (xj )
where xj and σjz are the coordinate and the z component Pauli matrix of the particle
j , respectively, N the total number of particles, g and ρ the coupling constants. The
system (2) describes the SRM with boundary fields. By rescaling xj → πxj /(2L), it is
seen that the last two terms describe two boundary fields at 0 and L, respectively. The
terms of sinh(xj + xl ) and cosh(xj + xl ) represent a typical feature of the open boundary
system, which describe the interaction between the j th electron and the mirror image of the
lth electron or vice versa. The inclusion of the image terms is just equivalent to removing
the infinite wall at the boundary.
First, we show the construction and the integrability of the system (2). For a one-
dimensional system of N particles, we define the reflection spin-Dunkl operators
N
Dj = pj + j l + iuj Mj ,
vj l Mj l + v̄j l M (3)
l=1,l=j
where pj (j = 1, 2, . . . , N) are the momenta of the 1D quantum mechanical particles. The
vj l = v(xj − xl , σjz , σlz ), v̄j l = v(xj + xl , σjz , σlz ) and uj = u(xj , σjz ) are yet undetermined
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 479
N
[Di , Dj ] = Uj l − Wj lk , (5)
k=1,k=j,l
where
Uj l = uj vj l + uj vl,−j − ul v̄j,−l − u−l v̄j,l Mj Mj l
− ul vj l + ul v−j,l − uj v̄−j,l − u−j v̄j,l Ml Mj l ,
Wj lk = (vj k vlj − vlj vlk − vlk vj k )Mlk Mj l − (vlk vj l − vj l vj k − vj k vlk )Mj l Mlk
+ vj k v̄lj − v̄lj v−lk − v̄lk v̄j,−k Mj k M lk
+ vj k v̄lk − v̄lk vj,−l + v̄j l v̄−j,k M lk Mj k
+ v̄j k vl,−j − v̄lj v̄−l,k − vlk v̄j k M j k Mlk
+ v̄j l v−j,k + v̄j k v̄l,−k − vlk v̄j l Mlk M j k
+ v̄j k v̄l,−j − vlj v̄lk − v̄lk vj,−k Mj l M j k
+ vj l v̄j k + v̄j k vl,−k − v̄lk v̄j,−l M j k Mj l .
If the reflection spin-Dunkl operators commute with each other, [Dj , Dl ] = 0, we can
define the commutative
family of the conversed quantities by the reflection spin-Dunkl
operators as In = j (Dj )n . If one of the In is chosen as the Hamiltonian, then the model
is integrable. For SRM, we seek for the solutions of the equation, Uj l = Wj lk = 0. After
480 J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494
ig
vj l = coth(xj − xl ) − sgn(j − l) 1 + σjz σlz
2
ig
+ tanh(xj − xl ) − sgn(j − l) 1 − σjz σlz , (6)
2
ig ig
v̄j l = coth(xj + xl ) + 1 1 + σjz σlz + tanh(xj + xl ) + 1 1 − σjz σlz , (7)
2 2
uj = ρ coth(xj ) + 1 1 + σjz + tanh(xj ) + 1 1 − σjz , (8)
where g, ρ are coupling constants and the sign function, sgn(j − l) is defined as
1, for j > l,
sgn(j − l) =
−1, for j < l.
It is necessary to point out that permutation operators Mj l and Mj commute with the sign
function. The other three solutions can be found in Appendix A.
We choose I2 as the Hamiltonian
N
N
H= pj2 + j l
vj2l + v̄j2l + vj l Mj l + v̄j l M
j =1 j =l
N
1 N
1
N
+ uj Mj + u2j + Uj l − Wj lk , (9)
2 3
j =1 j =l j =l=k=j
where uj = ∂j uj , vj l = ∂j vj l and v̄j l = ∂j v̄j l . The Hamiltonian (9) can also be explicitly
written as
N
∂2 N g(g − M )(1 + σ z σ z )
jl j l
N g(g − M )(1 − σ z σ z )
jl j l
H =− + −
j =1
∂x 2
j j =l
2 sinh2
(x j − x l ) j =l
2 cosh 2
(x j − x l )
N g(g − M )(1 + σ z σ z )
jl j l
N g(g − M )(1 − σ z σ z )
jl j l
+ −
j =l
2 sinh2 (xj + xl ) j =l
2 cosh2 (xj + xl )
N ρ(ρ − M )(1 + σ z )
j j
N ρ(ρ − M )(1 − σ z )
j j
+ − . (10)
j =1
sinh2 (xj ) j =1
cosh2 (xj )
The permutation operators Mj l acting on the bosonic or fermionic subspace of the Hilbert
space will simply become ±1. In addition [Mj , H ] = 0, which means the Hamiltonian
is invariant under reflections. Hence, Mj can be substituted by its eigenvalues ±1 or by
σjz [22]. Here Mj = ±1 corresponds to a scalar impurity potential, while Mj = σjz yields
a scalar potential and a boundary magnetic field. In the case of Mj l = −1 and Mj = −1 of
system (10), we arrive at the Hamiltonian (2).
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 481
The asymptotic Bethe ansatz method (ABAM) is a powerful tool to derive the energy for
model with nonlocal interaction. Since the model is integrable, the ABAM requires only
the asymptotic behavior of the wave function at long distances, i.e., the two-body phase-
shift can be obtained without the full knowledge of the many-particles wave function.
We summarize the results in the center of mass frame and Mj l = −1 and Mj = −1
blow. For like particles, the potential is g(g + 1)/ sinh2 (x). Splitting off the center of
mass (R = (x1 + x2 )/2), the Schrödinger equation is straightforwardly reduced to a single
differential equation for the relative coordinate (x = x1 − x2 )
d 2Ψ 1 g(g + 1)
+ E − Ψ = 0, (11)
dx 2 2 sinh2 x
√
where E = 2k 2 with k = (k1 − k2 )/2. If we put ξ = coth x, + = −E = ik and
Ψ = (ξ 2 − 1)+/2w( 12 (1 + ξ )), the Schrödinger equation (11) can be changed into the
hypergeometric equation
u(1 − u)w (u) + γ − (α + β + 1)u w (u) − αβw(u) = 0, (12)
where we have used the notations α = ik − g, β = ik + g + 1, γ = 1 + ik and u = 12 (1 + ξ ).
The solution of Eq. (12) can be expressed in terms of hypergeometric functions as
ik/2 1
Ψ (ξ ) = C1 ξ 2 − 1 F α, β, γ , (1 + ξ )
2
(1−γ )
ik/2 1 + ξ
+ C2 ξ 2 − 1
2
1
× F β − γ + 1, α − γ + 1, 2 − γ , (1 + ξ ) . (13)
2
As x → 0 the wave function has two terms, one diverging as x −g and another is
proportional to x 1+g . The coefficient of the x −g -term has to vanish for the energy
expectation value to be properly defined. This determines the relation between the
constants C1 and C2 ,
C1 1(1 − ik)1(1 + g + ik)
= −(−1)ik . (14)
C2 1(1 + ik)1(1 + g − ik)
Asymptotically as x → −∞, the wave function becomes
scattering amplitudes and this can be taken by the choice of either the statistics of the
particles or the number of particles are even or odd.
For unlike particles, with the potential −g(g + 1)/ cosh2 (x), the Schrödinger equation
is
d 2Ψ 1 g(g + 1)
+ E− Ψ = 0. (17)
dx 2 2 cosh2 x
√
If we put ξ = tanh x, + = −E = ik and Ψ = (1 − ξ 2 )+/2w( 12 (1 − ξ )), Eq. (17) can be
changed into the hypergeometric equation
u(1 − u)w (u) + γ − (α + β + 1)u w (u) − αβw(u) = 0, (18)
S12 (k1 − k2 )S13 (k1 − k3 )S23 (k2 − k3 ) = S23 (k2 − k3 )S13 (k1 − k3 )S12 (k1 − k2 ), (25)
where S12 (k), S13 (k) and S23 (k) act in C n ⊗ C n ⊗ C n with S12 (k) = S(k) ⊗ 1, S23 (k) =
1 ⊗ S(k), etc.
Now, we consider the reflection with the boundary. For the spin-up particles (σ = 1),
the potential is ρ(ρ + 1)/ sinh2 (x). Using similar method, the wave function is given
asymptotically as
We find that this reflecting matrix R(k) satisfies the reflection equation [25,26]
S12 (k1 − k2 )R1 (k1 )S21 (k1 + k2 )R2 (k2 ) = R2 (k2 )S12 (k1 + k2 )R1 (k1 )S21 (k1 − k2 ),
(30)
where R1 (k) = R(k) ⊗ I , R2 (k) = I ⊗ R(k) and I is the 2 × 2 unit matrix.
In the system (2), we assume that the reflecting matrix of the left boundary is the unit
matrix, which means that the particles reflected by the left boundary only obtain a phase
shift π . The periodic motion of particle j consists of its scattering with each of the particles
to the right, reflecting off the right boundary, then scattering with all other particles while it
moves to the left, it is reflected at the left boundary with a phase shift π , and scattering until
it reaches its original position. The transfer matrix then consists of a product of 2(N − 1)
particle–particle scattering matrices and the reflecting matrix of the right boundary. If the
momentum of the particle j (j = 1, 2, . . . , N) is kj and the initial wave function is ξ0 , we
have
Sj,1 (kj − k1 ) · · · Sj,j −1 (kj − kj −1 )Sj,j +1 (kj − kj +1 ) · · · Sj,N (kj − kN )Rj,0 (kj )
× Sj,N (kj + kN ) · · · Sj,j +1 (kj + kj +1 )Sj,j −1 (kj + kj −1 ) · · · Sj,1 (kj + k1 )ξ0
= e2ikj L ξ0 . (31)
The above N eigenvalue equations are simultaneously
solved by Bethe ansatz, diagonaliz-
ing the Hamiltonian with eigenvalue E = N k
j =1 j
2.
We put
− − − − + + + +
X(kj ) = Sj,1 · · · Sj,j −1 Sj,j +1 · · · Sj,N Rj,0 Sj,N · · · Sj,j +1 Sj,j −1 · · · Sj,1 , (32)
±
where Sj,l = S(kj ± kl ). The monodromy matrix of the system is
− − − − −
T (k) = Sτ,j (k)Sτ,1 (k) · · · Sτ,j −1 (k)Sτ,j +1 (k) · · · Sτ,N (k)Rτ,0 (k)
+ + + + +
× Sτ,N (k) · · · Sτ,j +1 (k)Sτ,j −1 (k) · · · Sτ,1 (k)Sτ,j (k), (33)
±
where Sτ,j (k) = S(k ± kj ). The monodromy matrix satisfies the reflection equation
S1,2 (k1 − k2 )T1 (k1 )S1,2 (k1 + k2 )T2 (k2 ) = T2 (k2 )S1,2 (k1 + k2 )T1 (k1 )S1,2 (k1 − k2 ),
(34)
where the T1 (k) and T2 (k) act on the τ1 and τ2 auxiliary space. T1 (k) = T (k) ⊗ I ,
T2 (k) = I ⊗ T (k) and I is the 2 × 2 unit matrix.
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 485
−
We find Sτ,j (kj ) = Pτj and
+
trτ T (kj ) = X(kj ) trτ Pτj Sτj (kj )
1(1 + ikj )1(1 + g − ikj ) sin π(1 + g)
= 1+ X(kj ). (35)
1(1 − ikj )1(1 + g + ikj ) sin π(1 + g + ikj )
Therefore, the eigenstate of the X(kj ) can be obtained by constructing the eigenstate of
the monodromy matrix T (k).
Define
U (k) = Sτ,j (k − kj )Sτ,1 (k − k1 ) · · · Sτ,j −1 (k − kj −1 )
× Sτ,j +1 (k − kj +1 ) · · · Sτ,N (k − kN )
A(k) B(k)
= , (36)
C(k) D(k)
which satisfies the Yang–Baxter relation
S12 (k1 − k2 )U1 (k1 )U2 (k2 ) = U2 (k2 )U1 (k1 )S12 (k1 − k2 ). (37)
Then the monodromy matrix can also be written as
−1 α(k) β(k)
T (k) = U (k)Rτ,0 (k)U (−k) = , (38)
γ (k) δ(k)
where
−1 −1
U −1 (−k) = Sτ,N (−k − kN ) · · · Sτ,j +1 (−k − kj +1 )
−1 −1 −1
× Sτ,j −1 (−k − kj −1 ) · · · Sτ,1 (−k − k1 )Sτ,j (−k − kj )
N
1(1 + i(k + kj )/2)1(1 + g − i(k + kj )/2)
=
1(1 − i(k + kj )/2)1(1 + g + i(k + kj )/2)
j =1
sin π((ik + ikj )/2)
× σ y U t −k + i(1 + g) σ y .
sin π((ik + ikj )/2 + 1 + g)
In the derivation, we have used the property of scattering matrix,
−1
Sτ,l (−k − kl ) = Sτ,l (k + kl ). (39)
The transfer matrix is the trace of the monodromy matrix
t (k) = tr T (k) = α(k) + δ(k)
1(1 + ik)1(1 + g − ik) sin π(1 + g + ik) + sin π(1 + g)
= X(k). (40)
1(1 − ik)1(1 + g + ik) sin π(1 + g + ik)
The eigenvalue of X(k) can be obtained from that of the transfer matrix because
1(1 − ik)1(1 + g + ik)
X(k) =
1(1 + ik)1(1 + g − ik)
sin π(1 + g + ik)
× α(k) + δ(k) . (41)
sin π(1 + g + ik) + sin π(1 + g)
486 J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494
N
1(1 + i(k − kj )/2)1(1 + g − i(k − kj )/2)
U (k)11 |Ω = |Ω,
1(1 − i(k − kj )/2)1(1 + g + i(k − kj )/2)
j =1
N
1(1 + i(k − kj )/2)1(1 + g − i(k − kj )/2)
U (k)22 |Ω =
1(1 − i(k − kj )/2)1(1 + g + i(k − kj )/2)
j =1
sin πi((k − kj )/2)
× |Ω,
sin π((ik − ikj )/2 + 1 + g)
U (k)12 |Ω = 0,
N
1(1 + i(k + kj )/2)1(1 + g − i(k + kj )/2)
U −1 (−k)11 |Ω = |Ω,
1(1 − i(k + kj )/2)1(1 + g + i(k + kj )/2)
j =1
N
1(1 + i(k + kj )/2)1(1 + g − i(k + kj )/2)
−1
U (−k)22 |Ω =
1(1 − i(k + kj )/2)1(1 + g + i(k + kj )/2)
j =1
sin πi((k + kj )/2)
× |Ω,
sin π((ik + ikj )/2 + 1 + g)
U −1 (−k)12 |Ω = 0, (43)
where U (k)nm means the mth row and nth column element of the U (k) matrix. In order
to obtain the eigenvalues of the monodromy acting this vacuum state, we must change the
position of C(k1 ) and B(k2 ). From the Yang–Baxter equation (37), we find the following
commutation relation
sin π(1 + g)
C(k1 )B(k2 ) = D(k2 )A(k1) − D(k1 )A(k2 ) + B(k1 )C(k2 ). (44)
sin π(i(k1 − k2 )/2)
From Eqs. (43) and (44), we obtain the eigenvalues of the monodromy matrix acting on
the vacuum state,
N
1(1 + i(k − kj )/2)1(1 + g − i(k − kj )/2)1(1 + i(k + kj )/2)
α(k)|Ω =
1(1 − i(k − kj )/2)1(1 + g + i(k − kj )/2)1(1 − i(k + kj )/2)
j =1
1(1 + g − i(k + kj )/2) 1(1 + ik)1(1 + ρ − ik)
× |Ω,
1(1 + g + i(k + kj )/2) 1(1 − ik)1(1 + ρ + ik)
N
1(1 + i(k − kj )/2)1(1 + g − i(k − kj )/2)1(1 + i(k + kj )/2)
δ(k)|Ω =
1(1 − i(k − kj )/2)1(1 + g + i(k − kj )/2)1(1 − i(k + kj )/2)
j =1
1(1 + g − i(k + kj )/2) 1(1 + ik)1(1 + ρ − ik)
×
1(1 + g + i(k + kj )/2) 1(1 − ik)1(1 + ρ + ik)
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 487
N
sin π((ik + ikj )/2) sin π((ik − ikj )/2)
×
sin π((ik + ikj )/2 + 1 + g) sin π((ik − ikj )/2 + 1 + g)
j =1
sin π(ik) + sin π(1 + ρ) sin π(1 + g)
× −
sin π(ik + 1 + ρ) sin π(ik + 1 + g)
sin π(1 + g)
+ |Ω,
sin π(ik + 1 + g)
γ (k)|Ω = 0,
β(k)|Ω = 0. (45)
The eigenstate of the system is constructed by acting the β on the vacuum state (see
Eq. (54)). In order to get the eigenvalues of the transfer matrix, we must exchange the
position of α(k), δ(k) with β(k). From the reflection equation (34), we obtain the following
commutative relations between the elements of the monodromy matrix, which play an
important role in the algebraic Bethe ansatz method
Transfer matrix acting on the eigenstate, we should exchange the position of α(k), δ̄(k) and
β(λ1 )β(λ2 ) · · · β(λM ). Repeatedly using the commutation relations (46), (50) and (51), we
have
t (k)|Φ
sin π(ik + 1 + g) + sin π(1 + g)
=
sin π(ik + 1 + g)
M
sin π((ik + iλm )/2) sin π((ik − iλm )/2 − 1 − g)
× ΦM α(k)|Ω
sin π((ik − iλm )/2) sin π((ik + iλm )/2 + 1 + g)
m=1
M
sin π(1 + g) j
− Φ δ̄(λm )|Ω
sin π(iλm + 1 + g) sin π((ik + iλm )/2 + 1 + g) M
m=1
M
sin π((iλm − iλl )/2 + 1 + g) sin π((iλm + iλl )/2 + 2 + 2g)
×
sin π((iλm − iλl )/2) sin π((iλm + iλl )/2 + 1 + g)
l=m
M
sin π(1 + g) sin π(iλm ) j
+ ΦM α(λm )|Ω
sin π((ik − iλm )/2) sin π(iλm + 1 + g)
m=1
M
sin π((iλm + iλl )/2) sin π((iλm − iλl )/2 − 1 − g)
×
sin π((iλm − iλl )/2) sin π((iλm + iλl )/2 + 1 + g)
l=m
1
+
sin π(ik + 1 + g)
M
sin π((ik − iλm )/2 + 1 + g) sin π((ik + iλm )/2 + 2 + 2g)
× ΦM δ̄(k)|Ω
sin π((ik − iλm )/2) sin π((ik + iλm )/2 + 1 + g)
m=1
M
sin π(1 + g) sin π(ik + 2 + 2g) j
− Φ δ̄(λm )|Ω
sin π((ik − iλm )/2) sin π(iλm + 1 + g) M
m=1
M
sin π((iλm − iλl )/2 + 1 + g) sin π((iλm + iλl )/2 + 2 + 2g)
×
sin π((iλm − iλl )/2) sin π((iλm + iλl )/2 + 1 + g)
l=m
M
sin π(1 + g) sin π(ik + 2 + 2g) sin π(iλm ) j
+ Φ α(λm )|Ω
sin π(iλm + 1 + g) sin π((ik + iλm )/2 + 1 + g) M
m=1
M
sin π((iλm + iλl )/2) sin π((iλm − iλl )/2 − 1 − g)
× , (55)
sin π((iλm − iλl )/2) sin π((iλm + iλl )/2 + 1 + g)
l=m
where
N
sin π((iλm + ikj )/2) sin π((iλm − ikj )/2)
= ,
sin π((iλm + ikj )/2 + 1 + g) sin π((iλm − ikj )/2 + 1 + g)
j =1
m = 1, 2, . . . , M. (57)
With the help of Eq. (41), we obtain the eigenvalue of the X(k) matrix,
N
1(1 + i(k − kj )/2)1(1 + g − i(k − kj )/2)1(1 + i(k + kj )/2)
ΛX(k) =
1(1 − i(k − kj )/2)1(1 + g + i(k − kj )/2)1(1 − i(k + kj )/2)
j =1
1(1 + g − i(k + kj )/2)
×
1(1 + g + i(k + kj )/2)
1(1 + ik)1(1 + ρ − ik) 1(1 − ik)1(1 + g + ik)
×
1(1 − ik)1(1 + ρ + ik) 1(1 + ik)1(1 + g − ik)
M
sin π((ik + iλm )/2) sin π((ik − iλm )/2 − 1 − g)
×
sin π((ik − iλm )/2) sin π((ik + iλm )/2 + 1 + g)
m=1
J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494 491
N
1(1 + i(kj − kl )/2)1(1 + g − i(kj − kl )/2)1(1 + i(kj + kl )/2)
e2ikj L =
1(1 − i(kj − kl )/2)1(1 + g + i(kj − kl )/2)1(1 − i(kj + kl )/2)
l=j
1(1 + g − i(kj + kl )/2) 1(1 + ikj )1(1 + ρ − ikj )
×
1(1 + g + i(kj + kl )/2) 1(1 − ikj )1(1 + ρ + ikj )
M
sin π((ikj + iλm )/2) sin π((ikj − iλm )/2 − 1 − g)
× ,
sin π((ikj − iλm )/2) sin π((ikj + iλm )/2 + 1 + g)
m=1
j = 1, 2, . . . , N. (59)
Let iλm → iλm − (1 + g), the Bethe ansatz equations (57) and (59) will take the form of
cos π((iλm − 1 − g)/2) cos π((iλm − 1 − g)/2 + 1 + ρ)
cos π((iλm + 1 + g)/2) cos π((iλm + 1 + g)/2 − 1 − ρ)
M
sin π((iλm + iλl )/2 − 1 − g) sin π((iλm − iλl )/2 − 1 − g)
×
sin π((iλm + iλl )/2 + 1 + g) sin π((iλm − iλl )/2 + 1 + g)
l=m
N
sin π (iλm + iλl − 1 − g) sin π (iλm − iλl − 1 − g)
= 2 2
,
sin π2 (iλm + iλl + 1 + g) sin π2 (iλm − iλl + 1 + g)
l=1
m = 1, 2, . . . , M, (60)
N
1(1 + i(kj − kl )/2)1(1 + g − i(kj − kl )/2)1(1 + i(kj + kl )/2)
e2ikj L =
1(1 − i(kj − kl )/2)1(1 + g + i(kj − kl )/2)1(1 − i(kj + kl )/2)
l=j
1(1 + g − i(kj + kl )/2) 1(1 + ikj )1(1 + ρ − ikj )
×
1(1 + g + i(kj + kl )/2) 1(1 − ikj )1(1 + ρ + ikj )
M
sin π2 (ikj + iλm − 1 − g) sin π2 (ikj − iλm − 1 − g)
× ,
sin π2 (ikj + iλm + 1 + g) sin π2 (ikj − iλm + 1 + g)
m=1
j = 1, 2, . . . , N. (61)
492 J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494
5. Discussion
N
Dj = pj + ig/2 coth(xj − xl ) − sgn(j − l) σjz + σlz Mj l
l=1,l=j
+ coth(xj + xl ) + 1 σjz + σlz M j l
+ i (β − δ)(coth xj + 1)σjz + 2δ coth(2xj ) + 1 σjz Mj . (62)
[Di , Dj ] = 0. (63)
The conserved
quantities can be constructed by the reflection spin-Dunkl operators as
In = j (Dj )n and they commute among themselves. So the system is integrable. The
Hamiltonian is defined as H = j (Dj )2 , which can also be explicitly written as
N
∂2
N g 2 (1 + σ z σ z ) − g(σ z + σ z )M
j l j l jl
H =− +
∂x 2
j =1 j =l
2 sinh2 (xj − xl )
j l
N g 2 (1 + σ z σ z ) − g(σ z + σ z )M
N β(β − σ z M )
j l j l j j
+ +
j =l
2 sinh2 (xj + xl ) j =1
sinh2 (xj )
N δ(δ − σ z M )
j j
− . (64)
j =1
cosh2 (xj )
The SRM with general open boundary conditions can be constructed by the following
reflection spin-Dunkl operators
ig
N
Dj = pj + coth(xj − xl ) − sgn(j − l) 1 + σjz σlz
2
l=1,l=j
+ tanh(xj − xl ) − sgn(j − l) 1 − σjz σlz Mj l
ig
N
+ coth(xj + xl ) + 1 1 + σjz σlz
2
l=1,l=j
+ tanh(xj + xl ) + 1 1 − σjz σlz M j l + iuj Mj , (A.1)
where uj is a undetermined function. The integrability of the system requires that Dj
commutate with each other. Thus, we should seek for the solution of [Di , Dj ] = 0. After
tedious calculation, we found uj may have the following forms besides Eq. (8),
(1) uj = ρ (tanh xj + 1) + (coth xj + 1) , (A.2)
(2) uj = β(tanh xj − coth xj )σj , (A.3)
(3) uj = ρ (tanh xj + 1) + (coth xj + 1) + β(tanh xj − coth xj )σj , (A.4)
where ρ and β are coupling constants. The corresponding Hamiltonians are
H = H0 + Hb , b = 1, 2, 3, (A.5)
where
N
∂2 N g(g − M )(1 + σ z σ z )
jl j l
N g(g − M )(1 − σ z σ z )
jl j l
H0 = − + −
j =1
∂xj2
j =l
2 sinh (xj − xl )
2
j =l
2 cosh (xj − xl )
2
N g(g − M )(1 + σ z σ z )
jl j l
N g(g − M )(1 − σ z σ z )
jl j l
+ − , (A.6)
j =l
2 sinh2 (xj + xl ) j =l
2 cosh2 (xj + xl )
N
ρ(ρ − Mj )
N
ρ(ρ − Mj )
H1 = − , (A.7)
j =1
sinh2 (xj ) j =1
cosh2 (xj )
N
β(β − σj Mj )
N
β(β − σj Mj )
H2 = − , (A.8)
j =1
sinh2 (xj ) j =1
cosh2 (xj )
N
ρ(ρ − Mj )
N
ρ(ρ − Mj )
N
β(β − σj Mj )
H3 = − +
j =1
sinh2 (xj ) j =1
cosh2 (xj ) j =1
sinh2 (xj )
N
β(β − σj Mj )
− . (A.9)
j =1
cosh2 (xj )
All these are interesting integrable systems and can be studied straightforwardly by the
method suggested in this paper.
494 J.-P. Cao et al. / Nuclear Physics B 644 [FS] (2002) 476–494
References
Received 7 May 2002; received in revised form 11 July 2002; accepted 7 August 2002
Abstract
Recent results concerning the topological properties of random geometrical sets have been
successfully applied to the study of the morphology of clusters in percolation theory. This approach
provides an alternative way of inspecting the critical behaviour of random systems in statistical
mechanics.
For the 2d, q-states Potts model on the square lattice with q 6, intensive and accurate numerics
indicates that the average of the Euler characteristic (taken with respect to the Fortuin–Kasteleyn
random cluster measure) changes sign at the critical threshold of the magnetization transition.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
Recently, new insights in the study of the critical properties of clusters in percolation
theory have emerged based on ideas coming from mathematical morphology [1] and
integral geometry [2–4]. These mathematical theories provide a set of geometrical
and topological measures allowing to quantify the morphological properties of random
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 6 8 1 - 8
496 P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508
systems. In particular these tools have been applied to the study of random cluster
configurations in percolation theory and statistical physics [5–9].
One of these measures is the Euler–Poincaré characteristic χ which is a well-known
descriptor of the topological features of geometric patterns. It belongs to the finite set
of Minkowski functionals whose origin lies in the mathematical study of convex bodies
and integral geometry (see [2–4]). These measures, as we shall explain below, share
the following remarkable property: any homogeneous, additive, isometry-invariant and
conditionally continuous functional on a compact subset of the Euclidean space Rd can
be expressed as a linear combination of the Minkowski functionals. This is the well-known
Hadwiger’s theorem [2] of integral geometry which has a wide scope of applications in
mathematical physics due its rather general settings.
The use of these measures in image analysis [10], problems of shape recognition [1],
determination of the large scale structures of the universe [11], modelling of porous media
[12], microemulsions [13] and fractal analysis [14] has been a topic of growing interest
recently.
We also recall that for the problem of bond percolation on regular lattices, Sykes and
Essam [15] were able to show, using standard planar duality arguments, that for the case of
self-dual matching lattices (e.g., Z2 ), the mean value of the Euler–Poincaré characteristic
changes sign at the critical point (this even led them to announce a proof for the value of
the critical probability of bond percolation on Z2 ), see also [16].
More recently, Wagner [8] was able to compute the Euler–Poincaré characteristic on
the set of all plane regular mosaics (the 11 Archimedean lattices) as a function of the
site occupancy probability p ∈ [0, 1]] and showed that a close connection exists between
the threshold for site percolation on these lattices and the point where the Euler–Poincaré
characteristic (expressed as a function of p) changes sign.
The aim of this article is to further investigate the role played by this morphological
indicator in statistical physics and to present new results concerning its behaviour in
the case of the 2-dimensional Potts model. Namely we present here clear evidence,
based on Monte Carlo simulations, that for the 2d Potts model, the Euler–Poincaré
characteristic changes sign at the critical point of the thermal transition. Namely we
find that for q = 1, . . . , 4 it changes sign continuously at the transition point while,
for q = 5, 6 it has a first order transition at the critical point. As far as we know,
this is the first example of a discontinuous behaviour of this parameter in a physical
model.
The paper is organized as follows. In Section 2 we introduce basis facts concerning
Minkowski functionals and the necessary definitions for our model, the numerical results
are presented in Section 3 followed by some comments and discussion in Section 4.
2. The model
We first briefly summarize the basic facts from integral geometry and give the
definition of the Minkowski functionals including the Euler–Poincaré characteristic,
see [2–4] for more complete expositions. We will then show how to compute the
Euler–Poincaré characteristic in the case of a random configuration of sites and bonds
P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508 497
π d/2
Wd (A) = ωd χ(A), ωd = (2.3)
Γ (1 + d/2)
where Eα is an α-dimensional plane in Rd , dµ(Ea ) its density normalized such that, for
the d-dimensional ball Bd (r) with radius r, Wα (Bd (r)) = ωd r d−α and ωd is the volume
of the unit ball in Rd . Obviously, additivity of the Minkowski functionals is inherited from
(2.1), furthermore they are usually conveniently normalized through
ωd−α
Mα (A) = Wα (A).
ωd ωα
The computation of these normalized functionals in dimensions 1, 2, 3 in terms of the
usual geometric measures (length, area, volume, . . . ) is given in Table 1. An important
result of integral geometry is Hadwiger’s completeness theorem which asserts that,
under not too restrictive and furthermore physically reasonable assumptions, namely:
additivity, motion invariance (under translations and rotations) and conditional continuity
(which states that any convex body can be smoothly approximated by convex polyhedra),
Fig. 1. Examples of calculation of the Euler–Poincaré characteristic in dimension d = 2 for some combinations
of convex subsets of R2 .
498 P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508
Table 1
Values of the normalized Minkowski functionals for d-dimensional subsets of Rd , d = 1, 2, 3 in terms of
geometric measures, L: length, S: area, V : volume, C: circumference, H : integral mean curvature, χd : Euler–
Poincaré characteristic
d M0 M1 M2 M3
1 L χ1 /2 ··· ···
2 S C χ2 /π ···
3 V S H /2π 2 3χ3 /4π
d
M(A) = cν Mν (A),
ν=0
where integration is over the group of motions G (i.e., rotations and translations, g =
(r, Θ)). This formula is very useful to calculate mean values of Minkowski functionals
for random distributions of objects. For instance, it can be applied to the computation of
the excluded volume of convex bodies (leading, in case of spherical objects, to Steiner’s
formula) which has important applications in continuum percolation theory (see [18]).
It is one of the topics of algebraic topology to show that the above general definitions of
Minkowski functionals (including Euler–Poincaré characteristic) extend to cell complexes
to which random bond configurations on regular lattices belong.
The square lattice Z2 can be viewed as a cell-complex L = {L0 , L1 , L2 }, where L0 = Z2
is the set of sites, L1 is the set of bonds and L2 is the set of plaquettes (see [20–23]).
To a set of bonds B ⊂ L1 we associate the subcomplex Λ(B) ⊂ L defined as the
maximal closed subcomplex containing B. A subcomplex K0 of a complex K is said to be
closed if every element of K which precedes (i.e., is less than) some element of K0 is itself
an element of K0 . K0 is maximal in the sense that if K0 is a subcomplex of K and K0 is
strictly contained in K0 , then K0 is not closed.
This subcomplex can be written
The partition function for the q-states Potts model on a lattice Λ ⊂ Zd at inverse
temperature β reads
Potts
Zβ,q (Λ) = exp β δ(σi , σj ) , (2.6)
σ i,j ⊂Λ
where the first sum runs over all configurations σ ⊂ {1, . . . , q}|Λ| , the second one is over
each nearest neighbour pair of Potts spins on Λ and δ is the Kronecker symbol. We
remind that, whenever q is large enough, in any dimension d 2, this model exhibits a
unique (inverse) temperature βc where the mean energy is discontinuous (see [24–26]). In
dimension d = 2 for q 5 this is an exact result [28] and it is expected to be true, in d = 3,
for q 3 [29].
After performing the (FK) transformation [17], the partition function (2.6) leads to the
following random cluster representation
N 1 (X) N (X)
FK
Zβ,q (Λ) = eβ − 1 q Λ . (2.7)
X
Here the summation is over all graphs X which can be drawn inside the domain Λ and
NΛ (X) is the number of connected components of X (including isolated sites). As before,
500 P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508
we shall call N 1 (X) the number of bonds of the configuration X, N 0 (X) the number of
sites which are endpoints of a bond in X and N 2 (X) the number of plaquettes of the
configuration X, i.e., the set of cells in Λ having 4 occupied bonds on its boundary (see
Fig. 2). Formula (2.4) leads to
χ(X) = N 0 (X) − N 1 (X) + N 2 (X). (2.8)
this expression will allow us to compute the mean value of the Euler–Poincaré character-
istic with respect to the FK measure.
3. Numerical results
We have performed Monte Carlo simulations of the 2-dimensional q-state Potts model
on the square lattice for q ranging from 2 to 6. We have always simulated the models near
the critical temperature, whose value can be exactly determined through the well-known
√
formula [29]: βc (q) = J /kTc (q) = log(1 + q). In order to extract a value of the Euler
characteristic χ as close as possible to the value at the infinite volume limit, we have
taken rather large lattices: for the three models with a continuous transition (q = 2, 3, 4)
we arrived at lattice sizes up to 20002. The algorithm we used is the Wolff cluster update
[30]. The identification of FK cluster configurations has been performed via the Hoshen–
Kopelman algorithm [31]; we always considered free boundary conditions for the cluster
labeling. The calculation of the Euler characteristic is done during the cluster labeling
procedure and takes basically zero CPU time: the number of active bonds is stored during
the main Hoshen–Kopelman sweep of the lattice, the number of sites joined by bonds
is trivially obtained right after the clusters are labeled (it is just the total size of the
clusters containing at least two sites). The determination of the number of plaquettes is
more involved, but it requires only an additional sweep over the elementary squares of the
lattice. This procedure is relatively fast since the analysis of a plaquette stops as soon as a
“broken” bond is found.
The Wolff algorithm is the less efficient the bigger the number q of states. Particularly
dramatic is what happens when one passes from the 2-state (Ising) to the 3-state model:
P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508 501
in the former case, on the 20002 lattice it is enough to perform few updates (5–10) to get
uncorrelated configurations for the cluster variables, in the latter one needs about 1000
updates! Because of that, the simulations for q = 3, 4 on the 20002 lattice were very
slow, and the relative data could not reach a high statistics. This is also the reason why
we used the Hoshen–Kopelman algorithm instead of extracting the information from the
single cluster of the update: since for q > 2 basically all the CPU time is taken by the
update phase of the program, the analysis of all clusters of a given configuration allows us
to improve considerably the statistics of the percolation data without time losses.
If χ changes sign at the threshold, on a finite lattice the values measured at each iteration
would be distributed around zero, provided the lattice is large enough. Therefore one would
see both positive and negative values. For this reason, it is helpful to look at the distribution
of χ . The values of the Euler characteristic we shall refer to are meant per lattice site; this
has the advantage that we can see if and how the data concentrate around some value by
increasing the lattice size. In the following we present separately the results for q = 2, 3, 4
and q = 5, 6.
The first case we consider here is the Ising model. Fig. 3 shows the χ distribution
for three different lattice sizes: 5002 , 10002 and 20002 . In each case we have taken
100 000 iterations, measuring the variables of interest every 5 updates. The peak of the
distribution shifts towards χ = 0 the larger the lattice. The average values of χ are:
χ(5002) = 0.00101(2), χ(10002) = 0.00053(2), χ(20002) = 0.00024(1). We notice that
the averages are quite small and decrease sensibly if we go to larger sizes, reducing
themselves to about the half when we pass from a lattice to the next one. This approximate
linear scaling of χ with the lattice side suggests that the Euler characteristic at the infinite
volume limit indeed vanishes.
Fig. 3. Distribution of χ for the FK cluster configurations of the 2d Ising model at the critical point.
502 P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508
Fig. 4. Distribution of χ for the FK configurations of the 2d, 3-state Potts model at the critical point.
Fig. 5. Distribution of χ for the FK cluster configurations of the 2d, 4-state Potts model at the critical point.
Let us now examine the case q = 3. In Fig. 4 we again plot the χ distribution for
the same three lattice sizes we have considered for the Ising model. Since we collected
a different number of measurements for the different lattices, for a real comparison of the
distributions we needed to renormalize the total number of measurements on each lattice
to the same value: we decided to renormalize all data sets to the number of measurements
on the 5002 lattice (10 000). The distributions are broader than the Ising ones but they
appear almost exactly centered at χ = 0. The average values are in fact much smaller than
before: χ(5002) = 0.00024(4), χ(10002) = 0.00012(3), χ(20002) is zero within errors.
We then deduce that also for q = 3, χ = 0 at the critical point. To complete our analysis
we studied the case q = 4. In Fig. 5 we present a comparison of the χ distributions for two
lattice sizes, 8002 and 20002 . There is a clear shift of the center of the distribution towards
zero when one goes from the smaller to the larger lattice. The average values of the Euler
P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508 503
characteristic in the two cases are χ(8002 ) = −0.00145(8) and χ(20002) = −0.0006(2).
Also here there is no apparent convergence to some value, even if the lattices are rather
large: |χ| reduces itself to less than its half by changing the lattice size. From all this
we also deduce that the Euler characteristic of the FK clusters of the 2-dimensional 4-state
Potts model vanishes at criticality.
We also found that the variance 1χ of the χ distribution for the Ising model scales
as L−0.95 ≈L−1 , where L is the lattice side; for the 3-state Potts model it seems that
1χ scales as L−0.78 . We notice that, in the Ising case, if the correct scaling behaviour
of 1χ goes indeed as the inverse lattice side, the fluctuations of the Euler characteristic
would behave like the energy ones. But the fluctuations of the energy are proportional
to the specific heat of the system, and the L−1 behaviour is a consequence of the fact
that the specific heat of the 2D Ising model diverges logarithmically at criticality (α = 0).
That could mean that, in the Ising case, the fluctuation of the Euler characteristic diverges
with temperature like the specific heat. This impression is confirmed by the results on the
3-state Potts model: since in that case α/ν = 2/5, the scaling behaviour of the energy
fluctuations with L goes like L−4/5 = L−0.8 , in excellent agreement with our numerical
estimate.
3.2. Results for the models with a first order phase transition
For q > 4 the 2d q-state Potts model undergoes a first order phase transition, i.e., the
thermal variables vary discontinuously at the critical threshold. The magnetization, for
instance, makes a jump, varying from zero to a non-zero value. Because of that, we expect
that the cluster configurations change abruptly at the critical point, and that the cluster
variables exhibit as well discontinuities. In particular, the Euler characteristic may jump
from a value to another.
We analyze here the 5- and 6-state Potts models. In both cases we have performed
simulations on three lattices: 1002 , 2002 and 3002 . In Figs. 6 and 7 we compare
the distribution histograms on the 3002 lattice of the magnetization M and the Euler
characteristic χ at three different temperatures near Tc . We define the magnetization by
taking the excess of sites in the majority spin state (per lattice site) with respect to the
value 1/q in the paramagnetic phase, when all spin states are equally distributed. Therefore
we always measure M > 0 and that removes the degeneracy of the magnetization states
due to the Z(q) symmetry of the Hamiltonian. In this way, if one finds a double peak
structure in some temperature range, one can suspect that the transition is discontinuous.
Looking at the magnetization histograms of the figures one clearly sees the spontaneous
symmetry breaking by reducing the temperature. The double peak structure of M suggests
that the transition is first order, as it is known. The corresponding histograms of the
Euler characteristic show a perfectly analogous pattern. To check whether the transition
is indeed first-order, we determined the temperature βH at which the two peaks of the
Euler characteristic distribution are equally high. For 5-state Potts we found βH (1002) =
1.17343, βH (2002) = 1.17405 and βH (3002) = 1.17422; for 6-state Potts βH (1002) =
1.23763, βH (2002) = 1.23804 and βH (3002) = 1.23812. Successively we analyzed the
scaling of the hump between the two peaks with the linear dimension L of the lattice. The
height of the hump χm decreases with L according to the law log(χm ) ∝ −L2 , which is
504 P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508
Fig. 6. Distribution histograms of the magnetization M and the Euler characteristic χ for the 2-dimensional
5-state Potts model at three different temperatures. The lattice size is 3002 . The behaviour of χ is driven by M.
the typical behaviour at a first order phase change. As the result is valid for the 5-state and
the 6-state Potts model, it is likely to be valid also for q > 6, when the discontinuity of
M at the threshold is sharper. Looking at both figures we remark that the centers of the
peaks of χ look approximately symmetric with respect to zero. If this symmetry exists,
it would be an interesting feature, and at the moment we have no arguments to justify it.
In order to determine with some accuracy the values of χ in the two coexisting phases
we would need to increase considerably the size of the lattice, but the required computer
time would increase dramatically for the reasons we explained at the beginning of this
section.
P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508 505
Fig. 7. Distribution histograms of the magnetization M and the Euler characteristic χ for the 2-dimensional
6-state Potts model at three different temperatures. The lattice size is 3002 . The behaviour of χ is driven by M.
4. Conclusions
the modification of the clusters topology induced by the FK representation has non-trivial
consequences on the critical behaviour of spin systems [32–34].
The fact that χ changes sign at Tc for the 2d Ising model in FK representation can be
understood heuristically in the following way. From Tc up to T = ∞ the system is in its
disordered phase and the only excitations one can get in the FK-bond representation are
made of isolated bonds (the probability to see any plaquette vanishes exponentially). Ap-
plying the Euler–Poincaré formula (2.5), one sees that χ behaves like π 0 (X) times a term
of the order of the volume of the system. However, from Tc down to T = 0, the system
is in its ordered phase and the corresponding FK-configuration is (with high probability)
made of O(1) connected bond components. Missing bonds constitute the excitations and
their number scales with the volume of the system so, using again (2.5), one gets that χ
behaves like −π 1 (X) times a term of the order of the volume of the system. This explains
in the case of the Ising model the change of sign of χ at Tc . For Potts spins, when the
transition is first order (q 3), a similar argument holds in FK representation at the critical
point.
The striking phenomenon is the vanishing of the Euler–Poincaré characteristic at Tc
when the transition is second order.
Of primary interest is of course to understand how to relate the Euler–Poincaré
characteristic to the order parameter of the phase transition. This is probably not a simple
task and has not been done so far, even for models when an exact formulae can be derived
for the Euler–Poincaré characteristic (see [5]). Scaling properties and critical exponent of
this quantity are also subjects of great interest.
Another important question concerns the critical behaviour in gauge models. For
example a similar study could provide some insights concerning the deconfining transition
in SU(N ) gauge theory. Indeed, some works [35] tend to indicate that this transition could
be probed by percolation of some physical clusters related to color fields in lattice QCD.
These models have been thoroughly investigated in the past and the tools coming from
algebraic topology have been of primary importance to uncover profound duality results
concerning their phase structure [23,36].
Other spin systems have to be investigated in order to see whether this property of the
Euler–Poincaré characteristic is shared by models with continuous symmetries such as the
(X–Y )-model or the Widom–Rowlinson model. How does χ behave in spin glasses for
example?
Acknowledgements
We are indebted to H. Wagner who initiated our interest for the topics developed in this
paper and to J. Ruiz for fruitful discussions.
Financial support from the BiBoS Research Center (University of Bielefeld), TMR
network ERBFMRX-CT-970122 and the DFG under grant FOR 339/1-2 are gratefully
acknowledged.
P. Blanchard et al. / Nuclear Physics B 644 [FS] (2002) 495–508 507
References
[35] S. Fortunato, H. Satz, Polyakov loop percolation and deconfinement in SU(2) gauge theory, Phys. Lett.
B 475 (2000) 311;
S. Fortunato, F. Karsch, P. Petreczky, H. Satz, Effective Z(2) spin models of deconfinement and percolation
in SU(2) gauge theory, Phys. Lett. B 502 (2000) 321.
[36] F.J. Wegner, Duality in general Ising models and phase transitions without local order parameter, J. Math.
Phys. 12 (1971) 2259.
Nuclear Physics B 644 [FS] (2002) 509–532
www.elsevier.com/locate/npe
Abstract
We consider N = 1 supersymmetric sine-Gordon theory (SSG) with supersymmetric integrable
boundary conditions (boundary SSG = BSSG). We find two possible ways to close the boundary
bootstrap for this model, corresponding to two different choices for the boundary supercharge. We
argue that these two bootstrap solutions should correspond to the two integrable Lagrangian boundary
theories considered recently by Nepomechie.
2002 Elsevier Science B.V. All rights reserved.
Keywords: Supersymmetry; Sine-Gordon model; Integrable quantum field theory; Boundary scattering;
Bootstrap
1. Introduction
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 1 - 0
510 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
As a result of integrability, the boundary scattering factorizes, and the general solution
of the boundary Yang–Baxter equation was found in [5], but the constraint of supersym-
metry was not imposed. Nepomechie was the first to consider supersymmetric boundary
scattering [4], building on previous results obtained in the case of supersymmetric sinh-
Gordon theory [6]. However, no one has exposed the full structure of the solitonic re-
flection amplitude, although the results obtained in the case of the tricritical Ising model
[7] are closely related to this problem. Besides that, the closure of the bootstrap and the
spectrum of boundary states have not been even touched before. Therefore our aim is to
clear up the issue of supersymmetric boundary scattering in the BSSG model and to find
the complete spectrum of boundary states and their associated reflection factors. The main
idea—motivated by the successful description of the bulk scattering—is to look for the
reflection amplitudes in a form where there is no mixing between the supersymmetric and
other internal quantum numbers. This means an ansatz for the reflection amplitudes as a
product of two terms one of which is the ordinary (bosonic) sine-Gordon reflection ampli-
tude, while the other describes the scattering of the SUSY degrees of freedom.
Within the bootstrap procedure, we consider first the solitonic reflection amplitudes
on the ground state and on the first two excited boundaries. The SUSY factors in these
solutions have no poles in the physical strip, thus the masses of boundary states emerging
are the same as in the bosonic theory, however, SUSY introduces a nontrivial degeneracy.
The spectrum of general higher excited boundaries is easily extracted from these results.
We determine also the various reflection amplitudes on these excited boundaries.
There are two ways to close the bootstrap, starting from two different ground state
reflection amplitudes, corresponding to two possible choices of the boundary supercharge.
Both solutions lead to the same spectrum of boundary states, but the reflection amplitudes
are different. In one case the reflections conserve fermionic parity, while in the other they
do not.
The layout of the paper is as follows. Section 2 recalls briefly some important facts
about supersymmetric sine-Gordon theory. In Section 3 the reader is reminded of the
supersymmetric and integrable boundary interactions that can be added to the theory,
and we also discuss of the boundary supercharge and derive a formula relating it to the
boundary Hamiltonian. Section 4 gives a quick review (containing only the most necessary
facts) of the spectrum and reflection factors of the ordinary (nonsupersymmetric) sine-
Gordon model with integrable boundary conditions. In Section 5 we present the main
results of the paper, which is a conjecture for the spectrum and the full set of reflection
factors of the BSSG model. We consider the breather reflection amplitudes in Section 6
and then give our conclusions in Section 7.
1 β m2
γ µ ∂µ Ψ + mΨ
LSSG = ∂µ Φ∂ µ Φ + i Ψ Ψ cos Φ + cos βΦ, (1)
2 2 β2
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 511
SSG (u)++ −−
++ = SSG (u)−−
∞
Γ 2(l − 1)λ − λuπ Γ 2lλ + 1 − π
λu
=− (u → −u) ,
l=1 π Γ (2l − 1)λ + 1 − π
Γ (2l − 1)λ − λu λu
sin(λu) 2π 1
SSG (u)+− −+
+− = SSG (u)−+ = SSG (u)++
++ , λ= − ,
sin(λ(π − u)) β 2 2
sin(λπ)
SSG (u)−+ +−
+− = SSG (u)−+ = SSG (u)++
++ , u = −iθ, (4)
sin(λ(π − u))
1 Note that the relation between the parameter λ and the coupling β is different from the sine-Gordon case.
512 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
while the SUSY factor is identical to the S matrix of the tricritical Ising model perturbed
by the primary field of dimension 35 [8]:
0 12 1 12 θ π
SSUSY 1 θ = SSUSY 1 θ =2 (iπ−θ)/2πi
cos − K(θ ),
2 0 2 1 4i 4
1
1
0 1 θ
SSUSY 2 θ = SSUSY 2 θ =2 θ/2πi
cos K(θ ),
1 1 4i
0 2 1 2
0 12 1 12
SSUSY 1 θ = S θ = 2(iπ−θ)/2πi cos θ + π K(θ ),
1 0
SUSY 1
2 2
4i 4
1
1
1 0 θ π
SSUSY 2 1 θ = SSUSY 2 1 θ = 2θ/2πi cos − K(θ ),
0 2 1 2 4i 2
∞
1 Γ (k − 1/2 + θ/2πi)Γ (k − θ/2πi)
K(θ ) = √ .
π Γ (k + 1/2 − θ/2πi)Γ (k + θ/2πi)
k=1
As the SUSY factor has no poles in the physical strip, the solitonic amplitudes (3) have
poles at exactly the same locations as the sine-Gordon soliton S matrix. These correspond
to bound states (breathers) Bn of mass
πn
mn = 2M sin , n = 1, . . . , [λ].
2λ
The S matrix of the breathers was first found in [9]. The breathers form a particle multiplet
composed of a boson and a fermion, on which supersymmetry is represented in a standard
way [10,11].
For the ordinary sine-Gordon theory, the correspondence between the Lagrangian theory
and the bootstrap S matrix (4) is very well established. There is much less evidence for the
correctness of the S matrix (3) as the scattering amplitude of SUSY sine-Gordon theory.
Besides the original construction [2] (based on arguments related to N = 1 supersymmetric
minimal models), another indication is that at a particular value of the coupling β where it
is expected to have a restriction to the SUSY version of Lee–Yang theory (superconformal
minimal model SM(2/8) perturbed by the relevant superconformal primary field Φ(1, 3),
which is equivalent to Virasoro minimal model M(3/8) perturbed by the primary field
Φ(1, 5)), the first breather supermultiplet has the same scattering amplitude as predicted
(2)
from RSOS restriction of imaginary coupled a2 Toda theory in [12] (see also [13]).
The bulk theory has two supersymmetry charges of opposite chirality Q and Q, which
together form a Majorana spinor. They act on one-particle states |Ai (θ ) in the following
way [8,10,11]:
√
QAi (θ ) = mi eθ/2 QAi (θ ) , Ai (θ ) = √mi e−θ/2 Q
Q Ai (θ ) ,
are matrices satisfying
where mi are the particle masses and Q, Q
Q2 = 1, 2 = 1.
Q
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 513
In the one-particle basis {K0 1 , K1 1 , K 1 0 , K 1 1 } (we omit the upper index , as the
2 2 2 2
SUSY action does not depend on the topological charge), the supersymmetry algebra is
represented by the matrices
0 i 0 0 0 i 0 0
−i 0 0 0
Q= = −i 0 0 0 .
Q
0 0 1 0 , 0 0 −1 0
0 0 0 −1 0 0 0 1
The SUSY algebra of the sine-Gordon theory has a central charge and the matrix
1 0 0 0
1 0
Z = {Q, Q} = 0 1 0
2 0 0 −1 0
0 0 0 −1
describes the SUSY central charge in the above basis. This is not to be confused with
the topological charge T of the sine-Gordon solitons, which is represented by the upper
indices . Z can take the values 0 or ±1, and it distinguishes between solitons/antisolitons
mediating from odd to even and from even to odd vacua of the bosonic potential [14] (this
means that using the terminology of [15] the theory is 2-folded).
The above representation of SUSY describes BPS saturated objects. There was a
controversy in the literature whether the solitons in SSG are BPS saturated, since N = 1
SUSY does not protect their mass from acquiring radiative corrections. However, it was
shown in [16] that they remain BPS saturated at one-loop and probably to all orders, due
to anomalous quantum corrections to the classical formula for the central charge Z.
The fermionic parity operator Γ = (−1)F is given by the matrix
0 1 0 0
1 0 0 0
Γ = 0 0 0 1.
0 0 1 0
Using the definition of Γ it is possible to specify a basis of pure bosonic and pure
fermionic states for any given (fixed) number of particles. However, the composition
(coproduct) rules of the kink states as given in (2) are not free and therefore in the boson-
fermion internal space supersymmetry acts nonlocally. The action of supersymmetry on
multi-particle states involves braiding factors depending on Γ that are defined by the
coproduct ∆:
∆(Q) = Q ⊗ I + Γ ⊗ Q,
=Q
∆(Q) ⊗ I + Γ ⊗ Q,
∆(Γ ) = Γ ⊗ Γ.
The action on breather states can be derived using the bootstrap, but can also be obtained
from the representation theory of the SUSY algebra. It turns out that the central charge Z
(as well as the topological charge T ) vanishes identically for the breathers. For further
details we refer to [10,11].
514 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
QQ + Q Q = 2H, (7)
where H = h(x, t) dx is the Hamiltonian, Z is the SUSY central charge and M is the
soliton mass. In a boundary theory with supersymmetric integrable boundary condition,
the conserved supercharge can be written as follows:
0
± =
Q q(x, t) ± q̄(x, t) dx + QB (x = 0, t),
−∞
0
=
H h(x, t) dx + HB (x = 0, t),
−∞
where HB is the boundary interaction. Let us for the moment neglect the contribution from
the central charge Z (this is possible, e.g., in a sector containing only breathers). Then,
using Eq. (7) it is easy to see that
0
2±
Q =2 h(x, t) dx + 2HB (x = 0, t),
−∞
2± = 2H
Q .
2± = 2(H
Q ± M Z),
(8)
is an appropriate extension of Z to the boundary situation. We shall see that the
where Z
two bootstrap solutions we propose correctly reproduce this formula.
σ (x, u)
cos x
=
cos(x + λu)
1 x
∞ λu
Γ 12 + πx + (2l − 1)λ − λu
π Γ 2 − π + (2l − 1)λ − π
× 1 x (u → −u)
l=1
Γ 12 − πx + (2l − 2)λ − λu
π Γ 2 + π + 2lλ − π
λu
describes the boundary condition dependence. The reflection factors of the breathers can
be obtained by the bulk bootstrap procedure [19].
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 517
In the odd sector, i.e., when k is odd, the same formulae apply if in the ground state
reflection factors the η ↔ η̄ and s ↔ s̄ changes are made. The breather sector can be
obtained again by bulk fusion.
RSUSY (θ ) × RSG (θ ).
In this special form the constraints as unitarity, boundary Yang–Baxter equation and
crossing-unitarity relation [18] can be satisfied separately for the two factors. Since the
518 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
or in detail
and
1 1
K 1 a (θ )|Ba = Raa
2
(θ )K 1 a (−θ )|Ba + Rab
2
(θ )K 1 b (−θ )|Bb ,
2 2 2
b = a, a, b = 0, 1 (11)
In the second process the label of the boundary state has changed, which shows that |B0
and |B1 form a doublet. All of the constraints mentioned above factorize in the sense
that they give independent equations for the reflections on the boundary |B1/2 and on the
doublet |B0,1 . Since the ground state boundary is expected to be nondegenerate we first
concentrate on reflection factors off the singlet boundary |B1/2 . The most general solution
of the boundary Yang–Baxter equation is of the form [5]
R 01 1 (θ ) = 1 + A sinh(θ/2) M(θ ); R 11 1 (θ ) = 1 − A sinh(θ/2) M(θ )
2 2 2 2
± = Q ± Q
Q + QB ,
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 519
where Q, Q act on the particles as in the bulk theory (Section 2.2).2 The reason for this
is that they are given by integrals of local (fermionic) densities, and asymptotic particles
are localized far away from the wall, so the action of these charges is not affected by the
presence of the boundary. QB is the action of the boundary contribution, which we take to
be
QB = γ Γ, (12)
where γ is some unknown parameter (related to the energy of the boundary ground
state—see later). The reason for this choice is that we expect the boundary supercharge to
commute with the bulk S-matrix, which is symmetric under the action of Q, Q and Γ by
construction [10,11]. (12) is also supported by classical considerations in [4], showing also
that the classical version of γ is a function of the parameters in the boundary Lagrangian
(6).
Next we need to give the action of Q, Q and Γ on the boundary ground state |B1/2 .
Following [7], we choose
QB 1 = 0, B 1 = 0,
Q Γ B 1 = B 1 . (13)
2 2 2 2
The first two relations express that the boundary ground state is supersymmetric, while
the last one shows that it is an eigenvector of Γ . We expect that because the ground
state is nondegenerate. The choice of the eigenvalue (±1) is not important, as it could
be compensated by a redefinition of γ . It is a consequence of (8) and (13) that the ground
± on
state energy is γ 2 /2, which will be shown later to be consistent with the action of Q
the excited boundary states.
= 2−θ/πi P (θ ). (14)
− that commutes with the reflections (BSSG ) then the result is
If, however, it is Q −
ξ θ
R 1 1 (θ ) = cos + i sinh
0
K(θ − iξ )K(iπ − θ − iξ )2−θ/πi P (θ ),
2 2 2 2
ξ θ
R 11 1 (θ ) = cos − i sinh K(θ − iξ )K(iπ − θ − iξ )2−θ/πi P (θ ), (15)
2 2 2 2
differs by a sign from that used in [7], while agrees with the convention in [11]. This
2 Note that our charge Q
is important when comparing our results with those in [7].
520 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
where ξ is related to γ as
√ ξ
γ = −2 M cos . (16)
2
Note that symmetry of the reflection under Γ requires
R 01 1 (θ ) = R 11 1 (θ ),
2 2 2 2
thus in the first case (BSSG+ ) the reflections also commute with the operator Γ , while in
the other case (BSSG− ) they do not. We remark that there are no poles in the physical strip
in any of the reflection factors above.
In the case BSSG+ , the supersymmetry constraints do not determine the value of γ
in contrast to the results of [7,22]. The reason is that it is the supersymmetry of the
reflections on |Ba , a = 0, 1 which connects γ with a parameter in the reflection matrix
itself. However, we construct these reflections by the bootstrap, which determines them
completely, and γ is left as a free parameter. As it was argued above and as will also be
seen later γ is connected to the vacuum energy, so it is not a new parameter of the theory
(in principle it is expressible in terms of the Lagrangian parameters). The only independent
parameters introduced by the boundary are η and ϑ which are present in the bosonic sine-
Gordon reflection factors RSG .
R a1 1 (θ ) × RSG (θ ),
2 2
where the SUSY component has the form (14). Since the only poles of these reflection
factors are due to the sine-Gordon part their explanation has to be similar to that in the
bosonic theory. However, we have to supplement the formulae for the bosonic theory
with RSOS indices in a consistent way. The sine-Gordon reflection factor has boundary
independent poles at i nπ2λ for n = 1, 2, . . . , which can be described by Fig. 1. This
is identical to the nonsupersymmetric diagram except that it is decorated with RSOS
indices, which are displayed inside circles. Clearly the dashed line denotes the full breather
supermultiplet (now consisting of a boson and a fermion).
The boundary dependent poles of RSG are located at
η (2n + 1)π
−iθ = νn = − , n = 0, 1, . . . .
λ 2λ
At the position of these poles we associate boundary bound states to the reflection
amplitudes R a1 1 , a = 0, 1
2 2
1 1 1
|a, 1/2|n = |1/2
Ka 1 (iνn ) , where ≡ |B 1 , (17)
g|a,1/2|n 2 2 2 2
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 521
where the g-factor is the SUSY part of the boundary coupling, coming from the SUSY
component of the reflection factor (for definitions of boundary couplings, see [18]). The
two states (a = 0, 1) for a given n form a doublet which realizes the structure (11), that
is the K 1 a kinks can scatter on it. The action of the boundary supercharge on these states
2
can be calculated using the coproduct rules in [10,11], taking into account the action of the
charges on the boundary ground state:
√ νn
Q+ |0, 1/2|n = r γ − 2i M cos |1, 1/2|n,
2
√ νn
Q+ |1, 1/2|n = r −1
γ + 2i M cos |0, 1/2|n,
2
|1/2
g|1,1/2|n
r= |1/2
. (18)
g|0,1/2|n
The boundary supercharge satisfies
2
2+ |a, 1/2|n = 2 γ + M cos νn + M |a, 1/2|n
Q
2
which is exactly the relation Q 2+ = 2(H + M Z),
since the central charge of this state is
Z = 1 (we take the ground state |1/2 to have Z = 0, while the bulk soliton Ka1/2 has
Z = 1), the ground state has energy γ /2 by virtue of the relations (12), (13) and M cos νn
2
is the energy that the excited state has relative to the ground state (10).
The SUSY reflection factors of K 1 a off |a, 1/2|n can be computed from the bootstrap
2
principle (Figs. 2 and 3):
1 1 1 1 x
1
b
2 2
ga Rab (θ ) = gb
2 2 2
S (θ − iνn )S (θ + iνn )R 1 1 (θ ) ,
x
x=0,1
a 12 x 12 2 2
1
|1/2
where ga2 ≡ g|a,1/2|n . (19)
522 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
1
1 gb2 νn θ
Rab (θ ) = P (θ )K(θ + iνn )K(θ − iνn ) 1 δab cos
2
+ δa,1−b sin . (20)
2 2i
ga2
Note the appearance of the g factors in the result. They are the SUSY parts of the boundary
couplings and come in two types: one corresponds to the absorption of the particle while
creating a higher excited boundary state, the other describing the emission of the particle
and transition to some lower excited boundary state. The ones above are of the absorption
type. The residues of the full reflection factor are described by the product of an emission
and an absorption type full boundary coupling (for a definition of boundary couplings and
their relation to the residue of the reflection factor see [18]). Due to the tensor product
structure the full boundary coupling is given by the bosonic part multiplied with the SUSY
g-factor, as in the case of bulk scattering [11]. The product of the appropriate emission
and absorption SUSY g-factors is constrained to coincide with the value of the SUSY part
of the reflection factor at the position of the pole in the bosonic factor. It can be seen in
general that this does not give enough constraints to determine their value unambiguously
due to the degeneracy introduced by the RSOS indices a, b, and as no physical quantity
should explicitly depend on their value (see the example of the relation Q2+ = 2(H + M Z)
discussed above) we do not present any solution for them.
Being constructed by the bootstrap, the reflection factors (20) necessarily satisfy the
constraints of boundary factorization and crossing-unitarity; in addition, they commute
with Q+ which is guaranteed by the fact that the action of the boundary supercharge is also
derived from the bootstrap as in (18). The full reflection factor on the |a, 1/2|n excited
boundary can be obtained by multiplying this result with the appropriate excited bosonic
reflection factor:
1 1
±
2
Rab (θ ) × Q|n (η, ϑ, θ ) 2
or Rab (θ ) × P|n (η, ϑ, θ ). (21)
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 523
Clearly (20) has neither pole nor zero in the physical strip. So the poles of the reflection
factors on the first excited wall (21) are exactly the same as in the nonsupersymmetric
theory: that is they are at iνk or at iwm .
The decoration of the nonsupersymmetric diagrams shows, that Fig. 5 explains the ν
type of poles, while Fig. 4 explains the w type, but only for wm > νn . For wm < νn we
have a boundary bound state which we denote by
1
, a, 1 m, n = 1
K 1 a (iwm )|a, 1/2|n,
2 2 |a,1/2|n 2
g1 1
| 2 ,a, 2 |m,n
so this is also a doublet, but now it is the Ka 1 type kinks that are able to reflect on it. It
2
can be checked easily that the relation Q 2+ = 2(H + M Z)
holds for these states as well,
consistently with the previous interpretation of γ (for these states Z = 0).
At this point the question emerges whether the two states a = 0, 1 forming the doublet
| 12 , a, 12 |m, n are physically different or there is a possibility for some identification
so that a single state can explain the pole in the reflection matrix (21). This can be
decided by examining whether one can describe the residue of the reflection factor with a
1/2 1/2 1/2 1/2
single intermediate state, which implies a relation between the R00 , R11 , R10 and R01
components of the reflection factor at the pole. This relation is violated (for generic values
of the parameters) and so one must really introduce the two states above.
Following the same analysis we performed in [21], but now using a decorated version
of the Coleman–Thun diagrams it can be seen that the poles in the reflection matrix on the
above boundary excited state, which cannot be explained by Coleman–Thun diagrams are
located at iνk . Since the poles appear in association with both reflection factors R b1 1 (θ ),
22
the corresponding boundary states, which are denoted by
1
b, , a, 1 k, m, n
2 2
have a fourfold degeneracy.
Fig. 6.
It is clear that the general boundary bound state has the structure
ak . . . 1 , a1 , 1 nk . . . , m1 , n1 or 1 , ak . . . 1 , a1 , 1 mk , nk . . . , m1 , n1 . (22)
2 2 2 2 2
From this we see that in the supersymmetric case the boundary excited states have a
nontrivial degeneracy in contrast to the bosonic theory. The degeneracy is labeled by RSOS
sequences starting from 1/2. In both states in (22) the labels ai can freely take the values
0 and 1, and, as a result, the degeneracy of the states is 2k . The associated reflection
factors can be computed from successive application of the bootstrap procedure, which
is illustrated on Fig. 6
The result depends on the Z charge of the scattering particles. In the Z
= 0 case the
result can be written in the following form
where fba11ba22 (wm1 , νn2 , θ ) is the contribution of the dotted square summing over x1 = 0, 1
that is
fba11ba22 (wm1 , νn2 , θ )
x1 1
x1 1
= S 1 2
(θ − iwm1 )S 1
(θ + iwm1 )
2
x1 =0,1 2 a1 2 b1
1
1
x1 b2
×S 2
1
(θ − iνn2 )S 2
1
(θ + iνn2 ).
a2 2 x 1 2
Collecting the common factors we have
fba11ba22 (wm1 , νn2 , θ ) = K(θ − iwm1 )K(θ + iwm1 )K(θ − iνn2 )K(θ + iνn2 )
|a1 , 12 |n1 | 12 a1 , 12 |m1 ,n1
g g
| 12 a1 , 12 |m1 ,n1 |a2 , 12 a1 , 12 |n2 ,m1 ,n1 a1 a2
× hb1 b2 (wm1 , νn2 , θ ),
{a ↔ b}
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 525
where
where
|ak 12 ...
g
θ | 12 ak−1 ... θ π
hxakk ,bk = 2− iπ − sin − (δxk ,ak + δxk ,bk )
|bk 1 ... 2i 2
g1 2
| 2 bk−1 ...
wmk π
+ cos + (δxk ,ak − δxk ,bk ) .
2 2
R a1 1 (θ ) × RSG (θ )
2 2
the supersymmetry factors, R a1 1 (θ ), a = 0, 1, are taken now from (15). These factors
2 2
depend explicitly on γ , and this dependence pertains in the (SUSY) reflection amplitudes
on exited boundaries. Nevertheless, since none of these amplitudes has a pole in the
physical strip, following the steps of the previous considerations leads to the same
conclusion regarding the indexing and degeneracies of the boundary states. Therefore we
concentrate here mainly on the differences between the two solutions.
At the position of the νn poles in the ground state reflection amplitude we again associate
boundary bound states |a, 1/2|n to R a1 1 a = 0, 1 as in (17) (though of course the present
2 2
values of boundary couplings may differ from the previous ones). The action of the present
boundary supercharge, Q − , on these states is
√ νn
Q− |0, 1/2|n = r γ + 2 M sin |1, 1/2|n,
2
√
− |1, 1/2|n = r −1 γ − 2 M sin νn |0, 1/2|n.
Q
2
526 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
The action of Q 2− on these states is compatible with the relation Q 2− = 2(H − M Z),
2
provided we keep the interpretation of γ /2 as the ground state energy.
Using the SUSY factors Eq. (15) in the bootstrap equation (19) for the reflections of the
K 1 a kinks on these boundary states gives
2
1
1 gb2 ξ νn θ θ
Rab (θ ) = Z− (θ ) 1 δab cos cos
2
+ (−1) i sinh cosh
a
2 2 2 2
ga2
θ ξ νn
− iδa,1−b sinh cos + (−1)b sin ,
2 2 2
where ξ is expressed in terms of γ in Eq. (16), and
The breathers have vertex type scattering matrices in contrast to the RSOS type ones
of the kinks. These scattering matrices enter into the equations determining the reflection
factors of the breathers, nevertheless there is no need for their explicit form as the breather
reflection factors on the various boundaries can be obtained from that of the soliton
kinks by using the (bulk) fusion and the bootstrap [19]; the procedure is summarized
schematically on Fig. 7.
Fig. 7. The bootstrap procedure for the breather reflection factors on the boundary ground state | 12 .
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 527
If two bulk kinks form a bound state at a rapidity difference iρ (0 < ρ < π ) the bound
state is identified with a supermultiplet (φ, ψ) of mass 2M cos(ρ/2). (In case of the kth
breather ρ = ρk = π − kπ λ .) The fusing coefficients of these processes are defined via [11]:
Kab (θ + iρ/2)Kbc (θ − iρ/2) = f φ abc φ(θ ) + f ψ abc ψ(θ )
with the nonvanishing coefficients being:
f φ 0 1 0 = f φ 1 1 1 = 2(π−2ρ)/4π f φ 1 0 1 = 2(π−2ρ)/4π f φ 1 1 1
2
! 2
2 2
2 2
ρ−π
= K(iρ)2(π−ρ)/2π cos
4
and
f ψ 1 1 0 = −f ψ 0 1 1 = 2(π−2ρ)/4π if ψ 1 0 1 = −2(π−2ρ)/4π if ψ 1 1 1
2
! 2
2 2
2 2
ρ+π
= K(iρ)2(π−ρ)/2π cos .
4
To describe the ground state reflection amplitudes of the bosonic (φ) and fermionic (ψ)
components we represent them as
1 1
φ(θ ) = φ
K 1 0 (θ + iρ/2)K0 1 (θ − iρ/2)
2 2f 1 0 1 2 2
2 2
1
+ K 1 1 (θ + iρ/2)K1 1 (θ − iρ/2) ,
2 2 2
1 1
ψ(θ ) = K 1 0 (θ + iρ/2)K0 1 (θ − iρ/2)
2 2f ψ 1 0 1 2 2
2 2
1
− K 1 1 (θ + iρ/2)K1 1 (θ − iρ/2) .
2 2 2
These expressions show that they also provide an ordinary doublet representation of the
boundary supercharge Q ± and that the fermionic parity Γ act on them in the standard
way. The actual reflection factors are obtained from the bootstrap equation on Fig. 7, where
the dashed lines represent either φ or ψ. The bosonic and fermionic reflection factors are
qualitatively different in the Γ symmetric and Γ nonsymmetric cases, since the bootstrap
equations contain both the R 01 1 and the R 11 1 ground state kink reflection amplitudes, and
2 2 2 2
these are significantly different in the two cases. Writing the breather reflection factors on
the ground state boundary as
φ 1 A+ B φ 1
(θ ) = (−θ )
ψ 2
B A − ψ 2
in the Γ symmetric (BSSG+ ) case one obtains
= 0, θ π θ π
B=B A+ = Z(θ ) cos − , A− = Z(θ ) cos + ,
2i 4 2i 4
√
Z(θ ) = P (θ + iρ/2)P (θ − iρ/2) 2 K(2θ )2−θ/ iπ ,
528 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
in the bootstrap procedure. To emphasize that even in the BSSG+ case there are φ → ψ
type reflections on excited boundaries we give here the reflection matrix of breathers
on the |a1/2|n states. In the basis of |φ(θ )0, 1/2|n, |φ(θ )1, 1/2|n, |ψ(θ )0, 1/2|n
|ψ(θ )1, 1/2|n it can be written as
C+ 0 0 −D/r
C+ rD 0
# ) 0
Z(θ , (25)
0 D/r C− 0
−rD 0 0 C−
$
ρ ν
where D = √1 cos 2 cos 2n sin θi and
2
νn θ π 1 ρ θ θ π
C± = cos2 cos ∓ + cos − cos cos ± .
2 2i 4 2 2 i 2i 4
It is easy to show that in spite of the nontrivial boson fermion reflection the operator Γ
commutes with this reflection matrix.
A nontrivial check on the consistency of the bootstrap solution can be obtained by
considering the pole structure of the full reflection amplitude containing the SUSY factor
(25). The bosonic reflection factor of Bk on the bosonic boundary excited state |n has a
pole at −iθ = π2 − λη + 2λπ
(k + 2n + 1) [21]. In the supersymmetric case, this means that
a boundary excited state of the form
1
a, n + k (26)
2
enters as an on-shell intermediate state in the scattering of Bk on |a, 1/2|n. However, due
to the doublet (boson/fermion) structure of the breather naively one would expect 4 states
to explain the residue of the 4 × 4 reflection factor. In the conjectured spectrum, on the
other hand, the only possible process goes via (26) and it allows for only two intermediate
states (a = 0, 1). Therefore one expects that the determinant of the matrix (25) should have
a double zero there. It can be verified by direct calculation that this double zero is indeed
there without imposing any restriction on the parameters.
In the reflection of the kth breather on the nth excited boundary, there is another family
of poles at −iθ = ηλ − 2λ π
(k − 2l + 1), l = 0, . . . , n − 1, [21] that in the supersymmetric
case should correspond to intermediate states of the form
1
b, , a, 1 l, k − l, n .
2 2
At these poles, the number of intermediate states is 4 (a, b = 0, 1) and so we expect that
the determinant of the SUSY factor does not vanish, which indeed turns out to be the case.
530 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
7. Conclusions
To start we summarize the results of this paper. We considered the boundary scattering
amplitudes in boundary supersymmetric sine-Gordon theory (BSSG). Imposing the con-
straint of supersymmetry on solutions of the boundary Yang–Baxter equation, we found
two consistent sets of amplitudes that describe the reflection of solitons off the boundary in
its ground state. Then we considered the two bootstrap systems built from these fundamen-
tal amplitudes and conjectured the closure of this bootstrap, i.e., the set of boundary states
and the reflection factors on them. We also derived a relation between the boundary super-
charge and the Hamiltonian and checked that this relation holds for the bootstrap solutions.
Although the reflection amplitudes are different, the spectrum of states is the same in the
two bootstrap solutions. This common spectrum is characterized partly by a sequence of
integers, just like in the case of the ordinary sine-Gordon model [21], but also by an RSOS
sequence of length k + 1 (if the length of the integer sequence is k) starting from 1/2. The
energy of the state depends only on the integer labels, the different RSOS sequences cor-
respond to degenerate states. It is interesting to note that the nonsupersymmetric boundary
spectrum allows for a tensor product type supersymmetrization, and no further constraints
are obtained in accord with the bulk case [11].
In the case of the BSSG+ theory, the reflection amplitudes depend on two parameters η
and ϑ, that are inherited from the bosonic reflection factors and were originally introduced
in [18]. In the bosonic case it is known how these parameters are related to the parameters
of the boundary Lagrangian in the perturbed CFT formalism [25]. Besides that, the SUSY
algebra introduces a further parameter γ , which is related to the energy of the boundary
ground state and so must be a function of the parameters of the BSSG Lagrangian. In the
BSSG− theory the difference is that γ appears also in the expression for the reflection fac-
tors themselves. In the bosonic case the expression for the boundary energy in terms of La-
grangian parameters is also known [25]. The existence of two different families of solutions
and the number of parameters are in accordance with the expectations that they describe
the scattering in the Lagrangian theories corresponding to the boundary interaction (6) [4].
It is a very interesting and important issue to connect the bootstrap parameters η, θ
and the vacuum energy parameter γ to the parameters of the Lagrangian description for
the supersymmetric case as well. In the case of the nonsupersymmetric boundary sine-
Gordon theory this was achieved by considering it as a combined bulk and boundary
perturbation of a c = 1 free massless boson with Neumann boundary condition. However,
even the interpretation of the bulk SSG theory as a perturbed CFT is nontrivial, and we are
investigating this problem. We are also working on getting more evidence to link the bulk
S matrix and the reflection factors to the Lagrangian theory. Work is in progress in these
directions and we hope to report on the results in the very near future.
Acknowledgements
The authors would like to thank G.M.T. Watts for very useful discussions and
R.I. Nepomechie for comments on the manuscript. G.T. thanks the Hungarian Ministry
of Education for a Magyary Postdoctoral Fellowship, while B.Z. acknowledges partial
Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532 531
support from a Bolyai János Research Fellowship. This research was supported in part by
the Hungarian Ministry of Education under FKFP 0043/2001 and the Hungarian National
Science Fund (OTKA) grants T037674/02 and T34299/01.
References
[1] S. Ferrara, L. Girardello, S. Sciuto, An infinite set of conservation laws of the supersymmetric sine-Gordon
theory, Phys. Lett. B 76 (1978) 303.
[2] C. Ahn, Complete S-matrices of supersymmetric sine-Gordon theory and perturbed superconformal minimal
model, Nucl. Phys. B 354 (1991) 57–84.
[3] T. Inami, S. Odake, Y.-Z. Zhang, Supersymmetric extension of the sine-Gordon theory with integrable
boundary interactions, Phys. Lett. B 359 (1995) 118–124, hep-th/9506157.
[4] R.I. Nepomechie, The boundary supersymmetric sine-Gordon model revisited, Phys. Lett. B 509 (2001)
183–188, hep-th/0103029.
[5] C. Ahn, W.M. Koo, Exact boundary S matrices of the supersymmetric sine-Gordon theory on a half line,
J. Phys. A 29 (1996) 5845–5854, hep-th/9509056.
[6] C. Ahn, R.I. Nepomechie, Exact solution of the supersymmetric sinh-Gordon model with boundary, Nucl.
Phys. B 586 (2000) 611–640, hep-th/0005170.
[7] R.I. Nepomechie, Supersymmetry in the boundary tricritical Ising field theory, preprint UMTG-234 hep-
th/0203123.
[8] A.B. Zamolodchikov, Fractional-spin integrals of motion in perturbed conformal field theory, in: H. Guo,
Z. Qiu, H. Tye (Eds.), Fields, Strings and Quantum Gravity, Gordon and Breach, 1989.
[9] R. Shankar, E. Witten, The S matrix of the supersymmetric nonlinear sigma model, Phys. Rev. D 17 (1978)
2134.
[10] K. Schoutens, Supersymmetry and factorizing scattering, Nucl. Phys. B 344 (1990) 665–695.
[11] T.J. Hollowood, E. Mavrikis, The N = 1 supersymmetric bootstrap and Lie algebras, Nucl. Phys. B 484
(1997) 631–652, hep-th/9606116.
[12] G. Takács, A new RSOS restriction of the Zhiber–Mikhailov–Shabat model and Φ(1, 5) perturbations of
nonunitary minimal models, Nucl. Phys. B 489 (1997) 532–556, hep-th/9604098.
[13] C. Ahn, R.I. Nepomechie, The scaling supersymmetric Yang–Lee model with boundary, Nucl. Phys. B 594
(2001) 660–684, hep-th/0009250.
[14] E. Witten, D.I. Olive, Supersymmetry algebras that include topological charges, Phys. Lett. B 78 (1978) 97.
[15] Z. Bajnok, L. Palla, G. Takács, F. Wágner, The k-folded sine-Gordon model in finite volume, Nucl. Phys.
B 587 (2000) 585–618, hep-th/0004181.
[16] N. Graham, R.L. Jaffe, Energy, central charge, and the BPS bound for (1 + 1)-dimensional supersymmetric
solitons, Nucl. Phys. B 544 (1999) 432–447, hep-th/9808140;
M.A. Shifman, A.I. Vainshtein, M.B. Voloshin, Anomaly and quantum corrections to solitons in two-
dimensional theories with minimal supersymmetry, Phys. Rev. D 59 (1999) 045016, hep-th/9810068;
A. Litvintsev, P. van Nieuwenhuizen, Once more on the BPS bound for the SUSY kink, preprint YITP-00-18
hep-th/0010051;
A. Rebhan, P. van Nieuwenhuizen, R. Wimmer, The anomaly in the central charge of the supersymmetric
kink from dimensional regularization and reduction, preprint TUW-02-16, YITP-02-35, hep-th/0207051.
[17] R. Chatterjee, A.B. Zamolodchikov, Local magnetization in critical Ising model with boundary magnetic
field, preprint RU-93-54, hep-th/9311165.
[18] S. Ghoshal, A.B. Zamolodchikov, Boundary S matrix and boundary state in two-dimensional integrable
quantum field theory, Int. J. Mod. Phys. A 9 (1994) 3841–3886;
S. Ghoshal, A.B. Zamolodchikov, Boundary S matrix and boundary state in two-dimensional integrable
quantum field theory, Int. J. Mod. Phys. A 9 (1994) 4353, hep-th/9306002, Erratum.
[19] S. Ghoshal, Bound state boundary S matrix of the sine-Gordon model, Int. J. Mod. Phys. A 9 (1994) 4801,
hep-th/9310188;
A. Fring, R. Köberle, Factorized scattering in the presence of reflecting boundaries, Nucl. Phys. B 421
(1994) 159, hep-th/9304141.
532 Z. Bajnok et al. / Nuclear Physics B 644 [FS] (2002) 509–532
[20] P. Mattsson, P. Dorey, Boundary spectrum in the sine-Gordon model with Dirichlet boundary conditions, J.
Phys. A 33 (2000) 9065–9094, hep-th/0008071;
P. Mattsson, Integrable Quantum Field Theories in the Bulk and with a Boundary, PhD thesis, hep-
th/0111261.
[21] Z. Bajnok, L. Palla, G. Takács, G.Z. Tóth, The spectrum of boundary states in sine-Gordon model with
integrable boundary conditions, Nucl. Phys. B 622 (2002) 548–564, hep-th/0106070.
[22] L. Chim, Boundary S matrix for the tricritical Ising model, Int. J. Mod. Phys. A 11 (1996) 4491–4512,
hep-th/9510008.
[23] C. Ahn, W.M. Koo, Supersymmetric sine-Gordon model and the eight-vertex free fermion model with
boundary, Nucl. Phys. B 482 (1996) 675, hep-th/9606003.
[24] M. Moriconi, K. Schoutens, Reflection matrices for integrable N = 1 supersymmetric theories, Nucl. Phys.
B 487 (1997) 756–778, hep-th/9605219.
[25] Al.B. Zamolodchikov, unpublished;
Z. Bajnok, L. Palla, G. Takács, Finite size effects in boundary sine-Gordon theory, Nucl. Phys. B 622 (2002)
565–592, hep-th/0108157.
Nuclear Physics B 644 [FS] (2002) 533–567
www.elsevier.com/locate/npe
Abstract
We numerically investigate the fractal structure of two-dimensional quantum gravity coupled to
matter central charge c for −2 c 1. We reformulate Q-state Potts model into the model which
can be identified as a weighted percolation cluster model and can make continuous change of Q,
which relates c, on the dynamically triangulated lattice. The c-dependence of the critical coupling
is measured from the percolation probability and susceptibility. The c-dependence of the string
susceptibility of the quantum surface is evaluated and has very good agreement with the theoretical
predictions. The c-dependence of the fractal dimension based on the finite size scaling hypothesis is
measured and has excellent agreement with one of the theoretical predictions previously proposed
except for the region near c ≈ 1.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
It could probably be one of the most annoying questions: “what is the observable
of quantum gravity to check if the quantum theory of gravity is well defined?” In two-
dimensional quantum gravity the answer to this question is ready: “the fractal dimension
of the quantum surface is the well-defined observable which can be analytically and
numerically calculated.”
The importance of the fractal nature of two-dimensional quantum gravity was first
recognized by KPZ [1] where the critical exponent is recognized to represent the fractal
structure of the two-dimensional quantum surface. It has, however, been recognized later
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 2 2 - 2
534 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
that more direct observable to detect the quantum fractal nature of space time would be the
Housdorff dimension, or fractal dimension [2,3].
The serious numerical study to measure the fractal nature of two-dimensional quantum
surface was initiated by Agistein and Migdal [4]. They have proceeded the direct
measurements of the fractal dimension of two-dimensional quantum surface by proposing
a recursive sampling algorithm for c = 0 model but could not observe the fractal nature of
the quantum gravity in two dimension. The size of the triangulation was not large enough
to observe the fractal nature of the quantum surface of c = 0 model. The first numerical
confirmation of the fractal structure of two-dimensional quantum gravity was carried out
by Kawamoto, Kazakov, Saeki and Watabiki by using the recursive sampling algorithm for
c = −2 model [5]. In these numerical analyses the large size lattice triangulation (up to
5 million triangles) was necessary to confirm the fractal nature by the direct measurement
of the fractal dimension. This numerical confirmation of the two-dimensional quantum
space–time triggered wide varieties of numerical and analytic investigations of the fractal
nature of two-dimensional quantum gravity.
There are three analytic derivations of the c-dependence of Housdorff dimension or
fractal dimension of two-dimensional quantum gravity coupled to matter central charge c,
by Distler, Hlousek and Kawai [2], Kawai and Ninomiya [3] and later by Watabiki,
Kawamoto and Saeki [6]. It was, however, pointed out that the measured fractal dimension
of c = −2 model is very close to the third formulae given by Watabiki et al. [6]. In the
meantime Kawai, Kawamoto, Mogami and Watabiki tried to understand the fractal nature
of quantum gravity from the Matrix model point of view and succeeded to derive the
transfer matrix of the quantum surface of two-dimensional pure gravity (c = 0) [7]. The
formulation made it possible to derive the fractal dimension of pure gravity to be exactly
dF = 4 which is consistent with the value of the first and third formulae. This analytic
investigation of the random surface triggered further analytic and numerical investigations
of two-dimensional gravity [8–11].
Baby universe idea was proposed and proved to be useful to calculate string
susceptibility numerically [12,13]. Then finite size scaling hypothesis was proposed later
by being inspired by the analytic derivation of the two-point function of pure gravity [14].
The finite size scaling hypothesis made it possible to measure the fractal dimension very
accurately. Using this formulation we made systematic and the most reliable numerical
measurement of the fractal dimension for c = −2 model [15,16]. Then the very accurately
measured fractal dimension of c = −2 model perfectly agreed with the theoretical value of
the third formula, dF = 3.56 ± 0.04(numerical) 3.561 . . .(theoretical).
So far the numerical investigations of the fractal dimensions were carried out mainly
for c = −2 and c = 0 model and for several unitary series of conformal field theory [17]
and several values in the region c > 1 [18] and for c = −5 [19].
Here in this article we investigate the systematic investigation on the c-dependence
of the fractal dimension of two-dimensional quantum surface. For c = −2, 0 analytic
formulae by the help of matrix model was available while for the other continuous value
of matter central charge it was not obvious how to formulate the models to be convenient
for the numerical study of the fractal dimension. It has, however, been known that the
continuos central charge dependence can be accommodated by Q-state Potts model on the
flat lattice. Here in this paper we reformulate the Q-state Potts model into the model which
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 535
The fractal nature of the two-dimensional quantum gravity was first recognized by KPZ
[1]. It was, however, not clear what kind of fractal it meant in the beginning. Serious
analytic study on the fractal dimension of quantum gravity has been given by Distler,
Hlousek and Kawai [2], and Kawai and Ninomiya [3] and later by Watabiki, Kawamoto
and Saeki [6]. Here we summarize the analytic derivation of the fractal dimension.
In the derivation of the fractal dimension of two-dimensional quantum gravity coupled
with matter central charge c, we use the formulation of Liouville theory in particular the
formulation of conformal gauge given by DDK [24]. We first summarize the main results
of the formulation.
Formally the continuum partition function for matter coupled to two-dimensional
gravity is given by
√
Z(A) = Dg δ d 2 x g − A ZM [g], (2.1)
where ZM [g] is a matter part of the partition function with gravitational background and
A is the total area.
Regularized counterpart of the above partition function by dynamical triangulation is
Zreg (A) = ZM [G] δNa 2 ,A ∼ ZM [G0 ], (2.2)
G
where N is the number of equilateral triangles and a 2 is the area of the triangle. G denotes
a triangulation and G0 is the typical triangulation which we select from the huge set of
triangulations. The last approximate equality in Eq. (2.2) is valid up to the normalization
factor and if the selection of the typical surface is carried out by a correct procedure. Since
the path integration of the metric is carried out after the selection of the typical surface, G0
carries the information of the quantum fluctuation of spacetime effectively.
536 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
Following David, Distler and Kawai (DDK) [24], we obtain the gauge fixed version of
the two-dimensional gravity coupled to matter central charge c with a conformal gauge
gµν = ĝµν eφ :
c − 25
Z(A) = Dĝ φ ∆FP [ĝ]ZM [ĝ]δ 2
d x ĝ e α1 φ
− A exp SL [φ, ĝ] , (2.3)
48π
where ∆FP is the Fadeev–Popov contribution. SL [φ, ĝ] is the Liouville term given by
1
SL [φ, ĝ] = d x ĝ
2 µν
ĝ ∂µ φ∂ν φ + φ R , (2.4)
2
where we set the renormalized cosmological constant equal to zero for simplicity. α1
appeared in Eq. (2.3) is obtained from the following general formula:
√
25 − c − (25 − c)(25 − c − 24n)
αn = . (2.5)
12
DDK have shown that the primary conformal field of weight n − 1 can be made Weyl
invariant at the quantum level with a quantum correction:
d 2 x ĝ eαn φ Φn , (2.6)
where Φn transforms as Φn |ĝeσ = e(n−1)σ Φn |ĝ . The term eα1 φ in the delta function of Eq.
(2.3) is needed to keep the world sheet volume to be A at the quantum level.
Let us define an expectation value of an observable O(g) by
−1
O(g) A = Z(A) Dĝ φ ∆FP [ĝ]ZM [ĝ]δ 2
d x ĝ e α1 φ
−A
c − 25
× O(ĝ, φ) exp SL [φ, ĝ] . (2.7)
48π
Here we define two types of critical exponents which specialize the fractal nature of the
two-dimensional random surface.
Let us first define an intrinsic area A(r) of the random surface as a function of the mean
square average size of the world sheet r 2 as viewed in the embedding space. We define
Hausdorff dimension dH as
dH
A(r) = r2 . (2.8)
It should be noted that this definition of Hausdorff dimension refers to the embedded space.
Let us next define N(r) as the number of lattice points or number of triangles inside r
steps from a marked site. A step on the original lattice is one link step of a triangle while
a step on the dual lattice is one dual link (edge) step on the dual lattice. We define fractal
dimension dF as
N(r) = r dF . (2.9)
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 537
This definition of the fractal dimension characterizes the connectivity of the random
surface, how the random surface is composed by the connection of triangles, and thus
could be called connectivity dimension.
Distler, Hlousek and Kawai [2] evaluated the mean square size of the quantum surface
embedded in a D-dimensional space by calculating the two-point Green’s function of
vertex operator
1 2 ∂
x A = 2 2 ln d 2 x1 ĝ(x1 ) d 2 x2 ĝ(x2 ) eik(X(x1)−X(x2 ))
D ∂k A k=0
1 2
= 2 d x1 ĝ(x1 ) d x2 ĝ(x2 ) X(x1 ) − X(x2 )
2 2
A Z(A) A
|γs |
A
= (A → ∞), (2.10)
A0
where γs is the string susceptibility given by
√
D − 1 − (25 − D)(1 − D)
γs = . (2.11)
12
We can then obtain the first formula of Housdorff dimension as a function of D.
(1) 2
dH = , (2.12)
|γs |
where D could later be identified as matter central charge c in two dimension.
Let us next consider a derivation of fractal dimension using fermion as a test particle in
the gravitational background following by Kawai and Ninomiya [3]. The Lagrangian for
matter fermion coupled to gravity can be given by
1 √ √
L=− g R + Λ g + eψ̄iDψ / − meψ̄ψ, (2.13)
16π
where Λ and e are, respectively, cosmological constant and the determinant of the vielbein
in D dimensions. In D = 2 + . dimensions gravitational quantum corrections can be
evaluated by the .-expansion formulation [25]. Then the fermion mass term is expected
to acquire anomalous dimension via wave function renormalization of the matter fermion.
Here we try to identify the anomalous dimension of the fermion mass term by the use
of DDK formulation for Liouville theory. Under the scaling of the cosmological constant
Λ → β −D Λ, (2.14)
the change in the Lagrangian can be absorbed by the field redefinition
gµν → β 2 gµν . (2.15)
Since the vielbein transforms like square root of metric, the fermion kinetic term transforms
/ → β D−1 eψ̄iDψ.
as eψ̄iDψ / Then the scaling parameter can be absorbed by the field
redefinition ψ → β (1−D)/2ψ. The fermion mass term changes as meψ̄ψ → βmeψ̄ψ, in
particular ψ̄ψ → β −1 ψ̄ψ in D = 2.
In two dimensions the fermion mass term m d 2 x eψ̄ψ is expected to have the form
√
of Eq. (2.6) after the introduction of the gravitational quantum correction. Since g and
538 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
e acquire the same scale change, ψ̄ψ can be identified as primary conformal field of Φ1/2
because of the same scale change: Φ1/2 |ĝβ 2 = β −1 Φ1/2 |ĝ and ψ̄ψ → β −1 ψ̄ψ. Then the
Weyl invariant fermion mass term with the quantum correction is given by
m d 2 x ê eα1/2 φ ψ̄ψ. (2.16)
We now consider the following quantum average of the fermion mass term:
m d 2 x ê eα1/2 φ ψ̄ψ , (2.17)
A
where the quantum average · · ·A is defined in Eq. (2.7). Under the constant shift of the
conformal field φ → φ − 2 ln β/α1 , the quantum average should be unchanged and yet the
following relation holds:
m d 2 x êeα1/2 φ ψ̄ψ = mβ −2α1/2 /α1 d 2 x ê eα1/2 φ ψ̄ψ , (2.18)
A β2A
where the following change of delta function is taken into account:
δ d 2 x ĝ eα1 φ − A → β 2 δ d 2 x ĝ eα1 φ − β 2 A .
This relation suggests that the theory with two different sets of parameters (A, m) and
(β 2 A, mβ −2(α1/2 )/α1 ) are equivalent. The two-dimensional volume measured by the length
(2)
scale of the fermion field leads to another definition of fractal dimension dF [3],
d (2)
A ∼ L2 = L2α1/2 /α1 F , (2.19)
where
√ √
(2) α1 25 − c + 13 − c
dF (c) = =2× √ √ , (2.20)
α1/2 25 − c + 1 − c
with αn given by the formula (2.5).
Here we provide yet another derivation of the fractal dimension by using the solution of
diffusion equation on the random surface.
We define the Laplacian on the dynamically triangulated lattice. We first define
adjacency matrix Kij on the dynamically triangulated lattice. For a chosen typical surface
G0 we number the sites of the triangulated lattice. Then the (i, j ) component of the
adjacency matrix Kij is defined as: Kij = 1 if ith site and j th site are connected by a link,
Kij = 0 if they are not connected by a link. It is interesting to note that (n, n0 ) component
of (K T )i,j counts the number of possible random walks reaching from a marking site n0
to a site n after T steps. The Laplacian defined on the dynamically triangulated lattice is
given by
1
∆L = 1 − S, Sij = Kij , (2.21)
qj
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 539
where qj is called coordination number and denotes a number of links connected to the
site j . Sij is thus a probability of one step random walk from the site j to the neighboring
site i. The diffusion equation on a triangulated surface G0 with N triangles is given by
1 (Gi ) (Gi ) 1
Ψ A T + a 2
i ; x, x 0 − Ψ A (T ; x, x 0 ) = 2 ∆L (Gi )ΨA(Gi ) (T ; x, x0),
ai2 ai
(2.23)
where the location of the site x is measured with respect to the lattice constant ai . Thus
we identify the dimension of T as that of area: dim[T ] = dim[A]. In the continuum limit
the solution of the diffusion equation (2.23) is expected to approach the continuum wave
(G ) (G )
function: ΨA i (T ; x, x0) → ΨA ∞ (T ; x, x0). Numerically we approximate the limiting
surface as the typical surface (G0 ) of the maximum size triangulation: G∞ G0 . As
we have already noted in Eq. (2.2), the metric integration is effectively carried out for
Eq. (2.23) since we have chosen a typical surface. This means that the quantum effect
is included for the wave function of Eq. (2.23). On the other hand the solution of the
continuum counterpart of the diffusion equation: ∂τ Ψ (τ ; x, x0) = ∆(g)Ψ (τ ; x, x0) is still
background metric dependent in general. Furthermore the dimensions of T and τ may not
necessarily be equal.
Let us now define the comeback probability of random walk on the triangulated lattice
and relate it with the continuum expression of Liouville theory as follows:
(G )
G(T ) ≡ ΨA 0 (T ; x = x0 )
√ √
d 2 x g Ψ (τ ; x = x0 ) d 2x g
A A
1 √
= d 2 x g eτ ∆ Ψ (0; x = x0 )
A A
1 1
∼ ∼ , (2.24)
A T
540 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
where · · ·A is the quantum average defined in Eq. (2.7) and the last similarity relations
are dimensional relations. We should remind of the fact that the metric integration is
effectively carried out since we have chosen the typical surface G0 for the wave function of
the comeback probability. The initial wave function can be formally written as Ψ (0; x =
√
x0 ) = limx→x0 δ. (x − x0 )1/ g, where the regularized delta function is needed such as:
δ. (x − x0 ) = (1/π) × ./((x − x0 )2 + . 2 ).
We next consider how to accommodate the Weyl invariance into the diffusion equation
of random walk at the quantum level by using the formulation of Liouville theory. Let us
consider the following quantity by Liouville theory:
√
d 2x g Ψ (τ ; x = x0 )
A
√ √
= d 2 x g Ψ (0; x = x0 ) +τ d x g ∆Ψ (0; x = x0 )
2
+ ···, (2.25)
A A
This relation suggests that the theory with two different sets of parameters (τ, A) and
(β −2α−1 /α1 τ, β 2 A) are equivalent. Then we obtain the following dimensional relation:
We now point out that the expectation value of the mean squared geodesic distance is
evaluated by the standard continuum treatment
√ 2
d 2 x g r(x, x0) Ψ (τ ; x, x0)
2 √
2 1 1
= d x g r(x, x0 ) ˆ
δ(x − x0 ) + τ ∆x δ(x − x0 ) + · · ·
ĝ ĝ
2
= −4τ + O τ , (2.29)
which is related to the quantum version of the mean squared geodesic distance in the small
τ region
2 2
r ≡ r(x, x0 ) ΨA(G0 ) (T ; x, x0)
x
√ √ 2 √
2
d x g d x0 g r(x, x0 ) Ψ (τ ; x, x0)
2 2
d x g
A A
−α−1 /α1 −α−1 /α1
∼τ ∼A ∼T . (2.30)
The last similarity relations are dimensional relations. Here we give the third definition of
fractal dimension
d (3)
A= r2 F , (2.31)
where
√ √
(3) α1 25 − c + 49 − c
dF (c) = −2 =2× √ √ , (2.32)
α−1 25 − c + 1 − c
with αn given by Eq. (2.5).
Eq. (2.32), we have recognized that the numerical value of the fractal dimension is close to
√
the theoretical value of the third formula dF(3)(−2) = (3 + 17)/2 = 3.561 . . ..
Except for the analytic derivation of the fractal dimension, one obtains the following
analytic relations of comeback probability in (2.24) and mean squared geodesic distance
in (2.30) from the diffusion equation:
which can be numerically checked. These relations were numerically confirmed for c = −2
model in [6].
An alternative analytic investigation of the fractal structure of the pure gravity (c = 0)
were carried out by deriving transfer matrix of random surface by Kawai, Kawamoto,
Mogami and Watabiki. They found out the beautiful scaling function ρ(L; D) which
counts the number of boundaries whose boundary lengths lie between L and L + dL
located at geodesic distance D measured from a marked point. It is evaluated by taking the
continuum limit from the transfer matrix and disk amplitude of dynamical triangulation.
The functional form of ρ(L; D) for c = 0 model is given by
3 1 14
ρ(L; D)D 2 = √ x −5/2 + x −3/2 + x 1/2 e−x , (2.33)
7 π 2 3
where x = L/D 2 is a scaling parameter. This quantity ρ(L; D)D 2 for c = 0 model was
measured numerically and had excellent agreement with the theoretical scaling function
(2.33) [11]. One of the important result of this analysis is that the fractal dimension of the
c = 0 model turns out to be dF = 4 which is consistent with the first and third formulae.
Based on these theoretical and numerical evidences the third formula of the fractal
dimension would be the correct formula for the c-dependence of the fractal dimension.
In order to clear up the situation we started serious systematic and very accurate numerical
study of c = −2 model by using finite size scaling hypothesis [15,16]. It was concluded that
the measured fractal dimension from this analysis is dF = 3.56 ± 0.04 which is perfectly
consistent with the theoretical value of the third formula.
Potts model [20] was defined as a generalization of the Ising model [26]. Fortuin and
Kasteleyn [21] showed that the Q-state Potts model is equivalent to a weighted percolation
cluster model which we explain in this section. Their construction allows the Potts model
to be generalized to non-integral values of Q. We may call this model as generalized Potts
model or equivalently weighted percolation cluster model. These models were originally
formulated on the square lattice while we extend to formulate the model on a dynamically
triangulated lattice.
We define a planar ϕ 3 -graph G dual to a triangulated lattice. Let a graph GN have
N vertices which are dual to triangles in the triangulated lattice. With each vertex i, we
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 543
associate a spin that can take Q different values σi = 1, 2, . . . , Q. Two adjacent spins σi
and σj interact with interaction energy −J δ(σi , σj ), where
1, σi = σj ,
δ(σi , σj ) = (3.1)
0, σi = σj .
Thus the Hamiltonian is
H = −K δ(σi , σj ), (3.2)
i,j
where K = J /kB T can be reinterpreted as a coupling constant and the summation runs
over all the pairs of nearest-neighbor vertices i, j . Then the partition function of this
model is given by
ZN (K; GN ) = exp K δ(σi , σj ) , (3.3)
{σ } on GN i,j
where the σ -summation runs over all possible values {σi = 1, 2, . . . , Q} for the spin
variables σ1 , σ2 , . . . , σN on GN . Thus there are QN terms in the summation.
It has been shown [21,27] that the partition function (3.3) can be expressed as a
dichromatic polynomial [28]. In order to see this, let us expand the partition function as
a product of terms associated with nearest-neighbor vertices. This can be worked out by
using the relation [δ(σi , σj )]m = δ(σi , σj ) (m = 1, 2, . . .) as follows
ZN (K; GN ) = 1 + vδ(σi , σj ) , (3.4)
{σ } i,j
Fig. 1. A fragment of a cluster configuration on a planar ϕ 3 -graph. An isolated vertex is regarded as a cluster.
The summation is over all cluster configurations that can be drawn on GN . The expression
(3.5) is a dichromatic polynomial. Note that Q in Eq. (3.5) need not be an integer. We can
allow it to be any positive real number, and this can be a useful generalization of Q-state
Potts model to non-integer real number Q. We may call this model as generalized Q-state
Potts model or weighted percolation cluster model.
Let us consider Eq. (3.5) for a few particular values of Q. Firstly, we consider a
Q=1
model ZN (K) formulated on a given triangulated lattice. For the Q → 1 limit, if we
set v = p/(1 − p) the partition function is
E
Q=1 1
ZN (K) = vb = pb (1 − p)E−b . (3.6)
1−p
{cluster} {cluster}
Q=1
Therefore, ZN (K) becomes a sum over all possible bond percolation configurations
with the correct weight where p is the probability of a bond being present [21]. This result
holdsin any dimension and any lattice on which one defines the Potts models. Since the
sum {cluster} in Eq. (3.6) is the total probability and thus equal to 1, this model coupled
to two-dimensional quantum gravity corresponds to the pure gravity model (c = 0).
Next, let us examine for the Q → 0 limit. At the critical point of the Q-state Potts
model on the two-dimensional square lattice, it is known that v ∼ Q1/2 [20]. In general,
on a graph GN we can assume v ∼ Qα in the Q → 0 limit (0 < α < 1). Then the partition
Q→0
function ZN (K) becomes
Q→0
ZN (K) ∼ QαN Qαl+(1−α)C , (3.7)
{cluster}
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 545
where l is the number of independent loops in a cluster configuration. We have used the
Euler relation (See Fig. 1):
b = N + l − C. (3.8)
For 0 < α < 1 the leading terms in Eq. (3.7) in the Q → 0 limit can be obtained by
taking C = 1 (one cluster) and l = 0 (no loops). These dominant configurations are just the
spanning trees of the graph GN [21,29]. Then each spanning tree configuration contributes
Q→0
to ZN (K) with an equal weight. Therefore, this model coupled to two-dimensional
quantum gravity is equivalent to the c = −2 scalar fermion model coupled to quantum
gravity [5].
Temperley and Lieb [30] used the result of Fortuin and Kasteleyn to prove the
equivalence of the Potts model to the six-vertex or square-ice model, with staggered
polarizations. Baxter, Kelland and Wu (BKW) [27] have later found a very elegant
derivation of the result of Temperley and Lieb. They use a construction known as the BKW
construction which makes many exact results obvious, including the critical temperature,
self-duality and energy at criticality of the Potts model.
Thus the Q-state Potts models are analytically solved and the relations with the
conformal field theories are well known [31,32]. Q is related to central charge c in the
following particular form:
π 6
Q = 4 cos 2
, c=1− . (3.9)
m+1 m(m + 1)
The minimal unitary conformal field theories with central charge c between 0 and 1
correspond to integer m; m = 2, 3, 4, . . . . The generalization of Q to any positive real
number corresponds to a continuous change of the central charge c, a generalization from
minimal to non-minimal series of conformal field theories.
Within the framework of dynamical triangulations the generalized Potts model, or
equivalently the weighted percolation cluster models, coupled to two-dimensional quantum
gravity is described by the following partition function:
Q
1
ZN (K) = QC v b , (3.10)
SGN
GN ∈{ϕ 3 (TN )} {cluster}
where {ϕ 3 (TN )} denotes the set of ϕ 3 (TN )-graphs dual to triangulations TN of fixed
topology (which we always assume to be sphere) and SGN is a symmetry factor.
(k) (k)
QC v b
P {C}k = Q
, (4.1)
ZN (K; GN )
where C (k) and b(k) are the number of cluster and the total number of edges b(k) =
C (k) (k)
i=1 bi for the given cluster configuration {C}k , respectively.
We need to define a transition function t[{C}k , {C}l ] for a transition {C}k → {C}l ,
which satisfies ergodicity and the following detailed balance condition:
P {C}k t {C}k , {C}l = P {C}l t {C}l , {C}k . (4.2)
Here we choose to use Glauber function [33] as the transition function
δWlk
t {C}k , {C}l = , (4.3)
1 + δWlk
where
t[{C}k , {C}l ]
= δWlk = QδClk v δblk (4.4)
t[{C}l , {C}k ]
with δClk = C (l) − C (k) and δblk = b (l) − b(k) .
For a given cluster configuration {C}k , our updating proceeds as follows:
(1) We randomly pick up an edge on the graph GN and change the edge by the following
procedure and then the cluster configuration changes into {C}l .
(2) We have to find out a change of the probability distribution δWlk = QδClk v δblk when
we change the edge, where δblk is the change in the total number of bonds. If the edge
originally has a bond, remove the bond thus δblk = −1. If the edge originally does not
have a bond, add a bond to the edge thus δblk = +1. δClk is the change in the number
of clusters depending on the corresponding change of the edge.
(3) Next, we generate a pseudorandom number r uniformly distributed from 0 to 1 and
change the edge following to the procedure (2) if and only if
δWlk
r . (4.5)
1 + δWlk
This procedure ensures the transition {C}k → {C}l with the correct probability. We
use the Glauber function for the transition function because of a faster convergence to
the equilibrium distribution in our model.
(4) Return to (1) unless the system is sufficiently equilibrated.
Fig. 2. Cluster configurations: (a), (b), (c), (d). Doded lines are edges without bond while solid lines are edges
with bond. A and B are vertices in (a) and (b) while C and D are vertices in (c) and (d). Li denotes ith surrounding
loop. The arrows are pointers and compose the segments of the surrounding loops.
the bond is added. For example we pick up the edge A–B which has vertices A and B in
Fig. 2(a) or the edge C–D which has vertices C and D in Fig. 2(c), where those edges A–B
and C–D does not have a bond. After the bond A–B and the bond C–D are added, (a) and
(c) of Fig. 2 turn into (b) and (d) of Fig. 2, respectively. In order to find δClk we need to
know if the both vertices of the edge belong to the same cluster or not before the bond is
added. For example the vertices A and B in Fig. 2(a) belong to the different clusters while
the vertices C and D in Fig. 2(c) belong to the same cluster. It is not time consuming to
classify this difference numerically since we just need to know the data set of the collection
of numbered vertices belonging to the same cluster. If both vertices originally belong to
the different clusters then δClk = −1, while δClk = 0 if they originally belong to the same
cluster. It is thus numerically not difficult to identify δClk in the case of δblk = +1.
Let us suppose that we pick up an edge which already has a bond. Then we remove
the bond with the probability of the step (3) and thus δblk = −1 if it is removed. We may
548 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
pick up cluster configurations (b) and (d) of Fig. 2 as particular examples which already
has a bond at the edge A–B and C–D, respectively, and turn into (a) and (c) of Fig. 2,
respectively, after the bond A–B and C–D are removed. In order to find δClk we need to
know the information if the both vertices of the edge belong to the same cluster or not
after the bond is removed. The vertices A and B belong to the different cluster in (a) and
thus δClk = +1, while the vertices C and D belong to the same cluster in (c) and thus
δClk = 0. The crucial difference from the case δblk = +1 to the case δblk = −1 is that the
collection of the data set to classify the different cluster is not enough to differentiate if
two vertices are in the same cluster or not after the bond is removed. The straightforward
way to determine the connectedness of two vertices is to start from the first vertex and
enumerate all vertices connected to it until either the second vertex is reached, or an entire
connected region will be enumerated. This method is adequate if the clusters are small
enough (tens of vertices), but if large clusters are involved (in the vicinity of the critical
point) the CPU time requirements can grow unreasonable. Since the connectedness, a non-
local property, must be determined with every iteration to know δClk , it is important to find
faster algorithm to evaluate δClk . In order to quickly determine the connectedness of large
clusters, we have implemented an algorithm [34] using an auxiliary data structure based
on the BKW construction [27].
Let us reconsider a ϕ 3 -graph GN , with N vertices, together with its dual triangular lattice
TN . If a bond is present on GN , then its dual bond on TN is absent, and vice versa. The
boundaries between clusters on GN and their dual clusters on TN will form a collection of
closed loops. Now we call this closed loops the surrounding loops. We then have the Euler
relation,
b + 2C = N + L, (4.6)
which relates the number of clusters and bonds to the number of surrounding loops L.
By saving information of the surrounding loops as a data set, we transform the problem
of determining connectedness of two vertices to the problem of determining whether two
surrounding loop segments are part of the same surrounding loop or not. The surrounding
loops are represented as a chain of pointers in the computer. A pointer is a memory location
containing the address of the next pointer in the chain. There are three surrounding pointers
represented by arrows for each vertex, as is shown in Fig. 3. We have shown surrounding
loops composed of pointers for the panels (a), (b), (c), and (d) of Fig. 2. For example
in Fig. 2(b) pointer P1 points to pointer P2 , P2 points to P3 , etc. In this manner the
loops are represented by chains of pointers. Because of the (differential) Euler relation,
δblk + 2δClk = δLlk , we can determine δClk if we can determine δLlk , the change in the
number of surrounding loops.
Now, let us consider cases of removing a bond in the process of a Metropolis updating
in Fig. 2. The change; Fig. 2(b) → Fig. 2(a), illustrates the case in which removing the
bond A–B will divide the surrounding loop L1 into two surrounding loops L1 and L2
in Fig. 2(a), while the change; Fig. 2(d) → Fig. 2(c), illustrates the case in which two
surrounding loops L4 and L6 in Fig. 2(d) will be joined into the loop L4 in Fig. 2(c) if
the bond C–D is removed. In either case, two cuts must be made in chains of pointers
and the four ends rejoined if the bond is removed. In the case of A–B bond in Fig. 2(b),
the P1 → P2 and the P4 → P5 connections must be replaced by P1 → P5 and P4 → P2
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 549
connections, respectively, in Fig. 2(a). Similarly in the case of the bond C–D in Fig. 2(d),
the P6 → P7 and the P8 → P9 connections must be replaced by P6 → P9 and P8 → P7
connections, respectively, in Fig. 2(c). By collecting the information of chains we can
immediately conclude that δClk = +1 (for δblk = −1) since P1 and P4 are in the same
chain for Fig. 2(b) while P6 and P8 are in the different chains for Fig. 2(d) and thus
δClk = 0 (for δblk = −1). It is important to note that the information of the connectedness
of the cluster configuration of (a) and (c) can be obtained by the data set of the chains of
pointers of (b) and (d), respectively. It is numerically much easier to find if two pointers
are in the same loop chain or not.
In summary, in order to find δWlk for the Metropolis algorithm of the generalized Potts
model in Monte Carlo simulation, we first pick up an edge and find out δblk depending
on whether the edge has a bond or not. We first cut and rejoin our chains of pointers
depending on addition or removal of the bond on the edge. Using the (differential) Euler
relation, δblk + 2δClk = δLlk , we evaluate δClk by judging whether two pointers attached
to the different vertices of the given edge belong to the same chain or not.
When we apply Monte Carlo simulations to quantum gravity coupled to the generalized
Potts model or equivalently the weighted percolation clusters, we have to update the
cluster configuration for a given triangulation and at the same time we have to update the
triangulation for a given cluster configuration. The updating of cluster configurations on a
given triangulation can be carried out just as described in the above. In order to update
triangulations corresponding to the given ϕ 3 -graphs GN , we use the standard flip-flop
algorithm. We first choose two neighboring triangles randomly and flip the common link to
generate a new triangulation. This flip move changes the triangulation locally as in Fig. 4.
This move is enough to make the process ergodic for the chosen class of triangulations;
fixed number of triangles and fixed topology [35].
In our simulations we avoid to generate configurations corresponding to tadpole and
self-energy graphs. In other words we consider the class of triangulations satisfying the
following conditions:
Fig. 4. The flip move. This move is ergodic in the class of triangulations with a fixed number of triangles and
fixed topology.
Fig. 5. Bond assignment after a flip move in a fragment of a cluster configuration. The bond is assigned to the
new edge with 50 % probability.
It is a well accepted observation that lattice models with a second order phase transition
lead to corresponding continuum theories at the second order phase transition point.
In particular minimal conformal lattice models lead to the corresponding continuum
conformal field theory models at the critical point in two dimensions. It is well known that
the Q-state Potts models on regular two-dimensional lattice correspond to the field theories
of minimal unitary conformal series at the corresponding critical coupling constant Kc . The
correspondence is given in Eq. (3.9). It is analytically known that the Q-state Potts models
make continuous (second order or even higher order) phase transition at the critical point
for 0 Q 4, while they have first order phase transition point for Q > 4.
It is analytically not known if the nature of the phase transition may be changed when
gravity coupled to the Potts models. For several examples of minimal unitary models, it
is known that the order of phase transition can be raised to higher order when gravity
is coupled [36,37]. In the case of Q = 10 and 200, numerical results suggests a strong
evidence in favor of continuous transitions [23]. In our simulations we assume that the
Q state Potts models coupled to gravity presented by Eq. (3.10) have continuous phase
transitions for 0 Q 4 even at the non-unitary value of c.
In order to locate the critical coupling by using finite-size scaling method, we investigate
the percolation probability P (K) and the cluster size distribution ns (K) of percolation
theory [38]. Let us briefly summarize the physical meaning of the percolation probability
552 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
P (K) and the cluster size distribution ns (K) by studying the simplest Potts model of Q = 1
Q=1
case. The partition function of Q = 1 Potts model ZN (K) is given by Eq. (3.6). By the
last expression of Eq. (3.6) we can recognize that p with v = eK − 1 = p/(1 − p) can be
identified as the probability of a bond being present at an edge [21]. When K → 0, v → 0
and thus p → 0, i.e., the probability of a bond being present at an edge is getting small,
then the probability of finding a maximum cluster (the bonds of the maximum cluster
reaches from one end to the other of the lattice extension) is expected to be zero. Let
us define a quantity PN (K) = {the maximum cluster size}/N , where the cluster size is
defined as a number of bond of the cluster. PN (K) is a probability of a vertex being on
the maximum cluster, where N is the total number of vertices. P (K) = limN→∞ PN (K)
is called percolation probability. This quantity is the order parameter of the percolation
transition and is expected to show the following critical singularity in the infinite lattice for
|K − Kc | → 0
P (K) ∼ (K − Kc ) , K Kc ,
βp
(5.1)
0, K Kc ,
where βp is the critical exponent associated to the percolation probability.
In a finite lattice with the size N , the finite-size percolation probability PN (K) cannot
vanish at any K > 0, and its behavior depends on the lattice size. In Fig. 6 we show the
size dependence of PN (K) for Q = 2.5 and 0.6 as examples. As we can see, PN (K)
with finite size dependence does not have sharp rise in contrast with Eq. (5.1) but has
milder rise with respect to K. In these simulations we have performed with the lattice sizes
N = 100, 200, 400, 800 and 1600 for Q = 0.2, 0.4, 0.6, 0.8, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5 and
4.0, respectively. For each lattice size the number of independent samples is 5000. The
range of the coupling constant K in which we measure the observables depends on the
models, and they are shown in Table 1. So far as the behavior of PN (K) is concerned, the
order of phase transition is consistent with second order.
For the behavior of PN (K) near K → Kc we suppose the following scaling behavior
based on the finite-size scaling hypothesis [39]
PN (K) = L−βp /ν FP (K − Kc ) L1/ν , (5.2)
Fig. 6. Size-dependence of PN (K) for Q = 2.5 and 0.6 as examples. The sizes of the systems are N = 100 (the
highest curve), 200, 400, 800 and 1600 (the lowest curve).
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 553
Table 1
The ranges and intervals of the coupling constant K in which we
measure the finite-size percolation probability PN (K) for various Q
Q Range of K Interval of K
0.2 0.25 ∼ 0.82 0.03 × 20 (points)
0.4 0.50 ∼ 1.07 0.03 × 20 (points)
0.6 0.70 ∼ 1.27 0.03 × 20 (points)
0.8 0.80 ∼ 1.37 0.03 × 20 (points)
1.0 0.83 ∼ 1.40 0.03 × 20 (points)
1.5 1.05 ∼ 1.62 0.03 × 20 (points)
2.0 1.23 ∼ 1.65 0.03 × 15 (points)
2.5 1.30 ∼ 1.87 0.03 × 20 (points)
3.0 1.42 ∼ 1.84 0.03 × 15 (points)
3.5 1.45 ∼ 2.02 0.03 × 20 (points)
4.0 1.55 ∼ 1.97 0.03 × 15 (points)
Fig. 7. The functions ΦN,N (K), defined by Eq. (5.3) for all pairs of Ni ’s where N = 100, 200, . . . , 1600. The
curves are Q = 2.5 and 0.6 as examples.
where (5.1) is assumed for the infinite system of N → ∞. L is the linear extension of
the system and ν is the critical exponent associated to the correlation length ξ(K) ∼
1/|K − Kc |ν . Since the total volume is proportional to N , we should make an identification
N = LdF with dF as fractal dimension. We may view ξ(K) as an average geodesic
distance. In order to extract Kc using Eq. (5.2), we define the following function ΦN,N (K)
[40]
ln[PN (K)/PN (K)]
ΦN,N (K) = , (5.3)
ln[N/N ]
for a pair of size (N, N ). The functions ΦN,N (K) and ΦN ,N (K) for two different pairs
of sizes (N, N ) and (N , N ) should thus intersect at Kc , and the intersection point should
yield −βp /νdF if corrections to finite-size scaling can be neglected. In Fig. 7 we plot the
functions ΦN,N (K) for all pairs of given sizes in the cases of Q = 2.5 and 0.6. As we
can see from Fig. 6, PN (K) grows with the increase of Q, while the intersection point is
relatively clear for larger Q. It is getting more difficult to find the intersection point for
smaller Q.
554 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
Fig. 8. Size-dependence of χN (K) for Q = 2.5 and 0.6 as examples. The sizes of system are N = 100 (the lowest
peak), 200, 400, 800, 1600 and 3200 (the highest peak).
Let us next define a quantity ns (K) = ps (K)/s, where ps (K) is a probability of a vertex
being on a cluster of size s. ns (K) can be recognized as a cluster size distribution. This
can be understood as follows: suppose we have ms clusters with seize s, we can obtain a
relation ps (K) = ms s/N and thus ns (K) = ps (K)/s = ms /N which is proportional to the
cluster number and thus can be understood as a cluster size distribution. Using the cluster
size distribution we define the so-called percolation susceptibility χ(K)
χ(K) = s 2 ns (K), (5.4)
s
where the prime-sum means that the maximum cluster is omitted from the summation.
This quantity is the average number of bonds of a (finite) cluster and is expected to show
the following critical singularity in the infinite lattice for |K − Kc | → 0
(K − Kc )−γp , K Kc ,
χ(K) ∼ (5.5)
(Kc − K)−γp , K Kc ,
where γp and γp are the critical exponents associated to the percolation susceptibility.
Since these quantities P (K) and χ(K) play a major role in usual percolation theory, we
use them as crucial quantities of determining the critical point of generalized Potts models.
In a finite lattice with the size N , the finite-size percolation susceptibility χN (K)
cannot diverge at the critical point Kc but reaches a maximum of finite height only. The
magnitude of this maximum depends on the size of the lattice. In Fig. 8 we show the size
dependence of χN (K) for Q = 2.5 and 0.6 as examples. In these simulations we take lattice
sizes N = 100, 200, 400, 800, 1600 and 3200 for various values of Q’s. The range of the
coupling constant K and the number of independent samples in which we measure the
observables depend on the models and the lattice sizes, and they are shown in Tables 2–4.
c (N) instead of the true critical
In a finite lattice there is so called pseudo-critical point K
point Kc , where χN (K) reaches the maximum. The finite size scaling hypothesis for
the correlation length [39] means that at the pseudo-critical point the correlation length
coincides with the linear extension of the system, i.e.,
ξ Kc (N) ∼ L ∼ N 1/dF , (5.6)
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 555
Table 2
The lattice sizes, ranges of the coupling constant K and the number of samples in which we measure the finite-size
percolation susceptibility χN (K) for Q = 0.2, 0.4, 0.6 and 0.8
Q # triangles Range of K Interval of K # samples
0.2 100 0.20 ∼ 0.58 0.02 × 20 (points) 20000
200 0.30 ∼ 0.58 0.02 × 15 (points) 20000
400 0.36 ∼ 0.64 0.02 × 15 (points) 20000
800 0.40 ∼ 0.67 0.03 × 10 (points) 20000
1600 0.48 ∼ 0.66 0.02 × 10 (points) 20000
3200 0.47 ∼ 0.74 0.03 × 10 (points) 5000
0.4 100 0.42 ∼ 0.80 0.02 × 20 (points) 20000
200 0.53 ∼ 0.81 0.02 × 15 (points) 20000
400 0.61 ∼ 0.89 0.02 × 15 (points) 20000
800 0.66 ∼ 0.93 0.03 × 10 (points) 20000
1600 0.75 ∼ 0.93 0.02 × 10 (points) 20000
3200 0.74 ∼ 1.01 0.03 × 10 (points) 5000
0.6 100 0.57 ∼ 1.01 0.02 × 23 (points) 20000
200 0.71 ∼ 1.05 0.02 × 18 (points) 20000
400 0.77 ∼ 1.09 0.02 × 17 (points) 20000
800 0.83 ∼ 1.10 0.03 × 10 (points) 20000
1600 0.86 ∼ 1.16 0.03 × 11 (points) 20000
3200 0.86 ∼ 1.19 0.03 × 12 (points) 10000
0.8 100 0.70 ∼ 1.14 0.02 × 23 (points) 20000
200 0.83 ∼ 1.11 0.02 × 15 (points) 20000
400 0.91 ∼ 1.19 0.02 × 15 (points) 20000
800 0.95 ∼ 1.22 0.03 × 10 (points) 20000
1600 0.97 ∼ 1.27 0.03 × 11 (points) 20000
3200 1.02 ∼ 1.29 0.03 × 10 (points) 10000
Table 3
The same of Table 2 for Q = 1.0, 1.5, 2.0 and 2.5
Q # triangles Range of K Interval of K # samples
1.0 100 0.82 ∼ 1.20 0.02 × 20 (points) 20000
200 0.94 ∼ 1.22 0.02 × 15 (points) 20000
400 1.00 ∼ 1.32 0.02 × 17 (points) 20000
800 1.05 ∼ 1.32 0.03 × 10 (points) 20000
1600 1.13 ∼ 1.31 0.02 × 10 (points) 20000
3200 1.14 ∼ 1.34 0.02 × 11 (points) 10000
1.5 100 1.01 ∼ 1.39 0.02 × 20 (points) 20000
200 1.13 ∼ 1.41 0.02 × 15 (points) 20000
400 1.19 ∼ 1.47 0.02 × 15 (points) 20000
800 1.28 ∼ 1.46 0.02 × 10 (points) 20000
1600 1.31 ∼ 1.49 0.02 × 10 (points) 20000
3200 1.33 ∼ 1.51 0.02 × 10 (points) 10000
2.0 100 1.10 ∼ 1.60 0.02 × 26 (points) 20000
200 1.20 ∼ 1.60 0.02 × 21 (points) 20000
400 1.28 ∼ 1.64 0.02 × 19 (points) 20000
800 1.37 ∼ 1.61 0.02 × 13 (points) 20000
1600 1.41 ∼ 1.63 0.02 × 12 (points) 20000
3200 1.45 ∼ 1.63 0.02 × 10 (points) 10000
2.5 100 1.26 ∼ 1.64 0.02 × 20 (points) 20000
200 1.37 ∼ 1.65 0.02 × 15 (points) 20000
400 1.42 ∼ 1.70 0.02 × 15 (points) 20000
800 1.50 ∼ 1.68 0.02 × 10 (points) 20000
1600 1.53 ∼ 1.71 0.02 × 10 (points) 20000
3200 1.54 ∼ 1.72 0.02 × 10 (points) 10000
Table 4
The same of Table 2 for Q = 3.0, 3.5 and 4.0
Q # triangles Range of K Interval of K # samples
3.0 100 1.35 ∼ 1.73 0.02 × 20 (points) 20000
200 1.46 ∼ 1.74 0.02 × 15 (points) 20000
400 1.50 ∼ 1.78 0.02 × 15 (points) 20000
800 1.59 ∼ 1.77 0.02 × 10 (points) 20000
1600 1.60 ∼ 1.80 0.02 × 11 (points) 20000
3200 1.62 ∼ 1.80 0.02 × 10 (points) 10000
3.5 100 1.40 ∼ 1.78 0.02 × 20 (points) 20000
200 1.53 ∼ 1.81 0.02 × 15 (points) 20000
400 1.58 ∼ 1.86 0.02 × 15 (points) 20000
800 1.66 ∼ 1.84 0.02 × 10 (points) 20000
1600 1.68 ∼ 1.86 0.02 × 10 (points) 20000
3200 1.69 ∼ 1.87 0.02 × 10 (points) 10000
4.0 100 1.42 ∼ 1.90 0.02 × 25 (points) 20000
200 1.56 ∼ 1.90 0.02 × 18 (points) 20000
400 1.65 ∼ 1.93 0.02 × 15 (points) 20000
800 1.73 ∼ 1.91 0.02 × 10 (points) 20000
1600 1.74 ∼ 1.92 0.02 × 10 (points) 20000
3200 1.75 ∼ 1.93 0.02 × 10 (points) 10000
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 557
Fig. 9. The best linear fit to Eq. (5.7) for Q = 2.5 and 0.6, where δ = 1/νdF .
Fig. 10. The best values of Kc versus Q. The fat solid line is the fitting curve to the data points with polynomials
of fifth-order in Q1/2 (gravity). We also show the theoretical critical coupling for the Q-state Potts models on the
honeycomb and square lattice (flat).
The string susceptibility exponent γs is one of the simplest quantities which characterize
the fractal structure of quantum gravity. This quantity is introduced as the exponent of
the subleading correction to the canonical partition function for random surfaces of fixed
volume A
for A → ∞, where Λc denotes the critical cosmological constant. As we show in Eq. (2.2),
the total area A is proportional to the number of triangles N in the regularized counterpart
of the Eq. (5.8). In the case of pure gravity, Z(A = N) (a 2 = 1) is equal to the number of
inequivalent triangulations with volume N .
Physically the string susceptibility can be identified as an order parameter for the
branching probability of random surface. This could be understood by the following
558 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
relation:
A
1
dB BZ(B)(A − B)Z(A − B) ∼ Aγs , (5.9)
Z(A)
0
where BZ(B) can be identified as a possible number of triangulation with a marked point
on a triangulated surface of area B. Thus the left-hand side of Eq. (5.9) measures the
average branching rate of the total surface branching into two parts [43]. For γs > 0 the
surface has tendency to branch more while for γs < 0 the surface tends to be smooth.
Using the analytical approach in a continuum framework, c-dependence of γs was first
derived by KPZ [1] and later rederived by DDK [24] by using conformal gauge formulation
of Liouville theory,
√
c − 1 − (25 − c)(1 − c)
γs (c) = , (5.10)
12
for two-dimensional quantum gravity coupled to the matter central charge c with spherical
topology.
In this subsection we investigate the string susceptibility exponent γs by measuring the
distributions of so-called baby universes [12,13,18]. It has already been pointed out that
the numerical values of string susceptibility are in perfect agreement with the theoretical
results (5.10) of Q-state Potts models for integral values of Q’s. We then expect that the
agreement will be perfect even for the non-integral values of Q’s. We intend to use the
numerical investigations of γs (c) as the cross check of the critical values of Kc calculated
in the previous subsection.
The branching probability of the surface with total area A into B and A − B is given by
the integrand of Eq. (5.9). The lattice counterpart of this branching probability is given by
3(B + 1)Z(B + 1)(N − B + 1)Z(N − B + 1)
bN (B) ∼
Z(N)
γs −2
B
∼N B 1− (1 B < N), (5.11)
N
where two baby universes are divided by single triangle in the current formulation of
triangulations. We may call the smaller part of the minimum neck as a baby universe
(minbu). In numerical simulations it is easy to find the shortest loops in a given
triangulation and then to enumerate the area of the corresponding minbu’s.
The simulations were done with lattice sizes N = 1000 and 2000 for various values
of Q. For each lattice size the number of independent samples is 100K. The values of Q’s
and the critical couplings Kc (Q) are shown in Table 5.
In order to extract γs we have fitted the distributions expressed in the form
B a2
ln bN (B) = a1 + (γs − 2) ln B 1 − + , (5.12)
N B
where a1 and a2 are fitting parameters and the last term is a finite size correction term
for small B [13]. We have introduced a lower cut-off Bc , because Eq. (5.11) is only
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 559
Table 5
The critical coupling constants Kc for various values of Q
Q Kc
0.00 0.000
0.05 0.428
0.20 0.764
0.50 1.053
0.80 1.212
1.00 1.288
1.50 1.434
2.00 1.547
2.50 1.647
3.00 1.732
3.50 1.800
4.00 1.848
Fig. 11. The measured string susceptibility γs versus central charge c and the theoretical curve. No logarithmic
corrections are introduced in the fits.
asymptotically correct, deviations can be expected for small B. Moreover we have cut
large B part, because the distributions of minbu’s are not universal for large B.
The values γs (Bc ) extracted from Eq. (5.12) approach exponentially to a limiting value
for large Bc . Thus in order to extract the limiting value γs we fit the values γs (Bc ) in such
a form
γs (Bc ) = γs − a1 e−a2 Bc . (5.13)
In Fig. 11 we plotted the limiting value γs obtained by the above method versus central
charge c together with the theoretical curve (5.10).
The reason why the results for c ≈ 1 disagree with the theoretical curve is possibly due
to the fact that logarithmic corrections are not yet introduced. It is well known [12,18] that
logarithmic corrections play an important role in the vicinity of c ≈ 1. Thus we assume
that the partition function has the following asymptotic behavior for large N
Z(N) ∼ eλc N N γs −3 (ln N)α , (5.14)
560 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
Fig. 12. The measured string susceptibility γs versus central charge c and the theoretical curve. Logarithmic
corrections are introduced in the fits for Q = 2.5, 3.0, 3.5 and 4.0.
where α is an additional parameter. The measured distributions bN (B) can now be fitted
to the following parameterization:
B a2
ln[bN (B)] = a1 + (γs − 2) ln B 1 − + α ln ln B ln(N − B) + , (5.15)
N B
for Q = 2.5, 3.0, 3.5 and 4.0. Then γs with the logarithmic corrections versus central
charge c are plotted in Fig. 12.
The string susceptibility γs with the logarithmic corrections for the various values of
Q’s are in very good agreement with the theoretical curve. We can then conclude that the
values of critical coupling Kc (Q) evaluated numerically in the previous subsection are
correct within errors.
N(r) = r dF , (5.16)
where we count the number of triangles N(r) inside r steps from a marked triangle. In
fact the fractal structure of the two-dimensional quantum gravity was first confirmed in
this way by Kawamoto, Kazakov, Saeki and Watabiki for c = −2 scalar fermion model
[5]. In these analyses they needed 5 million triangles to obtain the reliable value of the
fractal dimension. It was later recognized that finite size scaling hypothesis is very useful
to evaluate the fractal dimension numerically and thus relatively small number of triangles
is enough to obtain very accurate fractal dimension for c = −2 model [15,16]. Here we use
finite size scaling hypothesis to obtain the c-dependence of the fractal dimension.
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 561
It has already been recognized numerically in [5] that the total perimeter length sN (r)
measured at geodesic distance r from a marked triangle grows
sN (r) = r dF −1 . (5.17)
This fact triggered the investigation of analytic derivation of transfer matrix for two-
dimensional random surface of quantum gravity for c = 0 model [7]. It has then been
recognized that two point function of random surface can be related to the measurement of
sN (r) [9,14,44].
In the case of pure gravity (c = 0) “two-point function” with fixed volume A is defined
by
1 2 √ 1 √
SA (R) = D[g] δ d x g−A d 2x g
Z(A) A
√
× d 2 x g δ Dg (x, x ) − R , (5.18)
where Dg (x, x ) denotes the geodesic distance between the points labeled by x and x
measured with respect to g. Then SA (R) dR is the average area of a spherical shell of
thickness dR and radius R from a marked point on the manifold. We recognize that the
lattice triangulation version of SA (R) corresponds to sN (r). According to the numerical
result, we expect to have a relation
SA (R) ∼ R dF −1 for R ∼ 0. (5.19)
In case of pure gravity (c = 0) SA (R) was calculated analytically
SA (R) = R 3 f R/A1/4 , (5.20)
where we can extrapolate 1/dF (∞) by a linear fit to Eq. (5.26). This is shown in Fig. 13 for
Q = 2.5 and 0.5 as examples, where the estimation of error is based on a non-linear fits to
sN (r)/N . The values of dF ’s obtained in this way for the various values of Q’s are plotted
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 563
Fig. 13. The best linear fit to Eq. (5.26) for Q = 2.5 and 0.5 as examples, by the decay of the peak of sN (r)/N
with N .
Fig. 14. The best linear fit to Eq. (5.27) for Q = 2.5 and 0.5 as examples, by the decay of the inverse average
radius 1/rN with N .
Fig. 15. The measured fractal dimension dF by the decay of peak versus central charge c and the three theoretical
curves given by Eqs. (2.12), (2.20) and (2.32).
in Fig. 15. Three theoretical curves given by the formulae; (2.12), (2.20) and (2.32), are
shown to be compared. It is clear that the formula (2.32) is closer to the numerical values
of the fractal dimension.
564 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
Fig. 16. The measured fractal dimension dF by the average radius versus central charge c and the three theoretical
curves given by Eqs. (2.12), (2.20) and (2.32).
Secondly, in order to make use of the whole information of sN (r) we have used the
average radius rN of universes with volume N
∞
1
rN ≡ rsN (r) ∼ N 1/dF . (5.27)
N
r=0
We can then expect 1/rN ∼ N −1/dF , where dF can be determined in the same way as
above. In Fig. 14 we show the corresponding linear fits to Eq. (5.27). The values of dF ’s
obtained in this way for the various values of Q’s are plotted in Fig. 16.
These two independent results are consistent with each other and support the theoretical
prediction dF(3) (c).
In this article we have shown the results of numerical investigations of the fractal
dimension of two-dimensional quantum gravity coupled to the matter central charge c for
−2 c 1. The c-dependence of the matter central charge is introduced by reformulating
Q-state Potts model into the model which can be identified as a generalization of
percolation cluster model, weighted percolation cluster model, on the random lattice.
In this formulation Q can be generalized into non-integer value and thus continuous
c-dependence is realized and then we have called this model simply as the generalized
Potts model. Since the model has a percolation cluster feature, we have formulated a new
Metropolis algorithm to generate clusters on dynamically triangulated surface.
The c-dependence of the critical coupling Kc is not known theoretically. We have
evaluated the c-dependence of Kc by measuring percolation probability and percolation
susceptibility with the help of scaling hypothesis. The c-dependence of the critical coupling
has similar behaviour as those of flat lattice. It is then very natural to expect that there
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 565
√ √
(3) 25 − c + 49 − c
dF (c) = 2 × √ √ . (6.1)
25 − c + 1 − c
We consider that the deviation of the agreement of the numerical values of the fractal
dimensions near c ≈ 1 from the theoretical values is possibly due to the fact that the size
of the lattice is not large enough to observe the possible discrete jump from the fractal
dimension c < 1 of Eq. (6.1) to the value of branched polymer phase dF = 2, γs = 1/2
[45–47].
It is interesting to measure the change of fractal dimension very accurately in this
delicate region near c ≈ 1. The theoretical curve of the c-dependence of the fractal
dimension has infinite slope at c = 1 and then turns into imaginary value. We conjecture
that the fractal phase of two-dimensional quantum gravity c < 1 turns into branched
polymer phase in 1 < c. We expect that there is a discrete jump of the fractal dimension
at c = 1. In order to measure this discrete jump numerically we may need huge number of
triangles in the simulation.
It is interesting to compare with the measurement of the string susceptibility in
three-dimensional simplicial gravity. In three dimensions tetrahedron is the fundamental
simplex which corresponds to the triangle of two-dimensional quantum gravity. In three-
dimensional dynamical triangulation the gravitational constant can be a free parameter and
plays a role of central charge c of two-dimensional quantum gravity. It was measured that
the string susceptibility changes from negative region to the positive region where branched
polymer phase is expected [48]. Here again the phase change from the fractal phase to the
branched polymer phase is expected. For realistic higher dimensional simplicial quantum
gravity it would be important to understand the phase change such as the fractal-branched
polymer phase change.
566 N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567
Acknowledgements
We would like to thank I. Kostov and Y. Watabiki for useful discussions at the very
early stage of this work. This work is supported in part by Japanese Ministry of Education,
Science, Sports and Culture under the grant number 13640250.
References
[1] V. Knizhnik, A.M. Polyakov, A. Zamolodchikov, Mod. Phys. Lett. A 3 (1988) 819.
[2] J. Distler, Z. Hlousek, H. Kawai, Mod. Phys. Lett. A 5 (1990) 1093.
[3] H. Kawai, M. Ninomiya, Nucl. Phys. B 336 (1990) 115.
[4] M.E. Agishtein, A.A. Migdal, Int. J. Mod. Phys. C 1 (1990) 165;
M.E. Agishtein, A.A. Migdal, Nucl. Phys. B 350 (1991) 690.
[5] N. Kawamoto, V.A. Kazakov, Y. Saeki, Y. Watabiki, Phys. Rev. Lett. 68 (1992) 2113.
[6] N. Kawamoto, Y. Saeki, Y. Watabiki, in preparation;
Y. Watabiki, Prog. Theor. Phys. Suppl. 114 (1993) 1;
N. Kawamoto, Quantum gravity, in: K. Kikkawa, M. Ninomiya (Eds.), Proceedings of Nishinomiya–Yukawa
Workshop, Nishinomiya, World Scientific, 1992, p. 112;
N. Kawamoto, Current topics in theoretical physics, in: Y.M. Cho (Ed.), First Asia-Pacific Winter School
for Theoretical Physics, World Scientific, 1993.
[7] H. Kawai, N. Kawamoto, T. Mogami, Y. Watabiki, Phys. Lett. B 306 (1993) 19.
[8] Y. Watabiki, Nucl. Phys. B 441 (1995) 119;
Y. Watabiki, Phys. Lett. B 346 (1995) 46.
[9] J. Ambjørn, Y. Watabiki, Nucl. Phys. B 445 (1995) 129.
[10] J. Ambjørn, C.F. Kristjansen, Y. Watabiki, Nucl. Phys. B 504 (1997) 555.
[11] S. Oda, N. Tsuda, T. Yukawa, Prog. Theor. Phys. 99 (1998) 875.
[12] S. Jain, S.D. Mathur, Phys. Lett. B 286 (1992) 239.
[13] J. Ambjørn, S. Jain, G. Thorleifsson, Phys. Lett. B 307 (1993) 34.
[14] J. Ambjørn, J. Jurkiewicz, Y. Watabiki, Nucl. Phys. B 454 (1995) 313.
[15] J. Ambjørn, K.N. Anagnostopoulos, T. Ichihara, L. Jensen, N. Kawamoto, Y. Watabiki, K. Yotsuji, Phys.
Lett. B 397 (1997) 177.
[16] J. Ambjørn, K.N. Anagnostopoulos, T. Ichihara, L. Jensen, N. Kawamoto, Y. Watabiki, K. Yotsuji, Nucl.
Phys. B 511 (1998) 673.
[17] J. Ambjørn, K.N. Anagnostopoulos, Nicl. Phys. B 497 (1997) 445.
[18] J. Ambjørn, G. Thorleifsson, Phys. Lett. B 323 (1994) 7.
[19] K.N. Anagnostopoulos, P. Bialas, G. Thorleifsson, J. Stat. Phys. 94 (1999) 321.
[20] R.B. Potts, Proc. Cambridge Philos. Soc. 48 (1952) 106.
[21] C.M. Fortuin, P.W. Kasteleyn, J. Phys. Soc. Jpn. Suppl. 26 (1969) 11;
C.M. Fortuin, P.W. Kasteleyn, Physica 57 (1972) 536.
[22] J. Jurkiewicz, A. Krzywicki, B. Petersson, B. Soderberg, Phys. Lett. B 213 (1988) 511;
C.F. Baillie, D.A. Johnston, Phys. Lett. B 286 (1992) 44;
S. Catterall, J. Kogut, R. Renken, Phys. Lett. B 292 (1992) 277;
J. Ambjørn, B. Durhuus, T. Jonsson, G. Thorleifsson, Nucl. Phys. B 398 (1993) 568;
J. Ambjørn, G. Thorleifsson, M. Wexler, Nucl. Phys. B 439 (1995) 187.
[23] J. Ambjørn, G. Thorleifsson, M. Wexler, Nucl. Phys. B 439 (1995) 187.
[24] F. David, Mod. Phys. Lett. A 3 (1988) 651;
J. Distler, H. Kawai, Nucl. Phys. B 321 (1989) 509.
[25] S. Weinberg, in: S.W. Hawking, W. Israel (Eds.), General Relativity, an Einstein Centenary Survey,
Cambridge Univ. Press, 1979, p. 790;
R. Gastmans, R. Kallosh, C. Truffin, Nucl. Phys. B 133 (1978) 417;
S.M. Christensen, M.J. Duff, Phys. Lett. B 79 (1978) 213.
N. Kawamoto, K. Yotsuji / Nuclear Physics B 644 [FS] (2002) 533–567 567
A(1)
n−1 reflection K-matrices
A. Lima-Santos
Universidade Federal de São Carlos, Departamento de Física, Caixa Postal 676,
CEP 13569-905 São Carlos, Brazil
Received 17 July 2002; received in revised form 6 September 2002; accepted 6 September 2002
Abstract
We investigate the possible regular solutions of the boundary Yang–Baxter equation for the vertex
(1)
models associated with the An−1 affine Lie algebra. We have classified them in two classes of
solutions. The first class consists of n(n − 1)/2 K-matrix solutions with three free parameters. The
second class are solutions that depend on the parity of n. For n odd there exist n reflection K-matrices
with 2 + [n/2] free parameters. It turns out that for n even there exist n/2 K-matrices with 2 + n/2
free parameters and n/2 K-matrices with 1 + n/2 free parameters.
2002 Elsevier Science B.V. All rights reserved.
1. Introduction
The search for integrable models through the Yang–Baxter equation [1–3]
R12 (u − v)K1 (u)R21 (u + v)K2 (v) = K2 (v)R12 (u + v)K1 (u)R21 (u − v). (1.2)
0550-3213/02/$ – see front matter 2002 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 ( 0 2 ) 0 0 8 1 9 - 2
A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584 569
With this goal in mind, the study of boundary quantum groups was initiated in
[8]. However, as observed by Nepomechie [9], an independent systematic method of
constructing the boundary quantum group generators is not yet available. In contrast to
the bulk case [5], one cannot exploit boundary affine Toda field theory, since appropriated
classical integrable boundary conditions are not yet known [10].
We are also sharing the hope that by studying the known examples of boundary quantum
group generators, it may become possible to uncover their basic algebraic structure, and
to find generalizations to all affine Lie algebras. Independent of the lack of an algebraic
solution from the quantum group approach, there has been an increasing amount of
effort towards the understanding of two-dimensional integrable theories with reflecting
boundaries via solutions of the reflection equation (1.2). In field theory, attention is focused
on the boundary S-matrix. In statistical mechanics, the emphasis has been on deriving
solutions of (1.2) and the calculation of various surface critical phenomena, both at and
away from criticality [11]. In condensed matter physics the actual target is the impurity
problem.
The classification of all possible solutions of the reflection equation (1.2) by direct
computation has been seen as a very difficult problem. However, recently we have proposed
(2)
a method which allows the classification of the Dn+1 reflection K-matrices [12] as well
as the K-matrices of the 19-vertex models [13]. In spite of these papers we decided to
continue in this line in order to include the A(1)
n−1 reflection K-matrices which will reveal
us its algebraic structure.
(1)
We have organized this paper as follows. In Section 2 we choose the An−1 reflection
equations and in Section 3 their solutions are derived and classified in two types. The
last section is reserved for the conclusion. The first models have its K-matrices written
explicitly in Appendices A–D.
2. The A(1)
n−1 reflection equations
The R-matrix for the vertex models associated with the A(1)n−1 (n 2 ) affine Lie algebra
was originally found in the articles [14,15] and as presented in [5] it has the form
R(u) = a1 (u) Eii ⊗ Eii + a2 (u) Eii ⊗ Ejj
i=j
+ a3 (u) Eij ⊗ Ej i + a4 (u) Eij ⊗ Ej i , (2.1)
i<j i>j
where Eij denotes the elementary n by n matrices ((Eij )ab = δia δib ) and the Boltzmann
weights with functional dependence on the spectral parameter u are given by
a1 (u) = eu − q 2 , a2 (u) = q eu − 1 ,
a3 (u) = − q 2 − 1 , a4 (u) = −eu q 2 − 1 . (2.2)
Here q denotes an arbitrary parameter.
570 A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584
For n > 2, the R-matrix (2.1) does not enjoy P and T symmetry but just PT invariance
Second, these algebraic equations are denoted by E[i, j ] = 0 and collected into blocks
B[i, j ] , i = 1, . . . , n2 − i and j = i, i + 1, . . . , n2 − i, defined by
E[i, j ] = 0,
E[j, i] = 0,
B[i, j ] = (2.12)
E n2 + 1 − i, n2 + 1 − j = 0,
E n2 + 1 − j, n2 + 1 − i = 0.
For a given block B[i, j ], the equation E[n2 + 1 − i, n2 + 1 − j ] = 0 can be obtained from
the equation E[i, j ] = 0 by interchanging
kij ↔ kn+1−i n+1−j , βij ↔ βn+1−i n+1−j
, a3 (u) ↔ a4 (u) (2.13)
and the equation E[j, i] = 0 is obtained from the equation E[i, j ] = 0 by the interchanging
kij ↔ kj i , βij ↔ βj i . (2.14)
In this way, we can control all equations and a particular solution is simultaneously
connected with at least four equations.
3. The A(1)
n−1 K-matrix solutions
Analyzing the A(1) n−1 reflection equations one can see that they possess a very special
structure. Several equations exist involving only the elements out of the diagonal, kij
(i = j ), these are the simplest equations and we will solve them first.
By direct inspection one can see that the diagonal blocks B[i, i] are uniquely solved by
the relations
βij kj i (u) = βj i kij (u), ∀ i = j. (3.1)
It means that we only need to find the n(n − 1)/2 elements kij (i < j ). Now we choose
a particular kij (i < j ) to be different from zero, with βij = 0, and try to express all
remaining elements in terms of this particular element. We have verified that this is possible
provided that
a4 (u) βpq kij (u), if p > i and q > j,
a3 (u) βij
kpq (u) = β (p = q). (3.2)
pq kij (u), if p > i and q < j
βij
Combining (3.1) with (3.2) we will obtain a very strong entail for the elements out of the
diagonal
kpj (u) = 0, for p = i,
kij (u) = 0 ⇒ (3.3)
kiq (u) = 0, for q = j.
It means that for a given kij , the only elements different from zero in the ith-row and in
the j th-column of K− (u) are kii , kij , kjj and kj i .
Analyzing more carefully these equations with the conditions (3.1) and (3.3), we have
found from the n(n − 1)/2 matrix elements kij (i < j ) that there are two possibilities to
choose a particular kij = 0:
572 A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584
• Only one non-diagonal element and its symmetric are allowed to be different from
zero. Thus we have n(n − 1)/2 reflection K-matrices with n + 2 non-zero elements.
Here we will denote by KIij (i < j ), the K-matrix for which the non-diagonal element
kij is the one chosen to be the non-zero matrix element. These matrices will be named
type-I solutions.
• For each kij = 0, additional non-diagonal elements and its asymmetric are allowed to
be different from zero provided they satisfy the equations
type-II solutions.
k11 0 0
K23 =
I 0 k22 k23 . (3.5)
0 k32 k33
(1)
One can expect that these are the three possibilities to write the same solution for the A2
model.
For the A(1)
3 model we have six type-I solutions {K12 , K13 , K14 , K23 , K24 , K34 } all with
I I I I I I
six non-zero elements. In this model we also have two type-II solutions {KII 12 , K14 }:
II
k11 k12 0 0 k11 0 0 k14
0 0
KII k21 k22 0 0 k22 k23 ,
12 = 14 =
and KII
0 0 k33 k34 0 k32 k33 0
0 0 k43 k44 k41 0 0 k44
similarity transformation among these K’s matrices, even after a gauge transformation.
Even for the type-I solutions the similarity account is not simple due to the presence
of three types of scalar functions and the constraint equations for the parameters βij .
Nevertheless, as we have found a way to write all solutions, we can leave the similarity
account to the reader.
Having identified these possibilities we may proceed in order to find the n diagonal
elements kii (u) in terms of the non-diagonal elements kij (u) for each Kij matrix.
These procedure is now standard [13]. For instance, if we are looking for KII 12 , the non-
diagonal elements kij , (i + j = 3 mod n) in terms of k12 are given by
β
ij
for i + j = 3,
β12 k12 (u),
kij (u) = βij u
for i + j = 3 mod n, (3.9)
β12 e k12 (u),
0, otherwise,
for i, j = 1, 2, . . . , n (i = j ).
Substituting (3.9) into the reflection equations we can now easily find the kii elements
up to an arbitrary function, here identified as k12 (u). Moreover, their consistency relations
will yield us some constraints equations for the parameters βij .
After we have found all diagonal elements in terms of kij (u), we can, without loss of
generality, choose the arbitrary functions as
1
kij (u) = βij e2u − 1 , i < j. (3.10)
2
This choice allows us to work out the solutions in terms of the functions fii (u) and hij (u)
defined by
1
fii (u) = βii eu − 1 + 1 and hij (u) = βij e2u − 1 , (3.11)
2
for i, j = 1, 2, . . . , n.
Now, we will simply present the general solutions and write them explicitly for the first
values of n in Appendices A–D. Let us start considering the type-I solutions.
574 A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584
Here we have n(n − 1)/2 reflection K-matrices with n + 2 non-zero elements. For
1 < i < j n we get (n − 2)(n − 1)/2 solutions
KIij = fii (u)Eii + e2u fii (−u)Ejj + hij (u)Eij + hj i (u)Ej i
i−1 j −1
n
(i)
+ Zi (u) Ell + Yi+1 (u) Ell + e2u Zi (u) Ell , (3.12)
l=1 l=i+1 l=j +1
(i)
where Zi (u) and Yi+1 (u) are scalar functions defined by
1
Zi (u) = fii (−u) + (βii + β11 )e−u e2u − 1 (3.13)
2
and
1
Yl(i) (u) = fii (u) + (βll − βii ) e2u − 1 . (3.14)
2
For i = 1 and 1 < j n we get the n − 1 remaining solutions
KI1j = f11 (u)E11 + e2u f11 (−u)Ejj + h1j (u)E1j + hj 1 (u)Ej 1
j −1
n
+ Y2(1) (u) Ell + Xj +1 (u) Ell , (3.15)
l=2 l=j +1
Due to the property (3.4) we have found three type-II general solutions for each A(1)
n−1
model:
Type-IIa = KII 12p , Type-IIb = KII 12p+1 , Type-IIc = KII2n ,
n
p = 1, 2, . . . , (3.22)
2
where [ n2 ] being the integer part of n2 .
For n-odd, the type-IIa solution is
[ n2 ]+p
p
n
KII
12p = f11 (u) Ejj + e f11 (−u)
2u
Ejj + e2u f11 (u) Ejj
j =1 j =p+1 j =[ n2 ]+p+2
[ n2 ]+1
n
KII
2n = Z2 (u)E11 + e f22 (−u)
2u
Ejj + e2u f22 (u) Ejj
j =2 j =[ n2 ]+2
+ hij (u)Eij , (3.27)
i+j =2 mod n
i=j
2 +p
n
p
n
KII
12p = f11 (u) Ejj + e f11 (−u)
2u
Ejj + e2u f11 (u) Ejj
j =1 j =p+1 j = n2 +p+1
+ + e u
hij (u)Eij , (3.29)
i+j =2 i+j =1+2p mod n
i=j i=j
n
+ X n2 +p+1 (u)E n2 +p+1 n
2 +p+1
+ e2u f11 (u) Ejj
j = n2 +p+2
+ + e u
hij (u)Eij , (3.31)
i+j =2+2p i+j =2+2p mod n
i=j i=j
A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584 577
4. Conclusion
The absence of an algebraic method such as the quantum group approaches leads us to
believe that a direct computation from their reflection equations should be a starting point
to obtain its classification.
After a systematic study of the functional equations we find that there are two types of
solutions for the A(1)
n−1 models. We call of type-I the K-matrices with three free parameters
and n + 2 non-zero matrix elements. These solutions were denoted by KIij to emphasize
the non-zero element out of the diagonal and its symmetric, which results in n(n − 1)/2
reflection K-matrices.
The type-II solutions are more interesting because their have many free parameters. The
(1)
An−1 models for n odd, in addition to the type-I solutions, have n type-II solutions with
2n − 1 non-zero matrix elements and (2 + [ n2 ]) free parameters. It turns out that for n even
we also have n type-II solutions but half of them are K-matrices with 2n non-zero matrix
elements and (2 + n2 ) free parameters, while the remaining ones have 2(n − 1) non-zero
matrix elements with (1 + n2 ) free parameters.
The corresponding K+ (u) are obtained from the isomorphism (2.8). Out of this
classification we have the trivial solution (K− = 1, K+ = M) for these models. Thus we
578 A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584
ended our discussion on the reflection matrices for the vertex models associated with the
A(1)
n−1 affine Lie algebra.
To complete the classification for all non-exceptional Lie algebras we still have to
(1) (1) (1) (2) (2)
consider the vertex models associated with the Bn , Cn , Dn , A2n and A2n−1 Lie
algebras.
Acknowledgement
This work was supported in part by Fundação de Amparo à Pesquisa do Estado de São
Paulo-FAPESP-Brasil and by Conselho Nacional de Desenvolvimento-CNPq-Brasil.
(1)
This is a very special case among the An−1 models. We note that there is only one
general K-matrix with 4 non-zero matrix elements [21,22]. From the type-IIa solutions
(3.29) or from the type-I solutions (3.15) it is the K12 matrix
f11 (u) h12 (u)
KI12 = .
h21 (u) e2u f11 (−u)
Although there is no constraint equation in this case, the regular condition (2.10) has fixed
in three the number of free parameters, in agreement with all type-I reflection K-matrices.
This is also a special case because it has only the type-I solutions KI12 , KI13 and KI23 .
From (3.15) we have
KI12 = f11 (u)E11 + e2u f11 (−u)E22 + h12 (u)E12 + h21 (u)E21 + X3 (u)E33
f11 (u) h12 (u) 0
= h21 (u) e2u f11 (−u) 0 , (B.1)
0 0 X3 (u)
with the four parameters β11 , β12 , β21 and β33 satisfied the constraint equation
and
KI23 = f22 (u)E22 + e2u fii (−u)E33 + h23 (u)E23 + h32 (u)E32 + Z2 (u)E11
Z2 (u) 0 0
= 0 f22 (u) h23 (u) , (B.9)
0 2u
h32 (u) e f22 (−u)
with the constraint
Due to the constraint equations these reflection K-matrices have only three free
parameters and the corresponding diagonal solutions have only one free parameter.
Here we observe that only four of these diagonal solutions are independents because
D6 = D4 and D3 = D1 . Here we also note that the solutions D1 and D4 are the diagonal
solutions derived by the first time in [21] and KI13 is the non-diagonal solution derived in
[20].
In a certain sense these solution are particular because they do not reveal us all properties
shared by the regular A(1)n−1 reflection K-matrices for n odd. Before we consider the next
odd case, let us consider the case n = 4.
(1)
Appendix C. The A3 reflection K-matrices
In this case the structure of the general solution begins to appear but it is still particular
because half of the type-II solutions are type-I solutions.
The K1j matrices for the type-I solutions are given by (3.15). For KI12 we get
f11 (u) h12 (u) 0 0
h21 (u) e2u f11 (−u) 0 0
KI12 = 0
(C.1)
0 X3 (u) 0
0 0 0 X3 (u)
with the constraint
β13 β31 = (β44 + β11 − 2)(β44 − β11 − 2) = (β22 + β11 − 2)(β22 − β11 ). (C.4)
The KI14 matrix is
f11 (u) 0 0 h14 (u)
0 Y2(1) (u)
0 0
KI14 = (C.5)
0 0
(1)
Y2 (u) 0
h41 (u) 0 0 e2u f11 (−u)
with
The remaining type-I K-matrices are given by (3.12). For KI23 we get
Z2 (u) 0 0 0
0 f22 (u) h23 (u) 0
KI23 =
0 2u
(C.7)
h32 (u) e f22 (−u) 0
0 0 0 e2u Z2 (u)
with constraint
β24 β42 = (β11 + β22 )(β11 − β22 ) = (β33 + β22 − 2)(β33 − β22 ) (C.10)
and finally for KI34
Z3 (u) 0 0 0
0 Z3 (u) 0 0
KI34 = 0
(C.11)
0 f33 (u) h34 (u)
0 0 h34 (u) e2u f33 (−u)
with
lim Yl(i) (u) = e2u fii (−u), lim Yl(i) (u) = fii (u),
βll →−βii +2 βll →βii
lim Zi (u) = fii (−u), lim Zi (u) = fii (u). (C.17)
β11 →−βii β11 →βii
we can see that only half of these diagonal solutions are independents:
D1 = diag f (u), e2u f (−u), e2u f (−u), e2u f (−u) ,
D2 = diag f (u), e2u f (−u), e2u f (u), e2u f (u) ,
D3 = diag f (u), f (u), e2u f (−u), e2u f (−u) ,
D4 = diag f (u), e2u f (−u), e2u f (−u), e2u f (u) ,
D5 = diag f (u), f (u), e2u f (−u), e2u f (u) ,
D6 = diag f (u), f (u), f (u), e2u f (−u) ,
D7 = diag f (−u), f (u), e2u f (−u), e2u f (−u) ,
D8 = diag f (−u), f (u), f (u), e2u f (−u) ,
D9 = diag f (−u), f (−u), f (u), e2u f (−u) , (C.18)
where we have used a compact notation for the functions fii (u)
fii (u) ≡ f (u) = β eu − 1 + 1 (C.19)
where β is the free parameter.
Here we will only write explicitly the five type-II solutions and their constraint equations
(1)
for the A4 model. They have nine non-zero matrix elements and four free parameters:
f (u) h (u) 0 0 0
11 12
h21 (u) e2u f11 (−u) 0 0 0
K12 =
II
0 0 e 2u f (−u)
11 0 e u h (u) ,
35
0 0 0 X4 (u) 0
0 0 u
e h53 (u) 0 2u
e f11 (u)
β12 β21 = β35 β53 = (β44 + β11 − 2)(β44 − β11 − 2), (D.1)
A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584 583
References
[1] R.J. Baxter, Exactly Solved Models in Statistical Mechanics, Academic Press, 1982.
[2] V.E. Korepin, A.G. Izergin, N.M. Bogoliubov, Quantum Inverse Scattering Method and Correlation
Functions, Cambridge Univ. Press, 1992.
[3] E. Abdalla, M.C.B. Abdalla, K. Rothe, Nonperturbative Methods in Two-Dimensional Quantum Field
Theory, 2 edn., World Scientific, Singapore, 2001.
[4] P.P. Kulish, N.Yu. Reshetikhin, J. Sov. Math. 23 (1983) 2435.
[5] M. Jimbo, Commun. Math. Phys. 102 (1986) 537.
[6] I.V. Cherednik, Theor. Math. Phys. 61 (1984) 977.
[7] E.K. Sklyanin, J. Phys. A 21 (1988) 2375.
[8] L. Mezincescu, R.I. Nepomechie, Int. J. Mod. Phys. A 13 (1998) 2747.
[9] R.I. Nepomechie, Boundary quantum group generators of type A, hep-th/0204181.
584 A. Lima-Santos / Nuclear Physics B 644 [FS] (2002) 568–584
[10] P. Bowcock, E. Corrigan, P.E. Dorey, R.H. Rietdijk, Nucl. Phys. B 445 (1995) 469.
[11] M.T. Batchelor, V. Fridkin, A. Kuniba, Y.K. Zhou, Phys. Lett. B 376 (1996) 266.
[12] A. Lima-Santos, Nucl. Phys. B 612 (2001) 446.
[13] A. Lima-Santos, Nucl. Phys. B 558 (1999) 637.
[14] I.V. Cherednik, Theor. Math. Phys. 43 (1980) 356.
[15] O. Babelon, H.J. de Vega, C.M. Viallet, Nucl. Phys. B 180 (1981) 542.
[16] N.Yu. Reshetikhin, M. Semenov-Tian-Shansky, Lett. Math. Phys. 19 (1990) 133.
[17] L. Mezincescu, R.I. Nepomechie, Int. J. Mod. Phys. A 7 (1992) 5657.
(1)
[18] G.M. Gandenberger, New non-diagonal solutions to the an boundary Yang–Baxter equation, hep-
th/9911178.
[19] G.W. Delius, N.J. Mackay, Quantum group symmetry in sine-Gordon and affine Toda field theories on the
half-line, hep-th/0112023.
[20] J. Abad, M. Rios, Phys. Lett. B 352 (1995) 92.
[21] H.J. de Vega, A. González-Ruiz, J. Phys. A 26 (1993) 519.
[22] S. Ghoshal, A.B. Zamolodchikov, Int. J. Mod. Phys. A 9 (2001) 3841.
Nuclear Physics B 644 (2002) 585–587
www.elsevier.com/locate/npe