Professional Documents
Culture Documents
$SSOLFDWLRQV
9RO
(GLWRU
,*RKEHUJ
(GLWRULDO2IILFH
6FKRRORI0DWKHPDWLFDO
6FLHQFHV
7HO$YLY8QLYHUVLW\
5DPDW$YLY,VUDHO
(GLWRULDO%RDUG
'$OSD\%HHU6KHYD
-$UD]\+DLID
$$W]PRQ7HO$YLY
-$%DOO%ODFNVEXUJ
$%HQ$UW]L7HO$YLY
+%HUFRYLFL%ORRPLQJWRQ
$%|WWFKHU&KHPQLW]
.&ODQFH\$WKHQV86$
/$&REXUQ%XIIDOR
5(&XUWR,RZD&LW\
.5'DYLGVRQ:DWHUORR2QWDULR
5*'RXJODV&ROOHJH6WDWLRQ
$'LMNVPD*URQLQJHQ
+'\P5HKRYRW
3$)XKUPDQQ%HHU6KHYD
%*UDPVFK0DLQ]
-$+HOWRQ/D-ROOD
0$.DDVKRHN$PVWHUGDP
+*.DSHU$UJRQQH
6XEVHULHV
$GYDQFHVLQ3DUWLDO'LIIHUHQWLDO
(TXDWLRQV
67.XURGD7RN\R
3/DQFDVWHU&DOJDU\
/(/HUHU+DLID
%0LW\DJLQ&ROXPEXV
92OVKHYVN\6WRUUV
03XWLQDU6DQWD%DUEDUD
/5RGPDQ:LOOLDPVEXUJ
-5RYQ\DN&KDUORWWHVYLOOH
'(6DUDVRQ%HUNHOH\
,06SLWNRYVN\:LOOLDPVEXUJ
67UHLO3URYLGHQFH
+8SPHLHU0DUEXUJ
609HUGX\Q/XQHO/HLGHQ
'9RLFXOHVFX%HUNHOH\
';LD1DVKYLOOH
'<DIDHY5HQQHV
+RQRUDU\DQG$GYLVRU\
(GLWRULDO%RDUG
&)RLDV%ORRPLQJWRQ
35+DOPRV6DQWD&ODUD
7.DLODWK6WDQIRUG
+/DQJHU9LHQQD
3'/D[1HZ<RUN
06/LYVLF%HHU6KHYD
+:LGRP6DQWD&UX]
0LFKDHO'HPXWK
7HFKQLVFKH8QLYHUVLWlW&ODXVWKDO
*HUPDQ\
6XEVHULHVHGLWRUV
%HUW:ROIJDQJ6FKXO]H
8QLYHUVLWlW3RWVGDP
*HUPDQ\
-HURPH$*ROGVWHLQ
7KH8QLYHUVLW\RI0HPSKLV71
86$
6HUJLR$OEHYHULR
8QLYHUVLWlW%RQQ
*HUPDQ\
1REX\XNL7RVH
.HLR8QLYHUVLW\<RNRKDPD
-DSDQ
6\PSOHFWLF*HRPHWU\
/FX5SFOETJOUIF
DQG
5IFPSZPG)ZQFSCPMJD
4XDQWXP0HFKDQLFV
&RVBUJPOT
0DXULFHGH*RVVRQ
.JDIBFM3FJTTJH
#FSU8PMGHBOH4DIVM[F
&EJUPST
"EWBODFTJO
1BSUJBM
%JGGFSFOUJBM
&RVBUJPOT
#JSLIVTFS7FSMBH
#BTFM #PTUPO#FSMJO
&EJUPST
-AURICE DE 'OSSON
.JDIBFM3FJTTJH
)NSTITUT FR -ATHEMATIK
56#FSHBLBEFNJF'SFJCFSH
5NIVERSITT 0OTSDAM
'BLVMUUGS.BUIFNBUJLVOE*OGPSNBUJL
0OSTFACH
*OTUJUVUGS"OHFXBOEUF"OBMZTJT
0OTSDAM
'SFJCFSH
'ERMANY
*HUPDQ\
E
MAIL MAURICEDEGOSSON
GMAILCOM
FNBJMSFJTTJH!NBUIUVGSFJCFSHEF
#FSU8PMGHBOH4DIVM[F
*OTUJUVUGS.BUIFNBUJL
6OJWFSTJUU1PUTEBN
1PUTEBN
(FSNBOZ
FNBJMTDIVM[F!NBUIVOJQPUTEBNEF
.BUIFNBUJDT4VCKFDU$MBTTJGJDBUJPO/
/
%1'[[6[[
"$*1DBUBMPHVFSFDPSEGPSUIJTCPPLJTBWBJMBCMFGSPNUIF
-JCSBSZPG$POHSFTT
8BTIJOHUPO%$
64"
#JCMJPHSBQIJDJOGPSNBUJPOQVCMJTIFECZ%JF%FVUTDIF#JCMJPUIFL
%JF%FVUTDIF#JCMJPUIFLMJTUTUIJTQVCMJDBUJPOJOUIF%FVUTDIF/BUJPOBMCJCMJPHSBGJFEFUBJMFECJCMJPHSBQIJDEBUBJTBWBJMBCMF
JOUIF*OUFSOFUBUIUUQEOCEECEF
*4#/#JSLIVTFS7FSMBH
#BTFMo#PTUPOo#FSMJO
5IJTXPSLJTTVCKFDUUPDPQZSJHIU"MMSJHIUTBSFSFTFSWFE
XIFUIFSUIFXIPMFPSQBSUPGUIFNBUFSJBMJTDPODFSOFE
TQFDJGJDBMMZUIFSJHIUTPGUSBOTMBUJPO
SFQSJOUJOH
SFVTFPGJMMVTUSBUJPOT
SFDJUBUJPO
CSPBEDBTUJOH
SFQSPEVDUJPOPONJDSPGJMNT
PSJOPUIFSXBZT
BOETUPSBHFJOEBUBCBOLT'PSBOZLJOEPGVTFQFSNJTTJPOPGUIFDPQZSJHIUPXOFSNVTUCFPCUBJOFE
#JSLIVTFS7FSMBH
10#PY
$)#BTFM
4XJU[FSMBOE
.FNCFSPGUIF#FSUFMTNBOO4QSJOHFS1VCMJTIJOH(SPVQ
1SJOUFEPOBDJEGSFFQBQFSQSPEVDFEGSPNDIMPSJOFGSFFQVMQ5$'e
$PWFSEFTJHO)FJO[)JMUCSVOOFS
#BTFM
1SJOUFEJO(FSNBOZ
*4#/
*4#/
XXXCJSLIBVTFSDI
Contents
Preface . . . . . . . .
Introduction . . . .
Organization . . .
Prerequisites . . .
Bibliography . . .
Acknowledgements
About the Author
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
xiii
xiii
xiv
xv
xv
xv
xvi
Notation . . . . . . . . . . . . . . . . . . . . .
Number sets . . . . . . . . . . . . . . . . .
Classical matrix groups . . . . . . . . . .
Vector calculus . . . . . . . . . . . . . . .
Function spaces and multi-index notation
Combinatorial notation . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
xvii
xvii
xvii
xviii
xix
xx
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3
3
7
9
11
11
12
15
15
18
viii
Contents
1.4
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
19
20
23
24
. . . .
. . . .
. . . .
. . . .
. . . .
Sp(n)
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
27
29
33
36
38
38
42
45
50
51
55
58
61
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
66
66
70
72
73
74
75
79
80
84
85
87
91
.
.
.
.
.
. 95
. 95
. 96
. 98
. 100
Contents
4.3
ix
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
100
101
102
104
104
106
108
112
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
123
124
125
129
130
130
133
136
139
139
141
143
145
145
147
149
152
152
154
156
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
160
160
163
166
167
.
.
.
.
.
Contents
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
170
174
174
179
183
185
186
189
192
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
196
196
199
204
208
208
209
212
214
215
220
222
223
227
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
238
238
239
239
240
240
244
244
246
248
6.3
6.4
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Contents
xi
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
252
253
253
255
258
261
262
263
265
266
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
272
272
279
282
282
285
288
291
291
296
298
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
304
304
307
309
310
310
314
317
317
319
321
324
324
325
327
331
8.4
8.5
xii
Contents
Preface
Introduction
We have been experiencing since the 1970s a process of symplectization of Science especially since it has been realized that symplectic geometry is the natural
language of both classical mechanics in its Hamiltonian formulation, and of its
renement, quantum mechanics. The purpose of this book is to provide core material in the symplectic treatment of quantum mechanics, in both its semi-classical
and in its full-blown operator-theoretical formulation, with a special emphasis
on so-called phase-space techniques. It is also intended to be a work of reference
for the reading of more advanced texts in the rapidly expanding areas of symplectic geometry and topology, where the prerequisites are too often assumed to be
well-known by the reader. This book will therefore be useful for both pure mathematicians and mathematical physicists. My dearest wish is that the somewhat
novel presentation of some well-established topics (for example the uncertainty
principle and Schr
odingers equation) will perhaps shed some new light on the
fascinating subject of quantization and may open new perspectives for future interdisciplinary research.
I have tried to present a balanced account of topics playing a central role
in the symplectization of quantum mechanics but of course this book in great
part represents my own tastes. Some important topics are lacking (or are only
alluded to): for instance Kirillov theory, coadjoint orbits, or spectral theory. We
will moreover almost exclusively be working in at symplectic space: the slight
loss in generality is, from my point of view, compensated by the fact that simple
things are not hidden behind complicated intrinsic notation.
The reader will nd the style in which this book has been written very traditional: I have been following the classical pattern DenitionLemmaTheorem
Corollary. Some readers will inevitably nd this way of writing medieval practice;
it is still, in my opinion, the best way to make a mathematical text easily accessible. Since this book is intended to be used in graduate courses as well as for
reference, we have included in the text carefully chosen exercises to enhance the
understanding of the concepts that are introduced. Some of these exercises should
be viewed as useful complements: the reader is encouraged to spend some time on
them (solutions of selected exercises are given at the end of the book).
xiv
Preface
Organization
This book consists of three parts which can to a large extent be read independently
of each other:
The rst part (partly based on my monograph [61] Maslov Classes, Metaplectic Representation and Lagrangian Quantization) is joint work with Serge
de Gosson. It is intended to be a rigorous presentation of the basics of symplectic geometry (Chapters I and II) and of its multiply-oriented extension
q-symplectic geometry (Chapter III); complete proofs are given, and some
new results are presented. The basic tool for the understanding and study of
q-symplectic geometry is the ArnoldLerayMaslov (For short: ALM ) index
and its topological and combinatorial properties. In Chapter IV we study and
extend to the degenerate case diverse Lagrangian and symplectic intersection
indices with a special emphasis on the ConleyZehnder index; the latter not
only plays an important role in the modern study of periodic Hamiltonian
orbits, but is also essential in the theory of the metaplectic group and its
applications to the study of quantum systems with chaotic classical counterpart. A remarkable fact is that all these intersection indices are easily reduced
to one mathematical object, the ALM index.
In the second part we begin by studying thoroughly the notion of phase of
a Lagrangian manifold (Chapter V). That notion, together with the properties of the ALM index dened in Chapter III, allows us to view quantized
Lagrangian manifolds as those on which one can dene a generalized notion
of wave function. Another attractive feature of the phase of a Lagrangian
manifold is that it allows a geometric denition of the HeisenbergWeyl operators, and hence of the Heisenberg group and algebra; these are studied
in detail in Chapter VI, together with the related notions of Weyl operator
and WignerMoyal transform, which are the keys to quantum mechanics in
phase space. In Chapter VII we study the metaplectic group and the associated Maslov indices, which are, surprisingly enough, related to the ALM
index in a crucial way.
In the third and last part we begin by giving a rigorous geometrical treatment
of the uncertainty principle of quantum mechanics Chapter VIII). We show
that this principle can be expressed in terms of the notion of symplectic
capacity, which is closely related to Williamsons diagonalization theorem
in the linear case, and to Gromovs non-squeezing theorem in the general
case. We thereafter (Chapter IX) expose in detail the machinery of Hilbert
Schmidt and trace-class operators, which allows us a rigorous mathematical
treatment of the fundamental notion of density matrix. Finally, in Chapter X
(and this is denitely one of the novelties compared to traditional texts) we
extend the Weyl pseudo-dierential calculus to phase space, using Stone and
von Neumanns theorem on the irreducible representations of the Heisenberg
group. This allows us to derive by a rigorous method a Schr
odinger equation
Preface
xv
in phase space whose solutions are related to those of the usual Schrodinger
equation by a wave-packet transform generalizing the physicists Bargmann
transform.
For the readers convenience I have reviewed some classical topics in a series of
Appendices at the end of the book (Classical Lie Groups, Covering Spaces, PseudoDierential Operators, Elementary Probability Theory). I hope that this arrangement will help the beginner concentrate on the main text with a minimum of
distraction and without being sidetracked by technicalities.
Prerequisites
The mathematical prerequisites for reading with prot most of this book are relatively modest: solid undergraduate courses in linear algebra and advanced calculus, as well as the most basic notions of topology and functional analysis (Hilbert
spaces, distribution theory) in principle suce. Since we will be dealing with problems having their origin in some parts of modern physics, some familiarity with
the basics of classical and quantum mechanics is of course helpful.
Bibliography
A few words about the bibliography: I have done my very best to give an accurate
and comprehensive list of references. Inevitably, there are omissions; I apologize in
advance for these. Some of these omissions are due to sheer ignorance; on the other
hand this book exposes techniques and results from diverse elds of mathematics
(and mathematical physics); to give a complete account of all contributions is an
impossible task!
Enough said. The book and the work! is now yours.
Acknowledgements
While the main part of this book was written during my exile in Sweden, in
the lovely little city of Karlskrona, it has benetted from many visits to various
institutions in Europe, Japan, USA, and Brazil. It is my duty and great pleasure
to thank Professor B.-W. Schulze (Potsdam) for extremely valuable comments
and constructive criticisms, and for his kind hospitality: the rst part of this book
originates in research having been done in the magnicent environment of the
Neues Palais in the Sans-Souci park where the Department of Mathematics of
the University of Potsdam is located.
xvi
Preface
I would like to express very special thanks to Ernst Binz (Mannheim) and
Kenro Furutani (Tokyo) for having read and commented upon a preliminary version of this book. Their advice and encouragements have been instrumental. (I am,
needless to say, soley responsible for remaining errors or misconceptions!)
I also wish to acknowledge debt and to express my warmest thanks for useful discussions, criticisms, and encouragements to (in alphabetical order): Bernhelm Booss-Bavnbek (Roskilde), Matthias Brack (Regensburg), Serge de Gosson (Vaxj
o), J
urgen Eichhorn (Greifswald), Karl Gustafson (Boulder), Katarina
Habermann (G
ottingen), Basil Hiley (London), Ronnie Lee (Yale), Robert Littlejohn (Berkeley), James Meiss (Boulder), Walter Schempp (Siegen), Gijs Tuynman
(Lille).
The third part of this book (the one devoted to quantum mechanics in phase
space) has been deeply inuenced by the work of Robert Littlejohn on semiclassical mechanics: may he nd here my gratitude for having opened my eyes
on the use of Weyl calculus and symplectic methods in quantum mechanics!
I had the opportunity to expose parts of this book during both a graduate
course I gave as an Ulam Visiting Professor at the University of Colorado at Boulder during the fall term 2001, and during a course as First Faculty in Residence
at the same University during the summer session 2003 (I take the opportunity to
thank J. Meiss for his kind hospitality and for having provided me with extremely
congenial environment).
This work has been partially supported by the State of S
ao Paulo Research
Foundation (Fapesp) grant 2005/517667; a great thanks to P. Piccione (University
of S
ao Paulo) for his hospitality and fascinating conversations about Morse theory
and the Maslov index.
About the Author
Maurice A. de Gosson is scientically aliated with Professor B.-W. Schulzes
Partielle Dierentialgleichungen und Komplexe Analysis research group of the
University of Potsdam. He has held longer visiting positions at several prestigious
Universities, among them Yale University, the University of Colorado at Boulder,
at the Science University of Tokyo, at the Universite Paul Sabatier of Toulouse,
at Potsdam University, at Utrecht University, and at the University of S
ao Paulo.
He regularly visits his alma mater, the University of Paris 6. De Gosson has done
extensive research in the area of symplectic geometry, the combinatorial theory of
the Maslov index, the theory of the metaplectic group, partial dierential equations, and mathematical physics. He obtained his Ph.D. in 1978 on the subject of
microlocal analysis at the University of Nice under the supervision of J. Chazarain,
and his Habilitation in 1992 at the University Pierre et Marie Curie in Paris, under
the mentorship of the late Jean Leray (Coll`ege de France).
Notation
Our notation is as standard (and simple) as conicting usages in the mathematical
and physical literature allow.
Number sets
R (resp. C) is the set of all real (resp. complex) numbers. We denote respectively by
N = {1, 2, . . . } and Z = {. . . , 2, 1, 0, +1, +2, . . . }
the set of positive integers and the set of all integers.
xviii
Notation
Vector calculus
The elements of Rm should be viewed as column vectors
x1
..
x= .
xn
when displayed; for typographic simplicity we will usually write x = (x1 , . . . , xn )
in the text. The Euclidean scalar product , and norm | | on Rm are dened by
x, y = xT y =
xj yj , |x| =
x, x.
j=1
x or ... .
xm
f1
f1
1
x
x1
x2
m
f2 f2
f2
x1 x2 xm
Df = .
(2)
..
..
..
.
.
.
.
.
.
fm
fn
fm
x
x1
x2
m
Let y = f (x); we will indierently use the notation
Df (x) ,
y (y1 , . . . , ym )
,
x (x1 , . . . , xm )
for the Jacobian matrix. If f is invertible, the inverse function theorem says that
D(f 1 )(y) = [Df (x)]1 .
(3)
Notation
xix
2f
2f
2 f
2
x1 x2
x1 xn
x2 f1
2f
2 f
2
x2 xn
x2
x2 x1
2
D f (x) = .
.
(4)
..
..
..
.
.
.
.
.
2f
2f
2 f
xn x1
xn x2
x2
n
Notice that the Jacobian and Hessian matrices are related by the formula
D(f )(x) = D2 f (x).
(5)
12 Mx,x
= M Bx, x e
12 Mx,x
(6)
(7)
for all x Rn .
(8)
xx
Notation
m/2
1
2
e y,x F f (y)dm y.
Combinatorial notation
Let X be a set, k an integer 0, (G, +) an Abelian group. By denition, a (Gvalued) k-cochain on X (or just cochain when the context is clear) is a mapping
c : X k+1 G.
To every k-cochain one associates its coboundary: it is the (k + 1)-cochain c
dened by
k+1
c(x0 , . . . , xk+1 ) =
(1)j c(x0 , . . . , x
j , . . . , xk+1 ),
(9)
j=0
Part I
Symplectic Geometry
Chapter 1
1.1.1 Generalities
Let E be a real vector space; its generic vector will be denoted by z. A symplectic
form (or: skew-product ) on E is a mapping : E E R which is
pj xj pj xj
(1.1)
(z, z ) =
j=1
(1.2)
(1.4)
R
two
symplectic
forms and by
z
z
(z1 , z2 ; z1 , z2 ) = (z1 , z1 ) + (z2 , z2 ),
(z1 , z2 ; z1 , z2 ) = (z1 , z1 ) (z2 , z2 ).
2n
2n
The corresponding symplectic spaces are denoted (R2n
z Rz , ) and (Rz
2n
Rz , ).
where 0 and i stand for the nn zero and identity matrices. The matrix J is called
the standard symplectic matrix. Alternatively, we can view R2n
z as the complex
vector space Cn by identifying (x, p) with x + ip. The standard symplectic form
can with this convention be written as
(z, z ) = Im z, z Cn
(1.6)
where , Cn is the usual (Hermitian) scalar product on Cn . Notice that multiplication of x + ip by i then corresponds to multiplication of (x, p) by J. These
considerations lead to the following denition:
Denition 1.4. A complex structure on a vector space E is any linear isomorphism j : E E such that j 2 = I.
Since det(j 2 ) = (1)dim E > 0 we must have dim E = 2n so that only evendimensional vector spaces can have a complex structure. It turns out that the
existence of a complex structure on E identies it with the standard symplectic
space as the following exercises show:
Exercise 1.5. Let j be a complex structure on a vector space E. Show that one
can dene on E a structure of complex vector space E C by setting
( + i)z = + jz.
(1.7)
[Hint: the condition j 2 = I is necessary to ensure that u(u z) = (uu )z for all
u, u C].
Exercise 1.6. Let f be a linear mapping E C E C such that f j = j f .
(i) Assume that the matrix of f in a basis B C = {e1 , . . . , en } of E C is U = A+iB
(A, B real nn matrices). Viewing f : E C E C as a real endomorphism f R
shows that the matrix of f R in the basis B = {j(e1 ), . . . , j(en ); e1 , . . . , en ; }
is then
A B
U=
.
B A
(ii) Show that det f R = | det f |2 .
Here is an example of a nonstandard symplectic structure. Let B be an
antisymmetric (real) n n matrix: B T = B and set
B I
JB =
.
I 0
We have
2
JB
B2 I
B
B
I
2
= I if B = 0. We can however associate to JB the symplectic form B
hence JB
dened by
B (z, z ) = (z, z ) Bx, x ;
(1.8)
(1.9)
Taking for granted the existence of symplectic bases we can prove that all
symplectic vector spaces of the same nite dimension 2n are isomorphic: let (E, )
and (E , ) have symplectic bases {ei , fj ; 1 i, j n} and {ei , fj ; 1 i, j n}
and consider the linear isomorphism s : E E dened by the conditions s(ei ) =
ei and s(fi ) = fi for 1 i n. That s is symplectic is clear since we have
(s(ei ), s(ej )) = (ei , ej ) = 0,
(s(fi ), s(fj )) = (fi , fj ) = 0,
(s(fj ), s(ei )) = (fj , ei ) = ij
for 1 i, j n.
The set of all symplectic automorphisms (E, ) (E, ) form a group
Sp(E, ) the symplectic group of (E, ) for the composition law. Indeed,
the identity is obviously symplectic, and so is the composition of two symplectic
transformations. If (s(z), s(z )) = (z, z ) then, replacing z and z by s1 (z) and
s1 (z ), we have (z, z ) = (s1 (z), s1 (z )) so that s1 is symplectic as well.
It turns out that all symplectic groups corresponding to symplectic spaces of
the same dimension are isomorphic:
Proposition 1.9. Let (E, ) and (E , ) be two symplectic spaces of the same
dimension 2n. The symplectic groups Sp(E, ) and Sp(E , ) are isomorphic.
Proof. Let be a symplectic isomorphism (E, ) (E , ) and dene a mapping f : Sp(E, ) Sp(E , ) by f (s) = f s f 1 . Clearly f (ss ) =
f (s)(s ) hence f is a group monomorphism. The condition f (S) = I (the
identity in Sp(E , )) is equivalent to f s = f and hence to s = I (the identity
in Sp(E, )); f is thus injective. It is also surjective because s = f 1 s f is a
solution of the equation f s f 1 = s .
These results show that it is no restriction to study nite-dimensional symplectic geometry by singling out one particular symplectic space, for instance the
standard symplectic space, or its variants. This will be done in the next section.
Note that if B1 = {e1i , f1j ; 1 i, j n1 } and B2 = {e2k , f2 ; 1 k, n2 }
are symplectic bases of (E1 , 1 ) and (E2 , 2 ), then
B = {e1i e2k , f1j f2 : 1 i, j n1 + n2 }
is a symplectic basis of (E1 E2 , 1 2 ).
Exercise 1.10. Construct explicitly an isomorphism
2n
2n
2n
(R2n
z Rz , ) (Rz Rz , )
Exercise 1.11. Denote by Sp (2n) and Sp (2n) the symplectic groups of (R2n
z
2n
2n
,
)
and
(R
R
,
),
respectively.
Show
that
R2n
z
z
z
where I
I
0
where the bij are (usually) chosen to be C functions, and the wedge product
dxi dxj is dened by
dxi dxj = dxi dxj dxj dxi
where dxi : Rm R is the projection on the ith coordinate. Returning to R2n
z ,
we have
dpj dxj (z, z ) = pj xj pj xj
hence we can identify the standard symplectic form with the dierential 2-form
dp dx =
dpj dxj = d(
pj dxj );
j=1
j=1
pj dxj
j=1
10
never vanishes on M and is thus a volume form on M . We will call the exterior
power n the symplectic volume form. When M is the standard symplectic space,
then the usual volume form on R2n
z ,
Vol2n = (dp1 dpn ) (dx1 dxn ),
is related to the symplectic volume form by
Vol2n = (1)n(n1)/2
1 n
.
n!
(1.10)
1.2. Skew-Orthogonality
11
1.2 Skew-Orthogonality
All vectors in a symplectic space (E, ) are skew-orthogonal (one also says isotropic) in view of the antisymmetry of a symplectic form: (z, z ) = 0 for all
z E. The notion of length therefore does not make sense in symplectic geometry
(whereas the notion of area does). The notion skew orthogonality is extremely
interesting in the sense that it allows the denition of subspaces of a symplectic
space having special properties. We begin by dening the notion of a symplectic
basis, which is the equivalent of an orthonormal basis in Euclidean geometry.
M N = N M and (M ) M .
It is traditional to classify subsets M of a symplectic space (E, ) as follows:
M E is said to be:
(1.11)
(M1 + M2 ) = M1 M2 , (M1 M2 ) = M1 + M2 .
(1.12)
12
k
ker((ej ))
j=1
Thus z (M1 + M2 ) and M1 M2 (M1 + M2 ) .
Let M be a linear subspace of (E, ) such that M M = {0}; in the
terminology introduced above M is a symplectic subset of E.
Exercise 1.14. If M M = {0}, then (M, |M ) and (M , |M ) are complementary symplectic spaces of (E, ):
(E, ) = (M M , |M |M ).
(1.13)
(1.14)
1.2. Skew-Orthogonality
13
k ek +
i ei +
j ej = 0
iA
iA
jB
i (fk , ei ) +
j (fk , ej ) = k = 0
jB
14
The following exercise shows that the symplectic form is essentially the standard symplectic form in any symplectic basis:
Exercise 1.17. Show that if xj , pj , xj , pj are the coordinates of z, z in any given
symplectic basis, then (z, z ) takes the standard form
(z, z ) =
pj xj pj xj .
(1.15)
j=1
15
The canonical basis is trivially an orthosymplectic basis. It is easy to construct orthosymplectic bases starting from an arbitrary set of vectors {e1 , . . . , en }
satisfying the conditions (ei , ej ) = 0: let be the vector space (Lagrangian
plane) spanned by these vectors; using the classical GramSchmidt orthonormalization process we can construct an orthonormal basis {e1 , . . . , en } of . Dene
now f1 = Je1 , . . . , fn = Jen . The vectors fi are orthogonal to the vectors ej
and are mutually orthogonal because J is a rotation; in addition
(fi , fj ) = (ei , ej ) = 0 , (fi , ej ) = ei , ej = ij ,
hence the basis
B = {e1 , . . . , en } {f1 , . . . , fn }
is both orthogonal and symplectic.
We leave it to the reader as an exercise to generalize this construction to any
set
{e1 , . . . , ek } {f1 , . . . , fm }
of normed pairwise orthogonal vectors satisfying in addition the symplectic conditions (fi , fj ) = (ei , ej ) = 0 and (fi , ej ) = ij .
16
Denition 1.20. The set of all Lagrangian planes in a symplectic space (E, ) is denoted by Lag(E, ) and is called the Lagrangian Grassmannian of (E, ). When
(E, ) is the standard symplectic space (R2n
z , ) the Lagrangian Grassmannian is
denoted by Lag(n), and we will use the notation
X = Rnx 0
and P = 0 Rnp .
X and P are called the horizontal and vertical Lagrangian planes in (R2n
z , ).
Other common notation for the Lagrangian Grassmannian is (E, ) or
(n, R).
Example 1.21. Suppose n = 1; Lag(1) consists of all straight lines passing through
the origin in the symplectic plane (R2n
z , det).
When n > 1 the Lagrangian Grassmannian is a proper subset of the set of
all n-dimensional planes of (R2n
z , ).
Let us study the equation of a Lagrangian plane in the standard symplectic
space.
In what follows we work in an arbitrary symplectic basis
B = {e1 , . . . , en } {f1 , . . . , fn }
of the standard symplectic space; the corresponding coordinates are denoted by x
and p.
Proposition 1.22. Let be an n-dimensional linear subspace of the standard
symplectic space (R2n
z , ).
(i) is a Lagrangian plane if and only if it can be represented by an equation
Xx + P p = 0 with rank(X, P ) = n and XP T = P X T .
(1.16)
17
(z, z ) = 0 for all vectors z, z of that space implies that we must have XP T =
P XT .
(ii) It is clear from (i) that p = M x represents a Lagrangian plane ; it is also clear
that this plane is transversal to Span{f1 , . . . , fn }. The converse follows from the
observation that if : Xx + P p = 0 is transversal to Span{f1 , . . . , fn }, then P
must invertible; the property follows taking M = P 1 X which is symmetric
since XP T = P X T .
Two Lagrangian planes are said to be transversal if = 0; since dim =
dim = 12 dim E this is equivalent to saying that E = . For instance the
horizontal and vertical Lagrangian planes X = Rnx 0 and P = 0 Rnp are
obviously transversal in (R2n
z , ).
(1.18)
A
C
B
D
(1.19)
Proof. (i) The condition is necessary, taking for B the canonical symplectic bases.
If conversely is the graph of a symmetric matrix M , then it is immediate to
check that (z; z ) = 0 for all z .
(ii) The intersection P consists of all (x, p) which satisfy both conditions
Xx + P p = 0 and x = 0. It follows that
(x, p) P P p = 0
and hence (1.18).
(iii) Formula (1.19) follows from the trivial equivalence
(x, p) SP P Bp = 0.
18
19
bases of (R2n
z , ). There exists U O(2n) such that U (ei ) = ei and U (fi ) = fi
where fi = Jei , fi = Jei . We have U Sp(n) hence
20
1 if a > 0,
0 if a = 0,
(X , a , P ) =
+1 if a < 0
(1.20)
21
The line : x cos + p sin = 0 is here identied with = 2 and []anti is the
symmetrized integer part function, that is
[s]anti = 12 ([s] [s]) , [s] = k if k s < k + 1
(for k an integer).
Let us now return to the general case. The following properties of the signature are immediate:
is Sp(E, )-invariant: for every S Sp(E, ) and , , (Lag(E, ))3
we have
(1.22)
(S, S , S ) = (, , )
(because (Sz, Sz ) = (z, z ), and so on);
is totally antisymmetric: for any permutation p of the set {1, 2, 3} we have
(p(1) , p(2) , p(3) ) = (1)sgn(p) (1 , 2 , 3 )
(1.23)
22
(1.24)
23
(1.25)
0 = 0 = 0 = 0
z=
xi ei +
pji fj ;
i=1
j=1
0
R = M M
M M
M M
0
M M
M M
M M .
0
24
The quadratic form R has the same signature (, , ) as RV for any invertible
matrix V ; choosing
0 I I
V = I 0 I
I I 0
the matrix of the quadratic form R V is
0
M M
1 T
0
M M
2 V RV =
0
0
0
M M
and hence
(, , ) = sign(M M ) + sign(M M ) + sign(M M ).
(1.26)
Formula (1.25) has the following combinatorial interpretation. Let us view the
WallKashiwara index as a 2-cochain on Lag(E, ) and denote by the coboundary operator (see (9) in the Notation section in the Preface). Then, by denition
of ,
(1 , 2 , 3 ) = (1 , 2 , 3 ) (2 , 3 , 4 ) + (1 , 3 , 4 ) (1 , 2 , 4 )
so that Theorem 1.32 can be restated in concise form as:
= 0, that is: is 2-cocycle on Lag(n).
25
such a way that their intersections do not change, the signature will remain unaltered. The same property remains true in higher dimensions. To prove this, we
need the following elementary lemma which describes the kernel of the quadratic
form dening ; in order to avoid a deluge of multiple primes in the proof
we slightly change notation and write (1 , 2 , 3 ) instead of (, , ) so that the
dening quadratic form becomes
Q(z1 , z2 , z3 ) = (z1 , z2 ) + (z2 , z3 ) + (z3 , z1 )
with (z1 , z2 , z3 ) 1 2 3 .
Recall that the kernel of a quadratic form is the kernel of the matrix of the
associated bilinear form.
Lemma 1.33. Let Ker Q be the kernel of the quadratic form Q. There exists an
isomorphism
(1.27)
Ker Q
= (1 2 ) (2 3 ) (3 1 ).
Proof. Let A be the matrix of Q. The condition u Ker Q is equivalent to
v T Au = 0 for all v 1 2 3 .
(1.28)
(1.29)
that is, to
Q(z1 + z1 , z2 + z2 , z3 + z3 ) Q(z1 + z2 , z3 )
(1.30)
26
It follows that
(z1 + z2 z3 = z1 + (z2 z3 ) = (z1 z3 ) + z2 1 2 ,
(z2 + z3 z1 = z2 + (z3 z1 ) = (z2 z1 ) + z3 2 3 ,
(z3 + z1 z2 = z3 + (z1 z2 ) = (z3 z2 ) + z1 3 1 .
The restriction to Ker Q of the automorphism of z3 dened by (z1 , z2 , z3 )
(z1 , z2 , z3 ) with
z1 = z1 + z2 z3 , z2 = z2 + z3 z1 , z3 = z3 + z1 z2
is thus an isomorphism of Ker Q onto (1 2 ) (2 3 ) (3 1 ).
We are now in position to prove the main topological property of the signature. Let us introduce the following notation: if k, k , k are three integers such
that 0 k, k , k n, we dene a subset Lag3k,k ,k (n) of Lag3 (n) by
Lag3k,k ,k (E,) = {(, , ) : dim( ) = k,dim( ) = k ,dim( ) = k }.
Proposition 1.34. The WallKashiwara signature has the following properties:
(i) It is locally constant on each set Lag3k,k ,k (E, );
(ii) If the triple (, , ) move continuously in Lag3 (E, ) in such a way that
dim , dim = k , and dim do not change, then (, , )
remains constant;
(iii) We have
(, , ) = n + dim + dim + dim mod 2.
(1.31)
Proof. Properties (i) and (ii) are equivalent since Lag(E, ) (and hence also
Lag3 (E, )) is connected. Property (iii) implies (i), and hence (ii). It is therefore sucient to prove the congruence (1.31). Let A be the matrix in the proof of
Lemma 1.33. In view of the isomorphism statement (1.27) we have
rank(A) = 3n (dim + dim + dim ).
Let ( + , ) be the signature of A, so that (by denition) (, , ) = + ;
since rank(A) = + + we thus have
(, , ) rank(A) mod 2
hence (1.31).
(1.32)
Chapter 2
28
Denition 2.2. Let (E, ), (E , ) be two symplectic vector spaces. A dieomorphism f : (E, ) (E , ) is called a symplectomorphism1 if the dierential
dz f is a linear symplectic mapping E E for every z E. [In the physical
literature one often says canonical transformation in place of symplectomorphism.]
It follows from the chain rule that the composition g f of two symplectomorphisms f : (E, ) (E , ) and g : (E , ) (E , ) is a symplectomorphism (E, ) (E , ). When
(E, ) = (E , ) = (R2n
z , )
a dieomorphism f of (R2n
z , ) is a symplectomorphism if and only if its Jacobian
matrix (calculated in any symplectic basis) is in Sp(n). Summarizing:
f is a symplectomorphism of (R2n
z , )
29
(2.2)
(2.3)
30
(2.5)
symmetric, and AD BC = I.
(2.6)
AB , CD
It follows from the second of these sets of conditions that the inverse of S is
T
D
B T
S 1 =
.
(2.7)
C T
AT
Example 2.5. Here are three useful classes of symplectic matrices: if P and L are,
respectively, a symmetric and an invertible n n matrix, we set
1
I
0
P I
L
0
, UP =
, ML =
.
(2.8)
VP =
P I
I 0
0
LT
The matrices VP are sometimes called symplectic shears.
It turns out as we shall prove later on that both sets
G = {J} {VP : P Sym(n, R)} {ML : L GL(n, R)}
and
31
A1 0 B1 0
0 A2 0 B2
S1 S2 =
C1 0 D1 0 Sp(n1 + n2 ).
0 C2 0 D2
(2.9)
32
(2.10)
Property (i) will follow, since for real matrices, eigenvalues appear in conjugate
pairs. Since S T JS = J we have S = J(S T )1 J and hence
PS () = det(J(S T )1 J I)
= det((S T )1 J + I)
= det(J + S)
= 2n det(S 1 I)
which is precisely (2.10).
(j)
33
Notice that an immediate consequence of this result is that if 1 is an eigenvalue of S Sp(n), then its multiplicity is necessarily even.
We will see in the next subsection (Proposition 2.13) that any positivedenite symmetric symplectic matrix can be diagonalized using an orthogonal
transformation which is at the same time symplectic.
(2.11)
of Sp(n).
Proof. In view of (2.7) the inverse of U = (u), u U(n, C), is
T
A
BT
= UT ,
U 1 =
B T AT
hence U O(2n, R) which proves the inclusion U(n) Sp(n) O(2n, R). Suppose
conversely that U Sp(n) O(2n, R). Then
JU = (U T )1 J = U J
which implies that U U(n) so that Sp(n) O(2n, R) U(n).
We will loosely talk about U(n) as of the unitary group when there is no
risk of confusion; notice that it immediately follows from conditions (2.5), (2.6)
that we have the equivalences:
A + iB U(n)
(2.12)
A B symmetric and AT A + B T B = I
(2.13)
AB
(2.14)
34
of course these conditions are just the same thing as the conditions
(A + iB) (A + iB) = (A + iB)(A + iB) = I
for the matrix A + iB to be unitary.
In particular, taking B = 0 we see the matrices
A 0
R=
with AAT = AT A = I
0 A
(2.15)
also are symplectic, and form a subgroup O(n) of U(n) which we identify with the
rotation group O(n, R). We thus have the chain of inclusions
O(n) U(n) Sp(n).
Let us end this subsection by mentioning that it is sometimes useful to identify elements of Sp(n) with complex symplectic matrices. The group Sp(n, C) is
dened, in analogy with Sp(n), by the condition
Sp(n, C) = {M M (2n, C) : M T JM = J}.
Let now K be the complex matrix
1 I
K=
2 iI
iI
U(2n, C)
I
(2.16)
35
Proof. Since S > 0 its eigenvalues occur in pairs (, 1/) of positive numbers
(Proposition 2.11); if 1 n are n eigenvalues then 1/1 , . . . , 1/n are the
other n eigenvalues. Let now U be an orthogonal matrix such that S = U T U
with, being given by (2.16). We claim that U U(n). It suces to show that
we can write U in the form
A B
U=
B A
with
AB T = B T A , AAT + BB T = I.
(2.17)
A
B
=
,
B
A
A
U = [e1 , . . . , en ; Je1 , . . . , Jen ] =
B
B
.
A
36
(2.18)
The one-parameter family (St ) thus dened is a group: St St = St+t and St1 =
St .
The following result gives an explicit description of the elements of the symplectic algebra:
Proposition 2.15. Let X be a real 2n 2n matrix.
(i) We have
X sp(n) XJ + JX T = 0 X T J + JX = 0.
(ii) Equivalently, sp(n) consists of all block-matrices X such that
U
V
X=
with V = V T and W = W T .
W U T
(2.19)
(2.20)
(t)k
(JXJ)k = JetX J.
exp(tJXJ) =
k!
k=0
It follows that
37
Remark 2.16. The symmetric matrices of order n forming an n(n + 1)/2-dimensional vector space (2.20) implies, by dimension count, that sp(n) has dimension
n(2n + 1). Since Sp(n) is connected we consequently have
dim Sp(n) = dim sp(n) = n(2n + 1).
(2.21)
1
exp X = cos I + sin X if > 0,
1
exp X = cosh I +
sinh X if < 0.
Tr(exp X) = 2 cos 2
and in the case < 0,
Tr(exp X) = 2 cosh 1.
38
However:
Proposition 2.18. A symplectic matrix S is symmetric positive denite if and only
if S = exp X with X sp(n) and X = X T . The mapping exp is a dieomorphism
sp(n) Sym(2n, R) Sp(n) Sym+ (2n, R)
(Sym+ (2n, R) is the set of positive denite symmetric matrices).
Proof. If X sp(n) and X = X T , then S is both symplectic and symmetric
positive denite. Assume conversely that S is symplectic and symmetric positive denite. The exponential mapping is a dieomorphism exp : Sym(2n, R)
Sym+ (2n, R) (the positive denite symmetric matrices) hence there exists a unique
X Sym(2n, R) such that S = exp X. Let us show that X sp(n). Since S = S T
we have SJS = J and hence S = JS 1 J. Because J = J 1 it follows that
exp X = J 1 (exp(X))J = exp(J 1 XJ)
and J 1 XJ being symmetric, we conclude that X = J 1 XJ, that is JX = XJ,
showing that X sp(n).
We will rene the result above in Subsection 2.2.1, Proposition 2.22, by using
the Cartan decomposition theorem. This will in particular allow us to obtain a
precise formula for calculating X in terms of the logarithm of S = exp X.
39
(2.24)
It turns out that formula (2.23) denes a bona de logarithm for matrices
having no eigenvalues on the negative half-axis:
Proposition 2.21. Assume that M has no eigenvalues 0. Then
(i) Log M dened by (2.23) exists;
(ii) We have
eLog M = M , (Log M )T = Log M T
and also
Log M 1 = Log M , Log(AM A1 ) = A(Log M )A1
(2.25)
f (M ) =
( M )1 ( 1)1 I d.
We have
(I M )1 = (( )I N )1 =
k0
j=0
( )k+1 N j
40
and hence
f (M ) =
1
1
Id +
k0
k=1
( )k1 d N k
k0
(1)k+1 1 k
( N )
k
k=1
= (Log )I +
k0
(1)k+1 1
( M I)k .
k
k=1
Direct substitution of the sum in the right-hand side in the power series for the
exponential yields the matrix 1 M ; hence exp f (M ) = M which we set out to
prove. Formulae (2.25) readily follow from denition (2.23) of the logarithm, and
so does the equality (Log M )T = Log M T .
The following consequence of Proposition 2.21, which renes Proposition
2.18, will be instrumental in the proof of the symplectic version of Cartans decomposition theorem:
Proposition 2.22. If S Sp(n) is positive denite, then X = Log S belongs to the
symplectic Lie algebra sp(n). That is, for every S Sp(n) Sym+ (2n, R) (the set
of symmetric positive denite symplectic matrices) we have
S = eLog S
Log S sp(n).
Let us rene the results above by using Cartan decomposition theorem from
the theory of Lie groups (see Appendix A):
Proposition 2.23. Every S Sp(n) can be written S = U eX where U U(n) and
X = 12 Log(S T S) is in sp(n) Sym(2n, R).
41
Proof. The symplectic matrix S T S has no negative eigenvalues, hence its logarithm Log(S T S) exists and is in sp(n) in view of Proposition 2.22; it is moreover
obviously symmetric. It follows that X sp(n) and hence eX and R are both in
Sp(n). Since we also have U O(2n) in view of Cartans theorem, the proposition
follows since we have Sp(n) O(2n) = U(n).
A rst consequence of the results above is that the symplectic group Sp(n)
is contractible to its subgroup U(n) (which, by the way, gives a new proof of the
fact that Sp(n) is connected):
Corollary 2.24.
(i) The standard symplectic group Sp(n) can be retracted to the unitary group
U(n).
(ii) The set Sp(n) Sym+ (2n, R) is contractible to a point.
Proof. (i) Let t S(t), 0 t 1, be a loop in Sp(n); in view of Proposition 2.23
we can write S(t) = U (t)eX(t) where U (t) U(n) and X(t) = 12 Log(S T (t)S(t)).
Since t S(t) is continuous, so is t X(t) and hence also t U (t). Consider
now the continuous mapping h : [0, 1] [0, 1] Sp(n) dened by
42
will
facand
role
(2.26)
Taking into account Exercise 2.28 above, it follows from Proposition 2.29 that S
can always be factored as
I 0 LT
X Y
0
S=
P I
0 L1 Y X
43
(2.27)
T 1/2
T 1/2
L = (AA + BB )
X + iY = (AA + BB )
(ii) Equivalently:
S=
L
Q
0
L1
X
Y
(2.28)
(2.29)
(A + iB).
Y
X
(2.30)
(2.31)
(2.32)
Proof. Part (ii) of the corollary immediately follows from the formulae (2.27)
(2.27). Let us prove (i). Expanding the matrix product in the right-hand side of
(2.27) we see that we must have A = LX and B = LY . These conditions, together
with the fact that X + iY is unitary, imply that
AAT + BB T = L(XX T + Y Y T )LT = LLT ,
hence det(AAT + BB T ) = 0 (cf. Exercise 2.8). Let us choose L and X + iY as
in formulas (2.29), (2.30). The matrix L is then evidently symmetric and we have
X + iY U(n, C); to prove the corollary it thus suces to show that
I 0
A B X T Y T L1 0
=
(2.33)
0
L
P I
XT
C D YT
where P is given by (2.28); we notice that the matrix P is then automatically
symmetric since the condition
I 0
Sp(n)
P I
is equivalent to P = P T (see the conditions (2.6) characterizing symplectic matrices). Expanding the product on the left-hand side of (2.33) this amounts to
44
(2.34)
L = (A2 + B 2 )1/2 ,
(2.35)
2 1/2
X + iY = (A + B )
(A + iB).
(2.36)
Exercise
2.32. Verify formulae (2.28)(2.30) in the case n = 1, i.e., when S =
a b
with ad bc = 1.
c d
45
(2.37)
x
p (z0 )
is invertible.
A very useful property is that every symplectic matrix is the product of two
free symplectic matrices. This is a particular case of the following very useful
result which will yield a precise factorization result for symplectic matrices, and
is in addition the key to many of the properties of the metaplectic group we will
study later on:
Proposition 2.36. For every (S, 0 ) Sp(n) Lag(n) there exist two matrices
S1 , S2 such that S = S1 S2 and S1 0 0 = S2 0 0 = 0. In particular, choosing
0 = P , every symplectic matrix is the product of two free symplectic matrices.
46
Proof. The second assertion follows from the rst choosing 0 = 0 Rnp . Recall
that Sp(n) acts transitively on the set of all pairs (, ) such that = 0. Choose
transverse to both 0 and S. There exists S1 Sp(n) such that S1 (0 , ) =
( , S0 ), that is S1 0 = and S0 = S1 . Since Sp(n) acts transitively on Lag(n)
we can nd S2 such that = S2 0 and hence S0 = S1 S2 0 . It follows that there
exists S Sp(n) such that S 0 = 0 and S = S1 S2 S . Set S2 = S2 S ; then
S = S1 S2 and we have
S1 0 0 = 0 = 0 , S2 0 0 = S2 0 0 = 0 = 0.
Another interesting property of free symplectic matrices are that they can
be generated by a function W dened on Rnx Rnx , in the sense that:
(x, p) = S(x , p ) p = x W (x, x ) and p = x W (x, x ).
Suppose that
A
S=
C
B
D
(2.39)
(2.40)
1
2
P x, x Lx, x +
1
2
Qx , x
(2.42)
(2.43)
47
A
=
C
DT
C T
B
D
(2.44)
B T
AT
(2.45)
(2.46)
A
C
B
.
D
(x0 , p0 ) = (B, Da + )
(2.47)
48
We have
pdx p dx = (pdx p dx ) + (p dx p dx )
= (pdx (p p0 )d(x x0 ) + dW (x , x )
= d(p0 , x + W (x x0 , x ))
which shows that Wz0 is a generating function. Finally, formula (2.47) is obtained
by a direct computation, expanding the quadratic form W (x x0 , x ) in its variables.
Corollary 2.38. Let f = [SW , z0 ] be a free ane symplectic transformation, and
set (x, p) = f (x , p ). The function z0 dened by
z0 (x, x ) = 12 p, x 12 p , x + 12 (z, z0 )
(2.48)
1
2
p0 , x0 .
(2.49)
Proof. Setting (x , p ) = S(x, p), the generating function W satises
W (x , x ) = 12 p , x 12 p , x )
in view of Eulers formula for homogeneous functions. Let z0 be dened by formula (2.48); in view of (2.45) we have
Wz0 (x, x ) z0 (x, x ) = 12 p0 , x 12 p, x0 12 p0 , x0
which is (2.49); this proves the corollary since all generating functions of a symplectic transformation are equal up to an additive constant.
We are going to establish a few factorization results for symplectic matrices.
Recall (Example 2.5) that if P and L are, respectively, a symmetric and an
invertible n n matrix, then
1
I
0
P I
L
0
VP =
, UP =
, ML =
.
(2.50)
P I
I 0
0
LT
Proposition 2.39. If S is a free symplectic matrix (2.37), then
S = VDB 1 MB 1 UB 1 A
(2.51)
S = VDB 1 MB 1 JVB 1 A .
(2.52)
and
49
0
DB 1 A C
1
B A I
I
0
(2.53)
for any matrix (2.37), symplectic or not. If now S is symplectic, then the middle
factor in the right-hand side of (2.53) also is symplectic, since the rst and the
third factors obviously are. Taking the condition ADT BC T = I in (2.6) into
account, we have DB 1 A C = (B T )1 and hence
so that
B
0
S=
B
0
=
0
DB 1 A C
I
DB 1
0
I
B
0
0
(B T )1
0
(B T )1
1
B A I
.
I
0
(2.54)
The factorization (2.51) follows (both DB 1 and B 1 A are symmetric, as a consequence of the relations B T D = DT B and B T A = AT B in (2.5)). Noting that
0
B 1 A I
=
I
0
I
I
0
0
B 1 A I
and
{UP , ML : P = P T , det L = 0}
generates Sp(n).
Proof. Taking 0 = 0 Rn in Proposition 2.36 every S Sp(n) is the product of
two free symplectic matrices. It now suces to apply Proposition 2.39.
50
Let us nally mention that the notion of free generating function extends
without any particular diculty to the case of symplectomorphisms; this will be
useful to us when we discuss HamiltonJacobi theory. Suppose in fact that f is
a free symplectomorphism in some neighborhood U of a phase space point. We
then have dp dx = dp dx and this is equivalent, by Poincares lemma, to the
existence of a function G C (R2n
z ) such that
pdx = p dx + dG(x , p ).
Assume now that Df (z ) is free for z U; then the condition det(x/p ) = 0
implies, by the implicit function theorem, that we can locally solve the equation
x = x(x , p ) in p , so that p = p (x, x ) and hence G(x , p ) is, for (x , p ) U, a
function of x, x only: G(x , p ) = G(x , p (x, x )). Calling this function W :
W (x, x ) = G(x , p (x, x ))
we thus have
pdx = p dx + dW (x, x ) = p dx + x W (x, x )dx + x W (x, x )dx
which requires p = x W (x, x ) and p = x W (x, x ) and f is hence free in U. The
proof of the converse goes along the same lines and is therefore left to the reader.
Since f is a symplectomorphism we have dp dx = dp dx and this is equivalent,
by Poincares lemma, to the existence of a function G C (R2n
z ) such that
pdx = p dx + dG(x , p ).
Assume now that Df (z ) is free for z U; then the condition det(x/p ) = 0
implies, by the implicit function theorem, that we can locally solve the equation
x = x(x , p ) in p , so that p = p (x, x ) and hence G(x , p ) is, for (x , p ) U, a
function of x, x only: G(x , p ) = G(x , p (x, x )). Calling this function W :
W (x, x ) = G(x , p (x, x ))
we thus have
pdx = p dx + dW (x, x ) = p dx + x W (x, x )dx + x W (x, x )dx
which requires p = x W (x, x ) and p = x W (x, x ) and f is hence free in U. The
proof of the converse goes along the same lines and is therefore left to the reader.
51
(2.56)
(2.57)
where J is the standard symplectic matrix. Dening the Hamilton vector eld by
XH = Jz H = (p H, x H)
(2.58)
Lagranges M
ecanique Analytique, Vol. I, pp. 217226 and 267270.
52
which is the same thing as (2.59). This formula is the gate to Hamiltonian mechanics on symplectic manifolds. In fact, formula (2.59) can be rewritten concisely as
iXH + dH = 0
(2.60)
(2.61)
H
of ows only holds when H is time-independent; in general ftH ftH = ft+t
and
H 1
H
H
(ft ) = ft (but of course we still have the identity f0 = I).
For notational and expository simplicity we will implicitly assume (unless
otherwise specied) that for every z0 R2n
z there exists a unique solution t zt
of the system (2.57) passing through z0 at time t = 0. The modications to
diverse statements when global existence (in time or space) does not hold are
rather obvious and are therefore left to the reader.
As we noted in previous subsection the ow of a time-dependent Hamiltonian
vector eld is not a one-parameter group; this fact sometimes leads to technical
complications when one wants to perform certain calculations. For this reason it
is helpful to introduce two (related) notions, those of suspended Hamilton ow
and time-dependent Hamilton ow. We begin by noting that Hamiltons equations
z = Jz H(z, t) can be rewritten as
d
H (z(t), t)
(z(t), t) = X
dt
(2.62)
H = (Jz H, 1) = (p H, x H, 1).
X
(2.63)
where
53
Denition 2.42.
H on the extended phase space
(i) The vector eld X
2n+1
Rz,t
R2n
z Rt
is called the suspended Hamilton vector eld; its ow (ftH ) is called the
suspended Hamilton ow determined by H.
2n
(ii) The two-parameter family of mappings R2n
z Rz dened by the formula
H
H
(ft,t
(z ), t) = ftt (z , t )
(2.64)
H (fH ).
=X
t
(2.65)
(2.66)
(2.67)
Note that it immediately follows from the group properties (2.66) of the
suspended ow that we have:
H
H
H
H
ft,t
= I , ft,t ft ,t = ft,t
H 1
, (ft,t
= ftH ,t
)
(2.68)
H
H
for all times t, t and t . When H does not depend on t we have ft,t
= ftt ; in
H
particular ft,0
= ftH .
Let H be some (possibly time-dependent) Hamiltonian function and ftH =
H
H
ft,0 . We say that ftH is a free symplectomorphism at a point z0 R2n
z if Dft (z0 )
H
H
is a free symplectic matrix. Of course ft is never free at t = 0 since f0 is the
identity. In Proposition 2.44 we will give a necessary and sucient condition for
H
the symplectomorphisms ft,t
to be free. Let us rst prove the following lemma,
the proof of which makes use of the notion of generating function:
2n
Lemma 2.43. The symplectomorphism f : R2n
z Rz is free in a neighborhood
2n
U of z0 Rz if and only if Df (z ) is a free symplectic matrix for z U, that is,
if and only if det(x/p ) = 0.
54
x
(z )
Df (z ) = x
p
(z )
x
x
(z
)
p
p
(z )
p
x
(z ) = 0.
p
55
and hence
x(t) = x0 + (t t0 )p H(z0 , t0 ) + O((t t0 )2 ).
It follows that
x(t)
= (t t0 )Hpp (z0 , t0 ) + O((t t0 )2 ),
p
hence there exists > 0 such that x(t)/p is invertible in [t0 , t0 []t0 , t0 + ]
if and only if Hpp (z0 , t0 ) is invertible; in view of Lemma 2.43 this is equivalent to
saying that ftH is free at z0 .
Example 2.45. The result above applies when the Hamiltonian H is of the physical
type
n
1 2
pj + U (x, t)
H(z, t) =
2m
j
j=1
since we have
1
1
Hpp (z0 , t0 ) = diag[ 2m
, . . . , 2m
].
1
n
In this case ftH is free for small non-zero t near each z0 where it is dened.
(2.69)
H
H
where D2 H(ft,t
(z)) is the Hessian matrix of H calculated at ft,t (z);
H
(ii) We have St,t
(z) Sp(n) for every z and t, t for which it is dened, hence
H
ft,t is a symplectomorphism.
H
H
= ftH and St,t
Proof. (i) It is sucient to consider the case t = 0. Set ft,0
=
St . Taking Hamiltons equation into account the time-derivative of the Jacobian
matrix St (z) is
d H
d
d
St (z) = (DftH (z)) = D
ft (z) ,
dt
dt
dt
56
that is
Using the fact that XH
d
St (z) = D(XH (ftH (z))).
dt
= Jz H together with the chain rule, we have
(2.70)
(2.71)
57
d Y
dt t
(2.72)
(2.73)
which is (2.72).
Remark 2.51. Proposition 2.50 can be restated as follows: set (x , p ) = f (x, p)
and K = H f ; if f is a symplectomorphism then we have the equivalence
x = p K(x , p ) and p = x K(x , p )
(2.74)
Exercise 2.52.
58
1
(pj Aj (x, t))2 + U (x, t)
2m
j
j=1
(2.75)
x(t)
= tHpp (z0 , 0) + O(t2 )
p
(R2n
z , ) is Hamiltonian if there exists a function H C (Rz,t , R) and a number
59
ftH
with
with
(2.76)
(2.77)
Proof. Let us rst prove (2.76). By the product and chain rules we have
d H K
d
d
(f f ) = ( ftH )ftK + (DftH )ftK ftK = XH (ftH ftK ) + (DftH )ftK XK (ftK )
dt t t
dt
dt
and it thus suces to show that
(DftH )ftK XK (ftK ) = XK(ftH )1 (ftK ).
(2.78)
Writing
(DftH )ftK XK (ftK ) = (DftH )((ftH )1 ftH ftK ) XK ((ftH )1 ftH ftK )
the equality (2.78) follows from the transformation formula (2.72) in Proposition
2.50. Formula (2.77) is now an easy consequence of (2.76), noting that (ftH ftH ) is
the ow determined by the Hamiltonian
H 1
H
and hence ftH = fat
. We may thus assume that f = f1H and g = f1K for some
Hamiltonians H and K. Now, using successively (2.76) and (2.77) we have
60
(2.79)
so we are done.
The result above motivates the following denition:
d
Proof. By denition of X we have dt
ft = Xft so that all we have to do is to prove
that X is a (time-dependent) Hamiltonian eld. For this it suces to show that the
contraction iX of the symplectic form with X is an exact dierential one-form, for
then iX = dH where H is given by (2.80). The ft being symplectomorphisms,
they preserve the symplectic form and hence LX = 0. In view of Cartans
homotopy formula we have
LX = iX d + d(iX ) = d(iX ) = 0
so that iX is closed; by Poincares lemma it is also exact.
(ftH )0t1
(ftK )0t1
61
+ p H((t)), p(t)
dt
= p(t),
x(t)
+ x(t),
p(t)
=0
where we have taken into account Hamiltons equations.
Assume now that is a periodic orbit through z0 . We will use the notation
St (z0 ) = DftH (z0 ).
Denition 2.60. Let : t ftH (z0 ) be a periodic orbit with period T . The symplectic matrix ST (z0 ) = DfTH (z0 ) is called a monodromy matrix. The eigenvalues of ST (z0 ) are called the Floquet multipliers of .
The following property is well known in Floquet theory:
Lemma 2.61.
(i) Let ST (z0 ) be the monodromy matrix of the periodic orbit . We have
St+T (z0 ) = St (z0 )ST (z0 )
(2.81)
62
Proof. (i) The mappings ftH form a group, hence, taking into account the equality
fTH (z0 ) = z0 :
H
ft+T
(z0 ) = ftH (fTH (z0 ))
so that by the chain rule,
H
(z0 ) = DftH (fTH (z0 ))DfTH (z0 ),
Dft+T
d H H
f (f (z0 ))
dt t t
t =0
hence ST0 (z0 )XH (z0 ) = XH (z0 ) setting t = T ; XH (z0 ) is thus an eigenvector of
ST (z0 ) with eigenvalue 1; the lemma follows the eigenvalues of a symplectic matrix
1/).
63
One shows, using normal form techniques that when the conditions of the
theorem above are fullled, the monodromy matrix of 0 can be written as
0
T U
ST (z0 ) = S0
0 ) S0
0 S(z
1
Chapter 3
Multi-Oriented Symplectic
Geometry
Multi-oriented symplectic geometry, also called q-symplectic geometry, is a topic
which has not been studied as it deserves in the mathematical literature; see
however Leray [107], Dazord [28], de Gosson [57, 61]; also [54, 55]. The idea is the
following: one begins by observing that since symplectic matrices have determinant
1, the action of Sp(n) on a Lagrangian plane preserves the orientation of that
Lagrangian plane. Thus, ordinary symplectic geometry is not only the study of
the action
Sp(n) Lag(n) Lag(n)
but it is actually the study of the action
Sp(n) Lag2 (n) Lag2 (n)
where Lag2 (n) is the double covering of Lag(n). More generally, q-symplectic geometry will be the study of the action
Spq (n) Lag2q (n) Lag2q (n)
where Spq (n) is the qth order covering of Sp(n) and Lag2q (n) is the 2qth order
covering of Lag(n). In the case q = 2 this action highlights the geometrical role of
the Maslov indices on the metaplectic group, which we will study in Chapter 7.
The study of q-symplectic geometry makes use of an important generalization
of the Maslov index, which we call the ArnoldLerayMaslov index. That index
plays a crucial role in at least two other areas of mathematics and mathematical
physics:
It is instrumental in giving the correct phase shifts through caustics in semiclassical quantization (Leray [107], de Gosson [60, 61, 62, 64]) because it
66
allows one to dene the argument of the square root of a de Rham form on
a Lagrangian manifold;
It allows a simple and elegant calculation of Maslov indices of both Lagrangian and symplectic paths; these indices play an essential role in the
study of spectral ow properties related to the theory of the Morse index (see Piccione and his collaborators [130, 131] and BoossBavnbek and
Wojciechowski [15]).
A + iB
A
B
B
A
identies U(n, C) with a subgroup U(n) of Sp(n); that subgroup consists of all
A B
U=
B A
where A and B satisfy the conditions
AT A + B T B = I , AT B = B T A,
T
AA + BB = I , AB = BA .
(3.1a)
(3.1b)
67
Also recall that the unitary group U(n, C) acts transitively on the Lagrangian
Grassmannian Lag(n) by the law u = U where U U(n) is associated to
u U(n, C).
We denote by W(n, C) the set of all symmetric unitary matrices:
"
!
W(n, C) = w U(n, C) : w = wT
and by W the image of w W(n, C) in U(n) Sp(n). The set of all such matrices
W is denoted by W(n). Applying the conditions (3.1) we thus have
A2 + B 2 = I,
A B
AB = BA,
W =
W(n)
B A
A = AT , B = B T .
Observe that neither W(n, C) or W(n) are groups: the product of two symmetric matrices is not in general symmetric.
Interestingly enough, the set W(n, C) is closed under the operation of taking
square roots:
Lemma 3.1. For every w W(n, C) [respectively W W(n)] there exists u
W(n, C) [resp. U W(n)] such that w = u2 [respectively W = U 2 ].
Proof. Let w = A + iB. The condition ww = I implies that AB = BA. It follows
that the symmetric matrices A and B can be diagonalized simultaneously: there
exists R O(n, R) such that G = RART and H = RBRT are diagonal. Let gj and
hj (1 j n) be the eigenvalues of G and H, respectively. Since A2 + B 2 = I
we have gj2 + h2j = 1 for every j. Choose now real numbers xj , yj such that
x2j yj2 = gj and 2xj yj = hj for 1 j n and let X and Y be the diagonal
matrices whose entries are these numbers xj , yj . Then (X + iY )2 = G + iH, and
u = RT (X + iY ) R is such that u2 = w.
We are now going prove that Lag(n) can be identied with W(n, C) (and
hence with W(n)). Let us begin with a preparatory remark:
Remark 3.2. Suppose that uP = P ; writing u = A + iB this implies B = 0,
and hence u O(n, R). This is immediately seen by noting that the condition
uP = P can be written in matrix form as: for every p there exists p such that
A B 0
0
= .
B A
p
p
Theorem 3.3.
(i) For Lag(n) and u U(n, C) such that = uP the product w = uuT only
depends on and not on the choice of u; the correspondence uuT is
thus the mapping
w() : Lag(n) W(n, C) , w() = w = uuT .
(3.2)
68
(3.3)
as claimed.
The Souriau mapping is a very useful tool when one wants to study transversality properties for Lagrangian planes; for instance
= 0 det(w() w( )) = 0.
(3.4)
(3.6)
69
Proof. Since Sp(n) acts transitively on Lag(n) it suces to consider the case =
P , in which case formula (3.6) reduces to
rank(w() I) = 2(n dim( P )).
T
T
B
0
0
BT
B A
.
A B
It follows that
rank(W () I) = 2 rank B;
The Souriau mapping w() can also be expressed in terms of projection operators on Lagrangian planes. Let be a Lagrangian plane and denote by P the
orthogonal projection in R2n
z on :
P2 = P , Ker(P ) = J , (P )T = P .
We have:
Proposition 3.5. The image W () of w() in U(n) is given by
W () = (I 2P )C
where P is the orthogonal projection in R2n
z
I
on and C =
0
(3.7)
0
is the conI
jugation matrix.
Proof. Since U(n) acts transitively on Lag(n), there exists U U(n) such that
= U P . Writing
A B
U=
B A
it follows from the relations (3.1) that the vector (Ax, Bx) is orthogonal to ; one
immediately checks that the projection operator on has matrix
BB T
AB T
P =
BAT
AAT
and hence
AAT BB T
(I 2P )C =
2BAT
that is:
(I 2P )C =
A
B
B
A
AT
BT
2AB T
,
BB T AAT
B T
= W ().
AT
70
(3.8)
1
2i
W
d(det w)
Z
det w
(3.9)
71
(3.10)
(iii) In fact 1 [Lag(n)] has a generator such that mLag ( r ) = r for every r Z.
Proof. The statement (ii) is an obvious consequence of the statement (i). Let us
prove (i). Since W(n, C) U(n, C) It follows from Lemma 3.6 that
d(det w)
1
Z
2i W det w
for every W 1 [W(n, C)] and that the homomorphism (3.9) is injective. Let us
show that this homomorphism is also surjective; it suces for that to exhibit the
generator in (iii). Writing (x, p) = (x1 , p1 ; . . . ; xn , pn ) the direct sum Lag(1)
Lag(1) (n terms) is a subset of Lag(n). Consider the loop (1) : t e2it ,
0 t 1, in W (1, C) Lag(1). Set now = (1) I2n2 where I2n2 is the
r
I2n2 and
identity in W (n 1, C). We have r = (1)
1
mLag ( ) =
2i
d(e2irt )
=r
e2irt
1
=
2i
d(det w)
det w
(3.11)
72
(t) =
(n)
Proposition 3.9. The family (mLag )nN is the only family of mappings mLag :
Lag(n) Z having the following properties:
(i) Homotopy: two loops and in Lag(n) are homotopic if and only if
(n)
(n)
mLag () = mLag ( );
(ii) Additivity under concatenation: for all loops and in Lag(n) with the
same origin,
mLag ( ) = mLag () + mLag ( );
(n)
(n)
(n)
(n)
(n )
(n )
Lag(n) Z having the same property; then the dierence (Lag )nN has the
(1)
properties (i), (ii), (iv) and (iii) is replaced by Lag ((1) ) = 0. Every loop in
Lag(n) being homotopic to r for some r Z, it follows from the concatenation
(n)
(n)
property (ii) that Lag () = Lag ( r ) = 0.
Remark 3.10. Notice that we did not use in the proof of uniqueness in Proposition 3.9 the dimensional additivity property: properties (i), (ii), and (iii) thus
characterize the Maslov index.
73
(3.13)
74
(3.14)
(n)
Proposition 3.14. The family (mSp )nN is the only family of mappings mSp :
Sp(n) Z having the following properties:
(i) Homotopy: two loops and in Sp(n) are homotopic if and only if mSp () =
(n)
mSp ( );
(ii) Additivity under concatenation: for all loops and in Sp(n) with the same
origin
(n)
(n)
(n)
mSp ( ) = mSp () + mSp ( );
(n)
(n)
(iii) Normalization: the generator of 1 [Sp(n)] has Maslov index mSp () = +1.
(iv) Dimensional additivity: identifying Sp(n1 ) Sp(n2 ) with a subset of Lag(n),
n = n1 + n2 we have
(n)
(n )
(n )
75
(3.15)
The integer Inert(, , ) is the index of inertia of the triple (, , ) (Leray [107]);
it is dened as follows: the conditions
(z, z , z ) , z + z + z = 0
dene three isomorphisms z z , z z , z z whose product is the
identity. It follows that
(z, z ) = (z , z ) = (z , z)
is the value of a quadratic form at z (or z , or z ); these quadratic
forms have the same index of inertia, denoted by Inert(, , ).
Since Lerays index of inertia Inert(, , ) is dened in terms of quadratic
forms which only exist when the transversality conditions (3.15) are satised, it
is not immediately obvious how to extend m( , ) to arbitrary pairs ( , ).
The extension presented in this chapter is due to the author; it rst appeared in
[54] and was then detailed in [57]. (Dazord has constructed in [28] a similar index
using methods from algebraic topology, for a dierent approach see Leray [109]).
The main idea is to use the signature (, , ) instead of Inert(, , ); this idea
probably goes back to Lion and Vergne [111], albeit in a somewhat incomplete
form: see the remarks in de Gosson [54, 55]. For a very detailed study of the ALM
and related indices see the paper [22] by Cappell et al.
The theory of the ArnoldLerayMaslov index which we will call for short
the ALM index is a beautiful generalization of the theory of the Maslov index
of Lagrangian loops. It is a very useful mathematical object, which can be used to
express various other indices: Lagrangian and symplectic path intersection indices,
and, as we will see, the ConleyZehnder index.
76
(3.17)
(3.18)
77
Notice that property (3.18) implies, together with the antisymmetry of the
signature, that if the ALM index exists then it must satisfy
( , ) = ( , )
(3.19)
( , ) ( , ) + ( , ) = (, , ).
It follows that = is such that
( , ) = ( , ) ( , );
(3.20)
( r , r ) = ( , ) + 2(r r )
(3.21)
(3.22)
78
(3.24)
79
1
d(det u)
k =
2i det u
is an isomorphism 1 [U(n, C)]
= (Z, +). Set now
"
!
U (n, C) = (u, ) : u U(n, C), det u = ei
and equip this set with the topology induced by the product U(n, C) R. Dene
a projection : U (n, C) U(n, C) by (U, ) = W , and let the group
1 [U(n, C)] act on U (n, C) by the law
(u, ) = (u, + 2k ).
That action is clearly transitive, hence U (n, C) is the universal covering group
of U(n, C), the group structure being given by
(U, )(U , ) = (U U , + ).
Let us now identify the Maslov bundle with a subset of U (n, C):
Proposition 3.17. The universal covering of Lag(n) W(n, C) is the set
"
!
W (n, C) = (w, ) : w W(n, C), det w = ei
equipped with the topology induced by U (n), together with the projection :
W (n, C) W(n, C) dened by (w, ) = w.
Proof. In view of Theorem 3.7 of Subsection 3.1.2 where the fundamental group
of W(n, C) is identied with (Z, +) it is sucient to check that W (n, C) is
connected because it will then indeed be the universal covering of W(n, C). Let
U (n, C) act on W (n, C) via the law
(u, )(w, ) = (uwuT , + 2).
(3.25)
The stabilizer of (I, 0) in U (n) under this action is the subgroup of U (n)
consisting of all pairs (U, ) such that U U T = I and = 0 (and hence det U = 1);
it can thus be identied with the rotation group SO(n) and hence
W (n, C) = U (n, C)/ SO(n, R).
Since U (n, C) is connected, so is W (n, C).
80
The Maslov bundle Lag (n) is the universal covering of the Lagrangian
Grassmannian. Quite abstractly, it is constructed as follows (the construction is
not specic of Lag (n), it is the way one constructs the universal covering of any
topological space: see the Appendix B). Choose a base point 0 in Lag(n): it is
any xed Lagrangian plane; let be an arbitrary element of Lag(n). Since Lag(n)
is path-connected, there exists at least one continuous path : [0, 1] Lag(n)
going from 0 to : (0) = 0 and (1) = . We say that two such paths and
are homotopic with xed endpoints if one of them can be continuously deformed
into the other while keeping its origin 0 and its endpoint xed. Homotopy with
xed endpoints is an equivalence relation; denote the equivalence class of the path
by . The universal covering of Lag (n) is the set of all the equivalence classes
as ranges over Lag(n); the mapping : Lag (n) Lag(n) which to
associates the endpoint of a path in is called a covering projection. One
shows that it is possible to endow the set Lag (n) with a topology for which it is
both connected and simply connected, and such that every Lag(n) has an open
1
(U ) is the disjoint union of open neighborhoods
neighborhood U such that
(1)
(k)
(k)
1
U , . . . , U , . . . of the points of
(), and the restriction of to each U
(k)
is a homeomorphism U U .
1
+ i Tr log(w(w )1
(3.26)
when = 0;
When has arbitrary dimension, one chooses such that = =
0 and one then calculates ( , ) using the property
( , ) = ( , ) ( , ) + (, , )
(3.27)
(3.28)
more generally
( , ) n + dim( ) mod 2.
81
(3.29)
1
+ i Tr Log(w(w )1 ;
= = = 0;
the formula will then hold in the general case as well in view of (3.27). We are going
to proceed along the lines in [84], p. 126. Since is locally constant on its domain. It
follows that the composed mapping S (S , S ) is (for xed ( , )
such that = 0) is locally constant on Sp (n); since Sp(n) is connected this
mapping is in fact constant so we have (S , S ) = ( , ) and it is
thus sucient to show that
(S , S ) (S , S ) + (S , S ) = (, , )
for some convenient S Sp(n). Since Sp(n) acts transitively on pairs of Lagrangian planes, there exists S Sp(n) such that S(, ) = (P , X ) where
82
(3.30)
1
(2k (2k + n) + i Tr Log I) = 2(k k ) n.
83
1
=
2k ( 12 (q p) + 2r) + i( 12 (q p)i)
= 2(k r) + p q.
(P, , A, ) =
Similarly
(A, , X, ) = (X, , A, )
1
= (2k + n) ( 12 (q p) + 2r) + i Tr Log(iA)
= 2(r k ) n,
hence
(P, , A, ) (P, , X, ) + (A, , X, ) = p q
which ends the proof since p q = (P , A , X ).
84
(3.32)
1
+ 2r 2r + i Tr Log(w(w )1
= ( , ) + 2(r r ).
( r , r ) =
The general case immediately follows using formula (3.27) and the fact that r
and r have projections and on Lag(n).
85
1
(( , ) + n + dim )
2
(3.33)
1
( (, , ) + n + dim(, , ))
2
(3.34)
(3.35)
(ii) Let be the generator of 1 [Lag(n)] whose natural image in Z is +1; then
m( r , r ) = m( , ) + r r
for all (r, r ) Z2 .
(3.36)
86
Proof. Formula (3.35) is equivalent to the property (3.18) of the ALM index.
Formula (3.36) follows from (3.31) in Corollary 3.22.
Remark 3.25. Formula (3.36) shows that the range of the mapping
( , ) m( , )
is all of Z.
Exercise 3.26. Check that Inert(, , ) coincides with Lerays index of inertia
dened at the beginning of Section 3.2 when = = = 0.
Let us state and prove the main result of this subsection:
Theorem 3.27. Let , be an arbitrary element of Lag (n) and dene a mapping
: Lag (n) Lag(n) Z
by the formula
( ) = (, m( , , )) , = Lag ( ).
(i) The mapping is a bijection whose restriction to the subset { : = 0}
of Lag (n) is a homeomorphism onto { : = 0} Z.
(ii) The set of all bijections ( ), form a system of local charts of Lag (n)
whose transitions = 1
are the functions
(, ) = (, + Inert(, , ) m(, , , )).
(3.37)
(, m( , , )) hence
( ) = (, + m( , , ) m( , , ))
which is the same thing as (3.37) in view of formula (3.35).
87
(3.38)
where we have used successively (3.31), (3.50), again (3.31), and nally the
Sp (n)-invariance (3.38) of the ALM index.
Here are a few properties which immediately follow from those of the ALM
index:
In view of property (3.29) (Proposition 3.19) of the ALM index we have
(S ) n dim(S ) mod 2
(3.39)
(3.40)
(3.41)
88
Proposition 3.29.
in Sp (n),
(i) For all S , S
(S S
) = (S ) + (S
) + (S, S )
(3.42)
(3.43)
(3.44)
(S S
, S ).
In view of the cocycle property = of the ALM index the right-hand side
of this equality is equal to
(SS , , S) = (, S, SS ) = (S, S ),
hence (3.42).
Property (ii) immediately follows from the two following observations: the ALM
index is locally constant on
{( , ) : = 0} (Lag(n))2
and the signature (, , ) is locally constant on
{(, , ) : = = = 0} (Lag(n))3 .
(iii) Using again the property = and the Sp (n)-invariance of we have
(S , ) (S , ) + (S , S ) = (S, , ),
(S , S ) (S , ) + (S , ) = (S, S , )
which yields (3.44) subtracting the rst identity from the second.
89
The practical calculation of (S ) does not always require the determination of an ALM index; formula (3.42) can often be used with prot. Here is an
example:
Example 3.30. Assume that n = 1. Let (I) be the homotopy class of the
symplectic path t etJ , 0 t 1, joining I to I in Sp(1). We have (I)2 =
(the generator of 1 [Sp(n)]), hence
((I)2 ) = () = 4.
But (3.42) implies that
((I)2 ) = 2 ((I) ) + (, , ) = 2 ((I) ),
hence ((I) ) = 2.
It turns out that the properties (i) and (ii) of the Maslov index listed in
Proposition 3.29 characterize that index. More precisely:
Proposition 3.31. Assume that : Sp (n) Z is locally constant on {S :
dim(S ) = 0} and satises
(S S
) = (S ) + (S
) + (S, S )
(3.45)
for all S , S
in Sp (n) (S = Sp (S ), S = Sp (S
)). Then = .
Exercise 3.32. Use the uniqueness property above to prove the conjugation formula
((S )1 S S ) = S (S )
where (S )1 S S denotes the homotopy class of the path t (S )1 S(t)S if
S is the homotopy class of a t S(t), 0 t 1 in Sp(n).
Let us mention the following result which will be proven in Chapter 7 (Lemma
7.24) in connection with the study of the metaplectic group: assume that = P
90
(3.46)
1
( (S ) + n + dim(S )),
2
(3.47)
that is
m (S ) = m(S , )
where has projection ( ) = . Since m( , ) Z for all ( , )
(Lag (n))2 it follows that m (S ) Sp (n).
Lag
(3.48)
91
(ii) The set of all bijections ( ) form a system of local charts of Sp (n) whose
transitions 1
are the functions
(S, m) = (S, m + Inert(S , , ) Inert(S , S , ).
Proof. (i) By denition of m (S) the range of consists of all pairs (S, m)
)); then S = S and
with m m (S). Assume that (S, m (S )) = (S , m (S
r
S = S for some r Z (cf. the proof of (i) in Theorem 3.27). In view of
so that is injective.
(3.48) we must have r = 0 and hence S = S
(ii) It is identical to that of the corresponding properties in Theorem 3.27 and is
therefore left to the reader as an exercise.
The theorem above allows us to describe in a precise way the composition
law of the universal covering group Sp (n):
Corollary 3.35. Let Lag(n). Identifying Sp (n) with the subset
{(S, m) : S Sp(n), m m (S)}
of Sp(n) Z the composition law of Sp (n) is given by the formula
(S, m) (S , m ) = (SS , m + m + Inert( , S , SS ).
(3.49)
(3.50)
92
(3.51)
(3.52)
93
(3.53)
Similarly, since
1 [Sp(n)] = {k : k Z}
we have
Spq (n) = Sp(n)/{qk : k Z}.
(3.54)
Let us now identify 1 [Lag(n)] with Z; recalling that the natural homomorphism
1 [Sp(n)] 1 [Lag(n)] is multiplication by 2 in Z (cf. formula (3.50) 1 [Sp(n)]
is then identied with 2Z. This leads us, taking (3.53) and (3.54) into account, to
the identications
Lagq (n) Lag (n)/qZ , Spq (n) Sp (n)/2qZ.
(3.55)
The ALM index on Lagq (n) is now dened as being the function
[]q : (Lagq (n))2 Zq = Z/qZ
given by
if ( , ) (Lag (n))2 covers ((q) , (q) ) Lagq (n))2 ; similarly the Maslov index
relative to Lag(n) on Spq (n) is the function
[ ]2q : Spq (n) Z2q = Z/2qZ
dened by
[ ]2q (S(q) ) = (S ) mod 2q
if S Sp (n) covers S(q) Spq (n).
Exactly as was the case for -symplectic geometry the study of q-symplectic
geometry requires the use of reduced indices:
94
Chapter 4
Intersection Indices
in Lag(n) and Sp(n)
In this chapter we generalize the notion of Maslov index to arbitrary paths (not
just loops) in Lag(n) and Sp(n). We will study two (related) constructions of these
intersection indices: the Lagrangian and symplectic Maslov indices, which
extend the usual notion of Maslov index for loops to arbitrary paths in Lag(n)
and Sp(n), and which are directly related to the notion of spectral ow, and
the ConleyZehnder index, which plays a crucial role in Morse theory, and its
applications to mathematical physics (we will apply the latter to a precise study
of metaplectic operators in Chapter 7).
The results in the two rst sections are taken from de Gosson [63] and de
Gosson and de Gosson [67]. For applications to Morse theory and to functional
analysis, see BoossBavnbek and Furutani [14], Javaloyes and Piccione [130], Nostre Marques et al. [131]
96
97
(4.1)
98
(4.2)
Proof. Suppose that and dene the paths 0 and 1 joining (0) to
(0) and (1) to (1), respectively, by 0 (s) = h(0, 1 s) and 1 (s) = h(1, s)
(0 s 1). Then 1 1 0 is homotopic to a point, and hence, in view of
(L2 ) and (L4 ):
Lag (, ) + Lag (1 , ) + Lag (1 , ) + Lag (0 , ) = 0.
But, in view of (L3 ),
Lag (1 , ) = Lag (0 , ) = 0
and thus
We will not discuss the uniqueness of an index dened by the axioms above;
the interested reader is referred to Serge de Gossons thesis [74] for a study of this
question.
Let us next construct explicitly a Lagrangian intersection index using the
properties of the ALM index studied in the previous chapter.
99
(4.3)
100
that is
Lag (12 , ) + Lag (23 , ) = Lag (13 , )
which we set out to prove. Axiom L3 . Let 12 be a path in the stratum Lag (n; k)
and denote by (t) the equivalence class of 01 12 (t) for 0 t 1. The mapping t (t) being continuous, the composition mapping t ( (t), ) is
locally constant on the interval [0, 1]. It follows that it is constant on that interval
since Lag (n; k) is connected; its value is
( (0), ) = ( (1), )
hence (12 , ) = 0. Axiom L4 . Let 1 [Lag(n), 0 )]. In view of formula (3.32)
in Corollary 3.22 we have the equality
Lag (, ) = (0, , ) (0, , ) = 2m()
101
(4.4)
102
dened by
(, ) Sp ( , )
is an intersection index on Lag(n).
In Theorem 4.5 we expressed a Lagrangian intersection index as the dierence between two values of the ALM index. A similar result holds for symplectic
intersection indices:
Proposition 4.10. Let 12 C(Sp(n)) be a symplectic path joining S1 to S2 in
Sp(n). Let S1, be an arbitrary element of Sp (n) covering S1 and S2, the
homotopy class of 01 12 (01 a representative of S1, ). The function Sp :
C(Sp(n)) Z dened by
Sp (12 , ) = (S2, ) (S1, )
(4.5)
which is (4.5).
Let us illustrate the notions studied above on a simple example.
(4.6)
where sign A(t) is the dierence between the number of eigenvalues > 0 and the
number of eigenvalues < 0 of A(t).
We have the following result, which has been established in a particular
case by Duistermaat [34], and which relates the spectral ow to the notions of
Lagrangian and symplectic intersection indices:
103
Proposition 4.11. Let be the Lagrangian path associated to the family (A(t))0t1
by
(t) = {(x, A(t)x) : x Rn } , 0 t 1.
(4.7)
Then the spectral ow of (A(t))0t1 is given by
SF(A(t))0t1 = Lag (, X )
(4.8)
SF(A(t))0t1 = Sp (, X )
(4.9)
or, equivalently
where A the symplectic path dened by
(t) = VA(t)
I
=
A(t)
0
I
for 0 t 1.
Proof. Formula (4.9) follows immediately from formula (4.8) observing that
I
0 x
x
=
.
A(t) I 0
A(t)x
To prove formula (4.8) we begin by noting that dim((t) P ) = n for 0 t 1,
and hence
(4.10)
Lag (, P ) = 0
in view of the axiom (L3 ) of nullity in the strata. By denition of Lag we have,
with obvious notation,
Lag (, X ) = ((1) , X, ) ((0) , X, ).
In view of the property = of the ALM index,
((t) , X, ) ((t) , P, ) = (X, , P, ) + ((t), X , P )
for 0 t 1, and hence, taking (4.10) into account,
Lag (, X ) = ((1), X , P ) ((0), X , P )
= (P , (1), X ) (P , (1), X ).
In view of formula (1.24) in Corollary 1.31 (Subsection 1.4.1) we have
(P , (t), X ) = sign(A(t))
and formula (4.8) follows.
The result above is rather trivial in the sense that the spectral ow (4.6)
depends only on the extreme values A(1) and A(0). The situation is however
far more complicated in the case of innite-dimensional symplectic spaces and
its analysis requires elaborated functional analytical techniques (see for instance
BoossBavnbek and Furutani [14]).
104
where
detC U = det(A + iB) if U =
A
B
B
.
A
105
(cf. the denition of the Maslov index on Sp(n), formula (3.12)). When t varies
from 0 to 2 the complex number detC U (t) = ei(t) varies from ei(0) = 1 to
ei(2) = 1 so that (2) Z.
Denition 4.12. The mapping iCZ : C (2n, R) Z dened by
iCZ () =
(2)
(4.11)
1
where S
is the homotopy class of the path t St1 ;
(4.12)
iCZ (r S ) = iCZ (S ) + 2r
(4.13)
for every r Z.
The uniqueness of a mapping Sp (n) Z satisfying these properties is actually
rather obvious: suppose iCZ : Sp (n) Z has the same properties and set
= iCZ iCZ . In view of (CZ3 ) we have (r S ) = (S ) for all r Z hence
is dened on Sp (n) = Sp+ (n) Sp (n) so that (S ) = (S) where S = S1 ,
the endpoint of the path t St . Property (CZ2 ) implies that this function
Sp (n) Z is constant on both Sp+ (n) and Sp (n). We next observe that since
det S = 1 we have det(S 1 I) = det(S I) so that S and S 1 always belong
to the same set Sp+ (n) or Sp (n) if det(S I) = 0. Property (CZ1 ) then implies
that must be zero on both Sp+ (n) or Sp (n).
106
(4.14)
(4.15)
(4.16)
107
(4.17)
(4.18)
(4.19)
(4.20)
108
noting that
(S I)1 (S I)1 (S I)(SS I)1
= (S I)1 (SS I S + I)(SS I)1 )
that is
(S I)1 (S I)1 (S I)(SS I)1 = (S I)1 (SS S )(SS I)1
= S (SS I)1 )
we get
MS + M = J( 12 I (S I)(SS I)1 + S (SS I)1 )
= J( 12 I + (SS I)1 )
= MSS
which we set out to prove.
(ii) Formula (4.18) follows from the sequence of equalities
MS 1 = 12 J + J(S 1 I)1
= 12 J JS(S I)1
= 12 J J(S I + I)(S I)1
= 21 J J(S I)1
= MS .
1
((I S) , )
2
(4.21)
109
1
(S )
2
(4.22)
where
is the relative Maslov index on Sp (2n) corresponding to the choice
Lag (2n).
Note that replacing n by 2n in the congruence (3.28) (Proposition 3.19) we
have
1
dim Ker(S I) mod 1.
2
Since the eigenvalue 1 of S has even multiplicity, (S ) is thus always an integer.
The index has the following three important properties; the third is essential
for the calculation of the index of repeated periodic orbits (it clearly shows that
is not in general additive):
(S )
Proposition 4.15.
(i) For all S Sp (n) we have
1
(S
) = (S ) ,
(I ) = 0
(4.23)
(4.24)
(iii) Let S be the homotopy class of a path in Sp(n) joining the identity to
S Sp (n), and let S Sp(n) be in the same connected component Sp (n)
) = (S ) where S
is the homotopy class in Sp(n) of the
as S. Then (S
concatenation of and a path joining S to S in Sp0 (n).
1
Proof. (i) Formulae (4.23) immediately follows from the equality (S
) = (I
1
S ) and the antisymmetry of .
(ii) The second formula (4.24) follows from the rst using (4.23). To prove the
rst formula (4.24) it suces to observe that to the generator of 1 [Sp(n)]
corresponds the generator I of 1 [Sp (2n)]; in view of property (3.41) of the
Maslov indices it follows that
r
(r S ) = 12
((I ) S )
= 12 (
(S ) + 4r)
= (S ) + 2r.
110
(iii) Assume in fact that S and S belong to, say, Sp+ (n). Let S be the homotopy
class of the path , and a path joining S to S in Sp+ (n) (we parametrize both
paths by t [0, 1]). Let t be the restriction of to the interval [0, t ], t t and
S (t ) the homotopy class of the concatenation t . We have det(S(t) I) > 0
(t) = 0 as t varies from 0 to 1. It follows from the
for all t [0, t ], hence S
fact that the is locally constant on the set {S
: S
= 0} (property (ii)
in Proposition 3.29) that the function t (S (t)) is constant, and hence
(S ) = (S (0))
=
(S (1))
=
(S )
The following consequence of the result above shows that the indices and
iCZ coincide on their common domain of denition:
Corollary 4.16. The restriction of the index to Sp (n) is the ConleyZehnder
index:
(S ) = iCZ (S ) if det(S I) = 0.
Proof. The restriction of to Sp (n) satises the properties (CZ1 ), (CZ2 ), and
(CZ3 ) of the ConleyZehnder index listed in 4.3.1; we showed that these proper
ties uniquely characterize iCZ .
Let us prove a formula for the index of the product of two paths:
Proposition 4.17. If S , S
, and S S
are such that det(SI) = 0, det(S I) =
0, and det(SS I) = 0, then
) = (S ) + (S
)+
(S S
1
2
sign(MS + MS )
(4.25)
(4.26)
111
Q(z) = (MS + MS )1 Jz, Jz
and the signature of Q is thus the same as that of the quadratic form
Q (z) = (MS + MS )1 z, z ,
that is sign(MS + MS ) proving formula (4.25). Formula (4.26) follows by a
straightforward induction on the integer r.
It is not immediately obvious that the index of the periodic orbit is
independent of the choice of the origin of the orbit. Let us prove that this is in
fact the case:
Proposition 4.18. Let (ft ) be the ow determined by a (time-independent) Hamiltonian function on R2n and z = 0 such that fT (z) = z for some T > 0. Let
z = ft (z) for some t and denote by ST (z) = DfT (z) and ST (z ) = DfT (z ) the
corresponding monodromy matrices. Let ST (z) and ST (z ) be the homotopy
classes of the paths t St (z) = Dft (z) and t St (z ) = Dft (z ), 0 t T .
We have (ST (z) ) = (ST (z ) ).
Proof. We have proven in Lemma 2.61 that monodromy matrices ST (z) and ST (z )
are conjugate of each other. Since we will need to let t vary we write ST (z ) =
ST (z , t ) so that
ST (z , t ) = St (z )ST (z)St (z )1 .
The paths
t St (z )
112
being homotopic with xed endpoints ST (z , t ) is also the homotopy class of the
path t St (z )St (z)St (z )1 . We thus have, by denition (4.21) of ,
(ST (z , t ) ) =
1
(S (z) )
2 t T
1
2
P x, x Lx, x +
1
2
Qx , x
where
P = DB 1 , L = B 1 , Q = B 1 A
then generates S in the sense that
(x, p) = S(x , p ) p = x W (x, x ) , p = x W (x, x )
(observe that P and Q are symmetric). We have:
(4.28)
113
(4.29)
0
I
and hence
det(SW I) = det(B) det(C (D I)B 1 (A I))
= (1)n det B det(C (D I)B 1 (A I)).
Since S is symplectic we have C DB 1 A = (B T )1 and hence
C (D I)B 1 (A I)) = B 1 A + DB 1 B 1 (B T )1 ;
1
1
P (t)x, x L(t)x, x + Q(t)x , x
2
2
114
1
( (S ) + sign Wxx
) = mP (S ) Inert Wxx
2 P
(4.30)
where Inert Wxx
is the index of concavity corresponding to the endpoint S of the
path t St .
Proof. We will divide the proof in three steps. Step 1. Let L Lag (4n). Using
successively formulae (4.22) and (3.44) we have
(S ) =
1
( (S ) + (S , , L) (S , S L, L)).
2 L
(4.31)
= (P, , P, ) (P, , S P, )
= (P, , S P, )
= P (S )
so that there remains to prove that
(S , , L0 ) (S , S L0 , L0 ) = 2 sign Wxx
.
115
this will complete the proof of the proposition. The condition det(S I) = 0 is
equivalent to S = 0 hence, using Proposition 1.29, the number
(S , , L0 ) = (S , L0 , )
is the signature of the quadratic form Q on L0 dened by
Q(0, p, 0, p) = (PrS , (0, p, 0, p); 0, p, 0, p )
where
PrS ,
(S I)1
=
S(S I)1
(S I)1
S(S I)1
is the projection on S along in R2n R2n . It follows that the quadratic form
Q is given by
Q(0, p, 0, p) = ((I S)1 (0, p ), S(I S)1 (0, p ); 0, p, 0, p)
where we have set p = p p ; by denition of this is
Q(0, p, 0, p) = ((I S)1 (0, p ), (0, p)) + (S(I S)1 (0, p ), (0, p )).
Let now MS be the symplectic Cayley transform (4.14) of S; we have
(I S)1 = JMS + 12 I , S(I S)1 = JMS 12 I
and hence
Q(0, p, 0, p ) = ((JMS + 12 I)(0, p ), (0, p)) + ((JMS 12 I)(0, p ), (0, p ))
= (JMS (0, p ), (0, p)) + (JMS (0, p ), (0, p ))
= (JMS (0, p ), (0, p ))
= MS (0, p ), (0, p ) .
Let us calculate explicitly MS . Writing S in usual block-form we have
0
B
C (D I)B 1 (A I) 0
S I =
,
B 1 (A I)
I DI
I
that is
0
SI =
I
B
DI
0
Wxx
1
B (A I) I
116
1
) (I D)B 1
(Wxx
=
1
B 1 (I A)(Wxx
) (I D)B 1 + B 1
and hence
1
) (I D)B 1 + B 1
B 1 (I A)(Wxx
MS =
1
1
2 I (Wxx ) (I D)B 1
1
2I
1
(Wxx
)
1
1
B (I A)(Wxx
)
1
+ B 1 (I A)(Wxx
)
1
(Wxx )
Remark 4.21. Lemma 4.19 above shows that if S is free then we have
1
arg det(S I) n + arg det B + arg det Wxx
mod 2
n arg det B + arg det Wxx
mod 2.
The reduced Maslov index mP (S ) corresponds to a choice of arg det B modulo 4;
Proposition 4.20 thus justies the following denition of the argument of det(S I)
modulo 4:
1
arg det(S I) n (S ) mod 4.
2
(p + x2 );
2
117
all the orbits are periodic with period 2/. The monodromy matrix is simply the
identity: T = I where
cos t sin t
t =
.
sin t cos t
Let us calculate the corresponding index ( ). The homotopy class of path
t t as t goes from 0 to T = 2/ is just the inverse of , the generator of
1 [Sp(1)] hence ( ) = 2 in view of (4.24). If we had considered r repetitions
of the orbit we would likewise have obtained ( ) = 2r.
Consider next a two-dimensional harmonic oscillator with Hamiltonian function
y 2
x 2
(p + x2 ) +
(p + y 2 );
H=
2 x
2 y
we assume that the frequencies y , x are incommensurate, so that the only periodic orbits are librations along the x and y axes. Let us focus on the orbit x
along the x axis; its prime period is T = 2/x and the corresponding monodromy
matrix is
1
0
0
0
0 cos 0 sin
, = 2 y ;
S1 =
0
0
1
0
x
0 sin 0 cos
it is the endpoint of the symplectic path t
St , 0 t 1, consisting of the
matrices
cos 2t
0
sin 2t
0
0
cos t
0
sin t
.
St =
sin 2t
0
cos 2t
0
0
sin t
0
cos t
In Gutzwillers trace formula [86] the sum is taken over periodic orbits, including
their repetitions; we are thus led to calculate the ConleyZehnder index of the path
t St with 0 t r where the integer r indicates the number of repetitions
of the orbit. Let us calculate the ConleyZehnder index (Sr, ) of this path. We
have St = t St where
cos 2t sin 2t
cos t sin t
t =
, St =
;
sin 2t cos 2t
sin t cos t
in view of the additivity property of the relative Maslov index we thus have
(Sr, ) = (r, ) + (Sr, )
where the rst term is just
(r, ) = 2r
in view of the calculation we made in the one-dimensional case with a dierent
parametrization. Let us next calculate (Sr, ). We will use formula (4.30) relating the index to the Maslov index via the index of concavity, so we begin by
118
1
2r + i Log(e2ir ) +
2
'
( 1
i(2r+)
2r + i Log(e
) + .
2
$ r %
);
$ r %
mP (Sr, ) =
.
(4.33)
of St ; the latter is
W (x, x , t) =
1
((x2 + x2 ) cos t 2xx ),
2 sin t
= tan(t/2). We thus have, taking (4.33) into account,
hence Wxx
'
$ r %
r (
(Sr, ) =
Inert tan
;
119
Part II
Heisenberg Group,
Weyl Calculus,
and Metaplectic
Representation
Chapter 5
Lagrangian Manifolds
and Quantization
Lagrangian manifolds are (immersed) submanifolds of the standard symplectic
space R2n
z whose tangent space at every point is a Lagrangian plane. What makes
Lagrangian manifolds interesting are that they are perfect candidates for the (semiclassical) quantization of integrable Hamiltonian systems. Moreover, every Lagrangian manifold has a phase, which is dened on its universal covering; this
notion will ultimately lead us, using only classical arguments, to the Heisenberg
Weyl operators, which are the rst step towards quantum mechanics. In the next
chapter they will be used to dene the notion of Weyl pseudo-dierential operator.
In his Bulletin review paper1 [177] Weinstein sustains that Everything is
a Lagrangian manifold! . Weinstein has made a point here, because mathematically and physically interesting examples of Lagrangian manifolds abound in the
literature on both classical and quantum mechanics.
also contains an interesting review of the state of symplectic geometry in the beginning
of the 1980s.
124
R
dened
on
a
neighborhood
of
V
. The manifold f (Vn )
tomorphism R2n
z
z
is also Lagrangian.
Proof. It is clear that f (Vn ) is a dierentiable manifold since a symplectomorphism
is a dieomorphism. The tangent plane to f (Vn ) at a point f (z) is Df (z)(z); the
result follows since Df (z) Sp(n) by denition of a symplectomorphism and using the fact that the image of a Lagrangian plane by a symplectic linear mapping
is a Lagrangian plane.
It follows, in particular, that the image by a Lagrangian plane by a symplectic
dieomorphism is a Lagrangian manifold.
A basic (but not generic) example of Lagrangian manifold in (R2n
z , ) is that
of the graph of the gradient of a function:
Proposition 5.4. Let C (U, R) where U is an open subset of Rn .
(i) The graph
Vn = {(x, x (x)) : x U }
is a Lagrangian submanifold of (R2n
z , ).
(ii) The orthogonal projection on X = Rnx is a dieomorphism Vn U , and
thus a global chart.
125
Proof. (i) It suces to notice that an equation for the tangent space (z0 ) to Vn
at z0 = (x0 , p0 ) is
p = [D(x )](x0 )x = [D2 (x0 )]x
where D2 (x0 ) is the Hessian matrix of calculated at x0 . The latter being
symmetric, (z0 ) is a Lagrangian plane in view of Corollary 1.23 in Chapter 1,
Section 1.3.
xI
pI
T
T
xI
pI
pI
,
=
pI
xI
xI
T
x
pI
= I.
pI
xI
Proof. The proof of this result relies on Proposition 1.25 on canonical coordinates
for a Lagrangian plane; since we will not use it in the rest of this chapter we omit its
proof and refer to, for instance Vaisman [168], 3.3, or Mischenko et al. [124].
126
d =
is called a phase of Vn .
The dierential form on Vn is closed: since the manifold Vn is Lagrangian,
we have
d (pdx) = (dp dx) = = 0.
In view of Poincares relative lemma (see Vaisman [168], Weinstein [178]) the
phase is thus always locally dened on Vn ; if Vn is simply connected it is even
globally dened. In the general case one immediately encounters the usual cohomological obstructions for the global existence of a phase. Here is a simple but
typical example:
2 it
127
R2
(cos sin ).
2
(5.1)
The rub in this example comes from the fact that is not dened on the circle
itself, because (+2) = ()R2 = (). There is, however, a way out: we can
view () as dened on the universal covering of S 1 (R), identied with the real
line R , the projection : R S 1 (R) being given by () = (R cos , R sin ).
This trick extends without diculty to the general case as well:
n Vn be the universal covering of the Lagrangian
Theorem 5.9. Let : V
n
manifold V .
Proposition 5.10.
n R such that
(i) There exists a dierentiable function : V
d(
z ) = pdx
if
(
z ) = z = (x, p)
(5.2)
(
z) =
(5.3)
(z0 ,z)
where the last equality follows from the fact that D is a subset of a Lagrangian
manifold. It follows that
pdx =
pdx
(z0 ,z)
(z0 ,z)
128
hence the integral of pdx along (z0 , z) only depends on the homotopy class in Vn
of the path joining z0 to z; it is thus a function of z Vn . There remains to show
that the function
(
z) =
pdx
(5.4)
(z0 ,z)
is such that d(z) = pdx. The property being local, we can assume that Vn is sim n = Vn and write (z) = (
ply connected, so that V
z ). Since Vn is dieomorphic
n
to (z) = Tz V in a neighborhood of z, we can reduce the proof to the case where
Vn is a Lagrangian plane . Let Ax + Bp = 0 (AT B = BAT ) be an equation of ,
and
(z) : t (B T u(t), AT u(t)) , 0 t 1
be a dierentiable curve starting from 0 and ending at z = (B T u(1), AT u(1)).
We have
1
1
T
(z) =
dt =
dt
A u(t), B T u(t)
BAT u(t), u(t)
0
129
Remark 5.12. Gromov has proved in [81] (also see [91]) that if Vn is a closed
Lagrangian
manifold (i.e., compact and without boundary) then we cannot have
*
pdx
=
0
for
all loops in Vn ; to construct the phase of such a manifold we thus
n Vn is a local dieomorphism
the restriction U to U of the projection : V
, U ) is a local chart
U U . It follows that we can always assume that (U
U
n
of V .
Proposition 5.14. Let be the local expression of the phase in any of the local
, U ) such that U Vn = :
charts (U
U
(x) = ((U U )1 (x)).
(5.6)
(5.7)
Proof. Let us rst show that the equation (5.7) remains unchanged if we replace
, U U ) by a chart (U
, U ) such that (U
) = (U
). There exists
(U
U
n
C() =
130
It follows that
x ((U U )1 (x)) = x ((U U )1 (x))
and hence the right-hand side of the identity (5.7) does not depend on the choice
, U U ). Set now (U U )1 (x) = (p(x), x); we have, for
of local chart (U
x U U (U ),
d(x) = d(p(x), x) = p(x)dx
hence (5.7).
Exercise 5.15. For S Sp(n) set (xS , pS ) = S(x, p).
(i) Show that
pS dxS xS dpS = pdx xdp.
(5.8)
(5.9)
dS (
z ) = pS dxS if (
z ) = (x, p).
(5.10)
Show that
131
The interest of the PoincareCartan form comes from the fact that it is a
relative integral invariant. This means that the contraction iX H dH of the exterior
derivative
dH = dp dx dH dt
of H with the suspended Hamilton vector eld
H = (XH , 1) = (Jz H, 1)
X
is zero:
iX H dH = 0.
There actually is a whole family of closely related invariants having the same
property. These invariants are dened, for R, by
()
(5.12)
(1)
dH = dH for every R.
Example 5.17. The case = 1/2. The corresponding form can be formally written as
1
(1/2)
H
= (z, dz) Hdt;
2
we will see in Chapter 10 that this form is particularly convenient in the sense
that it leads to a Schr
odinger equation in phase space where the variables x and
p are placed on similar footing, very much as in Hamiltons equations.
Let us prove the main result of this subsection, namely the relative invariance
()
of the forms H :
()
dH (X
H (z, t), Y (z, t)) = 0
(5.13)
dH = dH dt
132
we have
()
dH (X
H (z, t), Y (z, t)) =
(5.14)
By denition of we have
(XH , Y ) = x H, Yx p H, Yp ;
on the other hand
dH dt = x H(dx dt) + p H(dp dt)
and
(dx dt)(XH , 1; Y, ) = p H Yx ,
(dp dt)(XH , 1; Y, ) = x H Yp ,
so that
(dH dt)(XH , 1; Y, ) = x H, Yx p H, Yp = (XH , Y )
The relative invariance of the forms H has the following important consequence: let : [0, 1] R2n
z Rt be a smooth curve in extended phase space
on which we let the suspended ow ftH act; as time varies, will sweep out a
two-dimensional surface t whose boundary t consists of , ftH (
), and two
arcs of phase-space trajectory, 0 and 1 : 0 is the trajectory of the origin (0) of
, and 1 that of its endpoint (1). We have
()
H = 0.
(5.15)
t
Here is a sketch of the proof (for details, and more on invariant forms in general,
see Libermann and Marle [110]). Using the multi-dimensional Stokes formula we
have
()
()
H =
dH =
dH .
t
133
of terms of the type dH (XH , Y ), which are equal to zero in view of (5.13). We
thus have
dH = 0
t
whence (5.15).
+ H(x, x , t) = 0.
(5.16)
t
The interest of this equation comes from the fact that the knowledge of a suciently general solution yields the solutions of Hamiltons equations for H. (At
rst sight it may seem strange that one replaces a system of ordinary dierential
equations by a non-linear partial dierential equation, but this procedure is often
the only available method; see the examples in Goldstein [53].)
Proposition 5.19. Let = (x, t, ) be a solution of
+ H(x, x , t) = 0
t
(5.17)
(5.18)
(5.19)
134
Proof. We assume n = 1 for notational simplicity; the proof extends to the general
case without diculty. Condition (5.18) implies, in view of the implicit function
theorem, that the equation (x, t, ) = has a unique solution x(t) for each t;
this denes a function t x(t). Inserting x(t) in the formula p = x (x, t, ) we
also get a function t p(t) = x (x(t), t, ). Let us show that t (x(t), p(t))
is a solution of Hamiltons equations for H. Dierentiating the equation (5.17)
with respect to yields, using the chain rule,
H 2
2
+
= 0;
t
p x
(5.20)
(5.21)
H
x = 0,
p
(5.22)
2
2
+
x.
tx
x2
(5.23)
+ p = 0,
x
x2
p x2
hence p = x H since x = p H.
135
Exercise 5.20.
1 2
(i) Let H = 2m
p be the Hamiltonian of a particle with mass m moving freely
along the x-axis. Use (5.24) to nd a complete family of solutions of the
time-dependent HamiltonJacobi equation for H.
1
(ii) Do the same with the harmonic oscillator Hamiltonian H = 2m
(p2 +m2 2 x2 ).
It turns out that for a wide class of physically interesting Hamiltonians the
HamiltonJacobi equation can be explicitly solved using the notion of free generating function dened in Chapter 2, Subsection 2.2.3.
Proposition 5.21. Suppose that there exists such that for 0 < |t| < the mappings
ftH are free symplectomorphisms when dened. The Cauchy problem
(5.25)
has a solution , dened for 0 < |t| < , and given the formula
(x, t) = 0 (x ) + W (x, x ; t)
(5.26)
(5.27)
x,t
x ,0
pdx Hdt
x
x
p
x
x
p
p
p
is the identity as t 0, it follows that dx/dx is dierent from zero in some interval
[, ], > 0, hence x ftH (x , x 0 (x )) is a local dieomorphism for each
t [, ]. Obviously limt0 (x, t) = 0 (x) since x x as t 0, so that the
136
and hence
(x, t + t) (x, t)
=
t
(x, t) = H(x, p, t)
t
since p 0 when t 0. Similarly,
(5.28)
1
(x + x, t) (x, t) = px + px
2
and p 0 as x 0 so that
(x, t) = p.
(5.29)
x
Combining (5.28) and (5.29) shows that satises HamiltonJacobis equation.
Vnt
(
z (t) being the homotopy class in
of a path (zt , z(t))) obviously is a phase of
Vnt when zt = ftH (z0 ) is chosen as base point in Vnt . The following result relates
t to the phase 0 of the initial manifold Vn = Vn0 :
137
n= V
n and z(t) its image in V
n by f H (i.e.,
Lemma 5.22. Let z be a point in V
0
t
t
n
H
z(t) is the homotopy class in Vt of the image by ft of a path representing z). We
have
z(t)
z (t)) (
z) =
t (
H
z
zt
H .
(5.31)
z0
(zt ,z(t))
(z0 ,z)
pdx
(z0 ,z)
pdx
(zt ,z(t))
zt
H = 0
z0
that is to (5.31).
function (, t) : Vt R dened by
z,t
H
(5.33)
(
z , t) = (
z) +
z ,0
(5.34)
n Vn .
where t is the projection V
t
t
Proof. In view of Lemma 5.22 we have
zt ,t
(
z , t) =
H +
z0,0
pdx
(5.35)
(zt ,z)
where H is integrated along the arc of extended phase space trajectory leading
n of a path
from z0 at time t = 0 to zt at time t and (zt , z) is the image in V
t
n
138
n R dened by (5.33),
Denition 5.24. We will call the function (, t) : V
n
(5.35) the Hamiltonian phase of Vt (as opposed to the phase (5.30).
Here is an interesting particular case of the result above:
Exercise 5.25. Let H be a Hamiltonian which is quadratic and homogeneous in the
position and momentum variables; its ow thus consists of symplectic matrices StH .
(i) Show that the Hamiltonian phase of StH (V) is
(
z , t) = (
z ) + 12 (pt , xt p, x)
(5.36)
(5.37)
where E is the (constant) value of H on V and z(t) the homotopy class of the
path (z0 , z) + (z, z(t)) in Vn .
n
(z0 ,z(t))
where (z0 , z(t)) = (z0 , z) + (z, z(t)). The result follows since the rst integral
in the right-hand side of this equality is by denition (
z (t)).
Proposition 5.23 links the notion of phase of a Lagrangian manifold to the
standard HamiltonJacobi theory:
Proposition 5.27. Let z Vn have a neighborhood U in Vn projecting dieomorphically on Rnx .
(i) There exists > 0 such that the local expression = (x, t) of the phase
is dened for |t| < ;
(ii) satises the HamiltonJacobi equation
+ H(x, x ) = 0
t
for x in the projection of U and |t| < .
139
Proof. The rst part (i) is an immediate consequence of Proposition 5.14 (the
existence of follows from the fact that the caustic is a closed subset of Vn ). To
prove (ii) we observe that
(x, t) = (x , 0) +
z,t
z ,0
pdx Hdt
in view of formula (5.33) in Proposition 5.23; now we can parametrize the arc
joining z , 0 to z, t by x and t hence
x,t
(x, t) = (x , 0) +
pdx Hdt
x ,0
F G
F G
x
p
p
j
j
j xj
j=1
(5.38)
(5.39)
, XG = Jz G
140
(5.41)
The terminology comes from the following remark: let t z(t) = (x(t), p(t))
be a solution of Hamiltons equations z = Jz H(z). Then
d
F (z(t)) = x F, x + p F, p
dt
= x F, p H p F, x H
=0
hence the function t F (z(t)) is constant along the motion. In particular
{H, H} = 0 hence the Hamiltonian itself is a constant of the motion: this is the
theorem of conservation of energy for time-independent Hamiltonian systems.
The notion of constant of the motion makes sense for time-dependent Hamiltonians as well: a constant of the motion for a time-dependent Hamiltonian H is
a function F : R2n+1 R which is constant along the curves t (z(t), t) in
141
extended phase space. Notice that if F is dened along t (z(t), t) then, by the
chain rule
F
dF
=
+ {F, H} .
(5.42)
dt
t
Thus F : R2n+1 R is a constant of the motion if and only if it satises
Liouvilles equation
F
+ {F, H} = 0.
(5.43)
t
Since {H, H} = 0 we see that in particular the energy is a constant of the motion
if and only if H/t = 0 that is if and only if H is time-independent.
Exercise 5.31. Let H be the one-dimensional harmonic oscillator Hamiltonian
H=
(t) 2
(p + x2 )
2
and E(t) = H(z(t), t) the energy along a solution t z(t). In view of (5.42) we
have E(t)
= ((t)/(t))E(t),
j (t) 2
(pj + x2j )
H=
2
j=1
and t z(t) is a solution of Hamiltons equations for H. If there exists a real
number k such that (t) = k(t ) ( = (1 , . . . , n )) for some instants t and t ,
then E(t)/j (t) = E(t )/j (t ).
j
j=1
(p2j + x2j )
(5.44)
142
for 1 j n; we assume Ij 0 and the angles j are chosen such that 0 j <
2. In these new variables the Hamiltonian H takes the simple form
K(I) =
j Ij
(5.45)
j=1
(5.46)
for K and then to return to the original variables (x, p) using the inverse change
of variables. The equations (5.46) have the obvious solutions
j (t) = j t + j (0) , Ij (t) = Ij (0).
Inserting these values in (5.44) we get
+
xj (t) = 2Ij (0) cos(j t + j (0)),
+
pj (t) = 2Ij (0) sin(j t + j (0))
for 1 j n.
This example leads us quite naturally to the following general denition:
Denition 5.32. A time-independent Hamiltonian H on R2n
z is completely integrable (for short: integrable) if there exists a symplectomorphism f : (x, p)
(, I) (in general not globally dened) such that the composed function K = H f
only depends explicitly on the action variables I = (I1 , . . . , In ):
H(z) = H(f (, I)) = K(I).
(5.47)
K
(I)
Ij
(5.48)
143
(5.49)
(5.51)
j
j=1
(p2j + x2j )
(5.52)
K=
j Ij .
(5.53)
j=1
144
145
1
pdx m() Z for all one-cycles on Vn
4
(5.54)
146
where m() is the Maslov index. In the physical literature these conditions are often called the EBK (EinsteinBrillouinKeller) quantum conditions, or the Bohr
SommerfeldMaslov conditions. Since the integral of the action form pdx only
depends on the homotopy class of in Vn and the Maslov index is a homotopy
invariant, the same is true of conditions (5.54).
Denition 5.35. When the conditions (5.54) hold, the Lagrangian manifold Vn is
said to be quantized (in the sense of Keller and Maslov)
We will not justify the conditions (5.54) here; we will in fact see in Subsection
5.4.3 that they are equivalent to the existence of waveforms on the Lagrangian
manifold Vn . In Chapter 8 we will derive (5.54) using a topological argument
involving the notion of symplectic capacity.
Remark 5.36. Souriau [156] has proven that if the Lagrangian manifold Vn is
oriented then m() is even. We will re-derive this result in the more general context
of q-oriented Lagrangian manifold in Subsection 5.4.2.
The semiclassical values of the energy corresponding to the KellerMaslov
conditions are obtained as follows: let I = (I1 , . . . , In ) be the action variables corresponding to the basic one-cycles 1 , . . . , n on Vn . These are dened as follows:
let 1 , . . . , n be the loops in Tn (R1 , . . . , Rn ) dened, for 0 t 2, by
1 (t) = R1 (cos t, 0, . . . , 0; sin t, 0, . . . , 0),
2 (t) = R2 (0, cos t, . . . , 0; 0, sin t, . . . , 0),
n (t) = Rn (0, . . . , 0, cos t; 0, . . . , 0, sin t).
The basic one-cycles 1 , . . . , 1 of Vn are then just
1 = f 1 (
1 ), . . . , n = f 1 (
n ).
The action variables being given by
#
1
Ij =
pdx , 1 j n,
2 j
the quantization conditions (5.54) imply that we must have
Ij = (Nj + 14 m( j ))
for 1 j n,
(5.55)
(5.56)
147
Remark 5.37. We do not discuss here the ambiguity that might arise in calculation
of the energy because of the non-uniqueness of the angle action coordinates; that
ambiguity actually disappears if one requires that the system under consideration
is non-degenerate, that is 2 K(I) = 0. (See Arnold [3]), Ch. 10, 52.)
Applying the KellerMaslov quantization conditions (5.54) to a system with
quadratic Hamiltonian, or even an atom placed in a constant magnetic eld, gives
the same energy levels as those predicted by quantum mechanics. However in
general these conditions only lead to approximate values of observable quantities.
This reects the fact that they are semi-classical quantization conditions, obtained
by imposing the quantum selection rule on a Lagrangian torus, which is an object
associated to a classical system. We notice that Giacchetta et al. give in [47] a
geometric quantization scheme for completely integrable Hamiltonian systems in
the action-angle variables around an invariant torus with respect to polarizations
spanned by almost-Hamiltonian vector elds of angle variables; also see these
authors recent book [48].
148
For instance every oriented Lagrangian manifold has exactly two 1-orientations, namely z ((z), +) and z ((z), ) where + and correspond
to the positive or negative orientation of all the tangent planes.
Example 5.39. The circle S 1 is oriented and hence 1-oriented. The same is true of
the n-torus Tn = (S 1 )n . Tn is not q-oriented if q > 1.
If the Lagrangian manifold Vn is simply connected, then it is -orientable
(and hence orientable). The converse is not true: in [64], p. 175177 we have constructed, in relation with the quantization of the plane rotator, an -orientable
Lagrangian manifold which is not simply connected; that manifold is the submanifold of R4x,y,px ,py dened by the equations
x2 + y 2 = r2 , xpy ypx = L0 .
The following result provides us with a very useful test for deciding whether
a given Lagrangian manifold is q-oriented for some integer q (or +). It is also the
key to the understanding of Maslov quantization for other Lagrangian manifolds
than tori.
Theorem 5.40. A connected Lagrangian manifold Vn is q-oriented (1 q < )
if and only if the Maslov index mVn is such that
m() 0 mod 2q
(5.57)
(1 [Vn , z0 ])
149
z U U .
One proves (see for instance Godbillon [51], Ch. X, 5, or Abraham et al. [2], Ch.
7) that (Vn )o is an orientable manifold and that:
Vn is oriented if and only if (Vn )o is trivial, that is (Vn )o = Vn {1, +1};
(Vn )o is connected if and only if Vn is non-orientable.
These properties motivate the following denition:
150
Denition 5.42.
(i) The manifold (Vn )o is called the two-sheeted orientable covering of Vn .
(ii) A de Rham form (or: twisted form) on Vn is a dierential form on (Vn )o .
(iii) A de Rham (or: twisted) volume form is a volume form on (Vn )o .
Locally, in a neighborhood U Vn of z Vn , a de Rham form on Vn can
be written Orient(U ) where Orient(U ) = 1 is an orientation of the set U and
a dierential form on U .
From now on we again assume that Vn is a Lagrangian manifold coming
equipped with a Rham form . In [64], Ch. 5, 5.3, we showed that the pull-back
n Vn can be written outside the caustic
of to the universal covering : V
Vn as
z ) = (1)m(z) (
z ) = (1)m(z) (z)
(5.58)
(
where is a density on Vn and
z ))
m(
z ) = m(P, , (
corresponds to a choice of P, such that Lag (P, ) = P (m being the reduced
ALM index: Denition 3.23, Chapter 3.2, Section 3.3). The mapping
n Lag (n) , z (
z)
() : V
is a lift of the tangent mapping
() : Vn Lag(n) , z (z) = Tz Vn .
Formula (5.58) allows us to dene the square root of the pulled-back de Rham
form outside the caustic by
(
z ) = im(z) (z)
z ) = m(, , (
z )), m being the reduced ALM
with , Lag (n) and m (
index.
Exercise 5.44. Show that the expressions and corresponding to choices
, Lag(n) are related by the formula = im () where
m (z) = m(, , , ) Inert( , , (z)).
151
n of Vn ; to use an older
Waveforms are dened on the universal covering V
n
terminology they are multi-valued on V . They are single-valued if and only if
Vn is a quantized Lagrangian manifold in the sense of Denition 5.35.
Proposition 5.45. The Lagrangian manifold Vn satises the KellerMaslov quantization conditions
#
1
1
(5.59)
pdx m() Z
2
4
n ) if and only if
for every 1 [Vn , z0 ] (z0 an arbitrary point of V
z)
( z) = (
for every , that is, if and only if is dened on Vn .
Proof. By denition of we have
i
( z) = e ( z) im ( z) (z).
On the other hand, using property (3.36) of the reduced ALM index (Proposition
3.24, Subsection 3.3.1) and the fact that ( z) = m() (
z ), we have
m ( z) = m(, , ( z))
= m(, , m() (
z ))
z )) m(),
= m(, , (
hence ( z) = (
z ) is equivalent to the quantization condition (5.59).
i
z,t)
(
m (
z ,t)
(5.60)
+
(ftH ) (z)
and:
(
z , t) is the phase of the Lagrangian manifold ftH (Vn ):
z,t
(
z , t) = (
z )+
pdx Hdt
z ,0
(5.61)
152
m (
z , t) is the integer
z , t) = m(, , St (z) (
z ))
m (
(5.62)
will use for a while the subscript a in place of 0 to avoid confusion with the basepoint of
the Lagrangian manifold.
153
(5.64)
1
2
pa , xa .
(5.65)
Proof. We have, since the energy has constant value (z, za ) along the trajectory,
t
t
p + (s t)pa , xa ds
(z, za )ds
a (z) = (z tza ) +
0
154
In view of formulae (5.60), (5.61), (5.62) the action of the ow (ft0 ) determined
by the translation Hamiltonian (5.63) is given by
i
pa ,xa )
(5.66)
155
(5.67)
(5.68)
R2n
z .
i
T(za )T(zb ) = T(za )(e (pb ,x 2 pb ,xb ) T (zb ))
i
T(zb )T(za ) = e (pb ,x 2 pb ,xb ) e (pa ,xxb 2 pa ,xa ) T (za + zb ).
1
1
2 pa , xa + pb , x xa 2 pb , xb ,
1
1
2 pb , xb + pa , x xb 2 pa , xa ,
i
T(za )T(zb ) = e ( ) T(zb )T(za )
with
= pa + pb , x
1
2
pa + pb , xa + xb .
so that
i
T(za + zb ) = e ( ) T(za )T(zb ).
1
2
pb , xa
1
2
pa , xb = 21 (za , zb ),
156
=
(
z , za )
t
(5.69)
(
z , za ) = ix , xa pa , x .
(5.70)
(5.71)
one easily checks by a direct calculation that this solution is explicitly given by
the formula
(5.72)
(x, t) = exp i (tpa , x 12 t2 pa , xa ) 0 (x txa )
which we can as well rewrite as
(x, t) = T(tza )(z).
(5.73)
We will come back to this relationship between the HeisenbergWeyl operators in the next chapter, where we will begin to let them act in conformity with
tradition! on functions dened on position space Rnx , even though the denition
we have given (and which is not standard) shows that their true vocation is to act
on phase space objects (functions, or waveforms, for instance). This fact will be
exploited later in this book to construct a quantum mechanics in phase space.
(5.74)
157
Each of the operators T&(z0 ) can be obtained from the reection operator
T&(0) using conjugation by HeisenbergWeyl operators:
Proposition 5.52. The intertwining relation
T&(z0 ) = T(z0 )T&(0)T(z0 )1
(5.75)
2i
The fact that the product of two reections is a translation has its counterpart
in terms of the GrossmannRoyer operators:
Proposition 5.53. The GrossmannRoyer operators satisfy the product formula
2i
T&(za )T&(zb ) = e (za ,zb ) T(2(za zb ))
(5.76)
&
for all za , zb R2n
z ; in particular T (z) is an involution.
Proof. We have
$ 2i
%
T&(za )T&(zb )(x) = T&(za ) e pb ,2xa xxb (2xb x)
2i
2i
= e (x 2(xa xb ))
with
= 2((pa pb )x pa xa pb xb + 2pb xa ).
On the other hand
i
T(2(za zb ))(x) = e (x 2(xa xb ))
with
Exercise 5.54. Express the product T&(za )T&(zb )T&(zc ) in terms of the Heisenberg
Weyl operator T(za zb + zc ).
Let us now proceed to the next chapter, where the study of the Heisenberg
Weyl operators is taken up from an algebraic point of view.
Chapter 6
Heisenberg Group
and Weyl Operators
The algebraic approach to the Heisenberg group we outline in the rst section
goes back to Weyls work [179] on the applications of group theory to quantum
mechanics, a precursor of which is Heisenbergs matrix mechanics,1 which was
rigorously presented by Born and Jordan in [13]. It is remarkable that such an
ad hoc inductive argument has led to one of the most important developments in
mathematical physics, with ramications in many areas of pure mathematics. For
explicit applications of the Heisenberg group to various problems in information
theory see Binz et al. [9, 10], Schempp [140, 141]. Gaveau et al. [45] study the
symbolic calculus on the Heisenberg group; also see the books by Folland [42] or
Stein [158] where interesting material can be found.
The second part of this chapter is devoted to the denition and study of the
notion of Weyl pseudo-dierential calculus. Due to practical limitations we do not
pretend to give a full account of the Weyl pseudo-dierential calculus; the interested reader is referred to the (vast) literature on the subject. Some references for
Weyl calculus I have in mind are Leray [107], Tr`eves [164], Wong [181]. For recent
advances and a new point of view see Unterbergers book [167]. For modern aspects
of pseudo-dierential calculus in general, including a thorough study of the fascinating topic of pseudo-dierential operators on manifolds with conical singularities
the reader is referred to Schulze [144, 145, 146] and Egorov and Schulze [36].
1 It was during a time dubbed Hexenmathematik which is German for witch mathematics because of its non-commutativity, which was a quite unfamiliar feature for physicists at
that time.
160
(6.1)
we choose matrices
X1 , X2 , . . . , Xn ; P1 , P2 , . . . , Pn
satisfying the canonical commutation relations
[Xi , Xj ] = 0 , [Pi , Pj ] = 0 , [Xi , Pj ] = iij I.
(6.2)
X j = [Xj , H]
It turns out that the relations (6.2), viewed as commutation relations for
operators, lead to the denition of a Lie algebra, which is called the Heisenberg
algebra in the literature. We will see that it is the Lie algebra of a group closely
related to the HeisenbergWeyl operators dened at the end of the last chapter.
161
1 , . . . , X
n ;
Denition 6.1. A Heisenberg algebra is a Lie algebra hn with a basis {X
2
P1 , . . . , Pn ; T} satisfying the canonical commutation relations
i , X
j ] = 0 , [Pi , Pj ] = 0 , [X
i , Pj ] = ij T
[X
i , T] = 0 , [Pi , T] = 0
[X
(6.3)
i + pi Pi + tT , U
=
xi X
i=1
i + pi Pi + t T
xi X
i=1
(6.4)
(6.5)
(6.6)
Clearly (0, 0) is a unit for the composition law , and each (z, t) is invertible with
inverse (z, t). One veries that the law is associative, so it denes a group
2n
structure on R2n
z Rt , identifying Hn with the set Rz Rt .
2 Often
162
(6.7)
It is immediate to check that Hn is the universal covering group of this exponenof Hn ; since the covering groups of a Lie group all have the
tiated version Hexp
n
same Lie algebra as the group itself, the Lie algebra of Hexp
indeed is hn .
n
There is a useful identication of Hn with a subgroup Hpol
n of GL(2n + 2, R).
That group (the polarized Heisenberg group) consists of all (2n + 2) (2n + 2)
upper-triangular matrices of the type
1 p1 pn t
0 1 0 x1
..
..
..
M (z, t) = ... ...
.
.
.
0 0 1 xn
0 0 0
1
(the entries of the principal diagonal are all equal to 1); we will write these matrices
for short as
1 pT t
M (z, t) = 0 1 x .
0 0 1
A simple calculation shows that the determinant of M (z, t) is 1 and that its inverse is
1 pT t + p, x
;
1
x
M (z, t)1 = 0
0
0
1
moreover
M (z, t)M (z , t ) = M (z + z , t + t + p, x ).
(6.8)
Exercise 6.3.
(i) Show that the mapping : Hpol
n Hn dened by
(M (z, t)) = (z, t
1
2
p, x)
is a group isomorphism.
3 It
is sometimes also called the Weyl group, especially in the physical literature.
(6.9)
163
pol
(ii) Show that the Lie algebra hpol
n of Hn consists of all matrices
0 pT t 12 p, x
.
x
X pol (z, t) = 0 0
0 0
0
(6.10)
(6.11)
164
We will need the following classical result from harmonic analysis (see, e.g.,
Stein and Weiss [159], Chapter 1, 3):
Lemma 6.4. Let A be a bounded operator L2 (Rnx ) L2 (Rnx ). If A commutes with
the translations x x + x0 , then there exists a function a L (Rnx ) such that
F (A) = aF for all S(Rnx ).
Proof. We will give the proof of this property in the special case where A is an
integral operator
A(x) = K(x, y)(y)dn y
with kernel K L1 (Rnx Rnx ) satisfying
sup |K(x, y)| dx C , sup |K(x, y)| dx C.
y
We have
A(x)
(F 1 K)(x, )F ()dn
where F 1 is the inverse Fourier transform acting on the second variable. We can
rewrite this formula as
1
n/2
eix, a(x, )F ()dn
A(x) = 2
where the symbol a is given by
a(x, ) = (2)n/2 eix, F 1 K(x, ),
that is
a(x, ) =
165
Let us now state and prove the main result of this subsection, namely the
existence of an irreducible representation of the Heisenberg group Hn compatible
(in a sense we will make precise) with the Schrodinger equation. (For dierent
approaches see Wallach [175], or Folland [42]; the latter gives a proof related to
the Stonevon Neumann theorem which we will discuss below); there are also
relations with Kirillov theory [175] which we do not discuss here because of lack
of space.
Theorem 6.5. The mapping T(z0 , t0 ) : Hn U(L2 (Rnx )) dened by
i
T(z0 , t0 )(x) = e t0 T(z0 )(x)
(6.12)
166
A natural question that arises at this stage is whether there are other irreducible unitary representations of Hn in the Hilbert space L2 (Rnx ). A famous
result, the Stonevon Neumann theorem, asserts that the Schr
odinger representation of it is, up to unitary equivalences, the only irreducible representation of Hn
in that Hilbert space (we are emphasizing the word that because the Stonevon
Neumann theorem does not prevent us from constructing non-trivial irreducible
representations of Hn in other Hilbert spaces than L2 (Rnx ). We will return to this
important question in Chapter 10 when we discuss the Schr
odinger equation in
phase space; we will see that it is perfectly possible to construct (innitely many)
representations of the Heisenberg group on closed subspaces of L2 (R2n
z ).
167
(6.14)
2
2n
Since F extends into a (unitary) operator L2 (R2n
z ) L (Rz ) and, by duality,
2n
2n
into an operator S (Rz ) S (Rz ) so does F . Summarizing:
(6.15)
Exercise 6.7. Prove the formula above using the relation (6.14) and the properties
of the ordinary Fourier transform. The following symplectic covariance property
generalizes formula (6.14):
Proposition 6.8. For every S Sp(n) we have
F (f S) = (F f ) S for every S Sp(n).
Proof. Since (Sz, z ) = (z, S 1 z ) we have
i
1
n
1
e (z,S z ) f (z )d2n z
F f (Sz) = 2
(6.16)
168
that is, performing the change of variable z = Sz and taking into account the
fact that the determinant of a symplectic matrix is 1,
i
1
n
e (z,z ) f (Sz )| det S|d2n z
F f (Sz) = 2
i
1
n
= 2
e (z,z ) f (Sz )d2n z ,
hence (6.16).
Let us now dene the Weyl operator associated to a symbol. We will see
below that our denition just represents the Fourier decomposition of a pseudodierential operator of a particular type.
Denition 6.9. Let a S(R2n
z ).
(i) The Weyl operator associated to the symbol a is the operator
: S(Rn ) S(Rn )
A
x
x
dened by
A(x)
=
1 n
2
(6.17)
(6.18)
i.e., an
symbol
(6.19)
(6.20)
169
(x) =
=
hence a (z) = (2)n (z) as claimed. Since on the other hand we have, in view of
the inversion formula (6.15),
i
a(z) = F a (z) = e (z,z ) (z )d2n z = 1,
this proves the proposition.
170
(6.22)
where the integral should be interpreted, for xed x, as the distributional bracket
A(x)
= KA (x, ), () .
The following theorem shows that Weyl operators are just pseudo-dierential
operators of a special kind:
Weyl
Theorem 6.12. Let a S(R2n
z ) and A a.
is given by
(i) The kernel of A
KA (x, y) =
and hence
A(x)
=
1 n
2
for S(Rnx ).
(ii) Conversely
a(x, p) =
1 n
2
1 n
2
1 n
2
(6.23)
(6.24)
(6.25)
171
We thus have
A(x)
=
where the kernel KA is given by
KA (x, y) =
1 n
2
(6.26)
Part (i) of the theorem above shows that Weyl operators can be expressed
in an amazingly simple way in terms of the GrossmannRoyer operators
2i
T&(z0 )(x) = e p0 ,xx0 (2x0 x)
(6.27)
where the integral is interpreted in the sense of Bochner; in particular we have the
reproducing property
1
n
T&(z)d2n z = I
(6.28)
(I the identity operator).
172
1 n
2i
Notice that if we interpret formula (6.24) in the distributional sense we recover the fact that the identity operator has symbol a = 1. In fact it suces to
observe that in view of the Fourier inversion formula,
1
n
i
(6.29)
e p,xy (y)dn ydn p = (x).
2
In fact, Schwartzs kernel theorem implies:
: S(Rnx ) S (Rnx ) there exists
Corollary 6.14. For every continuous operator A
2n
is given by formula (6.23) interpreted in the
a S (Rz ) such that the kernel of A
distributional sense:
- i
.
KA (x, y) = e ,xy , a( 12 (x + y), ) .
(6.30)
Proof. Formulae (6.23) and (6.25) in Theorem 6.12 show that the linear mapping
is an automorphism S(Rnx ) S(Rnx ). It is easy to verify that this automorphism
is continuous and hence extends to an automorphism S (Rnx ) S (Rnx ). In view
: S(Rnx ) S (Rnx ) can
of Schwartzs kernel theorem every continuous operator A
be written as
A(x)
= KA (x, ), ()
where KA S (R2n
z ). The corollary follows.
(6.31)
173
is dened by
of A
Proof. (i) The adjoint A
L2 (Rn )
, )L2 (Rn ) = (, A)
(A
x
x
for , S(Rnx ); this means that
n
n
n
KA (x, y)(y)d y (x)d x =
KA (y, x)(x)d x (y)dn y
which is equivalent to the condition
KA
(y, x).
(x, y) = KA
(6.32)
174
1 n
2
(6.34)
1 n
2
/
One can work with oscillatory integrals very much as with ordinary integrals provided that some care is taken in their evaluation (for example, one should not in
general attempt to apply Fubinis rule to them!).
175
is given by
Proof. The kernel of A
KA (x, y) =
1 n/2
2
F2 ( 12 (x + y), y x)
1 n
2
1 n
2
1 n
2
|F a(, y x)|dn dn x,
|F a(, y x)|dn dn y.
Setting = y x we have
n
n
|F a(, y x)|d d x = |F a(, )|dn dn ,
176
that is
2
dx C
|A(x)|
|KA (x, y)|d x |(y)|2 dn xdn y
2
dx C 2
|A(x)|
|(y)|2 dn y
which is precisely (6.36). The last statement of the lemma immediately follows
and the density of S(Rnx ) in L2 (Rnx ).
from the continuity of A
Exercise 6.20. By modifying in an adequate way the proof above, show that the
conclusions of Proposition 6.19 remain true if one replaces the condition F a
1
2n
L1 (R2n
z ) by a L (Rz ).
Proposition 6.19 implies the following rather general L2 -continuity result:
1
2n
Corollary 6.21. If a C n+1 (R2n
z ) and z a L (Rz ) for every multi-index
Weyl
a is a bounded operator on L2 (Rn ).
N2n such that || n + 1, then A
x
0n
2
2
where m
z is the mth power of the generalized Laplacian z =
j=1 xj + pj . It
follows that we have the estimate
2 m
2n
(1 + |z| ) |F a(z)| |(1 2 m
z )a(z)|d z <
if 2m 2(n + 1), and we can then nd C > 0 such that
|F a(z)| C(1 + |z|2 )n1
which implies that F a L1 (R2n
z ).
177
In most cases of interest one cannot expect L2 -boundedness for a Weyl op it is usually only dened on some domain D L2 (Rn ). An excellent
erator A;
x
A
substitute for Proposition 6.19 (and its corollary) is then to look for conditions on
: S(Rnx ) S(Rnx ). We are going to prove below
the symbol that ensure that A
that it is actually sucient to assume that the symbol a and its derivatives are
growing, for |z| , slower than some polynomial to ensure S(Rnx ) continuity.
Let us dene this concept precisely:
Denition 6.22. We will call a function a C (R2n
z ) a polynomially bounded
symbol if there exists real numbers m R, > 0, such that
1
(6.37)
f (p) =
we have, for 1 j n,
(6.39)
(6.40)
0
where y = nj=1 y2j . Using Leibnizs rule it is easily seen that the conditions
(6.38) imply that the function y a( 12 (x + y), p)(y) and all its derivatives
vanish at innity, hence, performing partial integrations in (6.40):
n
1
i
2N
2N
e p,xy N
|p| f (p) = (i)
y a( 2 (x + y), p)(y) d y
178
1
n
|N
y a( 2 (x + y), p)(y) |d y.
1
Using once again Leibnizs rule, N
y a( 2 (x + y), p)(y) is a linear combination
of terms of the type
g (z) = y a( 12 (x + y), p)y (y) , || + || = N
and each of these terms satises, by (6.38), an estimate
1
(1 + |z|2 ) 2 (m+N )
CN
and hence |f (p)| = O(|p| ) and the integral in p in (6.38) is thus absolutely
S(Rn ) when S(Rn ) (the proof of the
convergent. Let us now show that A
x
x
continuity easily follows: see the exercise below). For any multi-indices , in Nn
is, by Leibnizs rule, a linear combination with complex
the function x x A(x)
coecients of terms of the type
i
F, (x) = p
x e p,xy x a( 12 (x + y), p)(y)dn y dn p
(6.41)
with ||+|| = | |+| |; since x x a is polynomially bounded with the same value
of as a, it follows that the integral in y is again O(|p| ) so that |F, (x)| < .
is a bounded function for every pair ( , ) of multi-indices,
Hence x x A(x)
n
S(R ) as claimed.
and A
x
on
Exercise 6.24. Finish the proof of Theorem 6.23 by showing the continuity of A
i
n
i p,xy
||
p,xy
S(Rx ). [Hint: notice that p e
= (i) y (e
) and use thereafter
partial integrations in y in (6.41) to rewrite F, (x) in a convenient way making
terms x (x ) appear in the integrand. Conclude using Leibnizs rule.]
179
1 n
2
1 n
2
/
/
Since each of these operators sends S(Rnx ) into itself we may compose them. The
B
is itself a
natural question which immediately arises is whether the product A
Weyl operator, and if it is the case, what is then its Weyl symbol? A rst remark
is that:
Lemma 6.25. Assume that A and B are operators with kernels KA and KB be2n
longing to S(R2n
x,y ). Then the kernel KC of C = AB is in S(Rx,y ) and is given by
the formula
KC (x, y) =
(6.42)
|KC (x, y)|2 dn xdn y = ||KA ||L2 (R2n
||KB ||L2 (R2n
< .
x,y )
x,y )
Let us next show that KC (x, y) given by (6.42) indeed is the kernel of AB. We
have, for S(Rnx ),
AB(x) = KA (x, y)B(y)dn y
= KA (x, y)
KB (y, z)(z)dn z dn y;
180
since the integrands are rapidly decreasing we may apply Fubinis theorem, which
yields
AB(x) =
and B
are in S(R2n ). In view of
Proof. Assume that the Weyl symbols a, b of A
z
B,
and formula (6.23) expressing the kernel in
formula (6.42) for the kernel of A
terms of the symbol, we have
KA B (x, y) =
1 2n
2
181
and hence
1
2n
c(z) =
2i
Remark 6.28. Expanding the integrand in formula (6.43) and using repeated integrations by parts one nds that one can formally write
i
c(z) = a(z)e 2
(z)
(6.44)
where L = zT J z , the arrows indicating the direction in which the partial derivatives act5 . This notation is commonly used in physics.
Remark 6.29. If we view as a variable parameter, then the symbol c of the
composed operator will generally depend on , even if a and b do not.
We now assume that the Weyl operators
= 1 n b (z)T(z)d2n z
= 1 n a (z)T(z)d2n z , B
A
2
2
can be composed (this is the case for instance if a and b are in S(R2n
z )) and set
=A
B.
Assuming that we can write
C
1
n
C = 2
c (z)T(z)d2n z
we ask: what is c ? The answer is given by the following theorem:
=A
B
is the funcTheorem 6.30. The twisted symbol of the composed operator C
tion c , dened by
1
n
i
c (z) = 2
(6.45)
e 2 (z,z ) a (z z )b (z )d2n z
or, equivalently,
c (z) =
1 n
2
e 2 (z,z ) a (z )b (z z )d2n z .
Proof. Writing
=
A
5 Littlejohn
1 n
2
=
a (z0 )T(z0 )d2n z0 , B
1 n
2
(6.46)
182
1 2n
2
1 n
2
1 n
2
i
e 2 (z0 ,z1 ) a (z0 )b (z1 )T(z0 + z1 )d2n z0 d2n z1 .
(6.48)
183
= A
B
is
Corollary 6.33. The twisted Weyl symbol of the composed operator C
(when dened) the function
1
2n
a b .
(6.49)
c = 2
Exercise 6.34. Let f, g S(R2n
z ). Show that:
F (a 2 b) = (F a) 2 b = a 2 F b,
F a 2 F b = a 2 b
(6.50)
(6.51)
(6.52)
184
(6.53)
The problem of dequantization can now, within our framework, be stated as fol : S(R2n ) S (R2n ) how can we nd a satisfying
lows: given an operator A
z
z
(6.53)? We are going to answer this question; let us rst introduce the following
is an operator with kernel K S(Rnx Rny ) we call the real
terminology: if A
number
= K(x, x)dn x
Tr A
(We will study in detail the notion of trace, and that of trace-class
the trace of A.
operator, in Chapter 9.)
Weyl
&
&
Theorem 6.35. Let a S(R2n
z ) and A a. The trace of AT (z) and T (z)A are
dened in the sense above, and:
(i) We have
T&(z)) = 2n Tr(T&(z)A);
a(z) = 2n Tr(A
(6.54)
be a bounded operator. The mapping
(ii) Let A
S(R2n
z ) C , b (2) Tr(AB)
n
(6.55)
1 n
2
2i
p0 ,x0 x
2i
Integrating in x yields
1
n
2i
2i
K(x, x)dn x = 2
e pp0 ,x dn x e p0 p,x0 a(x0 , p)dn p
2i
= 2n (p p0 )e p0 p,x0 a(x0 , p)dn p
= 2n a(x0 , p0 )
which proves the rst equality (6.54); the second is proven in the same way.
185
1 n
1 n
Tr
T&(z)d2n z
b(z)A
T&(z))d2n z
b(z) Tr(A
(see exercise below for the justication of the second equality). Taking into account
(6.54) this is
B)
= 1 n b(z)a(z)d2n z = 1 n a, b .
Tr(A
2
2
follows by continuity.
The case of a general bounded A
% )
$)
T&(z))d2n z.
T&(z)d2n z = b(z) Tr(A
Exercise 6.36. Justify the fact that Tr b(z)A
[Hint: pass to the kernels; alternatively use Proposition 9.19 in Chapter 9.
186
(6.58)
187
Proposition 6.39. The WignerMoyal transform of a pair (, ) of functions belonging to S(Rnx ) is given by
1
n
(T&(z), )L2 (Rnx )
(6.59)
W (, )(z) =
where T&(z) is the GrossmannRoyer operator associated to z; hence in particular:
1
n
W (z) =
(T&(z), )L2 (Rnx ) .
(6.60)
Proof. We have
(T&(z0 ), )L2 (Rnx ) =
2i
= () W (, )(z0 )
proving (6.59).
Note that the WignerMoyal transform obviously exists for larger classes of
functions than those which are rapidly decreasing; in fact W (, ) is dened for
all , L2 (Rnx ), as follows from the CauchySchwarz inequality
1
n
||||L2 (Rnx ) ||||L2 (Rnx )
|W (, )(z)|
which immediately follows from formula (6.59). More generally, it can be extended
to pairs of tempered distributions; to prove this we will establish an important
formula, called the Moyal identity in the mathematical and physical literature:
Proposition 6.40. The WignerMoyal transform is a bilinear mapping
W : S(Rnx ) S(Rnx ) S(Rnx )
satisfying the Moyal identity
=
(W (, ), W ( , ))L2 (R2n
z )
1 n
2
(6.61)
188
we have
i
A=
e p,yy (x + 12 y) (x + 12 y )
(x 12 y) (x 12 y )dn ydn y dn xdn p,
the integral in p is (2)n (y y ) and hence
n
A = (2)
(x + 12 y) (x 12 y)(x + 12 y) (x 12 y)dn ydn y dn x.
Setting u = x + 12 y and v = x 12 y we get
n
n
A = (2)
(u) (u)d u (v) (v)dn v,
whence (6.61).
For further use, let us compute the symplectic Fourier transform
W (, ) = F W (, )
W (, )(z) = 2n W (, )( 12 z).
(6.63)
(6.64)
1 2n
2
189
(6.65)
and hence
W (T(z0 )) =
1 n
2
(6.67)
190
1 n
2
W (z)d x =
= F (p)F (p)
It immediately follows from any of the two formulae (6.67) above that
W (z)dn z = ||||2L2 (Rnx ) = ||F ||2L2 (Rnp ) .
(6.68)
If the function is normalized: ||||2L2 (Rn ) = 1, then so is W (z):
191
Weyl
we have, for all , S(Rn ),
Proposition 6.45. Let a A;
x
)L2 (Rn ) =
(A,
x
and, in particular
)L2 (Rn ) =
(A,
x
Proof. Let us write
W (, )(z)a(z)d2n z
(6.69)
W (z)a(z)d2n z.
(6.70)
W =
W (, )a(z)d2n z.
By denition of W (, ) we have
W =
1 n
2
1 n
2
)L2 (Rn ) .
that is W = (A,
x
Proof. It is an immediate consequence of formula (6.70) taking into account denition (8.1) of the mathematical expectation value.
This result is very important, both mathematically, and from the point of
view of the statistical interpretation of quantum mechanics. It shows again that the
vocation of the Wigner transform is probabilistic, since formula (6.71) expresses
in a given normalized state by averaging its
the expectation value of the operator A
symbol against) the Wigner transform
of that state. This result, and the fact that
)
the quantities W (z)dn p and W (z)dn x indeed are probability densities have
led to metaphysical discussions about negative probabilities; see for instance
Feynmans contribution in [88].
192
1 n
2
||||2L2 (Rnx ) .
(6.73)
(6.74)
1
(a)
(6.75)
193
2i
(6.77)
W (a, p)dn p = 0.
1 n
2
shows that the function p (2)n/2 W (x, p) is the Fourier transform of the
function y (x + 12 y)(x 12 y); it follows that
(x + 12 y)(x 12 y) =
2i
1
(a)
then the right-hand side must be independent of the choice of a and hence
2i
Chapter 7
196
(7.1)
, Q = Q , and det L = 0
T
(7.2)
(7.4)
mod 2.
(7.5)
(7.6)
where
= 2m n.
(7.7)
197
Denition 7.1.
(i) The operator SW,m is called a quadratic Fourier transform associated to
the free symplectic matrix SW .
(ii) The class modulo 4 of the integer m is called the Maslov index of SW,m .
The quadratic Fourier transform corresponding to the choices SW = J and
m = 0 is denoted by J.
The generating function of J being simply W (x, x ) = x, x , it follows that
1
n/2
Jf (x) = 2i
eix,x f (x )dn x = in/2 F f (x)
(7.8)
for f S(Rnx ); F is the usual Fourier transform. It follows from the Fourier
inversion formula that the inverse J1 of J is given by the formula
i
n/2
1
J f (x) = 2
eix,x f (x )dx = in/2 F 1 f (x).
Note that the identity operator cannot be represented by an operator SW,m
since it is not a free symplectic matrix.
Of course, if m is one choice of Maslov index, then m + 2 is another equally
good choice: to each function W formula (7.3) associates not one but two operators
SW,m and SW,m+2 = SW,m (this reects the fact that the operators SW,m are
elements of the two-fold covering group of Sp(n)).
,L,m by
Let us dene operators VP and M
i
,L,m f (x) = im
VP f (x) = e 2 P x,x f (x) , M
| det L|f (Lx).
(7.9)
(7.10)
(ii) The operators SW,m extend to unitary operators L2 (Rnx ) L2 (Rnx ) and the
inverse of SW,m is
1
SW,m
= SW ,m with W (x, x ) = W (x , x) , m = n m.
1 n/2
2i
(7.11)
198
n/2
i
2
1
,L1 ,m VP
SW,m
= VQ J1 M
i n/2 m
i
| det L1 |
2
1 n/2 m+n
i
| det L|
2i
eiL
x,x
f (x )dn x
(x),
,LT ,nm Jf
=M
the inversion formulas (7.11) follow.
It follows from the proposition above that the operators SW,m form a subset, closed under the operation of inversion, of the group U(L2 (Rnx )) of unitary
operators acting on L2 (Rnx ). They thus generate a subgroup of that group.
Denition 7.3. The subgroup of U(L2 (Rnx )) generated by the quadratic Fourier
transforms SW,m is called the metaplectic group Mp(n). The elements of Mp(n)
are called metaplectic operators.
Every S Mp(n) is thus, by denition, a product SW1 ,m1 SWk ,mk of
metaplectic operators associated to free symplectic matrices.
We will use the following result which considerably simplies many arguments
(cf. Proposition 2.36, Subsection 2.2.3, Chapter 2):
Lemma 7.4. Every S Mp(n) can be written as a product of exactly two quadratic
Fourier transforms: S = SW,m SW ,m . (Such a factorization is, however, never
unique: for instance I = SW,m SW ,m for every generating function W .)
We will not prove this here, and refer to Leray [107], Ch. 1, or de Gosson [61].
Exercise 7.5. Show that quadratic Fourier transform SW,m cannot be a local oper ) Supp(f )
ator (a local operator on S (Rnx ) is an operator S such that Supp(Sf
n
for f S (Rx )).
199
2n
We will denote the elements of the dual (R2n
z ) of Rz by a, b, etc. Thus
a(z) = a(x, p) is the value of the linear form a at the point z = (x, p).
To every a we associate a rst order linear partial dierential operator A
obtained by formally replacing p in a(x, p) by Dx :
A = a(x, Dx ) , Dx = ix ;
thus, if
a(x, p) = , x + , p
for = (1 , . . . , n ), = (1 , . . . , n ) in Rn , then
A = , x + , Dx = , x i , x .
(7.12)
Obviously the sum of two operators of the type above is an operator of the same
type, and so is the product of such an operator by a scalar. It follows that these
operators form a 2n-dimensional vector space, which we denote by Di (1) (n).
(1)
2n
The vector spaces R2n
(n) are isomorphic since they all
z , (Rz ) and Di
have the same dimension 2n. The following result explicitly describes three canonical isomorphisms between these spaces:
Lemma 7.6.
(i) The linear mappings
2n
, 1 : z0 a,
1 : R2n
z (Rz )
(1)
(n) , 2 : a A
2 : (R2n
z ) Di
(7.13)
200
(1)
2n
Proof. (i) The vector spaces R2n
(n) having the same dimenz , (Rz ) , and Di
sion, it suces to show that ker(1 ) and ker(2 ) are zero. Now, 1 (z0 ) = 0 is
equivalent to the condition (z, z0 ) = 0 for all z, and hence to z0 = 0 since a
symplectic form non-degenerate. If 2 (a) = 0 then
and hence
[(z1 ), (z2 )] = i(x1 , p2 x2 , p1 )
We are next going to show that the metaplectic group Mp(n) acts by conjugation on Di (1) (n). This will allow us to explicitly construct a covering mapping
Mp(n) Sp(n).
(1)
Lemma 7.7. For z0 = (x0 , p0 ) R2n
(n) by
z dene A Di
(7.14)
(7.15)
(7.16)
and
201
,L,m x0 , Dx (M
,L,m )1 f (x) = x0 (L1 )T Dx f (x).
M
which is (7.16).
M
,L,m , VP .
Property (ii) immediately follows since S is a product of operators J,
(iii) The mapping S is trivially a linear mapping Di (1) (n) Di (1) (n). If
S1 Di (1) (n), then we have also A = S1 B S Di (1) (n) since
B = SA
A = S1 B(S1 )1 . It follows that S is surjective and hence bijective.
M
,L,m , VP generate Mp(n) the lemma above shows
Since the operators J,
that for every S Mp(n) there exists a linear automorphism S of R2n
z such that
S (A) = a S, that is
S ((z0 )) = (Sz0 ).
(7.17)
Let us show that the automorphism S preserves the symplectic form. For z, z
R2n
z we have, in view of (7.13),
(Sz, Sz ) = i [(Sz), (Sz )]
= i S (z), S (z )
%
$
1
)S
= i S(z)
S1 , S(z
= iS [(z), (z )] S1
= (z, z ),
hence S Sp(n) as claimed.
This result allows us to dene a natural projection
Mp : Mp(n) Sp(n) , Mp : S S;
it is the mapping which to S Mp(n) associates the element S Sp(n) dened
by (7.17), that is
(7.18)
S = 1 S .
That this mapping is a covering mapping follows from:
202
Proposition 7.8.
(i) The mapping Mp is a continuous group epimorphism of Mp(n) onto Sp(n)
such that:
= J , Mp (M
,L,m ) = ML , Mp (VP ) = VP
Mp (J)
and hence
Mp (SW,m ) = SW .
(7.19)
(7.20)
hence (7.19). Formula (7.20) follows since every quadratic Fourier transform SW,m
can be factorized as
,L,m JVQ
SW,m = VP M
in view of Proposition 7.2 above. To establish the continuity of the mapping Mp
(1)
we rst remark that the isomorphism : R2n
(n) dened in Lemma 7.6
z Di
is trivially continuous, and so is its inverse. Since S S = S S it suces to show
that for every A Di (1) (n), fS (A) has A as limit when S I in Mp(n). Now,
Mp(n) is a group of continuous automorphisms of S(Rn ) hence, when S I, then
S1 f f .
S1 f f for every f S(Rn ), that is AS1 f Af and also SA
S1 = A for every A Di (1) (n) and
(ii) Suppose that 1 S = I. Then SA
this is only possible if S is multiplication by a constant c with |c| = 1 (see Exercise
below); thus ker( Mp ) S 1 . In view of Lemma 7.4 we have S = SW,m SW ,m
for some choice of (W, m) and (W , m ) hence the condition S ker( Mp ) is
equivalent to
SW ,m = c(SW,m )1 = cSW ,m
which is only possible if c = 1, hence S = I as claimed.
203
Mp(n) Mp(n)
Mp
Mp
2
2
Sp(n)
Sp(n)
(7.21)
204
Suppose for instance that S = SW,m ; it is easily checked, using the fact that
W is homogeneous of degree 2 in (x, x ), that
1
n/2
i
SW,m
f (x) = 2i
(W ) e W (x,x ) f (x ) dn x
or, equivalently,
SW,m
= n/2 SW/,m .
Also,
)1 f (x) =
(SW,m
n/2
i
2
(W )
eW
(x,x )
f (x ) dn x .
The projection of SW,m
on Sp(n) is the free matrix SW :
Mp (SW,m
) = SW .
(7.22)
Exercise 7.10. Show that if and are two positive numbers, then we have
SW,m
= M / SW,m
M/
(i.e., Mp(n) and Mp (n) are equivalent representations of the metaplectic group).
Remark 7.11. The metaplectic group Mp(n) is not an irreducible representation
of the double covering group Sp2 (n) (see Folland [42], p. 194, for a proof).
In what follows we will use the following convention, notation, and terminology which is consistent with quantum mechanics:
Notation 7.12. The projection Mp(n) Sp(n) will always assumed to be the
homomorphism
Mp : Mp(n) Sp(n)
and we will drop all the superscripts referring to : we will write Mp for Mp ,
SW,m for SW,m
and S for S . The functions on which these -dependent metaplectic operators are applied will be denoted by Greek letters , , etc.
205
(7.23)
S.
a S S1 A
Weyl
(7.24)
Proof. (i) To prove the rst formula (7.23) it is sucient to assume that S is a
quadratic Fourier transform SW,m since these generate Mp(n). Suppose indeed we
have shown that
1
T(SW z0 ) = SW,m T(z0 )SW,m
.
(7.25)
Writing an arbitrary element S of Mp(n) as a product SW,m SW ,m we have
1
1
T(Sz0 ) = SW,m (SW ,m T(z0 )SW
,m )SW,m
1
= SW,m T(SW z0 )SW,m
= T(SW SW z0 )
= SW,m SW ,m T(z0 )(SW,m SW ,m )1
= ST(z0 )S1
and the case of a general S Mp(n) follows by induction on the number of terms
(the argument above actually already proves (7.23) in view of (7.4)). Let us thus
prove (7.25); equivalently:
1
T(z0 )SW,m = SW,m T(SW
z0 ).
(7.26)
(7.27)
206
where SW0 ,m is one of the metaplectic operators associated to W0 . Let us now set
1
1
h(x) = SW,m T(SW
z0 )(x) and (x0 , p0 ) = SW,m
(x0 , p0 );
we have
h(x) =
n/2
1
2i
(W )
1
2
1
2
p0 , x0 .
1
2
1
2
p0 , x0 + p0 , x .
But this equality immediately follows from Proposition 2.37 and its Corollary 2.38.
To prove the second formula (7.23) recall (Proposition 5.52, Subsection 5.5.3 of
Chapter 5) that we have
T&(z0 ) = T(z0 )T&(0)T(z0 )1 .
It follows that
T(z0 )1 S1 .
T&(Sz0 ) = ST(z0 )(S1 T&(0)S)
We claim that S1 T&(0)S = T&(0). It suces of course to prove this when S = SW,m .
For S(Rnx ) we have
1
n/2
i
&
T (0)SW,m (x) = 2i
(W ) e W (x,x ) (x ) dn x
1
n/2
i
= 2i
(W ) e W (x,x ) (x ) dn x
= SW,m T&(0)(x)
and hence
the second formula (7.23).
1 &
SW,m
T (0)SW,m = T&(0);
207
Let us now prove part (ii) of the theorem. In view of formula (6.23) in Theorem
6.12 we have
w
(a S) = a (Sz)T(z)d2n z
where (a S)w is the Weyl operator associated to the symbol a S, that is,
performing the change of variables Sz z and taking into account the fact that
det S = 1,
(a S)w = a (z)T(S 1 z)d2n z.
By formula (7.23) in Theorem 7.13 we have S1 T(z)S = T(S 1 z) and hence
(a S) =
w
2n z = S1
a (z)S1 T(z)Sd
2n
a (z)T (z)d z S
which is (7.24).
(7.29)
W (S)(z)
= W (S 1 z).
(7.30)
2n
S,
)L2 (Rn ) .
S,
S)
L2 (Rn ) = (S1 A
S)a(z)d
z = (A
W (S,
x
x
W (, )(z)(a S)(z)d2n z
W (, )(S 1 z)a(z)d2n z,
which establishes (7.29) since and are arbitrary; (7.30) trivially follows taking
= .
208
it = H
where is a function (or distribution) in the x, t variables. For arbitrary H this
equation is dicult to solve explicitly; it turns out, however, that when H is a
quadratic form in z, then the solutions can be expressed using metaplectic operators. To prove this remarkable fact we will have to identify the Lie algebra of the
metaplectic group.
(7.32)
H
Recall the following notation: (St,t
) is the time-dependent ow determined by H,
that is if t z(t) is the solution of (7.32) with z(t ) = z then
H
z(t) = St,t
(z ).
(7.33)
H
We will set St,0
= StH . Assume that H does not depend on t; then (StH ) is the
one-parameter subgroup of Sp(n) given by
StH = etJH .
Conversely, if (St ) is an arbitrary one-parameter subgroup of Sp(n), then St =
etX for some X sp(n) and we have (St ) = (StH ) where H is the quadratic
Hamiltonian
(7.34)
H(z) = 12 JXz, z .
The following elementary result is useful:
209
T
Y JXz, z =
1
2
(Y T JX X T JY )z, z ,
1
2
J(Y X XY )z, z
mp(n) sp(n)
exp
exp
Mp(n) Sp(n)
(7.35)
Mp
commutative.
In Chapter 2, Subsection 2.1.3, we called sp(n) the symplectic algebra; we
will call mp(n) the metaplectic algebra. For an approach using the beautiful
and important notion of symplectic Cliord algebra see Crumeyrolle [26] and the
monograph by Habermann and Habermann [87].
210
Theorem 7.16.
(i) The linear mapping F which to X sp(n) associates the anti-Hermitian
Weyl
where H
H with H given by (7.34) is injective,
operator F (X) = i H
and we have
[F (X), F (X )] = F ([X, X ])
(7.36)
for all X, X sp(n);
(ii) The image F (sp(n)) of F is the metaplectic algebra mp(n).
Proof. It is clear that the mapping F is linear and injective. Consider the matrices
1 0 jk + kj
jk
0
Xjk =
,
, Yjk =
0
jk
0
2 0
1
0
0
Zjk =
(1 j k n)
2 jk + kj 0
with 1 the only non-vanishing entry at the jth row and kth column; these matrices
form a basis of sp(n) (see Exercise 2.17). For notational simplicity we will assume
that n = 1 and set X = X11 , Y = Y11 , Z = Z11 :
1 0
0 1
0 0
X=
,Y =
,Z=
;
0 1
0 0
1 0
the case of general n is studied in an exactly similar way. To the matrices X, Y ,
Z correspond via formula (7.34) the Hamiltonians
HX = px , HY = 12 p2 , HZ = 12 x2 .
X = F (X), H
Y = F (Y ), H
Z = F (Z) form a basis of the vector
The operators H
space F (sp(1)); they are given by
X = ixx 1 i , H
Y = 1 2 x2 , H
Z = 1 x2 .
H
2
2
2
Let us show that formula (7.36) holds. In view of the linearity of F it is sucient
to check that
X ] = H
[X,Y ] ,
X , H
[H
Z ] = H
[X,Z] ,
X , H
[H
Z ] = H
[Y,Z] .
Y , H
[H
We have thus proved that F is a Lie algebra isomorphism. To show that F (sp(1)) =
mp(1) it is thus sucient to check that the one-parameter groups
i
t Ut = e HX t ,
i
t Vt = e HY t ,
i
t Wt = e HZ t
211
are subgroups of Mp(1). Let 0 S(R) and set (x, t) = Ut 0 (x). The function
is the unique solution of the Cauchy problem
it = (ixx + 12 i) , (, 0) = 0 .
A straightforward calculation (using for instance the method of characteristics)
yields
(x, t) = et/2 0 (et x)
,L(t),0 where L(t) = et and we thus have
hence the group (Ut ) is given by Ut = M
Ut Mp(1) for all t. Leaving the detailed calculations to the reader one similarly
veries that
1/2
i
1
(x x )2 0 (x )dx ,
exp
Vt 0 (x) =
2it
2t
1 2
Wt 0 (x) = exp x 0 (x),
2
so that Vt is a quadratic Fourier transform corresponding to the generating function W = (xx )2 /2t and Wt is the operator VtI ; in both cases we have operators
belonging to Mp(1).
We leave it to the reader to check that the diagram (7.35) is commutative.
Exercise 7.17. Show that exp F 1 = Mp exp where exp is a collective notation
for the exponential mappings mp(n) Mp(n) and sp(n) Sp(n) [Use the
generators of mp(n) and sp(n).]
Let us apply the result above to the Schr
odinger equation associated to a
quadratic Hamiltonian (7.31). Since Mp(n) covers Sp(n) it follows from the unique
path lifting theorem from the theory of covering manifolds that we can lift the path
H
= StH in a unique way into a path t StH in Mp(n) such that S0H = I.
t St,0
Let 0 S(Rnx ) and set
(x, t) = St 0 (x).
It turns out that satises Schrodingers equation
it = H
Weyl
is the Weyl operator H
where H
H. The following two exercises will be used
in the proof of this property:
is given by
Exercise 7.18. Verify that the operator H
2
= Hpp x , x i Hpx x, x + 1 Hxx x, x i Tr(Hpx )
H
2
2
2
(7.37)
212
Weyl
Weyl
Exercise 7.19. Let H
H, K
K where H and K are quadratic Hamiltonians (7.31). Show that
K]
= i{H,
[H,
K} = i(XH , XK )
where {, } is the Poisson bracket (5.38). (You might want to use the previous
Exercise.)
odingers equation:
Let us now show that = St 0 is a solution of Schr
Corollary 7.20. Let t St be the lift to Mp(n) of the ow t StH . For every
0 S(Rnx ) the function dened by (x, t) = St 0 (x) is a solution of the partial
dierential equation
it = H
, (, 0) = 0
Weyl
where H
H.
Proof. We have
it = i lim
t0
t0
1
(St I) St 0 ,
t
1
(St I) f = Hf
t
(7.38)
(7.39)
213
, (, 0) = z0 ,M0
= H
t
(7.40)
1 t
a(t) = exp
T r(Hpp M )dt
2 0
(7.43)
and M (t) = X(t)Y (t)1 where the matrices X(t) and Y (t) are calculated as follows: nd X0 and Y0 such that M0 = X0 Y01 ; then X = X(t) and Y = Y (t) are
given by
X
H X
(7.44)
, X(0) = X0 , Y (0) = Y0
= St
Y
Y
where (StH ) is the linear symplectic (time-dependent) ow determined by the quadratic Hamiltonian H.
214
Proof. (Cf. [128], 2.1, for computational details). Making the Ansatz
i
(7.45)
(7.46)
and
with initial conditions M (0) = M0 , a(0) = 1. The equation (7.46) is recognized to
be a matrix Riccati equation; setting M = XY 1 we have
M = XY 1 XY 1 Y X 1 ;
since M = M T , insertion shows that the matrices X and Y satisfy the equations
X + Hxx Y + Hxp X = 0,
Y Hpx Y Hpp X = 0
or, equivalently,
d X
X
,
= JH
Y
dt Y
hence (7.44) (we leave it to the reader to check that M = XY 1 only depends
on M0 ). One veries by a straightforward computation that the matrix M =
XY 1 thus obtained indeed is symmetric, and one nally solves the equation
(7.46) explicitly, which yields (7.43).
Note that the center of the Gaussian t follows the Hamiltonian trajectory
determined by the quadratic function H; this typical behavior, well-known in
quantum mechanics, is related to Ehrenfests theorem on average values (see for
instance Messiah [123], Chapter VI, for a proof and comments).
215
(7.47)
7.3.1 The Maslov index
(S)
In this section we dene and study in detail the Maslov index of an arbitrary
metaplectic operator S Mp(n). We will see that it can be very simply related to
the Maslov index on the universal covering group and that it is in fact identical
to the Maslov index on Spq (n) studied in Chapter 3 if we take q = 2.
Theorem 7.22. Let W = (P, L, Q) be a quadratic form (7.47).
(i) Let W = (P , L , Q ) and W = (P , L , Q ) be such that SW ,m =
SW,m SW ,m (cf. Lemma 7.4), then the metaplectic Maslov indices m, m ,
and m satisfy the relation
m m + m Inert(P + Q) mod 4
(7.48)
1
1
sign(P + Q) m + m + sign(P + Q ) mod 4 (7.49)
2
2
and also
rank(P + Q) rank(P + Q ) mod 4.
(7.50)
216
Proof. We only sketch the proof, since it is rather long and technical. Part (i)
(formula (7.48)) is due to Leray [107], Chapter I, 1,2. Part (ii) is due to the author
[56, 58] (also see [61], Chapter 3). The idea is the following: let k (x) = (x k)
(k > 0) where (x) = exp(|x|2 /2) (any other radially symmetric element of
S(Rnx ) would do as well). Using carefully the method of stationary phase one nds
that if S = SW,m SW ,m , then
k (0) C im+m n/2 (ei/4 )sign(P +Q) k rank(P +Q)/2 for k
S
where C > 0 is a constant depending on W and W . If S = SW ,m SW ,m we
will thus have
k (0) C im +m n/2 (ei/4 )sign(P +Q ) k rank(P +Q )/2 for k
S
for some new constant C > 0 depending on W and W . This is only possible
if (7.49) and (7.50) hold.
Denition 7.23.
(SW,m ) = 2m n
(7.51)
(S)
(SW,m ) +
(7.52)
[The
(which is uniquely dened modulo 4) is called the Maslov index of S.
hats should be understood as class modulo 4.]
Notice that the Maslov index is well dened in view of (ii) in Theorem 7.22
is indeed independent of the factorization of S in a product
which shows that
(S)
of two quadratic Fourier transforms.
It turns out (not so unexpectedly!) that we can rewrite denition (7.52) in
terms of the signature of a triple of Lagrangian plane. Let us begin by proving a
technical result:
Lemma 7.24. Let SW and SW be two free symplectic matrices and write
A B
A B
A
B
, SW =
SW =
, SW SW =
.
C D
C D
C D
Let P = 0 Rnp . We have
(P , SW P , SW SW P ) = sign(B 1 B (B )1 ).
(7.53)
217
Proof. Let us rst prove (7.53) in the particular case where SW P = X , that is
when SW is of the type
A B
SW =
C 0
in which case B = AB . Using the Sp(n)-invariance and the antisymmetry of the
signature, we have, since SW P = X :
1
P , P ).
(P , SW P , SW SW P ) = (X , SW
(7.54)
0
p
=
B T p
AT p
1
so the Lagrangian plane SW
P has for equation Ax + Bp = 0; since B is invertible
(because SW is free) this equation can be rewritten p = B 1 Ax. Using the second
formula (1.24) (Chapter 1, Section 1.4) together with the identity (7.54) we have
(P , SW P , SW SW P ) = sign(B 1 A)
= sign(B 1 (AB )(B )1 )
= sign(B 1 B (B )1 )
which proves (7.53) in the case SW P = X . The general case reduces to the
former, using the fact that the symplectic group acts transitively on all pairs of
transverse Lagrangian planes. In fact, since
SW P P = X P = 0
we can nd S0 Sp(n) such that (P , SW P ) = S0 (P , X ), that is SW P = S0 X
and S0 P = P . It follows, using again the antisymmetry and Sp(n)-invariance of
that:
(P , SW P , SW SW P ) = (X , (SW S0 )1 P , P )
which is (7.54) with SW replaced by SW S0 . Changing SW into S01 SW (and hence
leaving SW SW unchanged) we are led back to the rst case. Since S0 P = P , S0
must be of the type
L
0
S0 =
, det(L) = 0, P = P T ;
P (L1 )T
writing again SW in block-matrix form, the products SS0 and S01 SW are thus
of the type
B(LT )1
L1 B
SS0 =
, S01 SW =
218
(the stars * are block-entries that are easily calculated, but that we do not need
to write down), and hence
(P , SW P , SW SW P ) = sign(LT B 1 B B 1 L) = sign(B 1 B B 1 )
proving (7.53) in the general case.
(S)
(SW,m ) +
(SW ,m ) + (P , SW P , SW SW P ).
(7.55)
(SS ) =
S on Sp(n).
where S, S are the projections of S,
Proof. (i) If W = (P, L, Q) and W = (P , L , Q ) we have
L1
L1 Q
SW =
,
P L1 Q LT L1 P
L1
L1 Q
SW =
P L1 Q LT L1 P
(formula (2.43) in Chapter 1, Subsection 2.2.3) and thus
L1 (P + Q)L1
SW = SW SW =
.
Writing
(7.56)
219
(ii) (We are following de Gosson [56, 61].) Assume rst that S = SW,m and let us
show that
+
(S)
(SW,m ) + (P , SP , SSW P )
(SSW,m ) =
for every S Mp(n). Let Lag(n) and dene functions fW : Mp(n) Z and
by gW, : Mp(n) Z by
=
(P , SP , SSW P ),
fW (S)
(SSW,m )
(S)
=
(SP , P , ).
(S)
gW, (S)
In view of the cocycle property of we have
(P , SP , SSW P )
= (P , SP , ) (P , SSW P , ) + (SP , SSW P , );
from which follows that
= gW, (SSW,m ) gW, (S)
(SP , SSW P , P ).
fW (S)
Choose now such that
SP = SSW P = P = 0;
gW, is constant on a neighborhood of S in Mp(n); since on the other hand
SP SSW P = SW P P = 0,
the function S (SP , SSW P , ) is constant in a neighborhood of S in Sp(n)
is locally constant on Mp(n) and hence constant since Mp(n) is
hence fW (S)
connected. That constant value is
fW (SW ,m ) =
(SW ,m SW,m )
(SW ,m )
(P , SW P , SW SW P ),
that is
fW (SW ,m ) =
(SW,m )
proving formula (7.56) in the case S = SW,m . To prove it in the general case we
proceed as follows: writing S = SW,m SW ,m we have
(SS ) =
(SSW,m ) +
(SW ,m ) + (P , SSW P , SS P )
+
=
(S)
(SW,m ) + (P , SP , SSW P ) +
(SW ,m )
+ (P , SSW P , SS P ).
Set
= (P , SP , SSW P ) + (P , SSW P , SS P );
220
using successively the antisymmetry, the cocycle property, and the Sp(n)-invariance, and of we have
= (P , SSW P , SS P ) (P , SSW P , SS P )
= (SSW P , SS P , SP ) (P , SS P , SP )
= (SW P , S P , P ) + (P , SP , SS P ),
hence, noting that S = SW SW :
+
(S)
(S ) + (P , SP , SS P ) mod 4
(SS ) =
As an application let us calculate the indices of the identity and of the inverse
of a metaplectic operator:
Corollary 7.27. We have
=0 ,
(I)
(S1 ) =
(S).
(7.57)
(I)
(SW,m SW,m
)
1
(SW,m
) + (P , SW P , P )
=
(SW,m ) +
1
(SW,m
)
=
(SW,m ) +
(I)
The second formula (7.57) follows from the rst in view of (7.56) since
+
=
(S)
(S1 ) + (P , SP , P )
(I)
(SS1 ) =
using the fact that (P , SP , P ) = 0.
7.3.2 The Maslov indices
(S)
The reader will certainly have noted that in all the formulae above a special role
is played by the Lagrangian plane P = 0 Rnp . It turns out that it is possible
(and useful!) to dene a Maslov index associated to an arbitrary Lagrangian plane
Lag(n), whose properties are quite similar to those of the standard Maslov
index.
221
Denition 7.28. Let Lag(n) and S0 Sp(n) be such that = S0 P . Let S0 be
one of the two operators in Mp(n) such that Mp (S0 ) = S0 . The integer modulo
4 dened by
=
(S01 SS0 ) mod 4
(S)
is called the Maslov index of S relatively to .
That the integer
(S01 SS0 ) is independent of the choice of S0 is clear, since
for a given S0 Sp(n) there are only two operators S0 with projection S0
Sp(n). Slightly more delicate is the proof that
(S01 SS0 ) does not depend on the
choice of S0 Sp(n) such that = S0 P . To show this we proceed as in de Gosson
[56]. Assume that = S1 P . Then S0 S11 P = P so that R = S0 S11 is in St(P ),
Mp(n) with projection R. We have,
the stabilizer of P in Sp(n). Now choose R
using (7.56) together with the cocycle property and Sp(n)-invariance of :
=
1 ) +
1 S1 SS0 R)
(R
(S01 SS0 ) +
(R)
(R
0
+ (P , R1 P , R1 S01 SS0 P ) + (P , S01 SS0 P , S01 SS0 RP ),
1 ) =
(R),
that is, since
(R
1 S1 SS0 R)
=
+
(R
(R)
(S01 SS0 ) +
(R)
0
+ (RP , P , S01 SS0 P ) + (P , S01 SS0 P , S01 SS0 RP )
=
(S1 SS0 ) + (P , P , S 1 SS0 P ) + (P , S 1 SS0 P , S 1 SS0 P )
0
(R
0
This proves that
(S01 SS0 ) only depends on S and , as claimed, which completely
justies Denition 7.28.
detailed in
The properties of
are immediately deduced from those of
Theorem 7.26:
Corollary 7.29. The Maslov index
on Mp(n) has the following properties:
(i) For all S, S in Mp(n) we have
+
(S)
(S ) + (, S, SS ) mod 4
(SS ) =
(7.58)
S on Sp(n).
where S, S are the projections of S,
(ii) We have
= 0 and
(I)
(S1 ) =
(S).
Proof. This is an immediate consequence of the denition of
and Theorem 7.26
and Corollary 7.27.
222
The relation (7.58) clearly indicates a relationship between the Maslov index
and the Maslov index on Sp (n) studied in Chapter 3, Subsection 3.3.2 (such
a relation was of course already apparent in formula (7.56) of Theorem 7.26).
Corollary 7.30. Let S Sp (n) have projection S Mp(n). For every
Lag(n) we have
= (S ) mod 4,
(S)
that is
= [ ]2 ([S]2 )
(S)
where [ ]2 is the Maslov index on Sp2 (n).
Proof. This is an immediate consequence of Proposition 3.31 on the uniqueness of
a function : Sp (n) Z satisfying
(S S
) = (S ) + (S
) + (, S, SS )
(S = Sp (S ), S = Sp (S
)) and locally constant on the set {S : S = 0}.
Remark 7.31. In [58] we have shown that it is possible to reconstruct the ALM
index modulo 4 using only the properties of the Maslov indices
on Mp(n).
T(z0 + z1 ) =
i
e 2 (z0 ,z1 ) T(z0 )T(z1 ).
(7.59)
(7.60)
223
(S)
7.4.1 The operators R
Let MS be the symplectic Cayley transform of S Sp(n) such that det(S I) = 0
(Denition 4.13 in Chapter 4, Section 4.3): it is the symmetric matrix
MS =
1
J(S + I)(S I)1 .
2
(7.62)
interpreted as a Bochner integral. We will show that provided we choose the integer
=S
correctly this is the Weyl representation of S Mp(n) such that Mp (S)
(the denition of the operators (7.62) is due to Mehlig and Wilkinson [121], who
however do not make precise the integer ). Anyhow, formula (7.62) denes a Weyl
operator with complex Gaussian twisted symbol
i
i
(z) =
e 2 MS z,z .
a(S),
| det(S I)|
(7.63)
Assume in addition that det(S + I) = 0. Since the symbol and twisted symbol of
a Weyl operator are symplectic Fourier transforms of each other, the symbol of
(S) is a(S), = F a(S),
R
, that is
n
i
i
1
i
(S),
a
(z) =
e (z,z ) e 2 MS z ,z d2n z ;
2
| det(S I)|
applying the Fresnel formula (7.61) with m = 2n we then get
1
(S),
1
i
i+ 2 sgn MS
(z) =
| det MS |1/2 e 2 JMS Jz,z .
| det(S I)|
Since, by denition of MS ,
det MS = 2n det(S + I) det(S I),
we can rewrite the formula above as
1
1
i
i+ 2 sgn MS
e 2 JMS Jz,z ;
a(S), (z) = 2n/2
| det(S + I)|
(7.64)
224
(S) can
Lemma 7.32. Let S Sp(n) be such that det(S I) = 0. The operator R
be written as
n
1
i
R (S) =
i | det(S I)| e 2 (Sz,z) T((S I)z)d2n z,
(7.65)
2
that is, as
(S) =
R
1
2
n
i
| det(S I)| T(Sz)T(z)d2n z.
(7.66)
Proof. We have
1
1
J(S + I)(S I)1 = J + J(S I)1 ,
2
2
hence, in view of the antisymmetry of J,
MS z, z = J(S I)1 z, z = ((S I)1 z, z).
Performing the change of variables z (S I)1 z we can rewrite the integral
in the right-hand side of (7.62) as
i
i
MS z,z
2n
2
T (z)d z = e 2 (z,(SI)z) T((S I)z)d2n z
e
i
= e 2 (Sz,z) T((S I)z)d2n ,
hence (7.66). Taking into account the relation (7.60) we have
i
(S):
Let us begin by studying composition and inversion for the operators R
Theorem 7.33. Let S and S in Sp(n) be such that det(S I) = 0, det(S I) = 0.
(i) If det(SS I) = 0, then
(S)R
(S ) = R
R
+ + 12 sgn M (SS ).
(7.67)
(7.68)
225
(S) and R
(S ) are, respectively,
Proof. (i) The twisted symbols of R
i
i
a (z) =
e 2 MS z,z ,
| det(S I)|
i
i
b (z) =
e 2 MS z,z .
| det(S I)|
(S)R
(S ) is given by
The twisted symbol c of the compose R
1
n
i
c (z) = 2
e 2 (z,z ) a (z z )b (z )d2n z
(formula (6.45), Theorem 6.30, Subsection 6.3.1 of Chapter 6), that is
i
i
c (z) = K e 2 (z,z ) e 2 (z,z ) d2n z
where the constant in front of the integral is
n
1
i+
K=
2
| det(S I)(S I)|
and the phase (z, z ) is
(z, z ) = MS (z z ), z z + MS z , z
that is
Observing that
(z, z ) 2 MS z, z = (J 2MS )z, z
= 2 J(S I)1 z, z ,
we have
(z, z ) + (z, z )
= 2 J(S I)1 z, z + MS z, z + (MS + MS )z , z
and hence
c (z) = Ke
i
2 MS z,z)
e J(SI)
i
z,z
i
e 2 (MS +MS )z ,z d2n z .
(7.69)
Applying the Fresnel formula (7.61) with m = 2n and replacing K with its value
we get
1
n
i
i
c (z) = 2
| det[(MS + MS )(S I)(S I)]|1/2 e 4 sgn M e (z)
(7.70)
226
227
(S)
Corollary 7.34. Let S Sp(n) be such that det(S I) = 0. The operators R
1
(S) .
(S) = R
are unitary: R
Proof. In view of Proposition 6.16 (Subsection 6.2.2, Chapter 6) giving the symbol
of the adjoint of a Weyl operator is the complex conjugate of the symbol of that
operator. Since the twisted and Weyl symbol are symplectic Fourier transforms of
(S) is thus given by
each other, the symbol a of R
i
i
i
(2)n a(z) =
e (z,z ) e 2 MS z ,z d2n z .
| det(S I)|
We have
n
(2) a(z) =
| det(S I)|
i
i
e (z,z ) e 2 MS z ,z d2n z .
(7.71)
iCZ (r S ) = iCZ (S ) + 2r
228
iCZ (S S
) = iCZ (S ) + iCZ (S
)+
1
sign MS .
2
(7.72)
which suggests that the operators R
(S) generate a true (two-sheeted) unitary
iCZ (S)
representation of the symplectic group, that is the metaplectic group. Formula
(7.72) is however not sucient for this claim, because the R
(S) have only
iCZ (S)
been dened for det(S I) = 0. What we must do is to show that these operators
generate a group, and that this group is indeed the metaplectic group Mp(n).
Let us recall the following notation, used in Section 4.3 of Chapter 4: if W
is a quadratic form
W (x, x ) =
1
2
P x, x Lx, x +
1
2
Qx , x
A
C
WS = DB 1 + B 1 A B 1 (B T )1
(7.74)
B
is the free symplectic matrix generated by W . Also recall that
D
= (1) det L
det(P + Q L L )
(7.75)
229
Proposition 7.36. Let SW,m Mp(n) be a quadratic Fourier transform with projection S = SW .
(SW ) = SW,m provided that
(i) We have R
mod 4.
iCZ (S)
(7.76)
(7.77)
(x0 )dn x0 = 1,
(S)(0) =
R
1
2
n
| det(S I)|
(7.78)
MS (0, p0 ), (0, p0 ) = WS1 p0 , p0
(7.79)
230
(SW )(0) =
R
1 n i
i e 4 sgn WS | det L|1/2 .
2
n m
1
i
| det L|
2i
1 n mn/2
i
| det L|
2
and hence
i e
i
4
sgn WS
= imn/2 .
1
2
sgn WS m 12 n mod 4
Recall from Chapter 7 that every S Mp(n) can be written (in innitely
many ways) as a product S = SW,m SW ,m . We are going to show that SW,m and
SW ,m always can be chosen such that det(SW,m I) = 0 and det(SW ,m I) = 0.
(SW ) generate Mp(n). In fact, every S Mp(n)
Corollary 7.37. The operators R
can be written as a product
(SW )R
(SW )
S = R
where det(SW I) = 0, det(SW I) = 0.
(7.80)
231
Proof. Recall that the quadratic Fourier transforms SW,m generate Mp(n); in fact
every S Mp(n) can be written in the form S = SW,m SW ,m . In view of Proposition 7.36 it thus suces to show that W and W can be chosen so that SW =
Mp (SW,m ) and SW = Mp (SW ,m ) satisfy det(SW I) = 0, det(SW I) = 0.
(SW ) generate Mp(n) follows from formula (7.80) since Let us write
That the R
S = SW,m SW ,m and apply the factorization (2.51) to each of the factors; writing
W = (P, L, Q), W = (P , L , Q ) we have
,L,m JV(P +Q) M
,L ,m JVQ .
S = VP M
(7.81)
We claim that SW,m and SW ,m can be chosen in such a way that det(SW I) = 0
and det(SW I) = 0, that is,
det(P + Q L LT ) = 0 and det(P + Q L LT ) = 0;
this will prove the assertion. We rst remark that the right-hand side of (7.81)
obviously does not change if we replace P by P + I and Q by Q I where
R. Choose now such that it is not an eigenvalue of P + Q L LT and
is not an eigenvalue of P + Q L LT ; then
det(P + Q I L LT ) = 0,
det(P + I + Q L LT ) = 0
and we have S = SW1 ,m1 SW1 ,m1 with
1
1
P x, x Lx, x + (Q I)x , x ,
2
2
1
1
W1 (x, x ) = (P + I)x, x L x, x + Q x , x ;
2
2
W1 (x, x ) =
So far, so good. But we havent told the whole story yet: there remains to prove
(S).
that every S Mp(n) such that det(S I) = 0 can be written in the form R
Proposition 7.38. For every S Mp(n) such that det(S I) = 0, we have S =
(S) (S) with
R
(S) = + + 12 sgn(M + M ).
(7.82)
(SW )R
(SW ). A straightforward calculation using
Proof. Let us write S = R
the composition formula for Weyl operators (6.45) in Theorem 6.30 of Chapter 6,
together with the Fresnel integral (7.61), shows that we have
S =
1 n
2
i+ + 2 sgn(M+M )
| det(SW I)(SW I)(M + M )|
e 2 N z,z T(z)d2n z
i
(7.83)
232
and N is given by
N = M (M + 12 J)(M + M )1 (M 12 J).
We claim that
det(SW I)(SW I)(M + M ) = det(S I)
(7.84)
(7.85)
which is precisely (7.84). Formula (7.85) is at rst sight more cumbersome; there
is however an easy way out: assume that S = SW ,m ; we know that we must
have in this case
N = 12 J(SW SW + I)(SW SW I)1
and this algebraic identity then holds for all S = SW SW since the free symplectic
matrices are dense in Sp(n). Thus,
S =
1
2
n
i+ + 2 sgn(M+M
| det(S I)|
1
2
sgn(M + M ))
is eectively one of the two possible choices for arg det(S I). We have
( + +
1
2
sgn(M + M ))
= arg det(SW I) arg det(SW I) + 12 sgn(M + M );
233
1
2
Exercise 7.39. Prove in detail the equality (7.83) used in the proof of Proposition
7.38.
Part III
Quantum Mechanics
in Phase Space
Chapter 8
238
239
so that is indeed a state. More general states which are genuinely compactly
supported distributions occur naturally in statistical mechanics; here is one basic
example:
Example 8.2. The microcanonical Gibbs state. Let H be a time-dependent
Hamiltonian function and assume that the energy shell E : H(z) = E has compact smooth boundary. The microcanonical state is dened by
micro (z) =
1
(H(z) E)
(E, V )
240
=
, = Cov(X, P ).
(P )2
This example is a particular case of the following situation: consider the
random vector
Z = (X1 , . . . , Xn ; P1 , . . . , Pn )
where Xj : Rnx R and Pj : Rnp R; denoting by XX and P P the covariance
matrices of X = (X1 , . . . , Xn ) and P = (P1 , . . . , Pn ), and by
XP = (Cov(Xj , Pk ))1j,kn
the covariance matrix of Z is the symmetric matrix
XX XP
=
with P X = TXP .
P X P P
Remark 8.5. The functions Xj and Pk are usually interpreted, as the notation is
intended to suggest, as the random variables corresponding to measurements of
the position and momentum coordinates xj and pk .
)L2 (Rn )
(A,
x
.
(, )L2 (Rnx )
241
(8.1)
This shows that what counts in practice is actually the ray { : = 0, C}:
the replacement of by in formula (8.1) has no eect, whatsoever, on the
mathematical expectation: the true state space of quantum mechanics is actually
is motivated by the following argument:
a projective space. The denition of A
assume that the spectrum of A is discrete, and consists of real numbers j , j N
is assumed to be self-adjoint); let (j )jN be the corresponding
(recall that A
A basic axiom of quantum mechanics is
orthonormal basis of eigenfunctions of A.
when measuring
that the probability of obtaining the value j for the observable A
it in the normalized state is
Pr ( = j ) = |(, j )L2 (Rnx ) |2 .
in the state is in this case
The mathematical expectation of the observable A
=
A
)L2 (Rn )
j |(, j )L2 (Rnx ) |2 = (A,
x
j=1
242
Proposition 8.8. If the variances and covariances of two self-adjoint operators A
and B exist and satisfy the commutation relation
B]
=A
B
B
A
= iI,
[A,
then the following inequality holds:
2
2 (B)
(A)
1
4
B)
+ 1 2 .
Cov(A,
4
(8.2)
1
4
B)
+ 1 2
Cov(A,
4
(8.3)
= B
= 0. We have, since A
is self-adjoint,
when A
2 = (A
2 , )L2 (Rn ) = ||A||
22 n ,
A
L (Rx )
x
2 = (B
2 , )L2 (Rn ) = ||B||
22 n ,
B
L (Rx )
x
hence, using CauchySchwarzs inequality:
2 |(A,
B)
L2 (Rn ) |2 = |(A
B,
)L2 (Rn ) |2 .
2 B
A
x
x
Noting that
B
= 1 (A
B
+B
A)
+ 1 [A,
B]
= 1 (A
B
+B
A)
+
A
2
2
2
i
2 Id
i 2
2| ;
243
H ||P ||H ;
(k + 1) 2||X||
(i) We have
S )2 (B
S )2
(A
S
S
1
4
B)
1 [A,
B]
2
Cov(A,
(8.5)
j , Pj ) + 1 2 .
Cov(X
4
(8.6)
= S.
for every S Mp(n) with (S)
(ii) In particular
j )2 (Pj )2
(X
S
S
1
4
Proof. Using the metaplectic covariance property (7.24) in Theorem 7.13 we have
2
2 n
2 2,
AS S
|SA(x)|
d x = ||SA||
=
L
hence, since S is a unitary operator:
2 2 = A
2 .
S = ||A||
A
L
S
Writing similar relations for BS , A2S and BS2 formulae (8.5) and (8.6)
follow.
We will use this result in the forthcoming subsections to restate the uncertainty principle in a geometric form. Let us rst explain what we mean by a
quantum mechanically admissible covariance matrix.
244
0
D=
0
where the diagonal elements of are the moduli of the eigenvalues of JM . This
is a truly remarkable result which will allow us to construct a precise phase space
quantum mechanics in the ensuing chapters. One can without exaggeration say
that this theorem carries the germs of the recent developments of symplectic topology; it leads immediately to a proof of Gromovs famous non-squeezing theorem
in the linear case and has many applications both in mathematics and physics.
Williamson proved this result in 1963 and it has been rediscovered several times
since that with dierent proofs.
ST M S =
0
, diagonal,
(8.7)
(8.8)
245
, K 2 fj = 2j fj
and that the vectors of the basis {ei , fj }1i,jn satisfy the relations
(ei , ej ) = ei , Kej M = j ei , fj M = 0,
(fi , fj ) = fi , Kfj M = j fi , ej M = 0,
(fi , ej ) = fi , Kej M = i ei , fj M = i ij .
1/2
1/2
Setting ei = i
ei and fj = j
fj , the basis {ei , fj }1i,jn is symplectic. Let S be the element of Sp(n) mapping the canonical symplectic basis to
{ei , fj }1i,jn . The , M -orthogonality of {ei , fj }1i,jn implies (8.7) with =
diag[1 , . . . , n ]. To prove the uniqueness statement (ii) it suces to show that if
there exists S Sp(n) such that S T LS = L with L = diag[, ], L = diag[ , ],
then = . Since S is symplectic we have S T JS = J and hence S T LS = L is
equivalent to S 1 JLS = JL from which follows that JL and JL have the same
eigenvalues. These eigenvalues are precisely the complex numbers i/j .
The diagonalizing matrix S in the theorem above has no reason to be unique.
However:
Proposition 8.12. Assume that S and S are two elements of Sp(n) such that
M = (S )T DS = S T DS
where D is the Williamson diagonal form of M . Then S(S )1 U(n).
Proof. Set U = S(S )1 ; we have U T DU = D. We are going to show that U J =
JU ; the lemma will follow. Setting R = D1/2 U D1/2 we have
RT R = D1/2 (U T DU )D1/2 = D1/2 DD1/2 = I
246
(8.9)
Denition 8.13. With the ordering convention above (,1 , . . . , ,n ) is called the
symplectic spectrum of M and is denoted by Spec (M ):
Spec (M ) = (,1 , . . . , ,n )
Here are two important properties of the symplectic spectrum:
Proposition 8.14. Let Spec (M ) = (,1 , . . . , ,n ) be the symplectic spectrum of
M.
(i) Spec (M ) is a symplectic invariant:
Spec (S T M S) = Spec (M ) for every S Sp(n);
(8.10)
1
1
(ii) the sequence (1
:
,n , . . . , ,1 ) is the symplectic spectrum of M
(8.11)
247
(8.12)
Proof. When two matrices A and B have the same eigenvalues we will write A B.
When those of A are smaller than or equal to those of B (for a common ordering)
we will write A B. Notice that when A or B is invertible we have AB
BA.
With this notation, the statement is equivalent to
M M = (JM )2 (JM )2
since the eigenvalues of JM and JM occur in pairs i, i with and
real. The relation M M is equivalent to z T M z z T M z for every z R2n
z .
Replacing z by successively (JM 1/2 )z and (JM 1/2 )z in z T M z z T M z we thus
have, taking into account the fact that J T = J, that is, since J T = J,
M 1/2 JM JM 1/2 M 1/2 JM JM 1/2 ,
(8.13)
(8.14)
M JM J,
M JM J
M JM J,
(8.15)
Since we have
M 1/2 JM JM 1/2
(M J)2
248
(M) :
j, (x2j + p2j ) 1.
j=1
Denition 8.16. The number R (M) = 1/ 1, is called the symplectic radius of
the phase-space ellipsoid M; c (M) = R2 = /1, is its symplectic area.
The properties of the symplectic area are summarized in the following result,
whose hard part follows from Theorem 8.15:
Corollary 8.17. Let M and M be two ellipsoids in (R2n
z , ).
(i) If M M then c (M) c (M );
(ii) For every S Sp(n) we have c (S(M)) = c (M);
(iii) For every > 0 we have c (M) = 2 c (M).
Proof. (i) Assume that M : M z, z 1 and M : M z, z 1. If M M then
M M and hence Spec (M ) Spec (M ) in view of the implication (8.12) in
Theorem 8.15; in particular 1, 1, .
Let us prove (ii). We have S(M) : M z, z 1 with S = (S 1 )T M S 1 and M
thus have the same symplectic spectrum as M in view of Proposition 8.14, (i).
In the next subsection we generalize the notion of symplectic radius and area
to arbitrary subsets of phase space.
249
(8.16)
f ISp(n)
inf
f ISp(n)
{R2 : f () Zj (R)}.
(8.17)
250
251
and if R R there exists no f Symp(n) such that f (B(R )) B(R) because symplectomorphisms are volume-preserving. There remains to show that
cG (Zj (R)) = R2 ; it is at this point and only at this point! we will use
Gromovs theorem. If R R then the identity sends B(R ) in Zj (R) hence
cG (Zj (R)) R2 . Assume that cG (Zj (R)) > R2 ; then there exists a ball B(R )
with R > R and a symplectomorphism f such that f (B(R )) Zj (R) and this
would violate Gromovs theorem.
The reader is invited to show that, conversely, the existence of a symplectic
capacity implies Gromovs theorem:
Exercise 8.24. Assume that you have constructed a symplectic capacity c on
(R2n
z , ). Use the properties of c to prove Gromovs Theorem 8.22.
The number R dened by cG () = R2 is called the symplectic radius
of ; that this terminology is consistent with that introduced in Denition 8.16
above follows from the fact that all symplectic capacities (linear or not) agree on
ellipsoids. Let us prove this important property:
Proposition 8.25. Let M : M z, z 1 be an ellipsoid in R2n
z and c an arbitrary linear symplectic capacity on (R2n
,
).
Let
n, be the
1,
2,
z
symplectic spectrum of the symmetric matrix M . We have
c(M) =
= clin (M)
(8.18)
n,
where clin is any linear symplectic capacity.
Proof. Let us choose S Sp(n) such that the matrix S T M S = D is in Williamson
normal form; S 1 (M) is thus the ellipsoid
n
j, (x2j + p2j ) 1.
(8.19)
j=1
Since c(S 1 (M)) = c(M) it is sucient to assume that the ellipsoid M is represented by (8.19). In view of the double inequality
n, (x2n + p2n )
j, (x2j + p2j ) n,
j=1
we have
(x2j + p2j )
(8.20)
j=1
1/2
B(1/2
n, ) M Z(n, ),
The rst equality in formula (8.18) now follows from Gromovs Theorem 8.22;
the second equality is obvious since we have put M in normal form using a linear
symplectomorphism.
252
XP
P P
, P X = TXP
(8.21)
(8.22)
253
(8.23)
(8.24)
(8.25)
of R2n
z the Wigner ellipsoid associated to Z. (Beware the factor 1/2 in the
left-hand side!) We will say that W is a centered Wigner ellipsoid if z = 0.
254
' 2
$
1
exp 12 x2 +
x
2x p
p2
p2
p2
p2
(%
.
1.
The matrix is admissible if and only if j, 12 for every j and this is equivalent
to the conditions ,j for every j.
We are going to see that Wigner ellipsoids have quite remarkable properties;
in particular they will allow us to give a purely geometrical statement of the
uncertainty principle. We will for instance prove that:
An ellipsoid in R2n
z is a Wigner ellipsoid if and only if its intersection
with any ane symplectic plane passing through its center has area at
least 12 h.
Let us begin by reviewing some very basic facts about matrices and ellipsoids.
Recall that a Hermitian matrix M is positive-denite (M > 0) if the two following
equivalent conditions are satised:
M z, z > 0 for every z = 0;
Every eigenvalue of M is > 0.
Similarly, M is semi-denite positive (M 0) if these two conditions hold with >
replaced by . When M M > 0 (resp. M M 0) we will write M > M or
M < M (resp. M M or M M ).
255
(8.26)
for all z. Suppose that we do not have M M . Then there exists z0 = 0 such
that (M M )z0 , z0 < 0. The inequality (8.26) implies that we have
2 (M M )z0 , z0 2 M z0 , z M z, z ,
for every R; dividing both sides of this inequality by 2 and letting thereafter
we get (M M )z0 , z0 0 which is only possible if z0 = 0. This
contradiction shows that M M , as claimed. The second statement in the lemma
immediately follows.
256
h2n
()n
= n .
n!
2 n!
(8.27)
(It is thus smaller than that of the cubic quantum cells used in statistical mechanics by a factor of n!2n .)
Lemma 8.35. An ellipsoid M : M z, z 1 in R2n
z is a symplectic ball with radius
one if and only M Sp(n) and we then have M = S(B(1)) with M = (SS T )1 .
Assume
Then M is the set of all z R2n
z such that
Proof.
that M = S(B(1).
1
1
1 T 1
S z, S z 1 hence M = (S ) S is a symmetric denite-positive symplectic matrix. Assume conversely that M Sp(n). Since M > 0 we also
have M 1
1
>0
1
T
and there exists S Sp(n) such that M
= SS . Hence M : S z, S 1 z 1
is just S(B(1).
Another very useful observation is that we do not need all symplectic matrices
to produce all symplectic balls:
Lemma 8.36. For every centered symplectic ball B2n = S(B 2n (R)) there exist
unique real symmetric n n matrices L (det L = 0) and Q such that B2n =
S0 (B 2n (R)) and
L
0
S0 =
Sp(n).
(8.28)
Q L1
Proof. In view of Corollary 2.30 in Chapter 2, Subsection 2.2.2 we can factorize
S Sp(n) as S = S0 U where U U(n) and S0 is of the type (8.28). The claim
follows since U (B 2n (R)) = B 2n (R). (That L and Q are uniquely dened is clear
for if S0 (B 2n (R)) = S0 (B 2n (R)) then S0 (S0 )1 U(n) and S0 can only be of the
type (8.28) if it is identical to S0 .)
257
The proof above shows that every symplectic ball (and hence every quantum
blob) can be obtained from a ball with the same radius by rst performing a
symplectic rotation which takes it into another ball, and by thereafter applying
two successive symplectic transformations of the simple types
L
0
I
0
(L = LT , P = P T );
ML1 =
, VP =
0 L1
P 0
the rst of these transformations is essentially a symplectic rescaling of the coordinates, and the second a symplectic shear.
One of the main interests of quantum blobs comes from the following property
which shows that, if we cut a quantum blob by an ane symplectic plane, we will
always obtain an ellipse with area exactly 12 h. Assuredly, this property is not really
intuitive: if we cut an arbitrary ellipsoid in Rm by 2-planes, we will always get
elliptic sections, but these do not usually have the same areas! Here is a direct
elementary proof. It is of course sucient to assume that the ball is centered at 0.
Proposition 8.37. The intersection of B = S(B(R)) with a symplectic plane P is
an ellipse with area R2 . Hence the intersection of a quantum blob with any ane
symplectic plane has area at most 12 h (and equal to 12 h if that plane passes through
the center of that blob).
Proof. We have B P = S|P (B(R) P ) where S|P is the restriction of S to the
symplectic plane P = S 1 (P). The intersection B(R) P is a circle with area R2
and S|P is a symplectic isomorphism P P, and is hence area preserving. The
area of the ellipse B P is thus R2 as claimed.
This property is actually a particular case of a more general result, which
shows that the intersection of a symplectic ball with any symplectic subspace is a
symplectic ball of this subspace. We will prove this in detail below (Theorem 8.41).
We urge the reader to notice that the assumption that we are cutting
S(B 2n (R)) with symplectic planes is essential. The following exercise provides
a counterexample which shows that the conclusion of Proposition 8.37 is falsied
if we intersect S(B 2n (R)) with a plane that is not symplectic.
Exercise 8.38. Assume n = 2 and take S = diag[1 , 2 , 1/1 , 1/2 ] with 1 > 0,
2 > 0, and 1 = 2 . Show that S is symplectic, but that the intersection of
S(B 4 (R)) with the x2 , p1 plane (which is not conjugate) does not have area R2 .
The assumption that S is symplectic is also essential in Proposition 8.37:
Exercise 8.39. Assume that we swap the two last diagonal entries of the matrix S
in the exercise above so that it becomes S = diag[1 , 2 , 1/2 , 1/1 ].
(i) Show that S is not symplectic;
(ii) show that the section S (B 2n (R)) by the symplectic x2 , p2 plane does not
have area R2 .
258
,j (x2j + p2j ) 1,
j=1
1/2
1/2
1
hence
B(,n ); since ,n
S (M) contains the ball
B( ) and hence M S(B( ).
it contains a fortiori
Let us next prove that (ii) = (iii). Assume that M contains a quantum blob
Q; In view of Lemma 8.33 we may
assume that this quantum blob has the same
)). By denition, we can nd S Sp(n) such
center as M, that
is
Q
=
S(B(
259
An ellipsoid M of R2n
z containing a Wigner ellipsoid W is itself a Wigner
W then S(B 2n (z0 , )) M. (Notice that this property does not even require
that M and W be concentric in view of Lemma 8.33).
260
More generally:
Proposition 8.42. Let M : M z, z 1 be an ellipsoid in R2n
z .
(i) If M is a Wigner ellipsoid W , then M F for every symplectic subspace
(F, |F ) of (R2n
z , ).
(ii) Let (F1 , 1 ), . . . , (Fm , m ) (j = |Fj for j = 1, . . . , m) be symplectic subspaces of (R2n
z , ) such that
(F1 , 1 ) (Fm , m ) = (R2n
z , ).
M is a Wigner ellipsoid if and only if each M Fj is a Wigner ellipsoid in
(Fj , j ) for 1 j m.
Proof. (i) In view of Theorem 8.40 above W contains a quantum blob Q hence
W F contains the set Q F which is a quantum blob in W F by Theorem
8.41. A new use of Theorem 8.40 implies that W F is a Wigner ellipsoid in F.
(ii) The case m = 1 is trivial and it is sucient to give the proof for m = 2, the
general case following by an immediate induction on the number of terms in the
direct sum. Writing
(R2n
z , ) = (F1 , 1 ) (F2 , 2 )
with dim F1 = 2n1 and dim F2 = 2n2 , we thus assume that W F1 and W F2
are Wigner ellipsoids in F1 and F2 respectively. Let us prove that W contains a
quantum blob Q2n , and is thus a Wigner ellipsoid. In view of Theorem 8.40 we
can nd S1 Sp(F1 , 1 ) and S2 Sp(F2 , 2 ) such that
Set now S = S1 S2 and Q2n = S(B 2n ( )). We claim that Q2n W ; this will
prove (ii). Let z Q2n , that is |Sz|2 ; writing z = z1 + z2 with z1 F1 and
z2 F2 we have |S1 z1 |2 + |S2 z2 |2 , hence
z1 Q2n1 W F1 , z2 Q2n2 W F2
so that
z (W F1 ) + (W F2 ) W
and Q
2n
W as claimed.
261
(8.29)
1
2h
262
1
1 2
(x1 + p21 ) +
(x2 + p2j ) 1
Rj2 j
2jn
with Rj2 . Assume that there exists a quantum blob Q2n = S(B 2n (z0 , )
contained in M. It follows from Corollary 8.43 that we must then
have z0 = 0.
2n
2n
LetS, S Sp(n) and z0 Rz be such that S(B ( )) W and
S (B 2n ( ))
W . In view
of Proposition 8.12 we then have S(S )1 U(n)
2n
2n
hence S(B ( )) = S (B ( ).
Let us end this subsection by shortly discussing the notion of joint quantum probability. Some quantum physicists contend that there is no true collective
and P when
(joint) probability density for pairs of conjugate observables, say X
n = 1. It is in fact possible to construct innitely many density probabilities
having arbitrary densities X and P as marginal probability densities, as the
following exercise shows:
Exercise 8.46. Suppose n = 1 and let X and P be dened as above and set
(z) = X (x)P (p)(1 f (u(x), v(p)))
where f is an arbitrary function and
x
X (x )dx , v(p) =
u(x) =
P (p )dp .
263
that minimized the uncertainty relations. Lisiecki [105] gives a very interesting
survey of coherent state representations.
We will use the following classical result about the Fourier transform of a
complex Gaussian:
Let M be a symmetric complex m m matrix such that Re M > 0. We have
1
m/2
1
i
1
1
(8.30)
e u,v e 2 Mv,v dm v = (det M )1/2 e 2 M u,u
2
where (det M )1/2 is given by the formula
1/2
(det M )1/2 = 1
1/2
1/2
,
m
1/2
of
M 1 .
the eigenvalues 1
m
1
Formula (8.30) can be easily proven using the standard Gauss integral
2
ex /2 dx = 2
and diagonalizing of M .
(8.31)
1 n/2
Y X 1
.
X 1
(8.32)
(8.33)
264
1
1 n
e Xx,x
2
1/2
i
1
e p+Y x,y e 4 Xy,y dn y = (2)n/2 det( 12 X)
exp 1 X 1 (p + Y x), p + Y x
and hence
W (z) =
1 n/2
where
Gz, z = (X + Y X 1 )x, x + 2X 1 Y x, p + X 1 p, p;
this proves part (i) of the proposition. The symmetry of G is of course obvious.
Property (ii) immediately follows since
X + Y X 1 Y
X 1 Y
1/2
Y X 1
X
=
X 1
0
and
X 1/2
S=
X 1/2 Y
Y X 1/2
X 1/2
X 1/2
X 1/2 Y
X 1/2
0
X 1/2
obviously is symplectic.
|(x)|2 dn x =
'
det X
()n
(1/2
(8.35)
265
(8.36)
1
1 n Gz,z
e
(8.37)
It follows, as announced in Chapter 6, that the Wigner transform of a Gaussian always is a positive function. It is noticeable that, conversely, if is a function
such that W 0, then must be a Gaussian, of the type (x) = C exp [Q(x)]
where C is a complex number and Q a positive denite quadratic form (this is a
well-known result going back to Hudson [94]).
)
Exercise 8.49. Verify that we have W 0 (z)d2n z = 1 when 0 is the normalized
Gaussian (8.36).
(8.39)
T 1/2
(A + iB),
U + iV = (AA + BB )
T 1/2
Q = (CA + DB )(AA + BB )
(8.40)
.
(8.41)
266
1
1 n Gz,z
e
, G = ST S
(8.42)
2
2
Weyl operator. The functions N (x) = hN (x/ )ex /2 (hN the N th Hermite
function) form a complete orthonormal system and the corresponding eigenvalues
N = N N hence A
0; but
are N = (N + 12 ). Set now a = H 12 ; then A
a is not a positive function.
We will see that this fact is not a weakness of Weyl calculus but rather a
manifestation of the uncertainty principle, which can be alleviated by averaging the
symbol a over a quantum blob in a sense to be dened. Let us however mention that
m
(R2n
if a 0 and belongs to some of the standard pseudo-dierential classes S,
z )
is the sum of a positive operator with
with 0 < 1, then one proves that A
m()
m
2n
positive symbol in S, (Rz ) and of an operator with symbol in S,
(R2n
z )
267
(see Folland [42], Chapter 2, 6; for related results see Feerman and Phong [41]
and the references therein).
The possible non-positivity of the Weyl operator associated to a positive symbol is of course related to the generic non-positivity of the Wigner transform: recall
(formula (6.70) in Proposition 6.45, Chapter 6) that the mathematical expectation
in a (normalized) state is, when dened, given by the formula
of A
= W (z)a(z)d2n z.
A
It has been known since de Bruijn [17] that the average W R over a
2
2
Gaussian R (z) = e|z| /R satises
W R 0 if R2 = , W R > 0 if R2 > ;
(8.43)
this actually follows from a positivity result for convolutions of Wigner transforms:
Proposition 8.53. Let , S(Rnx ). We have W W 0; more precisely
W W = (2) |W (, )|2 0
n
(8.44)
(T (z), )L2 (Rnx ) = e (p,x 2 p,x) (x + x) (x )dn x .
268
Setting x = y 12 x this is
(T(z), )L2 (Rnx ) =
e p,y (y + 12 x) (y 12 x)dn x
Let us extend de Bruijns result (8.43) to arbitrary Gaussians. To any symmetric 2n 2n matrix > 0 we associate the normalized Gaussian dened by
(z) =
1 n
2
(det )1/2 e 2
z,z
A straightforward calculation shows that has integral 1 (and is hence a probability density); computing the Fourier transform of using formula (8.30)
one moreover easily veries that
= +
(8.45)
(i) If there exists S Sp(n) such that M = S(B( )) (and hence c(M ) = 12 h)
then W 0.
(ii) If c(M ) > 12 h, then W > 0.
Using Proposition 8.54 we can prove a positivity result for the average of Weyl
operators with positive symbol. This result is actually no more than a predictable
generalization of calculations that can be found elsewhere (a good summary being
[42]); it however very clearly shows that phase space coarse graining by ellipsoids
with symplectic capacity 12 (and in particular by quantum blobs) eliminates
positivity diculties appearing in Weyl calculus: we are going to see that the
Gaussian average of a symbol a 0 over an ellipsoid with symplectic capacity
12 always leads to a positive operator :
269
Weyl
Corollary 8.55. Assume that a 0. If c(M ) 12 h, then the operator A
a satises
, )L2 (Rn ) 0
(A
x
1 2
(p + m2 2 x2 ) , m > 0, > 0
2m
1 12 (x2 +p2 )
e 2
.
2 2
2
(1 + m 2 ) H(z) + 2 .
2m
Assume that the ellipsoid (x2 + p2 )/2 2 1 is admissible; this is equivalent to the
condition 2 12 and hence H (z) 12 : we have thus recovered the ground
state energy of the one-dimensional harmonic oscillator. We leave it to the reader
to check that we would still get the same result if we had used a more general
Gaussian instead of . We leave it to the reader to generalize this result to a
quadratic Hamiltonian by using Williamsons symplectic diagonalization theorem.
Chapter 9
= H
t
describes the evolution of the matter wave in terms of a self-adjoint operator H
associated to a Hamiltonian function by some quantization rule. For instance,
when H is of the physical type kinetic energy + potential one replaces the
momentum coordinates pj by the dierential operator i/xj and to let the
position coordinates xj stand as they are. Now, if we believe and we have every
reason to do so that quantum mechanics supersedes classical mechanics as being the better theory, we are in trouble from a logical point of view, and risk
inconsistencies. This is because if we view quantum mechanics as the quest for an
algorithm for attaching a self-adjoint operator to a function (or symbol, to use
the terminology of the theory of pseudo-dierential operators) we are at risk of
overlooking several facts (see Mackeys review article [117])). For instance:
It has long been known by scientists working in foundational questions that
the modern formulation of quantum mechanics1 rules out a large number of
classical Hamiltonians;
There are features of quantum mechanics (e.g., spin) which are not preserved
in the classical limit. We therefore postulate, with Mackey [116, 117], that
classical mechanics is only what quantum mechanics looks like under certain limiting conditions, that is, that quantum mechanics is a renement of
1I
272
Hamiltonian mechanics (this is, by the way, the point of view of the deformation quantization of Bayen et al. [7], which is an autonomous theory
originating in Moyals trailblazer [127]).
For all these reasons it is worth trying to construct quantum mechanics ex nihilo
without reference to classical mechanics. This does not mean, of course, that we
will not quantize Hamiltonians (or other observables) when possible, or desirable, but one should understand that dequantization is perhaps, after all, more
important than quantization, which may well be a chimera (as exemplied by wellknown no-go in geometric quantization theories (see for instance Gotay [75] and
Tuynman [166]).
The concept of density operator (or density matrix), as it is commonly
called in physics) comes from statistical physics in dening Gibbs quantum states.
For detailed discussions see Ruelle [138], Khinchin [103], Landau and Lifschitz
[106], Messiah [123]. See Nazaikinskii et al. [128] for an introduction to the topic
from the point of view of asymptotics.
We begin by giving a self-contained account of two essential tools from functional analysis, the HilbertSchmidt and the trace-class operators.
The notion of quantum state represented (up to a complex factor) by a
function is not always convenient, because in practice one is not always certain
what state the system is in. This ignorance adds a new uncertainty of a nonquantum nature to the quantum-mechanical one. This motivates the introduction
of the notion of density operator, which is a powerful tool for describing general
quantum systems. To fully understand and exploit this notion we have to make a
detour and spend some time to study the basic notions of trace-class and Hilbert
Schmidt operators, which besides their interest in quantum mechanics play an
important role in functional analysis.
273
operators on Hilbert spaces. The notion is essential for the denition of the density
operators of quantum mechanics.
Let H be a complex separable Hilbert space and (ej ) an orthonormal system
of vectors in H (not necessarily a basis); we denote by F the closed subspace of
H spanned by the ej . Then the following results hold:
0
For every u H the series j |(u, ej )H |2 is convergent and we have Bessels
inequality
(9.2)
j
j (u, ej )H ej
(u, ej )H ej = PF u;
(9.3)
sup ||Au||H =
||u||H 1
sup ||Au||H
||u||H =1
|(Aei , fj )H | <
(9.4)
i,j
(we are assuming that both bases are indexed by the same sets). The vector space
of all trace-class operators H H is denoted by LTr (H).
Obviously A is of trace class if and only if its adjoint A is.
We are going to prove that if the condition (9.4) characterizing trace-class
operators holds for one pair of orthonormal basis, then it holds for all. This property will allow us to prove that LTr (H) is indeed a vector space, and to dene the
trace of an element of LTr (H) by the formula
(Aei , ei )H |
Tr A =
i
274
|(Aei , fj )H | <
(9.5)
i,j
for all orthonormal bases (ei )i , (fj )j of H with same index set;
(ii) If (ei )i and (fi )i are two orthonormal bases, then the following equality holds:
(Aei , ei )H =
(Afi , fi )H
(9.6)
i
(ei , ej )H ej , f =
(f , fk )H fk
ei =
j
we have
(Aei , f )H =
(9.7)
j,k
|(Aei , f )H |
|(ei , ej )H ||(f , fk )H | |(Aej , fk )H |).
i,
j,k
(9.8)
i,
Using successively the trivial inequality ab 12 (a2 + b2 ) and the Bessel inequality
(9.1) we get
|(ei , ej )H ||(f , fk )H | 12
|(ei , ej )H |2 + 12
|(f , fk )H |2
i
i,
2
1
2 (||ej ||H
||fk ||2H )
= 12 ,
and hence
|(Aei , f )H |
i,
1
2
|(Aej , fk )H |) <
j,k
(Aei , ei )H =
(ei , ej )H (ei , ek )H (Aej , ek )H
j,k
and hence
(Aei , ei )H
j,k
In view of (9.2)
275
9
(ei , ej )H (ei , ek )H
(Aej , ek )H ).
which establishes (9.6); that the series is absolutely convergent follows from (9.5)
with the choice ei = fi .
(iii) That A LTr (H) if C and A LTr (H) is obvious. Let A, B LTr (H);
then
((A + B)ei , ei )H =
(Aei , ei )H +
(Bei , ei )H
i
and each sum on the right-hand side is absolutely convergent, implying that A +
B LTr (H).
Formula (9.6) motivates the following denition:
Denition 9.3. Let A be a trace-class operator in H and (ei )i an orthonormal basis
of H. The sum of the absolutely convergent series
Tr(A) =
(Aei , ei )H
(9.9)
i
(which does not depend on the choice of (ei )i ) is called the trace of the operator A.
The following properties of the trace are obvious consequences of (9.9):
Tr(A + B) = Tr(A) + Tr(B) , Tr(A ) = Tr(A).
(9.10)
276
(ABei , ei )H =
(Bei , ej )H (Aei , ej )H
(9.11)
i,j
and hence
(ABei , ei )H
|(Bei , ej )H |
|(Aei , ej )H | <
i
i,j
i,j
so that AB is indeed of trace class in view of Lemma 9.2 (i). Formula (9.11) implies
that
(ABei , ei )H =
(BAei , ei )H
(9.12)
i
(Bei , Aei )H ,
(9.13)
277
hence, noting that Re Tr(A B) | Tr(A B)| and using the CauchySchwarz inequality,
1/2
1/2
(Bei , Bei )H (Aei , Aei )H ,
Re Tr(A B)
i
that is
Re Tr(A B)
1/2
=
(uj , ei )H (uj , ek )H (A Aek , ei )H .
i,k
(A Aek , ei )H ;
i,k
278
j Pj
(9.14)
j dim Hj
(9.15)
j dim Hj <
j
holds.
Proof. (i) A is compact, so it veries the properties listed before the statement
of the proposition. Choose an orthonormal basis (eij )i in each eigenspace Hj and
complete the union i (eij )i of these bases into a full orthonormal basis of H by
selecting orthonormal vectors (fi )i in Ker A such that (fi , ejk )H = 0 for all j, k
279
Au =
(Au, fi )H fi +
(Au, eij )H eij .
i
i,j
9
(u, eij )H eij
Tr(A) =
(Afi , fi )H +
(Aeij , eij )H ;
i
i,j
Tr(A) =
j (eij , eij )H =
j
(eij , eij )H ,
i,j
(9.16)
hence (9.15) since the sum indexed by i is equal to the dimension of the eigenspace Hj .
(iii) Any operator A that can be written in the form (9.14) is self-adjoint because
each projection is self-adjoint (notice that it is also compact because the spaces
H
0j are nite-dimensional); moreover the operator A is positive and the condition
j j dim Hj < is precisely equivalent to A being of trace class in view of the
second equality (9.16).
280
||Aej ||2H =
||Afj ||2H <
j
(9.17)
||A||2HS =
||Aei ||2H
(9.18)
i
||Aei ||2H =
|(Aej , fj )2H
j
||Aei ||2H =
|(Aej , fj )2H =
|(ej , A fj )2H ,
i
i,j
i,j
||Aei ||2H =
||A fi ||2H .
281
||Afi ||2H =
||A fi ||2H
i
and hence
(9.19)
||Aei ||2H =
||Afi ||2H
as claimed. Property
(ii) Immediately follows from (9.19).
(iii) If A and B are HilbertSchmidt operators then A is trivially a Hilbert
Schmidt operator and ||A||HS = ||||A||HS for every C; on the other hand,
by the triangle inequality
||Aej ||2H =
||Aej ||2H <
jJ
jJ\N
282
(ii) Assume that the set {Aej : j J} is nite: then A is of nite rank and there is
nothing to prove in view of (i). If that set is innite, it is no restriction, replacing
if necessary J by a smaller set to assume that all the Aej are distinct, and that
J = N. Let Ak be the linear operator dened by Ak ej = Aej for j k and
Ak ej = 0 for all other indices j. We have limk Ak = A and each Ak is of nite
rank.
Property (iii) follows because operators of nite rank are compact, and the limit
of a sequence of compact operators is compact.
A particularly interesting situation occurs when the trace-class operator is
self-adjoint (this will be the case for the density matrices we will study in the next
section).
Proposition 9.13. Every self-adjoint trace-class operator in a Hilbert space H is a
HilbertSchmidt operator (and is hence compact).
Proof. Let (ei )i be an orthonormal basis of H; we have
||Aei ||2H = (Aei , Aei )H = (A2 ei , ei )H
and in view of Proposition 9.5 the operator A2 is of trace class. It follows that
||Aei ||2H =
(A2 ei , ei )H <
i
It turns out that, more generally, every trace-class operator can be obtained
by composing two Hilbert-Schmidt operators; since we will not use this fact we
propose its proof as an exercise:
Exercise 9.14. Show that every trace-class operator is the product of two Hilbert
Schmidt operators, and is hence compact. [Hint: use an adequate polar decomposition A = U (A A)1/2 .]
283
K(x, y)(y)dn y dn x
2
n
KA (x, y)(y)d y
|KA (x, y)| d y
2 n
|(y)|2 dn y
(9.21)
2 n
|KA (x, x )| d x
2 n
|KB (x , y)| d x
and hence
2 n
n
2 n
|KA (x, x )| d x
|KAB (x, y)| d xd y
2 n
|KAB (x, y)|2 dn xdn y
|KA (x, x )|2 dn x dn x
284
that is
||KB ||L2 (R2n
< .
|KAB (x, y)|2 dn xdn y = ||KA ||L2 (R2n
x,y )
x,y )
Let us next show that KAB (x, y) indeed is the kernel of AB. We have, for
L2 (Rnx ),
AB(x) = KA (x, x )B(x )dn x
= KA (x, x )
KB (x , y)(y)dn y dn x .
The result will follow if we prove that we can change the order of integration, that
is:
n
KB (x , y)(y)d y dn x
KA (x, x )
n
=
KA (x, x )KB (x , y)d x (y)dn y.
For this it suces to show that the function
F (x) =
|KA (x, x )KB (x , y)(y)|dn x dn y
is bounded for almost every x. Now, using again CauchySchwarzs inequality,
F (x) =
|KA (x, x )KB (x , y)|dn x |(y)|dn y
2 n
2 n
n
|(y)| d y
2 n
d y
n
d y
1/2
1/2
1/2
||||L2 (Rnx ) .
285
|KA (x, x)|dn x < ;
Tr(A) =
KA (x, x)dn x.
This statement is false without more stringent conditions on the kernel KA . (See
however the interesting discussion in 8.4.3 of Dubin et al. [33].) Here is one a
rigorous result; for the state of the art see Wongs review paper [182]. Additional
information can be found in Gr
ochenigs book [78] and paper [79]; also see Toft
[160]; an interesting precursor is Pool [133]. A very nice treatment of operators of
trace class in general spaces L2 (X, ) is Duistermaats contribution in [35].
Recall from standard distribution theory that the Sobolev space H s (R2n
z )
(s R) consists of all a S (R2n
z ) such that the Fourier transform F a is a
function satisfying
|F a(z)|2 (1 + |z|2 )s d2n z < .
Theorem 9.17. Let a be a function R2n
z C.
Weyl
a is of trace class and a L1 (R2n
(i) If A
z ), then
1
n
a(z)d2n z.
Tr(A) = 2
(9.23)
Weyl
(ii) If a is such that a and F a both are in H s (R2n
z ) with s > 2n, then A a
is of trace class.
s
2n
Notice that since S(R2n
z ) H (Rz ) for all s R, Weyl operators with
2n
symbols a S(Rz ) are automatically of trace class; since such a symbol a trivially
is in L1 (R2n
z ) the trace of such an operator is indeed given by formula (9.23).
Let us give an independent proof of this property when the kernel of the
operator is rapidly decreasing (this is the case for instance for Gaussian states):
286
Proof. The inequality KA (x, x) 0 trivially follows from KA (x, y) = KA (y, x),
and this condition is equivalent to A = A . Let (j )j be an orthonormal basis of
L2 (Rnx ); then (j k )j,k is an orthonormal basis of L2 (R2n
z ). Expanding KA in
a Fourier series in that basis we have
with
.
cj,k = (KA , j k )L2 (R2n
x,y )
It follows that
n
KA (x, x)d x =
j (x) k (x)dn x =
cj,k
cjj .
j,k
(Aj , j )L2 (Rnx ) =
KA (x, x)dn x
as claimed.
Self-adjoint trace-class operators are, as we have seen, HilbertSchmidt operators with L2 kernels; they are thus also Weyl operators with L2 symbols (Theorem
9.21). The rst part of the following result which expresses the trace in terms of
the symbol is a consequence of Proposition 9.18.
Weyl
Weyl
a and B
b be self-adjoint Weyl operators of trace
Proposition 9.19. Let A
class.
(i) We have
=
Tr A
1 n
2
a(z)d2n z = a (0)
(9.25)
A);
B
is given by the formula
A
B)
= 1 n a(z)b(z)d2n z.
Tr(A
2
(9.26)
in
Proof. (i) The second equality (9.25) is obvious since a = F a. Writing A
pseudo-dierential form
1
n
i
A(x) = 2
e p,xy a( 12 (x + y), p)(y)dn ydn p,
287
is
the kernel KA of A
KA (x, y) =
1 n
2
1 n
2
a(x, p)dn pdn x
which is the rst equality (9.25); the second follows in view of the denition of the
symplectic Fourier transform F a = a .
(ii) We have
B)
=
Tr(A
1 n
2
c(z)d2n z
=A
B;
in view of formula (6.43) (Subsection 6.3.2
where c(z) is the symbol of C
of Chapter 6) we have
1
2n
i
c(z) = 4
e 2 (z ,z ) a(z + 12 z )b(z 12 z )d2n z d2n z .
Performing the change of variables u = z + 12 z , v = z 12 z we have d2n z d2n z =
42n d2n ud2n v and hence
1
2n
i
c(z) =
e 2 (uz,vz) a(u)b(v)d2n ud2n v.
Integrating c(z) yields
1
2n
2i
2n
c(z)d z =
e (uz,vz) a(u)b(v)d2n ud2n vd2n z,
that is, expanding (u z, v z),
1
2n
2i
2i
2n
(z,uv) 2n
d z e (u,v) a(u)b(v)d2n ud2n v.
e
c(z)d z =
Now
e
and hence
2i
(z,uv)
2n
d z=
c(z)d2n z =
=
formula (9.26) follows.
2i
2i
288
=
jk j k
j,k
we have
(KA , )L2 (R2n
=
x,y )
k
x,y
j,k
jk
j,k
j,k
Using CauchySchwarzs inequality and recalling that ||j ||L2 (Rnx ) = 1 we get
|(KA , )L2 (R2n
|2
x,y )
j,k
289
that is
|2 ||||2L2 (R2n
|(KA , )L2 (R2n
x,y )
x,y )
noting that
|jk |2
j,k
0
k
KB (x, y)j (y)dn y j (x)dn x
We are next going to show that the space of integral HilbertSchmidt operators on L2 (Rnx ) is precisely the set of Weyl operators with symbol a L2 (R2n
z )
(Stein [158], Ch. XII, 4; Wong [181, 182]). This is indeed an important result,
since it will allow us a precise description of the density matrices of quantum states
in Section 9.3 using the methods of Weyl calculus.
290
Weyl
induces an isomorphism beTheorem 9.21. The Weyl correspondence a A
2
2n
tween L (Rz ) and the space of HilbertSchmidt operators on L2 (Rnx ):
2
n
(i) If a L2 (R2n
z ), then A is a HilbertSchmidt operator on L (Rx ).
Weyl
(ii) If conversely the kernel of a Weyl operator A
a is in L2 (R2n
x,y ), then
2
2n
a L (Rz ).
(iii) Let A be an arbitrary integral HilbertSchmidt operator on L2 (Rnx ). There
Weyl
exists a L2 (R2n
z ) such that A = A a and we have
||a||2L2 (R2n
= (2)n ||KA ||2L2 (R2n
.
z )
x,y )
(9.27)
Proof. The statement (iii) follows from (i) and (ii) in view of Theorem 9.20.
(i) and (ii). We have seen (formula (6.26) in the proof of Theorem 6.12, Chapter
and its symbol a are related by
6) that the kernel KA of the Weyl operator A
KA (x, y) =
1 n
2
e p,y KA (x + 12 y, x 12 y)dn y.
= (2)
= (2)n
291
(9.29)
(9.30)
292
(9.31)
We are going to see that the pure density operator is in this case a Weyl
operator. We have in fact already encountered its symbol in Chapter 6: it is
the Wigner transform of the function ! Let us prove this essential property:
Proposition 9.24. Let be the density operator associated to a pure state by
(9.30).
(i) The Weyl symbol of is the Wigner transform W of and thus
1
n / i p,xy
W ( 12 (x + y), p)(y)dn ydn p,
(9.32)
(x) = 2
e
that is
=
1 n
2
W (z)T(z)d2n z
(9.33)
(9.34)
= (A,
)L2 (Rn ) the mathematical expectation of A
in the state ).
(A
x
Proof. (i) We have
(x) = (, )L2 (Rnx ) (x) =
(y)(x)(y)dn y,
293
has a
Exercise 9.25. Check directly part (i) of Proposition 9.24 assuming that A
discrete spectrum (j )j and corresponding orthonormal eigenvector basis (j )jN .
Exercise 9.26. Look after formula (9.34) in any classical book on quantum mechanics and analyze in detail the way it is derived.
We have so far been assuming that the quantum system under consideration
was in a well-known state characterized by a function . Unfortunately things are
not always that simple in the quantum world. Suppose for instance that we have
the choice between a nite or innite number of states, described by functions
1 , 2 , . . . , each j having a probability j to be the true description. We can
then form a weighted mixture of the j by forming the convex sum
=
j j ,
j=1
j = 1 , j 0.
(9.35)
j=1
j j
(9.36)
=
j=1
294
Here is a very important result that describes all density matrices in a Hilbert
space H.
Theorem 9.28. Let be an operator on a Hilbert space H.
(i) is a density operator if and only if there exists a (nite or innite) sequence
(j ) of positive numbers and nite-dimensional pairwise orthogonal subspaces
Hj of H such that
j Hj and
j dim Hj = 1
(9.37)
=
j
(9.38)
Proof. The statement (i) is just Proposition 9.8, taking into account that the
orthogonal projections j are self-adjoint.
(ii) Since the spaces
we have Hj Hk = 0 if j = k
0 Hj are pairwise orthogonal
0
and hence 2 = j 2j Hj . The condition j j dim Hj = 1 implies that we must
have j 1 for each j so that
2j dim Hj
j dim Hj = 1.
Tr 2 =
j
2j dim Hj =
j dim Hj = 1.
j
Since dim Hj > 0 for every j, this equality is only possible if the numbers j
are
0 either zero or 1; since the case j = 0 is excluded it follows that the sum
=
j j dim Hj = 1 reduces to one single term, say j0 dim Hj0 = 1 so that
j0 Pj0 and 2 = 2j0 Hj0 . The equality Tr(
) = Tr(
2 ) = 1 can hold if and only if
j0 = 1, hence dim Hj0 = 1 and is a projection of rank 1, and hence a pure-state
density operator.
Specializing to the case where H = L2 (Rnx ) we get:
Corollary 9.29. An operator : L2 (Rnx ) L2 (Rnx ) is a density operator if and
only if there0
exists a family (j )jJ in L2 (Rnx ), a sequence (j )jJ of numbers
j 0 with
j = 1 such that the Weyl symbol of is given by
jJ
jJ
j W j .
(9.39)
295
Proof. Assume that the Weyl symbol of is given by (9.39); in view of Proposition
9.24 and the discussion preceding it we have
j j
=
jJ
=
j Hj ,
mj j = 1
j
m1
W j + 2
m1 +m
2 +1
j=1
W j +
j=m1
Part (iii) of Theorem 9.28 above motivates the denition of the notion of
purity of a quantum state which plays an important role in quantum optics and
the theory of squeezed states:
Denition 9.30. Let be a density operator on H; the number (
) = Tr(
2 ) is
called the purity of the quantum state represents.
The purity of a state satises the double inequality 0 (
) 1. We will
come back to it in the next subsection when we study Gaussian states.
Exercise 9.31. Show that a quantum state is pure if and only if its purity equals 1.
Recalling (formula (9.31) that the operator kernel of the density operator of
a pure state is just the tensor product , we have more generally:
Corollary 9.32. Let the density operator be given by formula (9.37) and let
(jk )j,k be a double-indexed family of orthonormal vectors in L2 (Rnx ) such that
the subfamily (jk )k is a basis of Hj for each j.
(i) The kernel K of is given by
j jk (x) jk (y)
K(x, y) =
with
0
j
j = 1.
(9.40)
j,k
j W jk (z)
j,k
(9.41)
296
j =
j (, jk )H jk
j,k
=
j =
j jk (x)(y)jk (y)dn y
j
j,k
which is (9.40).
(ii) Formula (9.41) for the symbol immediately follows from (9.40) in view of
Proposition 9.24(i).
j j ,
j=1
j = 1 , j 0
(9.42)
j=1
= H.
i
t
t ) the evolution operator determined by that equation: (x, t) =
We denote by (U
Ut (x, 0); this operator is unitary and satises
i
d
= U
U
t , i d U
H.
Ut = H
t
dt
dt t
(9.43)
Proposition 9.33. Let be the density operator at time t = 0 of the mixed state
(9.42).
(i) At time t this density operator is given by the formula
t U
t ;
t = U
(ii) The mapping t t satises the operator equation
i
d
t ].
t = [H,
dt
(9.44)
297
t U
in t now yields, using formulae (9.43),
Dierentiating t = U
t
i
d
d
t = i (U
Ut )
t
dt
dt
U
t U
t + U
t (U
t H)
=H
t t H
=H
Remark 9.34. The evolution equation (9.44) should not be confounded with the
Heisenberg equation of elementary quantum mechanics2 ; it is actually the quantized form of Liouvilles equation
cl
= {H, cl }
t
(9.45)
j j .
=
j=1
Unless the j are all linearly independent (which is generally not the case!), the decomposition above is never unique in opposition to the associated density operator
which is independent of the way the mixture is written.
2 All
298
det X
()n
(1/4
1
exp 2
(X + iY )x, x
1
1 n Gz,z
e
1
WF (z) = 2n det F e F z,z
(9.46)
1 n
2
1 n
2
(det )1/2 e 2
z,z
WF (z)d2n z = 1
so that WF is a priori a good candidate for being the symbol of a density operator.
Dene in fact the Weyl operator ,
1
n
i
(x) = 2
(9.48)
e p,xy WF (z)( 12 (x + y), p)(y)dn ydn p,
that is3
(x) =
299
(9.49)
this operator will be the density operator of some quantum state if the three
following conditions are fullled:
is self-adjoint: = ;
is of trace class and Tr = 1;
is non-negative: 0.
It is clear that = since is real; the operator = is of trace-class
because of Proposition 9.18, which tells us that in addition
1
n
Tr = 2
WF (z)d2n z = 1.
So far, so good. How about the positive semi-deniteness property 0? As
we already have noticed before, in Subsection 8.5.3 of the last chapter, the fact
that the symbol of a Weyl operator is positive does not imply that the operator
itself is positive. The obstruction, as we hinted at, is related to the uncertainty
principle of quantum mechanics: if the Gaussian is too sharply peaked, then
will not be the density of a quantum state. It however follows from Corollary
8.55 of Proposition 8.54 that if we average over a quantum blob, then we
will always obtain the Wigner transform of a mixed state. In fact, the following
precise result holds (cf. de Gosson [72]):
Theorem 9.35. Let Q = S(B 2n ( )), S Sp(n), be a quantum blob and WQ the
associated normalized Gaussian:
WQ (z) =
1
T 1
1 n (SS ) z,z
e
.
300
so there remains to check the positivity of ,Q ; the latter follows from Corollary
8.55 of Proposition 8.54.
(ii) Setting G = (SS T )1 we have
W,Q (z) =
1 2n
1
1
det F e F (zz ),(zz ) e Gz ,z d2n z ;
a(ut)2 bt2
exp
u
e
e
dt =
a+b
a+b
valid for all a, b > 0 together with the fact that the matrix D is diagonal, we nd
n
1
1
W,Q (Sz) =
(det D(I + D)1 )1/2 exp D(I + D)1 z, z ,
that is
Setting
1
W,Q (z) = exp z T (S 1 )T D(I + D)1 S 1 z .
K = (S 1 )T D(I + D)1 S 1
we obtain formula (9.50). There remains to show that the moduli of the eigenvalues
of JK are not superior to 1. Since S Sp(n) we have J(S 1 )T = SJ and hence
JK = SJD(I + D)1 S 1
has the same eigenvalues as JD(I + D)1 . Now,
0
JD(I + D)1 =
(I + )1
(I + )1
0
301
,j
1 + ,j
(9.51)
The result above leads us to ask the following question: which conditions (if
any) should one impose on the covariance matrix in the formula
(z) =
1 n
2
(det )1/2 e 2
z,z
in order that it be the Wigner transform of some quantum state? Writing (9.47)
in the form
1
1
n
W (z) =
det F e F z,z
a partial answer is already given by Proposition 8.47 (Section 8.5, Chapter 8)
which implies that if F is symplectic, then W is the Wigner transform of the pure
Gaussian state
(1/4
'
1
det X
(x) = ()
exp 2
(X + iY )x, x
n
where the symmetric matrices X, Y are determined from F . Let us discuss the
case of more general covariance matrices .
The following result, which is no more than a consequence of Theorem 9.35
above, shows that in a sense the Gaussians which are Wigner transforms of pure
Gaussian states are the lower limit for what is acceptable as a candidate for the
Wigner transform of a quantum state. Recall that the covariance matrix is said
to be quantum mechanically admissible if its symplectic spectrum is such that
Spec () ( 12 , 12 , . . . , 12 )
(9.52)
1 n
2
(det )1/2 e 2
z,z
302
2 (z)d2n z =
1 2n
2
2 (z)d2n z.
(det )1
e
z,z 2n
d z.
2 (z)d2n z =
and hence
(
) =
1 n
4
(det )1/2
n
(det )1/2 .
2
Notice that since det ( 12 )n for an admissible in view of (9.52) we indeed
have (
) 1.
Chapter 10
= H(x, ix )
t
(10.1)
= H( 12 x + ip , 12 p ix )
t
(10.2)
304
in both cases the variables x and p are placed, up to a change of sign, on the same
footing.
305
plausible by using formal analogies: this is what is done in all texts on quantum
mechanics (see for instance Dirac [31], p. 108111). So how did Schr
odinger arrive
at his equation? He actually elaborated on Hamiltons opticalmechanical analogy,
and took several mathematically illegitimate steps applying the HamiltonJacobi
theory (see Jammer [97] or Moore [125] for a thorough discussion of Schr
odingers
argument). Here is the idea, somewhat oversimplied. Schr
odingers starting point
was that a matter wave consists as all waves do of an amplitude and of a
phase. Consider now a system with Hamiltonian H represented by a phase-space
point z(0) at time t = 0. That system evolves with time and is represented by a
point z(t) at time t. Schr
odinger postulated that the phase of the associated
matter wave satised by HamiltonJacobis equation
W
+ H(x, x , t) = 0.
t
(10.3)
(10.4)
1
(z, dz) Hdt
2
(10.5)
306
where the variables x and p play (up to the sign) similar roles. Let us investigate
the quantum-mechanical consequences of this choice. In Section 5.5 of Chapter 5
we considered the Schr
odinger equation
i
=
(
z , z0 ) , (, 0) = 0
t
(10.6)
(
z , z0 ) = ix , x0 p0 , x
was obtained using Weyl correspondence from the translation Hamiltonian
Hz0 (z) = (z, z0 ). We checked that the solution of (10.6) was given by
(x, t) = T(tz0 )0 (x)
where T(z0 ) is the usual HeisenbergWeyl operator; explicitly
i
(1/2)
Hz
is
t
= (z, z0 ).
2
In analogy with the denition of the HeisenbergWeyl operators we can thus dene
an operator Tph (z0 ) acting on functions of z = (x, p) by the formula
i
Tph (z0 )0 (z) = e 2 (z,z0 ) T (z0 )0 (z).
1
= (z, z0 ) i z0 , z ;
t
2
rearranging the terms in the right-hand side of this equation shows that it can be
rewritten formally as
i
= 12 x + ip , 12 p ix ; x0 , p0 .
t
(10.7)
307
(1/2)
= (x + ip , ix ; x0 , p0 )
t
(10.9)
, pj i
.
pj
xj
These have been considered and exploited by Torres-Vega and Frederick [162, 163]
(see our discussion of their phase-space Schrodinger equation in [73]). It turns out,
as we will see, that the symmetrized choice (10.8) is the most convenient for
the determination of which solutions correspond to true quantum states (pure or
mixed), and directly related to the quantum blobs introduced and studied in
Chapter 8.
308
Hz
it
Tph (tz0 )0 (z) = e 2 (z,z0 )t 0 (z tz0 ).
(R2n
z ))
(10.10)
(10.11)
2
(10.12)
i
Tph (z0 ) = e 2 (z,z0 )t T (z0 )
.
|Tph (z0 , t0 )(z)|2 d2n z = ||||2L2 (R2n
z )
(10.13)
309
is that it places, from the very beginning, the variables x and p on the same
footing, and is thus consistent with our program, which is to arrive at a Schr
odinger
equation in phase space having the desired symmetries.
Observe that the operators Tph satisfy the same commutation relation as the
usual WeylHeisenberg operators:
i
Tph (z1 )Tph (z0 ) = e (z0 ,z1 ) Tph (z0 )Tph (z1 )
(10.14)
i
Tph (z0 )Tph (z1 ) = e 2 (z0 ,z1 ) Tph (z0 + z1 ).
(10.15)
We will see in Section 10.2 that the operators Tph (z0 , t0 ) lead to a new irreducible unitary representation of the Heisenberg group Hn on a closed subspace
odinger representation. Let us
of L2 (R2n
z ) which is unitarily equivalent to the Schr
rst briey state and discuss the Stonevon Neumann theorem.
We will not prove this result here, and refer to Wallach [175] (who reproduces
with minor changes a proof by Barry Simon), or to Folland [42] (who reproduces
310
Stones original proof); the reader will nd an alternative proof in the rst chapter
of Lion and Vergne [111].
Stone and von Neumanns theorem is actually a consequence of a much more
general theorem, due to Frobenius and Mackey, the theorem on systems of imprimitivities; see Jauch [99] (Ch. 12, 3) for a detailed discussion of that deep
result and of its consequences for quantum mechanics (Mackey discusses in detail
the genesis of the notion in [117]).
It is important to understand that, contrary to what is sometimes claimed in
the literature on representation theory, Stonevon Neumanns theorem does not
say that the Schr
odinger representation is the only possible irreducible representation of Hn . What it says is that if we succeed in one way or another in constructing
a unitary and irreducible representation T of Hn in some Hilbert space H, then
there will exist an isomorphism U : L2 (Rnx ) H such that
U T(z0 , t0 ) = T (z0 , t0 ) U for all (z0 , t0 ) R2n
z .
We are going to show in the next section that we can in fact construct, using a particular form of wave-packet transform, innitely many unitary and irreducible
representations of Hn , each on a closed space of L2 (R2n
z ). This will automatically
lead us to a Schrodinger equation in phase space.
10.2.1 Denition of U
In what follows will be be a rapidly decreasing function normalized to unity:
S(Rnx ) , ||||2L2 (Rnx ) = 1.
(10.16)
311
we have
U (z) =
1 n/2
2
2n
1 n/2
2
i
2 p,x
(10.19)
Remark 10.4. An interesting (but not at all mandatory!) choice is to take for
the real Gaussian
1
n/4 1 |x|2
e 2 .
(10.20)
(x) =
The corresponding operator U is then (up to an exponential factor) the coherent
state representation familiar to quantum physicists. In [128] Nazaikiinskii et al.
dene an alternative wave-packet transform U by the formula
1
n/2
i
(10.21)
e p,xx (x x )(x )dn x .
U (z) = 2
312
U = e 2 p,x U.
(10.22)
(10.23)
= S.
where Mp (S)
Proof. Recalling the formula
S)(z)
W (S,
= W (, )(S 1 z)
(Proposition 7.14, Chapter 7) we have, by denition of U ,
=
U (S)(z)
=
n/2
2
)( 1 z)
W (S,
2
W (, S1 )( 12 S 1 z)
n/2
= US 1 (S 1 z)
The interest of the Wigner wave-packet transform U comes from the fact
that it is an isometry of L2 (Rnx ) onto a closed subspace H of L2 (R2n
z ) and that
it takes the operators x and ix into the operators x/2 + ip and p/2 ix ,
respectively:
Theorem 10.6. The Wigner wave-packet transform U has the following properties:
(i) U is an isometry: the Parseval formula
(U , U )L2 (R2n
= (, )L2 (Rnx )
z )
(10.24)
xj + i
(10.25a)
U = U (xj ),
2
pj
1
pj i
)
(10.25b)
U = U (i
2
xj
xj
hold for S(Rnx ).
313
(10.26)
xj
xj + i
, xj pj i
;
2
pj
2
xj
observe that these rules are independent of the choice of and are in this sense
intrinsic.
Exercise 10.7. Show that the adjoint U of the Wigner wave-packet transform U
is given by the formula
1
n/2
i
U (x) = 2
e 2 p,xx (x + x , p)(x )dn pdn x
314
or, equivalently,
U (x)
1 n/2
2
2 |z| (z) = 0
e
i
(10.27)
xj
pj
hold for 1 j n.
(ii) For every the range of the Wigner wave-packet transform U is isometric
to H .
i
i
xj
pj
(the proof of this property, based on the Weierstrass theorem, is rather long and
technical and is therefore omitted here). That the range of U is characterized
by (10.27) follows by an immediate calculation that is left to the reader.
(ii) If U1 and U2 are two Wigner wave-packet transforms corresponding to the
choices 1 , 2 then U2 U1 is an isometry of H1 onto H2 .
The result above leads us to address the following related question: given
2
n
L2 (R2n
z ), can we nd and in L (Rx ) such that = U ? We are going
to see that the answer is in general no. Intuitively speaking the reason is the
following: if is too concentrated in phase space, it cannot correspond via the
odinger equation,
inverse transform U1 = U to a solution of the standard Schr
because the uncertainty principle would be violated. Let us make this precise when
the function is a Gaussian. We rst make the following obvious remark: in view
of condition (10.27) every Gaussian
1
0 (z) = Ce 2 |z|
, CC
315
G (z) = e 2 Gz,z. .
(10.28)
(10.29)
2
2 n/2
e Gz,z .
316
and S
are Gaussians of the type
It follows that both S
2
1
1
2
|x|2
S(x)
= (0)e 2 |x| , S(x)
= S(0)e
;
(x) = ()
317
is given by
W (z) = e Gz,z
where G is the positive denite symplectic matrix
X + Y X 1 Y Y X 1
G=
.
X 1 Y
X 1
Conversely, every such S Sp(n) can be put in the form above, which ends the
proof of (ii) since the datum of W determines up to a complex factor with
modulus one.
(10.30)
ph :
(ii) The following intertwining formula holds for every operator A
ph U = U A.
A
(10.31)
(10.32)
318
1 n/2
2
W (z z0 ) =
where we have set x = x + x0 . The overall exponential in Tph (z0 )U (z) is thus
i
( p0 , x0 + p, x 2 p, x + 2 p0 , x ) .
u1 = exp
2
Similarly,
U (T(z0 ))(z) =
i
1 n/2 2
e p,x
2
i
e p,x (x
i
1
x )e (p0 ,x 2 p0 ,x0 ) (x x0 )dn x
319
1 n
2
enjoy the same property which makes the main appeal of ordinary Weyl operators,
namely that they are self-adjoint if and only if their symbols are real. This is part
of the following result where we also investigate the relation between the spectra
of usual and phase-space Weyl operators:
be the operators associated to the Weyl symbol a.
ph and A
Proposition 10.12. Let A
ph is symmetric if and only if
(i) The operator A
a = a.
is also an eigenvalue of
(ii) Every eigenvalue of A
true).
1 n
2
e 2 (z,z ) a (z z ).
1 n
2
e 2 (z,z ) a (z z),
320
ph to be an
Notice that there is no reason for an arbitrary eigenvalue of A
this is only the case if the corresponding eigenvector belongs to
eigenvalue of A;
the range of a Wigner wave-packet transform.
Let us next establish a composition result:
ph
Proposition 10.13. Let a and b be the twisted symbols of the Weyl operators A
ph is the same as that of A
ph . The twisted symbol c of the compose A
ph B
B,
and B
that is
1
n
i
e 2 (z,z ) a (z z )b (z )d2n z.
c (z) = 2
Proof. By repeated use of (10.31) we have
ph )U = A
ph B
ph (B
ph U )
(A
ph U B
=A
B),
= U (A
ph = (A
B)
ph ; the twisted symbol of A
B
is precisely c (Theorem 6.30,
ph B
hence A
Chapter 6, Subsection 6.3.2).
In Theorem 9.21 of Chapter 9 (Subsection 9.2) we showed that the class
of Weyl operators with L2 symbols was identical to the class of HilbertSchmidt
operators. Not very surprisingly this identication carries over to the case of phasespace Weyl operators:
Proposition 10.14. Let S(R2n
z ) and U the corresponding Wigner wave-packet
ph induces an isomorphism between
transform. The correspondence a A
2
n
L (Rx ) and the space of HilbertSchmidt operators on the range H of U .
Proof. This is an immediate consequence of the aforementioned Theorem 9.21
since U is an unitary isomorphism L2 (Rnx ) H .
321
We leave it to the reader to restate the continuity properties proven in Chapter 6, Subsection 6.3.1 for Weyl operators in terms of the corresponding phasespace operators.
One of the most agreeable features of standard Weyl calculus is its covariance
under symplectic transformation of the symbols. In the next subsection we examine
the symplectic covariance of the phase-space calculus.
1
J(SW + I)(SW I)1
2
322
We have:
(S)ph
Lemma 10.16. Let S Sp(n) be such that det(S I) = 0. The operator R
can be written as
n
1
i
i
(S)ph =
R
e 2 (Sz,z) Tph ((S I)z)d2n z (10.34)
2
| det(S I)|
or, equivalently,
(S)ph =
R
1
2
n
i | det(S I)| Tph (Sz)Tph (z)d2n z.
(10.35)
Proof. It is, mutatis mutandis, the same as the proof of Lemma 7.32 in Section
(S).
7.4 of Chapter 7 for the Weyl operators R
We are going to show in a simple way that the well-known metaplectic
covariance relation
S
(10.36)
A
S = S1 A
for standard Weyl operators (Theorem 7.13, Subsection 7.1.3 of Chapter 7) exph , provided one replaces Mp(n) with
tends to the phase-space Weyl operators A
Mpph (n).
Proposition 10.17. Let S be a symplectic matrix and Sph any of the two operators
in Mpph (n) associated with S. The following phase-space metaplectic covariance
formulae hold:
1
1
Sph Tph (z0 )Sph
Aph Sph .
= Tph (Sz) , A
S ph = Sph
(10.37)
Proof. To prove the rst formula (10.37) it is sucient to assume that Sph =
(S)ph with det(S I) = 0 since these operators generate Mp (n). Let us thus
R
ph
prove that
Tph (Sz0 )Sph = Sph Tph (z0 ) if det(S I) = 0
(10.38)
where, in view of (10.35) the operator Sph is the Bochner integral
Sph = CS Tph (Sz)Tph(z)d2n z
with
CS =
1 n (S)
i
| det(S
2
We have
Tph (Sz0 )Sph = CS
and, similarly
Sph Tph (z0 ) = CS
I)|.
323
Since the constant CS does not play any special role in the argument we set
A(z0 ) = Tph (Sz0 )Tph (Sz)Tph (z)d2n z,
B(z0 ) = Tph (Sz)Tph (z)Tph(z0 )d2n z.
We have, by repeated use of formula (10.15),
i
A(z0 ) = e 2 1 (z,z0 ) Tph (Sz0 + (S I)z)d2n z,
i
B(z0 ) = e 2 2 (z,z0 ) Tph (z0 + (S I)z)d2n z
where the phases 1 and 2 are given by
1 (z, z0 ) = (z0 , z) (S(z + z0 ), z),
2 (z, z0 ) = (Sz, z) + ((S I)z, z0 ).
Performing the change of variables z = z + z0 in the integral dening A(z0 ) we
get
i
A(z0 ) = e 2 1 (z z0 ,z0 ) Tph (z0 + (S I)z )d2n z
and we have
1 (z z0 , z0 ) = (z0 , z z0 ) (Sz , z z0 )
= ((S I)z , z0 ) (Sz , z )
= 2 (z , z0 ),
hence A(z0 ) = B(z0 ) proving (10.38). The second formula (10.37) easily follows
from the rst: noting that the symplectic Fourier transform satises
1
n
i
F [A S](z) = 2
e (z0 ,z ) A(Sz )d2n z
1
n
i
= 2
e (Sz0 ,z ) A(z )d2n z
= F A(Sz),
we have
A
S ph =
=
=
which concludes the proof.
1 n
2
1 n
2
1 n
2
F A(Sz)Tph(z)d2n z
F A(z)Tph (S 1 z)d2n z
1
Tph (z)Sph d2n z
F A(z)Sph
324
10.4 Schr
odinger Equation in Phase Space
We now have all the material we need to derive the phase-space Schrodinger equation
ph
i
(10.39)
=H
t
formally written in the beginning of this chapter as
i
= H( 12 x + ip , 12 x ix );
t
ph
the operator on the right-hand side is actually the phase-space Weyl operator H
with symbol the Hamiltonian H.
= H,
t
ph .
=H
t
(10.40)
=H
ph (U ) = H
ph ,
= U (H)
t
hence (i).
Statement (ii) follows.
The result above raises some interesting physical questions: since the solutions of the phase-space Schr
odinger equation (10.40) exist independently of
the choice of any isometry U , what is the relation between the corresponding
conguration-space wave-functions = U and = U ?
10.4. Schr
odinger Equation in Phase Space
325
that is
= (1 , 2 )L2 (Rnx ) , = (1 , 2 )L2 (Rnx ) .
||||2L2 (R2n
z )
G (z) = e 2 Gz,z.
which belong to the range H of some Wigner wave-packet transform U are those
for which we have G Sp(n).
326
= H
t
, (, 0) = 0
(10.41)
ph ,
=H
t
Assume now that the symplectic matrix St is free and det(St I) = 0; then
(St )ph (z, 0)
(z, t) = R
(10.42)
where
(St )ph =
R
1
2
n/2
im(t)Inert WS (t)
| det(St I)|
2
(p + x2 ).
2
ph = +
H
i
p
+ (p2 + x2 ).
2
x2
p2
2
x
p
8
10.4. Schr
odinger Equation in Phase Space
327
1
((x2 + x2 ) cos t 2xx ).
2 sin t
i(t)
2 2 sin( t
2 )
1/2
i 2
t
2
(x + p0 ) cot( ) Tph (z0 )(z, 0)d2 z0 ,
exp
4 0
2
ph
=H
t
ph is a symmetric operator.
where H
Let be in L2 (Rnx ); if is normalized, then so is = U in view of the
Parseval formula (10.24):
||||L2 (Rnx ) = 1 ||||L2 (R2n
= 1.
z )
It follows that ||2 is a probability density in phase space. That this property is
conserved during the time-evolution is straightforward:
328
ph )
ph ) (H
= (H
t
:
ph = H
and hence, since H
ph
%
1 $
2
2n
(Hph , )L2 (R2n
(
H
,
)
(z, t)d2n z =
ph
L (Rz ) = 0
z )
t
i
)
)
so that (z,t)d2n z is constant; taking t = 0 this constant is precisely (z,0)d2n z.
Exercise 10.21. Give another proof of the proposition above when is in the range
of the wave-packet transform.
It turns out that by an appropriate choice of the marginal probabilities can
be chosen arbitrarily close to ||2 and |F |2 :
Theorem 10.22. Let L2 (Rnx ) and set = U .
(i) We have
(10.43)
(10.44)
(10.45)
10.4. Schr
odinger Equation in Phase Space
329
it follows that
2 n
|(z)| d p =
(x x )|(x x )|2 |(x )|2 dn x dn x
n
=
(x x )d x |(x x )|2 |(x )|2 dn x
= |(x x )|2 |(x )|2 dn x ,
hence formula (10.43). To prove (10.44) we note that in view of the metaplectic
covariance formula (10.23) for the wave-packet transform we have
UJ
(J)(x, p) = U (p, x)
where J = in/2 F is the metaplectic Fourier transform. It follows that
UF (F )(x, p) = in U (p, x)
and hence changing (p, x) into (x, p):
U (x, p) = in UF (F )(p, x).
And hence, using (10.43),
2 n
|(x, p)| d x = |UF (F )(p, x)|2 dn x
= |UF (F )(p, x)|2 dn x = (|F |2 |F |2 )(p)
which concludes the proof of (10.44). To prove (10.45) it suces to note that, in
view of the intertwining formula (10.31) and the fact that U = U1 , we have
ph U , U )L2 (R2n )
(Aph , )L2 (R2n
= (A
z )
z
proving (10.45).
The result above shows that the marginal probabilities of ||2 are just the traditional position and momentum probability densities ||2 and |F |2 smoothed
out by convoluting them with ||2 and |F |2 respectively.
Let us investigate the limit 0. Choose for the Gaussian (10.20):
(x) = (x) =
1
|x|2
1 n/4 2
e
.
330
Thus, in the limit 0 the square of the modulus of the phase-space wavefunction
becomes a true joint probability density for the probability densities ||2 and
|F |2 .
Let us now return to the notion of density operator which was discussed
in detail in Chapter 9. We showed in particular that an operator on L2 (Rnx )
was a density operator if and only if its Weyl symbol is a convex sum of Wigner
transforms of functions j L2 (Rnx ):
=
j W j , j 0 ,
j = 1.
j
To we can associate the phase-space operator ph ; notice that ph is automatically self-adjoint since is (Proposition 10.12(i)). The following result is almost
obvious; we nevertheless give its proof in detail.
Proposition 10.23. Let be a density operator on L2 (Rnx ). The restriction of ph
to any of the subspaces H = Range(
), S(Rnx ), is a density operator on H .
Proof. Since ph is self-adjoint there remains to prove that ph is non-negative
and that ph is of trace-class with T r(
ph ) = 1. Since we have ph = U U
for every U we have for such a ,
= (U U , )L2 (R2n
(
ph , )L2 (R2n
z )
z )
= (
U , U )L2 (Rnx )
and hence (
ph , )L2 (R2n
0 since (
, )L2 (Rnx ) 0 for all L2 (Rnx ); ph is
z )
thus a non-negative operator. Let us show that ph is of trace class and has trace
1. Choose an orthonormal basis (j )j of H ; then (j )j with j = U j is an
orthonormal basis of L2 (Rnx ) (because U is an isometry of L2 (Rnx ) onto H ). We
have
= (
U j , U j )L2 (Rnx )
(
ph j , j )L2 (R2n
z )
= (
j , j )L2 (Rnx )
and hence
(
ph j , j )L2 (R2n
=
z )
(
j , j )L2 (Rnx ) = 1
10.5. Conclusion
331
10.5 Conclusion
We have sketched in this last section a theory of Schr
odinger equation in symplectic
phase space which is consistent with the Stonevon Neumann theorem; by the
properties of the wave-packet transform, the theory of the ordinary Schr
odinger
equation on conguration space becomes a particular case of our constructions.
One must however be aware of the fact that the quantization scheme
Weyl
H
ph
H H
we have been using is not the only possible. While our choice was dictated by
considerations of maximal symplectic covariance, there are, however, many other
possibilities. For instance, while it is accepted by a majority of mathematicians
Weyl
is the most natural in standard
and physicists that Weyl quantization H H
quantum mechanics, there are other ways to dene the quantized Hamiltonian;
see for instance [128] for a discussion of some of these. Secondly, the phase-space
quantization scheme
j,ph = 1 xj + i , pj Pj,ph 1 pj i
xj X
2
pj
2
xj
H
ph can be
we have been using, and which leads to the correspondence H
replaced by the more general scheme
= xj + i , pj P = pj i
xj X
j,ph
j,ph
pj
xj
where , , , are any real numbers such that = 1, and this without
altering the position-momentum commutation relations; in fact one veries by an
immediate calculation that
$
%
, P = i
X
j,ph
j,ph
H
ph we have been using correfor all such choices. For instance the rule H
1
sponds to the case = = 2 , = 1, = 1, while the choice = = 1, = 0,
= 1 leads the the quantization rules
xj xj i p j , pj i x
j
which have been considered by some physicists (see e.g. [162, 163]). Notice that
this choice is in a sense very natural, because, as is easily veried, it corresponds to
extending the HeisenbergWeyl operators to phase space functions by the formula
1
i
T(z0 ) = e (p0 ,x 2 p0 ,x0 ) T (z0 )
332
There are furthermore several topics we have not discussed in this section
because of time and space. For instance, what is the physical meaning of functions
L2 (R2n
z ) that do not belong to the range of any wave-packet transform? These
functions are certainly (at least when they are not orthogonal to the range of every
U ) related to the density matrix, but in which way? It would be interesting to
give a precise correspondence between these objects. Another topic which we have
not mentioned is that of the semi-classical approximations to the solutions of the
phase space Schr
odinger equation
i
ph .
=H
t
I have in particular in mind the nearby orbit method (see Littlejohn [112] for a
nice description in the context of semi-classical analysis). The idea is the following: to each z0 R2n
z one associates the quadratic (inhomogeneous) Hamiltonian
function dened by
Hz0 (z, t) = H(zt , t) + H (zt , t), z zt +
1
H (zt , t)(z zt ), z zt
2
, (, 0) = z0
= H
t
(see Littlejohn [112] for a discussion and properties; very precise estimates are
given in Combescure and Robert [24]). It would certainly be interesting (and
perhaps not very dicult) to extend these results to the phase-space Schrodinger
equation.
Appendix A
334
(A.1)
k0
(1)k+1
k=0
k N k
k
(A.2)
M = eL ,L = (log )I + log(I + 1 N )
M = eL eL = eL eL .
Now, conjugating the series (A.2) we get
=
L
k0
(1)k+1
k=0
k N
k
k
335
1
[X, Y ]
2
and Cj (X, Y ) is a linear combination of commutators of higher order; for instance
C1 (X, Y ) = X + Y , C2 (X, Y ) =
1
1
[[X, Y ], Y ] [[X, Y ], X],
12
12
1
C4 (X, Y ) = [Y, [X, [X, Y ]]] [X, [Y, [X, Y ]]].
48
The following immediate consequence of the CampbellHausdor formula is useful
when dealing with the Heisenberg group:
C3 (X, Y ) =
Corollary A.5. Under the conditions in the theorem above assume that all commutators of order superior to 2 are equal to zero. Then
exp X exp Y = exp(X + Y + 12 [X, Y ]).
336
M (, ) =
(x)(x)1 dx.
(A.4)
The invertibility of M (, ) will follow since the series with general term
Sk =
(1)k (M Im )k
j=0
sup
axa
||(x)1 Im ||.
337
Appendix B
(k)
340
:
There is a natural action of the Poincare group 1 [M ] = 1 [M, m0 ] on M
let be the homotopy class of a loop in M originating and ending at m0 , and
let m
be the homotopy class of a path joining m0 to m in M . The homotopy
class of the concatenation (i.e., the loop followed by the path ) is denoted
by m;
the action
M
1 [M ] M
thus dened is transitive on the bers.
Regular coverings
Let be a subgroup of 1 [M ]; we denote by m
the set {
m
: }, and
}.
M = {m
:m
M
The mapping
: M M , m
m
is called the covering of M associated with the subgroup of 1 [M ]; we will use
the shorthand notation
/ (hence M = M
/1 [M ]).
M = M
= (
m)
for every ,
If is a normal subgroup of 1 [M ], then (m)
/ and
hence 1 [M ], or rather 1 [M ]/, acts on M = M
1 [M ]/ = 1 (M ).
The covering M is in this case called a regular covering of M . Notice that
/)/(1 [M ]/) = M
/1 [M ].
M = M /(1 [M ]/) = (M
The order of a covering M is the (constant) number of elements of each bre
1 (m). It is equal to the order of the group 1 [M ]/. We have:
Proposition B.1. If 1 [M ] = (Z, +), then the only covering of M having innite
order is its universal covering.
Proof. Let M be a covering; is thus a subgroup of (Z, +) and hence consists of
the multiples of some integer k. If k = 0 then the quotient group Z/kZ is nite,
.
and so is the order of M . If k = 0, the covering it denes is M
Appendix C
Pseudo-Dierential Operators
In traditional pseudo-dierential calculus, as practiced by most mathematicians
working in the theory of partial dierential equations, one associates to a suitable
n
n
symbol a C (R2n
z ) an operator A dened, for Co (Rx ) (or S(Rx )), by
the formula
n/2
n
A(x) = 1
p
(C.1)
eip,x a(x, p)(p)d
2
a (x)p , a C (Rnx )
a(x, p) =
||m
a (x)Dx .
A=
||m
342
m
C.1 The Classes S,
, Lm
,
One of the most used class of symbols is exhibited in the following denition:
Denition C.1. We say that a is a classical pseudo-dierential symbol on R2n
z if
a C (R2n
z ) and if there exist real numbers m, , with 0 < 1 and such
that for all multi-indices and in Nn and every compact subset K of Rnx we
can nd a constant C,,K > 0 such that
| a(x, p)| C,,K (1 + |p|)m||
(C.2)
n
Notice that in general A Lm
, (Rz ) does not map the space C0 (Rx ) of
compactly supported C functions into itself. One however proves that there
m
2n
exists an operator R L (R2n
z ) (i.e., R L (Rz ) for every m R) such that
n
A = A0 + R and A0 : C0 (Rx ) C0 (Rx )). The operator R is smoothing in
n
the sense that R : E (R2n
z ) C0 (Rx ).
N
1
mN
aj S,
(R2n
z ) for N 1.
j=0
2n
Let A Lm
, (Rz ) have symbol a. The adjoint A is also a pseudo-dierential
operator and its symbol b is then determined by the asymptotic expansion
1
b(x, p)
p Dx a(x, p).
!
n
N
343
Note that this formula is very complicated compared to the easy rule used when
one deals with Weyl pseudo-dierential operators.
m2
2n
2n
1
Let A Lm
1 ,1 (Rz ), B L2 ,2 (Rz ) have respective symbols a and b.
Assume that the composed operator A B exists (this can always be assumed to
be true replacing A by A0 such that A A0 = R L (R2n
z )). Then C = A B
is a pseudo-dierential operator
1 +m2
C Lm
(R2n
z ) , = min{1 , 2 } , = max{1 , 2 }
,
1
D a(x, p)p b(x, p).
! x
n
Appendix D
Var(Z) is called the standard deviation of Z.
346
Here are a few other concepts we will use. The convolution product
( )(z) = (z u) (u)dn u
(D.1)
(D.2)
Cov(Zj , Zk )
= (Zk , Zj )
Zj Zj
347
1 m/2
2
det(1/2 )e 2
(z
z ),z
z
(D.3)
eu /2 du = 1 for > 0;
2
this equality immediately follows, changing variables, from the classical Gauss
integral
2
1
eu /2 du = 1.
2
(ii) Setting y = z z we have
m
Zi zi = zi (z)d z zi zi (z)dm z
1 1
1
m/2
= 2
det(1/2 ) yi e 2 y,y dm y
and the last integral is zero since the integrand is an odd function; hence Zi = zi
as claimed.
(iii) Set ij = Cov(Zi , Zj ); by denition ij = Zi Zj Zi Zj hence, performing
again the change of variables y = z z,
1 1
1
m/2
ij = 2
det(1/2 ) yi yj e 2 y,y dm y.
Let now R = (rij )1i,jm be an orthogonal matrix such that D1 = R1 RT is
diagonal; setting y = Ru, we have
1
1
m/2
1
ij = 2
det(D1/2 ) fij (u)e 2 D u,u dm u,
where the functions fij are given by
fij (u) =
k,=1
rik rj uk u .
348
Noting that
uk u e 2
d u = 0 for k =
u,u m
1 m/2
2
det(D1/2 )
ri rj
u2 e 2 D
u,u m
d u.
(D.4)
=1
t2 et
/2d
dt
2
t2 et /2 dt = 3/2 2( 32 ) = 3/2 2
we thus have
u2k e 2 D
3/2
u,u m
d u = (2)m/2 k
0m
and hence (D.4) becomes ij =
=1 ri rj . The sum on the right-hand side
being the ith row and jth column entry of RDRT = , this formula concludes the
proof.
Proposition D.4. Let and be two Gaussian probability densities centered at
z and z , respectively; then = where = + and is centered
at z = z + z .
Proof. It is sucient to consider the case z = z = 0. The Fourier transform
of is
F () =
1
2
m/2
e
i,z
hence
F ()F () =
m
(z)d z =
1
2
2m
1
2
m
e 2 , ,
e 2 (+ ), .
We have
F ( )() = (2)m F ()F ()
so that = + as claimed.
S 2n
which is absurd. (This example generalizes to any 2n-dimensional compact manifold such that H k (M ) = 0 for k = 0 and k = 2n.)
Solution of Exercise 1.6. Let us prove that if U = A+iB is any matrix, symplectic,
or not (A and B being real n n matrices), then the determinant of the 2n 2n
block matrix
A B
U=
B A
is given by the simple formula
det U = | det(A + iB)|2 .
(D.5)
det
A
B
B
A + iB
= det
A
0
0
A iB
0
A iB
0
= 4n
2In
350
K
for 0 t 1 2s ,
f2t/(2s)
K
f1 for 2s t 1.
We have a(t, 0) = ftH , b(t, 0) = ftK hence h(t, 0) = ftH ftK ; similarly
K
f2t
for 0 t 12 ,
h(t, 1) =
H
f1K for 12 t 1,
f2t1
that is h(t, 1) = ft .
Solution of Exercise 5.48. The phase of T (za )Vn is
a (
z ) = (
z) +
1
2
pa , xa + pa , x ,
351
hence that of StH (T (za )Vn ) is (using (5.36) and the linearity of StH ):
A(t) = (
z0 ) +
1
2
pa , xa + pa , x
+
1
2
1
2
p + pa , x + xa
where z0,t = StH z0 , za,t = StH za . Similarly, the Hamiltonian phase of StH Vn is
(
z , t) = (
z0 ) + 12 (pt , xt p, x),
hence that of T (StH (za ))Vn is
B(t) = (
z ) + 12 (pt , xt p, ) +
1
2
and thus
A(t) B(t) = 12 (pa , x p, xa ) 12 (pa,t , xt pt , xa,t )
= 12 ((za , z) (za,t , zt ))
= 12 ((za , z) (StH za , StH z)).
Since StH Sp(n) we have (StH za , StH z) = (za , z) and hence A(t) = B(t).
Solution of Exercise 2.47. The condition Xt sp(n) is equivalent to JXt being
symmetric. Hence
d T
(S JSt ) = StT XtT JSt + StT JXt St = 0
dt t
so that StT JSt = S0T JS0 = J and St Sp(n) as claimed.
Solution of Exercise 8.29. Write the covariance matrix in the form
XX XP
=
, P X = TXP ,
P X P P
that is
XX = Cov(Xi , Xj )1i,jn , (Xj )2 = Cov(Xj , Xj ),
P P = Cov(Pi , Pj )1i,jn , (Pj )2 = Cov(Pj , Pj )
and XP = Cov(Xi , Pj )1i,jn . We have
(Xi )2 +
Cov(Xi , Pi ) 2i
Cov(Xi , Pi ) + 2i
>0
(Pi )2 +
352
1 n
2
1 n
2
f (Jz) = F 1 g(z) =
f (z) =
e (z,z ) f (z )d2n z .
and hence
so that
1 n
2
1 n
2
=
A
cj ()j j , cj () = (, j )H
j=1
and hence
j ) = (A
j , )H = j (j , )H ,
A(
that is
j) =
A(
j cj ()ck ().
k=1
j |cj ()|2 ;
k=1
.
but this is just the expectation A
Solution of Exercise 8.46. It is clear that 0. To prove that
(z)dpdx = 1
2
t
2m
353
1 n
2
1
0
S=
0
0
0
2
0
0
0
0
1/1
0
0
0
0
1/2
354
0
S =
0
0
and the blocks
A= 1
0
0
2
0
2
0
0
0
0
1/2
0
0
0
0
1/1
1/2
, D=
0
0
1/1
do not satisfy the condition AT D = DT A which is necessary for S to be symplectic. The section S (B 2n (R)) by the symplectic x2 , p2 plane is the ellipse
1 2
x + 2 p21 R2
1 1
1 /2 = R2 .
Solution of Exercise 6.3 (ii). The isomorphism (6.9) is C and induces a Lie
hn ; the Jacobian of at (z, t) = (0, 0)
algebra isomorphism d(0, 0) : hpol
n
pol
being the identity it follows that hn = hpol
n . Let us determine hn . We have
M (z, t) = I + m(z, t) where
0 pT t
m(z, t) = 0 0 x ;
0 0 0
m(z, t) is nilpotent: m(z, t)k = 0 for k > 2 and m(z, t)2 = m(0, p, x) and hence
1
em(z,t) = I + m(z, t) + m(0, p, x = M (z, t +
2
so that hpol
n consists of all matrices
1
X pol (z, t) = 0
0
pT
1
0
1
2
p, x)
p, x
.
x
1
1
2
Solution of Exercise 6.3. This immediately follows from Proposition 10.18, (i),
using the fact that U is an isometry:
= ||(, t)||L2 (Rnx )
(z, t)d2n z = ||U (, t)||L2 (R2n
z )
and hence, since ||(, t)||L2 (Rnx ) is conserved in time,
2n
2
n
2
2n
(z, t)d z = ||(, 0)||L (Rx ) = ||U (, 0)||L (Rz ) = (z, 0)d2n z.
Bibliography
[1] R. Abraham and J.E. Marsden. Foundations of Mechanics. The Benjamin/Cummings Publishing Company, Second Edition, 1978.
[2] R. Abraham, J.E. Marsden, and T. Ratiu. Manifolds, Tensor Analysis, and Applications. Applied Mathematical Sciences 75, Springer, 1988.
[3] V.I. Arnold. Mathematical Methods of Classical Mechanics. Graduate Texts in
Mathematics, second edition, Springer-Verlag, 1989.
[4] V.I. Arnold. A characteristic class entering in quantization conditions. Funkt. Anal.
i. Priloz. 1(1):114 (in Russian), 1967; Funct. Anal. Appl. 1:114 (English translation), 1967.
[5] Arvind, B. Dutta, N. Mukunda and R. Simon. 1995 The Real Symplectic Groups
in Quantum Mechanics and Optics; e-print arXiv:quant-ph/9509002 v3.
[6] A. Banyaga. Sur la structure du groupe des dieomorphismes qui preservent une
forme symplectique. Comm. Math. Helv. 53:174227, 1978.
[7] F. Bayen, M. Flato, C. Fronsdal, A. Lichnerowicz, and D. Sternheimer. Deformation Theory and Quantization. I. Deformation of Symplectic Structures. Annals of
Physics 111, 6110, 1978; II Physical Applications 110, 111151, 1978.
[8] S. Bergia and L. Navarro. On the early history of Einsteins quantization rule of
1917 (Preprint).
[9] E. Binz, S. Pods, and W. Schempp. Heisenberg groups a unifying structure of signal theory, holography and quantum information theory. J. Appl. Math. Comput.
11, 157, 2003.
[10] E. Binz, S. Pods, and W. Schempp. Heisenberg groups the fundamental ingredient
to describe information, its transmission and quantization. J. Phys. A: Math. Gen.
36, 64016421, 2003.
[11] D. Bleecker and B. Booss-Bavnbek. Index Theory with Applications to Mathematics
and Physics. Preprint 2004.
[12] D. Bohm. Quantum Theory. Prentice Hall, New York, 1951.
[13] M. Born and P. Jordan. Zur Quantenmechanik. Zeitschrift f
ur Physik 34, 858888,
1925.
[14] B. Booss-Bavnbek and K. Furutani. The Maslov Index: a Functional Analytical
Denition and the Spectral Flow Formula. Tokyo J. Math. 21(1), 1998.
[15] B. Booss-Bavnbek and K.P. Wojciechowski. Elliptic Boundary problems for Dirac
Operators, Birkh
auser, 1993.
356
Bibliography
[16] M. Brown. The Symplectic and Metaplectic Groups in Quantum Mechanics and
the Bohm Interpretation. Ph.D. Thesis, University of London, 2004.
[17] N.G. de Bruijn. A theory of generalized functions, with applications to Wigner
distribution and Weyl correspondence. Nieuw Archiev voor Wiskunde, 21, 205
280, 1973.
[18] F.A. Berezin and M.A. Shubin. The Schr
odinger Equation. Kluwer Publishers,
1991.
[19] V.C. Buslaev. Quantization and the W.K.B method. Trudy Mat. Inst. Steklov
110:528, 1978 [in Russian].
[20] J. Buttereld. On HamiltonJacobi Theory as a Classical Root of Quantum Theory. Preprint submitted to the proceedings of Quo Vadis Quantum Mechanics?
(Conference at Temple University, Sept. 2002, Ed. A. Elitzur and N. Kolenda,
2003.
[21] A. Cannas da Silva. Lectures on Symplectic Geometry. Springer-Verlag, Berlin
Heidelberg New York, 2001.
[22] S.E. Cappell, R. Lee, and E.Y. Miller. On the Maslov index. Comm. Pure and
Appl. Math. 17 121186, 1994.
[23] Y. Choquet-Bruhat, C. DeWitt-Morette. Analysis, Manifolds and Physics. Part II
Revised and Enlarged. Elsevier Science Publishing Company, 2001.
[24] M. Combescure and D. Robert. Semiclassical spreading of quantum wave packets
and applications to near unstable xed points of the classical ow. Asymptotic
Analysis 14 (1997) 377404
[25] C. Conley and E. Zehnder. Morse-type index theory for ows and periodic solutions
of Hamiltonian equations. Comm. Pure and Appl. Math. 37:207253, 1984.
[26] A. Crumeyrolle. Orthogonal and Symplectic Cliord Algebras. Kluwer Academic
Publishers, 1990.
[27] R.H. Cushman and L.M. Bates. Global Aspects of Classical Integrable Systems,
Birkh
auser, 1997.
[28] P. Dazord. Invariants homotopiques attaches aux bres symplectiques. Ann. Inst.
Fourier 29(2):2578, 1979.
[29] M. Demazure. Classe de Maslov II, Expose numero 10, Seminaire sur le bre
cotangent, Orsay (197576).
[30] J. Dieudonne. Foundations of Modern Analysis. Translated from the French Academic Press.
[31] P.A.M. Dirac. The Principles of Quantum Mechanics (Oxford Science Publications, fourth revised edition, 1999).
[32] D. Dragoman. Phase space formulation of quantum mechanics. Insight into the
measurement problem. Phys. Scripta 72, 290296, 2005.
[33] D.A. Dubin, M.A. Hennings, and T.B. Smith. Mathematical Aspects of Weyl Quantization and Phase. World Scientic, 2000.
[34] J.J. Duistermaat. On the Morse index in variational calculus. Adv. in Math. 21
173195, 1976.
[35] J.J. Duistermaat. On operators of trace class in L2 (X, ). In Geometry and Analysis. Papers Dedicated to the Memory of V.K. Patodi, Ed. Indian Academy of
Science, 2932, Springer-Verlag, 1981.
Bibliography
357
[36] Ju.V. Egorov and B.-W. Schulze. Pseudo-dierential operators, singularities, applications. Vol. 93 of Operator Theory, Advances, and Applications. Birkh
auser,
Basel, 1997.
[37] A. Einstein. Zum Quantensatz von Sommerfeld und Epstein. Verhandlungen der
Deutschen Phys. Ges. 9/10, 1917.
[38] Y. Elskens and D. Escande. Microscopic Dynamics of Plasmas and Chaos. IOP
Publ. Ltd., 2003.
[39] I. Ekeland. Convexity Methods in Hamiltonian Mechanics. Springer-Verlag, Berlin,
1990.
[40] U. Fano. Description of States of Quantum Mechanics and Operator Techniques.
Rev. Mod. Phys. 29(74), 1967.
[41] C. Feerman and D.H. Phong. The uncertainty principle and sharp G
arding inequalities. Comm. Pure Appl. Math. 75, 285331, 1981.
[42] G.B. Folland. Harmonic Analysis in Phase space. Annals of Mathematics studies,
Princeton University Press, Princeton, N.J., 1989.
[43] T. Frankel. The Geometry of Physics, An Introduction. Cambridge University
Press, 1997.
[44] A. Friedman. Foundations of Modern Analysis. Dover, 1982.
[45] B. Gaveau, P. Greiner, and Vauthier. Integrales de Fourier quadratiques et calcul
symbolique exact sur le groupe de Heisenberg. J. Funct. Anal. 68, 1986, 248272.
[46] Geometry and Quantum Field Theory. D.S. Freed and K.K. Uhlenbeck editors.
IAS/Park City Mathematics series, Volume 1, Amer. Math. Soc. and Institute for
Advanced Studies, 1995.
[47] G. Giacchetta, L. Mangiarotti, and G. Sardanashvily. Geometric quantization of
completely integrable Hamiltonian systems in the action-angle variables. Phys.
Lett. A 301, 5357, 2002.
[48] G. Giachetta, L. Mangiarotti, G. Sardanashvily. Geometric and Algebraic Topological Methods in Quantum Mechanics, World Scientic, Singapore, 2005.
[49] G. Giedke, J. Eisert, J.I. Cirac, and M.B. Plenio. Entanglement transformations
of pure Gaussian states. E-print arXiv:quant-ph/0301038 v1, 2003.
[50] C. Godbillon. Geometrie Dierentielle et Mecanique Analytique, Hermann, Paris,
1969.
ements de Topologie Algebrique. Hermann, Paris, 1971.
[51] C. Godbillon. El
[52] I. Gohberg and S. Goldberg. Basic Operator Theory. Birkh
auser, 1981.
[53] H. Goldstein. Classical Mechanics. AddisonWesley, 1950; second edition, 1980;
third edition, 2002.
[54] M. de Gosson. La denition de lindice de Maslov sans hypoth`ese de transversalite.
C. R. Acad. Sci., Paris, Serie I. 309:279281, 1990.
[55] M. de Gosson. La relation entre Sp , revetement universel du groupe symplectique
Sp et Sp Z. C. R. Acad. Sci., Paris, Serie I, 310:245248, 1990.
[56] M. de Gosson. Maslov Indices on Mp(n). Ann. Inst. Fourier, Grenoble, 40(3):537
555, 1990.
[57] M. de Gosson. The structure of q-symplectic geometry. J. Math. Pures et Appl.
71, 429453, 1992.
358
Bibliography
Bibliography
359
[80] H.J. Groenewold. On the principles of elementary quantum mechanics. Physics 12,
405460, 1946.
[81] M. Gromov. Pseudoholomorphic curves in symplectic manifolds. Invent. Math.
82:307347, 1985.
[82] A. Grossmann. Parity operators and quantization of -functions. Commun. Math.
Phys. 48 191193,1976.
[83] A. Grossmann, G. Loupias, and E.M. Stein. An algebra of pseudo-dierential operators and quantum mechanics in phase space, Ann. Inst. Fourier, Grenoble 18(2)
343368, 1968.
[84] V. Guillemin and S. Sternberg. Geometric Asymptotics. Math. Surveys Monographs 14, Amer. Math. Soc., Providence R.I., 1978.
[85] V. Guillemin and S. Sternberg. Symplectic Techniques in Physics. Cambridge University Press, Cambridge, Mass., 1984.
[86] M.C. Gutzwiller. Chaos in Classical and Quantum Mechanics. Interdisciplinary
Applied Mathematics, Springer-Verlag, 1990.
[87] K. Habermann and L. Habermann. Introduction to Symplectic Spinors (preprint),
2005.
[88] Quantum Implications: Essays in Honour of David Bohm. Edited by B.J. Hiley
and F. David Peat. Routledge, London and New York, 1987.
360
Bibliography
Bibliography
361
[123] A. Messiah. Quantum Mechanics (two volumes). NorthHolland Publ. Co., 1991;
translated from the French; original title: Mecanique Quantique. Dunod, Paris,
1961.
[124] A.S. Mischenko, V.E. Shatalov, and B.Yu. Sternin. Lagrangian Manifolds and the
Maslov Operator, Springer-Verlag, Heidelberg, 1990.
[125] W. Moore. Schr
odinger: life and thought Cambridge University Press, 1989.
[126] M. Morse. The Calculus of Variations in the Large. AMS, Providence, R. I., 1935.
[127] J.E. Moyal. Quantum mechanics as a statistical theory. Proc. Camb. Phil. Soc.
45:99124, 1947.
[128] V. Nazaikiinskii, B.-W. Schulze, and B. Sternin. Quantization Methods in Dierential Equations. Dierential and Integral Equations and Their Applications, Taylor
& Francis, 2002.
[129] D. Park. Introduction to the Quantum Theory, 3d edition. McGrawHill, 1992.
[130] M.A. Javaloyes and P. Piccione. Conjugate points and Maslov index in locally
symmetric semi-Riemannian manifolds. Preprint 2005.
[131] R.C. Nostre Marques, P. Piccione, and D. Tausk. On the Morse and the Maslov
index for periodic geodesics of arbitrary causal character. Dierential Geometry
and its Applications. Proc. Conf. Opava, 2001.
[132] L. Polterovich. The Geometry of the Group of Symplectic Dieomorphisms. Lectures in Mathematics, Birkh
auser, 2001.
[133] J.C.T. Pool. Mathematical aspects of the Weyl correspondence. J. Math. Phys. 7,
6676, 1966.
[134] M. Reed and B. Simon. Methods of Modern Mathematical Physics. Academic Press,
New York, 1972.
[135] J. Robbin and D. Salamon. The Maslov index for paths. Topology 32(4), 827844,
1993.
[136] H.P. Robertson. The uncertainty principle. Phys. Rev. 34, 163164, 1929.
[137] A. Royer. Wigner functions as the expectation value of a parity operator. Phys.
Rev. A 15, 449450, 1977.
[138] D. Ruelle. Statistical mechanics: rigorous results. Benjamin, 1974.
[139] M. Schechter. Operator Methods in Quantum Mechanics. North-Holland, 1981.
[140] W. Schempp. Harmonic Analysis on the Heisenberg Nilpotent Lie Group. Pitman
Research Notes in Mathematics 147, Longman Scientical and Technical, 1986.
[141] W. Schempp. Magnetic Resonance Imaging: Mathematical Foundations and Applications. Wiley, 1997.
[142] W.P. Schleich. Quantum Optics in Phase Space. Wiley-VCH, Berlin, 2001.
[143] E. Schr
odinger. Zum Heisenbergschen Unsch
arfeprinzip. Berliner Berichte, 296
303, 1930.
[144] B.-W. Schulze. Pseudo.dierential operators on manifolds with singularities.
North-Holland, Amsterdam, 1991.
[145] B.-W. Schulze. Operators with symbol hierarchies and iterated asymptotics. Publications of RIMS, Kyoto University, 38(4), 735802, 2002.
[146] B.-W. Schulze. Boundary value problems, conical singularities, and asymptotics.
Akademie Verlag Berlin, 1994.
362
Bibliography
[147] I.E. Segal. Foundations of the theory of dynamical systems of innitely many
degrees of freedom (I). Mat. Fys. Medd., Danske Vid. Selsk., 31(12):139, 1959.
[148] H. Seifert and W. Threlfall. A Textbook on Topology. Academic Press, 1980 [Original version: Lehrbuch der Topologie. Teubner, 1934]
[149] D. Shale. Linear Symmetries of free Boson elds. Trans. Amer. Math. Soc. 103:149
167, 1962.
[150] B. Simon. Trace Ideals and Their Applications. Cambridge University Press, 1979.
[151] B. Simon. The Weyl transform and Lp functions on phase space. Amer. Math. Soc.
116, 10451047, 1992.
[152] R. Simon, N. Mukunda and B. Dutta. Quantum Noise Matrix for Multimode
Systems: U(n)-invariance, squeezing and normal forms. Phys. Rev. A 49, 1567
1583, 1994.
[153] R. Simon, E.C.G. Sudarshan, and N. Mukunda. GaussianWigner distributions in
quantum mechanics and optics. Phys. Rev. A 36(8), 38683880, 1987
[154] I.M. Singer and J.A. Thorpe. Lecture Notes on Elementary Geometry and Topology.
Springer-Verlag, 1987.
[155] V.A. Sobolev. Absolute continuity of the periodic magnetic Schr
odinger operator.
Invent. Math. 137(1) 85112, 1999.
[156] J.-M. Souriau. Indice de Maslov des varietes lagrangiennes orientables. C. R. Acad.
Sci., Paris, Serie A, 276:10251026, 1973.
[157] E. Spanier. Algebraic Topology. McGrawHill Book company, 1966.
[158] E.M. Stein. Harmonic Analysis: Real Variable Methods, Orthogonality, and Oscillatory Integrals. Princeton University Press, 1993.
[159] E.M. Stein and G. Weiss. Fourier Analysis on Euclidean Spaces. Princeton University Press, 1971.
[160] J. Toft. Regularizations, decompositions and lower bound problems in the Weyl
calculus. Comm. Partial Di. Eq., 25, 12011234, 2000.
[161] J. Toft. Continuity properties in non-commutative convolution algebras, with applications in pseudo-dierential calculus. Bull. Sci. math. 126, 115142, 2002.
[162] G. Torres-Vega and J.H. Frederick. Quantum mechanics in phase space: New approaches to the correspondence principle. J. Chem. Phys. 93(12), 88628874, 1990.
[163] G. Torres-Vega and J.H. Frederick. A quantum mechanical representation in phase
space. J. Chem. Phys. 98(4), 31033120, 1993.
[164] F. Tr`eves. Introduction to Pseudo-dierential and Fourier Integral Operators (two
Volumes). University Series in Mathematics, Plenum Press, 1980.
[165] F. Tr`eves. Topological Vector Spaces, Distributions, and Kernels. Academic Press,
New York, 1967.
[166] G. Tuynman. What is prequantization, and what is geometric quantization? Proceedings, Seminar 19891990, Mathematical Structures in eld theory, 128. CWI
Syllabus 39. CWI, Amsterdam, 1996.
[167] A. Unterbeger. Automorphic pseudo-dierential Analysis and Higher Level Weyl
Calculi. Progress in Mathematics, Birkh
auser, 2003.
[168] I. Vaisman. Symplectic Geometry and Secondary Characteristic Classes. Birkh
auser, Progress in Mathematics 72, 1987.
Bibliography
363
[169] L. van Hove. Sur certaines representations unitaires dun groupe ni de transformations. Mem. Acad. Roy. Belg. Classe des Sci. 26(6), 1952.
[170] V.S. Varadarajan. Lie groups, Lie algebras, and their Representations. Prentice
Hall, 1974; Springer Verlag 1984.
[171] J. von Neumann. Mathematical Foundations of Quantum Mechanics. Princeton
University Press, reedition, 1996
[172] A. Voros. Asymptotic h-expansions of stationary quantum states. Ann. Inst. H.
Poincare, Sect. A, 26:343403, 1977.
[173] A. Voros. An algebra of pseudo-dierential operators and the asymptotics of quantum mechanics. J. Funct. Anal. 29:104132, 1978.
[174] C.T.C. Wall. Nonadditivity of the signature. Invent. Math. 7, 1969, 269274.
[175] N. Wallach, Lie Groups: History, Frontiers and Applications, 5. Symplectic Geometry and Fourier Analysis, Math. Sci. Press, Brookline, MA, 1977.
[176] A. Weil. Sur certains groupes doperateurs unitaires. Acta Math. 111:143211, 1964;
also in Collected Papers, Vol. III:169, Springer-Verlag, Heidelberg, 1980.
[177] A. Weinstein. Symplectic Geometry. Bull. Amer. Math. Soc. 5(1):111, 1981.
[178] A. Weinstein. Lectures on symplectic manifolds. CBMS Conf. Series 29, Amer.
Math. Soc., Providence R.I., 1977.
[179] H. Weyl. The Theory of Groups and Quantum Mechanics. Methuen, London, 1931
[reprinted by Dover Publ., New York, 1950].
[180] E. Wigner. On the quantum correction for thermodynamic equilibrium, Phys. Rev.
40, 799755, 1932.
[181] M.W. Wong. Weyl Transforms. Springer, 1998.
[182] M.W. Wong. Trace-Class Weyl Transforms. Integr. equ. oper. theory 99, Birkh
auser Verlag, Basel, 2003.
Index
action-angle variables, 141
ane
symplectic group, 31
symplectic isomorphisms, 31
ALM index, 71, 75, 99
properties of, 81
reduced, 85, 150
on Lag2q (n), 94
Arnold-Leray-Maslov index, see ALM
index
Bessels inequality, 273
Bochner integral, 168, 223
canonical
2-form, 10
commutation relations, 160, 161
symplectic basis, 7
transformation, 55
Cartans theorem, 334
Cayley transform (symplectic), 223
characteristic function, 346
coboundary, xx
coboundary operator, 75, 76, 85
cochain, xx, 76
cocycle, xx
property of Kashiwaras signature,
see Kashiwara signature
coisotropic, 11
completely integrable, 142
complex
structure, 6
concatenation, 96
ConleyZehnder index, 104, 227, 327
constant of the motion, 140, 145
correlation of a pair of random
variables, 346
covariance, 241
366
Hamilton
equations, 51
vector eld, 51
suspended, 52
HamiltonJacobi equation, 133
Hamiltonian
function, 51
phase, 138
symplectomorphism, 58
Hamiltonian ow
ow, 52
suspended, see Hamiltonian
time-dependent, 53
Hamiltonian function, see Hamiltonian
Heisenberg
algebra, 161
group, 162, 307
HeisenbergWeyl operator, 152, 163,
304
HilbertSchmidt norm, 280
HilbertSchmidt operator, 279, 291
integral, 288
imprimitivities, 309
integral operator, 282
involution (functions in), 140
isotropic, 11
Jacobian
matrix, xviii
Kashiwara, see WallKashiwara
signature
Kronecker ow, 143
Lagrangian
Grassmannian, 15, 16
manifold, 123
exact, 125, 129
q-oriented, 147
plane, 15
set, 11
submanifold, 124
torus, 143
Liouvilles equation, 141, 297
Index
Maslov
bundle, 79
cycle, 96
Maslov index, 96, 97
denition of, 71
on Lag(n), 66, 70, 75
on Mp(n), 197, 214
on Sp (n), 87, 215
mathematical expectation, 240, 291
metaplectic
group, 195, 198
Maslov index, 215
operator, 198
metaplectic covariance
for the WignerMoyal transform,
207
for wave-packet transforms, 312
of Weyl operators, 204
minimum uncertainty, 261
mixed state, 293
monodromy matrix, 61
Moyal identity, 187
observable
classical, 190, 238
quantum, 239
orientable covering, 150
oscillatory integral, 174
phase of a Lagrangian manifold, 125,
126
PoincareCartan invariant, 130, 305
Poisson
brackets, 139, 209
commuting functions, 140
polarized Heisenberg group, 162
positive operator, 266
primitive (of a cocycle), 76
probability density, 345
pure state
quantum, 291
q-orientation, 94
q-symplectic geometry, 65, 84, 94
quadratic Fourier transform, 197
quantum blob, 255, 265, 266, 307
quantum state, 239
quasi-probability density, 190
Index
367
symplectic group
direct sum, 31
symplectic isomorphism, 14
symplectic shear, 30, 257
symplectic spectrum, 246, 252
symplectomorphism, 27, 55
Schr
odinger
equation, 156, 209, 271
in phase space, 324
representation, 163
Schwartz
kernel theorem, 170
space, xix
skew
orthogonality, 11
product, 3
Souriau mapping, 66
spectral ow, 102
standard deviation, 345
standard symplectic
form, 4
space, 4
Stonevon Neumann theorem, 309
stratication, 95
stratum, 95
subsystem, 258
Symp(n), 28
symplectic
algebra, 36
area, 248, 249
ball, 255
base, 11
basis, 7
block matrix, 27
capacity, 261
linear, 249
Cayley transform, 223, 321
form, 3
Fourier transform, 167, 188
radius, 248, 251
subset, 11
vector space, 3
symplectic covariance
of Hamiltons equations, 56
of the symplectic Fourier transform,
167
symplectic gradient, 52
topology
of Ham(n), 60
of Symp(n), 60
trace (of a trace-class operator), 275
trace-class operator, 273, 275
self-adjoint, 282
translation operator, 152
twisted convolution, 182
twisted form, see de Rham form
twisted symbol, 168
uncertainty principle, 240, 261
universal covering
manifold, 126
of Lag(n), 79, 85
of Sp(n), 87
variance, 241, 345
variational equation, 55
WallKashiwara signature
cocycle property, 23
denition, 19
waveform, 150, 153
Weyl
operator, 166
composition of, 179
operator (adjoint of), 173
symbol, 168, 179
of a metaplectic operator, 222
symbol (twisted), 168, 181
Wigner ellipsoid, 253
Wigner transform, 186, 263, 303
range of, 192
Wigner wave-packet transform, 311
intertwining formulae, 317
range of, 314
WignerMoyal transform, 186, 311
Subseries
Advances in
Partial
Dierential
Equations
Subseries editors:
Bert-Wolfgang Schulze
Institut f
ur Mathematik
Universitat Potsdam
14415 Potsdam
Germany
Sergio Albeverio
Institut f
ur Angewandte Mathematik
Universitat Bonn
53115 Bonn
Germany
Jerome A. Goldstein
The University of Memphis
Mathematical Sciences
273 Dunn Hall
Memphis, TN 38152-3240
USA
Noboyuki Tose
COE of Mathematics
Keio University
4-1-1 Hiyoshi
Kohoku
Yokohama 223-8521
Japan
Michael Demuth
Institut f
ur Mathematik
Technische Universitat Clausthal
38678 Clausthal-Zellerfeld
Germany
The subseries Advances in Partial Dierential Equations is intended to report on recent
developments in all areas of partial dierential equations, in particular, on those belonging to the tradition dened by the series Operator Theory: Advances and Applications
and new areas. The Advances volumes will consist primarily of expository research articles, presenting both an overview of the state of a eld and new results, in such areas as
microlocal analysis and its applications to asymptotic phenomena, mathematical physics,
spectral theory, as well as symplectic geometry, the analysis of singularities of solutions,
and geometric analysis. Selected articles of this kind may also appear as a separate monographic volume. A certain number of short communications will also be published.
Proposals for manuscripts may be sent to B.-W. Schulze or to one of the other subseries
editors.