Professional Documents
Culture Documents
1 Introduction
This paper deals with maximization of classical f -divergence between the distri-
butions of a measurent outputs of a given pair of quantum states. f -divergence
Df between the probability density functions p1 and p2 over a discrete set is
de…ned as
X p1 (x)
Df (p1 jjp2 ) := p2 (x) f ;
x
p2 (x)
if p2 (x) > 0 for all x. (The de…nition for the general case is given later.)
This problem is import since Df has good operational meanings. For some
choice of f , its operational meaning is well-known. For example,
fKL ( ) := ln ; fKL2 ( ) = ln
; ( 0; 1) ;
f ( ) := (1)
; (0 < < 1) :
correspond to Rnyi-type relative entropy (or its monotone function). They play
key role in the theory of large deviation, and thus extensively used in asymptotic
analysis of eorror probability of decoding, hypothesis test, and so on.
Not only these choice of f , but others have also a good operational meaning.
If f is a proper closed convex function f with dom f (0; 1), Df (p1 jjp2 ) is
the optimal gain of a certain Bayes decision problem. For each f satisfying
above mentioned conditions, there is a pair of real valued functios w1 and w2
on decision space representing a gain of decision d, with
X
Df (p1 jjp2 ) = sup (w1 (d (x)) p1 (x) + w2 (d (x)) p2 (x)) : (2)
d( ) x
1
Conversely, for each pair (w1 ( ) ; w2 ( )), there is a proper closed convex function
f satisfying the above idenity.
Also, by (2) and the celebrated randomization criterion [6][7], there is a
Mrkov map which sends (p; q) to (p0 ; q 0 ) if and only if Df (pjjq) Df (p0 jjq 0 )
holds for any proper closed convex function f with dom f (0; 1).
In quantum information and statistics, we make decisions based on the data
out of measurements performed on given quantum states. Thus, a honest quan-
tum version of the concept is
where POVM is the short for positive operator valued measure, and P M is
the probability distribution obtained by applying the measurement M to the
state ( 2 f1; 2g). (Underlying Hilbert space, denoted by H, is …nite dimen-
sional throughout the paper.) A trouble with Dfmin ( 1 jj 2 ) is that the de…ni-
tion involves the intractable maximization of non-linear functional of POVM.
(If they were tractable, quantum asymptotic theory of hypothesis test and so
on should have been easier. As is well-known, these had been solved in quite
non-trivial manner.) This optimization had been so far solved only for f1=2 ,
and f ( ) = j1 j. The former and the latter corresponds to …delity and total
valiation, respectively.
The purposes of the present paper is to advance the study further, by inves-
tigating its properties, rewriting the maximization problem to more tractable
form, and giving closed formulas of the quantity in some special cases. One of
the two core results is Theorem 6: if the convex conjugate f of f is operator
convex, Dfmin ( 1 jj 2 ) is written as the supremum of a concave funcction of Her-
mitian operator in H (If the operators could move extended Hilbert space, the
result would not have been as useful), namely,
Dfmin ( 1 jj 2 ) = Dfmin ( 1 jj 1 2 1 ) + f ( 1) (1 tr 2 1 );
where 1 is the projection onto supp 1 . Using this, closed formula for pure
states is obtained, in case where f satis…es above mentioned condtions.
Using above results, we computed the limit
1 min
lim 2 Df ( jj 0 );
0
! ( 0 )
2
of the parameter . In case that > 0 for all , it have been a forklore that this
equals the constant multiple of SLD (symmetric logarithmic derivative) Fisher
information, which plays important role in the asymptotic theory of statistics.
We analyze the case where is not full-rank, and show this statement requires
non-trivial correction.
2 Classical f -divergence
In this section, we summerize the known or trivial facts about classical f -
divergence and convex functions. Below, f takes values in R [ f1; 1g ,
dom f := f : f ( ) < 1g. f is convex if and only if f( 1 ; 2 ) ; 2 f ( 1 )g is
convex. This way of de…nition is equivalent to the convention de…ning f ( ) :=
1 outside of the domain. A convex function f is proper if and only if f is
nowhere 1 and not 1 everywhere, and is closed if and only if f( 1 ; 2 ) ; 2 f ( 1 )g
is closed, or equivallently, f is lower semicontinuous.
Let f be a proper closed convex function whose e¤ective domain contains
(0; 1). De…ne g ( 1 ; 2 ) by
8
>
> 2f
1
; 1 2 dom f; 2 > 0
>
> 2
<
g ( 1 ; 2 ) := lim 2 #0 2 f 2 ;1
1 2 dom f; 2 = 0;
>
>
>
> 0; 1 = 2 = 0;
:
1; 1 62 dom f or 2 < 0:
Note g is proper, closed, convex, and positively homogeneous (see p. 35 and p.67
of [5]). Especially,
1
lim 2f > 1: (4)
2 #0 2
0 1
= p1 (x) lim
0 2f 0 ;
2 #0 2
3
where B c is the complement of the set B
where
n o
Wf = w = (w ) 2f1;2g ; w1 w10 ; w2 f (w10 ) ; 9w10 2 dom f ;
f (t) := sup (t f ( )) :
Therefore,
Df (P1 jjP2 )
0 1
Z X
= @ sup w p (x)A d (x)
w2W
2f1;2g
8 9
<Z X =
= sup w (x) p (x) d (x) ; w ( ) : measureable, into Wf : (6)
: ;
2f1;2g
Remark 1 (6) indicates (2).To see this, use Wf as a decision space, and let
the the gain of the decision w is w when true probability distribution is P .
t f (t) t0 f (t0 )
4
hlods for any t0 with t0 > t and f (t0 ) f (t),
f ( ) = sup t f (t) ;
t
= sup t f (t)
t2dom f0
= sup t f0 (t)
t2dom f0
= f0 ( ) :
If f0 6= f , how f0 di¤ers from f ? By assumptuin, f is not monotone
increasing. Also, we exclude unintresting case where f is monotone non-
increasing. Then mint f (t) exists. Let a0 be the largest number such that
f (a0 ) = mint f (t). Then
dom f0 = [a0 ; 1) \ dom f .
Let < 0. Then since f0 is monotone increasing on dom f0 = [a0 ; 1) \ dom f ,
f0 ( ) = sup t f (t)
t2[a0 ;1)\dom f
= ta0 f (a0 ) :
Thus, f0 is a¢ ne on the negative half line.
For example, both of
fTV ( ) := j1 j;
j1 j; 0;
fTV2 ( ) :=
1; < 0;
correspond to the total valiation distance,
DfTV (P jjQ) = DfTV2 (P jjQ) = kP Qk1 :
The former is standard but the latter is not,
8
< 1; t < 1;
fTV (t) = t; 1 t 1; ;
:
1; t > 1;
8
< 1; t < 1;
fTV2 (t) = t; 1 t 1; :
:
1; t > 1;
De…ne
f^ ( ) := g (1; ) ; 0;
and extend it to the negative half line so that it becomes standard. Also, De…ne
g^ by in a parallel manner as the de…nition of g, replacing f by f^. Then,
g^ ( 1; 2) = g( 2; 1) ; (8)
5
if 1 0, as will be checked below.
If ^
1 2 dom f and 2 > 0, the relation is checked by easy computation. If
1 = 2 = 0, the = dom f^ and 2 > 0,
both ends of (8) is 0. Finally, if 1 = 0 2
g^ (0; 2) = 1 = f^ (0) = 2f
^ (0) = 2 g (1; 0) = g( 2 ; 0) ;
Dfmin ( 1 jj 2 ) DfQ ( 1 jj 2 ) :
(If the statement is untrue, it obviously contradict with the monotonicity of
DfQ . )
Remark 2 Here, Df and Dfmax have operational meanngs only if the entities
are normalized. But extending them to general positive …nite measures and
positive operators makes statement of the mathematical statements easier. Since
we sometimes consider submatrices of inputs states, especially in the case that
measurement process is involved.
In this section we dtermine the case where Dfmin stays …nite. Note this result
also give, due to the above mentioned fact, the su¢ cient condition that all the
quantum versions of f -divergence becomes in…nite. Before analyzing non-trivial
cases, we …rst present the cases where Dfmin is …nite trivally.
The …rst trivial case is the one where Wf is "bounded from above", or there
are …nite numbers a1 adn a2 with
Wf ( 1; a1 ] ( 1; a2 ]: (9)
Thus, Dfmax is also …nite. Therefore, we consider the cases where the above
assumption is false.
6
Another trivial case is the one where 1 > 0 and 2 > 0. Then there is a
constant b > 0 and b0 > 0 (b b0 ) with
1 b 2 0; (10)
1 b0 2 0; (11)
0
respectively. For latter use, b and b is de…ned as the smallest number with
(10) and the largest number with (11), respectively. Then, denoting by pM the
density of P M with respect to P M1 + P M2 ,
pM1 (x) tr M 1
sup b ;
pM2 (x) 0 M 1H tr M 2
pM2 (x) tr M 1 1
inf :
pM1 (x) 0 M 1H tr M 2 b0
Hence, using the assumption that f is …nite on (0; 1), we have the asserted
statement. Thus, the question is the case where at least one of them is not
full-rank.
Observe by (6) and (3),
Dfmax ( 1 jj 2 ) w1 tr 1 + w2 tr 2 + w10 tr 1 (1 ) + w20 tr 2 (1 )
holds for any projection and w, w0 2 Wf .
Suppose ker 1 is non tirivial. Then let P be the projection onto ker 1 . Then
above inequality shows that Dfmax ( 1 jj 2 ) can be bounded only if w2 stays …nite,
i.e.,
Wf R ( 1; a2 ]: (12)
On the other hand, suppose the above inclusion is true. To avoid the trivial
case, suppose Wf is not bounded from above, and 2 is full-rank. ( If the latter
is false, by the parallel argument as above shows Dfmax = 1.) By the latter
assumption, there is b > 0 with (10). Let t be the number such that b is a
subgradient of f . Then by the convexity of f and (10), we have
X
Dfmin ( 1 jj 2 ) sup (t tr 1 Mt + ( b (t t ) f (t )) tr 2 Mt )
fMt g t
X
= sup t tr ( 1 b 2 ) Mt + b t tr 1 f (t ) tr 2
fMt g t
b t tr 1 f (t ) tr 2;
7
Note (12) and (13) is equivalent to
f ( 1) > 1
and
dom f ( 1; a1 ];
respectively.
4 Continuity of Dfmin
Since Dfmin is jointly convex almost by de…nition, it is continuous in the interior
of dom Dfmin , or at the points where 1 > 0 and 2 > 0. By applying well-known
facts in convex analysis, a sort of continuity at the edge is easily proved. For a
given convex function f , we denote epi f := f( 1 ; 2 ) ; 2 f ( 1 )g.
The de…nition (3) is rewrited as, using (6),
8 9
0 0
<Z X 0
=
sup Df P M1 jjP M2 = sup w (x) dP M (x) ; w ( ) : measureable function into Wf
M0 M 0 ;w : ;
2f1;2g
8 9
<Z X =
= sup w (x) tr M 0 (dx) ; w ( ) :measureable function into Wf
M 0 ;w : ;
2f1;2g
8 ! 9
< X Z =
= sup tr w M (dw) ;
M : w2Wf ;
2f1;2g
(W ) 2 2 WH;f
Z
, 9M : POVM W w M (dw) :
w2Wf
Then
where
~ = (W )
W = (W1 ; W2 ) :
2
8
Lemma 4 Dfmin is a proper closed convex function which is positively homoge-
neous.
where X
hW
~ ( 1; 2) := tr ( W ) :
2f1;2g
T
Since each epi hW
~ is closed, (W ) 2WH;f epi hW
~ is also closed. Thus we have
2
the assertion.
If one of the following two are trule, it is easy to prove that this is the case:
9
(II) f^ (t) (= f 1
( t)) is operator convex on dom f^
V y f (T ) V f V yT V :
R
Therefore, de…ning a PVM E 00 by the spectral decomposition V y T V = s2dom f
sE 00 (ds),
Z Z
00
W1 sE (ds) ; W2 f (s) E 00 (ds) :
Also, observe
spec V y W1 V spec W1 s 2 dom f :
Thus, (15) is veri…ed. In the case that (II) holds, almost in the parallel manner
(15) is checked.
W0H;f can be expressed in the following way:
n o
W0H;f = W~ ; W1 T; W2 f (T ) ; spec T dom f :
~ 2 W0
This is checked by noticing that W H;f is equivallent to
W1 T;
Z Z
W2 w2 E (dw) f (w1 ) E (dw) = f (T ) :
w2Wf w2Wf
10
Theorem 6 Let f be a proper convex function with dom f (0; 1). If (15)
is true, then W0H;f = WH;f and thus
= tr XDf (T ) (Y ) :
11
With all de…nitions and assumptions are made, we now proceed to the analy-
sis of the maximal point of G (T ). Most tractable case is that there is a station-
ary point of G (T ) in dom G. Therefore, if
d G (T0 + sX)
= tr X 1 tr f 2 Df (T0 ) (X)g
ds s=0
= tr X ( 1 Df (T0 ) ( 2 )) = 0
holds for any Hermitian matrix X, T0 achives maximum. (Here we used the
fact that Df (T0 ) ( ) is self-conjugate.) Thus, we have
1 = Df (T0 ) ( 2 ) (19)
Therefore,
Dfmin ( 1 jj 2 ) = tr 1 T0 tr 2f (T0 )
= tr T0 Df (T0 ) ( 2 ) tr 2 f (T0 )
= tr Df (T0 ) (T0 ) 2 tr 2 f (T0 )
= tr fDf (T0 ) (T0 ) f (T0 )g 2
n 0
o
= tr T0 f (T0 ) f (T0 ) 2
0
= tr f f (T0 ) 2; (20)
Here, in the third identity holds since Df (T0 ) ( ) is self-adjoint, the …fth idenity
holds due to (18), and the last idenity is due to
f (x) = f (x) = sup tx f (t) :
t
We give a su¢ cient condition that the supremum is achived in the interior
of the domain, or equivallently, T0 with (19) exists:
12
Then G (T ) = G + G0 + G0 ,
G tr 1 T tr 2 (b (T t ) + f (t ) )
= tr ( 1 b 2 ) T + tr 2 (b t f (t ))
tr ( 1 b 2 ) t + tr 2 (b t f (t ))
= tr 1 (t ) tr 2 f (t ) ;
and
G0 tr 1 (t0 0
) tr 2f (t0 ) 0
:
Therefore,
G (T ) G (t + T + t0 0
):
To summerize:
Lemma 7 Suppose f is a proper closed convex function with dom f (0; 1).
Suppose also the assumption (I) is true. Then, the T in (16) can be restricted
to the set of all the Hermitian operators with spec T [t0 ; t ]
If H is …nite dimensional and 1 > 0, 2 > 0, this lemma means that the
supremum can be restricted to the interior of dom G, and that the supremum
of the di¤erentiable concave function G ( ) is achived by a point satisfying some
point. Thus the solution T0 to (19) achieve the supremum.
So far, we had supposed the assumption (I) is true. Now let us consider the
case where (II) holds and (I) does not. A trivial approach is to exchange 0
and 1 , and apply all the analysis replacing f by f^. This means the change of
variable from T to S := f (T ). But sometimes, the use of the variable T is
more preferable by technical reasons. In such cases, still we can use (19), because
of the following reason. Since the assumption (II) says that f^ (t) = f 1 (t)
is operator convex, it is continuously di¤erentiable. Thus the stable point with
respect to S is also a stable point with respect to T .
Lemma 8 Suppose f is proper, closed, convex, standard and dom f (0; 1).
Suppose also f is operator convex,dom f is not bounded from below, and
f ( 1) > 1. Then f is operator monotone.
Below, TH0 ()is the operator in supp 1 with Tsupp 1 jei = 1 T jei. (it is
essentially identical to 1 T 1 , except that 1 T 1 is de…ned in H. )
13
Lemma 9 Suppose also the domain of a function h is unbounded from below
and h ( 1) is …nite. Then, if X 0 is supported on ker 1 and its eigenvalues
are negative on ker 1 ,
Proof. Let rs and j's i (k's k = 1) be the -th eigenvalue and eigenvector of
T + sX, respectively. Since
1
lim s T + X = X;
s!1
implying
= h (T )supp 1
+ h ( 1) (1 1
):
14
Proof. Since h is operator convex, by Exerceise V.2.2 of [2],
(h (T + sX))supp 1
h (T + sX)supp 1
= h (Tsupp 1 ) ;
(h (T + sX))ker 1
h (T + sX)ker 1
f ( 1) 1ker 1
: (25)
lim 1 h (T + sX) (1 1 )
s!1
X n o
= lim h (rs ) lim 1 j's i h's j (1 1 )
s!1 s!1
= 0;
Theorem 11 Suppose f is proper, closed, convex, standard and dom f (0; 1).
Suppose also f is operator convex and dom f is not bounded from below. Then
if 1 is not full-rank,
Dfmin ( 1 jj 2 ) = Dfmin ( 1 jj 1 2 1
) f ( 1) tr 2 (1 1
): (26)
Proof. By Theorem 3, we only have to prove the assertion for the case 1 <
f ( 1) (otherwise, Dfmin ( 1 jj 2 ) = 1:)
By Lemma 8, f is operator monotone. Thus, if X < 0 is supported on
ker 1 , G (T + sX), where G is as of (17), is non-decrasing in s. Therefore,
15
Hence, by Lemma 10, we have
sup G (T + X) = tr 1 1
T 1
tr ( 2 )supp 1
f (Tsupp 1 )
X<0: supp X= ker 1
f ( 1) tr 2 (1 1 );
indicating (26).
8 Examples
8.1 Renyi-type, f ( )
Here we consider f ( ) which is de…ned by (1) on the non-negative half line.
On the negarive half-line, we de…ne it so that it becomes standard, i.e., f is
monotone increasing on dom f . Since = 0 and 1 give trivial functions, we
omit these cases. The relation
f^ ( ) = f1 ( );
and the condition (I) is satis…ed. Thus, if > 1, the condition (II) is satis…ed,
and
f (t) = ( 1) 1t 1; (t > 0) ;
1; (t 0) :
Thus, if 2, in addition the condition (I) is satis…ed.
For all the values of (6= 0; 1), f 0 (t) moves all over the positive half line
(0; 1). Thus, by (22) and (21), the supremum is achieved by T0 with (19).
In the case of = 1 and 2, we can solve the problem "explicitly". Observe
f2 is operator convex on dom f2 .
1 2
f2 (t) = 4t ; (t 0) ;
;
1; (t < 0) :
By (19), we have a Lyapunov equation
1
Df2 (T0 ) ( 2 ) = (T0 2 + 2 T0 ) = 1:
4
16
If 2 > 0, this can be solved about T0 ,
Z 0
T0 = 4 es 2
1e
s 2
ds 0;
1
If = 12 ,
1; (t > 0) ;
f 1 (t) = 1 1
4t ; (t 0) ;
2
and
1 1 1
Df 1 (T0 ) ( 2 ) = T 2 T0 = 1:
2 4 0
Thus q
1 1=2 1=2 1=2 1=2
T0 = 2 2 2 1 2 2 ;
and
1=2
1 2 1 1
Dfmin ( 1 jj 2 ) = tr 2 T = tr 2 T0
1=2
4 0 2
q
1=2 1=2
= tr 2 1 2 ;
17
8.2 On Cherno¤ and Hoe¤ding bound
The results about Renyi type quantity, especially when two states are pure, gives
another way to compute Cherno¤ bound and Hoe¤ding bound, whose classical
counter part is
1
C (p1 jjp2 ) := sup lim ln ( 1;n + 2;n )
n!1 n
= sup f ln ( Df (p1 jjp2 ))g
0< <1
and
1 1
Hr (p1 jjp2 ) := sup lim ln 1;n ; lim 2;n r
n!1 n n!1 n
r 1
= sup ln ( Df (p1 jjp2 )) ;
0< <1 1 (1 )
respectively. Here, 1;n is the probability that the test mistakenly judges the
true distribution (= p1 n , in fact,) as being p2 n . 2;n the other direction of the
error. Its quantum counterpart is de…ned by repalcing distributions by states,
and explicite form of these quantities are
C ( 1 jj 2 ) = sup ln DfP ( 1 jj 2 ) ;
0< <1
r 1
Hr (p1 jjp2 ) = sup ln DfP ( 1 jj 2 ) ;
0< <1 1 (1 )
where
1
DfP ( 1 jj 2 ) := tr 1 2 ; 0< < 1:
(See [1][3][4])
We con…rm these celebrated results for the case where = j' i h' j. In
fact, we have
1 n n
C ( 1 jj 2 ) = lim sup ln Dfmin 1 jj 2 ;
n!1 n 0< <1
r 1 n n
Hr (p1 jjp2 ) = lim sup ln Dfmin 1 jj 2 :
n!1 0< <1 1 n (1 )
" " is trivial. The achievability is not such straightforward in general since
the optimal measurement di¤ers for each . However, since our states are
pure, the supremum is in…nitely approximated by the sequence of measurements
independent of , and thus the equality holds. Also, the RHS can be computed
explicitly,
1 1
n n 2 (1 ) ln jh'1 j'2 ij ; 0 < 2;
ln Dfmin 1 jj 2 = 1
n 2 ln jh'1 j'2 ij ; 2 < 1:
The supremum is achived at = 0; 1, and the known result is con…rmed. In-
terestingly, even though they give the same supremum, they di¤er for almost
every value of .
18
8.3 Kullback-Leibler divergence
Let
ln ; ( 0) ;
fKL ( ) := ;
1; ( < 0) ;
Then
( R
p2 (x) ln pp12 (x)
(x) d (x) ; if supp p1 supp p2 ;
DfKL (p1 jjp2 ) =
1; otherwise:
Z 1
DfKL (T0 ) ( 2 ) = esT0 2e
(1 s)T0
ds = 1;
0
Z 1
1 1
Df^KL2 (S0 ) ( 1 ) = (s1 S0 ) 1 (s1 S0 ) ds = 2
0
= sup tr ( 1 2) T =k 1 2 k1 :
T : 1 spec T 1
Note, fTV does not satisfy (22) nor (21). Indeed, the supremum is achived
by a T whose eigenvalues are at the both end of the domain of fTV .
19
9 Relation to quantum Fisher information
Consider a parameterized family fp g 2R of probability density functions over
a …nite set, and suppose ! p is smooth, and supp p supp p0 . Suppose
also f is a convex function with all good features. Then using Taylor expansion
of f ,
1 1
lim 2 (Df (p jjp )
0 Df (p jjp )) = f 00 (1) J 2 ;
0!
( 0 ) 2
where
X (@p (x) =@ )2
J :=
x
p (x)
is the Fisher information of the family fp g 2R . This quantity characterizes
asymptotic behaviour of estimate of , and the above mentioned relation is also
important in deriving these results, and in relating estimation and hypothesis
test.
Thus exploring its quantum analogue is also of interest. Let f g 2R be a
family of density operators, and suppose ! is smooth. Then our task
min 0 2 0
here is to evaluate Df ( jj 0 ) up to ( ) for small j j. Naively
exchanging the order of limit and optimization, we have
1 1 00 0 2
lim Dfmin ( jj 0 ) Dfmin ( jj ) = f (1) max J M ( ) ;
!0
( 0 )2 2 M
max J M = J S :
M
Here,
2
J S := tr LS ;
is the SLD Fisher information and LS is called the symmtric logarithmic deriv-
ative of f g 2R is de…ned as a Hermitian operator satisfying the equation
@ 1 S
= L + LS :
@ 2
SLD Fisher information, like its classical analogue, well characterize the asymp-
totic behaviour of the optimal estimate of the unknown parameter . Thus, by
Dfmin ( jj ) = f (1), we obtain
1 1 00
lim Dfmin ( jj 0 ) Dfmin ( jj ) = f (1) J S : (28)
!0
( 0 )2 2
20
has non-trivial kernel, however, the remainder term of the Taylor expansion is
not necessarily bounded due to 1=pM -factor in the de…nition of the f -divergence.
For example, suppose ( 2 R) are pure states. Then Dfmin ( > 1; < 0),
min min
DfKL and DfKL diverge, and (28) is never true. On the other hand, in the case
2
that f ( 1) is …nite, f = f (0 < 1=2) for example, it is easy to see the
LHS of (28) equals constant multiple of J S . Hence, the naive argument as above
is not completely regorous, but not totally false. Below, we give deeper analysis
on this issue. As it will turn out, (28) requires some non-trivial correction, when
the rank of is neither full nor 1.
To use Theorem 11, we suppose the assumption (I) holds and dom f is un-
bounded from below. Also, suppose f is three times continuously di¤erentiable
in the neighbourhood of 1, and
f 00 (1) > 0.
(The last assumption is necessary for the f -divergence not to be constant at
0
0
.) This means t0 := f (1) lies in the interior of dom f , and
f (t) = t f 0 1
(t) f f0 1
(t)
is three times continuously di¤erentiable at the neighbourhood of t0 . Using
above identity,
f (t0 ) = t0 f (1) ;
f 0 (t0 ) = 1;
00 1
f (t0 ) = > 0;
f 00 (1)
For simplicity, in addition we also suppose the rank of is constant for all ,
and the map ! is three times continuously di¤erentiable.
By Theorem 11 we only have to maximize
G (T ; ; 0 ) := tr T tr 0 f (T ) f ( 1) (1 tr 0 )
where T moves over all the Hermitian operators living in the support H of ,
is the projector onto H , and spec T is a subset of dom f . Observe, if = 0,
the optimal T is t0 . Thus, we put
T 0 = t0 + Y:
0
Then, using Lemma 7, we have, for small j j,
1
kY k r0 1 k 0 k sup 00
(29)
t2[t0 c;t0 +c] f (t)
as detailed in Appendix B. Here, r0 is the smallest eigenvalue of and c is a
positive number such that
1
sup 00
< 1:
t2[t0 c;t0 +c] f (t)
21
Such c > 0 exists since f is three times continuously di¤erentiable at the
neighbourhood of t0 and f 00 (t0 ) > 0.
Since f is three times continuously di¤erntiable in the neighbourhood of t0
by assumption,
1 1
f (t0 + ) = f (t0 ) + f 0 (t0 ) + f 00 (t0 ) 2 + f 000 (t0 ) 3
2 6
1 2 1
= f (t0 ) + + 00 + f 000 (t0 ) 3 ;
2f (1) 6
for a certain t0 between t0 + and t0 . Due to the fact that t0 commute with
Y and t0 = f (1) + f (t0 ),
G (t0 + Y ; ; 0)
= tr (t0 + Y ) tr 0 f (t0 +Y) f ( 1) (1 tr 0 )
1
= tr (t0 +Y) tr 0 f (t0 ) + Y + Y2 f ( 1) (1 tr 0 )+C
2f 00 (1)
1 n o
2
= f (1) + (f (t0 ) f ( 1)) (1 tr 0) + tr 0 Y02 tr 0 (Y0 Y) +C
2f 00 (1)
1
f (1) + (f (t0 ) f ( 1)) (1 tr 0) + tr 0 Y02 + C;
2f 00 (1)
where C is bounded from above as
1 000 3
jCj sup f (t0 ) kY k
6 t0 2[t0 kY k;t0 +kY k]
0 3
= O( ) ;
= A Ay ;
where the family fA g 2R satis…es
@ 1
A = LS A :
@ 2
22
Then
1
A 0 = A + LS A ( 0
) + C1 ;
2
0 2
where C1 is O ( ) . De…ne the notations
Then since
1 0
0 = ( ) LS;1 + LS;1 + C2 ;
2
we have, by (30),
Y0 = f 00 (1) ( 0
) LS;1 + C3 ;
0 2
where C2 and C3 are O ( ) . Also,
(1 ) 0 (1 )
y
1 S 0 1 S 0
= (1 ) L A ( ) + C1 L A ( ) + C1 (1 )
2 2
y
1 S;2 0 1 S;2 0
= L A ( ) + (1 )C1 L A ( ) + (1 )C1
2 2
1 0 2
= ( ) LS;2 LS;2 + C4
4
0 3
where C4 is O ( ) . After all, since Dfmin ( jj ) = f (1),
1
lim 2 Dfmin ( jj 0 ) Dfmin ( jj )
0 ! ( 0 )
1 2 f 00 (1) 2
= (f (t0 ) f ( 1)) tr 0 LS;2 + tr LS;1 :
4 2
If is full rank ( in this case, LS;2 = 0) or pure (in this case LS;1 = 0)
the LHS of () equals a constant multiple of J S , though the constant di¤ers
depending on which case it is. But if the rank of is full or 1, the result is a
weighted sum of two components of SLD Fisher information, one is concerned
with the change on the support of and the other is concerned with the change
on the kernel of .
1. What is the necessary and su¢ cient condition of f such that (15) holds?
2. Rewrite the condition that f is operator monotone to a condition on f .
23
References
[1] K.M.R. Audenaert, J. Calsamiglia, Ll. Masanes, R. Munoz-Tapia, A. Acin,
E. Bagan, F. Verstraete, " Discriminating states: the quantum Cherno¤
bound," Physics Review Letters 98 160501, (2007)
[2] R. Bahatia, Matrix Analysis, Springer, (1997)
[3] F. Hiai, M. Mosonyi, D. Petz and C. Beny, "Quantum f-divergences and
error corrections," Reviews in Mathematical Physics, 23, 691–747 (2011)
[4] H. Nagaoka, "The converse part of the theorem for quantum Hoe¤ding
bound," quantph/0611289 .
[5] R. Rockafellar, Convex analysis (Princeton, 1970).
[6] H. Strasser, "Mathematical Theory of Statistics", de Gruyter (1985).
[7] E. Torgersen, "Comparison of Statistical Experiments", Cambridge Univer-
sity Press (1991).
[8] D. Luenberger, Optimization by vector space method (Wiley, 1969).
h (t)
lim = 0:
t!1 t
Thus, by Proposition 8.4 of [3], h can be written as
Z
t
h (t) = h (0) + t d ( )
(0;1) t +
R 1
using a non-negative measure with (0;1) 1+
d ( ) < 1: Since
Z
h (t) h (0) 1
= d ( )
t (0;1) t+
and Z
1
lim d ( )=0
t!1 (0;1) t+
24
by Lebesgue’s dominated convergence theorem, we have
= 0:
Otherwise, there is t > 0 such that h (t) < h (0), contradicting with the assump-
tion that h (t) is monotone decreasing. Since the function
t
t! = 1+
t+ t+
are oprator monotone non-increasing, h (t) is operator monotone decreasing,
implying the assertion.
If dom f = ( 1; a), due to the above argument, f is operator monotone
increasing on ( 1; a "] for any " > 0. Suppose the spectrum of A1 and A2 is
a subset of ( 1; a], and A1 A2 . Then since A1 "1 A2 "1,
f (A1 "1) f (A2 "1) :
Letting " ! 0, we have f (A1 ) f (A2 ), meaning that f is operator monotone
increasing on ( 1; a).
B Proof of (29)
1 1=2 1=2
1 = 1 ( 0 )
b
1=2 1=2
= ( 0 )
r0 1 k 0 k
By rearranging terms, we have
1
b
1 r0 1 k 0 k
1
1 + r0 k 0 k:
Almost in parallel manner,
b0 1 r0 1 k 0 k:
Then by Lemma 7,
kY k max fjt t0 j ; jt0 t0 jg
1
sup 00 (t)
max fjb 1j ; jb0 1jg
f 0 (t)2[b0 ;b ] f
1
r0 1 k 0 k sup 00
:
f 0 (t)2[b0 ;b ] f (t)
25
Since f 0 is continuous, when r0 1 k 0 k is small enough, the set of all
t with f 0 (t) 2 [b0 ; b ] is the subset of [t0 c; t0 + c]. Therefore, we have (29).
26