Exterior Algebra - Supporting Materials .1 Mixed Discriminant.1Mixed Discriminant

CONCENTRATION OF EIGENVALUE SUMS AND GENERALIZED LIEB’S CONCAVITY THEOREM

5.6 Supporting Materials .1 Mixed Discriminant.1Mixed Discriminant

5.6.2 Exterior Algebra

Here we give a brief review of exterior algebras on the vector spaceCⁿ. For more details, one may refer to [14, 100]. For the convenience of our use, the notations in our paper might be different from those in other materials. For any 1 ≤ k ≤ n, let

∧^k(Cⁿ)denote the vector space of thekt h exterior algebra ofCⁿ, equipped with the inner product

h·,·i_∧k : ∧^k(Cⁿ)× ∧^k(Cⁿ) −→ C

hu₁∧ · · · ∧uk,v₁∧ · · · ∧vki_∧k =det







hu₁,v₁i hu₁,v₂i · · · hu₁,v_ki hu₂,v₁i hu₂,v₂i · · · hu₂,vki

... ... . . . ...

huk,v₁i huk,v₂i · · · huk,vki





 ,

wherehu,vi=u^∗vis the standardl₂inner product onCⁿ.

LetL(∧^k(Cⁿ)) denote the space of all linear operators from∧^k(Cⁿ)to itself. For any matrices A⁽¹⁾,A⁽²⁾,· · · ,A^(k) ∈C^n×n, we can define an element inL(∧^k(Cⁿ)):

M^(k)(A⁽¹⁾,A⁽²⁾,· · ·,A^(k⁾):

∧^k(Cⁿ) −→ ∧^k(Cⁿ) v₁∧v₂∧ · · · ∧vk 7−→ X

σ∈S_k

A^(σ(¹⁾⁾v₁∧A^(σ(²⁾⁾v₂∧ · · · ∧A^(σ(^k))vk, (5.45) whereSk is the symmetric group of orderk. Apparently, the map

(A⁽¹⁾,A⁽²⁾,· · ·,A^(k)) 7−→ M^(k)(A⁽¹⁾,A⁽²⁾,· · ·,A^(k))

is symmetric in A⁽¹⁾,A⁽²⁾,· · ·,A^(k) and is linear in each single A⁽ⁱ⁾. For simplicity, we will use the following notations for any matrices A,B,C ∈C^n×n:

M₀^(k)(A)= 1

k!M^(k)(A,· · ·, A), (5.46a) M₁^(k)(A;B)= 1

(k −1)!M^(k)(A,B,· · · ,B), (5.46b) M₂^(k)(A,B;C)= 1

(k −2)!M^(k)(A,B,C,· · ·,C). (5.46c) To avoid confusion, we define M₁⁽¹⁾(A;B) = M₀¹(A), M₂⁽¹⁾(A,B;C) = 0, and M₂⁽²⁾(A,B;C) = M₁⁽²⁾(A;B). Obviously the identity operator in L(∧^k(Cⁿ)) is M₀(In). We will be using the following properties:

• Invertibility: if A∈C^n×nis invertible, then (M₀^(k)(A))⁻¹=M₀^(k)(A⁻¹).

• Adjoint: for anyA∈C^n×n,(M₀^(k)(A))^∗ = M₀^(k)(A^∗), with respect to the inner producth·,·i_∧k.

• Positiveness: If A ∈ Hn, then M₀ (A) is Hermitian; if A ∈ H⁺_n, then M₀^(k)(A) 0; if A∈H⁺⁺_n , thenM₀⁽^k)(A) 0.

• Product properties: for any A,B,C,D ∈C^n×n, we have

M₀^(k)(AB)= M₀^(k)(A)M₀^(k)(B), (5.47a) M₁^(k)(A;B)M₀^(k)(C)= M₁^(k)(AC;BC), (5.47b) M₀^(k)(C)M₁^(k)(A;B)= M₁^(k)(C A;C B), (5.47c) M₁^(k)(A;C)M₁^(k)(B;D)= M₂^(k)(AD,C B;C D)+M₁^(k)(AB;C D). (5.47d)

• Derivative properties: for any differentiable functions A(t),B(t) : R −→

C^n×n, we have

∂

∂tM₀^(k)(A(t))= M₁^(k)(A⁰(t);A(t)) (5.48a)

∂

∂tM₁^(k)(A(t);B(t))= M₁^(k)(A⁰(t);B(t))+M₂^(k)(A(t),B⁰(t);B(t)).

(5.48b) Next we consider the natural basis of∧^k(Cⁿ),

{ei₁ ∧ei₂∧ · · · ∧ei_k}₁_≤i₁_<i₂<···<ik≤n,

which is orthogonal under the inner product h·,·i_∧k. Then the trace function on L(∧^k(Cⁿ))is defined as

Tr : L(∧^k(Cⁿ)) −→ C

TrF = X

1≤i₁<i₂<···<i_k≤n

hei₁ ∧ei₂∧ · · · ∧ei_k,F(ei₁ ∧ei₂ ∧ · · · ∧ei_k)i_∧k. (5.49) It is not hard to check that this trace function is also invariant under cyclic per- mutation, i.e. TrF G = TrGF for any F,G ∈ L(∧^k(Cⁿ)). Then for any A⁽¹⁾,· · ·,A^(k) ∈C^n×n, the trace Tr[M^(k)(A⁽¹⁾,· · ·, A^(k))] coincides with the definition of the mixed discriminant, as one can check that

TrM⁽^k)(A⁽¹⁾,· · ·,A⁽^k))

= X

σ∈S_k

1≤i₁<···<i_k≤n

hei₁ ∧ · · · ∧ei_k, A^(σ(1))ei₁∧ · · · ∧ A^(σ(k))ei_ki_∧k

= n!

(n−k)!D(A⁽¹⁾,· · ·,A⁽^k),In,· · ·,In

| {z }

n−k

(5.50)

From this observation, we can now express thek-trace of a matrix A∈C^n×nas Trk[A]= TrM₀^(k)(A). (5.51) For those who are familiar with exterior algebra, it is clear that the spectrum of M₀⁽^k) is just {λi₁λi₂· · ·λi_k}₁_≤i₁_<i₂<···<ik≤n, where λ₁, λ₂,· · ·, λn are the eigen- values of A. So in this way it is more convenient to see that TrM₀⁽^k)(A) = sum(spectrum ofM₀^(k)(A)) = P

1≤i₁<···<i_k≤nλi₁λi₂· · ·λi_k = Trk[A]. Our proof of Theorem 5.3.1 will base on the expression (5.51).

In fact, our proof the main theorem can be done without introducing the exterior algebra. We can instead go through the whole proof only using notations of mixed discriminant. The advantage of using exterior algebra is that it interprets thek-trace as the normal trace of operators in a space of higher dimension, so our k-trace functions have a nicer form that imitates the trace function in the original Lieb’s theorem. Also for the same reason, we are able to construct our proof by following the arguments of Lieb’s original proof in [67].

We next introduce some notations to simplify the expressions in what follows. For anynreal numbersλ₁, λ₂,· · ·, λn ∈R, we define the three symmetric forms

p^(n,k) = X

1≤i₁<i₂<···<ik≤n

λi₁λi₂· · ·λi_k, 1≤ k ≤ n, (5.52a)

d_i^(n,k) = X

1≤j₁<j₂<···<j_k−₁≤n i<{j₁,j₂,···,j_k−₁}

λj₁λj₂· · ·λj_k−₁, 2≤ k ≤ n, 1 ≤i ≤ n, (5.52b)

g_{i j}^(n,k) = X

1≤l₁<l₂<···<l_k−₂≤n i,j<{l₁,l₂,···,l_k−₂}

λl₁λl₂· · ·λl_k−₂, 3≤ k ≤ n, 1≤ i,j ≤ n, i , j. (5.52c)

For consistency, we defined_i^(n,k) = 1 ifk =1;g_{i j}^(n,k) =1 ifk =2 andi , j;g_{i j}^(n,k) =0 if k = 1 ori = j. Also we define p^(n,k) = d_i^(n,k) = g_{i j}^(n,k) = 0 ifk > n. Throughout this paper, whenever we are given some real numbers λ₁, λ₂,· · ·, λn, the quantities p^(n,k),d_i^(n,k),g_{i j}^(n,k)are always defined correspondingly with respect to{λi}₁_≤i≤n. The following relations are easy to verify with the definitions above, and will be useful in our proofs of lemmas and theorems. For anyn,k, and any 1≤ i,j ≤ nsuch that i, j, we have the expansion relations

p^(n,k) = λid_i^(n,k) +d_i^(n,k⁺¹⁾, d_i^(n,k) = λjg_{i j}^(n,k⁾+g_{i j}^(n,k⁺¹⁾. (5.53) With the notations defined above, we give the following lemma. The proof is straightforward by definition, so we omit it here.

Lemma 5.6.4. For any A,B ∈ C , and any diagonal matrix Λ ∈ C with diagonal entriesλ₁, λ₂,· · ·, λn, we have the following identities

TrM₀^(k⁾(Λ) = p^(n,k), (5.54a)

TrM₁^(k)(A;Λ) =

i=1

Aiid_i^(n,k), (5.54b)

TrM₂^(k)(A,B;Λ) = X

1≤i,j≤n

(AiiBj j− AjiBi j)g_{i j}^(n,k⁾, (5.54c)

for all1 ≤ k ≤ n, wherep⁽ⁿ^,k),d_i^(n,k),g_{i j}^(n,k⁾are defined with respect toλ₁, λ₂,· · ·, λn. We here provide an alternative of Lemma 5.4.4 using the following lemma.

Lemma 5.6.5. For any A∈Hn, we have

M₀⁽^k) exp(A) =exp M₁^(k)(A;In). Proof. We need to show that for anyv₁∧v₂∧ · · · ∧v_k ∈ ∧^k(Cⁿ),

M₀^(k) exp(A)

(v₁∧v₂∧ · · · ∧vk) =exp M₁^(k)(A;In)

(v₁∧v₂∧ · · · ∧vk). (5.55) We use Taylor expansion ofe^xto expand

M₀⁽^k) exp(A) =M₀^(k) ⁺

∞

j=0

1 j!A^j

, exp M₁^(k⁾(A;In) = ⁺

∞

j=0

j! M₁⁽^k)(A;In)j. Then for any integers j₁,j₂, . . . ,jk ≥ 0, the coefficient of the term A^j¹v₁∧A^j²v₂∧

· · · ∧ A^j^kvk in the left hand side of (5.55) is 1 j₁!j₂!· · ·jk!,

and the coefficient of the same term in the right hand side of (5.55) is also 1

J! J j₁

! J− j₁ j₂

· · · J− j₁− j₂− · · · − jk−1

= 1

j₁!j₂!· · · jk!,

where(J = j₁+ j₂+· · ·+ jk).

An alternative proof of Lemma 5.4.4. Using Lemma 5.6.5 and the original GT inequality for normal trace, we have

Trk[exp(A+B)]=TrM₀^(k) exp(A+B)

=Trexp M₁^(k)(A+B;In)

=Trexp M₁^(k)(A;In)+M₁⁽^k)(B;In)

≤ Trexp M₁^(k)(A;In)exp M₁^(k)(B;In)

=TrM₀^(k) exp(A)M₀^(k) exp(B)

=Trkexp(A)exp(B),

where we have used thatM₁^(k)(X;In)is linear in X. As shown by Petz [93], in the original GT inequality, the equality Tr[exp(A+ B)] =Tr[exp(A)exp(B)] holds for A,B ∈ Hnif and only if AB = B A. Therefore, according to our calculation above, the equality Trk[exp(A+B)]=Trk[exp(A)exp(B)] holds if and only if

M₁^(k)(A;In)M₁^(k)(B;In) =M₁^(k)(B;In)M₁^(k)(A;In). (5.56) However, one can check by definition that (5.56) is true if and only ifAB = B A. 5.6.3 Derivatives of Some Matrix Functions

Let us remind ourselves that a basic but important way to prove concavity of a differentiable function f(t) is by showing that f⁰⁰(t) ≤ 0. Similarly, one way to prove concavity of a differentiable multivariate function f(x)is by showing that the second directional derivative _∂t^∂²2f(x+ty)|_t₌₀ ≤ 0 for all allowed direction y. We will use this idea to prove the concavity of the k-trace functions (5.18) and (5.19).

For this purpose, we would need the following matrix derivative formulas.

• Consider a function A(t) : (a,b) −→ Hn, such that A(t) is differentiable on (a,b), then we have^[132]

∂

∂t exp A(t) =Z ₁

0 exp s A(t)

A⁰(t)exp (1−s)A(t)ds. (5.57) A⁰(t)denotes the derivative of A(t)with respect tot.

• Consider a function A(t): (a,b) −→ H⁺⁺_n , such that A(t)is differentiable on (a,b), then we have^[67]

∂

∂t A(t)−1 =− A(t)−1A⁰(t) A(t)−1, (5.58) and

∂

∂t log A(t) = Z ∞

0 A(t)+τIn⁻1A⁰(t) A(t)+τIn⁻1dτ. (5.59)

Dalam dokumen De Huang (Halaman 183-189)