vol20_pp506-518. 136KB Jun 04 2011 12:06:09 AM

(1)

POLYNOMIAL INEQUALITIES FOR NON-COMMUTING

OPERATORS∗

JOHN E. MCCARTHY† _AND _{RICHARD M. TIMONEY}‡

Abstract. We prove an inequality for polynomials applied in a symmetric way to non-commuting operators.

Key words. Ando inequality, Non-commuting, Symmetric functional calculus.

AMS subject classifications.15A60.

1. Introduction. J. von Neumann [9] proved an inequality about the norm of

a polynomial applied to a contraction on a Hilbert spaceH. LetD_{be the unit disk}

and T _{the unit circle in} C_{, and for any polynomial} _p_let _k_p_kX _{be the supremum of}

the modulus ofpon the setX. The result is that

T ∈ B(H),kTk ≤1⇒ kp(T)k ≤ kpkD. (1.1)

For polynomials p(z) = p(z1, z2, . . . , zn) = P|α|≤Ncαzα in n variables we use

the standard multi-index notation (where α= (α1, α2, . . . , αn) has 0 ≤αj ∈ Z for

1 ≤j ≤n, |α| =Pn_j₌₁αj, zα=Qn_j₌₁zjαj). There is an obvious way of applyingp

to ann-tuple T = (T1, T2, . . . , Tn) ofcommutingoperatorsTj ∈ B(H) (1≤j ≤n),

namely

p(T) =p(T1, T2, . . . , Tn) = X

|α|≤N

cαTα

(withTα₌Qn

j=1T

αj

j andTj0=I).

T. Andˆo [2] proved an extension of von Neumann’s inequality to pairs of com-muting contractions.

Theorem 1.1 (Andˆo). If T1, T2 ∈ B(H), max(kT1k,kT2k) ≤ 1, T1T2 = T2T1 andp(z) =p(z1, z2)is a polynomial, then

kp(T1, T2)k ≤ kpk_D2.

∗_{Received by the editors April 13, 2010. Accepted for publication August 8, 2010. Handling} Editor: Harm Bart.

†_{Washington University, St. Louis and Trinity College, Dublin ([email protected]). Partially} supported by National Science Foundation Grants DMS 0501079 and DMS 0966845.

(2)

The purpose of this note is to look for analogues of Andˆo’s inequality that are

satisfied by non-commutingoperators. For a polynomial pin n variables and an n

-tuple of operatorsT = (T1, . . . , Tn) we definepsym(T) to be a symmetrized version of

papplied toT (we make this precise in Section 2). We are looking for results of the

form:

For all n-tuplesT of operators in a certain set, there is a setK1 inCn _{such that}

kpsym(T)k ≤ kpkK1. (1.2)

and

For all n-tuples T of operators in a certain set, there is a set K2 in Cn _{and a} constant M such that

kpsym(T)k ≤ MkpkK2. (1.3)

Our main result is:

Theorem 4.6 There are positive constants Mn and Rn such that, whenever

T = (T1, T2, . . . , Tn)∈ B(H)n satisfies

k n X

i=1

ζiTik ≤ 1 ∀ζi∈D,

andpis a polynomial in nvariables, then

kpsym(T)k ≤ kpkRnD

n (1.4)

kpsym(T)k ≤ Mnkpk_Dn. (1.5)

Moreover, one can chooseR2= 1.85, R3= 2.6, M2= 4.1 andM3= 16.6.

2. Tuples of noncommuting contractions. There are several natural ways

one might apply a polynomialp(z1, z2) in two variables to pairsT = (T1, T2)∈ B(H)2

of operators. A simple case is for polynomials of the formp(z1, z2) =p1(z1) +p2(z2)

where we could naturally considerp(T1, T2) to meanp1(T1) +p2(T2).

A recent result of Drury [4] is that ifp(z1, z2) =p1(z1) +p2(z2), T1, T2∈ B(H)

(no longer necessarily commuting), max(kT1k,kT2k)≤1, then

(3)

Moreover, Drury [4] shows that the constant 2 is best possible.

One way to apply a polynomialp(z1, z2) =Pn_j,k₌₀aj,kz1jz2kto two noncommuting

operatorsT1andT2is by mapping each monomialzj1zk2to the average over all possible

products ofjnumber ofT1andknumber ofT2, and then extend this map by linearity

to all polynomials. We use the notationpsym(T1, T2) and the formula

psym(T1, T2) =

n X

j,k=0 aj,k

j+k j

X

S∈P(j+k,j)

j+k Y

i=1

T2−χS(i)

whereP(j+k, j) denotes the subsets of{1,2, . . . , j+k}of cardinalityj. The empty

product, which arises for j =k = 0, should be taken as the identity operator. The

notationQj_i₌₁+kT2−χS(i) is intended to mean the ordered product

T2−χS(1)T2−χS(2)· · ·T2−χS(j+k),

andχS(·) denotes the indicator function ofS.

Remarks 2.1. The operationp7→psym(T1, T2) is not an algebra homomorphism (from polynomials to operators). It is a linear operation and does not respect squares in general.

For example, ifp(z1, z2) =z12+z22, then

psym(T1, T2) =T12+T22

but forq(z1, z2) = (p(z1, z2))2₌_z4

1+z24+ 2z12z22 we have

(psym(T1, T2))2=T14+T24+T12T22+T22T126=qsym(T1, T2)

in general.

Similarly forp(z1, z2) = 2z1z2and

q(z1, z2) = (p(z1, z2))2= 4z12z22,

psym(T1, T2) =T1T2+T2T1,

(psym(T1, T2))2₌_T1T2T1T2₊_T1T2

2T1+T2T12T2+T2T1T2T16=qsym(T1, T2)

in general.

However in the very restricted situation thatp(z1, z2) =α+βz1+γz2andq=pm_,

then we do haveqsym(T1, T2) = (psym(T1, T2))m_.

The symmetrizing idea generalizes in the obvious way ton >2 variables. We will

(4)

3. Example. The analogue of Andˆo’s inequality for n≥3 commuting Hilbert

space contractions and polynomials norms onDn_{is known to fail (see Varopoulos [8],}

Crabb & Davie [3], Lotto & Steger [6], Holbrook [5]).

The explicit counterexamples of Kaijser & Varopoulos [8], and Crabb & Davie

[3]) havep(T) nilpotent (and so of spectral radius 0). While the examples of Lotto &

Steger [6] and Holbrook [5]) do not have this property, they are obtained by perturbing

examples wherep(T) is nilpotent (and sop(T) has relatively small spectral radius).

It is not known whether there is a constantCnso that the multi-variable inequality

kp(T)k=kp(T1, T2, . . . , Tn)k ≤Cnkpk_Dn (3.1)

holds for all polynomials p(z) in n variables and for all n-tuples T of commuting

Hilbert space contractions. However, it is well-known that a spectral radius version

of Andˆo’s inequality is true — indeed, it holds in any Banach algebra.

Proposition 3.1. Ifpis a polynomial innvariables andT = (T1, T2, . . . , Tn)is

ann-tuple of commuting elements in a Banach algebra, each with norm at most one, then

ρ(p(T)) = lim

m→∞k(p(T))

m

k1/m≤ kpkDn (3.2)

(for ρ(·)the spectral radius).

Proof. We consider a fixedn. It follows from the Cauchy integral formula, that

if max1≤j≤nkTjk ≤r <1, then

kp(T)k=kp(T1, T2, . . . , Tn)k ≤Crkpk_Dn (3.3)

for a constantCr depending onr (andn).

To see this write

p(T) = 1 (2πi)n

Z

ζ∈Tn

n Y

j=1 p(ζ)

n Y

j=1

(ζj−Tj)−1dζ1dζ2 . . . dζn

and estimate with the triangle inequality. This shows thatCr= (1−r)−n will work.

Applying (3.3) to powers ofpand using the spectral radius formula, we get

ρ(p(T))≤ kpkDn,

(provided max1≤j≤nkTjk ≤r <1). However, for the general case max1≤j≤nkTjk=

1, we can apply this torT to get

ρ(p(T)) = lim

r→1−

(5)

Example 3.2.

Letp(z, w) = (z−w)2_{+ 2(}_z₊_w_{) + 1 =}_z2₊_w2₋₂_zw_{+ 2(}_z₊_w_{) + 1,}

T1=

cos(π/3) sin(π/3)

sin(π/3) −cos(π/3)

=

1/2 √3/2

√

3/2 −1/2

,

T2=

cos(π/3) −sin(π/3)

−sin(π/3) −cos(π/3)

.

Note thatkpkD2 ≥p(1,−1) = 5. To show that kpk

D2 ≤5, consider the

homoge-neous polynomial

q(z1, z2, z3) =z21+z22+z23−2z1z2−2z1z3−2z2z3

and observe first thatp(z, w) =q(z, w,−1). Moreover

kpkD2 =kpk

T2 =kqk

T3=kqk D3,

by homogeneity ofqand the maximum principle. Holbrook [5, Proposition 2] gives a

proof thatkqkD3 = 5.

We have

psym(T1, T2) = (T1−T2)2+ 2(T1+T2) +I

=

0 √3

√

3 0

2

+ 2

1 0

0 −1

+I

=

3 0

0 3

+

2 0

0 −2

+

1 0

0 1

=

6 0

0 2

Sokpsym(T1, T2)k= 6>5 =kpkD2.

Remark 3.3. The example has hermitianT1andT2and a polynomial with real

coefficients and yetρ(psym(T1, T2))>kpkD2. Thus even Proposition 3.1 does not hold

for non-commuting pairs.

The referee has provided an argument to show that for the polynomialpof

Exam-ple 3.2, one has the inequalitykpsym(T1, T2)k ≤6 for all contractionsT1 andT2 (and

thus the example is optimal for thatp). This inequality is a substantial improvement

over using the sum of the absolute values of the coefficients ofp, so one is led to ask

(6)

4. _kP_ζ

iTik ≤ 1. In this section, we shall considern-tuplesT = (T1, . . . , Tn) of

operators, not assumed to be commuting, and we shall make the standing assumption:

k n X

i=1

ζiTik ≤ 1 ∀ζi ∈ D. (4.1)

This will hold, for example, if the condition

n X

i=1

kTik ≤ 1 (4.2)

holds. We wish to derive bounds onkpsym(T)k. We start with the following lemma:

Lemma 4.1. If S∈ B(H)andkSk<1 then

ℜ((I+S)(I−S)−1)≥0.

Proof.

2ℜ((I+S)(I−S)−1)

= (I−S∗₎−1₍_I₊_S∗_{) + (}_I₊_S₎₍_I₋_S₎−1

= (I−S∗)−1

(I+S∗)(I−S) + (I−S∗)(I+S)

(I−S)−1

= 2(I−S∗)−1[I−S∗S](I−S)−1

≥0.

Ifp(z) = P_c

αzα, define

Γp(z) = Xcα

α!

|α|!z

α _(4.3)

(as usual,α! meansα1!· · ·αn!). We let Λ denote the inverse of Γ:

ΛXdαzα =

X

dα|α|!

α! z

α_.

Proposition 4.2. Let T = (T1, T2, . . . , Tn)∈ B(H)n satisfy (4.1) andp(z)be a

polynomial in nvariables. Then

(7)

Proof. We first restrict to the case

ζ= (ζ1, ζ2, . . . , ζn)∈Tn⇒ kζ·Tk=

n X

j=1 ζjTj

<1

and hence by Lemma 4.1 the operator

(I+ζ·T)(I−ζ·T)−1= (I+ζ·T)

∞

X

j=0

(ζ·T)j=I+ 2

∞

X

j=1

(ζ·T)j

has positive real part

K(ζ, T) = ℜ (I+ζ·T)(I−ζ·T)−1

= I+

∞

X

j=1

(ζ·T)j+

∞

X

j=1

(¯ζ·T∗)j

= 2ℜ

" _∞ X

α1,...,αn=0

|α|!

α! ζ

α₍_zα₎

sym(T)

# −I.

We can compute that for polynomialsp(z) =p(z1, z2, . . . , zn),

psym(T) =

Z

Tn

Γp(ζ)K(¯ζ, T)dσ(ζ)

withdσindicating normalised Haar measure on the torusTn_{(and ¯}_ζ_{= (¯}_ζ1,_{ζ2, . . . ,}¯ _ζ¯_n_)).

As

K(¯ζ, T)dσ(ζ)

is a positive operator valued measure onTn_{, we then have a positive unital linear map}

C(Tn₎_{→ B}₍_H_{) given by}_f _7→R

Tnf(ζ)K(¯ζ, T)dσ(ζ).As this map is then of norm 1,

we can conclude

kpsym(T)k ≤ kΓpk_Dn.

For the remaining case supζ∈Tnkζ·Tk= 1, we have

kpsym(T)k= lim

r→1−k

psym(rT)k ≤ kΓpk_Dn.

Remark 4.3. The technique of the above proof is derived from methods of [7].

(8)

Proposition 4.4. For eachn≥2 there is a constantMn so that

kΓpk_Dn ≤ M_nkpk

Dn.

Moreover,

M2 ≤ 4.07

M3 ≤ 16.6

Proof. Define

J(η) =

∞

X

α1=0,...,αn=0

α!

|α|!η

α_. _(4.5)

Then

Γp(z) =

Z

Tn

p(ζ)[J(z1ζ1, . . . , z¯ nζ¯n)]dσ(ζ). (4.6)

To use (4.6), we breakJ into two parts — the sumJ0 where the minimum of theαi

is 0, and the remaining termsJ1.

J1(η) =

∞

X

α1=1,...,αn=1

α!

|α|!η

α_.

Case: n= 2. Here,

Z

T2

p(ζ)J0(z1ζ1, z2¯ ζ2¯)dσ(ζ) = p(z1,0) +p(0, z2)−p(0,0). (4.7)

So the norm of the left-hand side of (4.7) is dominated by 3kpk_D2.

ForJ1, we will use the estimate

Z T2

p(ζ)J1(z1ζ1, z2¯ ζ2¯)dσ(ζ)

≤ kpk∞kJ1kL

1 ≤ kpk_∞kJ1kL2.

We have

kJ1k2

L2 = ∞

X

α1,α2=1

_α1_!_α2_!

(α1+α2)!

2 = ∞ X α1=1 1

(α1+ 1)2 +

∞

X

α2=2

1

(α2+ 1)2 +

∞

X

α1,α2=2

_α1_!_α2_!

(α1+α2)!

2 ≤ π2 3 − 9 4 + ∞ X k=4

(k−3)

2

k(k−1)

2

(9)

(In the penultimate line, we letk=α1+α2; there arek−3 terms with this sum, and

the largest they can be is when either α1 or α2 is 2.) Adding the two estimates, we

getM2≤4.07.

Case: n= 3. Again, we estimate the contributions ofJ0 and J1 separately. We

have

Z

p(ζ)J0(z1ζ1, z2¯ ζ2, z3¯ ζ3¯)dσ(ζ)

= Γp(0, z2, z3) + [Γp(z1,0, z3)−p(0,0, z3)]

+ [Γp(z1, z2,0)−p(z1,0,0)−p(0, z2,0) +p(0,0,0)]

where we have had to subtract some terms to avoid double-counting. Thus the

con-tribution ofJ0 is at most 3M2+ 4.

To calculate the contribution ofJ1, we make the following estimate on kJ1kL2,

which is valid for alln≥3:

We want to bound

∞

X

α1=1,...,αn=1

_α_!

|α|!

2

(4.8)

Letk=|α|in (4.8). Note first that the number of terms for eachk is the number of

ways of writingk as a sum ofndistinct positive integers (order matters), and this is

exactly _nk−₋1₁

. Moreover, as eachαi is at least 1, we have

α!

|α|! ≤

1

k(k−1)· · ·(k−n+ 2).

Therefore (4.8) is bounded by

∞

X

k=n _k

−1

n−1

1

k(k−1)· · ·(k−n+ 2)

2

=

∞

X

k=n

k−n+ 1

(n−1)!k

1

k(k−1)· · ·(k−n+ 2).

The terms on the right-hand side of (4.9) decay like 1/kn−1, so the series converges

for alln≥3. Whenn= 3, the series is

∞

X

k=3

k−2

2k2₍_k₋₁₎ ≤ (0.381)

2_.

(10)

We now proceed by induction onn. The contribution from J0 is dominated by

applying Γ to the restriction ofpto the slices with one or more coordinates equal to

0, and these are bounded by the inductive hypothesis. The contribution from J1 is

bounded by (4.8).

We have proved that the polydisk is an M-spectral set forT; we can make the

constant one by enlarging the domain.

Proposition 4.5. There is a constantRn so that

kΓpk_Dn ≤ kpkR

nDn. (4.9)

Moreover,

R2 ≤ 1.85

R3 ≤ 2.6

Proof. LetL(η) = 2ℜJ(η)−1. Adding terms that are not conjugate analytic

powers of ζ inside the bracket in (4.6) will not change the value of the integral, so,

writingzζ¯for then-tuple (z1ζ1, . . . , z¯ nζ¯n), we get

Γp(z) =

Z

Tn

p(ζ)[L(zζ¯)]dσ(ζ). (4.10)

AsLis real and has integral 1, if we can choosern so that if|zi| ≤rn for eachithen

L(zζ¯) is non-negative for allζ, then itsL1 _{norm would equal its integral, and so we}

would get from (4.10) that

|Γp(z)| ≤ kpkDn.

LettingRn= 1/rngives (4.9). As the series (4.5) converges absolutely for allη ∈ Dn,

andL(0) = 1, the existence of somern now follows by continuity.

Let us turn now to obtaining quantitative estimates.

Case: n= 2. Adding terms toJ that are not analytic will not affect the integral

(4.10), so let us consider

L′(η) = ℜ

_{1 +}_η1

1−η1

· ℜ

_{1 +}_η2

1−η2

−

∞

X

α1=1,α2=1

(1− α!

|α|!)(η

α1 1 −η¯α

1 1 )(ηα

2 2 −η¯α

2 2 ).

ThenL′ _{has integral 1 and (4.10) is unchanged if}_L _{is replaced by}_L′_{. So we wish to}

find the largestrso thatL′ _{is positive on}_r_D2_.

It can be checked numerically thatr = 0.5406 works, so the best R2 is smaller

(11)

Case: n= 3. As in the casen= 2, we consider the kernel

L′(η) = ℜ

_{1 +}_η1

1−η1

· ℜ

_{1 +}_η2

1−η2

· ℜ

_{1 +}_η3

1−η3

−

∞

X

α1=1,α2=1,α3=0

(1− α!

|α|!)(η

α1 1 −η¯α

1 1 )(ηα

2 2 −η¯α

2 2 )(ηα

3 3 + ¯ηα

3 3 ).

(Note that there is a plus in the last factor to keepL′_{real.) Again, a computer search}

can findrso thatL′ _{is positive on} _r_D3_{, and}_r₌_._{39 works, so}_R3_<₂_._6.

Combining Propositions 4.2, 4.4 and 4.5, we get the main result of this section.

Theorem 4.6. There are positive constants Mn and Rn such that whenever

T = (T1, T2, . . . , Tn)∈ B(H)n satisfies (4.1) andp(z)is a polynomial innvariables,

then

n (4.11)

kpsym(T)k ≤ Mnkpk_Dn. (4.12)

Moreover, one can chooseR2= 1.85, R3= 2.6, M2= 4.1 andM3= 16.6.

Remark 4.7. Another way to estimatekpsym(T)k, under the assumption (4.2),

would be to crash through with absolute values. Let ∆n ={z∈Cn :Pn_j₌₁|zj| ≤1}

and let rn denote the Bohr radius of ∆n, i.e. the largest r such that whenever

p(z) = P_c

αzαhas modulus less than or equal to one on ∆n, then q(z) = P|cα|zα

has modulus bounded by one on r∆n. One then has the estimate that, under the

hypothesis (4.2), and writingCn= 1/rn,

kpsym(T)k ≤ kqk∆n ≤ kpkCn∆n. (4.13)

It was shown by L. Aizenberg [1, Thm. 9] that

1

3e1/3 < rn ≤

1 3.

So the estimate in (4.11) for pairs satisfying (4.2) does not follow from (4.13).

5. n-tuples of contractions. In an attempt to use the above technique for

tuplesT ∈ B(H)n such that max1≤j≤nkTjk ≤1, we consider restricting ζto belong

to ∆n, and we replaceσby some probability measureµsupported on ∆n.

Suppose we can find some functionqsuch that

Λµ(q)(z) :=

Z

∆n

q(ζ)ℜ1 + ¯₁ ζ·z

(12)

equals p(z). We do not actually need q to be a polynomial; having an absolutely

convergent power series on ∆n (inζ and ¯ζ) is enough.

Lemma 5.1. With notation as above, assume Λµ(q) =pand that T ∈ B(H)n is

ann-tuple of contractions. Then

k(p)_sym(T)k ≤ kqksuppt(µ)≤ sup{|q(z)|:z∈∆n}.

Proof. We assume first that max1≤j≤nkTjk < 1 and use the notationK(ζ, T)

from the proof of Proposition 4.2 (which is permissible askζ·Tk <1 for ζ ∈∆n).

We have

(Λµq)sym(T) =

Z

∆n

q(ζ)K(¯ζ, T)dσ(ζ)

and hence the inequalityk(p)_sym(T)k ≤ kqksuppt(µ)follows as in the previous proof.

If max1≤j≤nkTjk = 1, we deduce the result from k(p)sym(rT)k ≤ kqk∆n for

0< r <1.

Remark 5.2. For an arbitrary measureµ, there might be noqsuch that Λµ(q) =

p. Ifµis chosen to be circularly symmetric, though, one gets

Λµ(zα) =

|α|!

α1!. . . αn! Z

|ζα|2dµ(ζ)

zα. (5.2)

As long as none of the moments on the right of (5.2) vanish, inverting Λµ is now

straightforward.

To make use of the lemma to bound psym(T) we need to find a way to choose

another polynomialqand aµon ∆n so thatp= Λµqandkqk∆n is small. We do not

know a good way to do this.

Question 1. What is the smallest constantRn such that, for everyn-tupleT of

contractions and every polynomial p, one has

n? (5.3)

We do not know if one can chooseRn smaller than the reciprocal of the Bohr radius

of the polydisk, even whenn= 2.

Question 2. Is there a constantMn such that, for every n-tupleT of

contrac-tions and every polynomial p, one has

(13)

REFERENCES

[1] Lev Aizenberg. Multidimensional analogues of Bohr’s theorem on power series. Proc. Amer. Math. Soc., 128(4):1147–1155, 2000.

[2] T. Andˆo. On a pair of commutative contractions. Acta Sci. Math. (Szeged), 24:88–90, 1963. [3] M. J. Crabb and A. M. Davie. von Neumann’s inequality for Hilbert space operators. Bull.

London Math. Soc., 7:49–50, 1975.

[4] S. W. Drury. Von Neumann’s inequality for noncommuting contractions. Linear Algebra Appl., 428(1):305–315, 2008.

[5] John A. Holbrook. Schur norms and the multivariate von Neumann inequality. In Recent advances in operator theory and related topics (Szeged, 1999), volume 127 ofOper. Theory Adv. Appl., pp. 375–386. Birkh¨auser, Basel, 2001.

[6] B. A. Lotto and T. Steger. von Neumann’s inequality for commuting, diagonalizable contractions. II. Proc. Amer. Math. Soc., 120(3):897–901, 1994.

[7] John E. McCarthy and Mihai Putinar. Positivity aspects of the Fantappi`e transform. J. Anal. Math., 97:57–82, 2005.

[8] N. Th. Varopoulos. On an inequality of von Neumann and an application of the metric theory of tensor products to operators theory. J. Functional Analysis, 16:83–100, 1974.

[9] Johann von Neumann. Eine Spektraltheorie f¨ur allgemeine Operatoren eines unit¨aren Raumes.