B = P-IAP

Any two matrices

A

and

B

are said to be similar to each other if there exists a nonsingular matrix

P

^{such that}

B

p-I AP.

The matrices for a given linear t.ransformation relative to different ordered bases form an equivalence class under the similarity relation. What is the simplest matrix that can be obtained for a linear transformation by changing the basis? This is a difficult question, to which one answer is provided by the Jordan canonical form. Another answer can be given in the context of a finite-dimensional Hilbert space when the linear transformation

L

is Hermitian.

104 Chapter

Hilbert Spaces

Let

[UI

^\^{. .}^.^{, un]}be an ordered orthonormal basis for the n-dimensional Hilbert space X. Let L be Hermitian. The spectral theorem asserts the exis

tence of an ordered orthonormal basis

[VI

^{\ . . .}^,^v^nJand an n-tuple of real numbers (AI , . . . , An) such that

(8) Lx = :L>i(X,Vi)Vi i=1

n As above, we introduce matrices _n

A

and

P

^{such that}

(9)

LUj = Laij Ui ^{( 1}

:::; j :::; n)

,=1 i=1

The matrix B that represents L relative to the v-basis is the diagonal matrix diag(AI , . . . , An

)

^,as we see from Equation (8). Thus from Equation (7) we conclude that

A

is similar to a diagonal matrix having real entries. More can be said, however, because

P

has a special structure. Notice that

/

ⁿ ⁿ

)

^{n n}

(J)jk = Ojk = (Vk,Vj) = ^\ 8PikU;, �Prj Ur = 8�PikPrj(Ui,ur)

n n

= LPikPij = L(P")ji(P)ik = (P"P)jk i=1 i=1

This shows that

( 10) P"P = ^I

(It follows by elementary linear algebra that

PP* = ^I.)

Matrices having the property ( 10) are said to be unitary. We can therefore state that the matrix

A

(representing the Hermitian operator L with respect to an orthonormal base) is unitarily similar to a real uiagonal matrix.

Finally, we note that if an n ^xn complex matrix

A

is such that

A = ^A'

^,^then

A

is the matrix of a Hermitian transformation relative to an orthonormal basis.

Indeed, we have only to select any orthonormal base

[UI,'

^{. .}^{, unJ}and define L by

Then, of course,

LUj = LaijUi i=1

=

(t ^{Cj Uj) =} t t ^{Cj aij Ui}

By straightforward calculation we have

(Lx, y)

=

^{(x, Ly)}

A matrix

A

satisfying

A * ^{= A}

is said to be Hermitian. We have proved therefore the following important result, regarded by many as the capstone of elementary matrix theory:

Theorem.

Every complex Hermitian matrix is unitarily similar to

real diagonal matrix.

Section

2.5

Sturm-Liouville Theory

105 2.5 Sturm-Liouville Theory

In this section differential equations are attacked with the weapons of Hilbert space theory. Recall that in elementary calculus we interpret integration and differentiation as mutually inverse operations. So it is here, too, that dif

ferential operators and integral operators can be inverse to each other. We find that a differential operator is usually ill-behaved, whereas the corresponding in

tegral operator may be well-behaved, even to the point of being compact. Thus, we often try to recast a differential equation as an equivalent integral equation, hoping that the transformed problem will be less troublesome. (This theme will reappear many times in Chapter 4.) This strategy harmonizes with our general impression that differentiation emphasizes the roughness of a function, whereas integration is a smoothing operation, and is thus applicable to a broader class of functions.

Definition. The Sturm-Liouville operator is defined by

(Ax)(t)

⁼

[P(t)x'(t)j'

⁺

q(t)x(t)

^i.e.,

Ax

⁼

(px')'

⁺

qx

where

x

is two-times continuously differentiable,

p

is real-valued and continu

ously differentiable, and

q

is real-valued continuous. The domain of the functions

x, p,

^and

q

is an interval

[a, bj.

^{We permit}

x

to be complex-valued. Let eight real numbers

Qij, f3ij

be specified 1 :::; i, j :::; 2. Assume that

p(a)(f311f322 - f312(321)

⁼

p(b)(QJlQ22 - Q12Q2d

Let X be the subspace of

L2[a,

^bjconsisting of all twice continuously differen

tiable functions x such that

Qllx(a)

⁺

Ql2x'(a)

⁺

f3JJx(b)

⁺

f312X'(b)

⁼

^a Q21x(a)

⁺

Q22x'(a)

⁺

f321X(b)

⁺

f322X'(b)

⁼

^a

Assume also that

f3J1f322

^f;

f312f321

^or

QIl Q22

^f;

Q12Q21'

Theorem 1 .

operator on

Under the preceding hypotheses, A is a Hermitian Proof.

Let

x, y

E X. We want to prove that

(Ax, y)

⁼

(x, Ay).

We compute

(Ax, y) - (x, Ay)

⁼

lb[yAX - xAyj

⁼

lb [y(px')'

⁺

yqx - x(py')' - xqy]

= lb [y(px')' - x(pY)'J lb

a [y(px')'

⁺

y'px' - x(pY)' - x'py'J

lb [px'y - pxy'J'

⁼

[px'y -pxy']!

p(b) [x'(b)y(b) - x(b)fi'(b)] -p(a) [x'(a)y(a) - x(a)y'(a)]

-p(b) [det w(b)J

⁺

p(a}[detw(a)]

106 Chapter

Hilbert Spaces where w(t) is the Wronski matrix

Put also

[ ^{x(t) y(t)} ]

w(t) = x'(t) 'fI(t)

Our hypothesis on p is that p(a) det ,8 = p(b) det o. The fact that x, y E

gives us o w(a)

,8 w(b)

O. This yields (det o)[det w(a)J = (det ,8)[det w(b)J.

Note t.hat det( -,8) = det(,8) because ,8 is of even order. Multiplying this by p(b) gives us p(b) det 0 det w(a)

p(b) det ,8 det w(b). By a previous equation, this is p(a) det ,8 det w(a)

⁼

p(b) det ,8 det w(b). If det ,8 i- 0, we have p(b) det w(b) =

p(a) det w(a). If det 0 i- 0, a similar calculation can be used.

• Lemma.

A second-order linear differential equation

a(t)x"(t)

b(t)x'(t)

c(t)x(t) = ^d(t) (a :::::; t :::::; b)

can be put into the form of a Sturm-Liouville equation (px')'

qx = ^f,

provided that the functions a, b, c are continuous and a(t) i- 0 for all t in the interval [a, bJ.

Proof.

We transform the equation ax"

bx'

cx = d by multiplying by the integrating factor � ^exp J (b(t)ja(t)) dt. Thus

or

Let

x"ef bla + (bja)x'ef bla + (cja)xef bla = ^(dja)ef ^bla

p

ef bla , q = (cja)efbla

^•

Example 1.

If Ax

-x" (i.e., p(t) =

^{- 1}

^{and q(t)}

⁼

^0), what are the eigenvalues and eigenfunctions? The solutions to -x"

..\x are of the form CI sin ..;xt

C2 cos ..;xt. Hence every complex number ..\ is an eigenvalue, and

each eigenspace is of dimension 2.

•

Example 2.

Let Ax

-x" as before, but let the inner-product space be the subspace of L2[O,

^1T^J

consisting of twice continuously differentiable functions that satisfy x(O)

= X(1T)

=

o. The eigenvalues are n2 for n ⁼

0, 1 , 2, . .

, and the

eigenfunctions are sin n2t.

•

The next theorem illustrates one case of the Sturm-Liouville Problem. We take p(t)

1 in the differential equation and let ,8lJ = ^,812 = ⁰²¹ = ⁰²² = ^O.

We assume that 10ld

10121 > 0 and 1,82d

1,8221 >

It is left to Problem

Section

2.5

Sturm-Liouville Theory

107

to prove that the differential operator is Hermitian on the subspace of functions that satisfy the boundary conditions.

Our goal is to develop a method for solving the equation Ax

⁼

y, ^where y ^is

a given function, and x is to be determined. The plan of attack is to find a right inverse of A (say A B

⁼^I)

and to give x = By as the solution to the problem.

It will turn out that the spectral theorem is applicable to B.

( 1 ) (2) (3)

We assume that there exist functions u ^and v ^{such that} u" = qu

v" = qv ollv(a)

⁺

oI2v'(a)

⁼⁰

u'(a)v(a) - u(a)v'(a)

⁼¹

From

(3)

we see that u ¥

⁰

^and v ¥ ^O. The left side of

(3)

is the Wronskian of

u ând In practical terms, v evaluated at a. u ând v can be obtained by solving two initial-value problems. This is often done as follows. Find Uo ând Vo ^{such that}

Uo = quo

₁₁11

Vo = qvo uo(b) =

vo(a) =

⁰

u�(b)

⁼⁰

v�(a)

⁼¹

The Now we observe that for u ^and v required will then be suitable linear combinations of

all

s, Uo ^and Vo.

u'(s)v(s) - u(s)v'(s) =

This is true because the left side takes the value

at s = a and is constant.

Indeed,

- [u'v - uv'J ds d

⁼

u"v

⁺

u'v' - uv" - u'v'

⁼

quv - uqv

⁼⁰

Next we construct a function

called the

Green's function

for the prob

lem: _g(s,t)

₌

{ ^u(s)v(t)

v(s)u(t) a � t � s � b a � s � t � b

The operator A in this case is defined by

(4) Ax

⁼

x" - qx

and the domain of A is the closure in L21a, bJ of the set of all twice continuously differentiable functions x such that

Theorem 2.

A right inverse of A in Equation (4) is the operator B

defined by

(5)

(By)(s)

⁼

lb g(s, t)y(t) dt

108 Chapter

Hilbert Spaces

Proof.

It is to be proved that AB = I. Let

y

C[a,

^bJ^{and put}

x ⁼

^{By. We}

show first that A

x

⁼

y.

From t.he equation

we have

x(s)

⁼

lb ^9(S, ^{t)y(t) dt}

18 u(s)v(t)y(t) dt

⁺

J.b v(s)u(t)y(t) dt

u(s) 18 v(t)y(t) dt

⁺

v(s) J.b u(t)y(t) dt x'(s) ⁼ u'(s) l' v(t)y(t) dt

⁺

u(s)v(s)y(s)

v'(s) J.b u(t)y(t) dt - v(s)u(s)y(s)

u'(s) l' v(t)y(t) dt

⁺

v'(s) J.h u(t)y(t) dt

Another differentiation gives us

x"(s)

⁼

u"(s) l' v(t)y(t) dt

⁺

u'(s)v(s)y(s)

⁺

v"(s) J.b u(t)y(t) dt - v'(s)u(s)y(s

q(s)u(s) l' v(t)y(t) dt

⁺

q(s)v(s) J.b u(t)y(t) dt

⁺

y(s) [u'(s)v(s) - u(s)v'(s))

q(s)x(s)

⁺

y(s)

In the last step, the constant value of the Wronskian was substituted. Our calculation shows that

x" - qx

⁼

y

^{or A}

x

⁼

y,

as asserted. Hence AB

= I.

It remains to prove that

x

E X, i.e., that

x

satisfies the boundary conditions.

We have, from previous equations,

x(a)

⁼

v(a) lb u(t)y(t) dt

⁼

cv(a)

and

x'(a)

⁼

v'(a) lb u(t)y(t) dt

⁼

cv'(a)

Hence

QlJx(a)

⁺

Q12x'(a)

⁼

QlJcv(a)

⁺

Q12cv'(a)

⁼

⁰

Similarly we verify that

• Remark. If it is known that the homogeneous boundary-value problem has only t.he trivial solution, then B is also a

left

inverse of A. In order to verify this,

Section

^2.5

Sturm-Liouville Theory 109

let x

^{E X,}^Y

= Ax, ^and B y = z. The previous theorem shows that y = ABy = Az ^{and that} z

^{E X.}

^Hence x- z

^{E X}

^and A{x- z) =

o. It follows that

x -

= 0,

so that x = By = BAx.

Remark.

The operator B in the previous theorem is Hermitian, because (by Problem 9)

⁹

satisfies the equation

g{s, t) = g{t, s)

Now we apply the Spectral Theorem to the operator B. Notice that B ^is

compact by Theorem

in Section 2.3, ^page

^86.

There exist an orthonormal sequence [un] ⁱⁿ L2[a,

^b]

and real numbers An ^{such that}

By = n=) I >n(y,Un)Un

Since BUk ⁼ Akuk, ^{we have} Uk ⁼ AkAuk, ^and Uk satisfies the boundary con

ditions. This equation shows that Uk is an eigenvector of A corresponding to the eigenvalue 1/ Ak. ^Since Ak

^�

^{0, 1/} Ak

^�^00.

Consequently, a solution to the problem Ax

⁼

y, ^where y is given and x must satisfy the boundary conditions,

is

x = By

⁼

n=) ^L An(y,un)un

Example 3.

Consider the boundary-value problem

Ax

⁼⁼

x" + x =

x'{O) = .T(-rr) = 0

We shall solve

^it

by means of a Green's function. For the functions U ^and v ^we

can take u { t) = ^sin t ^and v{ t) = ^cos t. In this case the Green's function is

{ ^sin ^s ^cos t 0 � t � s � rr g{s, t) = cos s sin t O � s � t � rr

The compact Hermitian integral operator B is given by

(By)(s) = ^sin s 1s ^cos t y{t) dt + ^cos s 1"" ^sin ^{t y{t) dt}

^•

Example 4.

Let us solve the problem in Example 3 by using the Spectral Theorem. The eigenvalues and eigenvectors of the differential operator A ^are

obtained by solving x" +x =

^J-LX.

The general solution of the differential equation is x{t) =

^c)

^sin ^� t +

² ^co

^� t

Imposing the conditions x'{O) = x{rr) = 0, we find that the eigenvalues are J-Ln = 1

^{(n - 4 )2} and the eigenfunctions are vn(t)

⁼

^cos {2 ⁿ - l)t/2. ^The Vn ^{are also}

eigenfunctions of B, corresponding to eigenvalues An ^{= 1/} J-Ln ⁼ ^{(n - n2} ⁺ ^{�) -)} ^.

1 10 Chapter

Hilbert Spaces

Observe that the eigenfunctions [QnvnJ is an orthonormal system, and the spectral resolution of Vn are not of unit norm. If Qn ⁼ B 1/lIvnll, is ^then

By = n=1 L

An(Y, Qnvn)(Qnvn)

A computation reveals that Qn

⁼

(2/,11")1/2. Hence we can write

By = (2/1r) n=1 ^L

An(Y, Vn)Vn

Use of this formula is equivalent to the traditional method for solving the boundary-value problem

x" + x = y x'(O) = x(1r) = 0

2n - 1

The traditional method starts with the functions vn(t)

⁼

^cos -2- ^t, ^which

satisfy the boundary conditions. Then we build a function of the form x =

L�=1 cnVn· This also satisfies the boundary conditions. We hope that with a correct choice of the coefficients we will have Ax = y. Since A Vn

⁼

J-LnVn, ^this

equation reduces to L�=1 CnJ-LnVn ⁼ ^y. To discover the values of the coefficients, take the inner product of both sides with Vm:

By orthogonality, we get CmJ-LmQ;;,2 ⁼ ^(y, vm) ^and Cm ^{= (y,} Vm)J-L;;;IQ�.

Notice that Theorem 2 has given us an alternative method for solving the inhomogeneous boundary-value problem. Namely, we simply use the Green's function to get x:

x(s) = (By)(8) = 1" g(8, t)y(t) dt

•

Our next task is to find out how to determine a Green's function for the more general Sturm-Liouville problem. The differential equation and its boundary conditions are as follows:

(6)

{ Ax = (px')' + qx = y x E C2[a, b]

Ql1x(a) + QI2x'(a) ⁺ .Bllx(b) ⁺ .B12X'(b) ⁼ ⁰

Q21x(a) + Q22x'(a) ⁺ .B21X(b) ⁺ .B22X'(b) ⁼ ⁰

We are looking for a function

⁹

defined on [a, b]

[a, bJ. As usual, the t-sections

of

are given by gt(8) = g(8, t).

Section

^2.5

Sturm-Liouville Theory

Theorem 3.

The Green's function for the above problem is charac- terized by these five properties:

( i ⁾

⁹

is continuous in [a, b]

[a, b].

( .. ) 8g . .

¹¹

8s IS continuous In . a < s < t < b ^{an In} ^{d '} a < t < s < . b

(i ii ⁾ For each t, gt satisfies the boundary conditions.

( iv ⁾ Agt = 0 in the two open triangles described in ⁽ ii ⁾ .

. 8g . 8g

( v ⁾ t.1. s s ^hm -8 (s,t) - hm -8 (s,t) = -ljp(s). t t s s

Proof.

As in the previous proof, we take y ^E eta, b] and define

x(s) = 1b g(s, t)y(t) dt

1 1 1

It is to be shown that x is in the domain of A ^{and that} Ax = y. The domain of A

consists of twice-continuously differentiable functions that satisfy the boundary conditions. Let us use ' to denote partial differentiation with respect to s. ^Since

x(s) = 1s g(s, t)y(t) dt + J.b g(s, t)y(t) dt

we have ( as in the previous proof )

x'(s) = 1s g'(s,t)y(t)dt + J.b g'(s,t)y(t)dt

It follows that

x(a) = 1b g(a, t)y(t) dt x'(a) = 1b g'(a, t)y(t) dt

x(b) = 1b g(b,t)y(t)dt x'(b) = 1b g'(b,t)y(t)dt

Any linear combination of x(a), x(b), x'(a), ^and x'(b) is obtained by an integra

tion of the same linear combination of g(a,t), g(b,t), g'(a,t), ^and g'(b,t). ^Since gt satisfies the boundary conditions, so does x. We now compute x"(s) ^{from the}

equation for x'(s):

x"(s) = g'(s, s-)y(s) + 1s g"(s, t)y(t) dt - g'(s, s+)y(s) + J.b g"(s, t)y(t) dt

= y(s)jp(s) + 1b g"(s, t)y(t) dt

Here the following notation has been used:

g'(s,s+) = t .1. s ^lim g'(s,t) g'(s,s-) = t t s ^lim g'(s,t)

1 12

Chapter

Hilbert Spaces Now it is easy to verify that Ax

⁼

y. ^{We have}

Hence

Ax

⁼

(px')'

⁺

qx

⁼

p'x'

⁺

px" + qx

(Ax)(s)

⁼

p'(s) lb g'(s, t)y(t) dt

⁺

y(s) + p(s) lb g"(s, t)y(t) dt

q(s) lb g(s, t)y(t) dt

y(s) + lb [(p(s)g'(s, t))' + q(s)g(s, t)Jy(t) dt

⁼

y(s)

because l is a solution of the differential equation.

Example 5.

Find the Green's function for this Sturm-Liouville problem:

x"

⁼

y x(O)

⁼

x'(O)

^{= 0}

•

The preceding theorem asserts that gt should solve the homogeneous differential equation in the intervals 0 < s ^< t ^{< 1} ^and ^{0 <} t ^< s < 1 . Furthermore, gt

should be continuous, and it should satisfy the boundary conditions. Lastly,

g'( s, t) should have a jump discontinuity of magnitude -1 as t passes through the value s. One can guess that

is given by

g(s, t)

⁼

{o O � s � t � l s - t O � t � s � l

If we proceed systematically, it will be seen that this is the only solution. In the triangle 0 < s ^< t ^{< 1,} Agt

= 0, and therefore

gt must be a linear function of s.

We write g(s, t)

⁼

a(t)

⁺

b(t)s. ^Since gl must satisfy the boundary conditions, we have g(O, t)

⁼

(oglos)(O, t)

⁼

O. ^Thus a(t)

⁼

b(t)

⁼

⁰ ^and g(s, t)

⁼

⁰ ⁱⁿ

this triangle. In the second triangle, 0 < t ^< s ^<

^1.

^{Again l} must be linear, and we write g( s, t)

⁼

a( t)

⁺

,B( t )s. Continuity of

on the diagonal implies that

a(t)

⁺

,B(t)t

⁼

^0, and we therefore have g(s,t)

⁼

-,B(t)t

⁺

,B(t)s

⁼

,B(t)(s - t).

The condition (oglos)(s,s+) - (oglos)(s, s-)

⁼

-lip leads to the equation

o ^{- ,B(t)}

⁼

^-1. ^Hence ^{g(s, t)}

⁼

^{s - t} in this triangle. The solution to the inhomogeneous boundary-value problem x"

⁼

y is therefore given by

x(s)

⁼

1s (s - t)y(t) dt

Example

6. Find the Green's function for the problem We tentatively set

(7)

x" - x'

^-^2x⁼

y

{ ^u(s)v(t)

g(s, t)

⁼

v(s)u(t)

x(O)

⁼

⁰

⁼

x(l) O � s � t � l O � t � s � l

•

Section

^2.5

Sturm-Liouville Theory ¹¹³

and try to determine the functions

and

^v.

The homogeneous differential equa

tion has as its general solution the function

x(s) = o:e-s + (3e2s

The solution satisfying the condition

^x(O)⁼

0 ^is

The solution satisfying the condition

^{x ( l )}

= 0 ^is

With these choices, the function

⁹

in Equation

(7)

satisfies the first four require

ments in Theorem 3. With a suitable choice of the parameters

^0:

and

^(3,

the fifth requirement can be met as well. The calculation produces the following equation involving the Wronskian of

and

^v:

g'(s, s+) - g'(s, s-) = u'(s)v(s) - v'(s)u(s)

o:{3(3

^{- 3e3)eS}

In this problem, the function

is

^p(^s)= e-s,

because

x"(s) - x' (s)

=

(e-Sx'(s)) '

Hence condition 5 in Theorem 3 requires us to choose

^0:

and {3 ^{such that} o:{3

⁼

- (3 - 3e3)- 1 :::::: .0

0

¹⁷⁴⁶

⁵

^Then

Example 'T.

Find the Green's function for this Sturm-Liouville problem:

x" + 9x = y ^x(O)= x(7r/2) ⁼

0

•

According to the preceding theorem,

⁹

should be a continuous function on the square 0 ^:s:;

^s,

t :s:; 7r /2, and

^gt

should solve the homogeneous problem in the intervals 0 ^:s:;

:s:; t and t :s:;

:s:; 7r/2. Finally,

^ag/as

should have a jump of magnitude -1 as t increases through the value

^s.

These considerations lead us to define {

^� sin 3s cos 3t o :s:;

:s:; t :s:; 7r /2

g(s,

t) =

-� cos3ssin 3t

^{O :S:;}

t :s:;

^{s :S:;}

7r/2

Problems 2.5

•

1 . Find the eigenvalues and eigenfunctions for the Sturm-Liouville operator when p = q = 1

and

a = [6 �]

⁼^t3

114 Chapter

Hilbert Spaces

2. Prove that an operator of the form

(Ax)(s) =

lb

k(s, t)x(t) dt is Hermitian if and only if k(s, t) = k(t,s).

3. Find the Green's function for the problem

x" - 3x' + 2x = y x(O) = 0 = x(l) 4. Find the Green's function for the problem

x" - 9x = y x(O) = 0 = x(l) 5. Prove that if u and v are in el!a, bJ, then the function

{

u(s)v(t) a � s � t � b g(." t) =

v(s)u(t) a � t � s � b has properties (i) and (ii) mentioned in Theorem 3.

6. (Continuation) Show that if

pv2(ulv)' = - 1 then 9 (in Problem 5 ) will have property (v) in Theorem 3.

8. Prove that if p = I in the Sturm-Liouville problem and fil l = fi12 = 021 = 022 = 0 then A is Hermitian.

9. Prove that the function 9 in Equation (4) is symmetric: g(s,t) = g(t, s). 10. Let Ax = (px')' - qx. Prove Lagrange's identity:

xAy - yAx = [p(xy' - yx')J' 1 1 . (Continuation) Prove G reen's formula:

lb

(xAy - yAx) = p(xy' - yx')I�

12. Show that the Wronskian for any two solutions of the equation (px' )' - qx = 0 is a scalar multiple of l ip, and so is either identically zero or never zero. ( Here we assume p(t) '" 0 for a � t � b.)

13. Find the eigenvalues and eigenfunctions for the operator A defined by the equation Ax = -x" +2x' -x. Assume that the domain of A is the set of twice continuously differentiable functions on [0, IJ that have boundary values x(O) = x(l) = o.

Chapter 3 Calculus in Banach Spaces

3.1 The Frechet Derivative

1 15

3.2

The Chain Rule and Mean Value Theorems

121 3.3

Newton's Method

125

3.4

Implicit Function Theorems

135

3.5

Extremum Problems and Lagrange Multipliers

4

⁵

3.6

The Calculus of Variations

152

3.1 The Frechet Derivative

In this chapter we develop the theory of the derivative for mappings between Banach spaces. Partial derivatives, Jacobians, and gradients are all examples of the general theory, as are the Gateaux and Frechet differentials. Kantorovich's theorem on Newton's method is proved. Following that there is a section on implicit function theorems in a general setting. Such theorems can often be used to prove the existence of solutions to integral equations and other similar problems. Another section, devoted to extremum problems, illustrates how the methods of calculus (in Banach spaces) can lead to solutions. A section on the

"calculus of variations" closes the chapter.

The first step is to transfer, with as little disruption as possible, the ele

mentary ideas of calculus to the more general setting of a normed linear space.

Definition.

Let /

D

^-+

^Y be a mapping from an open set D in a normed linear space

into a normed linear space Y. Let x E D. If there is a bounded linear map A

^:^X^-+

Y such that

( 1 )

. II/(x

h) - /(x)

Ahll

l� I/h/l

⁼

⁰

then / is said to be

Frechet differentiable

at x, or simply

differentiable

at x. Furthermore, A is called the (Frechet)

derivative

of f at x.

1 1 5

1 1 6

Chapter

Calculus in Banach Spaces

Them'em 1. I

f J is differentiable at x, then the mapping A ^{in the}

definition is uniquely defined. (It depends on x as well as J.)

Proof. Suppose that

AI

^and

A2

are two linear maps having the required prop

erty, expressed in Equation ^{( 1 ) .}Then to each

e ^> ⁰

there corresponds a

6 ^> ⁰

such that

IIJ(x + h) - J(x) - Aihll

ellhll (i = 1, 2)

whenever

Ilhll

6.

By the triangle inequality,

II A

h - A2hll

2ellhll

^whenever

II hl!

6.

^Since

^AI - A2

is homogeneous, the preceding inequality is true for all

h.

^Hence

IIAI - A211

^�

2e.

^Since

e

was arbitrary,

HAl - A211 = o.

^•

Notation. If

J

is different.iable at

x,

its derivative, denoted by

A

^{in the}

definition, will usually be denoted by

1'(x).

Notice that with this notation

1'(x)

^E^C(X.

^Y).

^{This is}^NOTthe same as saying

l'

E C(X,

Y).

It will be necessary to dist.inguish carefully between

l'

and

1'(x).

Theorem 2. If

J is bounded in a neighborhood of x and if a linear map A has the property in Equation (1), ^then A is a bounded linear map; in other words, A is the Fn§chet derivative of J ^at ^x.

Proof. Choose

6 ^> ⁰

so that whenever

Ilhll

^�

6

we will have

IIJ(x + h)1I

^�

M

^and

IIJ(x + h) - J(x) - Ahll

^�

Ilhl!

Then for

IIhll

^�

6

^{we have}

IIAhl1

^�

2M + IIhll

^�

2M +6.

^For

lIull

^�

^1, 116ull

^�

6,

whence

IIA(6u)11

^�

2M + 6.

^Thus

IIAII

^�

(2M + 6)/6.

^•

Example ^1. Let X

= Y

⁼JR. Let

J

be a function whose derivative (in the elementary sense) at

x

^is

A.

Then the Frechet derivative of

J

^at

^x

is the linear map

h

^>--r

Ah,

^because

lim

IJ(x + h) - ^J(x) - Ahl

⁼ ^lim

I J(X + h) - J(x)

A I

^{= 0}

h--+O

Ihl

^h--+O

h

Thus, the terminology adopted here is slightly different from the elementary

notion of derivative in calculus. •

Example 2. Let X and

Y

be arbitrary normed linear spaces. Define

J

^:^X^-+

Y

J(x)

⁼^Yo,^where^Yois a fixed element of

Y.

(Naturally, such an

J

^{is called}

a constant map.) Then

1'(x)

⁼^O.(This is the

0

element of C(X,

Y).)

• Example ^3. Let

J

be a bounded linear map of X into

Y.

Then

1'(x)

⁼

J.

Indeed,

IIJ ^(x + h) - J(x) - J(h)1I ⁼

^O.Observe that the equation

l'

₌

J

^is

^not

t.rue. This illustrates again the importance of distinguishing carefully between

1'(x)

^and

1'.

^•

Section ^{3. 1} The Frechet Derivative

Theorem 3.

If f is differentiable at x, then it is continuous at x.

1 17

Proof.

Let A ⁼ f'(x). ^Then A ^E

^.C(X,

^Y). ^Given ^{c > 0,} ^{select 15} ^{> 0} ^{so that}

15 <

e/(l + IIAII) and so that the following implication is valid:

IIhll

⁰

^�

IIf(x + h) - f(x) - Ahll/"hll

¹

Then for Ilh"

^<^15,

we have by the triangle inequality

Ilf(x + h) - f(x)"

^�

IIf(x + h) - f(x) - Ahll + IIAhl/

< IIhll + I/Ahll

^�

Ilhll + IIAll llh/!

< 15

( ¹ + IIAID

^<^e ^•

Example 4.

Let

= Y

⁼

e[O, 1 J and let </>

^: ^IR^--+ ^IR

be continuously differentiable. Define element of e[o, 1J. What is f ^:

^X^--+

f'(x)? ^Y by the equation To answer this, we undertake a calculation f(x)

⁼

^</>

⁰

x, ^where x ^{is any}

of f(x + h) - f(x), using the classical mean value theorem:

[J(x + h) - f(x)1(t) ⁼ </>(x(t) + h(t)) - </>(x(t))

⁼

</>' ⁽ x(t) + O(t)h(t) ⁾ h(t)

where 0

O(t)

^1. This suggests that we define A ^by Ah

⁼

(</>' ox)h

With this definition, we shall have at every point t,

[f(x + h) - f(x) - Ah](t) ⁼ </>' ⁽ x(t) + O(t)h(t) ⁾ h(t) - </>'(x(t))h(t)

Hence, upon taking the supremum norm, we have

I/f(x + h) - f(x) - Ahll

^�

II</>'

⁰

(x + Oh) - </>'

⁰

xl/ l1hl/

By comparing this to Equation ( 1 ) and invoking the continuity of </>', we see that

A is indeed the derivative of f ^at x. ^Hence f'(x) ⁼ </>'

⁰

x.

^•

Theorem 4.

Let f

^:^IRn^--+ ^IR.

^If each of the partial derivatives D;/

⁽

⁼

f ^/

x

) exists in a neighborhood ofx and is continuous at x

then f'(x) exists, and a formula for it is

J'(x)h

⁼

^L

_i=1n

^D

^/ (x)

^·

hi

Speaking loosely, we say that the Frechet derivative of f is given by the gradient of f.

Proof.

We must prove that

Dalam dokumen Analysis for Applied Mathematics (Halaman 111-136)

A

B

P

B

p-I AP.

L

104 Chapter

Hilbert Spaces

[UI

[VI

(8) Lx = :L>i(X,Vi)Vi i=1

A

P

LUj = Laij Ui ( 1

,=1 i=1

)

A

P

/

)

(J)jk = Ojk = (Vk,Vj) = \ 8PikU;, �Prj Ur = 8�PikPrj(Ui,ur)

= LPikPij = L(P")ji(P)ik = (P"P)jk i=1 i=1

( 10) P"P = I

PP* = I.)

A

A

A = A'

A

[UI,'

LUj = LaijUi i=1

=

(t Cj Uj) = t t Cj aij Ui

=

A

A * = A

Every complex Hermitian matrix is unitarily similar to

real diagonal matrix.

Section

Sturm-Liouville Theory

(Ax)(t)

[P(t)x'(t)j'

q(t)x(t)

Ax

(px')'

qx

x

p

q

x, p,

q

[a, bj.

x

Qij, f3ij

p(a)(f311f322 - f312(321)

p(b)(QJlQ22 - Q12Q2d

L2[a,

Qllx(a)

Ql2x'(a)

f3JJx(b)

f312X'(b)

a Q21x(a)

Q22x'(a)

f321X(b)

f322X'(b)

a

f3J1f322

f312f321

QIl Q22

Q12Q21'

operator on

Under the preceding hypotheses, A is a Hermitian Proof.

x, y

(Ax, y)

(x, Ay).

(Ax, y) - (x, Ay)

lb[yAX - xAyj

lb [y(px')'

yqx - x(py')' - xqy]

= lb [y(px')' - x(pY)'J lb

LUj = Laij Ui ^{( 1}

(J)jk = Ojk = (Vk,Vj) = ^\ 8PikU;, �Prj Ur = 8�PikPrj(Ui,ur)

( 10) P"P = ^I

PP* = ^I.)

A = ^A'

(t ^{Cj Uj) =} t t ^{Cj aij Ui}

A * ^{= A}

^a Q21x(a)

^a

[ ^{x(t) y(t)} ]

c(t)x(t) = ^d(t) (a :::::; t :::::; b)

qx = ^f,

cx = d by multiplying by the integrating factor � ^exp J (b(t)ja(t)) dt. Thus

x"ef bla + (bja)x'ef bla + (cja)xef bla = ^(dja)ef ^bla

^{and q(t)}

^0), what are the eigenvalues and eigenfunctions? The solutions to -x"

1 in the differential equation and let ,8lJ = ^,812 = ⁰²¹ = ⁰²² = ^O.

y, ^where y ^is

We assume that there exist functions u ^and v ^{such that} u" = qu

^and v ¥ ^O. The left side of