Any two matrices
A
andB
are said to be similar to each other if there exists a nonsingular matrixP
such thatB
=p-I AP.
The matrices for a given linear t.ransformation relative to different ordered bases form an equivalence class under the similarity relation. What is the simplest matrix that can be obtained for a linear transformation by changing the basis? This is a difficult question, to which one answer is provided by the Jordan canonical form. Another answer can be given in the context of a finite-dimensional Hilbert space when the linear transformationL
is Hermitian.104 Chapter
2Hilbert Spaces
Let
[UI
\ . .. , un] be an ordered orthonormal basis for the n-dimensional Hilbert space X. Let L be Hermitian. The spectral theorem asserts the existence of an ordered orthonormal basis
[VI
\ . . . ,vnJ and an n-tuple of real numbers (AI , . . . , An) such that(8) Lx = :L>i(X,Vi)Vi i=1
n As above, we introduce matrices nA
andP
such that(9)
LUj = Laij Ui ( 1
:::; j :::; n),=1 i=1
The matrix B that represents L relative to the v-basis is the diagonal matrix diag(AI , . . . , An
)
, as we see from Equation (8). Thus from Equation (7) we conclude thatA
is similar to a diagonal matrix having real entries. More can be said, however, becauseP
has a special structure. Notice that/
n n)
n n(J)jk = Ojk = (Vk,Vj) = \ 8PikU;, �Prj Ur = 8�PikPrj(Ui,ur)
n n
= LPikPij = L(P")ji(P)ik = (P"P)jk i=1 i=1
This shows that
( 10) P"P = I
(It follows by elementary linear algebra that
PP* = I.)
Matrices having the property ( 10) are said to be unitary. We can therefore state that the matrixA
(representing the Hermitian operator L with respect to an orthonormal base) is unitarily similar to a real uiagonal matrix.Finally, we note that if an n x n complex matrix
A
is such thatA = A'
, thenA
is the matrix of a Hermitian transformation relative to an orthonormal basis.Indeed, we have only to select any orthonormal base
[UI,'
. . , unJ and define L byThen, of course,
LUj = LaijUi i=1
nLx
=
L(t Cj Uj) = t t Cj aij Ui
By straightforward calculation we have
(Lx, y)
=
(x, Ly)A matrix
A
satisfyingA * = A
is said to be Hermitian. We have proved therefore the following important result, regarded by many as the capstone of elementary matrix theory:Theorem.
Every complex Hermitian matrix is unitarily similar to
areal diagonal matrix.
Section
2.5Sturm-Liouville Theory
105 2.5 Sturm-Liouville TheoryIn this section differential equations are attacked with the weapons of Hilbert space theory. Recall that in elementary calculus we interpret integration and differentiation as mutually inverse operations. So it is here, too, that dif
ferential operators and integral operators can be inverse to each other. We find that a differential operator is usually ill-behaved, whereas the corresponding in
tegral operator may be well-behaved, even to the point of being compact. Thus, we often try to recast a differential equation as an equivalent integral equation, hoping that the transformed problem will be less troublesome. (This theme will reappear many times in Chapter 4.) This strategy harmonizes with our general impression that differentiation emphasizes the roughness of a function, whereas integration is a smoothing operation, and is thus applicable to a broader class of functions.
Definition. The Sturm-Liouville operator is defined by
(Ax)(t)
=[P(t)x'(t)j'
+q(t)x(t)
i.e.,Ax
=(px')'
+qx
where
x
is two-times continuously differentiable,p
is real-valued and continuously differentiable, and
q
is real-valued continuous. The domain of the functionsx, p,
andq
is an interval[a, bj.
We permitx
to be complex-valued. Let eight real numbersQij, f3ij
be specified 1 :::; i, j :::; 2. Assume thatp(a)(f311f322 - f312(321)
=p(b)(QJlQ22 - Q12Q2d
Let X be the subspace of
L2[a,
bj consisting of all twice continuously differentiable functions x such that
Qllx(a)
+Ql2x'(a)
+f3JJx(b)
+f312X'(b)
=a Q21x(a)
+Q22x'(a)
+f321X(b)
+f322X'(b)
=a
Assume also that
f3J1f322
f;f312f321
orQIl Q22
f;Q12Q21'
Theorem 1 .
operator on
X.Under the preceding hypotheses, A is a Hermitian Proof.
Letx, y
E X. We want to prove that(Ax, y)
=(x, Ay).
We compute(Ax, y) - (x, Ay)
=lb[yAX - xAyj
=lb [y(px')'
+yqx - x(py')' - xqy]
= lb [y(px')' - x(pY)'J lb
=
a [y(px')'
+y'px' - x(pY)' - x'py'J
=
lb [px'y - pxy'J'
=[px'y -pxy']!
=
p(b) [x'(b)y(b) - x(b)fi'(b)] -p(a) [x'(a)y(a) - x(a)y'(a)]
=
-p(b) [det w(b)J
+p(a}[detw(a)]
106 Chapter
2Hilbert Spaces where w(t) is the Wronski matrix
Put also
[ x(t) y(t) ]
w(t) = x'(t) 'fI(t)
Our hypothesis on p is that p(a) det ,8 = p(b) det o. The fact that x, y E
Xgives us o w(a)
+,8 w(b)
=O. This yields (det o)[det w(a)J = (det ,8)[det w(b)J.
Note t.hat det( -,8) = det(,8) because ,8 is of even order. Multiplying this by p(b) gives us p(b) det 0 det w(a)
=p(b) det ,8 det w(b). By a previous equation, this is p(a) det ,8 det w(a)
=p(b) det ,8 det w(b). If det ,8 i- 0, we have p(b) det w(b) =
p(a) det w(a). If det 0 i- 0, a similar calculation can be used.
• Lemma.A second-order linear differential equation
a(t)x"(t)
+b(t)x'(t)
+c(t)x(t) = d(t) (a :::::; t :::::; b)
can be put into the form of a Sturm-Liouville equation (px')'
+qx = f,
provided that the functions a, b, c are continuous and a(t) i- 0 for all t in the interval [a, bJ.
Proof.
We transform the equation ax"
+bx'
+cx = d by multiplying by the integrating factor � exp J (b(t)ja(t)) dt. Thus
or
Let
x"ef bla + (bja)x'ef bla + (cja)xef bla = (dja)ef bla
p
=ef bla , q = (cja)efbla
•Example 1.
If Ax
=-x" (i.e., p(t) =
- 1and q(t)
=0), what are the eigenvalues and eigenfunctions? The solutions to -x"
=..\x are of the form CI sin ..;xt
+C2 cos ..;xt. Hence every complex number ..\ is an eigenvalue, and
each eigenspace is of dimension 2.
•Example 2.
Let Ax
=-x" as before, but let the inner-product space be the subspace of L2[O,
1TJconsisting of twice continuously differentiable functions that satisfy x(O)
= X(1T)=
o. The eigenvalues are n2 for n =0, 1 , 2, . .
., and the
eigenfunctions are sin n2t.
•The next theorem illustrates one case of the Sturm-Liouville Problem. We take p(t)
=1 in the differential equation and let ,8lJ = ,812 = 021 = 022 = O.
We assume that 10ld
+10121 > 0 and 1,82d
+1,8221 >
o.It is left to Problem
8Section
2.5Sturm-Liouville Theory
107to prove that the differential operator is Hermitian on the subspace of functions that satisfy the boundary conditions.
Our goal is to develop a method for solving the equation Ax
=y, where y is
a given function, and x is to be determined. The plan of attack is to find a right inverse of A (say A B
= I)and to give x = By as the solution to the problem.
It will turn out that the spectral theorem is applicable to B.
( 1 ) (2) (3)
We assume that there exist functions u and v such that u" = qu
v" = qv ollv(a)
+oI2v'(a)
= 0u'(a)v(a) - u(a)v'(a)
= 1From
(3)we see that u ¥
0and v ¥ O. The left side of
(3)is the Wronskian of
u and In practical terms, v evaluated at a. u and v can be obtained by solving two initial-value problems. This is often done as follows. Find Uo and Vo such that
Uo = quo
11 11Vo = qvo uo(b) =
1vo(a) =
0u�(b)
= 0v�(a)
= 1The Now we observe that for u and v required will then be suitable linear combinations of
alls, Uo and Vo.
u'(s)v(s) - u(s)v'(s) =
1This is true because the left side takes the value
1at s = a and is constant.
Indeed,
- [u'v - uv'J ds d
=u"v
+u'v' - uv" - u'v'
=quv - uqv
= 0Next we construct a function
9called the
Green's functionfor the prob
lem: g(s,t)
={ u(s)v(t)
v(s)u(t) a � t � s � b a � s � t � b
The operator A in this case is defined by
(4) Ax
=x" - qx
and the domain of A is the closure in L21a, bJ of the set of all twice continuously differentiable functions x such that
Theorem 2.
A right inverse of A in Equation (4) is the operator B
defined by
(5)
(By)(s)
=lb g(s, t)y(t) dt
108 Chapter
2Hilbert Spaces
Proof.
It is to be proved that AB = I. Lety
EC[a,
bJ and putx =
By. Weshow first that A
x
=y.
From t.he equationwe have
x(s)
=lb 9(S, t)y(t) dt
=
18 u(s)v(t)y(t) dt
+J.b v(s)u(t)y(t) dt
=
u(s) 18 v(t)y(t) dt
+v(s) J.b u(t)y(t) dt x'(s) = u'(s) l' v(t)y(t) dt
+u(s)v(s)y(s)
+
v'(s) J.b u(t)y(t) dt - v(s)u(s)y(s)
=
u'(s) l' v(t)y(t) dt
+v'(s) J.h u(t)y(t) dt
Another differentiation gives us
x"(s)
=u"(s) l' v(t)y(t) dt
+u'(s)v(s)y(s)
+v"(s) J.b u(t)y(t) dt - v'(s)u(s)y(s
=
q(s)u(s) l' v(t)y(t) dt
+q(s)v(s) J.b u(t)y(t) dt
+y(s) [u'(s)v(s) - u(s)v'(s))
=
q(s)x(s)
+y(s)
In the last step, the constant value of the Wronskian was substituted. Our calculation shows that
x" - qx
=y
or Ax
=y,
as asserted. Hence AB= I.
It remains to prove that
x
E X, i.e., thatx
satisfies the boundary conditions.We have, from previous equations,
x(a)
=v(a) lb u(t)y(t) dt
=cv(a)
and
x'(a)
=v'(a) lb u(t)y(t) dt
=cv'(a)
Hence
QlJx(a)
+Q12x'(a)
=QlJcv(a)
+Q12cv'(a)
=0
Similarly we verify that
• Remark. If it is known that the homogeneous boundary-value problem has only t.he trivial solution, then B is also a
left
inverse of A. In order to verify this,Section
2.5Sturm-Liouville Theory 109
let x
E X, Y= Ax, and B y = z. The previous theorem shows that y = ABy = Az and that z
E X.Hence x- z
E Xand A{x- z) =
o. It follows thatx -
z= 0,
so that x = By = BAx.
Remark.
The operator B in the previous theorem is Hermitian, because (by Problem 9)
9satisfies the equation
g{s, t) = g{t, s)
Now we apply the Spectral Theorem to the operator B. Notice that B is
compact by Theorem
5in Section 2.3, page
86.There exist an orthonormal sequence [un] in L2[a,
b]and real numbers An such that
00
By = n=) I >n(y,Un)Un
Since BUk = Akuk, we have Uk = AkAuk, and Uk satisfies the boundary con
ditions. This equation shows that Uk is an eigenvector of A corresponding to the eigenvalue 1/ Ak. Since Ak
�0, 1/ Ak
� 00.Consequently, a solution to the problem Ax
=y, where y is given and x must satisfy the boundary conditions,
is
00x = By
=n=) L An(y,un)un
Example 3.
Consider the boundary-value problem
Ax
==x" + x =
Yx'{O) = .T(-rr) = 0
We shall solve
itby means of a Green's function. For the functions U and v we
can take u { t) = sin t and v{ t) = cos t. In this case the Green's function is
{ sin s cos t 0 � t � s � rr g{s, t) = cos s sin t O � s � t � rr
The compact Hermitian integral operator B is given by
(By)(s) = sin s 1s cos t y{t) dt + cos s 1"" sin t y{t) dt
•Example 4.
Let us solve the problem in Example 3 by using the Spectral Theorem. The eigenvalues and eigenvectors of the differential operator A are
obtained by solving x" +x =
J-LX.The general solution of the differential equation is x{t) =
c)sin � t +
C2 co
S� t
Imposing the conditions x'{O) = x{rr) = 0, we find that the eigenvalues are J-Ln = 1
-(n - 4 )2 and the eigenfunctions are vn(t)
=cos {2 n - l)t/2. The Vn are also
eigenfunctions of B, corresponding to eigenvalues An = 1/ J-Ln = (n - n2 + �) -) .
1 10 Chapter
2Hilbert Spaces
Observe that the eigenfunctions [QnvnJ is an orthonormal system, and the spectral resolution of Vn are not of unit norm. If Qn = B 1/lIvnll, is then
By = n=1 L
00An(Y, Qnvn)(Qnvn)
A computation reveals that Qn
=(2/,11")1/2. Hence we can write
By = (2/1r) n=1 L
00An(Y, Vn)Vn
Use of this formula is equivalent to the traditional method for solving the boundary-value problem
x" + x = y x'(O) = x(1r) = 0
2n - 1
The traditional method starts with the functions vn(t)
=cos -2- t, which
satisfy the boundary conditions. Then we build a function of the form x =
L�=1 cnVn· This also satisfies the boundary conditions. We hope that with a correct choice of the coefficients we will have Ax = y. Since A Vn
=J-LnVn, this
equation reduces to L�=1 CnJ-LnVn = y. To discover the values of the coefficients, take the inner product of both sides with Vm:
By orthogonality, we get CmJ-LmQ;;,2 = (y, vm) and Cm = (y, Vm)J-L;;;IQ�.
Notice that Theorem 2 has given us an alternative method for solving the inhomogeneous boundary-value problem. Namely, we simply use the Green's function to get x:
x(s) = (By)(8) = 1" g(8, t)y(t) dt
•Our next task is to find out how to determine a Green's function for the more general Sturm-Liouville problem. The differential equation and its boundary conditions are as follows:
(6)
{ Ax = (px')' + qx = y x E C2[a, b]
Ql1x(a) + QI2x'(a) + .Bllx(b) + .B12X'(b) = 0
Q21x(a) + Q22x'(a) + .B21X(b) + .B22X'(b) = 0
We are looking for a function
9defined on [a, b]
x[a, bJ. As usual, the t-sections
of
9are given by gt(8) = g(8, t).
Section
2.5Sturm-Liouville Theory
Theorem 3.
The Green's function for the above problem is charac- terized by these five properties:
( i )
9is continuous in [a, b]
x[a, b].
( .. ) 8g . .
118s IS continuous In . a < s < t < b an In d ' a < t < s < . b
(i ii ) For each t, gt satisfies the boundary conditions.
( iv ) Agt = 0 in the two open triangles described in ( ii ) .
. 8g . 8g
( v ) t.1. s s hm -8 (s,t) - hm -8 (s,t) = -ljp(s). t t s s
Proof.
As in the previous proof, we take y E eta, b] and define
x(s) = 1b g(s, t)y(t) dt
1 1 1
It is to be shown that x is in the domain of A and that Ax = y. The domain of A
consists of twice-continuously differentiable functions that satisfy the boundary conditions. Let us use ' to denote partial differentiation with respect to s. Since
x(s) = 1s g(s, t)y(t) dt + J.b g(s, t)y(t) dt
we have ( as in the previous proof )
x'(s) = 1s g'(s,t)y(t)dt + J.b g'(s,t)y(t)dt
It follows that
x(a) = 1b g(a, t)y(t) dt x'(a) = 1b g'(a, t)y(t) dt
x(b) = 1b g(b,t)y(t)dt x'(b) = 1b g'(b,t)y(t)dt
Any linear combination of x(a), x(b), x'(a), and x'(b) is obtained by an integra
tion of the same linear combination of g(a,t), g(b,t), g'(a,t), and g'(b,t). Since gt satisfies the boundary conditions, so does x. We now compute x"(s) from the
equation for x'(s):
x"(s) = g'(s, s-)y(s) + 1s g"(s, t)y(t) dt - g'(s, s+)y(s) + J.b g"(s, t)y(t) dt
= y(s)jp(s) + 1b g"(s, t)y(t) dt
Here the following notation has been used:
g'(s,s+) = t .1. s lim g'(s,t) g'(s,s-) = t t s lim g'(s,t)
1 12
Chapter
2Hilbert Spaces Now it is easy to verify that Ax
=y. We have
Hence
Ax
=(px')'
+qx
=p'x'
+px" + qx
(Ax)(s)
=p'(s) lb g'(s, t)y(t) dt
+y(s) + p(s) lb g"(s, t)y(t) dt
+
q(s) lb g(s, t)y(t) dt
=
y(s) + lb [(p(s)g'(s, t))' + q(s)g(s, t)Jy(t) dt
=y(s)
because l is a solution of the differential equation.
Example 5.
Find the Green's function for this Sturm-Liouville problem:
x"
=y x(O)
=x'(O)
= 0•
The preceding theorem asserts that gt should solve the homogeneous differential equation in the intervals 0 < s < t < 1 and 0 < t < s < 1 . Furthermore, gt
should be continuous, and it should satisfy the boundary conditions. Lastly,
g'( s, t) should have a jump discontinuity of magnitude -1 as t passes through the value s. One can guess that
9is given by
g(s, t)
={o O � s � t � l s - t O � t � s � l
If we proceed systematically, it will be seen that this is the only solution. In the triangle 0 < s < t < 1, Agt
= 0, and thereforegt must be a linear function of s.
We write g(s, t)
=a(t)
+b(t)s. Since gl must satisfy the boundary conditions, we have g(O, t)
=(oglos)(O, t)
=O. Thus a(t)
=b(t)
=0 and g(s, t)
=0 in
this triangle. In the second triangle, 0 < t < s <
1.Again l must be linear, and we write g( s, t)
=a( t)
+,B( t )s. Continuity of
9on the diagonal implies that
a(t)
+,B(t)t
=0, and we therefore have g(s,t)
=-,B(t)t
+,B(t)s
=,B(t)(s - t).
The condition (oglos)(s,s+) - (oglos)(s, s-)
=-lip leads to the equation
o - ,B(t)
=-1. Hence g(s, t)
=s - t in this triangle. The solution to the inhomogeneous boundary-value problem x"
=y is therefore given by
x(s)
=1s (s - t)y(t) dt
Example
6. Find the Green's function for the problem We tentatively set
(7)
x" - x'
-2x =y
{ u(s)v(t)
g(s, t)
=v(s)u(t)
x(O)
=0
=x(l) O � s � t � l O � t � s � l
•
Section
2.5Sturm-Liouville Theory 113
and try to determine the functions
uand
v.The homogeneous differential equa
tion has as its general solution the function
x(s) = o:e-s + (3e2sThe solution satisfying the condition
x(O) =0 is
The solution satisfying the condition
x ( l )= 0 is
With these choices, the function
9in Equation
(7)satisfies the first four require
ments in Theorem 3. With a suitable choice of the parameters
0:and
(3,the fifth requirement can be met as well. The calculation produces the following equation involving the Wronskian of
uand
v:g'(s, s+) - g'(s, s-) = u'(s)v(s) - v'(s)u(s)
=
o:{3(3
- 3e3)eSIn this problem, the function
pis
p( s) = e-s,because
x"(s) - x' (s)=
(e-Sx'(s)) 'Hence condition 5 in Theorem 3 requires us to choose
0:and {3 such that o:{3
=- (3 - 3e3)- 1 :::::: .0
0
17465
.Then
Example 'T.
Find the Green's function for this Sturm-Liouville problem:
x" + 9x = y x(O) = x(7r/2) =
0
•
According to the preceding theorem,
9should be a continuous function on the square 0 :s:;
s,t :s:; 7r /2, and
gtshould solve the homogeneous problem in the intervals 0 :s:;
s:s:; t and t :s:;
s:s:; 7r/2. Finally,
ag/asshould have a jump of magnitude -1 as t increases through the value
s.These considerations lead us to define {
-� sin 3s cos 3t o :s:;
s:s:; t :s:; 7r /2
g(s,
t) =
-� cos3ssin 3t
O :S:;t :s:;
s :S:;7r/2
Problems 2.5•
1 . Find the eigenvalues and eigenfunctions for the Sturm-Liouville operator when p = q = 1
and
a = [6 �]
= t3114 Chapter
2Hilbert Spaces
2. Prove that an operator of the form
(Ax)(s) =
lb
k(s, t)x(t) dt is Hermitian if and only if k(s, t) = k(t,s).3. Find the Green's function for the problem
x" - 3x' + 2x = y x(O) = 0 = x(l) 4. Find the Green's function for the problem
x" - 9x = y x(O) = 0 = x(l) 5. Prove that if u and v are in el!a, bJ, then the function
{
u(s)v(t) a � s � t � b g(." t) =v(s)u(t) a � t � s � b has properties (i) and (ii) mentioned in Theorem 3.
6. (Continuation) Show that if
pv2(ulv)' = - 1 then 9 (in Problem 5 ) will have property (v) in Theorem 3.
8. Prove that if p = I in the Sturm-Liouville problem and fil l = fi12 = 021 = 022 = 0 then A is Hermitian.
9. Prove that the function 9 in Equation (4) is symmetric: g(s,t) = g(t, s). 10. Let Ax = (px')' - qx. Prove Lagrange's identity:
xAy - yAx = [p(xy' - yx')J' 1 1 . (Continuation) Prove G reen's formula:
lb
(xAy - yAx) = p(xy' - yx')I�12. Show that the Wronskian for any two solutions of the equation (px' )' - qx = 0 is a scalar multiple of l ip, and so is either identically zero or never zero. ( Here we assume p(t) '" 0 for a � t � b.)
13. Find the eigenvalues and eigenfunctions for the operator A defined by the equation Ax = -x" +2x' -x. Assume that the domain of A is the set of twice continuously differentiable functions on [0, IJ that have boundary values x(O) = x(l) = o.
Chapter 3
Calculus in Banach Spaces
3.1 The Frechet Derivative
1 153.2
The Chain Rule and Mean Value Theorems
121 3.3Newton's Method
1253.4
Implicit Function Theorems
1353.5
Extremum Problems and Lagrange Multipliers
14
53.6
The Calculus of Variations
1523.1 The Frechet Derivative
In this chapter we develop the theory of the derivative for mappings between Banach spaces. Partial derivatives, Jacobians, and gradients are all examples of the general theory, as are the Gateaux and Frechet differentials. Kantorovich's theorem on Newton's method is proved. Following that there is a section on implicit function theorems in a general setting. Such theorems can often be used to prove the existence of solutions to integral equations and other similar problems. Another section, devoted to extremum problems, illustrates how the methods of calculus (in Banach spaces) can lead to solutions. A section on the
"calculus of variations" closes the chapter.
The first step is to transfer, with as little disruption as possible, the ele
mentary ideas of calculus to the more general setting of a normed linear space.
Definition.
Let /
:D
-+Y be a mapping from an open set D in a normed linear space
Xinto a normed linear space Y. Let x E D. If there is a bounded linear map A
: X -+Y such that
( 1 )
. II/(x
+h) - /(x)
-Ahll
l� I/h/l
=0
then / is said to be
Frechet differentiableat x, or simply
differentiableat x. Furthermore, A is called the (Frechet)
derivativeof f at x.
1 1 5
1 1 6
Chapter
3Calculus in Banach Spaces
Them'em 1. I
f J is differentiable at x, then the mapping A in the
definition is uniquely defined. (It depends on x as well as J.)
Proof. Suppose that
AI
andA2
are two linear maps having the required property, expressed in Equation ( 1 ) . Then to each
e > 0
there corresponds a6 > 0
such that
IIJ(x + h) - J(x) - Aihll
<ellhll (i = 1, 2)
whenever
Ilhll
<6.
By the triangle inequality,II A
Jh - A2hll
<2ellhll
wheneverII hl!
<6.
SinceAI - A2
is homogeneous, the preceding inequality is true for allh.
HenceIIAI - A211
�2e.
Sincee
was arbitrary,HAl - A211 = o.
•Notation. If
J
is different.iable atx,
its derivative, denoted byA
in thedefinition, will usually be denoted by
1'(x).
Notice that with this notation1'(x)
E C(X.Y).
This is NOT the same as sayingl'
E C(X,Y).
It will be necessary to dist.inguish carefully betweenl'
and1'(x).
Theorem 2. If
J is bounded in a neighborhood of x and if a linear map A has the property in Equation (1), then A is a bounded linear map; in other words, A is the Fn§chet derivative of J at x.
Proof. Choose
6 > 0
so that wheneverIlhll
�6
we will haveIIJ(x + h)1I
�M
andIIJ(x + h) - J(x) - Ahll
�Ilhl!
Then for
IIhll
�6
we haveIIAhl1
�2M + IIhll
�2M +6.
ForlIull
�1, 116ull
�6,
whence
IIA(6u)11
�2M + 6.
ThusIIAII
�(2M + 6)/6.
•Example 1. Let X
= Y
= JR. LetJ
be a function whose derivative (in the elementary sense) atx
isA.
Then the Frechet derivative ofJ
atx
is the linear maph
>--rAh,
becauselim
IJ(x + h) - J(x) - Ahl
= limI J(X + h) - J(x)
_A I
= 0h--+O
Ihl
h--+Oh
Thus, the terminology adopted here is slightly different from the elementary
notion of derivative in calculus. •
Example 2. Let X and
Y
be arbitrary normed linear spaces. DefineJ
: X -+Y
byJ(x)
= Yo, where Yo is a fixed element ofY.
(Naturally, such anJ
is calleda constant map.) Then
1'(x)
= O. (This is the0
element of C(X,Y).)
• Example 3. LetJ
be a bounded linear map of X intoY.
Then1'(x)
=J.
Indeed,
IIJ (x + h) - J(x) - J(h)1I =
O. Observe that the equationl'
=J
isnot
t.rue. This illustrates again the importance of distinguishing carefully between
1'(x)
and1'.
•Section 3. 1 The Frechet Derivative
Theorem 3.
If f is differentiable at x, then it is continuous at x.
1 17
Proof.
Let A = f'(x). Then A E
.C(X,Y). Given c > 0, select 15 > 0 so that
15 <
e/(l + IIAII) and so that the following implication is valid:
IIhll
<0
�IIf(x + h) - f(x) - Ahll/"hll
<1
Then for Ilh"
< 15,we have by the triangle inequality
Ilf(x + h) - f(x)"
�IIf(x + h) - f(x) - Ahll + IIAhl/
< IIhll + I/Ahll
�Ilhll + IIAll llh/!
< 15
( 1 + IIAID
< e •Example 4.
Let
X= Y
=e[O, 1 J and let </>
: IR --+ IRbe continuously differentiable. Define element of e[o, 1J. What is f :
X --+f'(x)? Y by the equation To answer this, we undertake a calculation f(x)
=</>
0x, where x is any
of f(x + h) - f(x), using the classical mean value theorem:
[J(x + h) - f(x)1(t) = </>(x(t) + h(t)) - </>(x(t))
=</>' ( x(t) + O(t)h(t) ) h(t)
where 0
<O(t)
<1. This suggests that we define A by Ah
=(</>' ox)h
With this definition, we shall have at every point t,
[f(x + h) - f(x) - Ah](t) = </>' ( x(t) + O(t)h(t) ) h(t) - </>'(x(t))h(t)
Hence, upon taking the supremum norm, we have
I/f(x + h) - f(x) - Ahll
�II</>'
0(x + Oh) - </>'
0xl/ l1hl/
By comparing this to Equation ( 1 ) and invoking the continuity of </>', we see that
A is indeed the derivative of f at x. Hence f'(x) = </>'
0x.
•Theorem 4.
Let f
: IRn --+ IR.If each of the partial derivatives D;/
(=
af /
ax
;) exists in a neighborhood ofx and is continuous at x
then f'(x) exists, and a formula for it is
J'(x)h
=L
i=1 nD
;/ (x)
·hi
Speaking loosely, we say that the Frechet derivative of f is given by the gradient of f.
Proof.