Gaussian Integration Methods - 4 Systems of Linear Equations 190

will include the limitτ₀between them. This observation yields a convenient stopping criterion.

Example.The exact value of the integral

# π/2 0

5(e^π−2)⁻¹e²^xcosx dx

is 1. Using the polynomial extrapolation method of Romberg, and carrying 12 digits, we obtain for Tik, Uik, 0 ≤ i ≤ 6, 0 ≤ k ≤ 3 the values given in the following table.

i Ti0 Ti1 Ti2 Ti3

0 0.185 755 068 924

1 0.724 727 335 089 0.904 384 757 145

2 0.925 565 035 158 0.992 510 935 182 0.998 386 013 717

3 0.981 021 630 069 0.999 507 161 706 0.999 973 576 808 0.999 998 776 222 4 0.995 232 017 388 0.999 968 813 161 0.999 999 589 925 1.000 000 002 83 5 0.998 806 537 974 0.999 998 044 836 0.999 999 993 614 1.000 000 000 02 6 0.999 701 542 775 0.999 999 877 709 0.999 999 999 901 1.000 000 000 00

i Ui0 Ui1 Ui2 Ui3

0 1.263 699 601 26

1 1.126 402 735 23 1.080 637 113 22

2 1.036 478 224 98 1.006 503 388 23 1.001 561 139 90

3 1.009 442 404 71 1.000 430 464 62 1.000 025 603 04 1.000 001 229 44 4 1.002 381 058 56 1.000 027 276 51 1.000 000 397 30 0.999 999 997 211 5 1.000 596 547 58 1.000 001 710 58 1.000 000 006 19 0.999 999 999 978 6 1.000 149 217 14 1.000 000 107 00 1.000 000 000 09 1.000 000 000 00

The conditions (3.6.1) are met, for instance, if ω(x) is positive and continuous on a ﬁnite interval [a, b]. Condition (3.6.1c) is equivalent to 3_b

aω(x)dx >0 [see Exercise 14].

We will again examine integration rules of the type

(3.6.2) I(f˜ ) :=

n i=1

w_if(x_i).

The Newton-Cotes formulas [see Section 3.1] are of this form, but the abscissas x_i were required to form a uniform partition of the interval [a, b].

In this section, we relax this restriction and try to choose the x_i as well as thew_i so as to maximize the order of the integration method, that is, to maximize the degree for which all polynomials are exactly integrated by (3.6.2). We will see that this is possible and leads to a class of well-deﬁned so-calledGaussian integration rulesor Gaussian quadrature formulas [see for instance Stroud und Secrest (1966)]. These Gaussian integration rules will be shown to be unique and of order 2n−1. Alsow_i >0 anda < x_i< b fori= 1,. . .,n. In order to establish these results and to determine the exact form of the Gaussian integration rules, we need some basic facts about orthogonal polynomials. We introduce the notation

Π¯_j :={p|p(x) =x^j+a₁x^j−1+· · ·+a_j}

for the set of normed real polynomials of degreej, and, as before, we denote by

Π_j :={p| degree (p)≤j}

the linear space of all real polynomials whose degree does not exceedj. In addition, we deﬁne the scalar product

(f, g) :=

# _b

ω(x)f(x)g(x)dx

on the linear spaceL²[a,b] of all functions for which the integral (f, f) =

# _b

ω(x)f(x)²dx

is well deﬁned and ﬁnite. The functionsf,g∈L²[a, b] are calledorthogonal if (f, g) = 0. The following theorem establishes the existence of a sequence of mutually orthogonal polynomials, the system oforthogonal polynomials associated with the weight functionω(x).

(3.6.3) Theorem.There exist polynomialsp_j∈Π¯_j,j = 0,1,2,. . ., such that

(3.6.4) (p_i, p_k) = 0 for i=k.

These polynomials are uniquely deﬁned by the recursions (3.6.5a) p₀(x)≡1,

(3.6.5b) p_i+1(x)≡(x−δ_i+1)p_i(x)−γ_i+1² p_i−1(x) fori≥0, wherep₋₁(x) :≡0 and⁷

(3.6.6a) δ_i+1:= (x p_i, p_i)/(p_i, p_i) fori≥0, (3.6.6b) γ_i+1² :=

1 fori= 0, (p_i, p_i)/(p_i−1, p_i−1) fori≥1.

Proof. The polynomials can be constructed recursively by a technique known as Gram-Schmidt orthogonalization. Clearly p₀(x) ≡ 1. Suppose then, as an induction hypothesis, that all orthogonal polynomials with the above properties have been constructed for j ≤ i and have been shown to be unique. We proceed to show that there exists a unique polynomial p_i+1∈Π¯_i+1with

(3.6.7) (p_i+1, p_j) = 0 for j ≤i,

and that this polynomial satisﬁes (3.6.5b). Any polynomial p_i+1 ∈ Π¯_i+1 can be written uniquely in the form

p_i+1(x)≡(x−δ_i+1)p_i(x) +c_i−1p_i−1(x) +c_i−2p_i−2(x) +· · ·+c₀p₀(x), because its leading coeﬃcient and those of the polynomialsp_j,j≤i, have value 1. Since (p_j,p_k) = 0 for allj,k≤iwithj =k, (3.6.7) holds if and only if

(3.6.8a) (p_i+1, p_i)= (xp_i, p_i)−δ_i+1(p_i, p_i) = 0,

(3.6.8b) (p_i+1, p_j−1)= (xp_j−1, p_i) +c_j−1(p_j−1, p_j−1) = 0 forj≤i.

The condition (3.6.1c) — withp²_i andp²_j−1, respectively, in the role of the nonnegative polynomials— rules out (p_i, p_i) = 0 and (p_j−1, p_j−1) = 0 for 1≤j ≤i. Therefore, the equations (3.6.8) can be solved uniquely. (3.6.8a) gives (3.6.6a). By the induction hypothesis,

p_j(x)≡(x−δ_j)p_j−1(x)−γ_j²p_j−2(x)

forj ≤i. From this, by solving forx p_j−1(x), we have (xp_j−1, p_i) = (p_j, p_i) forj≤i, so that

c_j−1=− (p_j, p_i) (p_j−1, p_j−1) =

−γ_i+1² forj=i, 0 forj < i.

in view of (3.6.8). Thus (3.6.5b) has been established fori+ 1.

7 x pi denotes the polynomial with valuesxpi(x) for allx.

Every polynomialp∈Π_k is clearly representable as a linear combina- tion of the orthogonal polynomialsp_i,i≤k. We thus have:

(3.6.9) Corollary.(p, p_n) = 0for allp∈Π_n−1.

(3.6.10) Theorem.The rootsx_i,i= 1,. . .,n, ofp_n are real and simple.

They all lie in the open interval(a, b).

Proof.Consider those roots ofp_nwhich lie in (a, b) and which are of odd multiplicity, that is, at whichp_n changes sign:

a < x₁<· · ·< x_l< b.

The polynomial

q(x) :=

!l j=1

(x−x_j)∈Π¯_l

is such that the polynomialp_n(x)q(x) does not change sign in [a, b], so that (p_n, q) =

# _b

ω(x)p_n(x)q(x)dx= 0

by (3.6.1c). Thus degree (q) =l=nmust hold, as otherwise (p_n, q) = 0 by

Corollary (3.6.9).

Next we have the

(3.6.11) Theorem.The n×n-matrix

A:=





p₀(t₁) . . . p₀(t_n)

... ...

p_n−1(t₁) . . . p_n−1(t_n)





is nonsingular for mutually distinct argumentst_i,i= 1, . . .,n.

Proof.AssumeA is singular. Then there is a vectorc^T = (c₀, . . . , c_n−1), c= 0 withc^TA= 0. The polynomial

q(x) :=

n−1

i=0

c_ip_i(x),

with degree (p) < n, has then distinct rootst₁,. . ., t_n and must vanish identically. Since the polynomialsp_i(.) are linearly independent,q(x)≡0

implies the contradictionc= 0.

Theorem (3.6.11) shows that the interpolation problem of ﬁnding a function of the form

p(x)≡

n−1

i=0

c_ip_i(x)

with p(t_i) = f_i, i = 1, 2, . . ., n is always solvable. The condition of the theorem is known as the Haar condition.Any sequence of functions p₀, p₁, . . .which satisfy the Haar condition is said to form a Chebyshev sys- tem.Theorem (3.6.11) states that sequences of orthogonal polynomials are Chebyshev systems.

Now we arrive at the main result of this section.

(3.6.12) Theorem.

(a) Letx₁,. . .,x_nbe the roots of thenthorthogonal polynomialp_n(x), and letw₁,. . .,w_n be the solution of the (nonsingular) system of equations

(3.6.13)

n i=1

p_k(x_i)w_i=

(p₀, p₀) if k= 0,

0 if k= 1, 2,. . .,n−1.

Thenw_i>0 fori= 1,2,. . .,n, and

(3.6.14)

# _b

ω(x)p(x)dx= n i=1

w_ip(x_i)

holds for all polynomialsp∈Π_2n−1. The positive numbersw_iare called

“weights”.

(b) Conversely, if the numbersw_i,x_i,i= 1,. . .,n, are such that (3.6.14) holds for allp∈Π_2n−1, then thex_i are the roots ofp_n and the weights w_i satisfy(3.6.13).

Proof.By Theorem (3.6.10), the rootsx_i,i= 1,. . .,n, ofp_nare real and mutually distinct numbers in the open interval (a, b). The matrix

(3.6.15) A:=





p₀(x₁) . . . p₀(x_n)

... ...

p_n−1(x₁) . . . p_n−1(x_n)





is nonsingular by Theorem (3.6.11), so that the system of equations (3.6.13) has a unique solution.

Consider an arbitrary polynomialp∈Π_2n−1. It can be written in the form

(3.6.16) p(x)≡p_n(x)q(x) +r(x),

where q, rare polynomials inΠ_n−1, which we can express as linear com- binations of orthogonal polynomials

q(x)≡ⁿ⁻¹

k=0

α_kp_k(x), r(x)≡ⁿ⁻¹

k=0

β_kp_k(x).

Sincep₀(x)≡1, it follows from (3.6.16) and Corollary (3.6.9) that

# _b

ω(x)p(x)dx= (p_n, q) + (r, p₀) =β₀(p₀, p₀).

On the other hand, by (3.6.16) [sincep_n(x_i) = 0] and by (3.6.13), n

i=1

w_ip(x_i) = n i=1

w_ir(x_i) =

n−1

k=0

β_k _n

i=1

w_ip_k(x_i)

=β₀(p₀, p₀), Thus (3.6.14) is satisﬁed.

We observe that

(3.6.17). If w_i, x_i, i = 1, . . ., n, are such that (3.6.14) holds for all polynomialsp∈Π_2n−1, thenw_i>0 fori= 1, . . .,n.

This is readily veriﬁed by applying (3.6.14) to the polynomials

¯ p_j(x) :=

h=1h=j

(x−x_h)²∈Π_2n−2, j= 1, . . . , n,

and noting that 0<

# _b

ω(x)¯p_j(x)dx= n

i=1

w_ip¯_j(x_i) =w_j

h=1h=j

(x_j−x_h)²

by (3.6.1c). This completes the proof of (3.6.12a).

We prove (3.6.12c) next. Assume that w_i, x_i, i = 1, . . ., n, are such that (3.6.14) even holds for all polynomialsp∈Π_2n. Then

p(x) :≡!ⁿ

j=1

(x−x_j)²∈Π_2n

contradicts this claim, since by (3.6.1c) 0<

# _b

ω(x)¯p(x)dx= n

i=1

w_ip(x¯ _i) = 0.

This proves (3.6.12c).

To prove (3.6.12b), suppose that w_i, x_i, i = 1, . . ., n, are such that (3.6.14) holds for allp∈4

2n−1. Note that the abscissasx_i must be mutually distinct, since otherwise we could formulate the same integration rule using onlyn−1 of the abscissasx_i, contradicting (3.6.12c).

Applying (3.6.14) to the orthogonal polynomialsp = p_k, k = 0, . . ., n−1, themselves, we ﬁnd

n i=1

w_ip_k(x_i) =

# _b

ω(x)p_k(x)dx= (p_k, p₀) =

(p₀, p₀), ifk= 0,

0, if 1≤k≤n−1.

In other words, the weightsw_i must satisfy (3.6.13).

Applying (3.6.14) to p(x) :≡ p_k(x)p_n(x), k = 0, . . ., n−1, gives by (3.6.9)

0 = (p_k, p_n) = n i=1

w_ip_n(x_i)p_k(x_i), k= 0, . . . , n−1.

In other words, the vector c := (w₁p_n(x₁), . . ., w_np_n(x_n))^T solves the homogeneous system of equations Ac = 0 with A the matrix (3.6.15).

Since the abscissas x_i, i= 1, . . ., n, are mutually distinct, the matrix A is nonsingular by Theorem (3.6.11). Therefore c= 0 and w_ip_n(x_i) = 0 for i= 1, . . ., n. Sincew_i >0 by (3.6.17), we have p_n(x_i) = 0,i= 1, . . .,n.

This completes the proof of (3.6.12b).

For the most common weight functionω(x) :≡1 and the interval [−1,1], the results of Theorem (3.6.12) are due to Gauss. The corresponding orthogonal polynomials are [see Exercise 16]

(3.6.18) p_k(x) := k!

(2k)!

d^k

dx^k(x²−1)^k, k= 0,1, . . . .

Indeed,p_k∈Π¯_k and integration by parts establishes (p_i, p_k) = 0 fori=k.

Up to a factor, the polynomials (3.6.18) are the Legendre polynomials. In the following table we give some values forw_i,x_iin this important special case. For further values see the National Bureau of StandardsHandbook of Mathematical Functions[Abramowitz and Stegun (1964)].

n w_i x_i

1 w₁= 2 x₁ = 0

2 w₁=w₂ = 1 x₂ =−x₁= 0.577 350 2692. . . 3 w₁=w₃ = ⁵

9 x₃ =−x₁= 0.774 596 6692. . . w₂= ⁸

9 x₂ = 0

4 w₁=w₄ = 0.347 854 8451. . . x₄ =−x₁= 0.861 136 3116. . . w₂=w₃ = 0.652 145 1549. . . x₃ =−x₂= 0.339 981 0436. . . 5 w₁=w₅ = 0.236 926 8851. . . x₅ =−x₁= 0.906 179 8459. . . w₂=w₄ = 0.478 628 6705. . . x₄ =−x₂= 0.538 469 3101. . . w₃= ¹²⁸₂₂₅ = 0.568 888 8889. . . x₃ = 0

Other important cases which lead to Gaussian integration rules are listed in the following table:

[a, b] ω(x) Orthogonal polynomials

[−1,1] (1−x²)^−1/2 T_n(x), Chebyschev polynomials [0,∞] e^−x L_n(x), Laguerre polynomials [−∞,∞] e^−x² H_n(x), Hermite polynomials

We have characterized the quantitiesw_i,x_i which enter the Gaussian integration rules for given weight functions, but we have yet to discuss methods for their actual calculation. We will examine this problem un- der the assumption that the coefficients δ_i,γ_i of the recursion (3.6.5) are given. Golub and Welsch (1969) and Gautschi (1968, 1970) discuss the much harder problem of finding the coefficientsδ_i,γ_i.

The theory of orthogonal polynomials ties in with the theory of real tridiagonal matrices

(3.6.19) J_n =





 δ₁ γ₂ γ₂ δ₂ ·

· · ·

· · γ_n γ_n δ_n







and their principal submatrices

J_j :=





 δ₁ γ₂ γ₂ δ₂ ·

· · ·

· · γ_j γ_j δ_j







Such matrices will be studied in Sections 5.5, 5.6, and 6.6.1. In Section 5.5 it will be seen that the characteristic polynomials p_j(x) = det(J_j−xI) of theJ_j satisfy the recursions (3.6.5) with the matrix elementsδ_j,γ_j as the coeﬃcients. Therefore,p_nis the characteristic polynomial of the tridiagonal matrixJ_n. Consequently we have

(3.6.20) Theorem. The roots x_i, i = 1, . . ., n, of the nth orthogonal polynomialp_n are the eigenvalues of the tridiagonal matrix J_n in(3.6.19).

The bisection method of Section 5.6, theQRmethod of Section 6.6.6, and others are available to calculate the eigenvalues of these tridiagonal systems.

With respect to the weightsw_i, we have [Szeg¨o (1959), Golub and Welsch (1969)]:

(3.6.21) Theorem. Let v⁽ⁱ⁾ := (v⁽ⁱ⁾₁ , . . . , v_n⁽ⁱ⁾)^T be an eigenvector of J_n (3.6.19) for the eigenvalue x_i, J_nv⁽ⁱ⁾ = x_iv⁽ⁱ⁾. Suppose v⁽ⁱ⁾ is scaled in such a way that

v^(i)Tv⁽ⁱ⁾= (p₀, p₀) =

# _b

ω(x)dx.

Then the weights are given by

w_i= (v⁽ⁱ⁾₁ )², i= 1, . . . , n.

Proof.We verify that the vector

v⁽ⁱ⁾= (ρ₀p₀(x_i), ρ₁p₁(x_i), . . . , ρ_n−1p_n−1(x_i))^T where

ρ_j := 1/(γ₁γ₂· · ·γ_j+1) forj= 0, 1,. . .,n−1

is an eigenvector ofJ_n for the eigenvaluex_i:J_nv˜⁽ⁱ⁾=x_i˜v⁽ⁱ⁾. By (3.6.5) for anyx,

δ₁ρ₀p₀(x) +γ₂ρ₁p₁(x) =δ₁p₀(x) +p₁(x) =xp₀(x) =xρ₀p₀(x).

Forj= 2,. . .,n−1, similarly

γ_jρ_j−2p_j−2(x) +δ_jρ_j−1p_j−1(x) +γ_j+1ρ_jp_j(x)

=ρ_j−1[γ_j²p_j−2(x) +δ_jp_j−1(x) +p_j(x)]

=xρ_j−1p_j−1(x), and ﬁnally

ρ_n−1[γ_n²p_n−2(x) +δ_np_n−1(x)] =xρ_n−1p_n−1(x)−ρ_n−1p_n(x), so that

γ_nρ_n−2p_n−2(x_i) +δ_np_n−1(x_i)] =x_iρ_n−1p_n−1(x_i)

holds, providedp_n(x_i) = 0.

Sinceρ_j = 0,j= 0, 1,. . ., n−1, the system of equations (3.6.13) for w_i is equivalent to

(3.6.22) (˜v⁽¹⁾, . . . ,v˜⁽ⁿ⁾)w= (p₀, p₀)e₁ withw= (w₁, . . . , w_n)^T,e₁= (1,0, . . . ,0)^T.

Eigenvectors of symmetric matrices for distinct eigenvalues are orthogonal. Therefore, multiplying (3.6.22) byv^(i)T from the left yields

(˜v^(i)Tv˜⁽ⁱ⁾)w_i= (p₀, p₀)˜v⁽ⁱ⁾₁ . Sinceρ₀= 1 andp₀(x)≡1, we have ˜v⁽ⁱ⁾₁ = 1. Thus (3.6.23) (˜v^(i)Tv˜⁽ⁱ⁾)w_i= (p₀, p₀).

Using again the fact that ˜v₁⁽ⁱ⁾= 1, we ﬁndv⁽ⁱ⁾₁ ˜v⁽ⁱ⁾=v⁽ⁱ⁾, and multiplying (3.6.23) by (v⁽ⁱ⁾₁ )² gives

(v^(i)Tv⁽ⁱ⁾)w_i= (v⁽ⁱ⁾₁ )²(p₀, p₀).

Sincev^(i)Tv⁽ⁱ⁾= (p₀, p₀) by hypothesis, we obtainw_i= (v₁⁽ⁱ⁾)². If theQR-method is employed for determining the eigenvalues of J_n, then the calculation of the ﬁrst components v⁽ⁱ⁾₁ of the eigenvectors v⁽ⁱ⁾ is readily included in that algorithm: calculating the abscissas x_i and the weightsw_i can be done concurrently [Golub and Welsch (1969)].

Finally, we will estimate the error of Gaussian integration:

(3.6.24) Theorem.If f ∈C²ⁿ[a, b], then

# _b

ω(x)f(x)dx−ⁿ

i=1

w_if(x_i) =f⁽²ⁿ⁾(ξ)

(2n)! (p_n, p_n) for someξ∈(a, b).

Proof. Consider the solution h ∈ Π_2n−1 of the Hermite interpolation problem [see Section 2.1.5]

h(x_i) =f(x_i), h(x_i) =f(x_i), i= 1,2, . . . , n.

Since degreeh <2n,

# _b

ω(x)h(x)dx= n i=1

w_ih(x_i) = n i=1

w_if(x_i)

by Theorem (3.6.12). Therefore, the error term has the integral represen- tation

# _b

ω(x)f(x)dx−ⁿ

i=1

w_if(x_i) =

# _b

ω(x)(f(x)−h(x))dx.

By Theorem (2.1.5.9), and since thex_i are the roots ofp_n(x))∈Π¯_n, f(x)−h(x) = f⁽²ⁿ⁾(ζ)

(2n)! (x−x₁)²· · ·(x−x_n)²=f⁽²ⁿ⁾(ζ) (2n)! p²_n(x) for someζ=ζ(x) in the intervalI[x, x₁, . . . , x_n] spanned byxandx₁,. . ., x_n. Next,

f⁽²ⁿ⁾(ζ(x))

(2n)! = f(x)−h(x) p²_n(x)

is continuous on [a, b] so that the mean-value theorem of integral calculus applies:

# _b

ω(x)(f(x)−h(x))dx= 1 (2n)!

# _b

ω(x)f⁽²ⁿ⁾(ζ(x))p²_n(x)dx

= f⁽²ⁿ⁾(ξ)

(2n)! (p_n, p_n)

for someξ∈(a, b).

Comparing the various integration rules (Newton-Cotes formulas, extrapolation methods, Gaussian integration), we ﬁnd that, computational eﬀorts being equal, Gaussian integration yields the most accurate results.

If only one knew ahead of time how to chosen so as to achieve specified accuracy for any given integral, then Gaussian integration would be clearly superior to other methods. Unfortunately, it is frequently not possible to use the error formula (3.6.24) for this purpose, because the 2nth derivative is difficult to estimate. For these reasons, one will usually apply Gaussian integration for increasing values of n until successive approximate values agree within the specified accuracy. Since the function values which had been calculated forncannot be used forn+ 1 (at least not in the classical caseω(x)≡1), the apparent advantages of Gauss integration as compared with extrapolation methods are soon lost. There have been attempts to remedy this situation [e.g. Kronrod (1965)]. A collection offortran pro- grams is given in Piessens et al. (1983).

Dalam dokumen 4 Systems of Linear Equations 190 (Halaman 181-191)