Examples - 4 Systems of Linear Equations 190

Example l: This example follows up Example 4 of the previous section: given p >0, q >0, pq, determine the root

y=−p+ p²+q with smallest absolute value of the quadratic equation

y²+ 2py−q= 0.

Input data:p, q.Result:y=ϕ(p, q) =−p+ p²+q.

The problem was seen to be well conditioned for p >0, q >0. It was also shown that the relative input errorsεp, εq make the following contribution to the relative error of the resulty=ϕ(p, q):

−p p²+q

εp+ q

2y p²+q

εq= −p p²+q

εp+p+ p²+q 2

p²+q εq.

Since

p²+q ^≤^1,

p+ p²+q 2

p²+q ^≤^1, the inherent error∆⁽⁰⁾ysatisﬁes

eps≤ε⁽⁰⁾y :=∆⁽⁰⁾y

y ≤3 eps.

We will now consider two algorithms for computingy=ϕ(p, q).

s:=p², Algorithm 1:

t:=s+q, u:=√

t, y:=−p+u.

Obviously,pqcauses cancellation wheny:=−p+uis evaluated, and it must therefore be expected that the roundoﬀ error

∆u:=ε·√

t=ε· p²+q.

generated during the ﬂoating-point calculation of the square root fl(√

t) =√

t·(1 +ε), |ε| ≤eps,

will be greatly ampliﬁed. Indeed, the above error contributes the following term to the error ofy:

1 y∆u=

p²+q

−p+

p²+q ·ε= 1 q(p

p²+q+p²+q)ε=k·ε.

Sincep,q >0, the ampliﬁcation factorkadmits the following lower bound:

k >2p² q >0.

which is large, sincepq by hypothesis. Therefore, the proposed algorithm is not numerically stable, because the inﬂuence of rounding

p²+qalone exceeds that of the inherent errorε⁽⁰⁾y by an order of magnitude.

s:=p², Algorithm 2:

t:=s+q, u:=√

t, v:=p+u, y:=q/v.

This algorithm does not cause cancellation when calculating v := p+u. The roundoﬀ error ∆u = ε

p²+q, which stems from rounding

p²+q, will be ampliﬁed according to the remainder mapψ(u):

u→p+u→ q

p+u =:ψ(u).

Thus it contributes the following term to the relative error ofy:

1 y

∂ϕ

∂u∆u= −q

y(p+u)² ·∆u

= −q

p²+q −p+

p²+q p+

p²+q₂ ^·^ε

=−

p²+q p+

p²+q·ε=k·ε.

The ampliﬁcation factor k remains small; indeed, |k| < 1, and algorithm 2 is therefore numerically stable.

The following numerical results illustrate the diﬀerence between Algorithms 1 and 2. They were obtained using ﬂoating-point arithmetic of 40 binary mantissa places – about 13 decimal places – as will be the case in subsequent numerical examples.

p= 1000, q= 0.018 000 000 081

Resultyaccording to Algorithm 1: 0.900 030 136 108₁₀−5 Resultyaccording to nach Algorithm 2: 0.899 999 999 999₁₀−5 Exact value ofy: 0.900 000 000 000₁₀−5 Example 2.For given ﬁxedx, the value of coskxmay be computed recursively using form= 1,2, . . . , k−1 the formula

cos(m+ 1)x= 2 cosxcosmx−cos(m−1)x.

In this case, a trigonometric-function evaluation has to be carried out only once, to ﬁndc= cosx. Now let|x| = 0 be a small number. The calculation ofccauses a small roundoﬀ error:

c= (1 +ε) cosx, |ε| ≤eps. How does this roundoﬀ error aﬀect the calculation of coskx?

coskxcan be expressed in terms ofc: coskx= cos(karccosc) =:f(c).Since df

dc =ksinkx sinx

the errorεcosxofccauses, to ﬁrst approximation, an absolute error (1.4.1) ∆coskx .

=εcosx

sinxksinkx=ε·kcotxsinkx in coskx.

On the other hand, the inherent error∆⁽⁰⁾ck(1.3.19) of the resultck:= coskx is

∆⁽⁰⁾ck= [k|xsinkx|+|coskx|] eps.

Comparing this with (1.4.1) shows that∆coskxmay be considerably larger than

∆⁽⁰⁾ck for small|x|; hence the algorithm is not numerically stable.

Example 3.For givenxand a “large” positive integerk, the numbers coskxand sinkxare to be computed recursively using

cosmx= cosxcos(m−1)x−sinxsin(m−1)x,

sinmx= sinxcos(m−1)x+ cosxsin(m−1)x, m= 1,2, . . . , k.

How do small errorsεccosx,εssinxin the calculation of cosx, sinxaﬀect the ﬁnal results coskx, sinkx? Abbreviatingcm:= cosmx,sm:= sinmx,c:= cosx, s:= sinx, and putting

U :=

c−s s c

we have

=U cm−1

sm−1

, m= 1, . . . , k.

Here U is a unitary matrix, which corresponds to a rotation by the angle x.

Repeated application of the formula above gives ck

=U^k c₀

s₀

=U^k· 1

. Now

∂U

∂c = 1 0

0 1

, ∂U

∂s = 0−1

1 0

=:A, and therefore

∂

∂cU^k=k U^k⁻¹,

∂

∂sU^k=AU^k⁻¹+U AU^k⁻²+· · ·+U^k⁻¹A

=kAU^k⁻¹,

becauseAcommutes withU. SinceU describes a rotation in IR² by the anglex,

∂

∂cU^k=k

cos(k−1)x −sin(k−1)x sin(k−1)x cos(k−1)x

∂

∂sU^k=k

−sin(k−1)x −cos(k−1)x cos(k−1)x −sin(k−1)x

The relative errorsεc,εsofc= cosx,s= sinxeﬀect the following absolute errors of coskx, sinkx:

(1.4.2)

∆ck

∆sk

= _∂

∂cU^k 1 0

·εccosx+ _∂

∂sU^k 1 0

·εssinx

=εckcosx

cos(k−1)x sin(k−1)x

+εsksinx

−sin(k−1)x cos(k−1)x

. The inherent errors ∆⁽⁰⁾ck and∆⁽⁰⁾sk ofck = coskx and sk = sinkx, respectively, are given by

(1.4.3) ∆⁽⁰⁾ck= [k|xsinkx|+|coskx|] eps,

∆⁽⁰⁾sk= [k|xcoskx|+|sinkx|] eps.

Comparison of (1.4.2) and (1.4.3) reveals that for bigkand|kx| ≈1 the influence of the roundoff error εc is considerably bigger than the inherent errors, while the roundoff errorεsis harmless.The algorithm is not numerically stable, albeit numerically more trustworthy than the algorithm of Example 2 as far as the computation ofck alone is concerned.

Example 4.For small|x|, the recursive calculation of cm= cosmx, sm= sinmx, m= 1,2, . . . , based on

cos(m+ 1)x= cosxcosmx−sinxsinmx, sin(m+ 1)x= sinxcosmx+ cosxsinmx,

as in Example 3, may be further improved numerically. To this end, we express the diﬀerencesdsm+1anddcm+1of subsequent sine and cosine values as follows:

dcm+1: = cos(m+ 1)x−cosmx

= 2(cosx−1) cosmx−sinxsinmx−cosxcosmx+ cosmx

=−4

sin² x 2

cosmx+ [cosmx−cos(m−1)x], dsm+1: = sin(m+ 1)x−sinmx

= 2(cosx−1) sinmx+ sinxcosmx−cosxsinmx+ sinmx

=−4

sin² x 2

sinmx+ [sinmx−sin(m−1)x].

This leads to a more elaborate recursive algorithm for computingck,sk in the casex >0:

dc₁:=−2 sin²x

2, t:= 2dc₁, ds₁:=

−dc1(2 +dc₁), s₀:= 0, c₀:= 1, and form:= 1, 2,. . .,k:

cm:=cm−1+dcm, dcm+1:=t·cm+dcm, sm:=sm−1+dsm, dsm+1:=t·sm+dsm.

For the error analysis, note thatckandskare functions ofs= sin(x/2):

ck= cos(2karcsins) =:ϕ₁(s), sk= sin(2karcsins) =:ϕ₂(s).

An error∆s=εssin(x/2) in the calculation ofstherefore causes – to a ﬁrst-order approximation – the following errors inck:

∂ϕ₁

∂s εssinx

2 =εs−2ksinkx cos(x/2) sinx

=−2ktanx

2sinkx·εs, and insk:

∂ϕ₂

∂s εssinx

2 = 2ktanx

2coskx·εs.

Comparison with the inherent errors (1.4.3) shows these errors to be harmless for small|x|. The algorithm is then numerically stable, at least as far as the inﬂuence of the roundoﬀ errorεsis concerned.

Again we illustrate our analytical considerations with some numerical results.

Letx= 0.001,k= 1000.

Algorithm Result for coskx Relative error Example 2 0.540 302 121 124 −0.3410−6 Example 3 0.540 302 305 776 −0.17₁₀−9 Example 4 0.540 302 305 865 −0.58₁₀−11 Exact value 0.540 302 305 868 140. . .

Example 5.We will derive some results which will be useful for the analysis of algorithms for solving linear equations in Section 4.5. Given the quantitiesc,a₁, . . .,an,b₁,. . .,bn−1 withan= 0, we want to ﬁnd the solutionβnof the linear equation

(1.4.4) c−a₁b₁− · · · −an−1bn−1−anβn= 0.

Floating-point arithmetic yields the approximate solution

(1.4.5) bn= fl

_c₋_a

1b₁− · · · −an−1bn−1

as follows:

s₀:=c;

for j:= 1, 2,. . .,n−1

(1.4.6) sj:= fl(sj−1−ajbj) = (sj−1−ajbj(1 +µj))(1 +αj), bn:= fl(sn−1/an) = (1 +δ)sn−1/an,

with|µj|,|aj|,|δ| ≤eps. Ifan= 1, as is frequently the case in applications, then δ= 0, sincebn:=sn−1.

We will now describe two useful estimates for the residual r:=c−a₁b₁−. . .−anbn

From (1.4.6) follow the equations s₀−c= 0, sj−(sj−1−ajbj) =sj−

1 +αj +ajbjµj

=sj αj

1 +αj −ajbjµj, j= 1,2, . . . , n−1, anbn−sn−1=δsn−1.

Summing these equations yields r=c−

n i=1

aibi=

n−1

j=1

−sj αj

1 +αj+ajbjµj

−δsn−1

and thereby the ﬁrst one of the promised estimates

(1.4.7) |r| ≤ eps

1−eps[δ· |sn−1|+

n−1

j=1

(|sj|+|ajbj|)],

δ:= 0 ifan= 1, 1 otherwise.

The second estimate is cruder than (1.4.7). (1.4.6) gives (1.4.8) bn=

n!−1 k=1

(1 +αk)−

n−1

j=1

ajbj(1 +µj)

n!−1 k=j

(1 +αk)

1 +δ an , which can be solved forc:

(1.4.9) c=

n−1

j=1

ajbj(1 +µj)

j−1

k=1

(1 +αk)⁻¹+anbn(1 +δ)⁻¹

n!−1 k=1

(1 +αk)⁻¹.

A simple induction argument overmshows that (1 +σ) =

!m k=1

(1 +σk)^±1, |σk| ≤eps, m·eps<1 implies

|σ| ≤ m·eps 1−m·eps.

In view of (1.4.9) this ensures the existence of quantitiesεjwith

(1.4.10) c=

n−1

j=1

ajbj(1 +j·εj) +anbn(1 + (n−1 +δ)εn),

|εj| ≤ eps

1−n·eps, δ:= 0 ifan= 1, 1 otherwise.

Forr=c−a₁b₁−a₂b₂− · · · −anbnwe have consequently (1.4.11) |r| ≤ eps

1−n·eps _n₋₁

j=1

j|ajbj|+ (n−1 +δ)|anbn|

In particular, (1.4.8) reveals the numerical stability of our algorithm for comput- ingβn. The roundoﬀ errorαmcontributes the amount

c−a₁b₁−a₂b₂− · · · −ambm

an αm

to the absolute error inβn. This, however, is at most equal to ^c^·^ε^c⁻â¹^b¹^εâ¹_a^{− · · · −}â^m^b^m^ε^α^m

≤

|c|+"m

i=1|aibi| eps

|an| ,

which represents no more than the inﬂuence of the input errors εc and εa_i of c andai, i= 1,. . .,m, respectively, provided |εc|,|εa_i| ≤eps. The remaining roundoﬀ errorsµkandδare similarly shown to be harmless.

The numerical stability of the above algorithm is often shown by interpreting (1.4.10) in the sense of backward analysis: The computed approximate solution bnis the exact solution of the equation

c−¯a₁b₁−. . .−a¯nbn= 0, whose coeﬃzients

aj:=aj(1 +j·εj), 1≤j≤n−1,

aj:=aj(1 + (n−1 +δ)εn)

have changed only slightly from their original values aj. This kind of analysis, however, involves the diﬃculty of having to deﬁne how large n can be so that errors of the formnε,|ε| ≤eps can still be considered as being of the same order of magnitude as the machine precision eps.

1.5 Interval Arithmetic; Statistical Roundoﬀ

Dalam dokumen 4 Systems of Linear Equations 190 (Halaman 31-37)