Interval Arithmetic; Statistical Roundoﬀ Estimation

In view of (1.4.9) this ensures the existence of quantitiesεjwith

(1.4.10) c=

n−1

j=1

ajbj(1 +j·εj) +anbn(1 + (n−1 +δ)εn),

|εj| ≤ eps

1−n·eps, δ:= 0 ifan= 1, 1 otherwise.

Forr=c−a₁b₁−a₂b₂− · · · −anbnwe have consequently (1.4.11) |r| ≤ eps

1−n·eps _n₋₁

j=1

j|ajbj|+ (n−1 +δ)|anbn|

In particular, (1.4.8) reveals the numerical stability of our algorithm for comput- ingβn. The roundoﬀ errorαmcontributes the amount

c−a₁b₁−a₂b₂− · · · −ambm

an αm

to the absolute error inβn. This, however, is at most equal to ^c^·^ε^c⁻â¹^b¹^εâ¹_a^{− · · · −}â^m^b^m^ε^α^m

≤

|c|+"m

i=1|aibi| eps

|an| ,

which represents no more than the inﬂuence of the input errors εc and εa_i of c andai, i= 1,. . .,m, respectively, provided |εc|,|εa_i| ≤eps. The remaining roundoﬀ errorsµkandδare similarly shown to be harmless.

The numerical stability of the above algorithm is often shown by interpreting (1.4.10) in the sense of backward analysis: The computed approximate solution bnis the exact solution of the equation

c−¯a₁b₁−. . .−a¯nbn= 0, whose coeﬃzients

aj:=aj(1 +j·εj), 1≤j≤n−1,

aj:=aj(1 + (n−1 +δ)εn)

have changed only slightly from their original values aj. This kind of analysis, however, involves the diﬃculty of having to deﬁne how large n can be so that errors of the formnε,|ε| ≤eps can still be considered as being of the same order of magnitude as the machine precision eps.

1.5 Interval Arithmetic; Statistical Roundoﬀ

numerical method, however, the number of the arithmetic operations, and consequently the number of individual roundoff errors, is very large, and the corresponding algorithm is too complicated to permit the estimation of the total effect ofallroundoff errors in this fashion.

A technique known asinterval arithmeticoffers an approach to deter- mining exact upper bounds for the absolute error of an algorithm, taking into account all roundoff and data errors. Interval arithmetic is based on the realization that the exact values for all real numbersa∈IR which ei- ther enter an algorithm or are computed as intermediate or final results are usually not known. At best one knows small intervals which containa. For this reason, the interval-arithmetic approach is to calculate systematically in terms of such intervals

a= [a, a],

bounded by machine numbers a, a ∈ A, rather than in terms of single real numbersa. Each unknown numberais represented by an interval ˜a= [a, a] with a ∈a. The arithmetic operations˜ ∈ { ⊕,,⊗, }between intervals are deﬁned so as to becompatiblewith the above interpretation.

That is, ˜c:= ˜a ˜bis deﬁned as an interval (as small as possible) satisfying

c⊃ {a b|a∈˜aandb∈˜b.

and having machine number endpoints.

In the case of addition, for instance, this holds if⊕is deﬁned as follows:

[c, c] := [a, a]⊕[b, b] where

c := max{γ∈A|γ≤a+b} c:= min{γ∈A|γ≥a+b},

withAdenoting again the set of machine numbers. In the case of multiplication⊗, assuming, say,a >0,b >0,

[c, c] := [a, a]⊗[b, b] can be deﬁned by letting

c := max{γ∈A|γ≤a×b}, c:= min{γ∈A|γ≥a×b}.

Replacing, in these and similar fashions, every quantity by an interval and every arithmetic operation by its corresponding interval operation – this is readily implemented on computers – we obtain interval algorithms which produce intervals guaranteed to obtain the desired exact solutions. The data for these interval algorithms will be again intervals, chosen to allow for data errors.

It has been found, however, that an uncritical utilization of interval arithmetic techniques leads to error bounds which, while certainly reliable, are in most cases much too pessimistic. It is not enough to simply substitute interval operations for arithmetic operations without taking into account how the particular roundoﬀ or data enter into the respective results. For example, it happens quite frequently that a certain roundoﬀ errorεimpairs someintermediate resultsu₁, . . . , u_n of an algorithm considerably,

∂u_i

∂ε

1 for i= 1, . . . , n,

while theﬁnalresulty=f(u₁, . . . , u_n) is not strongly aﬀected, ∂y

∂ε ≤1,

even though it is calculated from the highly inaccurate intermediate values u₁, . . . , u_n: the algorithm shows error damping.

Example 1.Evaluatey=φ(x) =x³−3x²+ 3x= ((x−3)×x+ 3)×xusing Horner’s scheme:

u:=x−3, v:=u×x, w:=v+ 3, y:=w×x.

The valuexis known to lie in the interval x∈x˜:= [0.9,1.1].

Starting with this interval and using straight interval arithmetic, we ﬁnd

u= ˜x[3,3] = [−2.1,−1.9],

v= ˜u⊗˜x= [−2.31,−1.71],

w= ˜v[3,3] = [0.69,1.29],

y= ˜w⊗x˜= [0.621,1.419].

The interval ˜yis much too large compared to the interval {φ(x)|x∈x˜}= [0.999,1.001], which describes the actual eﬀect of an error inxonφ(x).

Example 2.Using just ordinary 2-digit arithmetic gives considerably more ac- curate results than the interval arithmetic suggests:

x= 0.9 x= 1.1 u −2.1 −1.9

v −1.9 −2.1

w 1.1 0.9

y 0.99 0.99

For the successful application of interval arithmetic, therefore, it is not suﬃcient merely to replace the arithmetic operations of commonly used algorithms by interval operations: It is necessary to develop new algorithms producing the same ﬁnal results but having an improved error-dependence pattern for the intermediate results.

Example 3.In Example 1 a simple transformation ofϕ(x) suﬃces:

y=ϕ(x) = 1 + (x−1)³.

When applied to the corresponding evaluation algorithm and the same starting interval ˜x= [0.9,1.1], interval arithmetic now produces the optimal result:

u₁:= ˜x[1, 1] = [−0.1, 0.1],

u₂:= ˜u₁⊗u˜₁= [−0.01, 0.01],

u₃:= ˜u₂⊗u˜₁= [−0.001, 0.001],

y:= ˜u₃⊕[1, 1] = [0.999, 1.001].

As far as ordinary arithmetic is concerned, there is not much diﬀerence between the two evaluation algorithms of Example 1 and Example 3. Using two digits again, the results are practically identical to those in Example 2:

x= 0.9 x= 1.1

u₁ −0.1 0.1

u₂ 0.01 0.01 u₃ −0.001 0.001

y 1.0 1.0

For an in-depth treatment of interval arithmetic the reader should con- sult, for instance, Moore (1966) or Kulisch (1969).

In order to obtain statistical roundoff estimates [Rademacher (1948)], we assume that the relative roundoff error [see (1.2.6)] which is caused by an elementary operation is a random variable with values in the interval [−eps,eps]. Furthermore we assume that the roundoff errorsεattributable to different operations are independent random variables. Byµ_εwe denote the expected value and byσ_εthe variance of the above round-off distribution. They satisfy the general relationship

µ_ε=E(ε), σ²_ε=E(ε−E(ε))²=E(ε²)−(E(ε))²=µ_ε2−µ²_ε. Assuming a uniform distribution in the interval [−eps,eps], we get (1.5.1) µ_ε=E(ε) = 0, σ²_ε=E(ε²) = 1

2 eps

# _eps

−eps

t²dt= 1 3

eps =: ¯2 ε². Closer examinations show the roundoff distribution to be not quit uniform [see Sterbenz (1974)), Exercise 22, p. 122]. It should also be kept in mind that the ideal roundoff pattern is only an approximation to the roundoff

patterns observed in actual computing machinery, so that the quantitiesµ_ε andσ²_ε may have to be determined empirically.

The resultsxof algorithms subjected to random roundoﬀ errors become random variables themselves with expected values µ_x and variances σ²_x connected again by the basic relation

σ²_x=E(x−E(x))²=E(x²)−(E(x))²=µ_x2−µ²_x.

The propagation of previous roundoﬀ eﬀects through elementary operations is described by the following formulas for arbitrary independent random variablesx, yand constantsα, β∈IR:

(1.5.2)

µ_αx±βy=E(αx±βy) =αE(x)±βE(y) =αµ_x±βµ_y, σ²_αx±βy=E((αx±βy)²)−(E(αx±βy))²

=α²E(x−E(x))²+β²E(y−E(y))²=α²σ_x²+β²σ_y². The ﬁrst of the above formulas follows by the linearity of the expected-value operator. It holds for arbitrary random variablesx, y. The second formula is based on the relationE(x y) =E(x)E(y), which holds wheneverxand y are independent. Similarly, we obtain for independentxandy

(1.5.3)

µ_x×y=E(x×y) =E(x)E(y) =µ_xµ_y,

σ_x×y² =E[x×y)−E(x)E(y)]²=µ_x2µ_y2−µ²_xµ²_y

=σ²_xσ²_y+µ²_xσ_y²+µ²_yσ²_x.

Example.For calculating y =a²−b² (see example 2 in Section 1.3) we ﬁnd, under the assumptions (1.5.1), E(a) = a, σ²_a = 0,E(b) = b,σ_b² = 0 and using (1.5.2) and (1.5.3), that

η₁ =a²(1 +ε₁), E(η₁) =a², σ²η₁ =a⁴ε¯², η₂ =b²(1 +ε₂), E(η₂) =b², σ²η₂ =b⁴ε¯²,

y= (η₁−η₂)(1 +ε₃), E(y) =E(η₁−η₂)E(1 +ε₃) =a²−b², (η₁, η₂, ε₃ are assumed to be independent),

σ²_y=σ²_η₁₋_η₂σ₁₊² _ε₃+µ²_η₁₋_η₂σ²₁₊_ε₃+µ²₁₊_ε₃σ_η²₁₋_η₂

= (σ²η₁+ση²₂)¯ε²+ (a²−b²)²ε¯²+ 1(σ²η₁+σ²η₂)

= (a⁴+b⁴)¯ε⁴+ [(a²−b²)²+a⁴+b⁴]¯ε². Neglecting ¯ε⁴ compared to ¯ε²yields

σy² .

= ((a²−b²)²+a⁴+b⁴)¯ε².

Fora:= 0.3237,b= 0.3134, eps = 5×10⁻⁴ (see example 5 in Section 1.3), we ﬁnd

σy .

= 0.144¯ε= 0.000 0415,

which is close in magnitude to the true error∆y= 0.000 01787 for 4-digit arithmetic. Compare this with the error bound 0.000 10478 furnished by (1.3.17).

We denote byM(x) the set of all quantities which, directly or indirectly, have entered the calculation of the quantityx. IfM(x)∩M(y)=∅for the algorithm in question, then the random variables x and y are in general dependent.

The statistical roundoﬀ error analysis of an algorithm becomes ex- tremely complicated if dependent random variables are present. It becomes quite easy, however, under the following simplifying assumptions:

(1.5.4)

(a) The operands of each arithmetic operation are independent random variables.

(b) In calculating variances all terms of an order higher than the smallest one are neglected.

=E(x) E(y) =µ_x µ_y.

If in addition the expected valuesµ_x are replaced by the estimated values x, and relative variancesε²_x :=σ_x²/µ²_x ≈σ_x²/x² are introduced, then from (1.5.2) and (1.5.3) [compare (1.2.6), (1.3.5)],

(1.5.5)

z= fl(x±y) : ε²_z .

= x

z ₂

ε²_x+ y

z ₂

ε²_y+ ¯ε², z= fl(x±y) : ε²_z .

=ε²_x+ε²_y+ ¯ε², z= fl(x/y) : ε²_z .

=ε²_x+ε²_y+ ¯ε².

It should be kept in mind, however, that these results are valid only if the hypothesis (1.5.4), in particular (1.5.4a), are met.

It is possible to evaluate above formulas in the course of a numerical computation and thereby to obtain an estimate of the error of the final results. As in the case of interval arithmetic, this leads to an arithmetic of paired quantities (x, ε²_x) for which elementary operations are defined with the help of the above or similar formulas. Error bounds for the final results r are then obtained from the relative varianceε²_r, assuming that the final error distribution is normal. This assumption is justified inasmuch as the distributions of propagated errors alone tend to become normal if subjected to many elementary operations. At each such operation the nonnormal roundoff error distribution is superimposed on the distribution of previous errors. However, after many operations, the propagated errors are large compared to the newly created roundoff errors, so that the latter do not appreciably affect the normality of the total error distribution. Assuming the final error distribution to be normal, the actual relative error of the final resultris bounded with probability 0.9 by 2ε_r.

Exercises for Chapter 1

1. Show that with ﬂoating-point arithmetic oftdecimal places rd(a) = a

1 +ε with |ε| ≤5·10⁻^t

holds in analogy to (1.2.2). [In parallel with (1.2.6), as a consequence, fl(a b) = (a b)/(1 +ε) with|ε| ≤5·10⁻^tfor all arithmetic operations= +,−,

×,/.]

2. Let a,b,cbe ﬁxed-point numbers with N decimal places after the decimal point, and suppose 0 < a,b, c < 1. Substitute product a∗b is deﬁned as follows: Add 10⁻^N/2 to the exact producta·b, and delete the (N+ 1)-st and subsequent digits.

(a) Give a bound for|(a∗b)∗c−abc|.

(b) By how many units of theN-th place can (a∗b)∗canda∗(b∗c) diﬀer

3. Evaluating "n

i=1aj in ﬂoating-point arithmetic may lead to an arbitrarily large relative error. If, however, all summands ai are of the same sign, then this relative error is bounded. Derive a crude bound for this error, disregard- ing terms of higher order.

4. Show how to evaluate the following expressions in a numerically stable fash-

ion: 1

1 + 2x−1−x

1 +x for |x| 1,

$ x+1

x−

$ x−1

x for x1, 1−cosx

x for x= 0,|x| 1.

5. Suppose a computer program is available which yields values for arcsiny in ﬂoating-point representation withtdecimal mantissa places and for|y| ≤1 subject to a relative errorεwith|ε| ≤5×10⁻^t. In view of the relation

arctanx= arcsin x

√1 +x²,

this program could also be used to evaluate arctanx. Determine for which valuesxthis procedure is numerically stable by estimating the relative error.

6. For givenz, the function tanz/2 can be computed according to the formula tanz

2 =±₁₋_cos_z 1 + cosz

₁/2

Is this method of evaluation numerically stable for z ≈ 0, z ≈ π/2 ? If necessary, give numerically stable alternatives.

7. The function

f(ϕ, kc) := 1

cos²ϕ+kc²sin²ϕ

is to be evaluated for 0≤ϕ≤π/2, 0< kc≤1.

The method

k²: = 1−k²c, f(ϕ, kc) : = 1

1−k²sin²ϕ

avoids the calculation of cosϕ and is faster. Compare this with the direct evaluation of the original expression forf(ϕ, kc) with respect to numerical stability.

8. For the linear functionf(x) :=a+b x, wherea= 0,b= 0, compute the ﬁrst derivativeDhf(0) =f(0) =bby the formula

Dhf(0) = f(h)−f(−h) 2h

in binary ﬂoating-point arithmetic. Suppose thataandbare binary machine numbers, andha power of 2. Multiplication byhand division by 2hcan be therefore carried out exactly. Give a bound for the relative error ofDhf(0).

What is the behavior of this bound ash→0 ?

9. The square root±(u+iυ) of a complex numberx+iywithy= 0 may be calculated from the formulas

u=±

$ x+

x²+y² 2 υ= y

Compare the casesx≥0 andx <0 with respect tom their numerical stability. Modify the formulas if necessary to ensure numerical stability.

10. The varianceS², of a set of observationsx₁, . . . , xnis to determined. Which of formulas

S² = 1 n−1

% _n

i=1

x²i−n¯x²

S² = 1 n−1

n i=1

(xi−¯x)² with ¯x:= 1 n

n i=1

is numerically more trustworthy ?

11. The coeﬃcientsar,br(r= 0, . . . , n) are, for ﬁxedx, connected recursively:

bn:=an;

(∗) for r=n−1, n−2, . . . ,0 : br:=xbr+1+ar. (a) Show that the polynomials

A(z) :=

n r=0

arz^r, B(z) :=

n r=1

brz^r⁻¹

satisfy

A(z) = (z−x)·B(z) +b₀.

(b) SupposeA(x) =b₀ is to be calculated by the recursion (∗) for ﬁxedxin ﬂoating-point arithmetic, the result beingb₀. Show, using the formulas (compare Exercise 1)

fl(u+υ) = u+υ

1 +σ, |σ| ≤eps, fl(u·υ) = u·υ

1 +π, |π| ≤eps, the inequality

|A(x)−b₀| ≤ eps

1−eps(2e₀− |b₀|), wheree₀ is deﬁned by the following recursion:

en:=|an|/2;

for r=n−1, n−2, . . . ,0; er:=|x|ar+1+|br|.

Hint:From bn:=an,

pr:= fl(xbr+1) = xbr+1

1 +πr+1

br:= fl(pr+ar) = pr+ar

1 +σr =xbr+1+ar+δr











r=n−1, . . . ,0,

derive

δr=−xb_r₊₁ πr+1

1 +πr+1 −σrb_r (r=n−1, . . . ,0);

then showb₀="n

k=0(ak+δk)x^k,δn:= 0, and estimate"n 0|δk||x|^k. 12. Assuming Earth to be special, two points on its surface can be expressed

in Cartesian coordinates

p_i= [x_i, y_iz_i] = [rcosα_icosβ_i, rsinα_icosβ_i, rsinβ_i], i= 1,2, where ris the earth radius andα_i, β_i are the longitudes and latitudes of the two pointsp_i, respectively. If

cosσ=p^T₁p₂

r² = cos(α₁−α₂) cosβ₁cosβ₂+ sinβ₁sinβ₂, thenrσ is thegreat-circle distancebetween the two points.

(a) Show that using the arccos function to determineσfrom the above expression is not numerically stable.

(b) Derive a numerically stable expression for σ.

Dalam dokumen 4 Systems of Linear Equations 190 (Halaman 37-46)