D + optimisation modelling versus least- - Practical Magnetotellurics: A Guide to Electrical Co

structure philosophy, or a treatise on the difference between the mathematical and the physical

viewpoint

Many inversion schemes are founded on linearisation about an arbitrary starting model of an inherently non-linear system of equa- tions, as described in the previous section. However, Parker (1980) presented a non-linear formulation whereby the optimal 1-D solution is described by a stack of delta functions (infinitesimally thin sheets of finite conductance) embedded in a perfectly insulating

7.2D⁺optimisation versus least-structure modelling 133

half-space. This is known as theD⁺model. The fit of the modelled D⁺ data to the measured data can be expressed in terms of the weighted least-square statistic given in Equation (6.6), withjsites=1.

It should be noted that this"²misfit statistic depends, in part, on the distribution of errors in the measured data. Because of measurement errors and inevitable departures from the theoretically idealised 1-D dimensional sounding environment,"²is not expected to vanish to zero. The expected value of"²for data with independent Gaussian errors isifreq. If models whose misfits are less than two standard deviations of "² above its expected value are deemed acceptable, then this defines a misfit inequality (cf. Parker and Whaler, 1981):

"²5ifreqþ2 ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2ðifreqÞ

p : (7:3)

Failure to satisfy this inequality implies that data cannot be satis- factorily modelled following the assumption of a 1-D Earth, and that multi-dimensional conductivity structures must be considered.

In limited situations, the calculation ofD⁺solutions may represent a useful initial stage to modelling, offering confirmation, or otherwise, as to the existence of 1-D solutions. However, such sharp interfaces as delta functions are physically unrealistic. Parker and Whaler (1981), therefore, developed theH⁺model, in which the best-fit to a model composed of a stack of homogeneous layers overlying a perfect con- ductor placed at the maximumpenetration depthis sought. However, depending on the error structure of measured data, approaches to modelling aimed solely at minimising the misfit between modelled and measured responses are not necessarily justified. Fitting electromagnetic data as closely as possible maximises theroughnessof a model, and it is generally desirable to sacrifice a degree of fit to a dataset in favour of introducing gradational conductivity contrasts (Constable et al., 1987).

The conductivity structure apparent in models derived from layered 1-D inversion schemes can depend quite critically on an often arbitrarily pre-determined number of layers that should be present in the final model. Under-parameterisation of the inverse problem tends to suppress structure that could be significant, whilst over-parameterisation introduces structure that is redundant, and not truly resolvable by the available data (Constableet al., 1987).

Also, models with too many layers may develop oscillations as the modelled structure tends to the delta function (D⁺) solution that optimises the misfit. One solution to this problem is the minimum layer inversion (Fischeret al.1981), in which additional layers are introduced into a model at progressively greater depths only when successively longer-period data demands their presence. On the

134 Inversion of MT data

other hand, Constable et al. (1987), drawing on the philosophy embodied in the tenet known as Occam’s razor¹⁰, propose finding the smoothest possible model (known as theleast-structure model) consistent with an acceptable (user-definable) fit to the data. Given the diffusive nature of electromagnetic fields, passive EM tech- niques resolve conductivity gradients rather than sharp bound- aries at depth, giving least-structure models a certain appeal. By explicitly minimising structural discontinuities, unjustifiably com- plex interpretations of data are avoided, and any structure present within the model should lie within the resolving power of the data.

Therefore, the temptation of over-interpreting data is reduced.

However, as we shall see in Chapters 8 and 9, in certain depth ranges such as at the lithosphere–asthenosphere boundary and at the 410 km transition zone, the existence of discontinuities is widely expected. Therefore, layered models and least-structure (smooth) models represent two extremes between which the resistivity–depth distribution of the real Earth is likely to lie.

2-D models are constructed on a rectangular grid of rows and columns intersecting at nodes in they–zco-ordinate plane (as, for example, delineated in Figure 2.5) to form blocks or cells, each of which is attributed a uniform conductivity. Rules of thumb for constructing such a mesh are dealt with in Section 6.2. Once defined, the mesh remains fixed from one iteration to the next. Most 2-D inversion algorithms (e.g.,Occam Inversion(de Groot-Hedlin and Constable, 1990);Rapid Relaxation Inversion(RRI) (Smith and Booker, 1991)) are founded on the least-structure philosophy, and involve joint minimisation of data misfit (e.g., expressed as an rms statistic as given in Equation (6.6)) and model roughness. For 1-D models, roughness can be defined as

M_R1¼

ð dm zð Þ dz

dz (7:4)

M_R2¼

ð d²m zð Þ dz²

dz (7:5)

wheremðzÞis either conductivity,, orlnðÞ(Constableet al., 1987).

In 2-D models, both horizontal and vertical conductivity gradients can be minimised (e.g., by minimising conductivity differences

10Named after William of Ockham, a fourteenth-century Franciscan monk who wrote ‘Plutias non est ponenda sine necessitate’. (Entities should not be multiplied unnecessarily.)

7.2D⁺optimisation versus least-structure modelling 135

between laterally and vertically adjacent cells in the model). This process is sometimes known as ‘regularisation’, and the grid on which it is performed is called the regularisation mesh(de Groot- Hedlin and Constable, 1990). For example, in this case, we can write the roughness function, QðyiÞ, as a scaled norm of the Laplacian beneath siteyi(Smith and Booker, 1991):

Q yð Þ ¼_i

ð d²m yð i;zÞ

df zð Þ² þg zð Þd²m yð i;zÞ dy²

_y¼y

d²z df zð Þ²

df zð Þ; (7:6)

wheregðzÞallows for a trade-off between penalising horizontal and vertical structures, andfðzÞcontrols the scale length for measuring structure at different depths. Choosingm¼lnð Þ andf¼lnðzþz0Þ generally produces misfits between modelled and measured data that are uniform across the frequency spectrum (Smith and Booker, 1988). The non-dimensional weight function gðzÞ can be generalised as (Smith and Booker, 1991):

gðzÞ ¼ i zþz₀

; (7:7)

whereiis the distance between measurement sites neighbouringyi, and and are constants (that are unrelated to phase-sensitive strike and skew). As i! 1, g zð Þ ! 1. Therefore, horizontal gradients are penalised more relative to vertical gradients the greater the site spacing. The constantscales the horizontal versus vertical derivatives in Equation (7.7). The larger the value of , the greater the degree of horizontal smoothing. Small values of favour shallow structure, but as is increased from 0 to 1.5, the penalty for horizontal structure becomes independent of depth.

Default values of ¼4 and ¼1:5 are suggested by Smith and Booker (1991).

The roughness operator and misfit are jointly minimised via the operatorWi:

W_ið Þ ¼y_i Q yð Þ þ_i _ie²_i; (7:8)

where _i is a trade-off parameter between model structure and misfit, andP

e²_i is the standard²statistic. Since the inverse problem is non-linear,²is reduced in small steps over a number of iterations so that assumptions inherent in the linearisation (e.g., Equation (7.2)) are not violated.

Changes in the model structure that are required by data at (an)other site(s) can produce a local increase inWi. This is particu- larly true for B-polarisation data, which are sensitive to electric

136 Inversion of MT data

currents flowing along the modelled profile. A global measure,WG, of the misfit function expressed in Equation (7.8) that is not dominated by sites with very large or very small_i is (Smith and Booker, 1991):

W_G¼X^isites

i¼1

W_i

_iþ_median: (7:9)

Convergence to the model with the smallest roughness commensur- ate with an acceptable (minimum) misfit requires several iterations (typically 10–20). If a priori knowledge about the presence of sharp discontinuities in resistivity is available, then the penalty for roughness can be removed at cells that border on these expected discontinuities (e.g., de Groot-Hedlin and Constable, 1990).

A flowchart of the steps involved in the inversion process is shown in Figure 7.2. It is advisable to experiment with different starting models to ensure that a similar solution is reached independent of the starting model. Smith and Booker (1991) suggest producing a 2-D model by introducing data components over a series of RRI inversions (Figure 7.3). The computation time required for RRI is less than for the Occam inversion scheme, because in the former lateral electric and magnetic field gradients calculated from the previous iteration are incorporated into the forward-modelling stage as an approximation to the new fields, and the forward model responses are computed as successive per- turbations on a string of 1-D models that represent the structure beneath individual sites. Another popular 2-D inversion scheme, which is simple to install and implement has been programmed by Mackieet al.(1997).

Some inversion algorithms (e.g., Smith and Booker, 1991;

de Groot-Hedlin, 1991) allow static shiftparameters to be calculated simultaneously with the modelling procedure by favouring incorporation of surface structure at adjacent sites that minimises horizontal conductivity gradients at greater depths in the model (see Section 5.8).

Dalam dokumen Practical Magnetotellurics: A Guide to Electrical Conductivity Exploration of the Earth (Halaman 149-153)