Efficient surrogate modeling - Verification, validation, uncertainty quantification and aggrega

it is typical in gas turbine engine testing to replicate measurements at these locations in the tangential direction (into the page in Figure 4.2), as a safeguard against thermocouple failure. Often, only two replicatesNd = 2are available at a given axial-radial position due to the cost of instrumentation and measurement channel limitations.

Since synthetic data is used in this analysis,Nd= 2will be compared withNd= 10.

U1∈R^n×n^pc form a new orthonormal basis for the reducednpc-dimension subspace over theRⁿoutput space, which become the PCs,

Truncation error is the difference between the original dataset and the truncated dataset, which is reconstructed to recover the approximate original outputs. Reconstruction is achieved by back-transforming the truncated set ˆ

y^pcto the original space,yˆ₀ = ˆy^pcU₁^T, whereU₁ ∈ R^N^×n^pc is the reduced set of eigenvectors. Then, the error is computed as(y₀−yˆ₀)⊙S_y, whereS_y = [σ_y1^′ ,σ_y2^′ , . . . ,σ^′_yn]∈R^N^×nis a matrix withNrows, each containing a copy of the standard deviations of the original outputs, and⊙is the Hadamard product (element-wise multiplication).

Finally, since calibration will be performed in the PC-space, the test data is also transformed to the PC-space of the model. The dataydare first standardized with the model output column means and standard deviations to obtain standardizedyd0, then transformed using the eigenvectors from the model transformation similar to Eq.

4.4)

y_d^pc=yd0U (4.5)

which may again be truncated by usingU₁in place ofU.

Figure 4.4: Percentage contribution of PCs to the total variance (red), and maximum output reconstruction error of temperatures (T, K) and time constants (τ, s).

4.3.2 Input transformation with active subspace (AS)

The transformed model outputs result in a mappingF : R^p → Rof the parameter setθ ∈ R^p to a given PC (y_k^pc), to which the AS method [65] is applied (F is used here for a generic function to simplify notation in this section). If this mappingFis differentiable and square integrable, a symmetric positive semi-definite matrix may

be defined with the eigenvalue decomposition,

C= Z

∇F(θ)∇F(θ)^Tρ(θ)dθ=WΞW^T (4.6)

whereW = [w₁, . . . , w_p]∈R^p×pis an orthogonal matrix of the eigenvectors,Ξ =diag[ξ₁, . . . , ξ_p]∈R^p×pare magnitude-ordered eigenvalues,ρ(θ)is the sampling density. The parametersθare scaled to a[−1,1]^phypercube.

Since LHS has been used,ρ(θ)is interpreted as a uniform distribution of the random variablesθ, as in prior work [76–79]. Another interpretation of Eq. 4.6, based on the definition of expectation E[x] = R

xρ(x)dx[80], is that,

C=E

∇F ∇F^T

(4.7) which is to say,Cis the expected value (average) of the gradient outer product. The eigenspace ofC defines important directions in the domain ofF. Ordering the eigenpairs[ξi,wi]of this result in decreasing magnitude of ξiindicates thatw1is the most important direction, followed byw2, etc. Identifying these important directions has the potential for dramatic computational implications when considering quadrature rules for integration [81], optimization to minimize or maximizeF [76–79, 82], or approximation ofF [76, 78, 79, 82, 83]. By means analogous to PCA above, a heuristic based on eigenvalue magnitudes is used to truncate top_a < psuch that a p_a-dimensional partition ofW, the active subspaceW_a = [w₁, . . . ,w_a], captures the majority of the change in the function. The remaining columns ofW are the inactive subspace. The function approximationFa:R^p^a →R is called a ridge approximation over the active subspace [84],

F(θ)≈ Fa(W_a^Tθ) (4.8)

Plots of the model parameters and PC outputs based on Eq. 4.8 are known asshadow plots[85]. In the case that strong trends over the first one or two important directions are observed [75–78, 82, 83], the ability to visualize how the function changes in these important directions allows better selection of the most appropriate type of approximation, i.e., providing strong empirical evidence that a function is predominately linear, quadratic, or more complicated. Moreover, this change of variables is simplypa linear combinations of theporiginal parameters.

Therefore, the entries of the eigenvectors may be considered as weights indicating the importance of a particular parameter ordered by the corresponding eigenvalue. Thus, the magnitude of the entries of the first eigenvector offers a sensitivity analysis. More information on the sensitivity analysis interpretations can be found in [86].

The definition of Eq. 4.6 depends on gradients, which are often not available in standard FE tools. Therefore, gradient approximations will be based on the space-filling model samples. In this problem, a global linear gradient approximation is assumed for a single eigenvector (see Algorithm 1.3 in [65]). This eigenvectorw₁is calculated

from the linear gradientsbasw1=b/||b||, resulting in active parametersw^T_kθ, where the subscript of this single eigenvector is repurposed from here on to represent PCk.

4.3.3 The resulting PC-AS surrogate model

By this combination of PCA and AS, the surrogate modeling problem has been significantly simplified: the 5-input and 48-output problem is compressed into five 1-input and 1-output quadratic surrogate models. Further- more, AS dimension reduction makes visualization of the surrogate models feasible as shown by the five shadow plots in Figure 4.5. The shadow plots are the PC-space outputsy_k^pcplotted as a function of the active subspace parametersw_k^Tθ(blue dots), which are fit with quadratic polynomial regression surrogate models (red curves).

The measurements are also transformed into the model’s PC-space using Eq. 4.5 (green horizontal lines). The goal of calibration is to find the best values ofw_k^Tθ which result in a surrogate model output equivalent to the data, i.e. the intersection of the green and red lines for allkPC-AS shown in Figure 4.5. Re-fitting the PC-AS surrogate model for different numbers ofN (retaining 50 test points) andn_pcresulted inR²>0.95forN >20 andnpc≥5.

Figure 4.5:Active subspace shadow plots for the first 5 PCs (blue dots), surrogate model fits (red line), and measurements in PC-space (green lines).

4.3.4 Sensitivity analysis of PCs vs eigenvector

Another benefit discussed in Section 4.3.2 is that the eigenvectors of the active parameters provide first-order parameter sensitivity analysis for each PC. These eigenvectors are plotted in Figure 4.6 and are compared to first order Sobol’ indices. The first order Sobol’ index forθi and PCkwas generated using a modularized sample- based method [87],

S^k_i =Var

E[y^pc_k |θi]

Var[y_k^pc] (4.9)

where the expectation is taken over samples within bins of eachθi to determine the variance ofy^pc_k due toθi. Both approaches use the available 200 LHS samples from the FE model. The magnitude of these two first order sensitivity measures are in good agreement, but the AS approach requires no additional calculation.

Note that in both methods, the sensitivity results are computed for outputs in PC-space. Alternatively, sensi-

tivity results could be calculated for all physical outputs. Calculating sensitivities for all outputs results in many conflicting relationships when the objective is dimension reduction, i.e., parameters that are important to one output may not be important to another output. By computing sensitivity results for the PCs, the affect of a parameter is considered across all outputs. And, in this case there are only 5 PCs versus 48 physical outputs, potentially improving interpretability of the results. How to best use the sensitivity analysis of multivariate outputs merits further investigation (e.g., [88, 89], particularly as the dimensionality of the output space increases (i.e. more thermocouples). An approach toward combining these into a single Pareto chart of importance ranking is shown later in Figure 7.7. Furthermore, interaction effects may be of interest, which requires the calculation of higher order Sobol’ indices.

Figure 4.6:First-order Sobol’ indices (top) compared to active subspace eigenvectors (bottom) for each PC.

Dalam dokumen Verification, validation, uncertainty quantification and aggregation for engineering computational models in industrial applications (Halaman 52-56)