Professional Documents
Culture Documents
net/publication/352993637
CITATIONS READS
0 533
1 author:
Bruno Sudret
ETH Zurich
416 PUBLICATIONS 14,762 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Rice Crop Growth Stage Estimation using High-Frequency PolSAR Data View project
All content following this page was uploaded by Bruno Sudret on 05 July 2021.
Bruno Sudret
4th Int. Conf. on Uncertainty Quantification in Computational Sciences and Engineering, (UNCECOMP’2021),
Athens (Greece) – June 30, 2021
How to cite?
This presentation is a plenary lecture given at the 4th International Conference on Uncertainty
Quantification in Computational Sciences and Engineering on June 30, 2021.
How to cite ?
Sudret, B., Recent Developments on Surrogate Models for Stochastic Simulators,
4th International Conference on Uncertainty Quantification in Computational Sciences and
Engineering, streamed from Athens (Greece), June 30th, 2021.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 1 / 42
The SAMOS project
• The SAMOS project (“SurrogAte Modelling for stOchastic Simulators”) is funded by the Swiss National
Science Foundation under Grant # 175524 (May 2018 - April 2022).
Xujia Zhu
Use of generalized lambda distributions and sparse polynomial chaos ex-
pansions
Nora Lüthen
UNCECOMP’2021 paper U18995: “Surrogating stochastic simulators using
Karhunen-Loève expansion, sparse PCE and advanced statistical modelling”
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 2 / 42
Computational models in engineering
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 3 / 42
Computational models: the abstract viewpoint
A computational model may be seen as a black box program that computes quantities of interest (QoI)
(a.k.a. model responses) as a function of input parameters
Vector of input
Computational Model response
parameters
model M y = M(x) ∈ RQ
x ∈ RM
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 4 / 42
Deterministic vs. stochastic simulators
Stochastic
2 simulator 2
*
Deterministic
lue
simulator
va
an
1 1
Me
*
*
0 0
0 0.5 1 1.5 2 0 0.5 1 1.5 2
Input parameters Input parameters
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 5 / 42
Example: wind turbine simulation
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 6 / 42
Example: epidemiology
Terminology
• St : number of susceptible individuals at time t
• It : number of infected individuals at time t
• Rt : number of recovered individuals at time t
System dynamics
• Susceptible individuals can get infected due to
close contact with infected individuals (S → I )
• Infected individuals can recover and becomes
immune to future infections (I → R)
• Random contact and recovery modelled with
Poisson processes
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 7 / 42
Example: mathematical finance
dSt = µ St dt + σ St dWt
Asian option
• The payoff (of a call option) is contingent on the average price of the underlying asset
Z t
1
C = max {AT − K, 0} , with At = Su du.
t 0
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 8 / 42
Outline
Application examples
Analytical example
Stochastic SIR model
Wind turbine application
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 8 / 42
Formal definition
Ms : DX × Ω → R
(x, ω) 7→ Ms (x, ω)
Latent variables
Ms can be seen as a deterministic function M of input parameters x and latent variables Z :
Ms (x, ω) = M (x, Z(ω))
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 9 / 42
Computational costs induced by stochastic simulators
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 10 / 42
Surrogate models (deterministic simulators)
• It is built from a limited set of runs of the original model M called the experimental design
X = x(i) , i = 1, . . . , N
Simulated data
• It is fast to evaluate!
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 11 / 42
Surrogate models for uncertainty quantification
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 12 / 42
Ingredients for building a surrogate model
h 2 i
• Validate the surrogate model, e.g. estimate the generalization error ε = E M(X) − M̃(X)
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 13 / 42
Back to stochastic simulators
Our goal is to develop emulators that are:
• Data-driven (i.e. non-intrusive w.r.t. to the original simulator)
• General-purpose: no restrictive assumption (e.g. Gaussian) on the family of the output distribution is
made
• Able to tackle the full distribution of Y |X = x, but also quantities of interest (e.g. mean, variance,
quantiles)
• Providing a representation of Y |X = x easy to sample from
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 14 / 42
Existing approaches
• Gaussian models
1
Yan & Perdikaris (2019), Comput. Mech.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 15 / 42
Replication-based approaches
Main idea
Literature
• Stochastic Kriging: Ankenman et al. (2006) Stochastic Kriging for simulation metamodeling, Oper. Res.
• Quantile Kriging: Plumlee & Tuo (2014) Building accurate emulators for stochastic simulations via quantile Kriging, Technometrics
• Kernel density estimation: Moutoussamy et al. (2015) Emulators for stochastic simulation codes, ESAIM: Math. Model. Num. Anal.
• Generalized lambda model: Zhu & Sudret (2020) Replication-based emulation of the response distribution of stochastic simulators
using generalized lambda distributions, Int. J. Uncertainty Quantification
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 16 / 42
Assuming normality: Kriging models
Main idea
• Response distributions are normal
• Mean function µ(x) and log-variance function log(V (x)) are modeled by Gaussian processes
Literature
• Full Bayesian setup: Goldberg et al. (1997) Regression with input-dependent noise: a Gaussian process treatment, NIPS10
• Iterative fitting: Marrel et al. (2012) Global sensitivity analysis of stochastic computer models with joint metamodels, Stat. Comput.
• Maximum likelihood: Binois et al. (2018) Practical heteroscedastic Gaussian process modeling for large simulation experiments, J.
Comput. Graph. Stat.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 17 / 42
Conditional distribution estimation
Main approaches
• Estimate the joint distribution of (X, Y ) by kernel smoothing, then compute the conditional PDF by:
f (x, y)
f (y | x) =
f (x)
• Use parametric models to represent the conditional distribution directly
Literature
• Kernel smoothing: Hall et al. (2004) Cross-validation and the estimation of conditional probability densities, J. Amer. Stat. Assoc.
• Vine copula: Kraus & Czado (2017) D-vine copula based quantile regression, Comput. Stat. Data Anal.
• Generalized lambda model: Zhu & Sudret (2021) Emulation of stochastic simulators using generalized lambda models, Submitted to
SIAM/ASA J. Unc. Quant.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 18 / 42
Outline
Application examples
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 18 / 42
Definition
• The Freimer-Mudholkar-Kollia-Lin (FMKL) lambda distribution is defined through its quantile function
Q(u; λ) by 4 parameters
1 uλ3 − 1 (1 − u)λ4 − 1
Q(u; λ) = λ1 + −
λ2 λ3 λ4
where:
– λ1 is the location parameter
– λ2 > 0 is the scale parameter
– λ3 , λ4 are shape parameters
1 λ2 −1
fY (y; λ) = = λ −1 with u = Q (y; λ)
Q0 (u; λ) u 3 + (1 − u)λ4 −1
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 19 / 42
Properties
• GLDs approximate well unimodal PDFs (bell-, U-shaped, bounded and unbounded)
• λ3 and λ4 control the shape and boundedness
( (
−∞, λ3 ≤ 0 +∞, λ4 ≤ 0
Bl (λ) = 1
, Bu (λ) = 1
λ1 − λ2 λ3
, λ3 > 0 λ1 + λ2 λ4
, λ4 > 0
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 20 / 42
Summary chart
Moments of order
Zhu & Sudret (2020), Replication-based emulation of the response distribution of stochastic simulators using generalized lambda distributions, Int. J. Uncertainty Quantification
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 20 / 42
Outline
Application examples
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 20 / 42
Problem statement
Let us consider a stochastic simulator MS and the following experimental design with replications
• Experimental design of size N in the X -space: X = x(1) , x(2) , . . . , x(N )
• R replications for each x(i) ∈ X : Y (i) = y (i,1) , y (i,2) , . . . , y (i,R)
Two approaches
Zhu & Sudret (2020) Replication-based emulation of the response distribution of stochastic simulators using generalized lambda distributions, Int. J. Uncertainty Quantification
• Infer-and-fit: infer a lambda distribution for each point x(i) of the experimental design, then fit a sparse
polynomial chaos expansion to the parameters λ
• Joint inference: improve the previous results by maximum likelihood optimization
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 21 / 42
Local inference of lambda distributions (“Infer”)
(i) (i)
(i,1) (i,R)
Estimate λ based on Y = y , ... ,y
• Maximum likelihood estimation
R
!
(i)
X λ2
λ̂ = arg max log
λ
r=1
uλi,r3 −1 + (1 − ui,r )λ4 −1
where !
−1 (i,r)
(i,r) 1 uλi,r3 − 1 (1 − ui,r )λ4 − 1
ui,r = Q y ;λ , y = Q(ui,r ; λ) = λ1 + −
λ2 λ3 λ4
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 22 / 42
Polynomial chaos expansions in a nutshell
Ghanem & Spanos (1991; 2003); Xiu & Karniadakis (2002); Soize & Ghanem (2004)
• We assume here for simplicity that the input parameters are independent with
Xi ∼ fXi , i = 1, . . . , M
• PCE is also applicable in the general case using an isoprobabilistic transform X 7→ Ξ
where:
• Ψα (X) are basis functions (multivariate orthonormal polynomials)
• aα are coefficients to be computed (coordinates)
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 23 / 42
Polynomial chaos expansions
Truncation schemes q=1
α2
( M
) 3
X 2
α ∈ Ap,M = α ∈ NM ; αi ≤ p 1
0
0 1 2 3 4 5 6
i=1 α1
q = 0.5
• q -norm truncation: 6
5
4
α2
M
! q1 3
2
p,q,M M def
X
αiq
1
α∈A = α ∈ N ; kαkq = ≤p , 0<q<1 0
0 1 2 3 4 5 6
α1
i=1
Sparse expansion: A only contains relevant basis functions taken out of a candidate basis:
• Least-angle regression
• Orthogonal matching pursuit, subspace pursuit
• etc.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 24 / 42
Fitting polynomial chaos expansions (“Fit”)
• Gather the data from the “Fit”-step: x(1) , λ(1) , x(2) , λ(2) , . . . , x(N ) , λ(N )
• Represent the distribution parameters λ by polynomial chaos expansions
X
λk (x) ≈ λPC
k (x; a) = ak,α ψα (x) for k = 1, 3, 4
α∈Ak
!
PC
X
λ2 (x) ≈ λ2 (x; a) = exp a2,α ψα (x)
α∈A2
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 25 / 42
Improvement: joint fitting
Rationale
• The results of the Infer-and-fit algorithm depends on the accuracy of the local inference
• Idea: build a global model for the joint distribution of inputs and outputs:
Procedure
Find the optimal PCE coefficients a that minimize the Kullback-Leibler divergence between fX,Y and
GLD
fX,Y :
GLD
â = arg min DKL fX,Y || fX,Y ( . ; a)
a
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 26 / 42
Corresponding loss function
Solution
After some basic algebra, the minimization problem reduces to:
Estimator
• The sparse PCE bases for λPC (x) are taken from the “Infer-and-Fit” approach
• The expectation w.r.t. X and Y is computed from the experimental design X = x(1) , . . . , x(N ) ,
with R replications in each point (outputs y (i,r) ):
N R
1 XX
log fˆY |X y (i,r) ; λPC (x(i) ; a)
â = arg min
a NR
i=1 r=1
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 27 / 42
Illustration of the algorithm
Toy example
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 28 / 42
Illustration of the algorithm
Joint fitting
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 29 / 42
Outline
Application examples
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 29 / 42
Introduction
Goal
Propose a framework to build a stochastic emulator without replications (... that can also be applied if
there is replicated data)
Rationale
The joint fitting estimator does not require replications!
With Without
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 30 / 42
Procedure
Ingredients
• An experimental design X = x(1) , . . . , x(N ) and model evaluations y (i) def
= M(x(i) , ωi )
• Pre-selected PC bases for the four polynomial chaos expansions of λi (x), i = 1, . . . , 4
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 31 / 42
Outline
Application examples
Analytical example
Stochastic SIR model
Wind turbine application
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 31 / 42
Error metrics for stochastic emulators
Wasserstein distance
• It is the L2 distance between the quantile functions:
Z 1
def 2
d2WS (Y, Ŷ ) = kQY − Q̂Y k2L2 = QY (u) − Q̂Y (u) du
0
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 32 / 42
Geometric Brownian motion
dSt = x1 St dt + x2 St dWt
x1 − x22 /2 t, x2 t
St (x)/S0 ∼ LN
Setup
• X1 ∼ U (0, 0.1), X2 ∼ U (0.1, 0.4)
• Y = S1 for t = 1 is of interest, i.e. Y (x) ∼ LN x1 − x22 /2, x2
• Experimental design: X are generated using the Latin hypercube sampling
• No replications
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 33 / 42
PDF predictions (ED with N = 500)
� -.....
=::: ;:: �:: j:: : : :
�- �-...............
�- �- �- �- � �- ._�-... __-
...�
:•• :• :•� :� :� : � ·� \.; \� :� :_:: :_:::. ..-
• -=, �- �- �- �- �- �- �- �- �- -� � T- �-�-�-�-�-�-�-�-�-�-�-� - - -
+ ..• :_:. :_:. :_:. :_:. :_:. :: : :: : :: :
. ::: . ----
11111111!1. �- �- �- �- �- �- �- �- �- �· - - -- �
1
... .. ...1.. .. ...... ...... ...... ...... ...... .. " ........ � ..... ......--
..... --. -c-:;a;:-,
±
...... ...... ...... ...... ...... .. . ·+· ...... ..
...... ...... ...... ...... ...... ......
1 -- ----
... ...... ...... ...... ...... ...... ...... ...... . ...... ...... ...... .... �............ ...... .... � .......... .... 1 .... ...... ...... ...... ...... ...... ·=t=· ...... ..
...... � �
:
-@ . .
1 1 -- �-----
···. l ...... ...... ...... ..... . .... . .... l ...... ...... ...... ...... ...... ......
--
� 1
_l_ .... ...... . : . .� .._ .••. •..••. • �.-:.� •.
••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •••••• •r•••••• •••••• •••••• •••••• •••••• ••••••···
... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ...... ......
: 1
�.�<22•••••• ••
. ...... ...... ...... ...... � ............ ...... ...... ..
+ · ·• •
.. .... .... .... .... .... .... .... .... .... .... .... .... .... .... .... ...1 .... .... .... .... .... .... .... 1.... .... .... .... .... .... .... .
1 1
..1.... ..1....
1
..1....
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 35 / 42
Outline
Application examples
Analytical example
Stochastic SIR model
Wind turbine application
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 35 / 42
Stochastic SIR model
• Mt = St + It + Rt : total population
• St : number of susceptible individuals at time t
• It : number of infected individuals at time t
• Rt : number of recovered individuals at time t
Setup
• Total population Mt = 2000
• The initial condition: S0 ∼ U(1300, 1800),
I0 ∼ U(20, 200)
• Y (x): total number of infected individuals
during the outbreak (without counting I0 )
Binois et al. (2018) Practical heteroscedastic Gaussian process modeling for large simulation experiments, J. Comput. Graph. Stat.
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 36 / 42
PDF predictions (ED with N = 500)
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 37 / 42
Convergence study
• Experimental design of size 250, 500, 1000, 2000, 4000
• 50 independent runs for each scenario EX d2WS Y (X), Y GLaM (X)
ε=
• Normalized Wasserstein distance as a performance indicator Var [Y ]
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 38 / 42
Outline
Application examples
Analytical example
Stochastic SIR model
Wind turbine application
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 38 / 42
Wind turbine application
• Five input variables:
– Mean speed U , turbulence intensity I and shear exponent α are uniformly
distributed in the following domain Mean speed
Drag
Lift
pitch control
yaw control
Turbulence
Wave
Simulation
• 485 training points with 50 replications
• 120 test points with 500 replications
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 39 / 42
PDF predictions
The normalized Wasserstein distance is ε = 0.013
x = (8.3, 2.2, −0.21, 0.98, −7.2) x = (11.8, 2.9, 0.29, 0.97, 0.21)
x = (9.1, 1.0, 0.47, 1.3, 4.0) x = (4.0, 1.4, 0.52, 1.2, 3.5)
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 40 / 42
One word on sensitivity analysis
Due to the stochasticity in the simulator MS (x) ≡ Md (x, z), various Sobol’ indices can be defined:
• Classical indices based on partial variances:
Var [E [Y |X u ]]
Su = u = {i1 , . . . , ik } ⊂ {1, . . . , M }
Var [Y ]
• QoI-based indices, corresponding to a particular feature of Y (x), e.g. mean, variance, quantile
Var [Y | X u , Z]
Sutraj (Z) =
Var [Y | Z]
See details in: Zhu & Sudret, Global sensitivity analysis for stochastic simulators based on generalized lambda surrogate models,
Reliab. Eng. Sys. Safety (2021)
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 41 / 42
Conclusions
• Stochastic simulators are used in many fields of applied sciences and engineering
• We propose a framework based on generalized lambda distributions and polynomial chaos expansions
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 42 / 42
www.uqlab.com
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 42 / 42
UQLab: The Uncertainty Quantification Software
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 42 / 42
UQWorld: the community of UQ https://uqworld.org/
Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 42 / 42
Questions ?
View publication stats Surrogate models for stochastic simulators UNCECOMP’2021 – June 30, 2021 B. Sudret 42 / 42