Gradient-enhanced kriging: Difference between revisions
m Disambiguating links to Hyperparameter (link changed to Hyperparameter (Bayesian statistics)) using DisamAssist. |
|||
(193 intermediate revisions by 31 users not shown) | |||
Line 1: | Line 1: | ||
{{Short description|Prediction model used in Engineering}} |
|||
{{New unreviewed article|source=ArticleWizard|date=November 2016}} |
|||
{{COI|date=April 2017}} |
|||
'''Gradient-Enhanced Kriging (GEK)''' is a [[surrogate model|surrogate modeling]] technique used in engineering. A surrogate model (alternatively known as a [[metamodeling|metamodel]], [[response surface methodology|response surface]] or emulator) is a prediction of the output of an expensive computer code. This prediction is based on a small number of evaluations of the expensive computer code. |
|||
{{original research|date=April 2017}} |
|||
'''Gradient-enhanced kriging''' ('''GEK''') is a [[surrogate model]]ing technique used in engineering. A surrogate model (alternatively known as a [[metamodeling|metamodel]], [[response surface methodology|response surface]] or emulator) is a prediction of the output of an expensive computer code.<ref name = mitchell1992>{{cite journal | last1 = Mitchell | first1 = M. | last2 = Morris | first2 = M. | title = Bayesian design and analysis of computer experiments: two examples | url = http://www3.stat.sinica.edu.tw/statistica/oldpdf/A2n23.pdf | journal = Statistica Sinica | year = 1992 | issue = 2 | pages = 359–379}}</ref> |
|||
This prediction is based on a small number of evaluations of the expensive computer code. |
|||
== Introduction == |
== Introduction == |
||
[[File:GEK test-function.png|thumb|400px|Example of one-dimensional data interpolated by Kriging and GEK. The black line indicates the test-function, while the gray circles indicate 'samples' or evaluations of the test-function. The blue line is the Kriging mean, the shaded blue area illustrates the Kriging standard deviation. With GEK we can add the gradient information, illustrated in red, which increases the accuracy of the prediction.]] |
[[File:GEK test-function.png|thumb|400px|Example of one-dimensional data interpolated by Kriging and GEK. The black line indicates the test-function, while the gray circles indicate 'observations', 'samples' or 'evaluations' of the test-function. The blue line is the Kriging mean, the shaded blue area illustrates the Kriging standard deviation. With GEK we can add the gradient information, illustrated in red, which increases the accuracy of the prediction.]] |
||
Adjoint solvers are now becoming available in a range of [[computational fluid dynamics]] (CFD) solvers, such as [[Ansys|Fluent]], [[OpenFOAM]], [[SU2 code|SU2]] and US3D. Originally developed for [[optimization]], adjoint solvers are now finding more and more use in [[uncertainty quantification]]. |
|||
=== Linear speedup === |
|||
An adjoint solver allows one to compute the [[gradient]] of the quantity of interest with respect to all design parameters at the cost of one additional solve. This, potentially, leads to a [[linear]] speedup: the computational cost of constructing an accurate surrogate decrease, and the resulting computational speedup <math>s</math> scales linearly with the number <math>d</math> of design parameters. |
|||
The reasoning behind this linear speedup is straightforward. Assume we run <math>N</math> primal solves and <math>N</math> adjoint solves, at a total cost of <math>2N</math>. This results in <math>N+dN</math> data; <math>N</math> values for the quantity of interest and <math>d</math> partial derivatives in each of the <math>N</math> gradients. Now assume that each partial derivative provides as much information for our surrogate as a single primal solve. Then, the total cost of getting the same amount of information from primal solves only is <math>N+dN</math>. The speedup is the ratio of these costs: |
|||
<ref name = debaar2012>{{cite journal | last1 = de Baar | first1 = J.H.S. | last2 = Scholcz | first2 = T.P. | last3 = Verhoosel | first3 = C.V. | last4 = Dwight | first4 = R.P. | last5 = van Zuijlen | first5 = A.H. | last6 = Bijl | first6 = H. | title = Efficient uncertainty quantification with gradient-enhanced Kriging: Applications in FSI | url = https://aerodynamics.lr.tudelft.nl/~bayesiancomputing/files/GEK_UQ_ECCOMAS_submitted.pdf | journal = ECCOMAS, Vienna, Austria, September 10–14 | year = 2012}}</ref> |
|||
<ref name = debaar2015>{{cite journal | last1 = de Baar | first1 = J.H.S. | last2 = Scholcz | first2 = T.P. | last3 = Dwight | first3 = R.P. | year = 2015 | title = Exploiting Adjoint Derivatives in High-Dimensional Metamodels | journal = AIAA Journal | volume = 53 | issue = 5 | pages = 1391–1395 | doi =10.2514/1.J053678| bibcode = 2015AIAAJ..53.1391D }}</ref> |
|||
:<math>s = \frac{N+dN}{2N} = \frac{1}{2} + \frac{1}{2}d.</math> |
|||
A linear speedup has been demonstrated for a [[fluid-structure interaction]] problem <ref name = debaar2012/> and for a [[transonic]] [[airfoil]].<ref name = debaar2015/> |
|||
=== Noise === |
|||
One issue with adjoint-based gradients in CFD is that they can be particularly [[Noise (signal processing)|noisy]]. |
|||
<ref name = dwight2006>{{cite journal | last1 = Dwight | first1 = R. | last2 = Brezillon | first2 = J. | title = Effect of Approximations of the Discrete Adjoint on Gradient-Based Optimization | journal = AIAA Journal | volume = 44 | issue = 12 | pages = 3022–3031 | year = 2006| doi = 10.2514/1.21744 | citeseerx = 10.1.1.711.4761 | bibcode = 2006AIAAJ..44.3022D }}</ref> |
|||
<ref name = giles2003>{{cite journal | last1 = Giles | first1 = M. | last2 = Duta | first2 = M. | last3 = Muller | first3 = J. | last4 = Pierce | first4 = N. | s2cid = 2106397 | title = Algorithm Developments for Discrete Adjoint Methods | journal = AIAA Journal | volume = 41 | issue = 2 | pages = 198–205 | year = 2003| doi = 10.2514/2.1961 | bibcode = 2003AIAAJ..41..198G | url = https://ora.ox.ac.uk/objects/uuid:47cb04d7-d913-4d0b-a7c4-03b6137418e2 }}</ref> |
|||
When derived in a [[Thomas Bayes|Bayesian]] framework, GEK allows one to incorporate not only the gradient information, but also the [[Measurement uncertainty|uncertainty]] in that gradient information.<ref name = debaar2014a>{{cite journal | last1 = de Baar | first1 = J.H.S. | last2 = Dwight | first2 = R.P. | last3 = Bijl | first3 = H. | year = 2014 | title = Improvements to gradient-enhanced Kriging using a Bayesian interpretation | journal = International Journal for Uncertainty Quantification | volume = 4 | issue = 3 | pages = 205–223 | doi =10.1615/Int.J.UncertaintyQuantification.2013006809| doi-access = free }}</ref> |
|||
== Approach == |
|||
When using GEK one takes the following steps: |
|||
# '''Create a design of experiment (DoE):''' The DoE or 'sampling plan' is a list of different locations in the design space. The DoE indicates which combinations of parameters one will use to sample the computer simulation. With Kriging and GEK, a common choice is to use a Latin Hypercube Design (LHS) design with a 'maximin' criterion. The LHS-design is available in scripting codes like [[MATLAB]] or [[Python (programming language)|Python]]. |
|||
# '''Make observations:''' For each sample in our DoE one runs the computer simulation to obtain the Quantity of Interest (QoI). |
|||
# '''Construct the surrogate:''' One uses the GEK predictor equations to construct the surrogate conditional on the obtained observations. |
|||
Once the surrogate has been constructed it can be used in different ways, for example for surrogate-based [[uncertainty quantification]] (UQ) or [[optimization]]. |
|||
== Predictor equations == |
== Predictor equations == |
||
In a [[Thomas Bayes|Bayesian]] framework, we use [[Bayes' Theorem]] to predict the Kriging mean and |
In a [[Thomas Bayes|Bayesian]] framework, we use [[Bayes' Theorem]] to predict the [[Kriging]] mean and covariance conditional on the observations. When using GEK, the observations are usually the results of a number of computer simulations. GEK can be interpreted as a form of [[Gaussian process]] regression. |
||
=== Kriging === |
=== Kriging === |
||
Along the lines of, <ref>{{cite journal | last1 = Wikle | first1 = C.K. | last2 = Berliner | first2 = L.M. | year = 2007 | title = A Bayesian tutorial for data assimilation | journal = Physica D | volume = 230 | issue = 1–2| pages = 1–16 | doi =10.1016/j.physd.2006.09.017| bibcode = 2007PhyD..230....1W }}</ref> we are interested in the output <math>X</math> of our computer simulation, for which we assume the [[normal distribution|normal]] [[prior probability|prior probability distribution]]: |
|||
:<math> |
:<math>X\sim\mathcal{N}(\mu,P),</math> |
||
with prior mean <math>\mu</math> and prior [[covariance matrix]] <math>P</math>. |
with prior mean <math>\mu</math> and prior [[covariance matrix]] <math>P</math>. The observations <math>y</math> have the normal [[likelihood]]: |
||
:<math>Y\mid x\sim\mathcal{N}(Hx,R),</math> |
|||
=== GEK === |
|||
with <math>H</math> the observation matrix and <math>R</math> the observation error covariance matrix, which contains the [[measurement uncertainty|observation uncertainties]]. After applying [[Bayes' Theorem]] we obtain a normally distributed [[posterior probability|posterior probability distribution]], with Kriging mean: |
|||
:<math>\operatorname E(X\mid y) = \mu + K(y-H\mu),</math> |
|||
and Kriging covariance: |
|||
:<math>\operatorname{cov}(X\mid y) = (I-KH)P,</math> |
|||
where we have the gain matrix: |
|||
:<math>K = PH^T(R+HPH^T)^{-1}.</math> |
|||
In Kriging, the prior covariance matrix <math>P</math> is generated from a covariance function. One example of a covariance function is the Gaussian covariance: |
|||
:<math>P_{ij} = \sigma^2 \exp\left(-\sum_k\frac{|\xi_{jk}-\xi_{ik}|^2}{2 \theta_k^2}\right),</math> |
|||
where we sum over the dimensions <math>k</math> and <math>\xi</math> are the input parameters. The [[Hyperparameter (Bayesian statistics)|hyperparameter]]s <math>\mu</math>, <math>\sigma</math> and <math>\theta</math> can be estimated from a [[maximum likelihood estimation|Maximum Likelihood Estimate]] (MLE).<ref name = debaar2014a /> |
|||
<ref name = dwight2009>{{cite book | last1 = Dwight | first1 = R.P. | last2 = Han | first2 = Z.-H. | year = 2009 | title = Efficient uncertainty quantification using gradient-enhanced Kriging | journal = 11th AIAA Non-Deterministic Approaches Conference, Palm Springs, California, 4–7 May | doi =10.2514/6.2009-2276| isbn = 978-1-60086-975-4 | s2cid = 59019628 | url = https://elib.dlr.de/63754/1/PV2009_2276.pdf }}</ref> <ref name = zhang2021>{{cite journal |last1=Zhang |first1=Sheng |last2=Yang |first2=Xiu |last3=Tindel |first3=Samy |last4=Lin |first4=Guang |title=Augmented Gaussian random field: Theory and computation |journal=Discrete & Continuous Dynamical Systems - S |date=2021 |volume=15 |issue=4 |page=931 |doi=10.3934/dcdss.2021098|arxiv=2009.01961 |s2cid=221507566 }}</ref> |
|||
=== Indirect GEK === |
|||
There are several ways of implementing GEK. The first method, indirect GEK, defines a small but finite stepsize <math>h</math>, and uses the gradient information to append synthetic data to the observations <math>y</math>, see for example.<ref name = dwight2009 /> Indirect Kriging is sensitive to the choice of the step-size <math>h</math> and cannot include [[measurement uncertainty|observation uncertainties]]. |
|||
=== Direct GEK (through prior covariance matrix) === |
|||
Direct GEK is a form of co-Kriging, where we add the gradient information as co-variables. This can be done by modifying the prior covariance <math>P</math> or by modifying the observation matrix <math>H</math>; both approaches lead to the same GEK predictor. When we construct direct GEK through the prior covariance matrix, we append the partial derivatives to <math>y</math>, and modify the prior covariance matrix <math>P</math> such that it also contains the derivatives (and second derivatives) of the covariance function, see for example |
|||
<ref name = laurenceau2008>{{cite journal | last1 = Laurenceau | first1 = J. | last2 = Sagaut | first2 = P. | s2cid = 17895486 | year = 2008 | title = Building efficient response surfaces of aerodynamic functions with Kriging and coKriging | journal = AIAA Journal | volume = 46 | issue = 2 | pages = 498–507 | doi =10.2514/1.32308| bibcode = 2008AIAAJ..46..498L }}</ref> |
|||
.<ref name = debaar2014a /> |
|||
The main advantages of direct GEK over indirect GEK are: 1) we do not have to choose a step-size, 2) we can include [[measurement uncertainty|observation uncertainties]] for the gradients in <math>R</math>, and 3) it is less susceptible to poor [[condition number|conditioning]] of the gain matrix <math>K</math>. |
|||
<ref name = debaar2014a /> |
|||
<ref name = dwight2009 /> |
|||
=== Direct GEK (through observation matrix) === |
|||
Another way of arriving at the same direct GEK predictor is to append the partial derivatives to the observations <math>y</math> and include partial derivative operators in the observation matrix <math>H</math>, see for example.<ref>{{cite journal | last1 = de Baar | first1 = J.H.S. | year = 2014 | title = Stochastic Surrogates for Measurements and Computer Models of Fluids | url = http://repository.tudelft.nl/islandora/object/uuid%3A79d2dc5a-9a7c-4179-a417-a50cbba2bc5f?collection=research | journal = PhD Thesis, Delft University of Technology | pages = 99–101 }}</ref> |
|||
=== Gradient-enhanced kriging for high-dimensional problems (Indirect method) === |
|||
Current gradient-enhanced kriging methods do not scale well with the number of sampling points due to the rapid growth in the size of the correlation matrix, where new information is added for each sampling point in each direction of the design space. Furthermore, they do not scale well with the number of independent variables due to the increase in the number of hyperparameters that needs to be estimated. |
|||
To address this issue, a new gradient-enhanced surrogate model approach that drastically reduced the number of hyperparameters through the use of the [[Partial least squares regression|partial-least squares]] method that maintains accuracy is developed. In addition, this method is able to control the size of the correlation matrix by adding only relevant points defined through the information provided by the partial-least squares method. For more details, see.<ref name = bouhlel2018>{{cite journal | last1 = Bouhlel | first1 = M.A. | last2 = Martins | first2 = J.R.R.A. | year = 2018 | title = Gradient-enhanced kriging for high-dimensional problems | journal = Engineering with Computers | volume = 35 | pages = 157–173 | doi =10.1007/s00366-018-0590-x| arxiv = 1708.02663 | s2cid = 3540630 }}</ref> |
|||
This approach is implemented into the Surrogate Modeling Toolbox (SMT) in Python (https://github.com/SMTorg/SMT), and it runs on Linux, macOS, and Windows. SMT is distributed under the New BSD license. |
|||
=== Augmented gradient-enhanced kriging (direct method) === |
|||
A universal augmented framework is proposed in <ref name = zhang2021>{{cite journal |last1=Zhang |first1=Sheng |last2=Yang |first2=Xiu |last3=Tindel |first3=Samy |last4=Lin |first4=Guang |title=Augmented Gaussian random field: Theory and computation |journal=Discrete & Continuous Dynamical Systems - S |date=2021 |volume=15 |issue=4 |page=931 |doi=10.3934/dcdss.2021098|arxiv=2009.01961 |s2cid=221507566 }}</ref> to append derivatives of any order to the observations. This method can be viewed as a generalization of Direct GEK that takes into account higher-order derivatives. Also, the observations and derivatives are not required to be measured at the same location under this framework. |
|||
== Example: Drag coefficient of a transonic airfoil == |
== Example: Drag coefficient of a transonic airfoil == |
||
[[File:Transonic airfoil.png|thumb|400px|Transonic airfoil.]] |
|||
[[File:GEK airfoil reference.png|thumb|400px|Reference results for the drag coefficient of a transonic airfoil, based on a large number of CFD simulations. The horizontal and vertical axis show the deformation of the shape of the airfoil.]] |
[[File:GEK airfoil reference.png|thumb|400px|Reference results for the drag coefficient of a transonic airfoil, based on a large number of CFD simulations. The horizontal and vertical axis show the deformation of the shape of the airfoil.]] |
||
Line 24: | Line 99: | ||
[[File:GEK airfoil GEK.png|thumb|400px|GEK surrogate model of the drag coefficient of a transonic airfoil. The gray dots indicate the configurations for which the CFD solver was run, the arrows indicate the gradients.]] |
[[File:GEK airfoil GEK.png|thumb|400px|GEK surrogate model of the drag coefficient of a transonic airfoil. The gray dots indicate the configurations for which the CFD solver was run, the arrows indicate the gradients.]] |
||
As an example, consider the flow over a [[transonic]] [[airfoil]].<ref name = debaar2015/> The airfoil is operating at a [[Mach number]] of 0.8 and an [[angle of attack]] of 1.25 degrees. We assume that the shape of the airfoil is uncertain; the top and the bottom of the airfoil might have shifted up or down due to manufacturing tolerances. In other words, the shape of the airfoil that we are using might be slightly different from the airfoil that we designed. |
|||
== References == |
|||
<!--- See [[Wikipedia:Footnotes]] on how to create references using <ref></ref> tags which will then appear here automatically --> |
|||
{{Reflist}} |
|||
On the right we see the reference results for the [[drag coefficient]] of the airfoil, based on a large number of CFD simulations. Note that the lowest drag, which corresponds to 'optimal' performance, is close to the undeformed 'baseline' design of the airfoil at (0,0). |
|||
== External links == |
|||
<!-- Use the format: * [http://www.example.com/ example.com] --> |
|||
After designing a sampling plan (indicated by the gray dots) and running the CFD solver at those sample locations, we obtain the Kriging surrogate model. The Kriging surrogate is close to the reference, but perhaps not as close as we would desire. |
|||
In the last figure, we have improved the accuracy of this surrogate model by including the adjoint-based gradient information, indicated by the arrows, and applying GEK. |
|||
== Applications == |
|||
GEK has found the following applications: |
|||
* 1993: Design problem for a borehole model test-function.<ref name = morris1993>{{cite journal | last1 = Morris | first1 = M.D. | last2 = Mitchell | first2 = T.J. | last3 = Ylvisaker | first3 = D. | year = 1993 | title = Bayesian Design and Analysis of Computer Experiments: Use of Derivatives in Surface Prediction | journal = Technometrics | volume = 35 | issue = 3 | pages = 243–255 | doi =10.1080/00401706.1993.10485320| url = https://digital.library.unt.edu/ark:/67531/metadc1100641/ }}</ref> |
|||
* 2002: Aerodynamic design of a supersonic business jet.<ref name = chung2002>{{cite journal | last1 = Chung | first1 = H.-S. | last2 = Alonso | first2 = J.J. | year = 2002 | title = Using Gradients to Construct Cokriging Approximation Models for High-Dimensional Design Optimization Problems | journal = AIAA 40th Aerospace Sciences Meeting and Exhibit | pages = 2002–0317| doi =10.2514/6.2002-317| citeseerx = 10.1.1.12.4149 }}</ref> |
|||
* 2008: Uncertainty quantification for a transonic airfoil with uncertain shape parameters.<ref name = laurenceau2008 /> |
|||
* 2009: Uncertainty quantification for a transonic airfoil with uncertain shape parameters.<ref name = dwight2009 /> |
|||
* 2012: Surrogate model construction for a panel divergence problem, a [[fluid-structure interaction]] problem. Demonstration of a linear speedup.<ref name = debaar2012 /> |
|||
* 2013: Uncertainty quantification for a transonic airfoil with uncertain angle of attack and Mach number.<ref name = han2013>{{cite journal | last1 = Han | first1 = Z.-H. | last2 = Gortz | first2 = S. | last3 = Zimmermann | first3 = R. | year = 2013 | title = Improving variable-fidelity surrogate modeling via gradient-enhanced kriging and a generalized hybrid bridge function |doi=10.1016/j.ast.2012.01.006 | journal = Engineering with Computers | volume = 32 | issue = 1 | pages = 15–34 }}</ref> |
|||
* 2014: Uncertainty quantification for the RANS simulation of an airfoil, with the model parameters of the k-epsilon turbulence model as uncertain inputs.<ref name = debaar2014a /> |
|||
* 2015: Uncertainty quantification for the Euler simulation of a transonic airfoil with uncertain shape parameters. Demonstration of a linear speedup.<ref name = debaar2015 /> |
|||
* 2016: Surrogate model construction for two [[fluid-structure interaction]] problems.<ref name = ulaganathan2016>{{cite journal | last1 = Ulaganathan | first1 = S. | last2 = Couckuyt | first2 = I. | last3 = Dhaene | first3 = T. | last4 = Degroote | first4 = J. | last5 = Laermans | first5 = E. | year = 2016 | title = Performance study of gradient-enhanced Kriging | url = http://www.sciencedirect.com/science/article/pii/S127096381200017X | journal = Aerospace Science and Technology | volume = 25 | issue = 1 | pages = 177–189 }}</ref> |
|||
* 2017: Large review of gradient-enhanced surrogate models including many details concerning gradient-enhanced kriging.<ref name = laurent2017>{{cite journal | last1 = Laurent | first1 = L. | last2 = Le Riche | first2 = R. | last3 = Soulier | first3 = B. | last4 = Boucard | first4 = P.-A. | year = 2017 | title = An overview of gradient-enhanced metamodels with applications | journal = Archives of Computational Methods in Engineering | volume = 26 | pages = 1–46 | doi =10.1007/s11831-017-9226-3| s2cid = 54625655 | url = https://hal-emse.ccsd.cnrs.fr/emse-01525674/file/paperHAL.pdf }}</ref> |
|||
* 2017: Uncertainty propagation for a nuclear energy system.<ref name = lockwood2010>{{cite journal | last1 = Lockwood | first1 = B.A. | last2 = Anitescu | first2 = M. | year = 2012| title = Gradient-Enhanced Universal Kriging for Uncertainty Propagation | url = http://www.mcs.anl.gov/papers/1808-1110.pdf | journal = Nuclear Science and Engineering| volume = 170| issue = 2| pages = 168–195| doi =10.13182/NSE10-86| citeseerx = 10.1.1.187.6097 | s2cid = 18465024 }}</ref> |
|||
* 2020: Molecular geometry optimization.<ref name = Raggi2020>{{cite journal | last1 = Raggi | first1 = G. | last2 = Fdez. Galván | first2 = I. | last3 = Ritterhoff | first3 = C. L. | last4 = Vacher | first4 = M. | last5 = Lindh | first5 = R. | year = 2020 | title = Restricted-Variance Molecular Geometry Optimization Based on Gradient-Enhanced Kriging | journal = Journal of Chemical Theory and Computation | volume = 16 | issue = 6 | pages = 3989–4001 | doi = 10.1021/acs.jctc.0c00257| pmid = 32374164 | pmc = 7304864 | doi-access = free }}</ref> |
|||
== References == |
|||
{{Reflist}} |
|||
[[Category:Mathematical modeling]] |
|||
<!--- Categories ---> |
|||
[[Category: |
[[Category:Computational fluid dynamics]] |
Latest revision as of 10:49, 5 October 2024
A major contributor to this article appears to have a close connection with its subject. (April 2017) |
This article possibly contains original research. (April 2017) |
Gradient-enhanced kriging (GEK) is a surrogate modeling technique used in engineering. A surrogate model (alternatively known as a metamodel, response surface or emulator) is a prediction of the output of an expensive computer code.[1] This prediction is based on a small number of evaluations of the expensive computer code.
Introduction
[edit]Adjoint solvers are now becoming available in a range of computational fluid dynamics (CFD) solvers, such as Fluent, OpenFOAM, SU2 and US3D. Originally developed for optimization, adjoint solvers are now finding more and more use in uncertainty quantification.
Linear speedup
[edit]An adjoint solver allows one to compute the gradient of the quantity of interest with respect to all design parameters at the cost of one additional solve. This, potentially, leads to a linear speedup: the computational cost of constructing an accurate surrogate decrease, and the resulting computational speedup scales linearly with the number of design parameters.
The reasoning behind this linear speedup is straightforward. Assume we run primal solves and adjoint solves, at a total cost of . This results in data; values for the quantity of interest and partial derivatives in each of the gradients. Now assume that each partial derivative provides as much information for our surrogate as a single primal solve. Then, the total cost of getting the same amount of information from primal solves only is . The speedup is the ratio of these costs: [2] [3]
A linear speedup has been demonstrated for a fluid-structure interaction problem [2] and for a transonic airfoil.[3]
Noise
[edit]One issue with adjoint-based gradients in CFD is that they can be particularly noisy. [4] [5] When derived in a Bayesian framework, GEK allows one to incorporate not only the gradient information, but also the uncertainty in that gradient information.[6]
Approach
[edit]When using GEK one takes the following steps:
- Create a design of experiment (DoE): The DoE or 'sampling plan' is a list of different locations in the design space. The DoE indicates which combinations of parameters one will use to sample the computer simulation. With Kriging and GEK, a common choice is to use a Latin Hypercube Design (LHS) design with a 'maximin' criterion. The LHS-design is available in scripting codes like MATLAB or Python.
- Make observations: For each sample in our DoE one runs the computer simulation to obtain the Quantity of Interest (QoI).
- Construct the surrogate: One uses the GEK predictor equations to construct the surrogate conditional on the obtained observations.
Once the surrogate has been constructed it can be used in different ways, for example for surrogate-based uncertainty quantification (UQ) or optimization.
Predictor equations
[edit]In a Bayesian framework, we use Bayes' Theorem to predict the Kriging mean and covariance conditional on the observations. When using GEK, the observations are usually the results of a number of computer simulations. GEK can be interpreted as a form of Gaussian process regression.
Kriging
[edit]Along the lines of, [7] we are interested in the output of our computer simulation, for which we assume the normal prior probability distribution:
with prior mean and prior covariance matrix . The observations have the normal likelihood:
with the observation matrix and the observation error covariance matrix, which contains the observation uncertainties. After applying Bayes' Theorem we obtain a normally distributed posterior probability distribution, with Kriging mean:
and Kriging covariance:
where we have the gain matrix:
In Kriging, the prior covariance matrix is generated from a covariance function. One example of a covariance function is the Gaussian covariance:
where we sum over the dimensions and are the input parameters. The hyperparameters , and can be estimated from a Maximum Likelihood Estimate (MLE).[6] [8] [9]
Indirect GEK
[edit]There are several ways of implementing GEK. The first method, indirect GEK, defines a small but finite stepsize , and uses the gradient information to append synthetic data to the observations , see for example.[8] Indirect Kriging is sensitive to the choice of the step-size and cannot include observation uncertainties.
Direct GEK (through prior covariance matrix)
[edit]Direct GEK is a form of co-Kriging, where we add the gradient information as co-variables. This can be done by modifying the prior covariance or by modifying the observation matrix ; both approaches lead to the same GEK predictor. When we construct direct GEK through the prior covariance matrix, we append the partial derivatives to , and modify the prior covariance matrix such that it also contains the derivatives (and second derivatives) of the covariance function, see for example [10] .[6] The main advantages of direct GEK over indirect GEK are: 1) we do not have to choose a step-size, 2) we can include observation uncertainties for the gradients in , and 3) it is less susceptible to poor conditioning of the gain matrix . [6] [8]
Direct GEK (through observation matrix)
[edit]Another way of arriving at the same direct GEK predictor is to append the partial derivatives to the observations and include partial derivative operators in the observation matrix , see for example.[11]
Gradient-enhanced kriging for high-dimensional problems (Indirect method)
[edit]Current gradient-enhanced kriging methods do not scale well with the number of sampling points due to the rapid growth in the size of the correlation matrix, where new information is added for each sampling point in each direction of the design space. Furthermore, they do not scale well with the number of independent variables due to the increase in the number of hyperparameters that needs to be estimated. To address this issue, a new gradient-enhanced surrogate model approach that drastically reduced the number of hyperparameters through the use of the partial-least squares method that maintains accuracy is developed. In addition, this method is able to control the size of the correlation matrix by adding only relevant points defined through the information provided by the partial-least squares method. For more details, see.[12] This approach is implemented into the Surrogate Modeling Toolbox (SMT) in Python (https://github.com/SMTorg/SMT), and it runs on Linux, macOS, and Windows. SMT is distributed under the New BSD license.
Augmented gradient-enhanced kriging (direct method)
[edit]A universal augmented framework is proposed in [9] to append derivatives of any order to the observations. This method can be viewed as a generalization of Direct GEK that takes into account higher-order derivatives. Also, the observations and derivatives are not required to be measured at the same location under this framework.
Example: Drag coefficient of a transonic airfoil
[edit]As an example, consider the flow over a transonic airfoil.[3] The airfoil is operating at a Mach number of 0.8 and an angle of attack of 1.25 degrees. We assume that the shape of the airfoil is uncertain; the top and the bottom of the airfoil might have shifted up or down due to manufacturing tolerances. In other words, the shape of the airfoil that we are using might be slightly different from the airfoil that we designed.
On the right we see the reference results for the drag coefficient of the airfoil, based on a large number of CFD simulations. Note that the lowest drag, which corresponds to 'optimal' performance, is close to the undeformed 'baseline' design of the airfoil at (0,0).
After designing a sampling plan (indicated by the gray dots) and running the CFD solver at those sample locations, we obtain the Kriging surrogate model. The Kriging surrogate is close to the reference, but perhaps not as close as we would desire.
In the last figure, we have improved the accuracy of this surrogate model by including the adjoint-based gradient information, indicated by the arrows, and applying GEK.
Applications
[edit]GEK has found the following applications:
- 1993: Design problem for a borehole model test-function.[13]
- 2002: Aerodynamic design of a supersonic business jet.[14]
- 2008: Uncertainty quantification for a transonic airfoil with uncertain shape parameters.[10]
- 2009: Uncertainty quantification for a transonic airfoil with uncertain shape parameters.[8]
- 2012: Surrogate model construction for a panel divergence problem, a fluid-structure interaction problem. Demonstration of a linear speedup.[2]
- 2013: Uncertainty quantification for a transonic airfoil with uncertain angle of attack and Mach number.[15]
- 2014: Uncertainty quantification for the RANS simulation of an airfoil, with the model parameters of the k-epsilon turbulence model as uncertain inputs.[6]
- 2015: Uncertainty quantification for the Euler simulation of a transonic airfoil with uncertain shape parameters. Demonstration of a linear speedup.[3]
- 2016: Surrogate model construction for two fluid-structure interaction problems.[16]
- 2017: Large review of gradient-enhanced surrogate models including many details concerning gradient-enhanced kriging.[17]
- 2017: Uncertainty propagation for a nuclear energy system.[18]
- 2020: Molecular geometry optimization.[19]
References
[edit]- ^ Mitchell, M.; Morris, M. (1992). "Bayesian design and analysis of computer experiments: two examples" (PDF). Statistica Sinica (2): 359–379.
- ^ a b c de Baar, J.H.S.; Scholcz, T.P.; Verhoosel, C.V.; Dwight, R.P.; van Zuijlen, A.H.; Bijl, H. (2012). "Efficient uncertainty quantification with gradient-enhanced Kriging: Applications in FSI" (PDF). ECCOMAS, Vienna, Austria, September 10–14.
- ^ a b c d de Baar, J.H.S.; Scholcz, T.P.; Dwight, R.P. (2015). "Exploiting Adjoint Derivatives in High-Dimensional Metamodels". AIAA Journal. 53 (5): 1391–1395. Bibcode:2015AIAAJ..53.1391D. doi:10.2514/1.J053678.
- ^ Dwight, R.; Brezillon, J. (2006). "Effect of Approximations of the Discrete Adjoint on Gradient-Based Optimization". AIAA Journal. 44 (12): 3022–3031. Bibcode:2006AIAAJ..44.3022D. CiteSeerX 10.1.1.711.4761. doi:10.2514/1.21744.
- ^ Giles, M.; Duta, M.; Muller, J.; Pierce, N. (2003). "Algorithm Developments for Discrete Adjoint Methods". AIAA Journal. 41 (2): 198–205. Bibcode:2003AIAAJ..41..198G. doi:10.2514/2.1961. S2CID 2106397.
- ^ a b c d e de Baar, J.H.S.; Dwight, R.P.; Bijl, H. (2014). "Improvements to gradient-enhanced Kriging using a Bayesian interpretation". International Journal for Uncertainty Quantification. 4 (3): 205–223. doi:10.1615/Int.J.UncertaintyQuantification.2013006809.
- ^ Wikle, C.K.; Berliner, L.M. (2007). "A Bayesian tutorial for data assimilation". Physica D. 230 (1–2): 1–16. Bibcode:2007PhyD..230....1W. doi:10.1016/j.physd.2006.09.017.
- ^ a b c d Dwight, R.P.; Han, Z.-H. (2009). Efficient uncertainty quantification using gradient-enhanced Kriging (PDF). doi:10.2514/6.2009-2276. ISBN 978-1-60086-975-4. S2CID 59019628.
{{cite book}}
:|journal=
ignored (help) - ^ a b Zhang, Sheng; Yang, Xiu; Tindel, Samy; Lin, Guang (2021). "Augmented Gaussian random field: Theory and computation". Discrete & Continuous Dynamical Systems - S. 15 (4): 931. arXiv:2009.01961. doi:10.3934/dcdss.2021098. S2CID 221507566.
- ^ a b Laurenceau, J.; Sagaut, P. (2008). "Building efficient response surfaces of aerodynamic functions with Kriging and coKriging". AIAA Journal. 46 (2): 498–507. Bibcode:2008AIAAJ..46..498L. doi:10.2514/1.32308. S2CID 17895486.
- ^ de Baar, J.H.S. (2014). "Stochastic Surrogates for Measurements and Computer Models of Fluids". PhD Thesis, Delft University of Technology: 99–101.
- ^ Bouhlel, M.A.; Martins, J.R.R.A. (2018). "Gradient-enhanced kriging for high-dimensional problems". Engineering with Computers. 35: 157–173. arXiv:1708.02663. doi:10.1007/s00366-018-0590-x. S2CID 3540630.
- ^ Morris, M.D.; Mitchell, T.J.; Ylvisaker, D. (1993). "Bayesian Design and Analysis of Computer Experiments: Use of Derivatives in Surface Prediction". Technometrics. 35 (3): 243–255. doi:10.1080/00401706.1993.10485320.
- ^ Chung, H.-S.; Alonso, J.J. (2002). "Using Gradients to Construct Cokriging Approximation Models for High-Dimensional Design Optimization Problems". AIAA 40th Aerospace Sciences Meeting and Exhibit: 2002–0317. CiteSeerX 10.1.1.12.4149. doi:10.2514/6.2002-317.
- ^ Han, Z.-H.; Gortz, S.; Zimmermann, R. (2013). "Improving variable-fidelity surrogate modeling via gradient-enhanced kriging and a generalized hybrid bridge function". Engineering with Computers. 32 (1): 15–34. doi:10.1016/j.ast.2012.01.006.
- ^ Ulaganathan, S.; Couckuyt, I.; Dhaene, T.; Degroote, J.; Laermans, E. (2016). "Performance study of gradient-enhanced Kriging". Aerospace Science and Technology. 25 (1): 177–189.
- ^ Laurent, L.; Le Riche, R.; Soulier, B.; Boucard, P.-A. (2017). "An overview of gradient-enhanced metamodels with applications" (PDF). Archives of Computational Methods in Engineering. 26: 1–46. doi:10.1007/s11831-017-9226-3. S2CID 54625655.
- ^ Lockwood, B.A.; Anitescu, M. (2012). "Gradient-Enhanced Universal Kriging for Uncertainty Propagation" (PDF). Nuclear Science and Engineering. 170 (2): 168–195. CiteSeerX 10.1.1.187.6097. doi:10.13182/NSE10-86. S2CID 18465024.
- ^ Raggi, G.; Fdez. Galván, I.; Ritterhoff, C. L.; Vacher, M.; Lindh, R. (2020). "Restricted-Variance Molecular Geometry Optimization Based on Gradient-Enhanced Kriging". Journal of Chemical Theory and Computation. 16 (6): 3989–4001. doi:10.1021/acs.jctc.0c00257. PMC 7304864. PMID 32374164.