Matérn covariance function: Difference between revisions
LucasBrown (talk | contribs) Adding short description: "Tool in multivariate statistical analysis" |
|||
(46 intermediate revisions by 25 users not shown) | |||
Line 1: | Line 1: | ||
{{Short description|Tool in multivariate statistical analysis}} |
|||
In [[statistics]], the '''Matérn covariance''' |
In [[statistics]], the '''Matérn covariance''', also called the '''Matérn kernel''',<ref name="Genton2002">{{cite journal |last1=Genton |first1=Marc G. |title=Classes of kernels for machine learning: a statistics perspective |journal=The Journal of Machine Learning Research |date=1 March 2002 |volume=2 |issue=3/1/2002 |pages=303–304 |url=https://dl.acm.org/doi/10.5555/944790.944815 |language=EN}}</ref> is a [[covariance function]] used in [[spatial statistics]], [[geostatistics]], [[machine learning]], image analysis, and other applications of multivariate statistical analysis on [[metric space]]s. It is named after the Swedish forestry statistician [[Bertil Matérn]].<ref>{{Cite journal| first1 = B. | last2 = McBratney|first2= A. B. | title = The Matérn function as a general model for soil variograms| last1 = Minasny| journal = Geoderma | volume = 128| issue = 3–4 | pages = 192–207 | year = 2005 | doi = 10.1016/j.geoderma.2005.04.003}}</ref> It specifies the covariance between two measurements as a function of the distance <math>d</math> between the points at which they are taken. Since the covariance only depends on distances between points, it is [[stationary process|stationary]]. If the distance is [[Euclidean distance]], the Matérn covariance is also [[isotropic]]. |
||
== Definition == |
== Definition == |
||
The Matérn covariance between two points separated by ''d'' distance units is given by <ref>Rasmussen, Carl Edward (2006) [http://www.gaussianprocess.org/gpml/chapters/RW4.pdf Gaussian Processes for Machine Learning]</ref> |
The Matérn covariance between measurements taken at two points separated by ''d'' distance units is given by <ref name=RasmWill2006>Rasmussen, Carl Edward and Williams, Christopher K. I. (2006) [http://www.gaussianprocess.org/gpml/chapters/RW4.pdf Gaussian Processes for Machine Learning]</ref> |
||
:<math> |
:<math> |
||
C_\nu(d) = \sigma^2\frac{2^{1-\nu}}{\Gamma(\nu)}\Bigg(\sqrt{2\nu}\frac{d}{\rho}\Bigg)^\nu K_\nu\Bigg(\sqrt{2\nu}\frac{d}{\rho}\Bigg) |
C_\nu(d) = \sigma^2\frac{2^{1-\nu}}{\Gamma(\nu)}\Bigg(\sqrt{2\nu}\frac{d}{\rho}\Bigg)^\nu K_\nu\Bigg(\sqrt{2\nu}\frac{d}{\rho}\Bigg), |
||
</math> |
</math> |
||
where <math>\Gamma</math> is the [[gamma function]], <math>K_\nu</math> is the modified [[Bessel function]] of the second kind, and ρ and |
where <math>\Gamma</math> is the [[gamma function]], <math>K_\nu</math> is the modified [[Bessel function]] of the second kind, and ''ρ'' and <math>\nu</math> are positive [[parameter]]s of the covariance. |
||
A [[Gaussian process]] with Matérn covariance |
A [[Gaussian process]] with Matérn covariance is <math>\lceil \nu \rceil-1</math> times differentiable in the mean-square sense.<ref name=RasmWill2006/><ref name=R>Santner, T. J., Williams, B. J., & Notz, W. I. (2013). ''The design and analysis of computer experiments.'' Springer Science & Business Media.</ref> |
||
== Spectral density == |
|||
⚫ | |||
The power spectrum of a process with Matérn covariance defined on <math>\mathbb{R}^n</math> is the (''n''-dimensional) Fourier transform of the Matérn covariance function (see [[Wiener–Khinchin theorem]]). Explicitly, this is given by |
|||
⚫ | |||
When <math>\nu = p+1/2,\ p\in \mathbb{N}^+</math> , the '''Matérn covariance''' can be written as a product of an exponential and a polynomial of order <math>p</math>:<ref>{{Cite book|title=Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables|last=Abramowitz and Stegun|first=|publisher=|year=|isbn=0-486-61272-4|location=|pages=}}</ref> |
|||
⚫ | |||
⚫ | |||
S(f)=\sigma^2\frac{2^n\pi^{n/2}\Gamma(\nu+\frac{n}2)(2\nu)^\nu}{\Gamma(\nu)\rho^{2\nu}}\left(\frac{2\nu}{\rho^2} + 4\pi^2f^2\right)^{-\left(\nu+\frac{n}2\right)}. |
|||
</math><ref name=RasmWill2006 /> |
|||
⚫ | |||
⚫ | |||
When <math>\nu = p+1/2,\ p\in \mathbb{N}^+</math> , the '''Matérn covariance''' can be written as a product of an exponential and a polynomial of degree <math>p</math>.<ref name=Stein1999>Stein, M. L. (1999). ''Interpolation of spatial data: some theory for kriging.'' Springer Series in Statistics.</ref><ref name=GuttorpGneiting2006>Peter Guttorp & Tilmann Gneiting, 2006. "Studies in the history of probability and statistics XLIX On the Matern correlation family," Biometrika, Biometrika Trust, vol. 93(4), pages 989-995, December.</ref> The modified Bessel function of a fractional order is given by Equations 10.1.9 and 10.2.15<ref name=AbramowitzStegun1965>{{Cite book|title=Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables|last=Abramowitz and Stegun|year=1965 |isbn=0-486-61272-4|url-access=registration|url=https://archive.org/details/handbookofmathe000abra}}</ref> as |
|||
<math display="block">\sqrt{\frac{\pi}{2z}} K_{p+1/2}(z) = \frac{\pi}{2z}e^{-z}\sum_{k=0}^n \frac{(n+k)!}{k!\Gamma(n-k+1)} \left( 2z \right) ^{-k} </math> . |
|||
This allows for the Matérn covariance of half-integer values of <math>\nu</math> to be expressed as |
|||
⚫ | |||
which gives: |
which gives: |
||
* for <math>\nu = 1/2\ (p=0)</math>: <math>C_{1/2}(d) = \sigma^2\exp\left(-\frac{d}{\rho}\right)</math> |
* for <math>\nu = 1/2\ (p=0)</math>: <math>C_{1/2}(d) = \sigma^2\exp\left(-\frac{d}{\rho}\right),</math> |
||
* for <math>\nu = 3/2\ (p=1)</math>: <math>C_{3/2}(d) = \sigma^2\left(1+\frac{\sqrt{3}d}{\rho}\right)\exp\left(-\frac{\sqrt{3}d}{\rho}\right)</math> |
* for <math>\nu = 3/2\ (p=1)</math>: <math>C_{3/2}(d) = \sigma^2\left(1+\frac{\sqrt{3}d}{\rho}\right)\exp\left(-\frac{\sqrt{3}d}{\rho}\right),</math> |
||
* for <math>\nu = 5/2\ (p=2)</math>: <math>C_{5/2}(d) = \sigma^2\left(1+\frac{\sqrt{5}d}{\rho}+\frac{5d^2}{3\rho^2}\right)\exp\left(-\frac{\sqrt{5}d}{\rho}\right)</math> |
* for <math>\nu = 5/2\ (p=2)</math>: <math>C_{5/2}(d) = \sigma^2\left(1+\frac{\sqrt{5}d}{\rho}+\frac{5d^2}{3\rho^2}\right)\exp\left(-\frac{\sqrt{5}d}{\rho}\right).</math> |
||
=== The Gaussian case in the limit of infinite ν === |
=== The Gaussian case in the limit of infinite ''ν'' === |
||
As <math>\nu\rightarrow\infty</math>, the '''Matérn covariance''' converges to the |
As <math>\nu\rightarrow\infty</math>, the '''Matérn covariance''' converges to the squared exponential [[covariance function]] |
||
:<math> |
:<math> |
||
\lim_{\nu\rightarrow\infty}C_\nu(d) = \sigma^2\exp\left(-\frac{d^2}{2\rho^2}\right) |
\lim_{\nu\rightarrow\infty}C_\nu(d) = \sigma^2\exp\left(-\frac{d^2}{2\rho^2}\right). |
||
</math> |
</math> |
||
== Taylor series at zero and spectral moments == |
== Taylor series at zero and spectral moments == |
||
The behavior for <math> |
The behavior for <math> |
||
d\rightarrow0 |
d\rightarrow0 |
||
</math> can be obtained by the following Taylor series: |
</math> can be obtained by the following [[Taylor series]] (reference is needed, the formula below leads to division by zero in case <math>\nu = 1</math>): |
||
<math display="block"> |
<math display="block"> |
||
Line 41: | Line 54: | ||
When defined, the following spectral moments can be derived from the Taylor series: |
When defined, the following spectral moments can be derived from the Taylor series: |
||
: <math> |
|||
\begin{align} |
|||
\lambda_0 = C_\nu(0) = \sigma^2 |
\lambda_0 & = C_\nu(0) = \sigma^2, \\[8pt] |
||
</math>, |
|||
⚫ | |||
⚫ | |||
\end{align} |
|||
⚫ | |||
</math> |
</math> |
||
==See also== |
==See also== |
||
Line 56: | Line 69: | ||
{{DEFAULTSORT:Matern covariance function}} |
{{DEFAULTSORT:Matern covariance function}} |
||
[[Category:Geostatistics]] |
[[Category:Geostatistics]] |
||
[[Category:Spatial |
[[Category:Spatial analysis]] |
||
[[Category:Covariance and correlation]] |
[[Category:Covariance and correlation]] |
Latest revision as of 06:46, 15 May 2024
In statistics, the Matérn covariance, also called the Matérn kernel,[1] is a covariance function used in spatial statistics, geostatistics, machine learning, image analysis, and other applications of multivariate statistical analysis on metric spaces. It is named after the Swedish forestry statistician Bertil Matérn.[2] It specifies the covariance between two measurements as a function of the distance between the points at which they are taken. Since the covariance only depends on distances between points, it is stationary. If the distance is Euclidean distance, the Matérn covariance is also isotropic.
Definition
[edit]The Matérn covariance between measurements taken at two points separated by d distance units is given by [3]
where is the gamma function, is the modified Bessel function of the second kind, and ρ and are positive parameters of the covariance.
A Gaussian process with Matérn covariance is times differentiable in the mean-square sense.[3][4]
Spectral density
[edit]The power spectrum of a process with Matérn covariance defined on is the (n-dimensional) Fourier transform of the Matérn covariance function (see Wiener–Khinchin theorem). Explicitly, this is given by
Simplification for specific values of ν
[edit]Simplification for ν half integer
[edit]When , the Matérn covariance can be written as a product of an exponential and a polynomial of degree .[5][6] The modified Bessel function of a fractional order is given by Equations 10.1.9 and 10.2.15[7] as
.
This allows for the Matérn covariance of half-integer values of to be expressed as
which gives:
- for :
- for :
- for :
The Gaussian case in the limit of infinite ν
[edit]As , the Matérn covariance converges to the squared exponential covariance function
Taylor series at zero and spectral moments
[edit]The behavior for can be obtained by the following Taylor series (reference is needed, the formula below leads to division by zero in case ):
When defined, the following spectral moments can be derived from the Taylor series:
See also
[edit]References
[edit]- ^ Genton, Marc G. (1 March 2002). "Classes of kernels for machine learning: a statistics perspective". The Journal of Machine Learning Research. 2 (3/1/2002): 303–304.
- ^ Minasny, B.; McBratney, A. B. (2005). "The Matérn function as a general model for soil variograms". Geoderma. 128 (3–4): 192–207. doi:10.1016/j.geoderma.2005.04.003.
- ^ a b c Rasmussen, Carl Edward and Williams, Christopher K. I. (2006) Gaussian Processes for Machine Learning
- ^ Santner, T. J., Williams, B. J., & Notz, W. I. (2013). The design and analysis of computer experiments. Springer Science & Business Media.
- ^ Stein, M. L. (1999). Interpolation of spatial data: some theory for kriging. Springer Series in Statistics.
- ^ Peter Guttorp & Tilmann Gneiting, 2006. "Studies in the history of probability and statistics XLIX On the Matern correlation family," Biometrika, Biometrika Trust, vol. 93(4), pages 989-995, December.
- ^ Abramowitz and Stegun (1965). Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. ISBN 0-486-61272-4.