Jump to content

Scale parameter: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Undid revision 562864011 by 112.135.252.35 (talk) rate was correct
 
(30 intermediate revisions by 21 users not shown)
Line 1: Line 1:
{{Short description|Statistical measure}}
{{Unreferenced|date=December 2009}}
{{More citations needed|date=December 2009}}
In [[probability theory]] and [[statistics]], a '''scale parameter''' is a special kind of [[numerical parameter]] of a [[parametric family]] of [[probability distribution]]s. The larger the scale parameter, the more spread out the distribution.

In [[probability theory]] and [[statistics]], a '''scale parameter''' is a special kind of [[numerical parameter]] of a [[parametric family]] of [[probability distribution]]s. The larger the scale parameter, the more [[dispersion (statistics)|spread out]] the distribution.


==Definition==
==Definition==
Line 7: Line 9:
:<math>F(x;s,\theta) = F(x/s;1,\theta), \!</math>
:<math>F(x;s,\theta) = F(x/s;1,\theta), \!</math>


then ''s'' is called a '''scale parameter''', since its value determines the "[[scale (ratio)|scale]]" or [[statistical dispersion]] of the probability distribution. If ''s'' is large, then the distribution will be more spread out; if ''s'' is small then it will be more concentrated.
then ''s'' is called a '''scale parameter''', since its value determines the "[[scale (ratio)|scale]]" or [[statistical dispersion]] of the probability distribution. If ''s'' is large, then the distribution will be more spread out; if ''s'' is small then it will be more concentrated.

[[File:Effects of a scale parameter on a positive-support probability distribution.gif|thumb|300px|Animation showing the effects of a scale parameter on a probability distribution supported on the positive real line.]]
[[File:Effect of a scale parameter over a mixture of two normal probability distributions.gif|thumb|300px|Effect of a scale parameter over a mixture of two normal probability distributions]]


If the [[probability density function|probability density]] exists for all values of the complete parameter set, then the density (as a function of the scale parameter only) satisfies
If the [[probability density function|probability density]] exists for all values of the complete parameter set, then the density (as a function of the scale parameter only) satisfies
:<math>f_s(x) = f(x/s)/s, \!</math>
:<math>f_s(x) = f(x/s)/s, \!</math>
where ''f'' is the density of a standardized version of the density.
where ''f'' is the density of a standardized version of the density, i.e. <math>f(x) \equiv f_{s=1}(x)</math>.


An [[estimator]] of a scale parameter is called an '''estimator of scale.'''
An [[estimator]] of a scale parameter is called an '''estimator of scale.'''

===Families with Location Parameters===
In the case where a parametrized family has a [[location parameter]], a slightly different definition is often used as follows. If we denote the location parameter by <math>m</math>, and the scale parameter by <math>s</math>, then we require that <math>F(x;s,m,\theta)=F((x-m)/s;1,0,\theta)</math> where <math>F(x,s,m,\theta)</math> is the cmd for the parametrized family.<ref>{{cite web |url= http://www.encyclopediaofmath.org/index.php?title=Scale_parameter&oldid=13206 |title= Scale parameter
|last=Prokhorov |first=A.V. |date= 7 February 2011 |website=Encyclopedia of Mathematics |publisher= Springer |access-date=7 February 2019}}</ref> This modification is necessary in order for the standard deviation of a non-central Gaussian to be a scale parameter, since otherwise the mean would change when we rescale <math>x</math>. However, this alternative definition is not consistently used.<ref>{{cite web |url=https://www.math.kth.se/matstat/gru/sf2955/scaleparameter |title= Scale parameter
|last=Koski |first=Timo |website=KTH Royal Institute of Technology|access-date=7 February 2019}}</ref>


===Simple manipulations===
===Simple manipulations===
We can write <math>f_s</math> in terms of <math>g(x) = x/s</math>, as follows:
We can write <math>f_s</math> in terms of <math>g(x) = x/s</math>, as follows:


:<math>f_s(x) = f(x/s) \times 1/s = f(g(x)) \times g'(x). \!</math>
:<math>f_s(x) = f\left(\frac{x}{s}\right) \cdot \frac{1}{s} = f(g(x))g'(x).</math>


Because ''f'' is a probability density function, it integrates to unity:
Because ''f'' is a probability density function, it integrates to unity:
Line 25: Line 35:
1 = \int_{-\infty}^{\infty} f(x)\,dx
1 = \int_{-\infty}^{\infty} f(x)\,dx
= \int_{g(-\infty)}^{g(\infty)} f(x)\,dx.
= \int_{g(-\infty)}^{g(\infty)} f(x)\,dx.
\!</math>
</math>


By the [[substitution rule]] of integral calculus, we then have
By the [[substitution rule]] of integral calculus, we then have


:<math>
:<math>
1 = \int_{-\infty}^{\infty} f(g(x)) \times g'(x)\,dx
1 = \int_{-\infty}^{\infty} f(g(x)) g'(x)\,dx
= \int_{-\infty}^{\infty} f_s(x)\,dx.
= \int_{-\infty}^{\infty} f_s(x)\,dx.
\!</math>
</math>


So <math>f_s</math> is also properly normalized.
So <math>f_s</math> is also properly normalized.


==Rate parameter==
==Rate parameter==
Some families of distributions use a '''rate parameter''' which is simply the reciprocal of the ''scale parameter''. So for example the [[exponential distribution]] with scale parameter β and probability density
Some families of distributions use a '''rate parameter''' (or '''inverse scale parameter'''), which is simply the reciprocal of the ''scale parameter''. So for example the [[exponential distribution]] with scale parameter β and probability density
:<math>f(x;\beta ) = \frac{1}{\beta} e^{-x/\beta} ,\; x \ge 0 </math>
:<math>f(x;\beta ) = \frac{1}{\beta} e^{-x/\beta} ,\; x \ge 0 </math>
could equally be written with rate parameter λ as
could equivalently be written with rate parameter λ as
:<math>f(x;\lambda) = \lambda e^{-\lambda x} ,\; x \ge 0. </math>
:<math>f(x;\lambda) = \lambda e^{-\lambda x} ,\; x \ge 0. </math>


==Examples==
==Examples==
* The [[Uniform distribution (continuous)|uniform distribution]] can be parameterized with a [[location parameter]] of <math>(a+b)/2</math> and a scale parameter <math>|b-a|</math>.
* The [[normal distribution]] has two parameters: a [[location parameter]] <math>\mu</math> and a scale parameter <math>\sigma</math>. In practice the normal distribution is often parameterized in terms of the ''squared'' scale <math>\sigma^2</math>, which corresponds to the [[variance]] of the distribution.
* The [[normal distribution]] has two parameters: a [[location parameter]] <math>\mu</math> and a scale parameter <math>\sigma</math>. In practice the normal distribution is often parameterized in terms of the ''squared'' scale <math>\sigma^2</math>, which corresponds to the [[variance]] of the distribution.

* The [[gamma distribution]] is usually parameterized in terms of a scale parameter <math>\theta</math> or its inverse.
* The [[gamma distribution]] is usually parameterized in terms of a scale parameter <math>\theta</math> or its inverse.

* Special cases of distributions where the scale parameter equals unity may be called "standard" under certain conditions. For example, if the location parameter equals zero and the scale parameter equals one, the [[normal distribution]] is known as the ''standard'' normal distribution, and the [[Cauchy distribution]] as the ''standard'' Cauchy distribution.
* Special cases of distributions where the scale parameter equals unity may be called "standard" under certain conditions. For example, if the location parameter equals zero and the scale parameter equals one, the [[normal distribution]] is known as the ''standard'' normal distribution, and the [[Cauchy distribution]] as the ''standard'' Cauchy distribution.


Line 54: Line 63:
* Scales linearly with the scale parameter, and
* Scales linearly with the scale parameter, and
* Converges as the sample size grows.
* Converges as the sample size grows.
Various [[Statistical_dispersion#Measures_of_statistical_dispersion|measures of statistical dispersion]] satisfy these.
Various [[Statistical dispersion#Measures of statistical dispersion|measures of statistical dispersion]] satisfy these.
In order to make the statistic a [[consistent estimator]] for the scale parameter, one must in general multiply the statistic by a constant [[scale factor]]. This scale factor is defined as the theoretical value of the value obtained by dividing the required scale parameter by the asymptotic value of the statistic. Note that the scale factor depends on the distribution in question.
In order to make the statistic a [[consistent estimator]] for the scale parameter, one must in general multiply the statistic by a constant [[scale factor]]. This scale factor is defined as the theoretical value of the value obtained by dividing the required scale parameter by the asymptotic value of the statistic. Note that the scale factor depends on the distribution in question.


For instance, in order to use the [[median absolute deviation]] (MAD) to estimate the [[standard deviation]] of the [[normal distribution]], one must multiply it by the factor
For instance, in order to use the [[median absolute deviation]] (MAD) to estimate the [[standard deviation]] of the [[normal distribution]], one must multiply it by the factor
:<math>1/\Phi^{-1}(3/4) \approx 1.4826,</math>
:<math>1/\Phi^{-1}(3/4) \approx 1.4826,</math>
where Φ<sup>−1</sup> is the [[quantile function]] (inverse of the [[cumulative distribution function]]) for the standard normal distribution. (See [[Median_absolute_deviation#Relation_to_standard_deviation|MAD]] for details.)
where Φ<sup>−1</sup> is the [[quantile function]] (inverse of the [[cumulative distribution function]]) for the standard normal distribution. (See [[Median absolute deviation#Relation to standard deviation|MAD]] for details.)
That is, the MAD is not a consistent estimator for the standard deviation of a normal distribution, but 1.4826... MAD is a consistent estimator.
That is, the MAD is not a consistent estimator for the standard deviation of a normal distribution, but 1.4826... MAD is a consistent estimator.
Similarly, the [[average absolute deviation]] needs to be multiplied by approximately 1.2533 to be a consistent estimator for standard deviation. Different factors would be required to estimate the standard deviation if the population did not follow a normal distribution.
Similarly, the [[average absolute deviation]] needs to be multiplied by approximately 1.2533 to be a consistent estimator for standard deviation. Different factors would be required to estimate the standard deviation if the population did not follow a normal distribution.
Line 68: Line 77:
* [[Location parameter]]
* [[Location parameter]]
* [[Location-scale family]]
* [[Location-scale family]]
* [[Mean-preserving spread]]
* [[Scale mixture]]
* [[Shape parameter]]
* [[Statistical dispersion]]
* [[Statistical dispersion]]

== References ==
{{Reflist}}

== Further reading ==
* {{cite book|last1=Mood|first1=A. M.|last2=Graybill|first2=F. A.|last3=Boes|first3=D. C.|title=Introduction to the theory of statistics|edition=3rd|publisher=McGraw-Hill|place=New York|chapter=VII.6.2 ''Scale invariance''|year=1974}}

{{Statistics|inference}}


{{DEFAULTSORT:Scale Parameter}}
{{DEFAULTSORT:Scale Parameter}}
[[Category:Theory of probability distributions]]
[[Category:Statistical parameters]]
[[Category:Statistical terminology]]

Latest revision as of 21:51, 23 November 2024

In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions. The larger the scale parameter, the more spread out the distribution.

Definition

[edit]

If a family of probability distributions is such that there is a parameter s (and other parameters θ) for which the cumulative distribution function satisfies

then s is called a scale parameter, since its value determines the "scale" or statistical dispersion of the probability distribution. If s is large, then the distribution will be more spread out; if s is small then it will be more concentrated.

Animation showing the effects of a scale parameter on a probability distribution supported on the positive real line.
Effect of a scale parameter over a mixture of two normal probability distributions

If the probability density exists for all values of the complete parameter set, then the density (as a function of the scale parameter only) satisfies

where f is the density of a standardized version of the density, i.e. .

An estimator of a scale parameter is called an estimator of scale.

Families with Location Parameters

[edit]

In the case where a parametrized family has a location parameter, a slightly different definition is often used as follows. If we denote the location parameter by , and the scale parameter by , then we require that where is the cmd for the parametrized family.[1] This modification is necessary in order for the standard deviation of a non-central Gaussian to be a scale parameter, since otherwise the mean would change when we rescale . However, this alternative definition is not consistently used.[2]

Simple manipulations

[edit]

We can write in terms of , as follows:

Because f is a probability density function, it integrates to unity:

By the substitution rule of integral calculus, we then have

So is also properly normalized.

Rate parameter

[edit]

Some families of distributions use a rate parameter (or inverse scale parameter), which is simply the reciprocal of the scale parameter. So for example the exponential distribution with scale parameter β and probability density

could equivalently be written with rate parameter λ as

Examples

[edit]
  • The uniform distribution can be parameterized with a location parameter of and a scale parameter .
  • The normal distribution has two parameters: a location parameter and a scale parameter . In practice the normal distribution is often parameterized in terms of the squared scale , which corresponds to the variance of the distribution.
  • The gamma distribution is usually parameterized in terms of a scale parameter or its inverse.
  • Special cases of distributions where the scale parameter equals unity may be called "standard" under certain conditions. For example, if the location parameter equals zero and the scale parameter equals one, the normal distribution is known as the standard normal distribution, and the Cauchy distribution as the standard Cauchy distribution.

Estimation

[edit]

A statistic can be used to estimate a scale parameter so long as it:

  • Is location-invariant,
  • Scales linearly with the scale parameter, and
  • Converges as the sample size grows.

Various measures of statistical dispersion satisfy these. In order to make the statistic a consistent estimator for the scale parameter, one must in general multiply the statistic by a constant scale factor. This scale factor is defined as the theoretical value of the value obtained by dividing the required scale parameter by the asymptotic value of the statistic. Note that the scale factor depends on the distribution in question.

For instance, in order to use the median absolute deviation (MAD) to estimate the standard deviation of the normal distribution, one must multiply it by the factor

where Φ−1 is the quantile function (inverse of the cumulative distribution function) for the standard normal distribution. (See MAD for details.) That is, the MAD is not a consistent estimator for the standard deviation of a normal distribution, but 1.4826... MAD is a consistent estimator. Similarly, the average absolute deviation needs to be multiplied by approximately 1.2533 to be a consistent estimator for standard deviation. Different factors would be required to estimate the standard deviation if the population did not follow a normal distribution.

See also

[edit]

References

[edit]
  1. ^ Prokhorov, A.V. (7 February 2011). "Scale parameter". Encyclopedia of Mathematics. Springer. Retrieved 7 February 2019.
  2. ^ Koski, Timo. "Scale parameter". KTH Royal Institute of Technology. Retrieved 7 February 2019.

Further reading

[edit]
  • Mood, A. M.; Graybill, F. A.; Boes, D. C. (1974). "VII.6.2 Scale invariance". Introduction to the theory of statistics (3rd ed.). New York: McGraw-Hill.