Lehmann–Scheffé theorem: Difference between revisions

Content deleted Content added

Inline

Latest revision as of 16:14, 9 December 2024

In statistics, the Lehmann–Scheffé theorem is a prominent statement, tying together the ideas of completeness, sufficiency, uniqueness, and best unbiased estimation.^[1] The theorem states that any estimator that is unbiased for a given unknown quantity and that depends on the data only through a complete, sufficient statistic is the unique best unbiased estimator of that quantity. The Lehmann–Scheffé theorem is named after Erich Leo Lehmann and Henry Scheffé, given their two early papers.^[2]^[3]

If T is a complete sufficient statistic for θ and E(g(T)) = τ(θ) then g(T) is the uniformly minimum-variance unbiased estimator (UMVUE) of τ(θ).

Statement

Let ${\vec {X}}=X_{1},X_{2},\dots ,X_{n}$ be a random sample from a distribution that has p.d.f (or p.m.f in the discrete case) $f(x:\theta )$ where $\theta \in \Omega$ is a parameter in the parameter space. Suppose $Y=u({\vec {X}})$ is a sufficient statistic for θ, and let $\{f_{Y}(y:\theta ):\theta \in \Omega \}$ be a complete family. If $\varphi :\operatorname {E} [\varphi (Y)]=\theta$ then $\varphi (Y)$ is the unique MVUE of θ.

Proof

By the Rao–Blackwell theorem, if $Z$ is an unbiased estimator of θ then $\varphi (Y):=\operatorname {E} [Z\mid Y]$ defines an unbiased estimator of θ with the property that its variance is not greater than that of $Z$ .

Now we show that this function is unique. Suppose $W$ is another candidate MVUE estimator of θ. Then again $\psi (Y):=\operatorname {E} [W\mid Y]$ defines an unbiased estimator of θ with the property that its variance is not greater than that of $W$ . Then

\operatorname {E} [\varphi (Y)-\psi (Y)]=0,\theta \in \Omega .

Since $\{f_{Y}(y:\theta ):\theta \in \Omega \}$ is a complete family

\operatorname {E} [\varphi (Y)-\psi (Y)]=0\implies \varphi (y)-\psi (y)=0,\theta \in \Omega

and therefore the function $\varphi$ is the unique function of Y with variance not greater than that of any other unbiased estimator. We conclude that $\varphi (Y)$ is the MVUE.

Example for when using a non-complete minimal sufficient statistic

An example of an improvable Rao–Blackwell improvement, when using a minimal sufficient statistic that is not complete, was provided by Galili and Meilijson in 2016.^[4] Let $X_{1},\ldots ,X_{n}$ be a random sample from a scale-uniform distribution $X\sim U((1-k)\theta ,(1+k)\theta ),$ with unknown mean $\operatorname {E} [X]=\theta$ and known design parameter $k\in (0,1)$ . In the search for "best" possible unbiased estimators for $\theta$ , it is natural to consider $X_{1}$ as an initial (crude) unbiased estimator for $\theta$ and then try to improve it. Since $X_{1}$ is not a function of $T=\left(X_{(1)},X_{(n)}\right)$ , the minimal sufficient statistic for $\theta$ (where $X_{(1)}=\min _{i}X_{i}$ and $X_{(n)}=\max _{i}X_{i}$ ), it may be improved using the Rao–Blackwell theorem as follows:

{\hat {\theta }}_{RB}=\operatorname {E} _{\theta }[X_{1}\mid X_{(1)},X_{(n)}]={\frac {X_{(1)}+X_{(n)}}{2}}.

However, the following unbiased estimator can be shown to have lower variance:

{\hat {\theta }}_{LV}={\frac {1}{k^{2}{\frac {n-1}{n+1}}+1}}\cdot {\frac {(1-k)X_{(1)}+(1+k)X_{(n)}}{2}}.

And in fact, it could be even further improved when using the following estimator:

{\hat {\theta }}_{\text{BAYES}}={\frac {n+1}{n}}\left[1-{\frac {{\frac {X_{(1)}(1+k)}{X_{(n)}(1-k)}}-1}{\left({\frac {X_{(1)}(1+k)}{X_{(n)}(1-k)}}\right)^{n+1}-1}}\right]{\frac {X_{(n)}}{1+k}}

The model is a scale model. Optimal equivariant estimators can then be derived for loss functions that are invariant.^[5]

References

^ Casella, George (2001). Statistical Inference. Duxbury Press. p. 369. ISBN 978-0-534-24312-8.
^ Lehmann, E. L.; Scheffé, H. (1950). "Completeness, similar regions, and unbiased estimation. I." Sankhyā. 10 (4): 305–340. doi:10.1007/978-1-4614-1412-4_23. JSTOR 25048038. MR 0039201.
^ Lehmann, E.L.; Scheffé, H. (1955). "Completeness, similar regions, and unbiased estimation. II". Sankhyā. 15 (3): 219–236. doi:10.1007/978-1-4614-1412-4_24. JSTOR 25048243. MR 0072410.
^ Tal Galili; Isaac Meilijson (31 Mar 2016). "An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator". The American Statistician. 70 (1): 108–113. doi:10.1080/00031305.2015.1100683. PMC 4960505. PMID 27499547.
^ Taraldsen, Gunnar (2020). "Micha Mandel (2020), "The Scaled Uniform Model Revisited," The American Statistician, 74:1, 98–100: Comment". The American Statistician. 74 (3): 315. doi:10.1080/00031305.2020.1769727. S2CID 219493070.

[Casella-1] Casella, George (2001). Statistical Inference. Duxbury Press. p. 369. ISBN 978-0-534-24312-8.

[LS1-2] Lehmann, E. L.; Scheffé, H. (1950). "Completeness, similar regions, and unbiased estimation. I." Sankhyā. 10 (4): 305–340. doi:10.1007/978-1-4614-1412-4_23. JSTOR 25048038. MR 0039201.

[LS2-3] Lehmann, E.L.; Scheffé, H. (1955). "Completeness, similar regions, and unbiased estimation. II". Sankhyā. 15 (3): 219–236. doi:10.1007/978-1-4614-1412-4_24. JSTOR 25048243. MR 0072410.

[4] Tal Galili; Isaac Meilijson (31 Mar 2016). "An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator". The American Statistician. 70 (1): 108–113. doi:10.1080/00031305.2015.1100683. PMC 4960505. PMID 27499547.

[5] Taraldsen, Gunnar (2020). "Micha Mandel (2020), "The Scaled Uniform Model Revisited," The American Statistician, 74:1, 98–100: Comment". The American Statistician. 74 (3): 315. doi:10.1080/00031305.2020.1769727. S2CID 219493070.

[1]

[2]

[3]

[4]

[5]

@@ Line 1: / Line 1: @@
+{{Short description|Theorem in statistics}}
 {{Refimprove|date=April 2011}}
-In [[statistics]], the '''Lehmann–Scheffé theorem''' is a prominent statement, tying together the ideas of completeness, sufficiency, uniqueness, and best unbiased estimation.<ref name=Casella/> The theorem states that any [[estimator]] which is [[unbiased estimator|unbiased]] for a given unknown quantity and that depends on the data only through a [[completeness (statistics)|complete]], [[sufficiency (statistics)|sufficient statistic]] is the unique [[best unbiased estimator]] of that quantity. The Lehmann–Scheffé theorem is named after [[Erich Leo Lehmann]] and [[Henry Scheffé]], given their two early papers.<ref name=LS1/><ref name=LS2/>
+In [[statistics]], the '''Lehmann–Scheffé theorem''' is a prominent statement, tying together the ideas of completeness, sufficiency, uniqueness, and best unbiased estimation.<ref name=Casella/> The theorem states that any [[estimator]] that is [[unbiased estimator|unbiased]] for a given unknown quantity and that depends on the data only through a [[completeness (statistics)|complete]], [[sufficiency (statistics)|sufficient statistic]] is the unique [[best unbiased estimator]] of that quantity. The Lehmann–Scheffé theorem is named after [[Erich Leo Lehmann]] and [[Henry Scheffé]], given their two early papers.<ref name=LS1/><ref name=LS2/>
 If ''T'' is a complete sufficient statistic for ''θ'' and E(''g''(''T''))&nbsp;=&nbsp;''&tau;''(''&theta;'') then ''g''(''T'') is the [[uniformly minimum-variance unbiased estimator]] (UMVUE) of&nbsp;''τ''(''&theta;'').
@@ Line 28: / Line 29: @@
 == Example for when using a non-complete minimal sufficient statistic ==
-An example of an improvable Rao–Blackwell improvement, when using a minimal sufficient statistic that is '''not complete''', was provided by Galili and Meilijson in 2016.<ref>{{cite journal|title= An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator | authors = Tal Galili & Isaac Meilijson | date = 31 Mar 2016 | journal = The American Statistician | volume = 70 | issue = 1 | url = http://www.tandfonline.com/doi/abs/10.1080/00031305.2015.1100683?journalCode=utas20 | pages = 108–113 |doi=10.1080/00031305.2015.1100683}}</ref> Let <math>X_1, \ldots, X_n</math> be a random sample from a scale-uniform distribution <math>X \sim U ( (1-k) \theta, (1+k) \theta),</math> with unknown mean <math>\operatorname{E}[X]=\theta</math> and known design parameter <math>k \in (0,1)</math>. In the search for "best" possible unbiased estimators for <math>\theta</math>, it is natural to consider <math>X_1</math> as an initial (crude) unbiased estimator for <math>\theta</math> and then try to improve it. Since <math>X_1</math> is not a function of <math>T = \left( X_{(1)}, X_{(n)}  \right)</math>, the minimal sufficient statistic for <math>\theta</math> (where <math>X_{(1)} = \min_i X_i </math> and <math>X_{(n)} = \max_i X_i </math>), it may be improved using the Rao–Blackwell theorem as follows:
+An example of an improvable Rao–Blackwell improvement, when using a minimal sufficient statistic that is '''not complete''', was provided by Galili and Meilijson in 2016.<ref>{{cite journal|title= An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator |author1=Tal Galili |author2=Isaac Meilijson | date = 31 Mar 2016 | journal = The American Statistician | volume = 70 | issue = 1 | pages = 108–113 |doi=10.1080/00031305.2015.1100683| pmc = 4960505 | pmid=27499547}}</ref> Let <math>X_1, \ldots, X_n</math> be a random sample from a scale-uniform distribution <math>X \sim U ( (1-k) \theta, (1+k) \theta),</math> with unknown mean <math>\operatorname{E}[X]=\theta</math> and known design parameter <math>k \in (0,1)</math>. In the search for "best" possible unbiased estimators for <math>\theta</math>, it is natural to consider <math>X_1</math> as an initial (crude) unbiased estimator for <math>\theta</math> and then try to improve it. Since <math>X_1</math> is not a function of <math>T = \left( X_{(1)}, X_{(n)}  \right)</math>, the minimal sufficient statistic for <math>\theta</math> (where <math>X_{(1)} = \min_i X_i </math> and <math>X_{(n)} = \max_i X_i </math>), it may be improved using the Rao–Blackwell theorem as follows:
 :<math>\hat{\theta}_{RB} =\operatorname{E}_\theta[X_1\mid X_{(1)}, X_{( n)}] = \frac{X_{(1)}+X_{(n)}} 2.</math>
@@ Line 39: / Line 40: @@
 :<math>\hat{\theta}_\text{BAYES}=\frac{n+1} n \left[1- \frac{\frac{X_{(1)} (1+k)}{X_{(n)} (1-k)}-1}{ \left (\frac{X_{(1)} (1+k)}{X_{(n)} (1-k)}\right )^{n+1} -1} \right] \frac{X_{(n)}}{1+k}</math>
+The model is a [[Scale parameter|scale model]]. Optimal [[Equivariant Estimator|equivariant estimators]] can then be derived for [[loss function]]s that are invariant.<ref>{{Cite journal|last=Taraldsen|first=Gunnar|date=2020|title=Micha Mandel (2020), "The Scaled Uniform Model Revisited," The American Statistician, 74:1, 98–100: Comment|url=https://doi.org/10.1080/00031305.2020.1769727|journal=The American Statistician|volume=74|issue=3|pages=315|doi=10.1080/00031305.2020.1769727|s2cid=219493070 |issn=}}</ref>
 ==See also==
 *[[Basu's theorem]]
+*[[Completeness (statistics)]]
-*[[Complete class theorem]]
 *[[Rao–Blackwell theorem]]
@@ Line 53: / Line 56: @@
  |journal=[[Sankhya (journal)|Sankhyā]]
  |volume=10 |issue=4 |year=1950 |pages=305–340
- |mr=39201 |jstor=25048038}}
+ |mr=39201 |jstor=25048038 |doi=10.1007/978-1-4614-1412-4_23|doi-access=free }}
 </ref>
 <ref name=LS2>{{cite journal
@@ Line 61: / Line 64: @@
  |journal=[[Sankhya (journal)|Sankhyā]]
  |volume=15 |issue=3 |year=1955 |pages=219–236
- |mr=72410 |jstor=25048243}}
+ |mr=72410 |jstor=25048243 |doi=10.1007/978-1-4614-1412-4_24|doi-access=free }}
 </ref>
 <ref name=Casella>{{cite book
@@ Line 67: / Line 70: @@
  |title=Statistical Inference
  |year=2001 |publisher=Duxbury Press
- |isbn=0-534-24312-6 |page=369}}
+ |isbn=978-0-534-24312-8 |page=369}}
 </ref>
 }}
@@ Line 74: / Line 77: @@
 {{DEFAULTSORT:Lehmann-Scheffe theorem}}
-[[Category:Statistical theorems]]
+[[Category:Theorems in statistics]]
 [[Category:Estimation theory]]