User:Slava3087

In classical statistical decision theory, where we are faced with the problem of estimating a deterministic parameter (vector) $\theta \in \Theta$ from observations $x\in {\mathcal {X}}$ , an estimator (estimation rule) $\delta ^{M}\,\!$ is called minimax if its maximal risk is minimal among all estimators of $\theta \,\!$ . In a sense this means that $\delta ^{M}\,\!$ is an estimator which performs best in the worst possible case allowed in the problem.

Problem Setup

Consider the problem of estimating a deterministic (not Bayesian) parameter $\theta \in \Theta$ from noisy or corrupt data $x\in {\mathcal {X}}$ related through the conditional probability distribution $P(x|\theta )\,\!$ . Our goal is to find a "good" estimatimator $\delta (x)\,\!$ for estimating the parameter $\theta \,\!$ , which minimizes some given risk function $R(\theta ,\delta )\,\!$ . Here the risk function is the expectation of some loss function $L(\theta ,\delta )\,\!$ with respect to $P(x|\theta )\,\!$ . A popular example for a loss function is the squared error loss $L(\theta ,\delta )=\|\theta -\delta \|^{2}\,\!$ , and the risk function for this loss is the mean squared error (MSE).

Unfortunatlly in general the risk can not be minimized, since it depends on the unknown parameter $\theta \,\!$ itself (If we knew what was the actual value of $\theta \,\!$ , we wouldnt need to estimate it). Therefore an aditional criteria for finding an optimal estimator in some sense are requiered. One such criteria is the minimax criteria.

Definition

Definition : An estimator $\delta ^{M}:{\mathcal {X}}\rightarrow \Theta \,\!$ is called minimax with respect to a risk function $R(\theta ,\delta )\,\!$ if it achievs the smallest maximum risk among all estimators, meaning it satisfies

\sup _{\theta \in \Theta }R(\theta ,\delta ^{M})=\inf _{\delta }\sup _{\theta \in \Theta }R(\theta ,\delta )\,\!

.

Least Favorable Distribution

Logically, an estimator is minimax when it is the best in the worst case. Continuing this logic, a minimax estimator should be a Bayes estimator with respect to a prior least favorable distribution of $\theta \,\!$ . To demonstrate this notaion denote the avarge risk of the Bayes estimator $\delta _{pi}\,\!$ with respect to a prior distribution $\pi \,\!$ as

r_{\pi }=\int R(\theta ,\delta _{\pi })d\pi (\theta )\,\!

Definition : A prior distribution $\pi \,\!$ is called least favorable if for any other distribution $\pi '\,\!$ the avarge risk satisfies, $r_{\pi }\geq r_{\pi '}\,\!$ .

Theorem : If $r_{\pi }=\sup _{\theta }R(\theta ,\delta _{\pi })\,\!$ , then:

1) $\delta _{\pi }\,\!$ is minimax.

2)If $\delta _{\pi }\,\!$ is a unique Bayes estimator, it is also the unique minimax estimator.

3) $\pi \,\!$ is least favorable.

Concludion: If an estimator has constant risk, it is minimax. Note that it is not a necessary condition.

Example: Consider the problem of estimating the mean of $n\,\!$ dimensional Gaussian white random vactor, $x\sim N(\theta ,I_{n}\sigma ^{2})\,\!$ . The Maximum likelihood (ML) estimator for $\theta \,\!$ in this case is simply $\delta _{ML}=x\,\!$ , and it risk is

R(\theta ,\delta _{ML})=E{\|\delta _{ML}-\theta \|^{2}}=\sum \limits _{1}^{n}E{(x_{i}-\theta _{i})^{2}}=n\sigma ^{2}\,\!

.

So the risk is constant, and therfore the ML estimator is minimax.Nonetheless, minimaxity does not always imply admissibility. Infact in this example, the ML estimator is known to be inadmissible (not admissible) whenever $n>2\,\!$ . The famous James-Stein estimator dominates the ML whenever $n>2\,\!$ . Though both estimattors have the same risk $n\sigma ^{2}\,\!$ when $\|\theta \|\rightarrow \infty \,\!$ , and they are both minimax, the James-Stein Estimator has smaller risk for any finite $\|\theta \|\,\!$ . This fact is illistrated in the following figure.

The reason for that is that the ML estimator is not an actual Bayes estimator, but rather the limit of such estimators.

Definition : A sequence of prior disrtributions ${\pi }_{n}\,\!$ , is called least favorable if for any other distribution $\pi '\,\!$ ,

: Failed to parse (unknown function "\limit"): {\displaystyle \limit_{n \rightarrow \infty} r_{\pi_n} \leq r_{pi '}\,\!}

1) $\delta _{\pi }\,\!$ is minimax.

2)If $\delta _{\pi }\,\!$ is a unique Bayes estimator, it is also the unique minimax estimator.

3) $\pi \,\!$ is least favorable.