Neyman construction

Neyman construction, named after Jerzy Neyman, is a frequentist method to construct an interval at a confidence level $C,\,$ such that if we repeat the experiment many times the interval will contain the true value of some parameter a fraction $C\,$ of the time.

Theory

Assume $X_{1},X_{2},...X_{n}$ are random variables with joint pdf $f(x_{1},x_{2},...x_{n}|\theta _{1},\theta _{2},...,\theta _{k})$ , which depends on k unknown parameters. For convenience, let $\Theta$ be the sample space defined by the n random variables and subsequently define a sample point in the sample space as $X=(X_{1},X_{2},...X_{n})$
Neyman originally proposed defining two functions $L(x)$ and $U(x)$ such that for any sample point, $X$ ,

$L(X)\leq U(X)$ $\forall X\in \Theta$
L and U are single valued and defined.

Given an observation, $X^{'}$ , the probability that $\theta _{1}$ lies between $L(X^{'})$ and $U(X^{'})$ is defined as $P(L(X^{'})\leq \theta _{1}\leq U(X^{'})|X^{'})$ with probability of $0$ or $1$ . These calculated probabilities fail to draw meaningful inference about $\theta _{1}$ since the probability is simply zero or unity. Furthermore, under the frequentist construct the model parameters are unknown constants and not permitted to be random variables.^[1] For example if $\theta _{1}=5$ , then $P(2\leq 5\leq 10)=1$ . Likewise, if $\theta _{1}=11$ , then $P(2\leq 11\leq 10)=0$

As Neyman describes in his 1937 paper, suppose that we consider all points in the sample space, that is, $\forall X\in \Theta$ , which are a system of random variables defined by the joint pdf described above. Since $L$ and $U$ are functions of $X$ they too are random variables and one can examine the meaning of the following probability statement:

Under the frequentist construct the model parameters are unknown constants and not permitted to be random variables. Considering all the sample points in the sample space as random variables defined by the joint pdf above, that is all

X\in \Theta

it can be shown that

L

and

U

are functions of random variables and hence random variables. Therefore one can look at the probability of

L(X)

and

U(X)

for some

X\in \Theta

. If

\theta _{1}^{'}

is the true value of

\theta _{1}

, we can define

L

and

U

such that the probability

L(X)\leq \theta _{1}^{'}

and

\theta _{1}^{'}\leq U(X)

is equal to pre-specified confidence level

,C

.

That is, $P(L(X)\leq \theta _{1}^{'}\leq U(X)|\theta _{1}^{'})=C$ where $0\leq C\leq 1$ and $L(X)$ and $U(X)$ are the upper and lower confidence limits for $\theta _{1}$ ^[1]

Coverage probability

The coverage probability, $C$ , for Neyman construction is the frequency of experiments in which the confidence interval contains the actual value of interest. Generally, the coverage probability is set to a $95\%$ confidence. For Neyman construction, the coverage probability is set to some value $C$ where $0<C<1$ .

Implementation

A Neyman construction can be carried out by performing multiple experiments that construct data sets corresponding to a given value of the parameter. The experiments are fitted with conventional methods, and the space of fitted parameter values constitutes the band which the confidence interval can be selected from.

Classic example

Plot of 50 confidence intervals from 50 samples generated from a normal distribution.

Suppose $X\sim N(\theta ,\sigma ^{2})$ , where $\theta$ and $\sigma ^{2}$ are unknown constants where we wish to estimate $\theta$ . We can define (2) single value functions, $L$ and $U$ , defined by the process above such that given a pre-specified confidence level, $C$ , and random sample $X^{*}=(x_{1},x_{2},...x_{n})$

L(X^{*})={\bar {x}}-t{\frac {s}{\sqrt {n}}}

U(X^{*})={\bar {x}}+t{\frac {s}{\sqrt {n}}}

where $s/{\sqrt {n}}$ is the standard error, and the sample mean and standard deviation are:

{\bar {x}}={\frac {1}{n}}\sum _{i=1}^{n}x_{i}={\frac {1}{n}}(x_{1},x_{2},...x_{n})

s={\sqrt {{\frac {1}{n-1}}\sum _{i=1}^{n}(x_{i}-{\bar {x}})^{2}}}

The factor $t$ follows a t distribution with (n-1) degrees of freedom, $t$ ~t_{$({1-C}/2,n-1)$} ^[2]

Another Example

$X_{1},X_{2},...,X_{n}$ are iid random variables, and let $T=(X_{1},X_{2},...,X_{n})$ . Suppose $T\sim N(\mu ,\sigma ^{2})$ . Now to construct a confidence interval with $C$ level of confidence. We know ${\bar {x}}$ is sufficient for $\mu$ . So,

p(-Z_{\frac {\alpha }{2}}\leq {\frac {{\bar {x}}-\mu }{\sigma ^{2}}}\leq Z_{\frac {\alpha }{2}})=C

p(-Z_{\frac {\alpha }{2}}\sigma ^{2}\leq {\bar {x}}-\mu \leq Z_{\frac {\alpha }{2}}\sigma ^{2})=C

p({\bar {x}}-Z_{\frac {\alpha }{2}}\sigma ^{2}\leq \mu \leq {\bar {x}}+Z_{\frac {\alpha }{2}}\sigma ^{2})=C

This produces a $100(C)\%$ confidence interval for $\mu$ where,

L(T)={\bar {x}}-Z_{\frac {\alpha }{2}}\sigma ^{2}

U(T)={\bar {x}}+Z_{\frac {\alpha }{2}}\sigma ^{2}

.

^[3]

References

^ ^a ^b Neyman, J. (1937). "Outline of a Theory of Statistical Estimation Based on the Classical Theory of Probability". Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences. 236 (767): 333–380. doi:10.1098/rsta.1937.0005. JSTOR 91337.
^ Rao, C. Radhakrishna (13 April 1973). Linear Statistical Inference and its Applications: Second Edition. John Wiley & Sons. pp. 470–472. ISBN 9780471708230.
^ Samaniego, Francisco J. (2014-01-14). Stochastic Modeling and Mathematical Statistics. Chapman and Hall/CRC. p. 347. ISBN 9781466560468.

[Neyman-1] Neyman, J. (1937). "Outline of a Theory of Statistical Estimation Based on the Classical Theory of Probability". Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences. 236 (767): 333–380. doi:10.1098/rsta.1937.0005. JSTOR 91337.

[CR_Rao-2] Rao, C. Radhakrishna (13 April 1973). Linear Statistical Inference and its Applications: Second Edition. John Wiley & Sons. pp. 470–472. ISBN 9780471708230.

[Stochastic_Models-3] Samaniego, Francisco J. (2014-01-14). Stochastic Modeling and Mathematical Statistics. Chapman and Hall/CRC. p. 347. ISBN 9781466560468.

[1]

[2]

[3]