next up previous
Next: Challenges Up: Robust Bootstrap Previous: Robust Bootstrap for Location

Robust Bootstrap for Location and Scale

Assume the location model (2) with unknown scale, and let Tn and Sn be defined by (4) and (5). Under mild assumptions (see Maronna and Yohai, 1981) we have

 \begin{displaymath}
\sqrt{n} \left( \left(
\begin{array}{c} T_n \\ S_n \end{...
...) \right)
\longrightarrow N_2 \left( {\bf0}, \Sigma \right),
\end{displaymath} (32)

where N2 denotes a normal distribution in ${\mbox{I}\!\mbox{R}}^2$, and the asymptotic variance-covariance matrix is

 \begin{displaymath}
\Sigma \ \ = \ \ D^{-1} \ C \ \left( D^{-1} \right)',
\end{displaymath} (33)

where

 \begin{displaymath}
C = \left( \begin{array}{cc}
E_F \psi^2 \left( Y \right) ...
...t] &
E_F \chi^2 \left( Y \right) - b^2
\end{array} \right),
\end{displaymath} (34)


 \begin{displaymath}
D = -\frac{1}{S_\infty} \ \left( \begin{array}{cc}
E_F \p...
...\left[ \chi' \left( Y \right) Y \right]
\end{array} \right),
\end{displaymath} (35)

and $Y = ( X - T_\infty) / S_\infty$. Let the weights \( \omega_i \left( T_n, S_n \right) \) and \( v_i \left( T_n, S_n \right) \) be defined as

\begin{eqnarray*}\omega_i \left( T_n, S_n \right) & = & \left. \frac{
\psi \lef...
...( \frac{
x_i - T_n }{S_n} \right) \hspace{.5in} 1 \le i \le n.
\end{eqnarray*}


Similarly define \( \omega_i \left( T_\infty, S_\infty
\right) \) and \( v_i \left( T_\infty, S_\infty\right) \) for \(
1 \le i \le n\). We have

\begin{displaymath}\left(
\begin{array}{c}
T_n \\ S_n \end{array} \right)
= \...
...n, S_n \right)
\left( x_i - T_n \right) \end{array} \right).
\end{displaymath}

To simplify the notation call

\begin{displaymath}\theta_n \equiv
\left( \begin{array}{c} \theta_{n1} \\ \thet...
...ft( \begin{array}{c}
T_\infty\\ S_\infty\end{array} \right),
\end{displaymath}

and

 \begin{displaymath}
\hat{\theta}_n \equiv \left( \begin{array}{c}
\hat{\thet...
...\right) \left(
x_i - T_\infty\right)
\end{array} \right).
\end{displaymath} (36)

The motivation for theorem 5 below is as follows. We have the identity

 \begin{displaymath}
\theta_n =
\left( \begin{array}{c} \theta_{n1} \\ \theta_...
...n, S_n \right)
\left( x_i - T_n \right) \end{array} \right).
\end{displaymath} (37)

Rearranging the terms after a Taylor expansion around \(
\theta_\infty = \left( T_\infty, S_\infty\right)' \) we get

 \begin{displaymath}
\sqrt{n} \left( \theta_n - \theta_\infty \right) =
\left[...
...1} \
\sqrt{n} \left(
\hat{\theta}_n - \theta_\infty \right)
\end{displaymath} (38)

where

\begin{displaymath}A_n \left( \tilde{\theta}_n \right)= \left( \begin{array}{cc}...
...ial S_n \left( \tilde{\theta}_n \right)
\end{array} \right),
\end{displaymath}

and $\tilde{\theta}_n$ is a point between $\theta_n$ and $\theta_\infty$. Equation (37) motivates our bootstrap, and (38) suggests the correction needed to account for the use of fixed weights. As before, we can complete our argument by proving that the sequence of matrices $\left[ I - A_n \left( \tilde{\theta}_n \right) \right]$ converges almost surely to a non-singular matrix. The necessary regularity conditions for this proof are very restrictive. The next theorem is the location-scale extension of theorem 4.

Theorem 5   Assume that (32) and (33) to (35) hold, and that $ 0 < \left\vert E_F \left( \psi \left( Y \right) / Y \right)
\right\vert < +\infty$, with $Y = \left( X - T_\infty\right) / S_\infty$.
1.
Let $\hat{\theta}_n$ be defined by (36). Then

\begin{displaymath}\sqrt{n} \left( \hat{\theta}_n - \theta_\infty \right)
\longrightarrow N_2 \left( {\bf0}, \Sigma_2 \right),
\end{displaymath}

where

 \begin{displaymath}
\Sigma_2 =
\left( \begin{array}{cc}
\frac{ E_F \psi^2 \...
...F \chi^2 \left( Y \right) - b ^2 }{b^2}
\end{array} \right),
\end{displaymath} (39)

and $Y = ( X - T_\infty) / S_\infty$.
2.
There exists a matrix $A \in {\mbox{I}\!\mbox{R}}^{2 \times 2}$ such that

 \begin{displaymath}
\Sigma = A \ \Sigma_2 \ A^t
\end{displaymath} (40)

where $\Sigma$ is given by (33) to (35) and $\Sigma_2$ is given by (39).
3.
If the functions $\psi'\left( t \right)$, $\psi'
\left( t \right)t$, $\chi' \left( t \right)$, $\chi' \left( t \right)t$ and $\psi \left( t \right) / t$ are uniformly continuous, then there exists a sequence of matrices $\left\{ A_n \right\}_{n \ge 1} =
\left\{ A_n \left( \theta_n \right) \right\}_{n \ge 1}$ such that

\begin{displaymath}A_n \longrightarrow A \mbox{ almost surely},
\end{displaymath}

where A satisfies (40).

Proof of theorem 5:
1.
The following argument will show that there exists a function $g:{\mbox{I}\!\mbox{R}}^3 \rightarrow {\mbox{I}\!\mbox{R}}^2$, and n independent and identically distributed random vectors ${\bf y}_i
\in {\mbox{I}\!\mbox{R}}^3, \ 1 \le i \le n$, with mean $\mu = E_F \left( {\bf y}_i \right)$, such that

\begin{displaymath}\sqrt{n} \left( \hat{\theta}_n - \theta_\infty \right)
= \sq...
...eft(
\bar{{\bf y}} \right) - g
\left( \mu \right) \right).
\end{displaymath}

This representation will allow us to calculate the asymptotic variance-covariance matrix of $\sqrt{n} \left( \hat{\theta}_n - \theta_\infty \right)$. By Slutzky's Theorem it will be given by

\begin{displaymath}\nabla g \left( \mu \right) \ \Sigma_{\bf y}
\ \nabla g \left( \mu \right)'
\end{displaymath}

where \( \nabla g \left( \mu \right) \) denotes the matrix of first derivatives of g at $\mu$, and $\Sigma_{\bf y}$ is the matrix of variance-covariance of the random vectors ${\bf y}_i$. Note that

\begin{eqnarray*}\hat{\theta}_{n1} & = &
\sum_{i=1}^nx_i \ \frac{ \Psi \left( \...
...nfty}
\right) / \left( X- T_\infty\right) \right) } + T_\infty
\end{eqnarray*}


and

\begin{eqnarray*}\hat{\theta}_{n2} & = &
\frac{1}{nb} \sum \chi \left( \frac{x...
...\frac{x-T_\infty}{
S_\infty} \right) \right) \ \right/ \ b \ .
\end{eqnarray*}


For each $i=1, \ldots, n$ define the random vectors \( {\bf y}_i \in {\mbox{I}\!\mbox{R}}^3\) as

\begin{displaymath}{\bf y}_i = \left( \Psi \left( \frac{x_i-T_\infty}{
S_\infty...
...
\chi \left( \frac{x_i -T_\infty}{S_\infty} \right) \right).
\end{displaymath}

Their sample mean is

\begin{displaymath}\bar{{\bf y}} = \left(
E_{F_n} \Psi \left( \frac{X-T_\infty...
...F_n} \chi \left( \frac{X-T_\infty}{S_\infty} \right) \right).
\end{displaymath}

Define \( g : {\mbox{I}\!\mbox{R}}^3 \rightarrow
{\mbox{I}\!\mbox{R}}^2 \) by

\begin{displaymath}g \left( x,\ y,\ z \right) \ = \ \left( x / y + T_\infty, \ \ S_\infty\ z
/ b \right),
\end{displaymath}

and note that

\begin{displaymath}\hat{\theta}_n = g \left( \bar{{\bf y}} \right).
\end{displaymath}

Now let \( \mu \equiv \left( \mu_1, \mu_2, \mu_3 \right)
= E_F \left( {\bf y}_1 \right) \). We have $\mu_1 = 0$, $0 < \left\vert \mu_2 \right\vert < \infty$, and $\mu_3 = b$. This implies

\begin{displaymath}g \left( \mu \right) = \left( \begin{array}{c}
T_\infty\\ S_\infty\end{array} \right) = \theta_\infty.
\end{displaymath}

We have then expressed $\hat{\theta}_n - \theta_\infty$ as a smooth function of means:

\begin{displaymath}\sqrt{n} \left( \hat{\theta}_n - \theta_\infty \right)
= \sq...
...eft(
\bar{{\bf y}} \right) - g
\left( \mu \right) \right).
\end{displaymath}

It is easy to check that for any ${\bf x}= \left( x_1,
x_2, x_3 \right) \in {\mbox{I}\!\mbox{R}}^3$ with $x_2 \ne 0$ we have

\begin{displaymath}\nabla g \left( {\bf x}\right) = \left( \begin{array}{ccc}
1...
...x_1 / x_2^2 & 0 \\
0 & 0 & S_\infty/ b \end{array} \right),
\end{displaymath}

so that

\begin{eqnarray*}\nabla g \left( \mu \right) &
= & \left( \begin{array}{ccc}
1...
...ight)^{-1}& 0 & 0 \\
0 & 0 & S_\infty/ b \end{array} \right),
\end{eqnarray*}


where $ y = \left( X - T_\infty\right) / S_\infty$. From here, part 1 of the theorem follows.
2.
Consider the matrix A such that

\begin{displaymath}A^{-1} = \left( \begin{array}{cc}
\left. E_F \psi' \left( Y ...
... \left( Y \right) \ Y \right] \right/ b
\end{array} \right),
\end{displaymath}

where $Y = \left( X - T_\infty\right) / S_\infty$. Note that A is non-singular if and only if D in equation (35) is non-singular. It is easy to see that A satisfies (40).
3.
From Lemma A in page 253 of Serfling (1980) and the assumed regularity conditions we get the result with

\begin{displaymath}A_n^{-1} =
\left( \begin{array}{cc}
\left. E_{F_n} \psi' \...
...} \right) \ \tilde{Y} \right] \right/ b
\end{array} \right),
\end{displaymath}

where $\tilde{Y} = \left( X - T_n \right) / S_n$. This finishes the proof of the theorem.
The questions that remain to be answered are: The representation

\begin{displaymath}\sqrt{n} \left( \hat{\theta}_n - \theta_\infty \right) =
\s...
...ft( \bar{{\bf y}} \right) -
g \left(
\mu \right)
\right), \end{displaymath}

we get from the proof of Part 1 in Theorem 5 together with Theorem 3 positively answer the first question. We are still working on the second question. Empirical results obtained are very encouraging.
next up previous
Next: Challenges Up: Robust Bootstrap Previous: Robust Bootstrap for Location
Department Web Master
2000-05-29