Davis, A.W.; (1970)On the marginal distribution of the latent roots of the multivariate beta matrix."

I
I
••
I
I
I
I
I
I
••
,I
I
I
I
I
I
I
I_
I
ON THE MARGINAL DISTRIBUTIONS OF THE LATENT
ROOTS OF THE MULTIVARIATE BETA MATRIX
By
A. W. Davis
to
Department of Biostatistics
University of North Carolina at Chapel Hill
Institute of Statistics Mimeo Series No. 690
July 1970
I
.I
I
I
I
I
I
••I
I
I
I
I
I
I
1_
I
ABSTRACT
On the marginal distributions of the latent roots of
the multivariate beta matrix
The marginal distributions of the latent roots of the multivariate beta
matrix are shown to constitute a complete system of solutions of an ordinary
differential equation (d.e.), which is related to the author's d.e. 's for
Rotelling's generalized T2 and Pillai's V(m) statistics.
o
Similar results are
given for the latent roots of the multivariate F and Wishart matrices (E=I).
Pillai's approximations to the distributions of the largest and smallest roots
are interpreted as exact solutions, the contributions of higher order solutions
being neglected.
I
I
1
••
I
On the marginal distributions of the latent roots
of the multivariate beta mattix*
A. W. Davis
C.S.I.R.O., Adelaide, and University of North Carolina
I,
I
I
I
I
••I
I
I
I
I
I
I
I_
I
1.
Introduction.
Let S(mXm) and T(mxm) have independent Wishart distributions
W(q,E) and W(n,E), respectively, where E is the population covariance matrix and
q,n
~
m.
The latent roots £1 > ••• > £m > 0 of the multivariate beta matrix
B = S(S+T)-l are well known to have the joint density function
(1)
m ~(q-m-l) m
~(n-m-l)
TI'l(l-£,)
TI,<,(£,-£,),
<l>m;q,n(&) = K(m;q,n)IT,~= l£i
~=
~
~ J
~
J
(2)
K(m;q,n)
= ~~~:~[r(~(q+n-i»/r(~(m~i»r(~(q-i»r(~(n-i»],
The marginal distributions of the individual £, have been investigated by
~
.
Roy [14], [15], who showed that the largest root £1 is of basic importance in
testing hypotheses and constructing confidence regions in multivariate analysis
of va~iance; also by Pillai [10], Khatri [8], Sugiyama and Fukutomi [17],
Sugiyama [16], and AI-Ani [1].
Pillai [11] gave very accurate approximations
to the upper and lower tails of the distributions of £1 and £m' respectively,
and £1 has been extensively tabulated by Heck [7] for m<5, using Pillai's
approximation, and Pillai ([11], [12], etc.) for m<20.
Studies of the noo-
central distributions have been made by Khatri [9] and Pillai and Dotson [13].
*This research was· supported by the National Institutes of Health Grant No. GM-12868.
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
~
I
2
As
n~,
matrix.
nB~WI'
say, having the distribution W(q,I) where I(rnxm) is the unit
Ranumara and Thompson [6] have tabulated the largest and smallest roots
of WI using limiting forms of Pi11ai's approximations, and discussed their
application.
The present author [3], [5] has shown that the null distributions of tr B
(Pi11ai's V(m»
and tr F (Rote11ing's genera1izedT 2 ), where F = ST- 1 , satisfy
o
certain ordinary linear differential equations (d.e. 's) of order m which are
related by a simple transformation.
distributions of the 2
i
In Section 2 it is shown that the marginal
form a complete system of solutions of a similar d.e.
Thus, the power-series of Sugiyama and Fukutomi are solutions at the regular
singularities 0 and 1.
Pi11ai's approximations are also shown in Section 4 to
be exact solutions of the d.e., but approximations to the distributions insofar
as contributions from higher-order solutions' are neglected.
Corresponding results
for the latent roots of F and WI are readily deduced (Sections 6 and 7).
2.
The differential eguation.
r
CR , where R is the real line.
{O<x <••• <x <2<x
.
r
,8
s- 1< .•• <x 1<1}
The marginal density function f (2) of
s
given by
(3)
f
s
(2) =
fDm- 1 (s,2) ~m;q,n (x1 '···,xs- l'~'x8 , ••• ,xml)dx
--
~1
-
where dx = IT =l dx ; it is proportional to
i
i
~~(q-m-1)(1_~)~(n-m-1)f
. ~(x)~-l(~-x.)dx
~-'
. Dm-1 (s,~) - ~=1
(4)
in which
~
denotes
~
m-1;q-1,n-1'
Define
~
s
is
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
Ie
I
3
(5)
'1' (J/,;x) = <1>(x)I (J/,-x (l»· •• (J/,-x (
r
-
a
a m- 1-r
a
»,
(r=O,l, ••• ,m-l),
the summation being extended over the (m-l) selections of integers 0'.(1) < ••• < O'.(m-l-r)
r
from the set 1,2, ••• ,m-l.
When r=m-l, the sum is taken to be unity.
We now
introduce the mfunctions
L
(J/,) = f m. 1
'1' (J/,;x)dx, (r=O,l, ••• ,m-l),
s,r
D - (s,J/,) r
- -
(6)
noting that f (J/,) is proportional to J/,~(q-m-l)(l_J/,)~(n-m-l)L
s
s,O
Our object is
to show that, for each s, the L
are related by a system of first-order differential
s,r
equations which are independent of s.
Differentiating (6),
L' (J/,) = - Z(l) + Z(2) + (r+l)L
+1'
s,r
s,r
s,r
s,r
(7)
where
Z(l)
s,r
=
f Dm- 2 (s-l,J/,) '1'r (J/,;xl,···,x s- 2'J/"x s- l'···'xm2)dx,
-.-
(8)
Z(2) =
f Dm- 2 (s,J/,) '1'r (J/,;xl,···,x s- l'J/"x s , ••• ,xm2)dx.
s,r
-Now let a(l), ••• ,S(r) denote the set of subscripts complementary to
a(l), ••• ,O'.(m-l-r).
We have
rJ/, L
s,r
(9)
= (m-r)L
. s,r+1 + 0 s,r ,
say.
Integration by parts with respect to the xS(i) yields
I
I
,e
4
~(q+n-2m+2)G
(10)
s,r
= t(1-t)[Z(1)_Z(2)]
s,r s,r
+ ~(q-m+r)L
s,r
+~
s,r
,
where
I
I
I
I
I
I
I.
I
I
I
I
I
I
I
I_
I
~
s,r
(11)
A term similar to (11) occurred in the derivation of a d.e. for Rote11ing's
generalized To2 ([3], Equation (2.13», and the same approach yields
~
(12)
s,r
= ~(m-r)(m+r-3)L
s,r- 1+l~(r-1)(1-2t)L s,r +l~(r+1)t(1-t)L s,r+1.
Finally, eliminating the Z's, G's and
~'s
from equations (7), (9), (10) and (12),
we find that
t(l-t)L'
=
s,r
~(m-r)(q+n-m+r-1)L
+
~(r+1)(r+2)t(1-t)L
s,x-
l+l~r[(l-t)(q-m+r)-t(n-m+r)]L
s,r
( 13)
where Ls,- 1
s,r+1' (r=0,1, ••• ,m-1),
=Ls,m =O.
We observe that the system (13) is independent of s, and in principle one
could successively eliminate L 1, ••• ,L
l' arriving at a homogeneous linear
s,
s,md.e. of order m having each L 0 as a solution.
s,
Clearly the f
s
will be solutions
of a similar d.e.; furthermore, they will constitute a linearly independent and
hence complete system of solutions, since as
(14) .
where
t~O+
fS(t)/t~(q-m-1) ~ ks(m;q,n)t~(m-s)(q-S+2), (s=l, ••• ,m),
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
I_
I
5
k (m;q,n) = K(m;q,n)/[K(s-1;q+m-s+l,n-m+s-l)K(m-s;q-s,m-s+3)].
(15)
s
This is easily proved by writting x. =
.
J
(j=s, ••• ,m-l) in (4) and letting
~Wj
~+o.
We note in addition that (13) is invariant under
(16)
q+n, n+q,
~+1-~,
r
provided that we also replace L,
by (-1) Ls,r; this reflects the obvious result
. s,r
that (16) transforms fs(~) into fm+l-s(~).
3.
Solutions of the d.e.
It is convenient to introduce H =
r
.
(l-~)
-r
L
s,r
(r=O,1, ••• ,m-1) and to express (13) as a matrix d.e. for H = (H , ••• ,H _ )':
O
m1
(17)
" .
where
The d.e. (17) is of Fuchsian type, with regular singularities at
~=O,l
and
infinity, and we refer to [2], Chapter 4, for the general theory of such d.e.'s.
Assuming a series solution H =
Lr : o
hr~p+r in 1~1<1, we obtain Ah O .= phO' so that
p must be one of the latent roots a , ••• ,a _ of.A, and h the corresponding
O
m1
O
•
il
'I
6
I
1
il·
il
il.
,
'I
latent vector.
To relate this fundamental set of solutions to the f (t), we
s
first obtain a non-singular transformation H=PM, where P(mxm) is independent of
t, such that P- 1AP = diag(a ).
A suitable choice is
r
(19)
P = {pij },
P ij
= (-1)
i-j m-j-1 i-I
(i.)IT . (q+n-m+r)/(q-m+j+r+l) ,
(20)
Both P and P
-1
are lower triangular, and MO=H '
O
p- 1 CP = G =
(21)
v
A ,
1
III
o
0
0
v
1
Am-1
m-2
llm-l
where
A. = (m-i) (n-i) (q+i-1) (q-m+i-1) (q-m+i-2) (q+n-m+i-1)/[2(q-m+2i-2)
~
(q-m+2i-1) 2 (q-m+2i)],
(22)
~I
ll.
~
,
il
11
llO'
It may be shown that
'v
il
il.
11j
r=J
with inverse
;1
!I
-J
= i2-~i(m+n-3)-m+1
+ ~i(i+1)(m-i)(n-i)/(q-m+2i-l)
-~(i+1)(i+2)(m-i-1)(n-i-1)/(q-m+2i+1),
v. = ~(i+1)(i+2).
~
The d.e. (17) now takes the form
,
]1
(23)
dM/dt
=
[t- 1 diag(a ) + (l-t)-l G]M,
r
~I
)1
a +r
corresponding to the latent root a
and assuming a solution M = lr:o nrt p
p
,
(no ,r , ••• ,n~,r
-1 ,) of the nr :
1
'
~I·
1
il
of A, we obtain the following recurrence relations for the components
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
I_
I
7
np,O = 1,
(24)
ni,O = 0 (i~p),
(r-a.+a
1
l+[ll.+(r-l)-a.+a
1
P)n i ,r = A.n.
1 1- ,r1
1
P]n i ,r- 'l+v.n,·+l
1 1
,r-l'
(i=O, ••• ,m-l; r=1,2, •••• ).
This form of solution unfortunately breaks down if ai-a
integer for some i.
In fact, a +l-a
p
p
=
~(q-m+l)+p
if q-m is odd, while a + -a = q-m+2(p+l) (p
p 2 p
~
p
is a positive
(p < m-2) , which is an integer
-
m-3) is always an integer.
Generally in such situations the solution must be obtained by limiting procedures
which may produce logarithmic terms.
However, it may be seen from (6) that the
Ls,r are in fact representable by power series, and it appears that if a.-a
is
1
p
a positive integer for i>p, then the right-hand side of the ith equation in (24)
vanishe~
identically when r = a.-a.
1
p
Thus n
i,r
is an undetermined constant
introducing the a.-solution at this stage, and the power series form is preserved.
1
We also see from (24) that the nO
,r
are zero for r<p in the a -solution,
p
'while
(25)
= (p+l)!/ni~l(q-m+p+i+l) = i; p , say.
a+p
-Hence M (!) = O(! p ) as !+O+, and since a
+(m-s) = ~(m-s)(q-s+2), it follows
O
m-s
from (14) that L ,s must be some linear combination of the am-s ,am-s +1'.'.' and
O
a
m-
I-solutions.
The coefficient of the a
m-s
-solution is clearly k (m;q,n)/i;
,
s
m-s
but the remaining coefficients have not been determined for general s.
However,
the density function f (!) of the largest root corresponds to the largest root
l
am-I of A, and is thus completely specified by (24).
The resulting power series
coincides with the result of Sugiyama and Fukutomi [17].
I
I
••
I
I
I
I
I
I
8
4.
Pillai's approximations.
A particular solution corresponding to the smallest
root a =0 of A may be given explicitly as an (m-l)th degree polynomial.
o
(Z)i = z(z+l) ••• (z+i-l),(z) . = z(z-l) ••• (z-i+l), it may be shown that
-~
(26)
Thus we obtain the following approximation to the lower tail of f (t) for
m
large q by neglecting the al, ••• ,a _ solutions (i.e., terms of order t q- m+l
ml
at least):
f (t) ~ k(m;q,n)t~(q-m-l)(l_t)~(n-m-l)
m
(27)
tm-I(m-l) ( +n) ( 1)
( t)r
lr=O r
q -m r q- -(m-l-r) ,
••I
where
I
I
I
I
I
I
large n (near the regular singularity t=l) is obtained.
I_
I
Writing
k(m;q,n) =
(28)
TI~r(~(q+n-m+l»/[2m-lr(~)r(~q)r(~)].
Using (16), a corresponding approximation to the upper tail of fl(t) for
The result is found
to be simply the right-hand side of (27) multiplied by (-1)
m-l
•
The integrated form of the approximation was arrived at by Pillai [11]
using a different approach, and used in a series of tabulations of the upper
5% and 1% points of
when n 2
~
~l.
Its accuracy to essentially five places of decimals
m+ll was demonstrated at least for m ~ 10 by substituting inexplicit
expressions for the distribution function [10].
In order to investigate the
usefulness of the d.e. (23), some percentage points were calculated by following
the a
m- I-solution .>ut from. the origin, using the same computation procedure as
in [4].
The method,tPI'<red to be effective at least up to m=7, since on
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
I_
I
9
comparing the 1% points, i.e., the less ,accurate results of the d.e. and the
more accurate results of the approximation, these were generally found to differ
by no more than a unit in the fifth decimal place.
On the other hand, the 5%
points obtained from the d.e. tended to exceed Pillai's by about three units
in the fifth decimal place.
The d.e. approach should be more accurate at lower
significance levels, and a tabulation of upper 10% points has been made.
5.
The median root for m=3.
The success of the Pillai approximation suggests
a similar approach to the tails of the distributions of the other roots, approximating the lower tail of f
s
by the a
m-s
-solution for large q, and deducing a
corresponding result' for the upper tail of f
m-s
+1 when n is large using (16).
However, so far it has proved difficult to obtain the solutions in closed forms
analogous to (27), except in the case of the median root 2
2
when m=3.
The
aI-solution is
1
1
1
M = (~q+2[2(n-l)/(q-2)-~(q+n-4)])2~q- (1-2)~,
l
(29)
kq
!m
M = -[(n-2) (q-3) (q+n-2)/2q(q-l)]2 2 (1-2)2 •
2
Hence, for large q and 2 near zero,
(30)
the beta density with parameters q-l and n-l.
the upper tail and large n, and
of 2 ,
2
ha~been
The same approximation holds for
used to compute upper 5% and 1% points
The results are identical to five decimal places with those published
by Pillai and Dotson [13], except where the latter have employed extrapolation,
in which case differences of up to 0.006 occur for the 5% points and up to .003
for the 1% points.
I
I
.e
I
I
I
I
I
I
Ie
I
I
I
I
I
I
I
.e
I
10
6.
The latent roots of the multivariate F matrix.
Substituting t = v/(v+l) in
(23), we obtain the d.e.
l
) + (l+v)-l [G-diag(a )])M*,
dM*/dv = (v- diag(ar
r
(31)
which has regular singularities at 0, -1 and infinity.
f*(v) of the sth largest root v
s
a
m-s
-
of F is a certain linear combination of the
, ••. ,a I-solutions of (31), multiplied by the factor v~(q-m-l)/(v+l)~(q+n)-m+l.
m-
a
+(m-s)
f*(v)/v~(q-m-l) - k (m·q n) v m-s
.
(32)
s
where k
7.
s
The density function
s
s
"
is defined by (14).
As
The latent roots of the Wishart matrix WI.
and nv
s
n~,
the densities of nt
converge to f (u), say, the density function of the sth largest root
s
To obtain a d.e. for the f , we substitute t = u/n and M.
s
n~
(i=0, ••• ,m-1) in (23); 1eting
~
it is found that
1
dM/du = [u- diag(a
. r) +
(33)
where M = (M ' ••• ,M _ )', and
O
ml
(34)
G=
110
-
V
~
II
-
0
o
.
VI •
·V
m-2
0
where
s
~
m-l
-
llm-1
G]
M,
= n
i -
M.
~
I
I
••
I
I
I
I
I
I
••I
I
I
I
I
I
I
Ie
I
11
(35)
Xi
= (m-i) (q+i-1) (q-m+i-1) (q-m+i-2)/[2(q-m+2i-2) (q-m+2i-1) 2 (q-m+2i)]
-
= -[i(q-m+i) (q+m-5)+2(m-1)(q-m-1)]/[2(q-m+2i-1) (q-m+2i+1) ].
11 i
The d.e. (33) has a regular singularity at u=O, but an irregular singularity
at infinity.
As before, f
s
is a linear combination of the a
at the origin, multiplied by e -~ u ~(q-m-1) •
, ••• ,a 1-solutions
m-
In particular, as u+O+,
a
-f (u)/u~(q-m-1) - k- (m;q ) u m-s
s·
.s
(36)
m-s
+(m-s)
where
ks (m;q) = K(m;q)/[K(s-1;q+m-s+1)K(m-s;q-s,m-s+3)]
(37)
K(m;q) =
~~/[2~q ~:~ r(~(m-i»r(~(q-i»] •
Taking limits in (27), we find that f (u) ~ (_1)m-1 gm,q (u) for large u,
1
and f (u) ~ g
(u) for u near zero, where
m
m,q
. gm,q(u) = K(m;q)e
-~ ~(q-m-1)
u
~m-1
m-1
r
lr=O( r )(q-1)_(m_1_r)(-u) ,
(38)
Again, g
corresponds to a particular a -so1ution of (33) at the origin.
O
m,q
Banumara and Thompson [6] used integrated forms of (38) in their tabulation of
u •
m
Some percentage points of u
1
were calculated using the a _ -so1ution
m1
of the d.e., and these differed from the results of the approximation by no more
than a unit in the second decimal place.
The d.e.(33} is in fact closely connected with the author's d.e. [5] for
the moment generating function'of Pi11ai's V(m) = tr B.
If we write
I
I
••
I
I
I
I
I
I
••
I
I
I
I
I
I
I
I_
I
12
A
(s) =lexp (-s v(m», then it is easily seen that fl(u) is proportional
m;q,n
to e
-~
u
~q-l
A
l l ,m+2(~)'
m- ;q-
The author's thanks are due to Mrs. Darien Wei and Dr. R. W. Helms of
the Department of Biostatistics, School of Public Health, University of
North Carolina, for their assistance with the computing.
REFERENCES
[1]
AI-Ani, S. (1968). On the distribution of the ith latent root under null
hypotheses concerning complex multivariate normal populations. Mimeo
Series 145, Department of Statistics, Purdue University.
[2]
Coddington, E. A. and Levinson, N. (1955).
Equations. McGraw-Hill, New York •
[3]
Davis, A. W. (1968). A system of linear differential equations for the
distribution of Hotelling's generalized T2 • Ann. Math. Statist. 39, 815-832.
Theory of Ordinary Differential
o
[4]
Davis, A. W. (1970). Exact distributions of Hotelling's generalized To2 •
Biometrika 57, 187-191.
[5]
Davis, A. W. (1970). On the null distribution of the sum of the roots of
a multivariate beta distribution. To appear in Ann. Math. Statist.
[6]
Hanumara, R. C. and Thompson, W. A. (1968). Percentage points of .the
extreme roots of a Wishart matrix. Biometrika 55, 505-512.
[7]
Heck, D. L. (1960). Charts of some upper percentage points of the distribution of the largest characteristic root. Ann. Math Statist., 31, 625-642.
[8]
Khatri, C. G. (1964). Distribution of the largest or the smallest
characteristic root under null hypothesis concerning complex multivariate
normal populations. Ann. Math. Statist. 35, 1807-1810.
[9]
Khatri, C. G. (1969). Non-central distributions of ith largest characteristic roots of three matrices concerning complex multivariate normal
populations. Ann. Inst. Statist. Math. 21, 23-32.
[10]
Pillai, K. C. S. (1954). On some distribution problems in multivariate
analysis. Mimeo Series 88, Institute of Statistics, University of
North Carolina.
I
I
.e
I
I
I
I
I
I
••I
I
I
I
I
I
I
I_
I
13
[11]
Pi11ai, K. C. S. (1965). On the distribution of the largest characteristic
root of a matrix in multivariate analysis. Biometrika 52, 405-414.
[12]Pi11ai, K. C. S. (1967). Upper percentage points of the largest root of
a matrix in multivariate analysis. Biometrika 54, 189-194.
[13]
Pillai, K. C. S. and Dotson, C. O. (1969). Power comparisons of tests of
two multivariate hypotheses based on individual characteristic roots.
Ann lnst. Statist. Math. 21, 49-66.
[14]
Roy, S. N. (1945). The individual sampling distribution of the maximum,
the minimum and any intermediate of the p-statistics on the null hypothesis.
Sankhya 7, 133-158.
[15]
Roy, S. N. (1957).
[16]
Sugiyama, T. (1967). Distribution of the largest latent root and the smallest
latent root of the generalized B statistic and F statistic in multivariate
analysis. Ann. Math. Statist. 38, 1152-1159.
[17]
Sugiyama, T. and Fukutomi, K. (1966). On the distribution of the extreme
characteristic roots of the matrices in multivariate analysis. Reports
of statistical application on research, union of Japanese scientists and
en~ineers, 13 •
Some Aspects of Multivariate Analysis.
Wiley, New York.

Download Report

Davis, A.W.; (1970)On the marginal distribution of the latent roots of the multivariate beta matrix."

Paperzz.com

Your Paperzz