1 ~1is paper is based in part on the author's 1974 Ph.D. dissertation at Purdue University. TI1is research was supported in part by the Office of Naval Research Contract N00014-67-A-0226-00014 at Purdue University. M'S 1970 Subject Classifications: Primary 62G31; Secondary 62G45, 62G70 Key Words and Phrases: Ranking and Selection; Sequential Selection; Tail Orderings; Stochastic Orderings; Quantiles; Nonparametric Selection. ASYMPTOTICALLY NONPARAMETRIC SEQUENTIAL SELECTION PROCEDURES 1 by Ra.Yinond J. Carroll Department of Statistics University of North CaroZina at Chapel Hill Institute of Statistics Hi:[~eo Series #944 August 1974 ASYrl?TOTICALLY NONPARAMETRIC SEQUE~rrIAL SELECTION PROCEDURES I by naymond J. Carroll University of Porth Carolina SUM~~RY Let F(x; e.) ~ ITI, ... ,IT K be (i = 1, ... ,K) K independent populations with distributions which are stochastically ordered or tail ordered. The ranking goal is to select the stochastically largest population or the population with the lightest tail. Sequential selection rules in the spirit of Robbins, Sobel, and Starr (lS68) and Geertsema (1972) are proposed and studied. The above problems are corollaries of a general TheoreM proposed, and the solutions are nonpararnetric in an asymptotic sense. 1This paper is based in part on the author's 1974 Ph.D. dissertation at Purdue University. This research was supported in part by the Office of Naval Research Contract N00014-67-A-0226-000l4 at Purdue University. -2- 1. INTRODUCTION independent populations with distributions F (x; e.) 1 6.1 < ~ (i = 1, ... ,K) , where is some unknown indexing parameter. 6. 1 e. indicates that e. is preferred to J J 6. , the most fundamental prob1 lem in ranking theory revolves around finding the 6. S 1 unkno~m population TI[K] ai ) = F(x - 6i ) , then 6i ~ 8 j 6. , in which case the goal would be to select the largest which is most preferred. could mean If For example, if F(x; J location 1: .':ITameter. Suppose the true (unlmo'Vm) preference ordering is 6[1]~"'~ 6[K] • Using a natural decision procedure; Bechhofer's (1954) indifference zone method consists of determining the smallest sar~le size nCo) for which the proba~ili of correctly selecting the preferred population whenever F(x; 6[K])) ~ 0 > 0 is at least as large as a specified constant o cl(o, 0) > 0 is known and is some distance function. case, using sample means, e.J - (1.1) (1. 2) (1.3) N(o) = first p* For future use, define 6. 1 integer n ~ (b%)2 , where = f~oo ~K-l (x + b)d~(x) . p* , where For the normal means -3- If 0 2 is unknown, Robbins, Sabel, and Starr (1968) and Geertsema (1972) investigate and generalize the natural rule which takes N(o) Qbservations from each population, where N(o) (1.5) = first integer n ~ (bS nl',"/0)2 2 Sni~v is the usual pooled sanp1e variance based on and freedom. degrees of K(n - 1) Geertsema (1972), using translation invariant estimates, derives asymptotic (as'o ~ 0 ) results lV'hich are nonparal'letric in the sense that, under certain conditions; they hold independently of the underlying distribution F. H(o) is a stopping rule of Chow and Robbins type (1965); in order to in- vestigate its a.symptotic properties, the follol.ring definition is needec: DEFINITION 1.1 (Ansconbe (1952)) A sequence of r.v.'s uniformly continuous in probability if c (e,o) :3 (1.6) p{lyl'l - where d.f. if n ~ V e > 0 , {Y } is said to be n > 0 , 3 J(e,o) 0 J (e: ,0) Yn I < ow-n 1 V integers ~:3 1m - nl < n c(e,o)} > 1 - e , e and is a sequence of norming constants such that for some real F, P{y n - e $ xw- l } ~ F(x) n The sample means satisfy (1.6) so that from Anscombe (1952), as (1.7) where and indicates convergence in law. }Ience 0 ~ 0 , -4- lin inf peeS) = p* , (1.8) 0-+0 where CS indicates a correct selection and the This paper atten~ts inf is taken over nco) . to generalize the location parameter results to such preference patterns as stochastic ordering (Lehpann (10S9)) and tail orderinf (Doksum (1969)). In Section 2, the sequential rules to be used are Section 3 motivates and presents the results and includes presente~. so~e applications. For i = 1, ... ,!~ , Section 4 gives the proofs. 2. STRUCTURE The initi?,l structure of Section 1 will be assumed. x.1. 1 ' ... ,x.In independent observations and statistics T. (n) , g. (n) T.(n) (2.2) where - ll(e.)) 1 CJ. 1. 1 g. (n) -+ (2.3) 1. 1l(6 ) i = lli CJ (6 and are formed such that 11 ~) (T.1 (n) (2.1) are taken according to a distribution i o(6 i ) 1 = 0i g.(n)>0 1. are constants. distance function will be (2.4) (2.5) 6. < 6. <='> ll· 1 ~ J n-+ oo a.s. -+ lI(e.) a. s., and .) 1 as <p 1 ~ ll. , J a.s. , The preference pattel~ and -5- nco) will be as in (1.4), with the obvious replacement of and e[~]. For N.(o) = first integer 1 i rn) for Since no sinple location parameter structure is assumed, there will be K independent stopping rules (2.6) ~(e = 1, ... ,1(, we will take N (0), ... l n ;:: (bg defined by ,1\)1((0) ? (n)/o)·· (i = 1, ... ,K) i ~!.(o) 1 observations from T1· (ll1' (0)) , and select the population with the largest 1T. 1 • , form T. (~J. (0)) . 1 1 Examples considered in this paper for stochastically orderec families will include cases where the T.(n) 1 are sample fleans or sample mepians; the interquartile range will be used for selecting the population with the lightest tail. will be assumed throughout that the REHARK 2.1. {T. (n)} 1 It satisfy (1.6). The stopping rules (2.6) being independent has certain drawbacks. Eowever, improvement here must await results in the location case, for which there does not appear to be a nonparanetric rule which eliminates obviously inferior populations and satisfies (1.8). 3. RESULTS AND APPLICATIONS The structure of Section 2 (especially (2.4) and (2.5)) will be and the goal will be to guarantee (1.3). assu~ed At this point, no specific assump- tions concerning the preference pattern (2.4) will be made; only later ,dl1 such orders as stochastic orderings or tail orderings be used. The major difficulty encountered in r,uaranteeing (1.8) is the lack of a recognizable least favorable configuration (see Bechhofer ,(1954)). (3.1) Letting -6- (3.2) For = 1, ... ,K i - I it is easy to see that lim inf peeS) (3.4) where the latter case, inf N. (0) :: N(o) 1 ~ n* is taken over (i = I, ... , K) Now, in the location parameter and P(~,o) (3.5) = Q* (say), lim inf P(!,O) _ P(Q.,o) so that lim inf P (CS) (3.6) ~ lim P(Q.,o) = P* • However, in the cases which are dealt with here, the parameter point at which pees) attains its infinum is as yet unknown. (en, 0) n However, there exists such that (3.7) If there were a p(~n, 0 ) + Q* n e such that and 0 ~n + + n ~ , 0 as since (3.8) P (en, 0 ) + Q* , n n + 00 • -7- some sort of a continuity criterion for suggests itself. P(',c) Eelly-Bray LeJI1..ma, it is clear th?t a sufficient condition for (3.9) 1 P{C-n b[T.(N. (0 )) Il n ~(S~)) ~ z} ~(z) + 1 • as n ~ (i 00 0* By the ~ p* is = 1, ... ,K) This fact, together wit!1 the continuity criterion, suggests Theorem 3.1. the sufficient condition (3.9) depends only on the tions, the rest of the paper will use the and ~eneric one-di~ensional terms cince distribu- T(n), g(n), N(o) • 11 (S) This intuitive continuity criterion suggests that close to F(x;S) should he is srall. so that F(x; SO) (AI) (A2) For each 8 and for all 0 e: > 0 • there is a d such that The other continuity conditions needed are that (2.1). (2.2), and (2.3) hold uniformly for THEOREf43.1 Y > (A3) 0 , c > jl(8) close to jl(8 ) ; this is summed up in 0 Suppose that for each 0 , such that if n >- J P8{lg(r) - 0(8)1> o(S)H o< H < e: SO' e: > 0 , and 1(1 > 0 , and ze:R' , :1 Ijl(S) - jl(8 ) I ~ y , then 0 n ~ for some is such that 0 ± J} ~ H)2 - 11 e: , where < cf4 .J , -e1 2 Pe {n / IT(m) - T(n) I (AS) 'Then, i f RE~~RK Q* 3.1. > for some cr m 1m - nl < en} 3 is compact and (AI) and (A2) are true, (1.8) holds. TI1eorem 3.1 is nonparametric in the sense that it holds indepen- dently of the underlying class of distributions class satisfies (AI) - (AS) anQ REMARK 3.2. S E is compact. Q* Although the proof of {F(x;e)}, as lone as this TIleore~ 3.1 depends on the rules (2.6), the Theorem is a fixed-sample result and thus can be verified in individual cases. Q* compact is used to guarantee that a subsequence of (e , 0 ) n n converges; this condition does not appear to be serious in practice, since most measuring devices are finite. REMARK 3.3. It is quite easy to see that the conditions of Theorem 3.1 suffice for the goals (i) selecting the selecting the smallest ~(e[l]) i" largest ~(e[K-t+lJ), ... ,~(e[K]) ; (ii) ; (iii) as selecting a restricted subset (see Santner (1974)). LEMMA 3.1. (Location Parameters) hold, then (1.8) holds and LEMMA 3.2. F (x; (~,cr)) Q* If F(x;e) = F(x - e) need not be compact. (Location Parameters with Unequal Scales) = F [x-~ 1 ,where j (j and (2.1) - (2.3) cr£!( compact and ~ER. Suppose Then, i f (2.1) - (2.3) hold, (1. 8) holds. The following Proposition 3.1 is adapted from Geertsema (1972) and Bahadur (1966) and follows in a similar manner. -9- PROPOSITION 3.1. Let c > 0 and 0 < a 1 • and suppose < F(x) has a bounded second derivative in a neighborhood of the Q.th population quantile 1;, with l 2 1/2 /2, and 1 F' (1;) = f(1;) > 0 . Let bn = fna - -' - cn / /2 and a = [na] + cn n to be the define [an]th and (bn]th order statistics. Tr.en It is clear that some sort of ilclosenessll condition must be placed on the underlying distributions in order that the continuity criteria (AI) - (AS) hold. One convenient Lipschitz type condition is (3.11) eO' a numbers For each Be + 1 that for all and Ce 1 as + (Sample Quantiles in Stochastic Orderings) tributions F(x;e) T(n) = X[na] and holds for all LEMMA 3.4. F(x; e) x Suppose the conditions of Proposition 3.1 hold. and as in (3.10). y Then, if ~* are stochastically ordered, that is the sample variance. Suppose that , (1.8) holds. Suppose the distributions 2 T(n) ~(e) Define is compact, and (3.11) ~(eO) in some neighborhood of (Sample r1eans in Stochastic Orderings) tl'ueifF(o;O) Suppose that the disth be the a popu~(e) are stochastically ordered and let g(n) such x,y, LEMMA 3.3. lation quantile. ~(e) + ~(eO) is the sample mean, and g en) = F- l [F(~(eo) ; eo) : e) (l11hich is is synunetricJ, and that for all 8 0 , -10- (3.12) is bounded in a neighborhood of eo' If (3.11) holds and Q* is compact, then (1.8) holds. LEMMA 3.5. (Interquantile Ranges in Tail Orderings) Suppose that the F(x;8) are tail ordered and it is desired to select the population witr. the lightest tail. If o < a < 1/2 < a < 1 , and ].11 (e) , ].12 (8) are the a l st and l 2 nd v and let g(n) be quantiles of F(x; e) , set T(n) = X[ (12 na 2J - A[na I ] the obvious estimate based on functions of the type (3.10). If Q* is compact and (3.11) holds for all x,y in some neighborhood of ].11 (e) (and ].12(8) ), then (1.8) holds. 4. PROOFS DEFINITION 4.1. Let = [ba(e)/a]2 Ml(a,e) = [ba(e)(l - M(a,e) (4.1) M (0,8) 2 Recall that M(a,e)/N(a) PROOF OF THEOREM 3.1. ~ = [ba(e) (4.2) + 1 a.s. under H)/0]2 F(o;e) as a It is sufficient to prove (3.9). and (A3) , one sees that d 00' J, 1].1(e) - ].l(8 0) 1 ~ y (1 E)/a]2 and y such that if ~ 0 . Then, by using (A7.) ° ~ 00 and -11- Since bO(El)/o(M(o,El))1/2 = (bo(El)/o)/[bo(El)/o] , one can easily show that (4.3) IPa{bo-1[T(M(O,B)) - "(a)] Choosing 00 and y <z ± o} - o(z ± 0)1 < < , small enough that 1M.1 (o,El) - r'!(o,El) I (i = 1,2) , < cH(o,El) from (A3) and (4.2), (4.5) How, (4.6) Pe{o-lb[T(N(O)) - )J(El)) :;; Z} - Pa{o-lb[T(M(O,El)) - )J(El)) :;; :;; 2€ + , ~ I • Pe{ho~lIT(N(o)) ~ and IN( 0) /H ( 0 , a) T(M(o,a)) -l i I :; 2€ + PEl{bo-1IT(m) - T(M(o,a)) for some Similar ly, Z+ m 3 Im/M(o,a) - a} I~ a c} I~ a 11 :; c} -12- By choosing 0 small. 0 $ 0 ' and 0 Ill(S) - ll(8 0 ) I y , this means from $ (4.3) that and hence that (3.9) holds. PROOF OF L8~ to 3.3 from Bahadur (1966). Len~a 3.3. For convenience, assume that (3.11) holds; one can extend Since (AI) and (A2) follow from (3.11). (A3) follows from the probability integral transformation. (3.11). and (A2). For (A4). assuming and z positive. ll(e o) I small. since F-l(F(X; eO) ; e) Ill(S) ~(Cez) (4.10) - £ $ Pe{O(8)-ln l/2 (T(n) - ll(e)) n large, is increasing, $ Z} $ ~(Bez) + £ • (AS) follows from the integral transformation, (3.11). and Proposition 3.1. PROOF OF LEr~~ 3.4. Assume that ICe - 11 $ IB e - 11 • and let g2(n:s) be F-l(F(X ; eO) ; e) •...• p-l(F(X ; eo) ~ e) . l n Then, from (3.11). one can show that the sample variance obtained from (4.11) where G n ~ c > 0 almost surely. With arbitrarily large probability under -13(4.12) where Gn* c * > 0 almost surely. -+ follOi;T from (4.12). (AI) The proof of fo11o\'JS from (4.11) and (A2) and (.'\3) (1\4) is somewhat involved. It is possible to show that, if then (A4) 1tJill follow if it can be shown that V e: > 0 , SUdl that n (4.14 ) ~ Pa Ie - eol and N(e) {Iii I!- I o n i=l (E.1 (e) < nee) imply - 1) (X. - eo)1 1 >o} It turns out that (4.14) is true if for any sequence with n. J -+ and 00 ej -+ J 11 :<;; IB e - 11 , convergence criterion n (e) e. (nle l ) , (n 2 , e2), ... n. J L i=l (il. 1 (e .) - 1) (X. - eO) J 1 converges to that of a random variable putting all mass at IHi(e) - 0 , 3 T-l(e) , eO ' the cistribution of -1/2 n. (4.15) < e> if (Lo~ve Elx i - eol2 o. C' ,Ince exists, one !'lay use the degenerate (1963), page 317) to achieve the result. The proof of (AS) requires an extension of Anscombe's (1 0 52) proof that the sample mean satisfies (2.6), and uses (A4), (3.12), and Yolnogorovis Inequality. PHOOF OF LH~\1A 3.5. Again, as in Lcr~ma follow in a manner similar to the proof cerived from Proposition 3.1 and t~e involves (3.11), the monotonicity of 3.3, assume (3.11). o~ (AI) - (1\3) Le!"lr-1a 3.4, although expression F-l(F(x; ~or are) g(n:e) is The proof of (A4J en) ; e) , and decol'lposing t~,e -14probability space into the 4 sets where -x [net.] 1 i = 1,2. REt~.RK - ~. 1 (8) > n or < 0 for US) fo11o\l1s from LeJ"!'.ma 3.3. 4.1. ~~te that a stochastic ordering was not really or 3.4 except to insure that (2.4) makes sense. use~ in Le~na 3.3 One could also combine Lemma 3.1 ",lith Lel'll1las 3.2 - 3.5 to extend the set over t'!hich the infinuJ11 in (1.3) is taken. ACKNOHLEDGEMENT I would like to e:q>ress my deepest appreciation to my advisor Shanti S. Gupta for his advice and support. -14- REFERENCES [1] Pnscombe, F.J. (1952). Proa. Camb. PhiZ. Soa. [2] Bahadur, n.R. (1966). A note on Quantiles in large samples. Statist. (37) 577-580. [3] Bechhofer, ~.E. (1954). A single-sample nultiple decision procedure for ranking means of normal populations "Ti th Imo"m variances. Ann. Math. Statist. (25) 16-39. [4] Chow, Y.S. and Robbins, E. (1965). On the aSYf.1.ptotic theory of fixedwidth sequential confidence intervals for e,e !Jean. Ann. Hath. statist. (36) 463-467. [5] Doksum, K. (1969) . Starshaped transformations and the power of ranI: tests. Ann. Hath. Statist. (40) 1167-1176. [6] Geertsema, J.e. (1972). Nonparametric sequential procedures for selecting the best of k populations. J. Am. Statist. Assoa. (67) 614-616. [7] Lehmann, E. L. York. [ o" ] Lo~ve, [9] 0.obbins, E., Sobel, r:., and Starr, f':. (1960) . A sequential procedure for selecting the largest of k means. Ann. Hath. Statist. (39) 88-92. [10] Santner, T.J. (1974). A restricted subset selection approach to ranking and selection problens. To appear in Ann. Math. Statist. N. (1959). (1963). Large sample theory of sequential estimation. (L!-8) 600-617. Ann. Math. Testing Statistical Hypotheses, John Fiiley, New Probability Theory, D. Van Nostrand, Princeton.
© Copyright 2025 Paperzz