4.2 Sampling Distributions Associated with Normal Populations Ulrich Hoensch Monday, March 8, 2010 Linear Combinations of Normal Distributions Theorem 4.2.1 Let X1 , X2 , . . . , Xn be independent random variables with Xi ∼ P N(µi , σi2 ). Let Y be a linear combination of the Xi ’s, that is: Y = ni=1 ai Xi with real constants Pn ai . Then, Y has a normal distribution with mean µ = Y i=1 ai µi and variance P σY2 = ni=1 ai2 σi2 . Corollary 4.2.2 Let X1 , X2 , . . . , Xn be a random sample of size n from a normally distributed population with mean µ and variance σ 2 . Then, X σ2 X ∼ N µ, , and X ∼ N nµ, nσ 2 . n Remark. This corollary asserts that if the population has a normal distribution, the statements of the Central Limit Theorem are true for any sample size, not only for “n large.” Chi-Square Distributions Theorem 4.2.3 Let X1 , X2 , . . . , Xk be independent Pkrandom variables with 2 Xi ∼ χ (ni ). Then the sum V = i=1 Xi has the distribution V ∼ χ2 (n1 + n2 + . . . + nk ). Theorem 4.2.4 Let X1 and X2 be independent random variables, and suppose X1 ∼ χ2 (n1 ) and Y = X1 + X2 ∼ χ2 (n) with n > n1 , Then, X2 = Y − X1 ∼ χ2 (n − n1 ). Remarks. I Recall that for X ∼ χ2 (n), µ = E (X ) = n, σ 2 = Var (X ) = 2n, and MX (t) = (1 − 2t)−n/2 . I X ∼ χ2 (n) is a special case of a gamma distribution with α = n/2 and β = 2. Chi-Square Distributions Theorem 4.2.5 If X has a gamma distribution with parameters α > 0 and β > 0, then 2X ∼ χ2 (2α). Y = β Theorem 4.2.6 If X ∼ N(0, 1), then X 2 ∼ χ2 (1). Theorem 4.2.7 Let X1 , X2 , . . . , Xn be a random sample of size n from a normally distributed population with mean µ and variance σ 2 . Then, Zi = (Xi − µ)/σ are independent N(0, 1)-distributed random variables, and n n X X Xi − µ 2 2 ∼ χ2 (n). Zi = σ i=1 i=1 Distribution of the Sample Variance Theorem 4.2.8 Let X1 , X2 , . . . , Xn be a random sample of size n from a normally distributed population with mean µ and variance σ 2 . Let P S 2 = (1/(n − 1)) ni=1 (X − X )2 be the sample variance. Then, (n − 1)S 2 ∼ χ2 (n − 1). σ2 (b) X and S 2 are independent random variables. (a) Remarks. I Corollary 4.2.2 and Theorem 4.2.8 give us information about the distribution of the sample mean and the sample variance of a random sample taken from a normally distributed population. I We can use χ2 cdf (a, b, n) to find the probability that a chi-square distributed random variable with n degrees of freedom is between a and b. Example Suppose X1 , X2 , . . . , X10 is a random sample taken from an N(12, 25)-distributed population. I Find the range for the middle 90% of the distribution of the sample mean X . Example I Find the probability that the sample variance S 2 is between 20 and 30. Example I Find the range for the middle 90% of the distribution of the sample variance. Student t-Distribution Definition 4.2.2 2 Suppose Y and Z are independent p random variables, Y ∼ χ (n) and Z ∼ N(0, 1). Then, T = Z / Y /n has a (Student) t-distribution with n degrees of freedom. We write T ∼ t(n) in this situation. Remarks. I The PDF of T ∼ t(n) is symmetric about the line t = 0 and equal to Γ((n + 1)/2) f (t) = √ πnΓ(n/2) −(n+1)/2 t2 1+ . n I If T ∼ t(n), then E (T ) = 0 and Var (T ) = n/(n − 2). I For n ≥ 30, we have that a t(n)-distribution can be well-approximated by a N(0, 1)-distribution. Student t-Distribution n=2 n=5 n=10 n=30 -4 -2 0 The PDF of T ∼ t(n) for n = 2, 5, 10, 30. 2 4 Student t-Distribution Theorem 4.2.9 Suppose X is the mean and S 2 is the variance of a random sample of size n taken from a normally distributed population with mean µ and variance σ 2 . Then the random variable T = X −µ √ ∼ t(n − 1). S/ n Proof: By Corollary 4.2.2, X ∼ N(µ, σ 2 /n), so √ Z = (X − µ)/(σ/ n) ∼ N(0, 1). By Theorem 4.2.8, Y = (n − 1)S 2 /σ 2 ∼ χ2 (n − 1). Since X and S 2 are independent by Theorem p 4.2.8, so are Z and Y . Thus, by Definition 4.2.2, T = Z / Y /(n − 1) ∼ t(n − 1). But now X −µ √ σ/ n X −µ √ σ/ n X −µ √ . T =p =q = = 2 (n−1)S S/σ S/ n Y /(n − 1) 2 Z σ (n−1) Example A random sample of 20 soda can yields an average of 235 ml, with a sample standard deviation of 10 ml. Assuming that the volume of soda in the cans is normally distributed and that the population mean is 240 ml, find the probability that the sample mean is equal to or less than the observed value. F -Distribution Definition 4.2.3 Suppose U and V are independent random variables, U ∼ χ2 (n1 ) and V ∼ χ2 (n2 ). Then, F = (U/n1 )/(V /n2 ) has a (Fisher) F -distribution with n1 numerator and n2 denominator degrees of freedom. We write F ∼ F (n1 , n2 ) in this situation. Remarks. I The PDF of F ∼ F (n1 , n2 ) is zero if x ≤ 0 and for x > 0 it is equal to Γ((n1 + n2 )/2) f (x) = Γ(n1 /2)Γ(n2 /2) I n1 n2 n1 /2 x (n1 /2)−1 n1 1+ x n2 −(n1 +n2 )/2 The F -distribution is skewed to the right, and if Fα (n1 , n2 ) is chosen so that P(F > Fα (n1 , n2 )) = α, then Fα (n1 , n2 )F1−α (n2 , n1 ) = 1. . F -Distribution 0 1 2 3 4 5 The PDF of F ∼ F (10, 5). From Table AIV.5, we find that F0.1 (10, 5) ≈ 3.30. The shaded area equals 10%. (You can verify that Fcdf (3.30, 1E 99, 10, 5) ≈ 0.1.) F -Distribution Theorem 4.2.10 Suppose a random sample of size n1 is drawn from a normally distributed population with variance σ12 , and (independently) another random sample of size n2 is drawn from a normally distributed population with variance σ22 . If S12 is the variance of the first sample, and S22 is the variance of the second sample, then F = S12 /σ12 ∼ F (n1 − 1, n2 − 1). S22 /σ22 Proof: By Theorem 4.2.8, U = (n1 − 1)S12 /σ12 ∼ χ2 (n1 − 1) and V = (n2 − 1)S22 /σ22 ∼ χ2 (n2 − 1). Note that U and V are independent. By Definition 4.2.3, F = (U/(n1 − 1))/(V /(n2 − 1)) ∼ F (n1 − 1, n2 − 1). But now F = S 2 /σ 2 U/(n1 − 1) = 12 12 . V /(n2 − 1) S2 /σ2 Example Two random samples are taken from two different normally distributed populations. It is assumed that these populations have the same variance σ 2 = σ12 = σ22 . Suppose the size of the first sample is n1 = 20 and the size of the second sample is n2 = 30. I Find the probability that the ratio of the sample variances S12 /S22 is between 0.8 and 1.2. Example I Find two values a and b so that P(a ≤ S12 /S22 ≤ b) = 0.90. Homework Problems for Section 4.2 (Points) p.204-206: 4.2.1 (3), 4.2.7 (2), 4.2.8 (2), 4.2.10 (3), 4.2.11 (2), 4.2.19 (3). Homework problems are due at the beginning of the class on Wednesday, March 17.
© Copyright 2025 Paperzz