STAT 1342 - Exam One (Test Form A) 1-3. Consider a sample of 26 guinea pigs. We counted the pigs in six weight ranges and record them in the following table. Weight counts range 0.91-1.00kg 5 1.01-1.10kg 12 1.11-1.20kg 5 1.21-1.30kg 2 1.31-1.40kg 1 1.41-1.50kg 1 1. The sample mode weight is in the category of____kg. (a)1.01-1.10 (b) 1.11-1.20 (c) 1.21-1.30 (d) none of above Highest frequency is between 1.01-1.10. 2. If the heaviest guinea pig were recorded as 2.00 kg by mistake instead of the correct weight, the mode weight would be greater. (a) True (b) False If the weight of the heaviest pig is corrected then also the mode of the data will be same, because the mode lies between 1.01-1.10Kg. 3. Suppose that the Q1=1.03 kg and Q3=1.17 kg, about what percent of pigs are weighed between 1.03kg and 1.17 kg? (5 points) (a) 75 % (b) 68% (c) 50% (d) 25% We know, between Q1 and Q3 there are 50% of the observations. 4-5. For a Physics course containing 10 students, the maximum point total for the quarter was 200. The point totals for the 10 students are given in the following stem plot. Stem Leaf 11 7 8 12 1 4 8 13 0 7 14 2 6 15 16 17 8 4. For the above number set calculate the five number summary Min, Q1, Median, Q2, Maximum (a) 116, 121 , 129, 142, 179 (b) 117, 121, 129, 142, 178 (c) 116, 121, 133.50, 142, 179 (d)117, 120.25, 133.50, 143.5, 178 1 5. This distribution of sample scores is best described as ________. (a) symmetric about its mean (b) skewed to the right (c) skewed to the left (d) None of above The rate at which the frequency increased is more than the rate at which the frequency decreased. This is evident from the stem-leaf plot. 6. To rate TV shows, phone surveys are sometimes used. Such a survey might record several variables, some of which are listed below. Which of these variables is categorical? (a) the number of persons watching the show (b) the ages of all persons watching the show (c) the number of times the show has been watched in the last month (d) the name of the show (if any) being watched Categorical or qualitative variable, is basically the odd-man-out problem. 7. A survey is conducted on students taking a statistics class. Several variables are measured in the survey. Which of these variables listed below is quantitative? (a) the number of credit hours taken during the quarter (b) the gender (c) the parents name (d) the state of the High School Quantitative variable, again is a odd-man-out problem. 8-10. Scores on a University exam have a mean of 80 and a standard deviation of 7. 8. Atleast what percentage of students have scores between 66 and 94? (a) 68% (b) 75% (c) 95% (d) 90% Chebyshev’s rule, because we don’t know anything about the distribution of data. 9. If the distribution of data is symmetric, about 95% data will fall within what two values? (a) (73, 87) (b) (60,100) (c) (66,94) (d) (68,95) Emperical rule, because we know that the distribution of data is symmetric. 10. If the distribution of data is symmetric, about what percent of data will be between 2 standard deviations from the mean. (a) 68% (b) 75% (c) 95% (d) 90% Emperical rule: Between mean -2*SD and mean + 2*SD we can get 95% of the observations. 2 11-12. Consider the following scatterplot of the weight of several models of cars (in pounds) vs. their horsepower. 300 Horsepower 250 200 150 100 50 2000 2500 3000 Weight 3500 4000 4500 11. A plausible value for the correlation between weight and mpg is (a) -0.8 (b) -0.1 (c) +0.1 (d) +0.8 12. Correlation between height and weight will be decreased if we measure the weight using kilograms instead of pounds. (a)True (b) False Correlation is unitless. 13-18. We want to investigate if the average age at which infants begin to crawl (y) can be predicted from the average outdoor temperature (x) six months after birth when they are likely to begin crawling. We decide to fit a least-squares regression line to the data with x as the explanatory variable and y as the response variable. We compute the following quantities. r = correlation between x and y = 0.9 x = mean of the values of x = 55.85 y = mean of the values of y = 30.25 sx = standard deviation of the values of x = 15.25 sy = standard deviation of the values of y = 2.78 13-18 Use of formulas (see my notes titled Regression) 13. The slope of the least-squares line is? (a) 0.16 (b) -.99 (c) 0.11 (d) 4.13 14. Y-intercept of the least squares line is? (a) 21.09 (b) 24.67 (c) 36.83 (d) 51.87 3 15. Equation of the line is? (a) Y=36.83-0.11X (b) Y=21.09-0.16X (c) Y=36.83+4.1X (d) Y=21.09+0.16X 16. Predict crawl age at room temperature of 50oC? (a) 12.38 (b) 29.09 (c) 8.32 (d) 11.33 17. Suppose the crawl age at room temperature of 50oC is 28, then, calculate the residual. (a) 3.33 (b) -3.33 (c) 1.09 (d) -1.09 18. What is the coefficient of determination? (a) 90% (b) 81% (c) 1.09 (d) none of above 19-20. There are 4 red balls and 6 green balls in a jar. You draw two balls without replacement. 19. What is the probability of drawing a red ball then a green ball? 4 6 4 6 46 46 (a) (b) (c) (d) 10 9 10 10 10 9 20. What is the probability of drawing a red and a green ball (in any order)? 4 6 4 6 4 6 6 4 + (a) (b) 10 10 10 9 10 9 10 9 46 46 4 6 6 4 (c) (d) 9 10 10 10 10 10 Please look at my notes titled “Probability-2”. 21-22. (Q. 22 is a bonus question and contains 5 points) All human blood can be typed as one of O, A, B, or AB. The distribution of the types varies a bit with race. Choose a black American at random. Here are the approximate probabilities that the person you choose will have blood type O, B, or AB. Blood Type O A B AB Probability 0.40 ? 0.32 0.08 21. The probability that the person chosen has a blood type other than O is (a) 0.3 (b) 0.6 (c) 0.5 (d) 0.4 Here you need to find the probability of the compliment of an event. P( choosing a person with blood type O) = 0.4 So, P( choosing a person with blood type other than O) = 1 - 0.4 = 0.6 4 22. An a group of 2 randomly chosen students, what is the probability that none of them is Type O blood (a) 0.6×0.6 (b) 0.6×0.4 (c) 0.4×0.4 (d) None of these Hints: This problem is similar to drawing marbles with replacement. So, required probability = = P (First person don’t have blood type O) × P(Second person don’t have blood type O) 23-26. Diagnostic tests of medical conditions can have several types of results. Test results can be either positive (TR+) or negative (TR-). A TR+ indicates that the patient has the condition and TR- indicates that the patient does not have the consition. (Note: A TR+ does not PROVE that the patient has the condition. To understand the effectiveness of the diagnostic test 200 persons were selected. Based on this answer the following: CP: The condition is present CA: The condition is absent CP CA Total TR+ 100 30 130 TR- 20 50 70 Total 120 80 200 For a person selected at random, compute: 23. P(TR+|CP) (a) 100/200 (b) 100/130 (c) 100/120 (d) 30/80 24.P(TR+|CA) (a) 100/200 (b) 30/130 (c) 100/120 (d) 30/80 25.P(TR-|CP) (a) 20/120 (b) 20/70 (c) 50/80 26. P(CP) (a) 20/120 (b) 30/130 (c) 120/200 (d) 50/70 (d) 80/200 Similar (only numbers changed) to Question 26, chapter 5 (page 203). 5
© Copyright 2024 Paperzz