Math 138 Final Exam Review Although there are many problems on

Math 138 Final Exam Review
Although there are many problems on this review, it does not fully cover all the material in
MATH 138. For additional review problems, review your homework problems, projects, exams,
and other classroom handouts.
Unit 1
1) Classify the variables as categorical or quantitative:
a) name
b) employee ID number
c) annual income
2) A survey of patients at a hospital classified the patients by gender and blood type, as seen in
the table.
Gender
Male Female
Blood A
105
93
type
B
98
84
O
160
145
AB
15
18
a) What percentage of the patients with type-B blood are male?
b) What percentage of the female patients have type-O blood?
c) What percentage of the patients are male and have type-A blood?
d) What percentage of the patients are female or have type-O blood?
e) Give the conditional distribution of blood type for the males
f) Give the conditional distribution of blood type for the females
g) Are blood type and gender independent?
3) The number of days off that 30 police detectives took in a given year are provided below.
Create a histogram of the data. (You can do this by hand or using technology as long as the
bins are reasonable and you can reproduce an accurate sketch).
10
5
5
1
1
4
3
0
1
5
9
7
4
11
7
7
1
11
0
5
1
6
7
5
6
10
6
1
1
0
4) Describe what these boxplots tell you about the relationship between fuel efficiency and the
number of cylinders an engine has.
5) The stem-and –leaf diagram shows the ages of males playing basketball at a public gym over
the course of a day. Describe the shape, center, spread, and unusual features of the
distribution.
4 8 9
4 0 1 2 3
3 6 6 8 8 9
3 0 0 0 1 4 4
2 6 7 9 9 9
2
1 5 5 5 5 6 6 6 6 6 6 7 7 7
1 2 3 4 4 4 4
0
0
6) The volumes of soda in quart soda bottles can be described by a Normal model with a mean
of 32.3 oz and a standard deviation of 1.2 oz.
a) What percentage of bottles can we expect to have a volume less than 32 oz?
b) 5% of bottles have a volume smaller than what amount?
7) The ages of the 21 members of a track and field team are listed below.
15
24
28
18
24
28
18
25
30
19
25
32
22
26
33
23
26
40
24
27
42
a) Report the 5-Number Summary for the data
b) Create a boxplot for the data. Use fences to identify potential outliers, if there are any.
c) Find the mean and the standard deviation for the data
d) What is the z-score for the age of the team member who is 40 years old?
e) Interpret the meaning of this z-score.
8) The distances traveled to work in miles by the employees at a large company are normally
distributed with the mean of 35 ml. and standard deviation of 10 ml.
a) What is the z score of an employee who travels 40 miles?
b) What percent of employees travel more than 45miles ?
c) Find the 90th percentile of the travel distances
d) In what interval do the middle 70% of travel distances fall?
9) Match the given correlation coefficients with the scatter plots.
10) The attendance at Camden Yards during the 12 years from 2000 to 2011 is given as follows:
(Source: http://www.ballparksofbaseball.com/attendance.htm)
Year
Attendance
a) Find the equation of the regression line.
2000
3,296,031
2001
3,094,841
b) Interpret the slope in context.
2002
2,682,439
2003
2,454,523
c) Find and interpret the residual for 2008
2004
2,744,013
d) Overall, is this a good linear fit? Explain.
2005
2,624,804
2006
2,153,150
e) Is it feasible to use this model to predict attendance in 2014?
2007
2,164,822
Why or why not?
2008
1,950,075
2009
1,907,163
f) What is the predicted attendance for 2014? (The actual
2010
1,733,018
attendance was 2,102,240).
2011
1,755,461
11) The March 2000 Consumer Reports compared various brands of supermarket enchiladas in
cost and sodium content. Use the scatterplot and regression analysis to answer the questions.
Fitted Line Plot
Sodium content (mg) = 2185 - 607.0 Cost (per serving)
1750
S
R-Sq
R-Sq(adj)
Sodium content (mg)
1500
250.702
77.3%
74.0%
1250
1000
750
500
1.0
1.5
2.0
Cost (per serving)
2.5
3.0
a) Use the scatterplot above to describe the relationship between Cost and Sodium Content.
b) What is the correlation coefficient for the relationship between cost and sodium content?
c) How much sodium would you expect if the cost is $2.90?
Unit 2
12) A real estate company kept a database on the apartments in a certain city. The percentages of
various types of apartments are listed below.
Number of Percent
bedrooms
0 (Studio)
15.9
1
25.5
2
45.8
3
10.1
a) Are events represented in this table disjoint?
b) What is the probability that a randomly selected apartment in this city is a 1-bedroom or
2-bedroom apartment?
c) What is the expected value for the number of bedrooms that an apartment will have?
13) Explain the difference between an experiment and an observational study
14) Of the coffee makers sold in an appliance store, 6.0% have either a faulty switch or a
defective cord, 2.0% have a faulty switch, and 0.8% have both defects. What percent of the
coffee makers will have a defective cord?
15) Suppose a computer chip manufacturer rejects 15% of the chips produced because they fail
presale testing.
a) What is the probability that the first chip fails and the second chip passes presale testing?
b) If you test 4 chips, what is the probability that not all of the chips fail?
16) Suppose that on any given day, there is a 65% chance of it being sunny, a 10% chance of
rain, and a 25% chance of being cloudy.
a) Clearly explain how you would use the random numbers from 1 to 100 to conduct a
simulation to model this situation
b) If you were asked to find the probability that it will sunny at least two days in a week,
clearly state what the response variable would be.
c) For each trial below, fill in the resulting outcome
Trial #
Random Numbers
1
2
3
4
5
6
65 91 5 72 41 69 48
90 82 93 74 63 67 71
34 35 56 83 42 4 95
91 42 18 37 81 85 67
80 57 63 38 76 35 12
63 84 65 36 82 89 68
Number of Sunny
Days
Response Variable
d) Using your simulation, estimate the probability that it will be sunny at least two days in a
week.
e) Using your simulation, how many sunny days would you estimate per week?
17) Assume that 11% of people are left-handed. If we select 10 people at random, find the
probability that
a) Exactly 3 are left-handed.
b) At most 2 are left-handed
c) At least 2 are left-handed
d) Between 1 and 4 people are left-handed (inclusive)
e) Explain how you know that the Binomial model applies to this situation
f) What is the mean and the standard deviation of the number of left-handed people in a
group of 10?
18) Suppose that in a given suburb 60% of the houses have garages, 40% have decks, and 30% of
the houses have both.
a) What is the probability that a randomly selected house has a garage or a deck?
b) What is the probability that a randomly selected house will have neither a garage nor a
deck?
c) What the probability that a randomly selected house will have a deck but not a garage?
d) What is the probability that a randomly selected house will have a deck if it is known that
the house has a garage?
e) Are the events of a house having a garage and a house having a deck independent?
Justify your answer with a probability test.
19) In a certain college, 33% of the physics majors belong to ethnic minorities. If 10 students are
selected at random from the physics majors, what is the probability that no more than 6
belong to an ethnic minority?
20) Suppose you buy 1 ticket for $1 out of a lottery of 100 tickets where the prize for the one
winning ticket is to be $50. What is your expected value?
21) A tax auditor has a pile of 191 tax returns of which he would like to select 17 for a special
audit. Describe a method for selecting the sample which involves
a) systematic sampling
b) stratified sampling
c) simple random sampling
22) At a college there are 120 freshmen, 90 sophomores, 110 juniors, and 80 seniors. A school
administrator selects a random sample of 12 of the freshmen, 9 of the sophomores, 11 of the
juniors and 8 of the seniors. She then interviews all the students selected. Identify the type
of sampling used in this example.
23) A car insurance company is interested in the association between age and the frequency of
car accidents. They obtained the following sample data.
Number of
accidents in
past 3 years
0
1
More than 1
total
Age Group
Under 25 25-45 Over 45
74
89
82
18
8
12
8
3
6
100
100
100
total
245
38
17
300
a) What is the probability that a randomly selected participant was under age 25?
b) What is the probability that a randomly selected participant was under age 25 or in more
than 1 car accident?
c) What is the probability that a randomly selected participant was over age 45 and in 0 car
accidents?
d) What is the probability that a randomly selected participant was over age 45 if it is known
that they were in 0 car accidents?
e) Do the events of being over 45 and having had 0 accidents in the past 3 years appear to
be independent? Justify your answer using probabilities.
24) Suppose that in a given court system 20% of defendants are truly innocent and 80% of
defendants are truly guilty. Defendants are given the choice to plead innocent or plead
guilty. Suppose that defendants who are innocent plead innocent 99% of the time and that
defendants who are guilty plead innocent 70% of the time.
a) Find the probability that a defendant is guilty and pleads guilty
b) Find the probability that a defendant is innocent and pleads innocent
c) Find the probability that a defendant pleads innocent
d) Find the probability that a defendant who pleads innocent is actually innocent
Unit 3
25) Assume that 25% of students at a university wear contact lenses. We randomly select 200
students.
a) What is the mean and standard deviation of the proportion of students in this group who
may wear contact lenses?
b) What is the probability that we observe a sample proportion (𝑝̂ ) of 30% or more students
who wear contact lenses?
26) The number of hours per week that high school seniors spend on computers is normally
distributed, with a mean of 4 hours and a standard deviation of 2 hours. 60 students are
chosen at random. Let y be the mean number of hours spent on the computer for this group.
Find the probability that y is between 4.2 and 4.4 hours.
27) A researcher wishes to estimate the proportion of fish in a certain lake that is inedible due to
pollution of the lake. How large a sample should be tested in order to be 99% confident that
the true proportion of inedible fish is estimated to within 6%?
28) A mayoral election race is tightly contested. In a random sample of 2200 likely voters, 1144
said that they were planning to vote for the current mayor. Based on a 95% confidence
interval, would you claim that the mayor will win a majority of the votes? Explain.
29) 7 of 8,500 people vaccinated against a certain disease later developed the disease. 18 of
10,000 people vaccinated with a placebo later developed the disease. Test the claim that the
vaccine is effective in lowering the incidence of the disease. Use a significance level of 0.02.
30) Suppose the proportion of sophomores at a particular college who purchased used textbooks
in the past year is p s and the proportion of freshmen at the college who purchased used
textbooks in the past year is p f . A study found a 95% confidence interval for ps  p f is
0.235,0.427 .
Does this interval suggest that sophomores are more likely than freshmen to
buy used textbooks? Explain.
31) A skeptical paranormal researcher claims that the proportion of Americans that have seen a
UFO, p, is less than 4%. He surveys 500 randomly selected Americans and finds that 11 of
them claim to have seen a UFO.
a) What type of test would be appropriate
b) Write the hypotheses
c) Check the assumptions and conditions
d) Conduct the hypothesis test. Report the test statistic and p-value, sketch the curve, and
make an appropriate conclusion.
e) What type of error might have occurred?
32) A police officer pulls over an individual that was driving recklessly. The police office is
trying to determine if the individual should be arrested for driving under the influence of
alcohol. Given Null and alternative hypotheses below, identify the type of error:
Ho: The individual is not driving under the influence of alcohol.
Ha: The individual is driving under the influence alcohol.
a) The police officer determines that the individual should be arrested and is driving under
the influence of alcohol when the individual is not driving under the influence of alcohol.
b) The police officer determines that the individual should not be arrested and is not driving
under the influence of alcohol when the individual is under the influence of alcohol.
33) In the past, the mean running time for a certain type of flashlight battery has been 8.5 hours.
That manufacturer has introduced a change in the production method and wants to perform a
hypothesis test to determine whether the mean running time has increased as a result. He
samples 30 newly produced batteries and find a sample mean of 9.2 hours with a standard
deviation of 0.6 hours.
a) What type of test would be appropriate?
b) Write the hypotheses:
c) Conduct the test (report the test statistic, p-value, sketch the curve, and write an
appropriate conclusion). You can assume that all assumptions and conditions have been
met.
34) Using the data below and a 0.05 significance level, test the claim that the responses occur
with percentages of 15%, 20%, 25%, 25%, and 15% respectively.
Response
A
B
C
D
E
Frequency
12 15 16 18 19
35) A coach uses a new technique to train gymnasts. 7 gymnasts were randomly selected and
their competition scores were recorded before and after the training. The results are shown
below.
Subject A
B
C
D
E
F
G
Before
9.4 9.5 9.6 9.6 9.4 9.6 9.6
After
9.5 9.7 9.6 9.5 9.5 9.9 9.4
Do the data suggest that the training technique is effective in raising the gymnasts’ scores?
Perform a hypothesis test at the 5% significance level.
36) A laboratory tested twelve chicken eggs and found that the mean amount of cholesterol was
240 milligrams with s  19.8 milligrams.
a) Construct a 95% confidence interval for the true mean cholesterol content of all such
eggs.
b) Interpret this confidence interval.
37) Suppose you have obtained a confidence interval for  , but wish to obtain a greater degree of
precision. Which of the following would result in a narrower confidence interval?
a)
b)
c)
d)
Increasing the sample size while keeping the confidence level fixed
Decreasing the sample size while keeping the confidence level fixed
Increasing the confidence level while keeping the sample size fixed
Decreasing the confidence level while keeping the sample size fixed
38) A car insurance company performed a study to determine whether an association exists
between age and the frequency of car accidents. They obtained the following sample data.
Perform a test to see if there is an association between age and frequency of car accidents.
  0.05
Age Group
Under 25 25-45 0ver 45 total
Number of
0
74
89
82
245
accidents in
1
18
8
12
38
past 3 years
More than 1
8
3
6
17
total
100
100
100
300