continuous random variable X

1
Chapter 6 Notes AP STATS
Name_________________________________
Introduction –
Station Number -
Bottled Water cup? --
Looking at the class data:
1)
2)
3)
4)
How many students in class guessed correctly?
What percent of the class is this?
What’s the probability that an individual student would guess correctly?
How many correct identifications would you need to see to be convinced that the students in your class aren’t
just guessing?
5) Design and carry out a simulation to answer this question. What do you conclude about your classes ability to
distinguish tap water from bottled water?
6.1 Discrete and Continuous Random Variables
Definition:
A random variable takes numerical values that describe the outcomes of some chance process. The probability
distribution of a random variable gives its possible values and their probabilities.
Suppose we toss a fair coin three times. The
Sample space is
HHH HHT HTH THH HTT THT TTH TTT
Let X = The number of heads obtained
X=0 X=1 X=2 X=3
Probability Distribution of X:
What is the probability that we get at least one head in three tosses of a coin?
P(X > 1) =
Interpret P(x > 1) in context. This probability means that if we were to repeatedly flip 3 coins and record the number of
heads, then in the long-run about _____% of the time we will get 1 or more heads.
There are two main types of random variables, discrete and continuous!
2
Discrete Random Variables and Their Probability Distributions
Example: APGAR SCORES: BABIES HEALTH AT BIRTH Discrete random variables
In 1952, Dr. Virginia Apgar suggested five criteria for measuring a baby’s health at birth: skin color, heart rate, muscle
tone, breathing, and response when stimulated. She developed a 0-1-2 scale to rate a newborn on each of the five
criteria. A baby’s Apgar score is the sum of the rating on each of the five scales, which gives a whole-number vale from 0
to 10. Apgar scores are still used today to evaluate the health of newborns.
What Apgar scores are typical? To find out, researchers recorded the Apgar scores of over 2 million newborn babies in a
single year. Imagine selecting one of these newborns at random. Define the random variable X = Apgar score of a
randomly selected baby one minute after birth. The table below gives the probability distribution for X.
(a) Show that the probability distribution for X is legitimate.
(b) Make a histogram of the probability distribution. Describe what you see.
(c) Apgar scores of 7 or higher indicate a healthy baby. What is P(X ≥ 7)?
3
1. TOSS 4 TIMES. Suppose you toss a fair coin 4 times. Let X = the number of heads you get.
a) Find the probability distribution of X.
b) Make a histogram of the probability distribution. Describe what you see.
c) Find P(X < 3) and interpret the result.
5. BENFORD’s LAW. Faked numbers in tax returns, invoices, or expense account claims often display patterns that
aren’t present in legitimate records. Some patterns, like too many round numbers, are obvious and easily avoided by a
clever crook. Others are more subtle. It is a striking fact that the first digits of numbers in legitimate records often
follow a model known as Benford’s law. Call the first digit of a randomly chosen record X for short. Benford’s law gives
this probability model for X. (note that the first digit cannot be 0):
First Digit X: 1
2
3
4
5
6
7
8
9
Probability 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046
a) Show that this is a legitimate probability distribution
b) Make a histogram of the probability distribution. Describe what you see.
c) Describe the event X > 6 in words. What is P(X> 6)
d) Express the event “the first digit is at most 5” in terms of X. What is the probability of this event?
4
7. Refer to problem above. The first digit of a randomly chosen expense account claim follows Benford’s law. Consider
the events A = First digit is 7 or greater and B = first digit is odd.
a) What outcomes make up the event A? What is P(A)?
b) What outcomes make up the event B? What is P(B)?
c) What outcomes make up the event “A or B”? What is P(A or B)? Why is this probability not equal to P(A)+P(B)?
Mean (Expected Value) of a Discrete Random Variable
Consider the random variable X = Apgar Score
Compute the mean of the random variable X and interpret it in context.
Value:
0
1
2
3
4
5
6
7
8
9
10
Probability:
0.001
0.006
0.007
0.008
0.012
0.020
0.038
0.099
0.319
0.437
0.053
5
9. Keno. Keno is a favorite game in casinos, and similar games are popular with the states that operate lotteries. Balls
numbered 1 to 80 are tumbled in a machine as the bets are placed, then 20 of the balls are chosen at random. Players
select numbers by marking a card. The simplest of the many wagers available is "mark 1 Number." your payoff is $3 on a
$1 bet if the number you select is one of those chosen. Because 20 of 80 numbers are chosen, your probability of
winning is 20/80 or 0.25. Let X = the amount you gain on a simple play of the game.
a) Make a table that shows the probability distribution of X.
b) Computer the expected value of X. Explain what this result means for the player.
13. Benford's law and fraud. A not so clever employee decided to fake his monthly expense report. He believed that
the first digits of his expense accounts should be equally likely to be any of the number from 1 to 9. In that case, the
first digits, Y, of a randomly selected expense amount would have the probability distribution shown in the histogram.
a) Explain why the mean of the random variable Y is located at the solid line in the diagram.
b) The first digits of randomly selected expense amounts actually follow Benford's law. What's the expected value of the
first digit? Explain how this information could be sued to detect a fake expense account.
First Digit X: 1
2
3
4
5
6
7
8
9
Probability 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046
c) What's P(Y >6)? According to Benford's law, what proportion of first digits in the employee's expense amounts should
be greater than 6? how could this information be sued to detect a fake expense report?
6
HOMEWORK PROBLEM #14
14. Life Insurance. A life insurance company sells a term insurance policy to a 21 year old male that pays $100,000 if
the insured dies within the next 5 years. The probability that a randomly chosen male will die each year can be found in
mortality tables. The company collects a premium of $250 each year as payment for the insurance. The amount Y that
the company earns on this policy is $250.00 per year, less the $100,000 that is must pay if the insured dies. Here is a
partially competed table that shows information about risk of mortality and the values of Y = profit earned by the
company:
A) Fill in the missing values of Y.
b) Find the missing probability.
c) calculate mean µx. Interpret this value in context.
Day 2 Standard Deviation and Variance. Continuous Random Variables
Recall: Variance -- an average of the standard deviation of the vales of the variable X from its mean µx
Recall: standard Deviation is the Square Root of the variance. -- the measure of how much the values of the variable X
tend to vary, on average, from the mean.
Example: Consider the random variable X = Apgar Score
Compute the standard deviation of the random variable X and interpret it in context.
Value:
0
1
2
3
4
5
6
7
8
9
10
Probability:
0.001
0.006
0.007
0.008
0.012
0.020
0.038
0.099
0.319
0.437
0.053
7
18. It would be quite risky for you to insure the life of a 21 year old friend under the terms of Exercise 14. There is a
high probability that your friend would live and you would gain $1250 in premiums. But if he were to die, you would
lose almost $100,000. Explain carefully why selling insurance is not risky for an insurance company that insures many
thousands of 21 year old men.
b) The risk of an investment is often measured by the standard deviation of the return on the investment. The more
variable the return is, the riskier the investment. We can measure the great risk of insuring a single person's life in
Exercise 14 by computing the standard deviation of the income Y that the insurer will receive. Find σY using the
distribution and mean found in Exercise 14.
Common AP ERROR:
Students often lose credit for not showing adequate work -or for not showing any work at all. To get full credit, students
need to go beyond just reporting commands used on a
graphing calculator. In this case, showing the first couple of
terms with an ellipsis (...) for the calculation of the mean or
standard deviation is sufficient.
Calculator steps to find Expected Mean and
Standard Deviation
19. Housing in San Jose. How do rented housing units differ from units occupied by their owners? Here are the
distributions of the number of rooms for owner occupied units and renter-occupied units in San Jose, California?
Number of Rooms
1
2
3
4
5
6
7
8
9
10
Owned 0.003 0.002 0.023 0.104 0.210 0.224 0.197 0.149 0.053 0.035
Rented 0.008 0.027 0.287 0.363 0.164 0.093 0.039 0.013 0.003 0.003
8
Let X = the number of rooms in a randomly selected owner-occupied unit and Y - the number of rooms in a randomly
chosen center-occupied unit.
a) Make histograms suitable for comparing the probability distribution of X and Y. Describe any differences you that you
observe.
b) Find the mean number of rooms for both types of housing unit. Explain why this difference makes sense.
c) Find the standard deviation of both X and Y. Explain why this difference makes sense.
Example: The heights of young women closely follow the Normal distribution with mean µ = 64 inches and standard
deviation σ = 2.7 inches. This is a distribution for a large set of data. Now choose one young woman at random. call
her height Y. What is the probability that a randomly chosen young woman has height between 68 and 70 inches?
9
Exam Tip: When you solve problems involving random variables, start by defining the random variable of interest.
For example, let X = the Apgar score of a randomly selected baby or let Y = the height of a randomly selected
young women. Then state the probability you are trying to find in terms of the random variable:P(68<Y<70) or
P(X>7)
Another Example: Weights of Three-Year-Old Females
The weights of three-year-old females closely follow a Normal distribution with a mean of  = 30.7 pounds and a
standard deviation of 3.6 pounds. Randomly choose one three-year-old female and call her weight X. Find the
probability that the randomly selected three-year-old female weighs at least 30 pounds.
Using your calculator:
23. ITBS Scores. The Normal distribution with mean µ = 6.8 and standard deviation σ = 1.6 is a good description of the
Iowa Test of Basic Skills (ITBS) vocabulary scores of seventh grade students in Gary, Indiana. call the score of a randomly
chosen student X for short. Find P(X>9) and interpret the result.
25. Did you vote? A sample survey contacted an SRS of 663 registered voters in Oregon shortly after an election and
asked respondents whether they had voted. Voter records shows that 56% of registered voters had actually voted. We
will see later than in repeated samples of size 663, the proportion in the sample who voted (V) will vary according to the
Normal distribution with mean μ = 0.56 and standard deviation σ = 0.019
a) If the respondents answer truthfully, what is P(0.52 < V< 0.60)? This is the probability that the sample proportion V
estimates the population proportion 0.56 within ± 0.04.
b) In fact, 72% of the respondents said they had voted (V= 0.72). if respondents answered truthfully, what is P(V>0.72)?
This probability is so small that it is good evidence that some people who did not vote claimed that they did vote.
10
Section 6.2 Linear Transformations
Review Questions 27 - 30.
Choose an American household at random and let the random variable X be the number of cars (including SUVs and
light trucks) they own. there is the probability model if we ignore the few households that own more than 5 cars.
Number of Cars X 0
1
2
3
4
5
Probability
0.09 0.36 0.35 0.13 0.05 0.02
27. A housing company builds houses with two car garages. What percent of the households have more cars than the
garage can hold?
A. 13%
B. 20%
C. 45%
D. 55%
E. 80%
28. What's the expected number of cars in a randomly selected American household?
A. Between 0 and 5
B. 1.00
C. 1.75
D. 1.84
E. 2.00
29. A deck of cards contains 52 cards, of which 4 are aces. You are offered the following wagers: Draw one card at
random from the deck. You win $10 if the card drawn is an ace. Otherwise you lose $1. If you make this wager very
many times, what will be the mean amount you win?
A. About -$1, because you will lose most of the time.
B. About $9, because you win $10 but lose only $1.
C. About -$0.15; that is, on average you lose about 15 cents
D. About $0.77; that is on average you win about 77₵
E. About $0, because the random draw gives you a fair bet.
30. The deck of 52 cards contains 13 hearts. Here is another wager: draw one card at random from the deck. If the card
drawn is a heart, you win $2. Otherwise you lose $1. Compare this wager (Call it Wager 2) with that of the pervious
exercise (call it Wager 2). Which one should you prefer?
A. Wager 1, because it has a higher expected value.
B. Wager 2, because it has a higher expected value.
C. Wager 1, because it has a higher probability of winning
D. Wager 2, because it has a higher probability of winning
E. Both wagers are equally favorable.
6.2 Linear Transformations
11
Note: Multiplying a random variable by a constant b multiplies the variance by b2.
Example:
El Dorado Community College considers a student to be full-time if he or she is taking between 12 and 18 units. The
number of units X that a randomly selected El Dorado Community College full-time student is taking in the fall semester
has the following distribution.
Number of Units (X) 12
13
14
15
16
17
18
Probability
0.25 0.10 0.05 0.30 0.10 0.05 0.15
Find µx = ______________ σx2 = _________ σx = _______
At El Dorado Community College, the tuition for full-time students is $50 per unit. That is, if T = tuition charge for a
randomly selected full-time student, T = 50X. Here is the probability distribution for T and a histogram of the probability
distribution:
Tuition Charge (T) 600 650 700 750 800 850 900
Probability
0.25 0.10 0.05 0.30 0.10 0.05 0.15
Find µx = ______________ σx2 = _________ σx = _______
Notice that



the shapes of both distributions are the same
the mean of the distribution of T is 50 times bigger than the mean of X: 732.5 = 50(14.65)
the standard deviation of T is 50 times bigger than the standard deviation of X: 103 = 50(2.06)
12
El Dorado Community College
In addition to tuition charges, each full-time student at El Dorado Community College is assessed student fees of $100
per semester. If C = overall cost for a randomly selected full-time student, C = 100 + T.
Here is the probability distribution for C and a histogram of the probability distribution:
Overall Cost (C)
Probability
700 750 800 850 900 950 1000
0.25 0.10 0.05 0.30 0.10 0.05 0.15
Find µx = ______________ σx2 = _________ σx = _______
Notice that



the shapes of both distributions are the same
the mean of the distribution of C is $100 larger than the mean of T: 832.5 = 100 + 732.5
the standard deviation of C is the same as the standard deviation of T
AP TEST COMMON ERROR:
When students are asked about the effect of a linear transformation on summary statistics, some students
forget that adding (subtracting) a constant to (from) every value in the distribution has no effect on measures of
spread, including the standard deviation, variance, range and interquartile range.
Scaling a Test
Problem: In a large introductory statistics class, the distribution of X = raw scores on a test was approximately normally
distributed with a mean of 17.2 and a standard deviation of 3.8. The professor decides to scale the scores by multiplying
the raw scores by 4 and adding 10.
(a) Define the variable Y to be the scaled score of a randomly selected student from this class. Find the mean and
standard deviation of Y.
(b) What is the probability that a randomly selected student has a scaled test score of at least 90?
13
37. Get on the boat! A small ferry runs every half hour from one side of a large river to the other. The number of cars
X on a randomly chosen ferry trip has the probability distribution shown below. You can check that μx = 3.87 and σx =
1.29
Cars
0
1
2
3
4
5
Probability 0.02 0.05 0.06 0.16 0.27 0.42
a) the cost for the ferry trip is $5. Make a graph of the probability distribution for the random variable M = money
collected on a randomly selected ferry trip. Describe its shape.
b) Find and interpret μx
c) Compute and interpret σx
39-40 Ms. Hall gave her class a 10 question multiple choice quiz. Let X = the number of questions that a randomly
selected student in the class answered correctly. The computer output below gives information about the probability
distribution of X. To determine each student's grade on the quiz (out of 100). Ms. Hall will multiply his or her number of
correct answers by 10. let G = the grade of a randomly chosen student in the class.
N
30
Mean
7.6
Median
8.5
St. Dev
1.32
Min
4
Max
10
Q1
8
Q3
9
39. a) Find the mean of G. Show your method
b) Find the standard deviation of G. Show your method.
c) How do the variance of X and the variance of G compare? Justify your answer.
40. a) Find the median of G. Show your method.
b) find the IQR of G. Show your method
c) What shape would the probability distribution of G have? Justify your answer.
41. Refer to exercise 37. The ferry company's expenses are $20 per trip. Define the random variable Y to be the
amount of profit (money collected minus expenses) made by the ferry company on a randomly selected trip. that is,
Y = M - 20.
a) How does the mean Y relate to the mean of M? Justify your answer. What is the practical importance of μY?
14
b) How does the standard deviation of Y relate to the standard deviation of M? Justify our answer. What is the practical
importance of σY?
43. Get on the boat. Based on the analysis of #41, the ferry company decides to increase the cost of a trip to $6. We
can calculate the company's profit Y on a randomly selected trip from the number of cars X. Find the mean and standard
deviation of Y. Show your work.
45. Too cool at the cabin. During the winter months, the temperatures at the Starneses' Colorado cabin can stay well
below freezing for weeks at a time. To prevent the pipes from freezing Mrs. Starnes sets the thermostat at 50° F. She
also buys a digital thermometer that records the indoor temperature each night at midnight. Unfortunately, the
thermometer is programmed to measure the temperature in °C. based on several years' worth of data, the temperature
T in the cabin at midnight follows a Normal Distribution with mean 8.5°C and standard deviation 2.25 °C.
a) Let Y = the temperature in the cabin at midnight on a randomly selected night in °F. F = (9/5)C + 32. Find the mean
and the standard deviation.
b) Find the probability that the midnight temperature in the cabin is below 40 °F. Show your work.
49. Review Independence: In which of the following games of chance would you be willing to assume independence of
X and Y in making a probability model? Explain your answer in each case.
a) In blackjack, you are dealt two cards and examine the total points X on the cards (face cards count 10 points). You can
choose to be dealt another card and compete based on the total points Y on all three cards.
b) In craps, the betting is based on successive rolls of two dice. X is the sum of the faces on the first roll, and Y is the sum
of the faces on the next roll.
15
6.2 Combining Random Variables
Example of mean of the Sum: How many total passengers can Pete and Erin expect on a randomly selected day?
If Pete expects µX = 3.75 and Erin expects µY = 3.10 , they will average a total of 3.75 + 3.10 = 6.85 passengers per trip.
Example of Variance of the Sum:
Let S = X + Y as before. Assume that X and Y are independent, which is reasonable since each student was
selected at random.
Here are all the possible combinations of X and Y:
X
12
12
12
13
13
13
14
14
14
15
15
15
16
16
16
17
17
17
18
18
18
P(X)
0.25
0.25
0.25
0.10
0.10
0.10
0.05
0.05
0.05
0.30
0.30
0.30
0.10
0.10
0.10
0.05
0.05
0.05
0.15
0.15
0.15
Y
12
15
18
12
15
18
12
15
18
12
15
18
12
15
18
12
15
18
12
15
18
P(Y)
0.3
0.4
0.3
0.3
0.4
0.3
0.3
0.4
0.3
0.3
0.4
0.3
0.3
0.4
0.3
0.3
0.4
0.3
0.3
0.4
0.3
S=X+Y
24
27
30
25
28
31
26
29
32
27
30
33
28
31
34
29
32
35
30
33
36
P(S)=P(X)P(Y)
0.075
0.10
0.075
0.03
0.04
0.03
0.015
0.02
0.015
0.09
0.12
0.09
0.03
0.04
0.03
0.015
0.02
0.015
0.045
0.06
0.045
Find μx = ___________
Find μy = ___________
Find μx+Y = ____________
Find σx = ____________
Find σy = ____________
Find σx+y = ____________
Find σX2 = ____________
Find σY2 = ____________
Find σX+Y2 = ____________
16
Example: Mr. Starnes likes between 8.5 and 9 grams of sugar in his hot tea. Suppose the amount of sugar in a randomly
selected packet follows a Normal distribution with mean 2.17 g and standard deviation 0.08 g. If Mr. Starnes selects 4
packets at random, what is the probability his tea will taste right?
Let X = the amount of sugar in a randomly selected packet. Then, T = X1 + X2 + X3 + X4. We want to find P(8.5 ≤ T ≤ 9).
Step 1: Find the mean amount of sugar in tea:
Step 2: Find the Standard Deviation of sugar in tea:
 T2   X2   X2   X2   X2  (0.08)2  (0.08)2  (0.08)2  (0.08)2  0.0256
1
2
3
4
 T  0.0256  0.16
Step 3: Find the z score for the sugar amount in his tea: z  8.5  8.68  1.13
0.16
and
z
9  8.68
 2.00
0.16
Step 4: Draw a normal distribution and shade in the appropriate area under the curve.

Step 5: calculate the probability using correct notation: P(-1.13 ≤ Z ≤ 2.00) = 0.8480
RECALL THAT IS NORMALCDF in calculator but we never write calculator talk on test!
Step 6: Conclude: There is about an 85% chance Mr. Starnes’s tea will taste right.
Practice Problem: Suppose that the weights of a certain variety of apples have weights that are approximately Normally
distributed with a mean of 9 ounces and a standard deviation of 1.5 ounces. If bags of apples are filled by randomly
selecting 12 apples, what is the probability that the sum of the weights of the 12 apples is less than 100 ounces?
17
Practice Problem #2: Have you ever been to a fast food restaurant and the lid didn't quite fit?
The diameter C of a randomly selected large drink cup at a fast food restaurant follows a normal distribution with a
mean of 3.96 inches and a standard deviation of 0.01 inches. The diameter L of a randomly selected large lid at this
restaurant follows a Normal distribution with mean 3.98 inches and standard deviation 0.02 inches. For a lid to fit on a
cup, the value of L has to be bigger than the value of C, but not by more than 0.06 inches.
State: __________________________________________________________________________________
Plan: Let ____ = L - C to represent the difference between the lid's diameter and the cup's diameter. Find
P(0<___<0.06)
Do: Find the mean: _______________________________
Find the variance: ______________________________
Find the standard deviation: ________________________
Find the Z scores:
Draw the Area under the curve:
Find the Probability: P(__________) = ________________
Conclude: _________________________________________________________________________________________
__________________________________________________________________________________________________
51. His and her earnings. A study of working couples measures the income X of the husband and the income Y of the
wife in large number of couples in which both parties are employed. Suppose that you knew the mean μx and μY the
variances σx2 and σY2 of both variables in the population.
a) Is it reasonable to take the mean of the total income X + Y to be μx + μY ? Explain your answer.
b) Is it reasonable to take the variance of the total income to be σx2 + σY2? Explain your answer.
18
57. Let X = the amount a life insurance company earns on a 5 year term life policy. Calculations reveal that μx = $303.35
and σx = $9707.57
The risk of insuring one person's life is reduced if we insure may people. Suppose that w insure two 21 year old males,
and that their ages at death are independent. If X1 and X2 are the insurer's income from the two insurance policies, the
insurer's average income W on the two policies is W =
𝑋1 +𝑋2
2
= 0.5𝑋1 + 0.5𝑋2
Find the mean and standard deviation of W.
58. If four 21 year old men are insured, the insurer's average income is:
V=
𝑋1 +𝑋2 +𝑋3 +𝑋4
4
= 025𝑋1 + 0.25𝑋2 + 0.25𝑋3 + 0.25𝑋4
When Xi is the income from insuring one man. Assuming that the amount of income earned on individual policies is
independent, find the mean and standard deviation of V.
59. Time and Motion. A time and motion study measures the time required for an assembly line worker to perform a
repetitive task. The data show that the time required to bring a part from a bin to its position on an automotive chassis
varies from car to car according to a Normal Distribution with mean 11 seconds and standard deviation 2 seconds. The
time required to attached the part to the chases follows a Normal distribution with mean 20 seconds and standard
deviation 4 seconds. The study fines the time required for the two steps are independent. A par t that takes a long time
to position, for example, does not take more or less time to attach than other parts.
a) Find mean and standard deviation of the time required for the entire operation of positioning and attaching the part.
b) Management's goal is for the entire process to take less than 30 seconds. Find the probability that this goal will be
met. Show you work.
19
63. Swim Team. Hanover High School has the best women's swimming team in the region. The 400 meter freestyle
relay team is undefeated this year. In the 400 meter freestyle relay, each swimmer swims 100 meters. The times, in
seconds, for the four swimmers this season are approximately Normally distributed with means and standard deviations
as shown. Assuming that the swimmer's individual times are independent, find the probability that the total team time
in the 400 meter freestyle relay is less than 220 seconds.
Swimmer
Kalin
Megan
Ashley
Lyndie
Mean
55.2
58.0
56.3
54.7
Std. Dev
2.8
3.0
2.8
2.7
TRY ON YOUR OWN PROBLEM (HOMEWORK)
61. Auto Emissions. The amount of nitrogen oxides (NOX) present in the exhaust of a particular type of car varies from
car to car according to a Normal distribution with mean 1.4 grams per mile (g/mi) and standard deviation 0.3 g/mi. Two
randomly selected cars of this type are tested. One has 1.1 g/mi of NOX; the other has 1.9 g/mi. The test station
attendant find this difference in emissions between two similar cars surprising. If the NOX levels for two randomly
chosen cars of this type are independent, find the probability that the difference is at least as large as the value the
attendant observed.
20
6.3 Binomial and Geometric Random Variables
Binomial Settings
When the same chance process is repeated several times, we are often interested in whether a particular outcome
does or doesn’t happen on each repetition. In some cases, the number of repeated trials is fixed in advance and we
are interested in the number of times a particular event (called a “success”) occurs. If the trials in these cases are
independent and each success has an equal chance of occurring, we have a binomial setting.
Example: ROLL THE PIGS game: PIG OUT
Land on sides: one has dot up, one has dot down
Toss two pigs 50 times. Success = number of times Pig Out occurs. Failure = number of times it did not.
Keep track of the number of successes.
Total successes = ______/50
__________________________________________________________________________________________________
Binary?
Yes. The outcome was either Pig out or not
Independent? Yes The outcome of one roll does not effect the outcome of the next roll.
Number?
n = 50
Success?
Probability of success = approximately 20%
_________________________________________________________________________________________________
21
Example: Determine whether the random variables below have a binomial distribution. Justify your answer.
(a) Roll a fair die 10 times and let X=the number of sixes.
(b) Shoot a basketball 20 times from various distances on the court. Let Y=number of shots made.
(c) Observe the next 100 cars that go by and let C=color.
Binomial Coefficient: Used to determine the number of successes given a binomial event. Much easier than listing out
all possible outcomes, which could take too long!
22
Example: Each child of a particular pair of parents has probability 0.25 of having type O blood. Genetics says that
children receive genes from each of their parents independently. If these parents have 5 children, the count X of
children with type O blood is a binomial random variable with n = 5 trials and probability p = 0.25 of a success on each
trial. In this setting, a child with type O blood is a “success” (S) and a child with another blood type is a “failure” (F).
What’s P(X = 2)?
1) Define n = 5 (number of trials) k = 2 (number of successes). Find the number of successful events:
Examine the number of successes from all possible outcomes: It would take forever to list all possible outcomes! This is
why we use the binomial coefficient :)
2) The probability of a success = 0.25 (from problem) The probability of failure = 0.75 (complement of success)
Write it just like above to get full credit on the AP Stat test, however, you can perform the entire calculation in your
calculator:
Go to Distributions: binompdf(n,p,k) Reminder that pdf is for a specific number and cdf is for an interval
DO NOT USE CALCULATOR SPEAK ON THE TEST. WRITE EVERYTHING JUST AS BELOW BUT YOU CAN FIND THE SOLUTION
IN YOUR CALCULATOR 0.2637 all at once!
Binomial Probability Formula is given on the formula sheet!
EXAMPLE 2: Each child of a particular pair of parents has probability 0.25 of having blood type O. Suppose the parents
have 5 children
(a) Find the probability that exactly 3 of the children have type O blood.
(b) Should the parents be surprised if more than 3 of their children have type O blood?
23
Conclusion to part b: Since there is only a 1.5% chance that more than 3 children out of 5 would have Type O blood, the
parents should be surprised!
Using your calculator to answer part B. You may have to use 1 - Binomcdf if your calculator does not have the new ROM
1-Binomcdf(5,0.25,3). If it has the new ROM, then just Binomcdf(5,0.25,3)
69. Sowing Seeds. Seed Depot advertises that 85% of its flower seeds will germinate (grow). Supposed that the
company's claim is true. Judy buys a packet with 20 flower seeds from Seed Depot and plants them in her garden. Let X
= the number of seeds that germinate
Binomial or Not?
71. Lefties. Exactly 10% of the students in a school are left-handed. Select students at random from the school, one at a
time, until you find one who is left-handed. Let V = the number of students chosen.
Binomial or Not?
73. Binomial Setting? A binomial distribution will be approximately correct as a model for one of these two sports
settings and not for the other. Explain why by briefly discussing both settings
a) A NFL kicker has make 80% of his field goal attempts in the past. This season he attempts 20 field goals. The attempts
differ widely in distance, angle, wind, and so on.
b) A NBA player has made 80% of his free-throw attempts in the past. This season he takes 150 free throws. Basketball
free throws are always attempted from 15 feet away with no interference from other players.
75. Blood Types. In the US 44% of adults have type O blood. Suppose we chose 7 US adults at random. Let X = the
number who have type O blood. Use the binomial probability to find P(Y=1). Interpret this answer in context.
77. . Blood Types. Using the information from 75. How surprising would it be to get more than 4 adults with type O
blood in the sample?
24
79. Seed Depot advertises that 85% of its flower seeds will germinate (grow). Supposed that the company's claim is
true. Judy buys a packet with 20 flower seeds from Seed Depot and plants them in her garden. Let X = the number of
seeds that germinate.
a) Find the probability that exactly 17 seeds germinate. Show your work.
b) If only 12 seeds actually germinate, should Judy be suspicious that the company's claim is not true? Computer P(X<12)
and use this results to support your answer.
6.3 day 2 Mean and Standard Deviation of a Binomial Distribution
Both Formulas are
given on Formula Sheet
:)
81. Random digit dialing. When an opinion poll calls residential telephone numbers at random, only 20% of the calls
reach a live person. You watch the random digit dialing machine make 15 calls.
Let X = the number of calls that reach a live person.
a) Find and interpret μx
b) find and interpret σx
83. Random digit dialing. Let Y = the number of calls that don't reach a live person.
a) Find the mean of Y. How is it related to the mean of X? Explain why this makes sense.
25
b) Find the standard deviation of Y. How is it related to the standard deviation of X? Explain why this makes sense.
85. Aircraft Engines. Engineers define reliability as the probability that an item will perform its function under specific
conditions for a specific period of time. A certain model of aircraft engine is designed so that each engine has
probability 0.999 of performing property for an hour of flight. Company engineers test an SRS of 350 engines of this
model. Let X = the number that operate for an hour without failure.
a) Explain why X is a binomial random variable.
b) Find the mean and standard deviation of X. Interpret each value in context.
c) Two engines failed the test. Are you convinced that this model of engine is less reliable than its supposed to be?
Compute P(X<348) and use the results to justify your answer.
Binomial Distributions in Statistical Sampling
n
1
N
10
87. Airport Security. The TSA is responsible for airport safety. On some flights, TSA officers randomly select passengers
for an extra
security check before boarding. One such flight had 76 passengers - 12 in first class and 64 in coach. Some
passengers were surprised when none of the 10 passengers chosen for screening were seated in first class. Can we use
a binomial distribution to approximate this probability?
26
Example: Attitudes Toward Shopping
Sample surveys show that fewer people enjoy shopping than in the past. A survey asked a nationwide random sample of
2500 adults if they agreed or disagreed that “I like buying new clothes, but shopping is often frustrating and timeconsuming.” Suppose that exactly 60% of all adult US residents would say “Agree” if asked the same question. Let X =
the number in the sample who agree. Estimate the probability that 1520 or more of the sample agree.
B
I
N
S
Is np > 10?
Is n(1 - p) > 10?
27
Geometric Settings - the goal is to repeat a chance behavior until a success occurs.
Geometric Random Variable
NOT GIVEN ON
FORMULA SHEET!
Example: In the board game Monopoly, one way to get out of jail is to roll doubles. How likely is it that someone in jail
would roll doubles on his first, second, or third attempt? If this was the only way to get out of jail, how many turns
would it take, on average?
Let Y = number of attempts it takes to roll doubles one time.
Geometric Distribution?
Binary? Yes, success = doubles, failure = not doubles
Independent? Yes, knowing the outcome of previous rolls does not give additional information about future rolls.
Trials? Yes, we are counting the number of trials needed to get doubles once.
Success? Yes, the probability of success is always 1/6.
28
(a) Find the probability that it takes 3 turns to roll doubles to get out of jail.
(b) Find the probability that it takes more than 3 turns to roll doubles and interpret this value in context.
89. No replacement. To use a binomial distribution to approximate the count of success in an SRS, why do we require
that the sample size n be no more than 10% of the population size N?
If the sample size is more than 10% of the population, the amount of change to the make-up of the population is too
much. The probability of success changes too much to be considered constant.