Binomial Distribution We’ve been using simulation techniques to create a picture or what outcomes seem reasonable based on the “no preference” or “no discrimination” scenario. However, instead of repeatedly “flipping” coins and ____________________ probabilities, we will focus on the situation in which we simulate this experiment an ____________________ number of times. This will provide us with the ___________________ probabilities of interest. Also, this will fix the “problem” of different people getting different answers to the research question using the same data. Consider the Helper vs. Hinderer study once again. The following graphic shows what the distribution of “no preference” would look like if we kept flipping 16 coins over and over again (i.e. an infinite number of times), each time counting and plotting the number of heads obtained. This is known as the ____________________ distribution. Using the Binomial Distribution to Make Decisions To answer research questions which involve a single categorical variable, statisticians do not necessarily always turn to simulations involving taking random samples or flipping coins over and over again. Instead, we use the Binomial distribution which shows us how often each of our possible outcomes would occur if we repeated the previous simulations an infinite number of times. Binomial Distribution – when can we use it? The Binomial distribution can be used whenever the following conditions are met: 1. There are a fixed number of trials, _____. 2. There are only _____ possible outcomes for each trial – a “success” and a “failure” 3. The probability of ____________________ (p) remains the same for __________ trial. 4. The trials are ____________________. 1 Example: Check the conditions in context for the Helper vs. Hinderer study. There are a fixed number of trials, n. There are only 2 possible outcomes for each trial – a “success” and a “failure”. The probability of success (p) remains the same for each trial. The trials are independent. Using JMP to Calculate Binomial Probabilities There is a file on the course website called Binomial_Probabilities.jmp which we will use to calculate binomial probabilities. We will need the following pieces of information to model the Helper vs. Hinderer situation: p = probability infants choose the helper toy (assuming no toy preference) = __________ n = sample size (number of infants in the study) = __________ To enter these values into JMP, right-click on the n column and select Formula: Then, change the value of n to 16 (since there were 16 infants in the study) as follows: Click Apply and OK. 2 Similarly, right-click on the p column and select Formula. Change the value to p = 0.50 (since there is a 50% chance of choosing the helper toy, assuming infants really have no preference.) Click Apply and OK. JMP should return the following output: The graph below shows the individual binomial probabilities for this scenario. 3 Questions: 1. What do the “Individual Binomial Probabilities” represent? 2. What do the “Cumulative Binomial Probabilities” represent? 3. What is the probability of seeing exactly 8 infants choose the helper toy, assuming they have no preference? 4. What is the probability of seeing exactly 14 infants choose the helper toy, assuming they have no preference? 5. What is the probability of seeing 14 or more infants choose the helper toy, assuming they have no preference? 6. Does the p-value from Question 5 provide evidence that infants prefer the helper toy? Explain. 4 Example Revisited: Are Females Passed Over for Managerial Training? Recall, this example involves the possibly discrimination against female employees. Suppose a large supermarket chain occasionally selects employees to receive management training. A group of women employees have claimed that female employees are less likely than male employees of similar qualifications to be chosen for training. The large employee pool that can be tapped for management training is 60% female and 40% male; however, since the management program began, 9 of the 20 (40%) employees chosen for management training were female. Research Question – Is there evidence of gender discrimination against females? We have already carried out a simulation study by repeatedly taking samples of size 20 from a population which was 60a5 female. Question: 7. Based on the simulated results above, what is the estimated probability of observing 9 or fewer women selected out of 20 if there is no discrimination (i.e. find the estimated p-value)? However, we can find the exact probability of observing 9 or fewer women selected out of 20 using the Binomial distribution. First, we’ll need to make sure the conditions for the Binomial distribution have been satisfied. There are a fixed number of trials, n. There are only 2 possible outcomes for each trial – a “success” and a “failure”. 5 The probability of success (p) remains the same for each trial. The trials are independent. Now, we’ll need the following information to set up the JMP file for this scenario: p = proportion of females in the employee pool (no discrimination) = __________ n = sample size (number of employees selected) = __________ To enter these values into JMP, right-click on the n column and select Formula. Then change the value of n to 20 as follows: Click Apply and then OK. Next, right-click on the p column and select Formula. Then change the value of p to 0.60 as follows: 6 Click Apply and then OK. JMP should then return the following output: The graph below shows the individual Binomial probabilities: Questions: 8. Based on the Binomial probabilities, what is the probability of observing 9 or fewer women selected if there really is no discrimination? How does this compare to the estimated probability found from the simulation (Question 7 above)? 7 9. If the supermarket chain cannot prove that the males selected for the management program were more qualified than the women in the pool, is there evidence of gender discrimination? Explain. Example: Effectiveness of an Experimental Drug Suppose a commonly prescribed drug for relieving nervous tension is believed to be only 70% effective. A new drug was formulated and administered to a random sample of 20 adults who were suffering from nervous tension. Of those 20 adults, 18 experienced relief when taking the new drug. Research Question – Is the new experimental drug more effective than the old drug? Questions: 10. Define the population of interest. 11. Define the variable of interest. 12. Can this scenario be modeled using the Binomial distribution? Describe in context how it meets the four conditions of a Binomial. 13. Assuming that the new drug is no better than the previous drug, how many of the 20 adults would you expect to experience relief? 8 14. Recall that in the actual study, 18 out of the 20 adults experienced relief. Researchers were hoping to show that the new drug was more effective than the previous drug (i.e. better than 70% effective). What possible outcomes for the number of subjects who experience relief are at least as extreme as the observed results? Explain. 15. Assuming the new drug is no better than the previous drug, what is the probability of observing results at least as extreme as observed? You’ll need to use JMP to find this probability. (Hint: You’ll use your answer from Question 14 to help you find the appropriate probability.) 16. Does this study provide evidence that the new drug is superior to the one previously prescribed? Explain. (Hint: You’ll use your answer from Question 15 in your explanation.) 9
© Copyright 2026 Paperzz