Teacher Version

Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
6•6
Lesson 1: Posing Statistical Questions
Student Outcomes

Students distinguish between statistical questions and those that are not statistical.

Students formulate a statistical question and explain what data could be collected to answer the question.

Students distinguish between categorical data and numerical data.
Classwork
Example 1 (5 minutes): Using Data to Answer Questions
Have students read Example 1, and then pose the following questions:

What questions was the beekeeper trying to answer?


Did the beekeeper collect data to answer his questions?


The beekeeper wanted to know if beekeepers have fewer hives this year than last year. The beekeeper
also wanted to know about how many fewer hives beekeepers have this year.
Yes. He collected data on whether there were fewer hives this year and on how many fewer hives there
were.
What did the beekeeper conclude?

The beekeeper concluded that most beekeepers had fewer hives this year and that a typical decrease
was about 4 hives.
Example 1: Using Data to Answer Questions
Honeybees are important because they produce honey and pollinate plants. Since 2007, there has been a decline in the
honeybee population in the United States. Honeybees live in hives, and a beekeeper in Wisconsin notices that this year,
he has 𝟓 fewer hives of bees than last year. He wonders if other beekeepers in Wisconsin are also losing hives. He
decides to survey other beekeepers and ask them if they have fewer hives this year than last year, and if so, how many
fewer. He then uses the data to conclude that most beekeepers have fewer hives this year than last and that a typical
decrease is about 𝟒 hives.
Next, have students read the information below. Then, point out how these steps relate to the beekeeper example
(Example 1). Step 3 is not visible in the paragraph students read, but point out that the beekeeper would have had to
summarize the information he collected in order to draw his conclusions.
Statistics is about using data to answer questions. In this module, you will use the following four steps in your work with
data:
Step 1: Pose a question that can be answered by data.
Step 2: Determine a plan to collect the data.
Step 3: Summarize the data with graphs and numerical summaries.
Step 4: Answer the question posed in Step 1 using the data and summaries.
You will be guided through this process as you study these lessons. This first lesson is about the first step: What is a
statistical question, and what does it mean that a question can be answered by data?
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
11
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
NYS COMMON CORE MATHEMATICS CURRICULUM
Lesson 1
6•6
Example 2 (15 minutes): What Is a Statistical Question?
Introduce the situation described in Example 2 to the class. Consider showing students an example of a baseball card so
that they can see the different kinds of information that appears on the card.
Example 2: What Is a Statistical Question?
Jerome, a sixth grader at Roosevelt Middle School, is a huge baseball fan. He loves to collect baseball cards. He has cards
of current players and of players from past baseball seasons. With his teacher’s permission, Jerome brought his baseball
card collection to school. Each card has a picture of a current or past major league baseball player, along with
information about the player. When he placed his cards out for the other students to see, they asked Jerome all sorts of
questions about his cards. Some asked:

What is Jerome’s favorite card?

What is the typical cost of a card in Jerome’s collection? For example, what is the average cost of a card?

Are more of Jerome’s cards for current players or for past players?

Which card is the newest card in Jerome’s collection?
Then, consider the questions posed in Example 2, and ask students the following:

Which of these questions do you think might be statistical questions?


Answers will vary. Some students do not know how to answer this question, but some may go back to
the definition of Step 1 in the process described earlier to look for questions that can be answered by
data. Even so, they still might think at this point that all of the questions are statistical questions.
What do you think I mean when I say a statistical question?

Answers will vary. If no one relates the idea of a statistical question to the need for data, consider
asking students to go back and look at the four-step process described earlier in this lesson.
Students do not yet understand what a statistical question is. Allow them to discuss and make conjectures about what it
might mean before guiding them to the following:

A statistical question is one that can be answered with data and for which it is anticipated that the data
(information) collected to answer the question will vary.

The second and third questions are statistical questions because the data collected to answer the questions
would vary. To decide on a typical value (i.e., to describe what is typical for values in a data set using a single
number) for the cost of a card, you would collect data on what each card cost. You would expect variability in
the costs because some cards probably cost more than others. For the third question, you would need to
collect data on whether or not each card was for a current player or for a past player. There would be
variability in the data collected because some cards would be for current players, and some would be for past
players. The first question—“What is Jerome’s favorite card?”—is not a statistical question because there is no
variability in the data collected to answer this question. There is only one data value and no variability. The
same is true for the fourth question.
Convey the main idea that a question is statistical if it can be answered with data that vary. Point out that the concept of
variability in the data means that not all data values have the same value.

The question “How old am I?” is not a statistical question because it is not answered by collecting data that
vary. The question “How old are the students in my school?” is a statistical question because to answer it, you
would collect data on the ages of students at the school, and the ages will vary—not all students are the same
age.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
12
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM

6•6
Would the following questions be answered by collecting data that vary?
How tall is your sixth-grade math teacher?


This question would not be answered by collecting data that vary because my sixth-grade math teacher
can only be one height.
What is your handspan (measured from the tip of the thumb to the tip of the small finger)?


This question is not a statistical question because I would just measure my handspan. It would not be
answered by collecting data that vary.
Which of these data sets would have the most variability?
The number of minutes students in your class spend getting ready for school or the number of pockets on the
clothes of students in your class

The number of minutes students spend getting ready for school would vary the most because some
students take a long time to get ready, and some students only require a short time to get ready for
school. The number of pockets will vary from student to student, but most values will be 0, 1, or 2.
After arriving at this understanding as a class, write the informal definition of statistical question on the board so that
students may refer to it for the remainder of the class.
Exercises 1–5 (10 minutes)
These question sets are designed to reinforce the definition of a statistical question. The main focus is on whether there
is variability in the data that would be used to answer the question. Consider having students share their answers to
Exercise 3 with a partner and having the partner decide whether or not the question is a statistical question.
Exercises 1–5
1.
For each of the following, determine whether or not the question is a statistical question. Give a reason for your
answer.
a.
Who is my favorite movie star?
This is not a statistical question because it is not answered by collecting data that vary.
b.
What are the favorite colors of sixth graders in my school?
This is a statistical question because to answer this question, you would collect data by asking students about
their favorite colors, and there would be variability in the data. The favorite color would not be the same for
every student.
c.
How many years have students in my school’s band or orchestra played an instrument?
This is a statistical question because to answer this question, you would collect data by asking students in the
band about how many years they have played an instrument, and there would be variability in the data. The
number of years would not be the same for every student.
d.
What is the favorite subject of sixth graders at my school?
This is a statistical question because to answer this question, you would collect data by asking students about
their favorite subjects, and there would be variability in the data. The favorite subject would not be the same
for every student.
e.
How many brothers and sisters does my best friend have?
This is not a statistical question because it is not answered by collecting data that vary.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
13
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
2.
6•6
Explain why each of the following questions is not a statistical question.
a.
How old am I?
This is not a statistical question because I only have one age at the time the question is asked.
b.
What’s my favorite color?
This is not a statistical question because it is not answered by data that vary. I only have one favorite color.
c.
How old is the principal at our school?
This is not a statistical question because the principal has just one age at the time I ask the principal’s age.
Therefore, the question is not answered by data that vary.
3.
Ronnie, a sixth grader, wanted to find out if he lived the farthest from school. Write a statistical question that
would help Ronnie find the answer.
Answers will vary. One possible answer is, “What is a typical distance from home to school (in miles) for students at
my school?”
4.
Write a statistical question that can be answered by collecting data from students in your class.
Answers will vary. For example, “What is the typical number of pets owned by students in my class?” or “How many
hours each day do students in my class play video games?”
5.
Change the following question to make it a statistical question: How old is my math teacher?
What is a typical age for teachers at my school?
Example 3 (5 minutes): Types of Data
To answer statistical questions, we collect data. For example, in the context of baseball cards, we might record the cost
of a card for each of 25 baseball cards. This would result in a data set with 25 values. We might also record the age of
each card or the team of the player featured on each card.
Example 3: Types of Data
We use two types of data to answer statistical questions: numerical data and categorical data. If you recorded the ages
of 𝟐𝟓 baseball cards, we would have numerical data. Each value in a numerical data set is a number. If we recorded the
team of the featured player for each of 𝟐𝟓 baseball cards, you would have categorical data. Although you still have 𝟐𝟓
data values, the data values are not numbers. They would be team names, which you can think of as categories.

What are other examples of categorical data?


Answers will vary. Eye color, the month in which you were born, and your favorite subject are examples
of categorical data.
What are other examples of numerical data?

Answers will vary. Height, number of pets, and minutes to get to school are all examples of numerical
data.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
14
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
6•6
To help students distinguish between the two data types, encourage them to think of possible data values. If the
possible data values are words or categories, then the variable is categorical.

Suppose that you collected data on the following. What are some of the possible values that you might get?

Eye color

Favorite TV show

Amount of rain that fell during each of 20 storms

High temperatures for each of 12 days
Exercises 6–7 (5 minutes)
Have students complete the exercises below to reinforce their understanding of the two types of data.
Exercises 6–7
6.
7.
Identify each of the following data sets as categorical (C) or numerical (N).
a.
Heights of 𝟐𝟎 sixth graders N
b.
Favorite flavor of ice cream for each of 𝟏𝟎 sixth graders
c.
Hours of sleep on a school night for each of 𝟑𝟎 sixth graders
N
d.
Type of beverage drunk at lunch for each of 𝟏𝟓 sixth graders
C
e.
Eye color for each of 𝟑𝟎 sixth graders
f.
Number of pencils in the desk of each of 𝟏𝟓 sixth graders
C
C
N
For each of the following statistical questions, identify whether the data Jerome would collect to answer the
question would be numerical or categorical. Explain your answer, and list four possible data values.
a.
How old are the cards in the collection?
The data are numerical data, as I anticipate the data will be numbers.
𝟏
𝟐
Possible data values: 𝟐 years, 𝟐 years, 𝟒 years, 𝟐𝟎 years
b.
How much did the cards in the collection cost?
The data are numerical data, as I anticipate the data will be numbers.
Possible data values: $𝟎. 𝟐𝟎, $𝟏. 𝟓𝟎, $𝟏𝟎. 𝟎𝟎, $𝟑𝟓. 𝟎𝟎
c.
Where did Jerome get the cards in the collection?
The data are categorical, as I anticipate the data will represent the names of places or people.
Possible data values: a store, a garage sale, from my brother, from a friend
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
15
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
6•6
Closing (2 minutes)

What is a statistical question?


A statistical question is one that can be answered by collecting data that vary. All of the data values
are not the same.
What is the difference between a numerical data set and a categorical data set?

Every value in a numerical data set is a number. The values in a categorical data set are not numerical.
They are categories.
While reviewing the four steps students follow in their work with data, remind students of the opening example about
the beekeeper. Consider revisiting that example here to show students how the four steps allow them to pose and
answer statistical questions.
Lesson Summary
Statistics is about using data to answer questions. In this module, the following four steps summarize your work
with data:
Step 1: Pose a question that can be answered by data.
Step 2: Determine a plan to collect the data.
Step 3: Summarize the data with graphs and numerical summaries.
Step 4: Answer the question posed in Step 1 using the data and summaries.
A statistical question is one that can be answered by collecting data and where there will be variability in the data.
Two types of data are used to answer statistical questions: numerical and categorical.
Exit Ticket (3 minutes)
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
16
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
NYS COMMON CORE MATHEMATICS CURRICULUM
Name ___________________________________________________
Lesson 1
6•6
Date____________________
Lesson 1: Posing Statistical Questions
Exit Ticket
1.
2.
Indicate whether each of the following two questions is a statistical question. Explain why or why not.
a.
How much does Susan’s dog weigh?
b.
How much do the dogs belonging to students at our school weigh?
If you collected data on the weights of dogs, would the data be numerical or categorical? Explain how you know the
data are numerical or categorical.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
17
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
6•6
Exit Ticket Sample Solutions
1.
Indicate whether each of the following two questions is a statistical question. Explain why or why not.
a.
How much does Susan’s dog weigh?
This is not a statistical question. This question is not answered by collecting data that vary.
b.
How much do the dogs belonging to students at our school weigh?
This is a statistical question. This question would be answered by collecting data on weights of dogs. There is
variability in these weights.
2.
If you collected data on the weights of dogs, would the data be numerical or categorical? Explain how you know the
data are numerical or categorical.
The data collected would be numerical data because the weight values are numbers.
Problem Set Sample Solutions
1.
For each of the following, determine whether the question is a statistical question. Give a reason for your answer.
a.
How many letters are in my last name?
This is not a statistical question because this question is not answered by collecting data that vary.
b.
How many letters are in the last names of the students in my sixth-grade class?
This is a statistical question because it would be answered by collecting data on name lengths, and there is
variability in the lengths of the last names.
c.
What are the colors of the shoes worn by students in my school?
This is a statistical question because it would be answered by collecting data on shoe colors, and we expect
variability in the colors.
d.
What is the maximum number of feet that roller coasters drop during a ride?
This is a statistical question because it would be answered by collecting data on the drop of roller coasters,
and we expect variability in how many feet different roller coasters drop. They will not all be the same.
e.
What are the heart rates of students in a sixth-grade class?
This is a statistical question because it would be answered by collecting data on heart rates, and we expect
variability. Not all sixth graders have exactly the same heart rate.
f.
How many hours of sleep per night do sixth graders usually get when they have school the next day?
This is a statistical question because it would be answered by collecting data on hours of sleep, and we do not
expect that all sixth graders sleep the same number of hours.
g.
How many miles per gallon do compact cars get?
This is a statistical question because it would be answered by collecting data on fuel efficiency, and we expect
variability in miles per gallon from one car to another.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
18
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
Lesson 1
NYS COMMON CORE MATHEMATICS CURRICULUM
2.
6•6
Identify each of the following data sets as categorical (C) or numerical (N). Explain your answer.
a.
Arm spans of 𝟏𝟐 sixth graders
N; the arm span can be measured as number of inches, for example, so the data set is numerical.
b.
Number of languages spoken by each of 𝟐𝟎 adults
N; number of languages is clearly numerical.
c.
Favorite sport of each person in a group of 𝟐𝟎 adults
C; a sport falls into a category, such as “soccer” or “hockey.” Favorite sport is not measured numerically.
d.
Number of pets for each of 𝟒𝟎 third graders
N; number of pets is clearly numerical.
e.
Number of hours a week spent reading a book for a group of middle school students
N; number of hours is clearly numerical.
3.
Rewrite each of the following questions as a statistical question.
Answers will vary. Sample answers are given below.
a.
How many pets does your teacher have?
How many pets do teachers in our school have?
b.
How many points did the high school soccer team score in its last game?
What is a typical number of points scored by the high school soccer team in its games this season?
c.
How many pages are in our math book?
What is a typical number of pages for the books in the school library?
d.
Can I do a handstand?
Can most sixth graders do a handstand?
4.
Write a statistical question that would be answered by collecting data from the sixth graders in your classroom.
Answers will vary. Check to make sure the question would be answered by collecting data that vary.
5.
Are the data you would collect to answer the question you wrote in Problem 2 categorical or numerical? Explain
your answer.
Answers will vary. Check to make sure that the answer here is consistent with the statistical question from Problem
4. If the possible values for the data collected would be numerical, students should answer numerical here, but if the
possible data values are categories rather than numbers, students should answer categorical.
Lesson 1:
Posing Statistical Questions
This work is derived from Eureka Math ™ and licensed by Great Minds. ©2015 Great Minds. eureka-math.org
This file derived from G6-M6-TE-1.3.0-10.2015
19
This work is licensed under a
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.