Drawing Lines of Best Fit

Lines of Best Fit
Pediatricians
measure the size
and weight of babies
as a standard part of a
physical examination.
Parents often keep
measurements of
their babies
as well.
16.1 Where Do You Buy Your Books?
Drawing Lines of Best Fit ............................................ 831
16.2 Mia Is Growing Like a Weed!
Analyzing the Line of Best Fit .................................... 843
16.3 Stroop Test
Performing an Experiment ..........................................853
© 2011 Carnegie Learning
© 2011 Carnegie Learning
16.4 Human Chain: Shoulder Experiment
Using Technology to Determine a Linear Regression Equation .................................................. 863
16.5 Jumping
Correlation ................................................................. 873
829
Chapter 16 Overview
16.2
Drawing
Lines of Best
Fit
Analyzing
the Line of
Best Fit
16.3
Performing
an
Experiment
16.4
Using
Technology
to
Determine
a Linear
Regression
Equation
16.5
Correlation
8.SP.1
8.SP.2
8.SP.3
8.SP.1
8.SP.2
8.SP.3
8.SP.1
8.SP.2
8.SP.3
8.SP.1
8.SP.2
8.SP.3
8.SP.1
8.SP.2
8.SP.3
1
This lesson introduces the concept of line
of best fit and explains how although a
straight line will not pass through all of
the points on a scatter plot, a line can be
drawn to approximate the data as closely
as possible.
X
Technology
X
Talk the Talk
Highlights
Peer Analysis
Pacing
X
Questions ask students to identify the
slope and y-intercept of the line of best
fit in terms of a problem situation, and
then use their line of best fit to make
predictions.
This lesson explores the accuracy of a line
of best fit that models real-world data.
1
Questions ask students to use a line of best
fit equation to make predictions about a
related problem situation.
X
This lesson explores the line of best fit that
models the results of a class experiment.
1
1
1
829A • Chapter 16 Lines of Best Fit
Questions ask students to interpret
their line of best fit equation and make
predictions in regards to the experiment.
This lesson explores how to use a graphing
calculator to determine a linear regression
equation.
Questions ask students to compare the
linear regression equations from wrist and
shoulder experiments.
This lesson uses data collected by another
class experiment to determine whether a
data set is positively, negatively, or
not correlated.
Questions ask students to interpret the
correlation coefficient.
X
X
X
X
X
X
X
X
X
© 2011 Carnegie Learning
16.1
CCSS
Worked Examples
Lesson
Models
This chapter explores real-world bivariate data and the concept of line of best fit. Class experiments will be
conducted with the data recorded and plotted in a scatter plot. The line of best is then calculated and used
to make predictions.
Skills Practice Correlation for Chapter 16
Lesson
Problem
Set
Objectives
Vocabulary
16.1
16.2
16.3
© 2011 Carnegie Learning
16.4
Drawing Lines
of Best Fit
Analyzing
the Line of
Best Fit
Performing
an
Experiment
Using
Technology
to
Determine
a Linear
Regression
Equation
1–6
Create scatter plots representing tables
7 – 12
Draw lines of best fit on scatter plots
13 – 18
Draw lines of best fit then write the equation of the lines
19 – 24
Use given equations to answer questions
1–6
Estimate the equation of lines of best fit on scatter plots
7 – 12
Use given lines of best fit to make predictions 13 – 18
Compare given graphs to determine which lines is a better fit for data
19 – 22
Use given equations to answer questions
1–4
Create scatter plots from tables, estimate the equation of lines of best
fit, and use equations to answer questions
5–8
Write equations for lines of best fit
9 – 12
Use given equations to answer questions
13 – 16
Use given information about experiments to answer questions
1-6
Use given correlation coefficients to indicate how close data are to
being straight lines
7 - 12
Determine linear regression equations using a graphing calculator
13 - 16
Calculate linear regression equations using a graphing calculator and
given data from tables
17 - 20
Write corresponding linear regression equations using a graphing
calculator and given data from tables
Vocabulary
16.5
Correlation
1 - 10
Determine whether points have a positive correlation, a negative
correlation, or no correlation
11 - 14
Draw lines of best fit and explain what they indicate about the
relationships between two variables
15 - 20
Estimate equations for given lines of best fit
21 - 24
Use given equations to answer questions
Chapter 16 Lines of Best Fit • 829B
© 2011 Carnegie Learning
830 • Chapter 16 Lines of Best Fit
Where Do You
Buy Your Books?
Drawing Lines of Best Fit
Learning Goals
Key Terms
In this lesson, you will:





 Determine the definition of a line of best fit.
 Use a line of best fit to make predictions.
 Compare two lines of best fit.
Essential Ideas
• A scatter plot is a graph of data points.
• A line of best fit is a straight line that is as close to
as many points as possible, but does not have to go
through all the points on a scatter plot.
• Equations can be written for a line of best fit.
• A line of best fit can be used to make predictions
about data.
• A line of best fit and its equations are often referred
© 2011 Carnegie Learning
to as a model of the data.
line of best fit
model
trend line
interpolating
extrapolating
Common Core State Standards for
Mathematics
8.SP Statistics and Probability
Investigate patterns of association in bivariate data.
1. Construct and interpret scatter plots for bivariate
measurement data to investigate patterns of
association between two quantities. Describe
patterns such as clustering, outliers, positive
or negative association, linear association, and
nonlinear association.
2. Know that straight lines are widely used to model
relationships between two quantitative variables.
For scatter plots that suggest a linear association,
informally fit a straight line, and informally assess
the model fit by judging the closeness of the data
points to the line.
3. Use the equation of a linear model to solve
problems in the context of bivariate measurement
data, interpreting the slope and intercept.
16.1 Drawing Lines of Best Fit • 831A
Overview
The terms line of best fit, trend line, interpolating, and extrapolating are defined in this lesson.
Students create two scatter plots to show percent of book sales from book stores and the internet for
time since 2004. They then will write an equation of the line of best fit for each scatter plot and draw it
© 2011 Carnegie Learning
in their plot. Using their line of best fit, students will predict the percent of book sales.
831B • Chapter 16 Lines of Best Fit
Warm Up
1. Solve for x.
50 5 3.5x 1 24.2
50 5 3.5x 1 24.2
50-24.2 5 3.5x 1 24.2 224.2
25.8 5 3.5x
x ¯ 7.37
2. Solve for x.
30 5 22.9x 1 50.3
30 5 22.9x 1 50.3
30 2 50.3 5 22.9x 1 50.3 2 50.3
220.3 5 22.9x
x57
3. Solve for y when x 5 6
y 5 3.5x 1 24.2
y 5 3.5x 1 24.2
y 5 3.5(6) 1 24.2
y 5 21 1 24.2
y 5 45.2
4. Solve for y when x 5 3
© 2011 Carnegie Learning
y 5 22.9 x 1 50.3
y 5 22.9x 1 50.3
y 5 22.9(3) 1 50.3
y 5 28.7 1 50.3
y 5 41.6
16.1 Drawing Lines of Best Fit • 831C
© 2011 Carnegie Learning
831D • Chapter 16 Lines of Best Fit
Where Do You
Buy Your Books?
Drawing Lines of Best Fit
Learning Goals
Key Terms
In this lesson, you will:
 Determine the definition of a line of best fit.
 line of best fit
 model
 Use a line of best fit to make predictions.
 Compare two lines of best fit.
 trend line
 interpolating
 extrapolating
E
-books are becoming more and more popular. First, the way people bought books changed a lot. Now, the actual books people read have changed too! E-books offer readers the opportunity to download books onto their reader devices. However, as trendy and convenient as e-books are, many technology experts feel that e-book reader devices may actually have a very short life. Because of new advancements in computers and cell phones, these same experts believe that e-books will be able to be read directly from portable laptop computers—computers not much bigger that a small stack of loose-leaf paper—and cell phones. What do you think are the advantages of combining technologies onto © 2011 Carnegie Learning
© 2011 Carnegie Learning
a single device? What are some disadvantages? Do you ever think the printed paper book will one day become completely obsolete?
16.1 Drawing Lines of Best Fit • 831
16.1 Drawing Lines of Best Fit • 831
Problem 1
Students analyze data for the
percent of book sales from a
book store from 2004 through
2010. They then create a
scatter plot to display data
with decreasing y-values as
time increases. A line of best
fit is drawn on the scatter plot
and two points are used to
determine the slope of the line
of best fit and the equation of
the line of best fit. Using the
equation for the line of best fit,
students will use the equation
to predict past and future sales.
Problem 1
Purchasing Books from Different Places
You can purchase books from many different places: a bookstore, a department store, the
Internet, a book club, and many other places. The source for purchasing books changes
as the available formats for books change.
Suppose the table shows the percent of book sales that came from bookstores for the
years 2004 through 2010.
Percent of Book Sales from Book Stores
Year
2004
2005
2006
2007
2008
2009
2010
Percent of
Total Sales
50.8
44.5
42.4
42.5
36.8
33.2
32.5
1. Identify the independent and dependent variables in this problem situation.
Time is the independent variable and percent of total sales is the dependent
variable.
Grouping
• Ask a student to read the
Because the x-coordinates represent time, you can define time as the number of years
introduction to Problem 1
aloud. Discuss the context
and complete Question 1
as a class.
since 2004. In this problem situation, you could represent 2004 as 0 on the x-axis.
2. How would you represent:
a. 2005?
• Have students complete
I would use 1 on the x-axis to represent 2005 because it is one year since 2004.
Questions 2 through 4 with
a partner. Then share the
responses as a class.
b. 2006?
I would use 3 since 2007 is 3 years since 2004.
your books from considered
a bookstore or a nonbookstore? Explain.
• Do these two types of stores
represent all the possible
sources where you can buy
books? Explain.
832 • Chapter 16 Lines of Best Fit
Share Phase, Question 2
• What year does the x-value of 0 represent on the graph of the scatter plot?
• What year does the x-value of 8 represent on the graph of the scatter plot?
• What year does the x-value of 24 represent on the graph of the scatter plot?
832 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
c. 2007?
Discuss Phase,
Question 1
• Is the place where you buy
© 2011 Carnegie Learning
I would use 2 because 2006 is 2 years since 2004.
Share Phase,
Questions 3 and 4
• What does the broken line
3. Create a scatter plot of the ordered pairs on the coordinate plane shown.
on the y-axis represent?
Why is it there?
circumstances under which
using a broken line on an
axis is appropriate?
Percent of Book Sales
from Bookstores
y
55
When you use 0
to indicate a particular
year, such as 2004, you should
indicate this on your graph with
the appropriate axis label. One
way to do this is to use a double
arrow: 0 ↔ 2004. You can
think of the double arrow as
meaning "is the same as."””
50
Percent of Total Sales
When you want to
show data points are
clustered together but
• What would the graph of
are away from the origin,
the scatter plot look like if
you can break the graph.
the broken line was not on
The squiggly line drawn
from the origin to the
the y-axis?
first interval shows a
• How would you describe the break in the graph.
45
40
35
30
• How does using a broken
0
0.0
line on the axis affect the
appearance of a graph?
x
1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0
Time (years)
(0
2004)
4. Do all the points in your scatter plot lie on the same line? What does
this tell you about the percent of total sales as the time changes?
Grouping
The points do not lie on the same line. The percent of total sales is
Ask a student to read the
information following
Question 4 aloud. Discuss the
definitions and the following
worked examples as a class.
not changing at the same rate as the time changes.
Sometimes, it may seem that there is not a linear relationship between the
data points in a scatter plot. However, some of the data points may be
clustered where a straight line might pass. Although a straight line will not
pass through all of the points in your scatter plot, you can use a line to
approximate the data as closely as possible. This kind of line is called a line
© 2011 Carnegie Learning
© 2011 Carnegie Learning
of best fit. A line of best fit is a line that is as close to as many points as
possible, but doesn’t have to go through all the points. When you use a line
of best fit, the line and its equation are often referred to as a model of the
data, or a trend line.
When data is displayed with a scatter plot, constructing a line of best fit is
helpful to predict values that may not be displayed on the plot. You want to
begin by analyzing the data and ask yourself:
●
Does the data look like a line?
●
Does the data seem to have a positive or negative correlation?
16.1 Drawing Lines of Best Fit • 833
16.1 Drawing Lines of Best Fit • 833
Let’s construct a line of best fit.
Step 1: Begin by plotting all the data.
Percent of Book Sales
from Bookstores
y
55
Percent of Total Sales
50
45
40
35
30
0
1
2
3 4
5 6
Time (years)
(0
2004)
7
8
x
Step 2: Draw a shape that encloses all of the data.
“Try to draw a
smooth and relatively
even shape.
Percent of Book Sales
from Bookstores
y
55
45
40
30
0
834 • Chapter 16 Lines of Best Fit
834 • Chapter 16 Lines of Best Fit
1
2
3 4
5 6
Time (years)
(0
2004)
7
8
x
© 2011 Carnegie Learning
35
© 2011 Carnegie Learning
Percent of Total Sales
50
Discuss Phase,
Steps 3 and 4
• Where did the ordered pair
(0, 49.2) come from? If you
were going to estimate the
coordinates of this point on
the scatter plot, what values
would you use?
Step 3: Draw a line that divides the enclosed area of the data in half.
Percent of Book Sales
from Bookstores
y
55
50 x
• Where did the ordered pair
Percent of Total Sales
(3, 40.5) come from? If you
were going to estimate the
coordinates of this point on
the scatter plot, what values
would you use?
45
x
40
35
30
• If you used your estimated
values of these two points,
what would be your
equation?
0
1
2
3 4
5 6
Time (years)
(0
2004)
7
8
x
Note that the line of best fit does not have to go through
any of the data values.
Step 4: Determine the equation of your line of
“The idea is that
you want to
identify a line
that has an equal
number of points
on either side.
best fit.
●
Begin by identifying two points on your trend line.
In this example, two points were chosen and
marked with an “x.” The estimated ordered pairs
are (0, 49.2) and (3, 40.5).
© 2011 Carnegie Learning
●
Calculate the slope of the line through your
two points.
●
49.2 2 40.5 5 ___
8.7
m 5 ___________
023
23
m 5 22.9
Write the equation of the line.
Let x represent the number of years since 2004, and y represent
the percent of all sales.
© 2011 Carnegie Learning
y 5 2.9x 1 49.2
16.1 Drawing Lines of Best Fit • 835
16.1 Drawing Lines of Best Fit • 835
Grouping
• Have students complete
It is possible to choose two different points and estimate those ordered pairs in a slightly
Questions 5 and 6 with a
partner. Then share the
responses as a class.
different way. Determining the line of best fit may lead to different equations depending
upon the estimated ordered pairs chosen to construct the line.
5. Identify the slope of the line of best fit and what it represents in this problem situation.
• Have students complete
The slope of the line of best fit is 22.9. The slope represents the change in percent
Questions 7 through 11 with
a partner. Then share the
responses as a class.
of book sales from bookstores per year.
6. Identify the y-intercept of the line of best fit and what it represents in this problem
situation.
The y-intercept is 49.2. The y-intercept represents the percent of book sales from
Share Phase,
Questions 5 and 6
• What would be the slope of
bookstores in 2004.
If you are predicting values that fall within the plotted values, you are interpolating. If you
are predicting values that fall outside the plotted values, you are extrapolating.
your line of best fit?
7. Use the line of best fit equation to predict the percent of book sales from bookstores:
• What would be the
a. in 2011.
y-intercept of your line of
best fit?
y 5 22.9(7) 1 49.2 5 28.9
About 29% of total sales from bookstores should occur in 2011.
b. in 2013.
Share Phase,
Question 7
• When the percent of sales
y 5 22.9(9) 1 49.2 5 23.1
About 23% of total sales from bookstores should occur in 2013.
of books from book stores
was 29%, what do you know
about the percent of sales of
books from other sources?
c. Explain how you determined your predictions.
First, I wrote the year as the number of years since 2004.
Then, I substituted this value for x-value into the equation
• When the percent of sales
8. Use the line of best fit equation to make predictions.
of books from book stores
was 23%, what do you know
about the percent of sales of
books from other sources?
a. In what year was the percent of book sales from
bookstores 60%?
60 5 22.9x 1 49.2
10.8 5 22.9x
23.7 ¯ x
2004 2 3.7 5 2000.3
Are printed book
sales trending
up or trending
down?
© 2011 Carnegie Learning
and solved for y.
836
•
Chapter 16
7719_CL_C3_CH16_pp829-886.indd 836
836 • Chapter 16 Lines of Best Fit
Lines of Best Fit
© 2011 Carnegie Learning
In 2000, bookstore sales were 60% of total sales.
11/04/15 5:32 PM
Share Phase,
Questions 8 through 11
• When the percent of sales
of books from book stores
was 60%, what do you know
about the percent of sales of
books from other sources?
b. In what year will the percent of book sales from bookstores be 20%?
20 5 22.9x 1 49.2
229.2 5 22.9x
10.1 ¯ x
2004 1 10.1 5 2014.1
Near the end of 2014, bookstore sales should be 20% of sales.
• When the percent of sales
of books from book stores
was 20%, what do you know
about the percent of sales of
books from other sources?
c. Explain how you determined your predictions.
First, I substituted the given value for y and solved for x. Then, I added the
value for x to 2004 to determine the year.
• What is the meaning of the
x-intercept in this situation?
9. Use the line of best fit equation to determine the percent of book sales from
bookstores in 2006.
• What year do you think the
y 5 22.9(2) 1 49.2 5 43.4
sales of books from book
stores will be 0%?
About 43% of the total book sales should come from bookstores.
• Why do you suppose that
the percent of sales of books
from books stores has
declined since 2004?
10. Compare your answer from Question 9 to the actual data from the table. What do
you notice?
The answer, about 43%, is fairly close to the actual data from the table,
which is 42.4%.
11. Do you think that the line of best fit model provides reasonable answers to the
questions posed in Questions 7 and 8? Explain your reasoning.
Yes. Because the line is reasonably close to all the data points, the model provides
© 2011 Carnegie Learning
© 2011 Carnegie Learning
reasonable answers.
16.1
7719_CL_C3_CH16_pp829-886.indd 837
Drawing Lines of Best Fit
•
837
13/04/15 11:31 AM
16.1 Drawing Lines of Best Fit • 837
Problem 2
Problem 2
Purchasing Books from the Internet
Suppose the table shows the percent of all book sales that came from the Internet for the
years 2004 through 2010.
Percent of Printed Book Sales from the Internet
Year
2004
2005
2006
2007
2008
2009
2010
Percent of
Total Sales
34.4
38.5
40.8
42.4
50.7
52.8
53.8
1. Identify the independent and dependent variables in this problem situation.
Time is the independent variable and percent of total sales is the
dependent variable.
Grouping
2. Create a scatter plot from the table data.
Have students complete
Questions 1 through 10 with
a partner. Then share the
responses as a class.
y
55
Percent of Printed Book Sales
from the Internet
Percent of Total Sales
50
Share Phase,
Questions 1 through 3
• How does the data in this
table look different than
the data in the table of
Problem 1?
45
40
35
30
• How does the scatter plot in
this problem look different
than the scatter plot in
Problem 1?
1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0
Time (years)
(0
2004)
3. Analyze your scatter plot. Does the data look like a line?
If so, does the data seem to have a positive correlation
or negative correlation?
The data seems to form a line with a positive correlation.
838 • Chapter 16 Lines of Best Fit
838 • Chapter 16 Lines of Best Fit
x
Remember to
start by drawing a
shape around all the
data. Then, divide
that in half.
© 2011 Carnegie Learning
0
0.0
© 2011 Carnegie Learning
Problem 2 is very similar
mathematically to Problem 1.
Students analyze data for the
percent of book sales from
the internet. Students create
a scatter plot to display the
percent of sales data with
increasing y-values as time
increases. They will then make
predictions for time in the future
as well as for time in the past
using the equation of their line
of best fit.
Share Phase,
Questions 4 through 7
• What is the meaning of the
x-intercept in this situation?
4. Use a ruler to draw the line that best fits your data on the graph. Then, write the
equation of the line. Define your variables and include the units.
Answers will vary, but should be close to y 5 3.5x 1 34.3, where x is the number of
• When the percent of sales
years since 2004, and y is the percent of all sales.
of books from the Internet
was 59%, what do you know
about the percent of sales of
books from other sources?
• When the percent of sales
5. What does the slope of your line represent in this problem situation?
The slope represents the change in the percent of total sales per year.
6. What does the y-intercept represent in this problem situation?
of books from the Internet
was 66%, what do you know
about the percent of sales of
books from other sources?
The y-intercept represents the percent of total sales in 2004.
7. Use your equation to predict the percent of book sales from the Internet:
a. in 2011.
y 5 3.5(7) 1 34.3 5 58.8
About 59% of the total book sales should come from the Internet.
b. in 2013.
y 5 3.5(9) 1 34.3 5 65.8
About 66% of the total book sales should come from the Internet.
c. Explain how you calculated your answers.
First, I wrote the year as the number of years since 2004. Then, I substituted
© 2011 Carnegie Learning
© 2011 Carnegie Learning
this value for x into the equation and solved for y.
16.1 Drawing Lines of Best Fit • 839
16.1 Drawing Lines of Best Fit • 839
Share Phase,
Questions 8
through 10
• When the percent of sales
8. Use your equation to predict the year in which Internet sales will be a certain percent
of total book sales. Show all your work.
a. 60% of total book sales
of books from the Internet
was 60%, what do you know
about the percent of sales of
books from other sources?
60 5 3.5x 1 34.3
25.7 5 3.5x
7.3 < x
2004 1 7.3 5 2011.3
In 2011, Internet sales will be 60% of total sales.
• When the percent of sales
of books from the Internet
was 20%, what do you know
about the percent of sales of
books from other sources?
b. 20% of total book sales
20 5 3.5x 1 34.3
214.3 5 3.5x
24.1 < x
• When the percent of sales
2004 2 4.1 5 1999.9
Near the end of 1999, Internet sales were 20% of sales.
of books from the Internet
was 41%, what do you
know about the percent of
sales of books from all other
sources?
c. Explain how you calculated your answers.
First, I substituted the given value for y and solved for x. Then, I added the
value for x to 2004 to determine the year.
• What year do you think the
sales of books from the
Internet was 0%?
9. Compare the actual values to values using your equation.
a. How close is the value of the y-intercept to the actual value?
• What year do you think the
The y-intercept, 34.3%, is very close to the actual value, 34.4%.
sales of books from the
Internet will be 100%?
• Why do you suppose that
is this answer to the actual data?
y 5 3.5(2) 1 34.3 5 41.3
About 41% of the total sales should come from the Internet.
The answer, about 41%, is close to the actual data, 40.8%.
© 2011 Carnegie Learning
b. Use your equation to predict the percent of Internet book sales in 2006. How close
the percent of sales of
books from the Internet has
increased since 2004?
10. Do you think that your model provides reasonable answers to Questions 7 and 8?
Yes. Because the line is reasonably close to all the points, the model provides
reasonable answers.
840 • Chapter 16 Lines of Best Fit
840 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
Explain your reasoning.
Talk the Talk
Students compare the data
from Problems 1 and 2.
Talk the Talk
1. Do you think that the data from Problems 1 and 2 are related?
Grouping
Yes. As the bookstores’ percent of sales decrease, the Internet’s percent of
Have students complete
Questions 1 through 3 with
a partner. Then share the
responses as a class.
sales increase.
2. Which percent of book sales is changing faster: bookstore sales or Internet sales?
Explain your reasoning.
The Internet’s percent of sales is changing faster because that line
is steeper.
3. Which equation from Problem 1 and Problem 2 models its data better? Explain your
reasoning.
Answers will vary, but students should make a comparison between how well each
model fits its data. They should do this by checking how close each percent given
© 2011 Carnegie Learning
© 2011 Carnegie Learning
by their models from 2004 through 2010 is to the actual value given in the tables.
Be prepared to share your solutions and methods.
16.1 Drawing Lines of Best Fit • 841
16.1 Drawing Lines of Best Fit • 841
Follow Up
Assignment
Use the Assignment for Lesson 16.1 in the Student Assignments book. See the Teacher’s Resources
and Assessments book for answers.
Skills Practice
Refer to the Skills Practice worksheet for Lesson 16.1 in the Student Assignments book for additional
resources. See the Teacher’s Resources and Assessments book for answers.
Assessment
See the Assessments provided in the Teacher’s Resources and Assessments book for Chapter 16.
Check for Students’ Understanding
Estimate the equation of the line of best fit shown in each scatter plot.
1.
y
20
18
16
14
12
10
8
6
4
2
0
2
4
6
8
10 12 14 16 18 20 x
y 2y
y 2 y1 5 m (x 2 x1) m = _______
​ x2 2 x1 ​ 
5 _______
​  2 2 1 
 ​ 
52
0.5 2 0
2
1
y 2 1 5 2(x 2 0)
y 5 2x 1 1
842 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
Answers will vary but should be close to the following equation:
2.
y
20
18
16
14
12
10
8
6
4
2
0
2
4
6
8
10 12 14 16 18 20 x
Answers will vary but should be close to the following equation:
y 2y
y 2 y1 5 m (x 2 x1) m 5 _______
​ x2 2 x1 ​ 
5 ________
​ 15 2 11 
 ​5 1
14 2 10
2
1
y 2 11 5 1(x 2 10)
© 2011 Carnegie Learning
y5x11
16.1 Drawing Lines of Best Fit • 842A
© 2011 Carnegie Learning
842B • Chapter 16 Lines of Best Fit
Mia’s Growing
Like a Weed!
Analyzing the Line of Best Fit
Learning Goals
In this lesson, you will:




Create a scatter plot.
Draw a line of best fit.
Write an equation of a line of best fit.
Use a line of best fit to make predictions.
Essential Ideas
• A scatter plot is a graph of data points.
• A line of best fit is a straight line that is as close to
as many points as possible, but does not have to go
through all the points on a scatter plot.
• Equations can be written for a line of best fit.
• A line of best fit can be used to make predictions
about data.
• A line of best fit and its equations are often referred
© 2011 Carnegie Learning
to as a model of the data.
Common Core State Standards for
Mathematics
8.SP Statistics and Probability
Investigate patterns of association in bivariate data.
1. Construct and interpret scatter plots for bivariate
measurement data to investigate patterns of
association between two quantities. Describe
patterns such as clustering, outliers, positive
or negative association, linear association, and
nonlinear association.
2. Know that straight lines are widely used to model
relationships between two quantitative variables.
For scatter plots that suggest a linear association,
informally fit a straight line, and informally assess
the model fit by judging the closeness of the data
points to the line.
3. Use the equation of a linear model to solve
problems in the context of bivariate measurement
data, interpreting the slope and intercept.
16.2 Analyzing the Line of Best Fit • 843A
Overview
Students create a scatter plot for age and height, and a scatter plot for age and weight. They will draw
the line of best fit and determine the equation of the line of best fit for each scatter plot. Students
then make predictions for height and for weight based on age using the equation of each line of
© 2011 Carnegie Learning
best fit.
843B • Chapter 16 Lines of Best Fit
Warm Up
Complete each statement.
1. If a line has a positive slope, then as the x- values increase, the y-values will _________________.
increase
2. If a line has a negative slope, then as the x-values increase, the y-value will _________________.
decrease
3. If a line has a slope of 7, then as the x-values increase by one unit, the y-values will
_______________ by ____________.
increase by 7 units
4. If a line has a slope of 7, then as the x-values increase by 5 units, the y-values will
_______________ by ____________.
© 2011 Carnegie Learning
increase by 35 units
16.2 Analyzing the Line of Best Fit • 843C
© 2011 Carnegie Learning
843D • Chapter 16 Lines of Best Fit
Mia Is Growing
Like a Weed!
Analyzing the Line of Best Fit
Learning Goals
In this lesson, you will:
 Create a scatter plot.
 Draw a line of best fit.
 Write an equation of a line of best fit.
 Use a line of best fit to make predictions.
T
he birth of an endangered species is reason to celebrate! And on July 9, 2005, the birth of a new bundle of joy took Washington, D.C. by storm. The birth of Tai Shan, the first giant panda to be born at the National Zoo, stole front page headlines from other events in the world. And when he made his first appearance in December 2005, 13,000 tickets were distributed to the public—all to get a glimpse of Tai Shan! But while he was gaining star attention by the people of D.C., zoologists were carefully monitoring his diet, behavior, weight, length, and exercise. Why do you think zoologists were so interested in Tai Shan’s daily activities and growth? What are other newborn creatures that are monitored © 2011 Carnegie Learning
© 2011 Carnegie Learning
constantly during their first years of life? Do you think there is a special type of doctor that only takes care of babies?
16.2 Analyzing the Line of Best Fit • 843
16.2 Analyzing the Line of Best Fit • 843
Problem 1
Problem 1
How Quickly is She Growing?
1. Mia was born a healthy, happy baby girl to the Sanchez family. At each doctor’s visit,
Mia’s height and weight were recorded. Her records from birth until she was 18
months old are shown in the table.
Weight
(lbs)
0.0
6.1
1.0
8.1
1.8
10.0
Grouping
2.3
10.3
Ask a student to read the
information at the beginning of
Question 1 aloud. Discuss the
context and complete
Question 1, part (a) as a class.
4.0
13.7
6.0
17.0
8.0
21.0
10.0
22.0
12.0
23.0
15.0
23.0
18.0
25.1
a. Consider the relationship between Mia’s age and her weight. What happens to
Mia’s weight as she gets older?
Mia’s weight increases as she gets older.
844 • Chapter 16 Lines of Best Fit
844 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
Age
(months)
© 2011 Carnegie Learning
Students analyze a table of
data of a child’s growth rate for
weight. They will consider the
relationships between age and
weight, and determine the unit
rate to compare her weight at
different ages. Weight and age
are then graphed using a
scatter plot. Finally, students
will approximate the line of best
fit and use this information to
predict trends in weight.
Grouping
Have students complete
Questions 1, part (b) through 3
with a partner. Then share the
responses as a class.
b. Write a unit rate that compares Mia’s weight change to her change in age from age
4 months to age 6 months. Explain how you calculated your answer.
3.3 pounds ____________
1.65 pounds
17 pounds 2 13.7 pounds ___________
_________________________
5
5
6 months 2 4 months
2 months
1 month
Mia’s weight changed by 1.65 pounds per month
between age 4 months and age 6 months.
Share Phase,
Question 1, parts (b)
through (d)
• What would you expect to
I divided the difference in weight by
the change in age.
happen to a baby’s weight as
she got older?
• Would she grow heavier at a
Remember, a unit
rate is a comparison of
two measurements in
which the denominator
has a value of
one unit.
c. Write a unit rate that compares Mia’s weight change
to her change in age from 6 months to 8 months. Explain
how you calculated your answer.
constant rate, a faster rate, or
a slower rate during her first
month of life or her fifteenth
month of life? Explain.
4 pounds _________
2 pounds
21 pounds 2 17 pounds _________
_______________________
5
5
8 months 2 6 months
2 months
1 month
Mia’s weight changed by 2 pounds per month between
6 months and 8 months.
I divided the difference in weight by the change in age.
• How can you check your
conjectures using the data
given for Mia?
d. Is Mia gaining weight faster from 4 months to 6 months, or
from 6 months to 8 months? Explain your reasoning.
Mia is gaining weight faster from 6 months to 8 months
because the rate of her change in weight during this time is
greater than the rate of her change in weight from 4 months
© 2011 Carnegie Learning
© 2011 Carnegie Learning
to 6 months.
16.2 Analyzing the Line of Best Fit • 845
16.2 Analyzing the Line of Best Fit • 845
Share Phase,
Questions 2 and 3
• How many points on the
2. Create a scatter plot that shows Mia’s age as the independent variable and her weight
as the dependent variable.
scatter plot are above your
line of best fit?
y
Mia’s Weight over Time
30
• How many points on the
28
26
scatter plot are below your
line of best fit?
24
22
Weight (lbs)
20
• How many points on the
scatter plot are on your line
of best fit?
18
16
14
12
10
8
6
4
Grouping
2
0
Have students complete
Questions 4 through 9 with
a partner. Then share the
responses as a class.
0
2
4
6
8 10 12 14 16 18 20 22 24 26 28 30
x
Age (months)
3. Do all the points in your scatter plot lie on the same line? What does this tell you
about Mia’s weight change as time changes? Explain your reasoning.
No. All the points do not lie on the same line. This means that Mia’s weight is not
changing at the same rate as time changes.
Share Phase,
Questions 4 and 5
• What is the slope of your line
4. Use a ruler to draw the line that best fits the data on the graph.
The graph of the line should be close to y 5 1.08x 1 8.61.
of best fit?
The equation should be close to
y 5 1.08x 1 8.61, where x is the age in
• What ordered pairs did you
months, and y is the weight in pounds.
use to determine the slope of
the line of best fit?
• How does your equation for
the line of best fit compare to
your classmate’s equations?
846 • Chapter 16 Lines of Best Fit
846 • Chapter 16 Lines of Best Fit
Remember, after
you draw the line, pick
two points from your line
to write the equation.
© 2011 Carnegie Learning
define your variables and include the units.
with respect to the problem
situation?
© 2011 Carnegie Learning
5. Write the equation of your line. Be sure to
• What does the slope mean
6. According to your line, approximately how many pounds did Mia gain each month
from the time she was born until she was 18 months old? How did you determine
your answer?
Mia gained about 1.08 pounds each month. The slope of my line of best fit is about
1.08 pounds per month.
7. If Mia continues to grow at this rate, how much will she weigh when she is:
a. 2 years old?
y 5 1.08(24) 1 8.61 5 34.53
Mia will weigh about 34.53 pounds.
b. 5 years old?
y 5 1.08(60) 1 8.61 5 73.41
Mia will weigh about 73.4 pounds.
c. 18 years old?
y 5 1.08(216) 1 8.61 5 241.89
Mia will weigh about 241.9 pounds.
8. Do all your answers to Question 7 make sense? Explain your reasoning.
No. The weights seem unreasonable. Some 2-year-olds might reach 34.5 pounds,
but the other two weights seem unreasonable. Most adults will never reach a
© 2011 Carnegie Learning
© 2011 Carnegie Learning
weight of 241 pounds.
9. What can you conclude about the accuracy of your model?
The model seems to be approximately accurate for the first 2 years, but not
beyond that.
16.2 Analyzing the Line of Best Fit • 847
16.2 Analyzing the Line of Best Fit • 847
Problem 2
A table of values that describes
Mia’s age and height is given.
Students use the table of data
to create a scatter plot for the
child’s height over time. They
will draw a line of best fit and
analyze the graph. They then
write the equation of the line of
best fit and use the equation to
predict future heights.
Problem 2
How Does Mia’s Height Change?
Analyze the table shown with the data of Mia’s age and her height.
Age
(months)
Height
(in.)
0.0
17.9
1.0
20.5
1.8
21.0
2.3
21.8
4.0
25.0
6.0
25.8
8.0
27.0
10.0
27.0
12.0
29.3
15.0
30.5
18.0
32.5
Grouping
Have students complete
Questions 1 through 8 with
a partner. Then share the
responses as a class.
Share Phase,
Question 1
• What is the same in Problem
2 as in Problem 1?
• What is different in Problem 2
than in Problem 1?
1. Consider the relationship between Mia’s age and her height. What happens to Mia’s
• What would you expect to
• Would her height increase
at a constant rate, a faster
rate, or a slower rate during
her first month of life or
her fifteenth month of life?
Explain.
• What do you think is the
range of typical heights for
most adults? Describe the
range of heights using units
of inches.
848 • Chapter 16 Lines of Best Fit
848 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
Mia’s height increases as she gets older.
© 2011 Carnegie Learning
height as she gets older?
happen to a baby’s height as
she got older?
Share Phase,
Questions 2 through 5
• How many points on the
scatter plot are above your
line of best fit?
2. Create a scatter plot that shows Mia’s age as the independent variable and her height
as the dependent variable. First, label the axes to represent the independent and
dependent variables. Next, choose the appropriate intervals for your scatter plot.
y
• How many points on the
Mia’s Height over Time
45
42
scatter plot are below your
line of best fit?
39
36
33
• How many points on the
Height (in.)
30
scatter plot are on your line
of best fit?
• What is the slope of your line
of best fit?
27
24
21
18
15
12
9
6
• What does the slope mean
3
0
with respect to the problem
situation?
0
2
4
6
8 10 12 14 16 18 20 22 24 26 28 30
x
Age (months)
3. Can these data be exactly represented by a linear equation? Explain your reasoning.
• What ordered pairs did you
No. All the points do not lie in a straight line.
use to determine the slope of
the line of best fit?
• How does your equation for
4. Use a ruler to draw the line that best fits your data on your graph. Then, write the
the line of best fit compare to
your classmate’s equations?
equation of your line. Be sure to define your variables and include the units.
The equation should be close to y 5 0.73x 1 20.08, where x is the age in months,
and y is the height in inches.
5. According to your line, approximately how many inches did Mia grow each month
© 2011 Carnegie Learning
© 2011 Carnegie Learning
from the time she was born until she was 18 months old? How did you determine
your answer?
Mia grew about 0.73 inch each month. The answer is given by the slope of
the line in Question 5.
16.2 Analyzing the Line of Best Fit • 849
16.2 Analyzing the Line of Best Fit • 849
6. If Mia continues to grow at this rate, how tall will she be when she is:
a. 2 years old?
y 5 0.73(24) 1 20.08 5 37.6
Mia will be about 37.6 inches tall.
b. 5 years old?
y 5 0.73(60) 1 20.08 5 63.88
Mia will be about 63.9 inches tall.
c. 18 years old?
y 5 0.73(216) 1 20.08 5 177.76
Mia will be about 177.8 inches tall.
7. Do all of your answers to Question 6 make sense? Explain your reasoning.
No. The heights at age 5 years and age 18 years seem to be unreasonable because
most 5-year-olds are not over 5 feet tall. Also, no adult humans have ever been
almost 15 feet tall.
8. What can you conclude about the accuracy of your model?
850 • Chapter 16 Lines of Best Fit
850 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
The model seems to be accurate for the first 2 years, but not beyond that.
Problem 3
Students are given additional
data values for the data table.
They plot these values and
compare them to their predicted
values. Students then update
their graphs and lines of best fit
with the additional data. They
will write an equation of best fit
and use their equation to make
predictions about the height
and weight of the child when
she is 18 years old.
Problem 3
More Data about Mia
1.
The table shows Mia’s growth from age 2 to age 5 __
2
Grouping
Have students complete
Questions 1 through 6 with
a partner. Then share the
responses as a class.
Age
(years)
Age
(months)
Weight
(lbs)
Height
(in.)
2.0
24
27.3
34.5
2.5
30
30.0
35.8
3.0
36
32.0
36.6
3.5
42
33.0
38.0
4.5
54
39.0
42.0
5.5
66
44.0
45.0
1. Complete the table shown by converting each age from years to months.
Share Phase,
Questions 1 and 2
• What unit was given for Mia’s
2. The scatter plot shown relates Mia’s age to her weight. Include the new data from the
table on the scatter plot. Then, draw the line of best fit and determine the equation of
the line.
y
age in Problem 1?
56
ages in Problem 3 into
months?
• How many months are in a
year? Is that an exact value?
© 2011 Carnegie Learning
• What units were given for
Mia’s weight in Problem 1?
What units were given for
Mia’s height in Problem 3?
• Why is a line of best fit called
52
48
44
40
Weight (lbs)
• How can you convert the
© 2011 Carnegie Learning
• What unit was given for Mia’s
age in Problem 3?
Mia’s Weight over Time
60
36
32
28
24
20
16
12
8
4
0
0
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75
x
Age (months)
The equation should be close to y 5 0.52x 1 12.47.
16.2 Analyzing the Line of Best Fit • 851
that? What is fitting best?
16.2 Analyzing the Line of Best Fit • 851
Share Phase,
Questions 3 through 6
• How many points on each
3. The scatter plot shown relates Mia’s age to her height. Include the new data from the
table on the scatter plot. Then, draw the line of best fit and determine the equation of
scatter plot are above your
line of best fit?
the line.
y
• How many points on each
60
Mia’s Height over Time
56
scatter plot are below your
line of best fit?
52
48
44
• How many points on each
Height (in.)
40
scatter plot are on your line
of best fit?
• What is the slope of each line
of best fit?
36
32
28
24
20
16
12
8
• What does the slope mean
4
0
with respect to the problem
situation?
0
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75
x
Age (months)
The equation should be close to y 5 0.38x 1 22.66.
• What ordered pairs did you
use to determine the slope of
each line of best fit?
4. Use your new lines of best fit to determine Mia’s height and weight when she is
18 years old.
y 5 0.52(216) 1 12.47 5 124.79
• How do your equations
y 5 0.38(216) 1 22.66 5 104.74
for each line of best fit
compare to your classmate’s
equations?
According to the lines of best fit, Mia will weigh about 124.8 pounds and be about
104.7 inches tall.
• Is it reasonable for the line of
5. How do these predictions compare to the predictions in Problems 1 and 2? Are these
best fit to be accurate on a
limited domain but not for the
set of all real numbers?
predictions reasonable? Explain your reasoning.
• How can you calculate the
line of best fit?
6. Do you think that extending the lines of best fit for Mia’s weight and height over time
made sense to make predictions about her weight and height beyond 6 years?
• How can you use a line of
No. The rate of Mia’s height and weight will change over time, but not at a
best fit to predict future
values?
constant rate.
Be prepared to share your solutions and methods.
852 • Chapter 16 Lines of Best Fit
852 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
but the height is still unreasonable.
© 2011 Carnegie Learning
The weight seems to be more reasonable than the prediction in Problems 1 and 2,
Follow Up
Assignment
Use the Assignment for Lesson 16.2 in the Student Assignments book. See the Teacher’s Resources
and Assessments book for answers.
Skills Practice
Refer to the Skills Practice worksheet for Lesson 16.2 in the Student Assignments book for additional
resources. See the Teacher’s Resources and Assessments book for answers.
Assessment
See the Assessments provided in the Teacher’s Resources and Assessments book for Chapter 16.
Check for Students’ Understanding
1. Use the given data to create a scatter plot.
x
y
3
7
4
6
5
6
8
2
9
3
y
9
8
7
6
5
© 2011 Carnegie Learning
4
3
2
1
0
1
2
3
4
5
6
7
8
9
x
2. Draw a line of best fit on the scatter plot.
3. Determine the equation for the line of best fit.
Answers will vary.
The equation should be close to the equation y 5 20.79x 1 9.39.
16.2 Analyzing the Line of Best Fit • 852A
© 2011 Carnegie Learning
852B • Chapter 16 Lines of Best Fit
Stroop Test
Performing an Experiment
Learning Goals
In this lesson, you will:
 Perform an experiment.
 Write and use the equations of lines of best fit.
 Compare results of an experiment.
Essential Ideas
• A Stroop Test studies a person’s perception of words
and colors by using lists of color words that are
written in colors.
• A scatter plot is a graph of data points.
• A line of best fit is a straight line that is as close to
as many points as possible, but does not have to go
through all the points on a scatter plot.
• Equations can be written for a line of best fit.
• A line of best fit can be used to make predictions
© 2011 Carnegie Learning
about data.
Common Core State Standards for
Mathematics
1. Construct and interpret scatter plots for bivariate
measurement data to investigate patterns of
association between two quantities. Describe
patterns such as clustering, outliers, positive
or negative association, linear association, and
nonlinear association.
2. Know that straight lines are widely used to model
relationships between two quantitative variables.
For scatter plots that suggest a linear association,
informally fit a straight line, and informally assess
the model fit by judging the closeness of the data
points to the line.
3. Use the equation of a linear model to solve
problems in the context of bivariate measurement
data, interpreting the slope and intercept.
8.SP Statistics and Probability
Investigate patterns of association in bivariate data.
Materials
Matching Lists of different lengths
Non-matching Lists of different lengths
Stopwatches (one for each time keeper)
16.3 Performing an Experiment • 853A
Overview
The Stroop Test studies a person’s perception of words and colors by using lists of color words (red,
green, black, and blue) that are written in one of the four colors. Students conduct the Stroop Test
experiment to gather data. They will calculate the mean time for various matching and non-matching
lists of words and create scatter plots of the list length versus the amount of time. Students then draw
the line of best fit for each scatter plot and make predictions for the amount of time based on the list
length using the equations of the lines of best fit.
Materials
Lists of words using different color inks will need to be prepared before this lesson is used. Several
lists will need to be made to conduct this test for each group of students performing the experiment.
Each list should be different lengths. Half of the lists should have the name of the color matching the
color of ink in which it is printed and the other half of the lists should have the name of the color not
matching the color of the ink in which it is printed.
© 2011 Carnegie Learning
Also, more than one stop watch or watches with a second hand will be needed for each group.
853B • Chapter 16 Lines of Best Fit
Warm Up
1. Use the given data to create a scatter plot.
x
y
5
0
4
2
3
3
2
5
9
3
y
5.5
5
4.5
4
3.5
3
2.5
2
1.5
1
0.5
0
1
2
3
4
5
6
7
8
9
x
2. Draw a line of best fit on the scatter plot.
3. Determine the equation for the line of best fit.
© 2011 Carnegie Learning
Answers will vary.
The equation should be close to the equation y 5 20.20x 1 3.51.
16.3 Performing an Experiment • 853C
© 2011 Carnegie Learning
853D • Chapter 16 Lines of Best Fit
Stroop Test
Performing an Experiment
Learning Goals
In this lesson, you will:
 Perform an experiment.
 Write and use the equations of lines of best fit.
 Compare results of an experiment.
T
he purpose of cognitive psychology is to understand and explain how the human brain works. One way to study how the brain works is to have human subjects perform experiments. The Stroop Test is one such experiment. The Stroop Test studies a person’s perception of words and colors by using lists of color words (red, green, black, and blue) that are written in one of the four colors. A person who participates in the Stroop Test receives one of two lists, a matching list or a non-matching list, with a varying number of words. In a matching list, the ink color matches the color of the word. In a non-matching list, the ink color does not match the color of the word.
© 2011 Carnegie Learning
© 2011 Carnegie Learning
The person participating in the experiment is given either kind of list. The person says aloud the ink color in which each word is written. The time it takes for the person to say the ink color for the list and the number of words in the list are recorded. The experiment is repeatedly performed with different people until enough data are collected to make a conclusion about the experiment. Why do you think a Stroop Test is important to study? Why do you think scientists and psychologists would be interested in the results?
16.3 Performing an Experiment • 853
16.3 Performing an Experiment • 853
Problem 1
Students gather data and
analyze the data for the Stroop
Test; the data is usually quite
linear. They create scatter plots
of the data, and determine the
line of best fit for each situation.
Students will then use their
equations to make predictions
and consider the accuracy of
their line of best fit.
Problem 1
Running a Stroop Test
In this lesson, you will perform the Stroop Test and calculate a line of best fit to make
predictions.
1. Before you perform this experiment, what results would you expect to see for either
the matching lists or non-matching lists? How do you think the results for the
matching lists will compare to the non-matching lists?
Please note: the answers supplied for the experiment are meant to be a sample.
As either list gets longer, the time increases.
It will probably take longer to say the ink color for words in the non-matching lists
than it will take to say the ink color for words in the matching lists.
Materials
Matching Lists
Non-Matching Lists
2. Identify the independent variable and the dependent variable in this problem situation.
The list length (number of words) is the independent variable.
Stopwatches
The amount of time to say the list aloud is the dependent variable.
Grouping
Have students complete
Questions 1 and 2 with a
partner. Then share the
responses as a class.
© 2011 Carnegie Learning
Share Phase,
Questions 1 and 2
• What will this experiment
measure?
• Do you think that the data
that you gather will be
somewhat linear?
will be a different amount of
time for the non-matching
lists than for the matching
lists?
• Why will you need to use
854 • Chapter 16 Lines of Best Fit
more than one timer?
• Why do you suppose the
colors red, green, black, and
blue were chosen to conduct
this test?
• What are other colors that
could have been used?
854 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
• Why do you think that there
Grouping
• Ask a student to read
Question 3 aloud. Discuss
the steps to completing the
experiment as a class.
3. Perform a Stroop Test and record your data in the tables shown. Make sure to
conduct three trials of the Stroop Test for each matching and non-matching list. Then,
vary the test by lengthening or shortening the number of words in the matching and
• Have students complete the
non-matching lists. Be sure to record the matching or non-matching list’s data in the
experiment and Questions 3
and 4 as directed.
correct table. You will complete the last column of the table in Question 4.
The data values listed are meant to be sample data. Your students’ responses
will vary.
Matching Lists
© 2011 Carnegie Learning
© 2011 Carnegie Learning
A trial is the
number of times
you conduct an
experiment.
List length
(words)
Time 1
(seconds)
Time 2
(seconds)
Time 3
(seconds)
Mean Time
(seconds)
10
7.20
7.22
7.13
7.18
15
9.59
9.22
9.43
9.41
6
3.35
3.22
3.43
3.33
12
5.74
5.77
5.79
5.77
20
14.34
13.92
13.56
13.94
26
12.12
12.20
12.01
12.11
13
5.63
5.75
5.60
5.66
8
3.37
3.06
3.33
3.25
11
4.00
4.40
4.20
4.20
5
3.20
3.28
3.36
3.28
Why should you
record three trials for
each time you vary the length
of the Stroop Test?
16.3 Performing an Experiment • 855
16.3 Performing an Experiment • 855
Share Phase,
Question 4
How did you determine the
mean time in seconds for
each list?
Non-Matching Lists
List length
(words)
Time 1
(seconds)
Time 2
(seconds)
Time 3
(seconds)
Mean Time
(seconds)
8
6.85
6.78
6.49
6.71
13
9.76
9.87
11.09
10.24
12
9.91
9.41
9.20
9.51
6
6.70
6.68
6.53
6.64
10
8.12
8.50
8.27
8.30
26
21.31
21.85
21.90
21.69
20
16.72
16.70
14.60
16.01
7
8.49
7.63
7.12
7.75
15
10.01
9.99
10.03
10.01
11
4.90
4.94
4.98
4.94
4. In the fifth column of each table, record the mean time in seconds for each list. Round
856 • Chapter 16 Lines of Best Fit
856 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
your answers to the nearest hundredth.
Grouping
Have students complete
Questions 5 through 10 with
a partner. Then share the
responses as a class.
5. Write the ordered pairs from the matching lists table that show the average time as
the dependent variable, and the average list length (number of words) as the
independent variable.
The ordered pairs for the matching list are (10, 7.18), (15, 9.41), (6, 3.33), (12, 5.77),
(20, 13.94), (26, 12.11), (13, 5.66), (8, 3.25), (11, 4.2), and (5, 3.28).
Share Phase,
Questions 5 through 8
• What two points did you use
to determine the slope of the
lines of best fit?
6. Create a scatter plot of the ordered pairs on the grid shown. First, label the axes to
represent the independent and dependent variables. Next, choose the appropriate
intervals for your scatter plot. Finally, name your scatter plot.
y
Matching Lists
22.5
21.0
• What is the y-intercept for
19.5
18.0
the line of best fit?
16.5
Time (seconds)
• What is the equation of the
line of best fit?
• What are the units for the
y-intercept in your line of
best fit?
15.0
13.5
12.0
10.5
9.0
7.5
6.0
4.5
3.0
• What does the x-value
1.5
0.0
represent in your line of
best fit?
0
2
4
6
8 10 12 14 16 18 20 22 24 26 28 30
x
List Length (words)
7. Use a ruler to draw the line of best fit. Then, write the equation of your line.
• What are the units for the
The graph of the line should be close to y 5 0.54x 1 0.06.
x-value?
© 2011 Carnegie Learning
represent in your line of
best fit?
© 2011 Carnegie Learning
• What does the y-value
8. State the y-intercept of your line. What does the y-intercept represent in
this situation?
The y-intercept is 0.06. It represents the amount of time it takes to read 0 words.
16.3 Performing an Experiment • 857
16.3 Performing an Experiment • 857
Share Phase,
Question 9
9. State the slope of your line. What does the slope represent in this situation?
What are the units for the slope
in your line of best fit?
The slope is 0.54. It represents the amount of time it takes to say the ink color for
a new word.
Share Phase,
Question 10
• How can you use your line of
10. Use your equation to answer each question.
a. About how many seconds should it take a person to say the ink color of a
matching list of 25 words?
best fit to predict an amount
of time if given a matching
list length?
y 5 0.54(25) 1 0.06 5 13.56
It should take a person about 13.56 seconds to say a matching list of 25 words.
• How can you use your line of
best fit to predict a matching
list length if given an amount
of time?
b. About how many seconds should it take a person to say the ink color of a
matching list of 10 words?
y 5 0.54(10) 1 0.06 5 5.46
It should take a person about 5.46 seconds to say a matching list of 10 words.
c. About how many words should a person be able to say the ink color from a
matching list in 2 minutes?
120 5 0.54x 1 0.06
119.94 5 0.54x
222.11 ¯ x
matching list in 2 minutes.
d. About how many words should a person be able to say the ink color from a
matching list in 5 minutes?
300 5 0.54x 1 0.06
© 2011 Carnegie Learning
A person should be able to say the ink color for approximately 222 words from a
299.94 5 0.54x
A person should be able to say the ink color for approximately 555 words from a
matching list in 5 minutes.
858 • Chapter 16 Lines of Best Fit
858 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
555.44 ¯ x
Grouping
Have students complete
Questions 11 through 16 with
a partner. Then share the
responses as a class.
11. Write the ordered pairs from the non-matching lists table that show the average time
as the dependent variable, and the list length as the independent variable.
The ordered pairs for the non-matching list are (8, 6.71), (13, 10.24), (12, 9.51),
(6, 6.64), (10, 8.3), (26, 21.69), (20, 16.01), (7, 7.75), (15, 10.01), and (11, 4.94).
Share Phase,
Questions 11
through 15
• What is a line of best fit?
• Why were the times for the
12. Create a scatter plot of the ordered pairs for the non-matching lists on the grid. First,
label the axes to represent the independent and dependent variables. Next, choose
the appropriate intervals for your scatter plot. Finally, name your scatter plot.
Non-Matching Lists
y
30
28
26
non-matching lists longer
than for the matching lists?
24
Time (seconds)
22
• What may have made the
results less accurate than
they could have been?
20
18
16
14
12
10
8
• How can you calculate a line
6
of best fit?
4
2
0
0
2
4
6
8 10 12 14 16 18 20 22 24 26 28 30
x
List Length (words)
13. Use a ruler to draw the line of best fit. Then, write the equation of your line.
© 2011 Carnegie Learning
The graph of the line should be close to y 5 0.76x 1 0.42.
14. State the y-intercept of your line. What does the y-intercept represent in
this situation?
The y-intercept is 0.42. It represents the amount of time it takes to say the
ink color for 0 words.
15. State the slope of your line. What does the slope represent in this situation?
© 2011 Carnegie Learning
Sample Answer: The slope is 0.76. It represents the amount of time it takes to say
the ink color for a new word from the non-matching list.
16.3 Performing an Experiment • 859
16.3 Performing an Experiment • 859
Share Phase,
Question 16
16. Use your equation to answer each question.
How can you use a line of best
fit to predict future values?
a. About how many seconds should it take a person to say the color of the ink for a
non-matching list of 25 words?
y 5 0.76(25) 1 0.42 5 19.42
It should take a person about 19.42 seconds to say a non-matching list of
Note
25 words.
The difference between the
slopes in the lines of best fit is
the average amount of time for
the students’ brains to override
their impulse to read a word.
The unit for the difference is
seconds per word.
b. About how many seconds should it take a person to say the color of the ink for a
non-matching list of 10 words?
y 5 0.76(10) 1 0.42 5 8.02
It should take a person about 8.02 seconds to say a non-matching list of
10 words.
c. About how many words should a person be able to say the ink color from a
non-matching list in 2 minutes?
120 5 0.76x 1 0.42
119.58 5 0.76x
157.34 ¯ x
A person should be able to say the ink color for a non-matching list for about
d. About how many words should a person be able to say the ink color from a
non-matching list in 5 minutes?
300 5 0.76x 1 0.42
299.58 5 0.76x
394.18 ¯ x
A person should be able to say the ink color of a non-matching list for about
© 2011 Carnegie Learning
157 words from a non-matching list in 2 minutes.
860 • Chapter 16 Lines of Best Fit
860 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
394 words from a non-matching list in 5 minutes.
Talk the Talk
Students answer questions
related to the data gathered
from performing the experiment.
Talk the Talk
1. Compare your results for the matching lists to the results for the non-matching lists.
Do your results seem reasonable? Explain your reasoning.
Grouping
According to the models, a person takes more time to say the ink color of a
Have students complete
Questions 1 through 3 with
a partner. Then share the
responses as a class.
non-matching list than a matching list when the lists are the same length.
This seems reasonable because it should take longer to say the ink color that
does not match the written word.
2. Are the results what you expected? Explain your reasoning.
No. I thought there would be a greater difference in time between matching and
non-matching lists of the same length.
3. What conclusions do you think a cognitive psychologist might draw from your
experiment results?
The psychologist might conclude that the ink color and the written color word are
pieces of information that can conflict in a person’s mind. The psychologist might
© 2011 Carnegie Learning
also conclude that if a person can say the ink color of either list in approximately
the same amount of time, the person’s brain can easily block certain information.
© 2011 Carnegie Learning
Be prepared to share your solutions and methods.
16.3 Performing an Experiment • 861
16.3 Performing an Experiment • 861
Follow Up
Assignment
Use the Assignment for Lesson 16.3 in the Student Assignments book. See the Teacher’s Resources
and Assessments book for answers.
Skills Practice
Refer to the Skills Practice worksheet for Lesson 16.3 in the Student Assignments book for additional
resources. See the Teacher’s Resources and Assessments book for answers.
Assessment
See the Assessments provided in the Teacher’s Resources and Assessments book for Chapter 16.
Check for Students’ Understanding
Two experiments are conducted to compare how long it takes inkjet printers to print in black-andwhite and how long it takes them to print in color. The number of pages printed using black-and-white
can be expressed by the line of best fit pb 5 33.8t 1 5.3, and the number of pages printed using color
can be expressed by the line of best fit pc 5 21t 1 2.7, where p is the total number of pages printed,
and t is the time in minutes. If you only have 15 minutes to use an inkjet printer, how many more
black-and-white pages could you print than color pages?
Pages printed in black-and-white:
p 5 33.8(15) 1 5.3 5 507 1 5.3 5 512.3
Pages printed in color:
p 5 21(15) 1 2.7 5 315 1 2.7 5 317.7
© 2011 Carnegie Learning
You can print 512.3 2 317.7 5 194.6 (approximately 195) more black-and-white pages in
15 minutes than color pages.
862 • Chapter 16 Lines of Best Fit
Human Chain:
Shoulder Experiment
Using Technology to Determine
a Linear Regression Equation
Learning Goals
Key Terms
In this lesson, you will:
 linear regression
 linear regression equation
 Perform an experiment.
 Use technology to determine a linear
regression equation.
 Use a linear regression equation to
predict results.
Essential Ideas
• Technology can be used to calculate the line of
best fit.
8.SP Statistics and Probability
• The linear regression equation is the equation used
by a calculator or spreadsheet program to find the
line of best fit.
• The least squares method is the method used by a
calculator or spreadsheet program to find the line of
best fit.
• The correlation coefficient indicates how close the
data are to forming a straight line.
• A linear regression equation can be used to predict
© 2011 Carnegie Learning
the results of an experiment.
Materials
Stopwatch
Graphing Calculators
Common Core State Standards for
Mathematics
Investigate patterns of association in bivariate data.
1. Construct and interpret scatter plots for bivariate
measurement data to investigate patterns of
association between two quantities. Describe
patterns such as clustering, outliers, positive
or negative association, linear association, and
nonlinear association.
2. Know that straight lines are widely used to model
relationships between two quantitative variables.
For scatter plots that suggest a linear association,
informally fit a straight line, and informally assess
the model fit by judging the closeness of the data
points to the line.
3. Use the equation of a linear model to solve
problems in the context of bivariate measurement
data, interpreting the slope and intercept.
16.4 Using Technology to Determine a Linear Regression Equation • 863A
Overview
The terms linear regression and linear regression equation are defined in this lesson. Students conduct
an experiment to gather data. They will then create a scatter plot of the amount of time compared
to the number of people in a chain. A calculator is used to calculate the linear regression equation.
Students then predict values for time and for chain length for given values using the linear
© 2011 Carnegie Learning
regression equation.
863B • Chapter 16 Lines of Best Fit
Warm Up
Plot the data in the table on the coordinate plane shown. Draw the line of best fit. Write the equation
for the line of best fit.
1.
x
y
4
1
5
3
2
4
1
3
8
2
y
4
3
2
1
0
1
2
3
4
5
6
7
8
9
x
Answers will vary but should be close to the equation y 5 20.20x 1 3.4.
© 2011 Carnegie Learning
2.
x
y
6
30
5
40
4
80
3
80
2
90
y
120
110
100
90
80
70
60
50
40
30
20
10
0
1
2
3
4
5
6
x
Answers will vary but should be close to the equation y 5 216x 1 128.
16.4 Using Technology to Determine a Linear Regression Equation • 863C
© 2011 Carnegie Learning
863D • Chapter 16 Lines of Best Fit
Human Chain:
Shoulder Experiment
Using Technology to Determine
a Linear Regression Equation
Learning Goals
Key Terms
In this lesson, you will:
 linear regression
 Perform an experiment.
 Use technology to determine a linear  linear regression equation
regression equation.
 Use a linear regression equation to predict results.
A
s you learned previously, distance can affect the nerve impulses that affect your central nervous system. But does distance affect how people and businesses operate? In the past, distance was a crucial factor on how businesses operated. For example, there was no such term as “telecommuting,” which is another way of describing when a person who works from a remote location not inside a company’s office. Distance was also a factor for professional sports teams not to expand to the West Coast until the mid 1950s. So, for most of the time, distance was a factor for many decisions, but just how long will distance remain being a © 2011 Carnegie Learning
© 2011 Carnegie Learning
factor in business decisions with the technological advances?
16.4 Using Technology to Determine a Linear Regression Equation • 863
16.4 Using Technology to Determine a Linear Regression Equation • 863
Problem 1
Students gather another set of
data to analyze and model with
a linear regression equation.
They will create a scatter
plot and calculate the linear
regression equation using
technology. Students then
analyze their linear regression
equations and use their linear
regression equation to predict
values for times given various
lengths of chains.
Problem 1
A Chain Connected at the Shoulder
Previously, your class performed an experiment to measure the speed of your nerve
impulses. Now, your class is going to perform an experiment to see if distance can affect
how quick your nerve impulses travel.
Like the previous experiment you and your classmates will form a circular chain. This time,
each student will gently hold the shoulder of the student to his or her right. One student
begins the chain, and another student ends the chain. Also, one student must be the
timekeeper. This experiment is very similar to the wrist experiment. The group members
must keep their eyes closed. To begin, the timekeeper says, “Go,” and the first student
carefully but quickly squeezes the next student’s shoulder. This next student squeezes a
shoulder, and so on. Once the last student’s shoulder is squeezed, he or she says, “Stop,”
and lets go of the next student’s shoulder. The amount of time it takes to complete the
chain is recorded by the timekeeper.
Materials
1. How do you think this experiment’s results will be different from the results in the
wrist experiment?
Stopwatch
Please note the answers supplied for the experiment
Graphing Calculator
are meant to be a sample—your students’ answers
will vary from the sample answers provided within
this lesson.
Grouping
It should take less time because the distance
from a person’s wrist on one hand to his or her
Ask a student to read the
information before Question 1
aloud. Discuss the experiment
and complete Questions 1 and
2 as a class.
other hand is greater than the distance from a
equation in this human chain
shoulder experiment different
from the linear regression
equation that you calculated
in the human chain wrist
experiment?
equation in this human chain
shoulder experiment similar
to the linear regression
equation that you calculated
in the human chain wrist
experiment?
864 • Chapter 16 Lines of Best Fit
864 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
person’s shoulder to the hand on the other arm.
Share Phase,
Question 1
• How is the linear regression
• How is the linear regression
Remember, to
accurately
perform the experiment,
you should use a
stopwatch.
2. Perform the experiment 10 times, using different chain lengths each time. Perform
3 trials for each chain that involves a different number of students. Record the data in
the table. Then, determine the mean time of each row and record the results in the
Grouping
© 2011 Carnegie Learning
Have students complete
Questions 3 through 5 with
a partner. Then share the
responses as a class.
© 2011 Carnegie Learning
table. Round your averages to the nearest hundredth, if necessary.
Chain Length
(students)
Time 1
(seconds)
Time 2
(seconds)
Time 3
(seconds)
Mean Time
(seconds)
15
4.79
4.10
4.11
4.33
10
2.60
2.41
2.23
2.41
5
1.41
1.28
1.18
1.29
30
6.45
6.84
7.01
6.77
45
10.95
11.23
11.07
11.08
8
2.16
2.40
1.96
2.17
20
4.72
4.54
4.63
4.63
27
6.03
6.24
6.26
6.18
18
4.47
4.29
4.32
4.36
10
2.23
2.15
2.16
2.18
3. Write the ordered pairs from the table with the chain length as the independent
variable, and the mean time as the dependent variable.
The ordered pairs are (15, 4.33), (10, 2.41), (5, 1.29), (30, 6.77), (45, 11.08), (8, 2.17),
(20, 4.63), (27, 6.18), (20, 4.36), and (10, 2.18).
16.4 Using Technology to Determine a Linear Regression Equation • 865
16.4 Using Technology to Determine a Linear Regression Equation • 865
Share Phase,
Questions 4 and 5
• What is the meaning and
4. Create a scatter plot of the ordered pairs on the grid shown. Do not forget to label
each axis and name your scatter plot.
what are the units of the
y-value in your linear
regression equation from
the wrist experiment? What
about from the shoulder
experiment?
y
30
Shoulder Experiment
28
26
24
Time (seconds)
22
• What is the meaning and
what are the units of the
x- value in your linear
regression equation from
the wrist experiment? What
about from the shoulder
experiment?
20
18
16
14
12
10
8
6
4
2
0
0
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75
x
Chain Length (students)
5. Did you need to break your graph? Why or why not.
No. I did not need to break my graph because the data values spread out through
the graph and many data points occurred near the origin.
Grouping
So far, you have drawn the line of best fit for a data set. You have also determined the
equation of the line of best fit. Now, you will use a graphing calculator to determine a line of
Ask a student to read the
information following Question
5 aloud. Discuss the definitions
and complete the steps to using
a graphic calculator as a class.
best fit. The graphing calculator uses a method and an equation for a line of best fit called
the linear regression equation. Throughout this chapter, you have been performing linear
regression, which is to model the relationship of two variables in a data set by drawing a
866 • Chapter 16 Lines of Best Fit
866 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
line of best fit. The linear regression equation is the equation for the line of best fit.
To determine the linear regression equation for the ordered pairs of a data set you must
first enter the data into your calculator.
Step 1: Press STAT . The EDIT , and 1: Edit selection should be highlighted
on your screen.
Step 2: Key in your data points from your experiment. Let the chain length data
values be entered into L1, and let the mean time data be entered in L2 for each
ordered pair.
L1
L2
45
8
20
27
18
10
---
11.08
2.17
4.63
6.18
4.36
2.18
L3
2
L2(11)=
When you have completed entering your data points in your calculator, you then need the
graphing calculator to plot the points in a scatter plot.
Step 3: Press 2nd and then Y5 . The Stat Plots and 1: Plot 1 should be
highlighted. Press ENTER and select the first type of graph.
Step 4: Next, press WINDOW . You must select the least and greatest distances along
© 2011 Carnegie Learning
© 2011 Carnegie Learning
the x- and y-axes. Select Xmin 5 0, Xmax 5 60 , and Xscl 5 5. Then, perform
the same steps for the y-axis. Finally, press GRAPH .
16.4 Using Technology to Determine a Linear Regression Equation • 867
16.4 Using Technology to Determine a Linear Regression Equation • 867
Next, you will use your graphing calculator to determine the
linear regression equation.
Step 1: Press STAT and use your arrow key to scroll to
CALC . Use your arrow key and scroll to 4:
LineReg(ax 1 b) and then press ENTER .
Step 2: Use lists L1 and L2 for your lists. Do this by
pressing 2nd and L1 and press , for the
To ensure
your calculator will display
the value of r, “``Diagnostics,
must be turned on. Press 2nd
and 0 to display Catalog. Scroll
to the Diagnostics On and then
press ENTER twice.
The calculator should display
the word Done.
x-coordinate data points. Then, put in the
y-coordinate data points by pressing 2nd and L2 .
LinReg
y = ax+b
a =.2362746199
b = 0.978491456
r2 =.9868599065
r =.9934082275
To determine the linear regression equation, simply substitute the values for the
terms a and b in the equation.
On your screen, you should also see a value for the variable r. The variable, r, is
used to represent the correlation coefficient. The correlation coefficient shows
how close the data points are to forming a straight line.
If the data set has a positive correlation, then r has a value between 0 and 1. The
closer the data is to forming a straight line, the closer the value of r is to 1.
If the data has a negative correlation, then r has a value between 0 and 21. The
closer the data is to forming a straight line, the closer the value of r is to 21.
Grouping
6. Use a graphing calculator to determine the linear regression equation, and correlation
coefficient for the data. If necessary, round the values a and b to the nearest
Have students complete
Questions 6 through 10 with
a partner. Then share the
responses as a class.
hundredth. Then, graph the line on your coordinate plane in Question 4.
The linear regression equation is y 5 0.24x 1 0.06.
868 • Chapter 16 Lines of Best Fit
868 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
As a shortcut, you can say: 21 # r # 1
© 2011 Carnegie Learning
If there is no linear relationship in the data, the value of r is 0.
Share Phase,
Questions 7 through 9
• What is the meaning
and what are the units of
the slope in your linear
regression equation from
the wrist experiment? What
about from the shoulder
experiment?
7. How close are the data to forming a straight line? Explain how you determined
your answer.
The data are very close to forming a straight line because the r-value is
approximately 0.992.
• What is the meaning and
8. What is the slope of the linear regression equation? What does the slope mean in the
problem situation?
what are the units of the
y-intercept in your linear
regression equation from
the wrist experiment? What
about from the shoulder
experiment?
The slope is 0.24.
The slope indicates the number of seconds added to the time for each student
added to the chain.
9. Use your linear regression equation and your calculator to determine the number of
seconds it should take to perform the experiment if:
• What is the correlation
a. 100 people are in the chain.
coefficient for your linear
regression equation for
the wrists experiment?
What about the shoulder
experiment? Why would they
be different?
y 5 0.24(100) 1 0.06 5 24.06
It should take about 24 seconds.
b. 50 people are in the chain.
Do you
remember how
to convert different
units of measure to
the same measure?
You will need to do
this to complete
this task.
• How can you use the linear
© 2011 Carnegie Learning
regression equation to
predict the time for a given
chain length?
y 5 0.24(50) 1 0.06 5 12.06
It should take about 12 seconds.
c. 10,000 people are in the chain.
y 5 0.24(10,000) 1 0.06 5 2400.06
It should take about 2400 seconds.
d. 6.9 billion people (the world’s population) are in the chain.
y 5 0.24(6,900,000,000) 1 0.06 5 1,656,000,000.1
It should take about 1,656,000,000 seconds.
e. Write your answer to part (d) in years.
© 2011 Carnegie Learning
1 day _________
1 yr
1 hr 3 ______
1 min 3 _______
3
¯ 52.51 years
1,656,000,000 sec 3 _______
60 sec 60 min
24 hr
365 days
It should take about 53 years.
16.4 Using Technology to Determine a Linear Regression Equation • 869
16.4 Using Technology to Determine a Linear Regression Equation • 869
Share Phase,
Question 10
• How can you use the linear
10. Use your linear regression equation to determine the length of the chain if it takes:
a. 1 hour to complete the chain.
regression equation to
predict the chain length for a
given time?
60 min 3 _______
60 sec 5 3600 sec
1 hr 3 _______
1 hr
1 min
3600 5 0.24x1 0.06
3599.94 5 0.24x
• How accurate will the
14,999.75 ¯ x
results be if you use the
linear regression equation to
predict values?
There were about 15,000 people in the chain.
• How do you measure the
accuracy of the line?
b. 1 day to complete the chain.
60 min 3 _______
60 sec 5 86,400 sec
24 hr 3 _______
1 day 3 ______
1 day
1 hr
1 min
86,400 5 0.24x 1 0.06
86,399.94 5 0.24x
359,999.75 ¯ x
There were about 360,000 people in the chain.
c. 1 year to complete the chain.
365 days
60 min 3 _______
60 sec 5 31,536,000 sec
24 hr 3 _______
1 yr 3 _________ 3 ______
1 yr
1 day
1 hr
1 min
31,536,000 5 0.24x 1 0.06
870 • Chapter 16 Lines of Best Fit
870 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
131,399,999.8 ¯ x
There were about 131,400,000 people in the chain.
© 2011 Carnegie Learning
31,535,999.94 5 0.24x
Talk the Talk
Students enter the data from
the wrist experiment in a
previous lesson into a graphing
calculator to determine the
linear regression equation.
They will then graph the linear
regression equation from
the wrist experiment and the
shoulder experiment on the
same coordinate plane and
compare the slopes.
Talk the Talk
1. How does this experiment differ from the wrist experiment you previously conducted?
In this experiment, I might think that the nerve impulses do not travel as far, so
they are quicker than the wrist experiment.
2. Enter the data from the wrist experiment into your calculator.
a. Use your calculator to determine the liner regression equation for the
wrist experiment.
b. Graph the equation on the grid shown.
c. Graph the equation for the shoulder experiment on the grid shown.
Grouping
y
30
Have students complete
Questions 1 through 3 with
a partner. Then share the
responses as a class.
Wrist and Shoulder Experiment
Title your
graph and label
your axes.
28
26
24
Time (seconds)
22
20
18
16
Wrist Experiment
14
12
10
8
6
4
Shoulder Experiment
2
0
0
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75
x
Chain Length (students)
3. What do the slopes in each experiment (wrist and shoulder) represent?
© 2011 Carnegie Learning
© 2011 Carnegie Learning
What do you think is the cause of the difference between the slopes?
The slope indicates the number of seconds added to the time for each
person added to the chain. The difference is caused by the distance
nerve impulses have to travel in each experiment. The distance from a
person’s wrist on one hand to his or her other hand is greater than the
distance from a person’s shoulder to the hand on the other arm. So,
the number of seconds per person is less in the second situation.
Be prepared to share your solutions and methods.
16.4 Using Technology to Determine a Linear Regression Equation • 871
16.4 Using Technology to Determine a Linear Regression Equation • 871
Follow Up
Assignment
Use the Assignment for Lesson 16.4 in the Student Assignments book. See the Teacher’s Resources
and Assessments book for answers.
Skills Practice
Refer to the Skills Practice worksheet for Lesson 16.4 in the Student Assignments book for additional
resources. See the Teacher’s Resources and Assessments book for answers.
Assessment
See the Assessments provided in the Teacher’s Resources and Assessments book for Chapter 16.
Check for Students’ Understanding
Use a graphing calculator and the data from each table to calculate the linear regression equation.
Round any answers to the nearest hundredth where necessary. Sketch the screen shot that contains
the scatter plot and the regression equation.
1.
x
y
4
1
5
3
2
4
1
3
8
2
y
4
3
1
0
1
2
3
4
5
6
7
8
9
x
Answers will vary but should be close to the equation y 5 20.20x 1 3.4.
872 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
2
2.
x
y
6
30
5
40
4
80
3
80
2
90
y
120
110
100
90
80
70
60
50
40
30
20
10
0
1
2
3
4
5
6
x
Answers will vary but should be close to the equation y 5 216x 1 128.
3. Compare the equations you wrote in Questions 1 and 2 above to the equations you wrote in the
warm up activity in this lesson. Are they close?
© 2011 Carnegie Learning
The regression equations in this activity are more accurate.
16.4 Using Technology to Determine a Linear Regression Equation • 872A
© 2011 Carnegie Learning
872B • Chapter 16 Lines of Best Fit
Jumping
Correlation
Learning Goals
In this lesson, you will:




Perform an experiment.
Draw a line of best fit.
Write and use an equation of a line of best fit.
Determine whether data are positively correlated, negatively correlated, or not correlated.
Essential Ideas
• A line of best fit is a straight line that is as close to
as many points as possible, but does not have to go
through all the points on a scatter plot.
• Equations can be written for a line of best fit.
• A line of best fit can be used to make predictions
about data.
• There is a correlation between the x- and y- values
when it is appropriate to use a line of best fit to
approximate a collection of points.
• The points are positively correlated when the line of
© 2011 Carnegie Learning
best fit has a positive slope.
• The points are negatively correlated when the line of
best fit has a negative slope.
Materials
Measuring Tape
Graphing Calculator
Common Core State Standards for
Mathematics
8.SP Statistics and Probability
Investigate patterns of association in bivariate data.
1. Construct and interpret scatter plots for bivariate
measurement data to investigate patterns of
association between two quantities. Describe
patterns such as clustering, outliers, positive
or negative association, linear association, and
nonlinear association.
2. Know that straight lines are widely used to model
relationships between two quantitative variables.
For scatter plots that suggest a linear association,
informally fit a straight line, and informally assess
the model fit by judging the closeness of the data
points to the line.
3. Use the equation of a linear model to solve
problems in the context of bivariate measurement
data, interpreting the slope and intercept.
16.5 Correlation • 873A
Overview
An experiment is used to explore the relationship between the height of a student and the height
which a student can jump. Students conduct an experiment to gather data, record the data in a table,
use the table to write ordered pairs, and use the ordered pairs to graph the scatter plot. A graphing
calculator is used to determine the line of best fit and students draw the line of best fit on the scatter
plot. In the second problem, the term correlation is introduced and lines with positive correlations,
negative correlations, and no correlation are described. Using the r-value, or correlation coefficient,
© 2011 Carnegie Learning
students identify the most accurate correlation coefficient for various scatter plots given.
873B • Chapter 16 Lines of Best Fit
Warm Up
Sketch a scatter plot that matches the given description.
1. Data that may have a linear regression with a positive slope.
y
7
6
5
4
3
2
1
0
1
2
3
4
5
6
7
x
2. Data that may have a linear regression with a negative slope.
y
7
6
5
4
3
2
1
1
2
3
4
5
6
7
x
© 2011 Carnegie Learning
0
16.5 Correlation • 873C
© 2011 Carnegie Learning
873D • Chapter 16 Lines of Best Fit
Jumping
Correlation
Learning Goals
In this lesson, you will:
 Perform an experiment.
 Draw a line of best fit.  Write and use an equation of a line of best fit.
 Determine whether data are positively correlated, negatively correlated, or not correlated.
J
umping is probably something you do during sports like basketball or during track-and-field events like the long jump. However, imagine if jumping was your usual way of travel! Well for certain creatures, jumping is as natural for them as walking is to humans. Crickets and grasshoppers jump to get from different blades of grass or other vegetation to another location—and jumping is important to get away from predators. Of course frogs and jumping spiders jump to attack prey for their next meals! And still, one of the largest leapers in the animal kingdom is the kangaroo. Native to Australia, kangaroos can leap for miles! Even though these leapers are both insects, arachnids, reptiles, and mammals, what special bodily © 2011 Carnegie Learning
© 2011 Carnegie Learning
feature do they each share?
16.5 Correlation • 873
16.5 Correlation • 873
Problem 1
Problem 1
How High Can You Jump?
There is a debate whether a person’s height affects how high a person can jump.
In this lesson, you will conduct an experiment, collect, and analyze the data to
determine the answer.
1. Create a table that you can use for your experiment. The table should include columns
for the person’s name, the person’s height in inches, and the height jumped in inches.
Please note the answers supplied for the experiment students will perform are
meant to be a sample.
Measuring Tape
Graphing Calculator
Grouping
Ask a student to read the
introduction to Problem 1 aloud.
Discuss the directions and
complete Questions 1 through 3
as a class
Discuss Phase,
Introduction
• What will this experiment
Person Height
(inches)
Jump Height
(inches)
Student A
67
13
Student B
65
9.5
Student C
71
7.5
Student D
67
10.5
Student E
62
12
Student F
68
9
Student G
60
5
Student H
75
20
Student I
73
19.5
Student J
72
18
measure?
• How can you measure the
heights that the students
jump?
• Which suggestion will give
the most accurate way
to measure the heights
jumped?
874 • Chapter 16 Lines of Best Fit
• Do you think the data that you gather will be somewhat linear?
• What units will you use to measure the heights of the students? What units will
you use to measure the heights jumped?
• How accurate will you be able to measure the heights jumped?
• Do you feel that the height that a person can jump would depend on the height
of the person? If so, how do you think that they are related?
874 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
Materials
Name
How will you
measure the
height each person
can jump?
© 2011 Carnegie Learning
In this experiment, students
collect data for the heights
jumped by ten students in
the class. They enter the data
collected in a table, identify the
independent and dependent
variables, and write the ordered
pairs from the data in the table.
Line breaks are used when
scaling the x-axis and students
create a scatter plot using the
ordered pairs. Using a graphing
calculator, they will determine
and draw the line of best fit
and interpret the regression
equation with respect to the
problem situation.
2. Perform the experiment with your classmates and record the results in the table.
Describe how you measured the height each person jumped.
Answers will vary.
3. Identify the independent and dependent variables in this problem situation.
The independent variable is the person’s height. The dependent variable is the
person’s jump height.
4. Write the ordered pairs from your table.
Grouping
The ordered pairs are (67, 13), (65, 9.5), (71, 7.5), (67, 10.5), (62, 12), (68, 9), (60, 5),
Have students complete
Questions 4 through 6 with
a partner. Then share the
responses as a class.
© 2011 Carnegie Learning
© 2011 Carnegie Learning
(75, 20), (73, 19.5), and (72, 18).
16.5 Correlation • 875
16.5 Correlation • 875
Share Phase,
Worked Example
• When is it appropriate to
create a break in the scale of
a graph?
When data points are clustered together but are far from the origin, it can be hard
to draw a line of best fit. This is especially true if the portion of the graph from the
• Would it be appropriate to
origin is included in the graph. To help you draw a line of best fit, you can break
break both the horizontal
scale and the vertical scale
of the same graph?
the graph and show the portion where the data appear. In fact, there have been
many occurrences where the graph has been broken in previous lessons.
Consider the two graphs shown that display the same data.
• Can you think of an
example where it would be
appropriate to break both
the horizontal scale and the
vertical scale of the same
graph?
• Does using a break in the
scale of the graph appear to
spread the data out? Explain.
y
30
27
24
When you
break a graph, draw a
squiggly line from the
origin to the first
interval on the x-axis.
It actually looks like
you did BREAK the
graph!
21
18
15
12
9
6
3
0
0
15 30 45 60 75 90 105 120 135 150
x
y
30
27
24
21
18
15
This symbol
indicates a break
in the graph.
12
9
6
0
90 96 102 108 114 120 126 132 138 144
x
The data values all occur greater than 90 and less than 150. By breaking the graph,
you can make it easier to draw a line of best fit. Therefore you can show a break in
the scale for the x-axis so that the portion of the graph containing the data appears.
876 • Chapter 16 Lines of Best Fit
876 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
0
© 2011 Carnegie Learning
3
Share Phase,
Questions 5 and 6
Is it easier to draw a line of best
fit with or without the break in
the scale of the graph? Explain.
5. What is/are the advantage(s) of using a graph with a break in it?
The advantages are that I can see all the data, easily draw a line of best fit, and
determine the slope of the line.
6. What is/are the disadvantage(s) of using a graph with a break in it?
One disadvantage is that I cannot see the y-intercept.
Grouping
7. Create a scatter plot of the data you collected. For this
Have students complete
Questions 7 through 11 with
a partner. Then share the
responses as a class.
scatter plot, make sure you break the graph to make it
easier to draw a line of best fit for your data.
y
30
If you are
using your
graphing calculator, you
can adjust your Window
view to see the data
better.
A Person’s Height and Jumping Height
28
26
24
Jump Height (in.)
22
20
18
16
14
12
10
8
6
4
2
0
0 52 54 56 58 60 62 64 66 68 70 72 74 76 78 80
x
Person Height (in.)
8. Use your graphing calculator to determine the line of
best fit.
© 2011 Carnegie Learning
© 2011 Carnegie Learning
See graph. The graph of the line should be close to
y5 0.8x2 42.
9. Describe the slope of your line.
The slope is positive.
How can you
write the equation
of the line when
you can't see the
y -intercept?
16.5 Correlation • 877
16.5 Correlation • 877
Share Phase,
Questions 10 and 11
• What function of a graphing
10. What does your graph and line of best fit indicate about the relationship between a
person’s height and how high a person can jump?
calculator has to be turned
on to view the r-values
(correlation coefficient)?
The graph and line of fit seem to indicate that the taller a person is, the higher the
person can jump.
• What is the correlation
coefficient in your regression
equation? Is your regression
equation considered a
good fit?
11. What factors could cause an incorrect jump height measurement or an inconsistency
in the measurements? Describe situations in which these errors or inconsistencies
could occur.
Some inconsistencies could occur due to inconsistent measurement of jump
height, the types of shoes each person wears, the physical ability of each person,
and the type of clothing each person wears. For example, if one person was
wearing sandals and another person was wearing sneakers, the person wearing
sandals would probably not be able to jump as high as the person
wearing sneakers.
Problem 2
Correlation
In many situations, you will have to determine if it is appropriate to use a line of best fit to
approximate a collection of points. When it is appropriate, you say that there is a correlation
between the x- and y-values. Otherwise, you can say that there is no correlation. When the
line of best fit has a positive slope, the points are positively correlated. When the line of best
fit has a negative slope, the points are negatively correlated.
1. Describe the correlation of the points in your graph. Explain your reasoning.
The points are positively correlated because the line of best fit has a
positive slope.
878 • Chapter 16 Lines of Best Fit
Discuss Phase, Question 1
• What is the difference between a positive association and a positive
correlation?
Grouping
Ask a student to read the
introduction to Problem 2 aloud.
Discuss this information and
complete Question 1 as a class.
• What is the difference between a negative association and a negative correlation?
• What is the difference between no association and no correlation?
• How would you describe the slope of the line of best fit in the previous problem?
• How would you describe the correlation of a line of best fit with a positive slope?
878 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
In this problem, students are
informally introduced to the
concept of correlation and
develop an intuitive sense
of correlation. The concepts
of positive correlation,
negative correlation, and no
correlation are described.
Students will classify the type
of correlation that exists for
the data represented in given
scatter plots by choosing the
most appropriate correlation
coefficient.
© 2011 Carnegie Learning
Problem 2
Grouping
Have students complete
Question 2 with a partner. Then
share the responses as a class.
2. Determine whether the points in each scatter plot have a positive correlation, a
negative correlation, or no correlation. Then determine which of the values of r you
think is most accurate. Explain why you chose your answer.
a.
y
8
Share Phase,
Question 2
• What does it mean for two
6
5
• What does it mean for two
quantities to be correlated?
b.
3
Since the data are negatively correlated,
2
the value of r must be negative. Also,
1
because the data are close to forming a
1
2
3
4
5
6
7
8
x
y
8
the most accurate.
●
r 5 0.7
●
r 5 20.7
●
r 5 0.07
●
r 5 20.07
7
6
5
4
The correct answer is r 5 0.7.
3
Since the data are positively correlated,
2
the value of r must be positive.
1
Also, because the data are fairly close
to forming a straight line, a value of 0.7
0
1
2
3
4
5
6
7
8
x
would be the most accurate.
Positive correlation
• You create a scatter plot of
© 2011 Carnegie Learning
r 5 20.09
Negative correlation
the x- values increase, on
the line of best fit, what type
of slope does the line have?
What type of correlation do
the data have?
another situation that
you think has a negative
correlation?
●
straight line, a value of 20.9 would be
• If the y- values decrease as
• What is an example of
r 5 0.09
The correct answer is r 5 20.9.
0
the x- values increase on the
line of best fit, what type of
slope does the line have?
What type of correlation do
the data have?
another situation that
you think has a positive
correlation?
r 5 20.9
●
4
• If the y- values increase as
• What is an example of
r 5 0.9
●
7
quantities to be related?
c.
© 2011 Carnegie Learning
data that shows the amount
of time since you made an
ice cream cone and the
amount of ice cream that
remains in the cone. What
type of correlation do the
data have?
●
y
8
●
r51
●
r 5 0.5
●
r 5 0.01
7
6
The correct answer is r 5 0.01.
5
Since there is not a linear relationship in
4
the data, the value of r is close to 0,
3
and 0.01 is close to 0.
2
1
0
1
2
3
4
5
6
7
8
x
No correlation
16.5 Correlation • 879
• What an example of a
situation that you think has
no correlation?
16.5 Correlation • 879
Talk the Talk
Students answer questions
about correlation and a line of
best fit.
Talk the Talk
1. Some lines of best fit model their data better than other lines of best fit. If a line of
Grouping
best fit models the data very well, what would you expect to see in a graph of the
data and the line?
Have students complete
Questions 1 and 2 with a
partner. Then share the
responses as a class.
The line is very close to all of the points.
2. Describe the graph of a collection of points that has no correlation.
Be prepared to share your solutions and methods.
880 • Chapter 16 Lines of Best Fit
880 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
The data do not fall in a straight line.
Follow Up
Assignment
Use the Assignment for Lesson 16.5 in the Student Assignments book. See the Teacher’s Resources
and Assessments book for answers.
Skills Practice
Refer to the Skills Practice worksheet for Lesson 16.5 in the Student Assignments book for additional
resources. See the Teacher’s Resources and Assessments book for answers.
Assessment
See the Assessments provided in the Teacher’s Resources and Assessments book for Chapter 16.
Check for Students’ Understanding
Determine whether the points in each scatter plot have a positive correlation, a negative correlation, or
no correlation. Explain your answer.
1.
y
20
20
18
18
16
16
14
14
12
12
10
10
8
8
6
6
4
4
2
2
0
© 2011 Carnegie Learning
2.
y
2
4
6
8
10 12 14 16 18 20 x
0
2
4
6
8
10 12 14 16 18 20 x
The points in this scatter plot have a
The points in this scatter plot have a
negative correlation because the line of
positive correlation because the line of
best fit has a negative slope.
best fit has a positive slope.
16.5 Correlation • 880A
© 2011 Carnegie Learning
880B • Chapter 16 Lines of Best Fit
Chapter 16
Summary
Key Terms
     extrapolating (16.1)
 linear regression (16.4)
 linear regression equation (16.4)
line of best fit (16.1)
model (16.1)
trend line (16.1)
interpolating (16.1)
Creating Scatter Plots and Drawing a Line of Best Fit
A scatter plot is a graph of data points. A line of best fit approximates the data in a scatter
plot as closely as possible. A line of best fit does not have to pass through all or any of the
data points. To determine how to construct a line of best fit, begin by plotting all the data.
Next, draw a shape that encloses all of the data. Then, draw a line that divides the
enclosed area of the data in half.
Example
Candice is an environmental engineer who is measuring the temperature of the ocean at
different depths. Her results are listed in the table. A scatter plot is shown from the
© 2011 Carnegie Learning
© 2011 Carnegie Learning
information in the table. A line of best fit is also shown on the graph.
Depth
(in meters)
0
100
200
300
400
500
600
700
800
900
Temperature
(°F)
81
76
73
70
66
61
56
52
48
43
Make sure you
stay organized
and not scattered!
It is easier for your
brain to understand
neat, organized
information.
Chapter 16 Summary • 881
Ocean Water Temperature
y
80
76
72
Temperature (ºF)
68
64
60
56
52
48
44
40
0
0
100 200 300 400 500 600 700 800 900 x
Water Depth (in meters)
Writing a Line of Best Fit Equation and Making Predictions
A line of best fit equation is usually written using the slope-intercept form of a line. The
slope-intercept form of a line is y 5 mx 1 b, where m represents the slope of the line and
b represents the y-intercept of the line. The equation for the line of best fit can be used to
make predictions about the related problem.
Example
An equation can be written that represents the line of best fit from the previous example.
That equation can be used to predict the water temperature at depths of 1000 feet and
1100 feet.
The line goes through the points (200, 73) and (500, 61). The slope of the line is
61 2 73 5 2____
12 5 20.04.
__________
500 2 200
300
73 5 20.04(200) 1 b
73 5 28 1 b
81 5 b
Therefore, the equation of the line of best fit is y 5 20.04x 1 81, where y represents the
water temperature in degrees Fahrenheit, and x represents the ocean depth in meters.
The water temperature at a depth of 1000 feet is predicted to be
y 5 20.04(1000) 1 81 5 240 1 81 5 41°F.
The water temperature at a depth of 1100 feet is predicted to be
y 5 20.04(1100) 1 81 5 244 1 81 5 37°F.
882 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
y 5 mx 1 b
© 2011 Carnegie Learning
Use the slope and one of the data points to determine the y-intercept.
Performing an Experiment and Comparing the Results
Lines of best fit can be used to model the results of experiments. The Stroop Test studies
a person’s perception of words and colors by using lists of color words (red, green, black,
and blue) that are written in one of the four colors. In a matching list, the color of the ink
matches the color of the word. In a non-matching list, the color of the ink does not match
the color of the word.
Example
Jin conducts an experiment using matching lists and non-matching lists. The matching list
consists of a list of repeated digits. The number of digits on each line matches the
numerical value of the repeated digit. For instance, one line in the list reads 5 5 5 5 5.
The non-matching list also consists of a list of repeated digits. The number of digits on
each line does not match the numerical value of the repeated digit. For instance, one line
in the non-matching list reads 7 7 7 7. In the experiment, Jin gives either the matching
list or the non-matching list to a person and asks them to say aloud the number of
digits in each row. Jin has determined that the line of best fit for the matching list data is
y 5 0.3x 1 0.2, where y represents the time in seconds it takes to read a list consisting
of x lines of digits. The line of best fit for the non-matching list is y 5 0.8x 1 0.2.
The slope in each line of best fit represents the predicted amount of time it takes to read
each line in the given list. The y-intercept represents the amount of time it takes to read
0 lines.
The predicted amount of time it takes a person to read a matching list containing
20 lines is
y 5 0.3(20) 1 0.2
y 5 6 1 0.2
y 5 6.20 seconds
© 2011 Carnegie Learning
© 2011 Carnegie Learning
The predicted amount of time it takes a person to read a non-matching list containing
20 lines is
y 5 0.8(20) 1 0.2
y 5 16 1 0.2
y 5 16.20 seconds
The predicted times make sense based on the experiment. It should take longer to read a
non-matching list of digits than to read a matching list of digits because the person must
count the digits in each line of the non-matching list.
Chapter 16 Summary • 883
Using Technology to Determine a Linear Regression Equation
Calculators and spreadsheet programs use methods to determine the line of best fit.
The resulting equation for the line of best fit is called the linear regression equation. The
calculator can also determine the correlation coefficient, r, which indicates how close the
data are to forming a straight line. The value of r is between 0 and 1 if the linear regression
equation has a positive slope. The value of r is between 0 and 21 if the linear regression
equation has a negative slope. The closer r is to 1 (if the data are positively correlated) or
21 (if the data are negatively correlated), the closer the data are to being in a straight line.
Example
Sabrina uses her calculator to determine the linear regression equation for a set of data.
The display screen of her calculator is shown. In this case, the value of a represents the
slope, and the value of b represents the y-intercept.
/LQ5HJ
\ D[E
D E U U The linear regression equation is y 5 21.33x 1 60.29. The data are negatively correlated,
because the slope of the linear regression equation is negative and the value of r is also
negative. The data are very close to being in a straight line because the value for r is very
884 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning
close to 21.
Determining the Correlation of Data
When the line of best fit representing a set of data points on a scatter plot has a positive
slope, the points are positively correlated. When the line of best fit has a negative slope,
the points are negatively correlated. When there appears to be no relationship between
the x-values and the y-values and there is not a suitable line of best fit to represent the
data, the points have no correlation.
Example
The scatter plots shown give an example of points that are positively correlated, points
that are negatively correlated, and points that have no correlation.
y
y
9
9
8
8
7
7
6
6
5
5
4
4
3
3
2
2
1
1
1
2
3
4
5
6
7
8
x
9
1
Negative Correlation
2
3
4
5
6
7
8
9
x
No Correlation
y
9
8
7
© 2011 Carnegie Learning
© 2011 Carnegie Learning
6
5
4
3
2
1
1
2
3
4
5
6
7
8
9
x
Positive Correlation
Chapter 16 Summary • 885
886 • Chapter 16 Lines of Best Fit
© 2011 Carnegie Learning
© 2011 Carnegie Learning