Professional Documents
Culture Documents
10
Mathematics
Semester
2
Bivariate
Data
–
Chapter
13
Why
learn
this?
Observations
of
two
or
more
variables
are
often
recorded,
for
example,
the
heights
and
weights
of
individuals.
Studying
the
data
allows
us
to
investigate
whether
there
is
any
relationship
between
the
variables,
how
strong
the
relationship
is,
and
whether
one
variable
can
be
effectively
predicted
from
information
about
another
variable.
Statistics
can
be
applied
to
medical
research,
sport,
agriculture,
sustainability,
weather
forecasting
and
fashion
trends,
to
name
but a
few
fields.
The
capacity
to
analyse
data
and
draw
conclusions
is
an
essential
skill
in
a
world
where
information
is
readily
available
and
often
manipulated.
Example:
An
ice
cream
shop
keeps
track
of
how
much
ice
cream
they
sell
versus
the
temperature
on
that
day.
The
two
variables
are
Ice
Cream
Sales
and
Temperature.
Here
are
their
figures
for
the
last
12
days:
Temperature
°C
24.2°
26.4°
21.9°
25.2°
28.5°
32.1°
29.4°
35.1°
33.4°
28.1°
32.6°
27.2°
Ice
Cream
Sales
$215
$325
$185
$332
$406
$522
$412
$614
$544
$421
$445
$408
Now
we
can
easily
see
that
warmer
weather
and
more
ice
cream
sales
are
linked,
but
the
relationship
is
not
perfect.
Questions
Section
Title
Questions
13.2
Bivariate
Data
1d–g,
2,
3,
4,
5,
7,
9,
12
13.3
Lines
of
best
fit
1,
2,
3,
4,
5,
6,
7,
8,
10,
11
13.4
Time
Series
1,
2,
3,
4,
5,
6,
7,
9,
10
More
resources
http://drweiser.weebly.com
Worked
Example
2
State
the
type
of
correlation
between
the
variables
y
x
and
y,
shown
on
the
scatterplot.
Correlation
and
causation
Even
a
strong
correlation
does
not
necessarily
mean
that
the
increase
or
decrease
in
the
level
of
one
variable
causes
an
increase
or
decrease
in
the
level
of
the
other.
It
is
best
to
avoid
statements
such
as: ‘An
increase
in
rainfall
causes
an
increase
in
the
wheat
growth.’
The
following
guidelines
should
be
closely
followed
in
order
to
draw
a
conclusion
about
the
relationship
between
the
two
variables
based
on
the
scatterplot.
• If
the
correlation
between
x
and
y
is
weak,
we
can
conclude
that
there
is
little
evidence
to
show
that
the
larger
x
is,
the
larger
(positive
correlation)
or
smaller
(negative
correlation)
y
is.
• If
the
correlation
between
x
and
y
is
moderate,
we
can
conclude
that
there
is
evidence
to
show
that
the
larger
x
is,
the
larger
(positive
correlation)
or
smaller
(negative
correlation)
y
is.
• If
the
correlation
between
x
and
y
is
strong,
we
can
conclude
that
the
larger
x
is,
the
larger
(positive
correlation)
or
smaller
(negative
correlation)
y
is.
Open a Data & Statistics page.
Press TAB e
to
locate the label of the horizontal axis
and select the variable ‘price’.
Press TAB e
again to locate the label of the vertical
axis and select the variable ‘sold’.
To
change
the
colour
of
the
scatterplot,
place
the
pointer
over
one
of
the
data
points.
Then
press
DOC
~.
Press:
•
2:
Edit
2
•
7:
Colour
7.
•
2:
Fill
Colour
2.
Select
a
colour
from
the
palette
for
the
scatterplot.
Press
ENTER
· .
b) State
the
type
of
correlation
between
the
two
variables
and,
hence,
draw
a
corresponding
conclusion.
The
points
on
the
plot
form
a
path
that
resembles a
straight,
narrow
band,
directed
from
the
top
left
corner
to
the
bottom
right
corner.
The
points
are
close
to
forming
a
straight
line.
There
is
a
strong,
negative,
linear
correlation
between
the
two
variables.
The
price
of
the
shirt
appears
to
affect
the
number
sold;
that
is,
the
more
expensive
the
shirt
the
fewer
sold.
Open a Data & Statistics page.
Press TAB e
to
locate the label of the horizontal axis
and select the variable ‘price’.
Press TAB e
again to locate the label of the vertical
axis and select the variable ‘sold’.
Page
6
of
14
b) Draw
in
the
line
of
best
fit
by
eye.
c) Find
the
equation
of
the
line
of
best
fit
in
terms
of
the
variables
n
(number
of
hours)
and
C
(monthly
cost).
Alternatively
on
the
CAS
Press menu b
4:
Analyze
6:
Regression
1: Show linear (mx+b)
The equation in terms of y=mx+b will be shown on the
graph.
Making
predictions
The
line
of
best
fit
can
be
used
to
predict
the
value
of
one
variable
from
that
of
another.
Because
of
the
subjective
nature
of
the
line,
it
should
be
noted
that
predictions
are
not
accurate
values,
but
rough
estimates.
Although
this
is
the
case,
predictions
using
this
method
are
considered
valuable
when
no
other
methods
are
available.
If
the
equation
of
the
line
of
best
fit
is
known,
or
can
be
derived,
predictions
can
be
made
by
substituting
known
values
into
the
equation
of
the
line
of
best
fit.
Worked
Example
6
The
table
below
shows
the
number
of
boxes
of
tissues
purchased
by
hayfever
sufferers
and
the
number
of
days
affected
by
hayfever
during
the
blooming
season
in
spring.
The equation and values of m and
b (c) are shown.
Correlation
coefficient
Once
a
relationship
between
two
variables
has
been
established,
it
is
helpful
to
develop
a
quantitative
value
to
measure
the
strength
of
the
relationship.
One
way
is
to
calculate
a
correlation
coefficient,
r.
This
is
easily
done
using
a
CAS
calculator,
but
a
manual
method
is
shown
below.
The
formula
for
the
Correlation
coefficient
r
is:
𝑥−𝑥 𝑦−𝑦
𝑟=
𝑥−𝑥 ) 𝑦−𝑦 )
• A
cyclical
pattern
displays
fluctuations
that
repeat
• Random
patterns
do
not
show
any
regular
but
will
usually
take
longer
than a
year
to
repeat.
fluctuation.
They
are
usually
caused
by
An
example
of
this
is
shown
in
the
graph
below,
unpredictable
events
such
as
the
economic
which
depicts
software
products
sold.
recession
illustrated
in
the graph
below.
63.6, 63.8, 63.5, 63.7, 63.2, 63.0, 62.8, 63.3, 63.1, 62.7, 62.6, 62.5, 62.9, 63.0,
63.1, 62.9, 62.6, 62.8, 63.0, 62.6, 62.5, 62.1, 61.8, 62.2, 62.0, 61.7, 61.5, 61.2
Trend
lines
• A
trend
line
is
a
type
of
line
of
best
fit.
Trend
lines
indicate
the
general
trend
of
the
data.
• Trend
lines
are
useful
in
forecasting,
or
making
predictions
about
the
future,
by
extrapolation.
Extrapolation
can
have
limited
reliability,
as
predictions
are
based
on
the
assumption
that
the
current
trend
will
continue
into
the
future.
Worked
Example
10
The
graph
at
shows
the
average
cost
of
renting
a
one-‐bedroom
flat,
as
recorded
over
a
10-‐year
period.
a) Use
technology
to
construct
a
time
series
graph,
with
a
line
of
best
fit,
that
represents
the
data.
• In a new problem on a Lists & Spreadsheet page
• label column A as ‘year’
• label column B as ‘numbermoved’.
• Enter the data as in the table above.
• Open
a
Data
&
Statistics
page.
• Press
TAB e
to locate the label of the horizontal axis
and select the variable ‘year’.
• Press TAB e
again to locate the label of the vertical
axis and select the variable ‘numbermoved’.
• Press: MENUb
• 4: Analyze 4
• 2: Add Moveable Line 2
A line and its equation appear automatically on the graph as shown.
Repositioning the line is done in two steps by moving the position of
the y-intercept and altering the gradient. To change the y-intercept,
move the pointer until it rests somewhere around the middle of the
line. Press the Click key a, then use the Touchpad to move the line
parallel to itself until the y-intercept is in an appropriate position. To
change the gradient, move the pointer until it rests somewhere near
one end of the line. Press the Click key a, then use the Touchpad to
rotate the line and change the gradient. Continue to use these tools
until you are satisfied with the line of best fit by eye.
b) Describe
the
trend:
There
appears
to
be
an
upward
trend
over
the
10
years.
c) Measure
the
correlation.
Back on the Spreadsheet page
Press: MENUb
• 4: Statistics 4
• 1: Stat Calculations 1
• 3: Linear Regression (mx + b) 3.
Select ‘year’ as the X List and ‘numbermoved’ as the Y List, leave
the next fields blank, TAB e
to OK, then press ENTER ·.