Professional Documents
Culture Documents
If you ask me which tool, out of the seven basic tools of quality, appears most
difficult to understand, my answer would be the scatter diagram.
Did you notice that I asked, “Which tool appears most difficult”?
I did not ask which tool is difficult to understand, difficult to draw, or difficult to
interpret.
Other charts use lines or bars to show findings, while a scatter diagram uses only
dots. This is why this graph looks a bit different. However, this chart is just as
easy to understand as line and bar charts.
I hope, after reading this blog post, you will not face any problems in
understanding the scatter diagram.
The scatter diagram is used to find the correlation between these two variables.
This diagram helps you determine how closely the two variables are related.
After determining the correlation between the variables, you can then predict the
behavior of the dependent variable based on the measure of the independent
variable. This chart is very useful when one variable is easy to measure and the
other is not.
Example
You are analyzing the pattern of accidents on a highway. You select the two
variables: motor speed and number of accidents, and draw the diagram.
According to the PMBOK Guide 5th edition, the scatter diagram is, “A correlation
chart that uses a regression line to explain or to predict how the change in an
independent variable will change a dependent variable.”
The PMBOK Guide states that scatter diagram helps you see the changes in the
dependent variable if you make any change to the independent variable. Since
this diagram shows you the correlation between the variables, it is also known as
a correlation chart.
Usually the independent variable is plotted along the horizontal axis (x-axis) and
the dependent variable is plotted on the vertical axis (y-axis). The independent
variable is also known as the control parameter because it influences the
behavior of the dependent variable.
It is not necessary for one parameter to be a controlling parameter. You can draw
the scatter diagram with both variables independent to each other. In this case
you can draw any variable on any axis.
I am giving you two types because it will show you the same chart with two
different perspectives, this will help you build a solid understanding regarding the
scatter diagram.
In this type of scatter diagram, data points are spread so randomly that you
cannot draw any line through them.
In this case you can say that there is no relation between these two variables.
Here, the data points are little closer together and you can feel that some kind of
relation exists between these two variables.
In this diagram, data points are grouped very close to each other such that you
can draw a line by following their pattern.
In this case you will say that the variables are closely related to each other.
As discussed earlier, you can also divide the scatter diagram according to the
slope, or trend, of the data points:
This type of diagram is also known as Scatter Diagram with Positive Slant.
In positive slant, the correlation will be positive, i.e. as the value of x increases,
the value of y will also increase. You can say that the slope of straight line drawn
along the data points will go up. The pattern will resemble the straight line.
For example, if the temperature goes up, cold drink sales will also go up.
Here as the value of x increases the value of y will also tend to increase, but the
pattern will not closely resemble a straight line.
This type of diagram is also known as Scatter Diagram with Negative Slant.
In negative slant, the correlation will be negative, i.e. as the value of x increases,
the value of y will decrease. The slope of a straight line drawn along the data
points will go down.
For example, if the temperature goes up, sales of winter coats goes down.
Here as the value of x increases the value of y will tend to decrease, but the
pattern will not be as well defined.
• Scatter diagrams are unable to give you the exact extent of correlation.
• Scatter diagram does not show you the quantitative measure of the
relationship between the variable. It only shows the quantitative expression of
the quantitative change.
• This chart does not show you the relationship for more than two variables.
Please keep in mind that the scatter diagram is different than the Ishikawa
diagram. The Ishikawa diagram shows you only the variables; it does not show
you the relationship between these variables. However, the Ishikawa diagram
can help you draw the scatter diagram; for example, you can find the two
variables (cause and effect), and then draw the scatter diagram to analyze the
relationship between them.
Here is where this blog post on the scatter diagram ends. If you have something
to share or any questoins, you can do so through the comments section.
1. The time, in minutes, that seven teenagers spent using their computer and spent watching TV on one day
is recorded in the table.
50
40
Time spent
watching TV 30
(minutes)
20
10
0
0 10 20 30 40 50 60
Time spent using computer (minutes)
(2)
.....................................................................................................................................
.....................................................................................................................................
(1)
(Total 4 marks)
100
80
60
Paper 2 mark
40
20
0
0 20 40 60 80 100
Paper 1 mark
.......................................................................................................................................
.......................................................................................................................................
.......................................................................................................................................
(1)
(d) Kay was absent for Paper 2, but scored a mark of 56 on Paper 1.
Use your line of best fit to estimate Kay’s mark on Paper 2.
.......................................................................................................................................
200
180
160
140
Number of
ice creams 120
sold
100
80
60
40
20
0
50 52 54 56 58 60 62 64 66 68 70
Midday temperature (ºF)
(b) Describe the relationship between the number of ice creams sold and the midday temperature.
.....................................................................................................................................
.....................................................................................................................................
(1)
(Total 2 marks)
80
75 ×
× ×
70 × × ×
× ×
65 ×
Resting pulse rate ×
(beats per minute) 60
×
× × ×
55
50 × × ×
×
45
40
20 25 30 35 40 45 50 55 60 65 70 75
Miles run per week
(a) How many runners have a resting pulse rate of 57 beats per minute?
Answer ................................................................
(1)
(c) Predict the resting pulse rate of a runner who runs 40 miles per week.
(d) Describe the relationship between the resting pulse rate and miles run per week.
.....................................................................................................................................
.....................................................................................................................................
(1)
(Total 4 marks)
40
30 ×
× ×
Trunk diameter
(centimetres) 20 × ×
×
10 × ×
0
0 2 4 6 8
Height (metres)
(b) Draw a line of best fit through the points on the scatter graph.
(1)
.....................…………………………………………………………………………
.....................…………………………………………………………………………
(1)
.....................…………………………………………………………………………
.....................…………………………………………………………………………
(1)
(Total 4 marks)
Compiled By: Uzair Yaqoob (City School PECHS Boys Campus)
6. The value of six houses in 2000 is compared to the value of similar houses in 2004.
Here are the results.
200
180
160
140
120
House value
in 2004
(thousands 100
of pounds)
80
60
40
20
0
0 10 20 30 40 50 60 70 80 90 100
House value in 2000 (thousands of pounds)
(2)
.....................................................................................................................................
.....................................................................................................................................
(1)
(c) In 2000 a house was valued at £80 000.
Compiled By: Uzair Yaqoob (City School PECHS Boys Campus)
Estimate the value of a similar house in 2004.
.....................................................................................................................................
Answer £ .......................................................................
(2)
(Total 5 marks)
Time (hours) 2 3 5 7 8 10
Mark 30 26 34 38 45 48
55
50
45
40
Mark
35
30
25
20
0
0 1 2 3 4 5 6 7 8 9 10
Time (hours)
(2)
(c) Use your line of best fit to estimate the mark of a pupil who revised for 4 hours.
Answer ………………………………….........
(1)
(Total 4 marks)
30 ×
Number 20
× ×
of cats
×
10
× ×
0
0 10 20 30 40 50 60 70
Number of dogs
(a) Ayville has the lowest number of cats of the six villages.
Use the graph to find the number of cats in Ayville.
Answer ..................…….........................…..
(1)
(b) The point plotted for Beeville does not fit the general trend.
Circle the point for Beeville on the scatter graph.
(1)
(Total 2 marks)
9. Eight teenagers took part in a general knowledge quiz and a pop-music quiz.
The scatter graph shows their scores.
14
12
10
Score in
pop-music 8
quiz
6
0
0 2 4 6 8 10 12 14 16 18 20
Score in general knowledge quiz
......................................................................................................................................
......................................................................................................................................
(1)
10. The table shows the school year and the reaction time of eight people who took part in the same test.
School year 5 7 8 9 10 11 12 13
Reaction time
6 5 4.8 4.5 4 4.2 3.5 3
(seconds)
4
Reaction
time
(seconds)
3
0
0 5 6 7 8 9 10 11 12 13
School year
(2)
......................................................................................................................................
......................................................................................................................................
......................................................................................................................................
(1)
(Total 4 marks)
2. (a) 88 B1
(b) The higher the mark on paper 1, the higher the mark on paper 2 B1
oe
accept positive correlation
(c) Straight line passing between (20, 14) to (90, 82) and (20, 25) to (90, 93) B1
(d) Correct mark (± 1) from their line B1 ft
[4]
4. (a) 3 B1
(b) Line between limits B1
Passing between (20, 71) and (20, 76)
(inclusive) at one end and (65, 47) and (70, 50)
(inclusive) at the other. If not ’ruled’ BO.
(c) (63) B1ft
ft their line. ± lmm
(d) Negative correlation or
More miles run, lower pulse rate B1
oe
[4]