Professional Documents
Culture Documents
INTRODUCTION
Correlation refers to the statistical relationship between two entities. In other words, it's
how two variables move in relation to one another. This means the two variables
moved either up or down in the same direction together.
It is a statistical measure that expresses the extent to which two variables are linearly
related (meaning they change together at a constant rate). It's a common tool for
describing simple relationships without making a statement about cause and effect.
Correlation is used to test relationships between quantitative variables or categorical
variables. In other words, it’s a measure of how things are related. The study of how
variables are correlated is called correlation analysis.
OBJECTIVES
At the end of this discussion we can be able to:
1. Understand the meaning and purpose of correlational analysis.
2. Know and identify the possible results of a correlational study.
3. Understand and apply the methods in solving correlational relationships.
4. Identify the types of correlational researches.
5. Know the basics in interpreting correlational data using SPSS.
DISCUSSION
As written above, correlation focuses on relationship of variables or bivariate data.
Correlation was invented by Francis Galton and dates from late in the year 1888, and it
arose when he recognized a common thread in three different scientific problems he
was studying.
Some examples of data that have a low correlation (or none at all):
Correlations are useful because if you can find out what relationship variables have,
you can make predictions about future behavior. Knowing what the future holds is
very important in the social sciences like government and healthcare. Businesses also
use these statistics for budgets and business plans.
There are three possible results of a correlational study: a positive correlation, a
negative correlation, and no correlation.
POSITIVE CORRELATION
■ A positive correlation occurs whenever the
change in the variables is occurring in the
same direction. If an increase in variable A
occurs and results in an increase in variable B,
there is a positive correlation. For example, an
increase in the number of hours that students
study could result in an increase in test scores
or lower ACT scores may indicate poorer
performance in college. These are both
examples of positive correlations because the
variables are moving in the same direction.
NEGATIVE CORRELATION
A negative (inverse) correlation occurs when one variable
increases and the other variable decreases. An example would
be the relationship between increasing exercise and reducing
the number of doctor visits for colds and common illnesses.
The data from a correlational study is often represented graphically using a scatterplot
or scatter diagram. Scatterplots are used to summarize the relationship between two
variables (X and Y) by plotting the discrete data points and then looking for overall
trends. The following graph represents the three main types of correlational
relationships:
The strength of the relationship is measure of how consistently the values of each
variable change in relation to each other. Graphically, the stronger the relationship, the
closer the data points will fall along a line as seen in the example below:
Example:
The more hours students spend studying for a test, the higher their test
grades tend to be. This correlation would be positive because as the values on
hours spent studying go up, test grades tend to go up (they move in the same
direction).
Example:
The more hours students spend partying the night before an exam, the
lower their test grades tend to be. This correlation would be negative because as
the values on hours spent partying go up, the values on test grades tend to go
down (they move in opposite direction).
Reference: Research Methods, Design, and Analysis, Twelfth Edition. Pg. 394Copyright ©
2014, 2011, 2007 Pearson Education, Inc
Formula
: Where:
N = Number of pair Scores
∑ x = Summation of the Values of x
∑ y = Summation of the Values of y
∑ xy = Summation of the Product of x and y
∑ x 2= Summation x
∑ y2= Summation y
Example 1
The more hours students spend studying for a test, the higher their test score
tend to be. This correlation would be positive because as the values on hours spent
studying go up, test grades tend to go up (they move in the same direction). Test items
is 20
X = Hours spend studying Y= Test Grade
x y xy x2 y2
1 10 10 1 10
3 15 45 9 225
4 18 72 16 324
5 20 100 25 400
N ∑ xy −∑ x ∑ y
∑ r= √¿¿¿
¿
Very strong
positive
association/correl
ation
130 130
r= √(50)(340) r= √17000 r = 0.997
Kendall’s Tau coefficient of correlation is usually smaller values than Spearman’s rho
correlation. The calculations are based on concordant and discordant pairs. Insensitive
to error. P values are more accurate with smaller sample sizes.
Assumptions
REFERENCE: https://towardsdatascience.com/kendall-rank-correlation-explained-
dee01d99c535
Step 2: Count the number of concordant pairs, using the second column. Concordant
pairs are how many larger ranks are below a certain rank. For example, the first rank in
the second interviewer’s column is a “1”, so all 11 ranks below it are larger.
However, going down the list to the third row (a rank of 4), the rank immediately below
(3) is smaller, so it doesn’t count for a concordant pair.
Step 3: Count the number of discordant pairs and insert them into the next column. The
number of discordant pairs is similar to Step 2, only you’re looking for smaller ranks,
not larger ones.
Perfect Correlation
Counting how many values are below the second column seems very odd when you
first do it. But it does work. Just as a thought experiment, here’s what the spreadsheet
looks like if both interviewers were in perfect agreement:
If you want to calculate statistical significance for your result, use this formula to get a
z-value:
Finding the area for a z-score of 3.85 on a z-table gives an area of .0001 — a tiny
probability value which tells you this result is statistically significant.
Formula :
Where,
Sr valu e
Example problem: Find the correlation of the scores for Mathematics and
English
math English Math English D D2
56 66 6 2 4 16
75 70 2 1 1 1
45 40 7 7 0 0
6 Σ D²
Sr= 1- -------------
n³- n
ΣD²=42
n=7 6 (42)
Sr= 1- -------------
7³ -7
Solve for the value of S?
6(42)
S= 1- --------------
7³ -7
25
Sr=1 --------------
343-7
257
Sr= 1--------------
336
S= 1- 0/75
3. Archival Research - involves analyzing data that has previously been collected by
others and looking for correlations. The researcher does not have control over the data
or how it was gathered, however, the researcher may have access to large amounts of
data with relatively little effort and often the data is free. For example, a researcher
may examine the crime statistics of several neighborhoods to see if there is any
correlation with crime and a sluggish housing market in particular areas.
STRENGHTS OF CORRELATION
■ Correlation allows the researcher to investigate naturally occurring variables that
maybe unethical or impractical to test experimentally. For example, it would be
unethical to conduct an experiment on whether smoking causes lung cancer.
■ Correlation allows the researcher to clearly and easily see if there is a
relationship between variables. This can then be displayed in a graphical form.
LIMITATIONS OF CORRELATION
■ Correlation is not and cannot be taken to imply causation. Even if there is a very
strong association between two variables we cannot assume that one causes the
other.
■ For example suppose we found a positive correlation between watching violence
on T.V. and violent behavior in adolescence. It could be that the cause of both
these is a third (extraneous) variable - say for example, growing up in a violent
home - and that both the watching of T.V. and the violent behavior are the
outcome of this.
■ Correlation does not allow us to go beyond the data that is given. For example
suppose it was found that there was an association between time spent on
SUMMARY
"Correlation is not causation" means that just because two variables are
related it does not necessarily mean that one causes the other.
ASSESSMENT
1.
2.
3.
4.
5.
REFERENCES
https://www.simplypsychology.org/correlation.html?
fbclid=IwAR3Yv9ncJDFWwkIoym_y7HhDR7gqDPxK27YcVue519gllyGL
Mkro5j2BYc
THANK YOU!