Professional Documents
Culture Documents
Persons CorrelationEconomicStatistics 1910
Persons CorrelationEconomicStatistics 1910
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
Taylor & Francis, Ltd. and American Statistical Association are collaborating with JSTOR to
digitize, preserve and extend access to Publications of the American Statistical Association
Economics deals mainly with correlation rather than with simple causa?
tion (p. 287); necessity of a method of measuring the correspondence between
two series of statistics (p. 288); illustrations of the use of the graphic method
(p. 290); the coefficient of correlation defined and illustrated (p. 298); the coeffi?
cient of correlation as a measure of the grouping of points about the line of
regression (p. 303); the equation of the line of regression (p. 303); the coeffi?
cients of correlation computed for the illustrations cited above (p. 306); two
influences affect the size of the coefficient of correlation, i. e., short-time fluctua?
tions and secular tendency (p. 306); three methods of isolating the two influ?
ences are described and illustrated (p. 310); the measurement of the correlation
among three variables (p. 319); conclusion (p.?
fSee Quarterly Journal of Economics, Feb., 1908, p. 274, for derivation of equation. In the
current article, I quote from the review of Doctor Kemmerer's book.
p=MR+CRc *
r* NE+NCEC
borne out by the facts?
All of the questions to be tested by the statistics co
are questions of correlation. Dr. Kemmerer makes t
graphically, as has been stated, by comparing the fluc
of the two curves based upon the pair of series of sta
being considered. The charts presented by Dr. Kem
from which his conclusions are drawn are given below
In the case of the correlation of bank reserves and m
in circulation, inclusive of bank reserves, Dr. Kemmer
cludes, "There can be no question but that when du
ance is made for fluctuations in business confidence,
dence of Chart I strongly supports the contention th
exists a close relationship between the amount of m
circulation and the amount of the country's bank res
In the case of the correlation of business distrust and the ratio
of bank reserves to check circulation the conclusion is, "the
chart substantiates the contention . . . that the ratio
of check circulation to bank reserves is a function of business
o I .r-l
io
o
?H
CTN
00
c3 !
U!
#|
CD I a
o
? PH
?$=H
'c3
r-J
J=J I CO
O o <D
O
?l-l ?H
cd o
rH CU
?*
o ox
?ali ap
"OT"
O rn
cs
<D
> ac; CD
CS
r-l
Q>
Od
ON
CO
S
^
a a
Per capita
imports plus
exports.
30
1881 18^5 1890 1895 1900 1205
? Index Numbers of Prices.
no
n^i^'
where:
X = l, 2, 3, 4, 5 Mi= 3
Y = 6, 8, 10, 12, 14 M2 = 10
-2 -4 16
-1 -2 4
0 0 0
<72=2\/2
20
+1 +2 4 =1
+2 +4 16 r"~"5\/2-2V/2
There
Assume r = l
then (2xy)2-2x2-2y2=0
Letting Xi = A1y1, x2 = A2y2 . . . x? = ^?y? the above expression
becomes
yi2y22(^-^)2+yi2y32(^i-^)2+ ... +yr2ys2(^-^)2+ .. .=0
The only way in which this expression can equal zero
is by having
Al = A2 = A3 = . . . = An
and it follows that
Xi = A1y1, x2=Aiy2 . . . x? = Aiyn
or
X=AXy
which denotes a linear relationship between X and Y.
That any relation other than a linear one will not lead to
r=l is illustrated by the following:
Let Y=X2
X = l, 2, 3, 4, 5, Mi= 3
Y = l, 4, 9, 16, 25, M2 = ll
-2 -10 100 20
-1 - 7 49 7 ^ = 1.41
0 - 2 4 0 *2 = 8.65
+1 + 5 25 5 r =0.981
+2 +14 196 28
Total.10 374 60
I ??.?
'Vt.
?I ... \-*[t!4 * *
'' ^4---^ ~ - ? -
. . ?./?. ^.?. . ?
Bank Reserves.
200
150
LINS OP REGRESSION
100
y = 1.591 - 46.3
Money in Circulation.
*i
x = r?y
and X = M!+r^-(Y-M2)
The coefficients r? and r? are called the coefficients of
ffl ff2
are the s
ured in
the slop
words, th
being m
the coeff
tive corr
axis for p
of 135?
be paralle
In the p
In some
locus su
. or y = a+bcx or any other empirical function. The
work in fitting such a curve to the points would be great and
the advantages are not sufficient to compensate for the addi?
tional work.*
The following table gives the correlation coefficients for the
various pairs of series of statistics for which Dr. Kemmerer has
attempted to show correlation by means of charts :
Coefficient of Corre?
Statistics of Period Covered. lation
=?= Probable Error.
f Relative circulation.
1879-1901 +0.23=*=0.13
\ General Prices.
*Karl Pearson in an article "On the General Theory of Skew Correlation and Non?
linear Regression" (published by Dulau and Co. in 1905 as one of the Drapers' Company
Research Memoirs) discusses the cases in which the law connecting the two series of sta?
tistics is not linear. He says, "In the great bulk of biometrical and economical enquiries,
however, the regression does not diverge very markedly from the linear form. In the cases
of non-linear regression that I have hitherto had to deal with, I find that parabolae of the
second or third order will suffice as a rule to describe the deviations from linearity" (p. 21).
Equations of parabolae of the second and third order are of the form
y=a0+a1x+a2x2
and y=a0+a1x+a2x2+a3x3
Coefficient of
Statistics of Period Correlation
fe Probable Error
Coefficient
Deviations
Series Period of
from
Correlation
(Marriage rate.
Hartley's index numbers of unem?
ployment .
1870-1895 ll yr. means -0.873
Series
Correlated Business item pre? Value of Business item pre? Value of
cedes marriage- Coefficient cedes marriage- Coefficient
rate item by rate item by
Having found that the trade cycle affects marriage rate Mr.
Yule asks the question, "Does the cycle affect the birth-rate,
and how?* The relation between marriage-rate and birth-rate
is shown by the following table :
Series Period
Deviations (a) Correlated Correlation Coeffi?
from with (b) of cient ?P. E.
( X0, Xi .... Xn ^
Lettings > represent two series of
( X'0, X'i .... X'n ) measurements
( di, d2 .... dn "> represent differences be-
and ?< > tween any two consecu-
(d'i, d'2 . . . . d'? ) tive measurem
(dm)
and ?< >* represent the respective means of these differ-
( d'm ) ences,
then dm = ?-?
n
=n?, and
d, =X/n-X/o =g
n n '
and the standard deviations of the differences are
Z(d-dm)2
S(d-dm)(d,-d,m) Sdd/-ndmd/m
P =
nS'S VC^-ndm2) CSd^-ndV)
Comparing this method of differences with the method
described in (1) Mr. Hooker says, "Correlation of the devia?
tions from an instantaneous average (or trend) may be
adopted to test the similarity of more or less marked periodic
influences, correlation of the difference between successive
values will probably prove most useful where the similarity
of the shorter rapid changes (with no apparent periodicity)
are the subject of investigation, or where the normal level of
one or both series of observations does not remain constant." f
He finds that the ordinary correlation coefficient (r) for the
price of corn in Iowa and total production in the United States
for the period 1870-1899 is -0.28, while p = -0.84.
*Ibid., Vol. 68, p. 697.
Ubid., p. 703.
y=-0.0256x+1.132
x=-27.05y+46.42
y = 24,086,000 (1.0238)x,
y * 24,086,000 (1.0258)
Series
There
3. Production and Price combined in the ratio 1:1, and Exports... +0.90
4. Production and Price combined in the ratio 3:1, and Exports... +0.81