Professional Documents
Culture Documents
21
Introduction to Statistical
Techniques in Social Work
* D.K. Lal Das
Introduction
Knowledge of statistics helps the social worker in
two ways. First, the knowledge allows the social
worker to be able to analyse the data as per his/
her research objectives and draw inferences which
in turn enable him/her to expand and improve not
only the knowledge base of his profession but it
also enables him to make his practice effective.
Second, as a consumer of research it enables him/
her to understand the analysis of data and appreciate
the statistical procedures used in research reports.
In the previous Chapter we discussed about preparation
of the code book, master chart and tables. In this
Chapter, we will learn some of the statistical
techniques which social workers need to apply in
course of data analysis.
1 40 11 60 21 43 31 58
2 37 12 38 22 41 32 24
3 61 13 39 23 25 33 71
4 67 14 40 24 42 34 55
5 59 15 51 25 38 35 33
6 70 16 37 26 40 36 65
7 39 17 40 27 50 37 55
8 46 18 72 28 30 38 66
9 68 19 39 29 33 39 40
10 41 20 50 30 54 40 62
24 I 37 II 50 II 63 -
25 I 38 II 51 I 64 -
26 - 39 III 52 - 65 I
27 - 40 53 - 66 I
28 - 41 II 54 I 67 I
29 - 42 I 55 II 68 I
30 I 43 I 56 - 69 -
31 - 44 - 57 - 70 I
32 - 45 - 58 I 71 I
33 II 46 I 59 I 72 I
34 - 47 - 60 I
35 - 48 - 61 I
36 - 49 - 62 I
Total 91
Processing and Analysis of Data 97
152.5
54+58+60+62+70+72+75+77 528
Mean = ————————————————— = ——— = 66
8 8
98 Social Work Intervention with Communities and Institutions
1 110-130 15
2 130-150 30
3 150-170 60
4 170-190 95
5 190-210 82
6 210-230 75
7 230-250 23
Total 380
Solution:
Calculation:
240
= ————— = 120
2
__ ∑ m f m f
X = ———— = ————
∑ f N
71120
= ————— = 187.16
380
Solution:
Table
Deviation
Monthly Mid- No. of from Step Total
wages values workers assumed deviation deviation
(in Rs.) (m) (f) mean. 180 (d) fd
(dx)
110-130 120 15 –60 –3 –45
130-150 140 30 –40 –2 –60
150-170 160 60 –20 –1 –60
170-190 180 95 0 0 0
190-210 200 82 +20 +1 +82
210-230 220 75 +40 +2 +150
230-250 240 23 +60 +3 +69
N = 380 fd = 136
100 Social Work Intervention with Communities and Institutions
__ ∑fd
Mean (X) = a + ——— x i
N
Where ‘a’ stands for the assumed mean, ∑fd for the
sum of total deviations, N for total number of
frequencies and ‘i’ for class interval. Now substituting
the values in the formula from the table we get :
136
= 180 + ——— x 20
380
= 180 + 7.16
Mean = Rs.187.16 (approximately)
The Median
The median is another simple measure of central
tendency. We sometimes want to locate the position
of the middle item when data have been arranged.
This measure is also known as positional averages.
We define the median as the size of the middle
item when the items are arrayed in ascending or
descending order of magnitude. This means that
median divides the series in such a manner that
there are as many items above or larger than the
middle one as there are below or smaller than it.
In continuous series we do not know every observation.
Instead, we have record of the frequencies with
which the observations appear in each of the class-
intervals as in the following Table. Nevertheless,
we can compute the median by determining which
class-interval contains the median.
Processing and Analysis of Data 101
110 – 130 15 15
130 – 150 30 45
N = 380
In the case of data given in Table above the median
value is that value on either side of which N or
380 2
2
380
or 190th items lie. Now the problem is to find
2
the class interval containing the 190 th item. The
cumulative frequency for the first three classes is
only 105. But when we move to the fourth class
interval 95 items are added to 105 for total of 200.
Therefore, the 190 th item must be located in this
fourth class-interval (the interval from Rs. 170 –
Rs. 190).
The median class (Rs.170 – Rs.190) for the series
contains 95 items. For the purpose of determining
the point, which has 190 items on each side, we
assume that these 95 items are evenly spaced over
the entire class interval 170–190. Therefore, we
can interpolate and find the values for 190 th item.
First, we determine that the 190th item is the 95th
102 Social Work Intervention with Communities and Institutions
190 – 170
—————— = 0.21053 (approximately)
95
Where
X = median,
L = lower limit of the class in which median lies
N = total number of items
C = cumulative frequency of the class prior to
the median class.
‘f = frequency of the median class
i = class interval of the median class.
380 – 105
2
= 170 + ——————— x (190 – 170)
95
190 – 105
= 170 + ——————— x (190 – 170)
95
Processing and Analysis of Data 103
85
= 170 + ——— x (190 – 170)
95
100 – 200 5
11 26
200 – 300 6
21
300 – 400 15 31
25
400 – 500 10 30
15
500 – 600 5 19
9
600 –700 4 12
7
700 –800 3 9
5
800 –900 2
1 1
2 1 1
3 1 1
4 1 1 1
5 1 1 1
6 1 1 1
No. of
times a
class
occurs 1 3 6 3 1
f 1 – f 0.
X = L + ——————— x i
2 f1 – f0 – f2
__ 15 – 6
X = 300 + ———————— x 100
2 x 15 – 6 – 10
9 x 100
= 300 + 14
= 300 +
= 300 + 64.29
= 364.29 (approximately)
Measures of Dispersion
In social work research, we often wish to know the
extent of homogeneity and heterogeneity among
respondents with respect to a given characteristic.
Any set of social data is values, which are
heterogeneous. The set of social data is characterized
by the heterogeneity of values. In fact, the extent
to which they are heterogeneous or vary among
themselves is of basic importance in statistics.
Measures of central tendency describe one important
characteristic of a set of data typically but they do
not tell us anything about this other basic
characteristic. Consequently, we need ways of
measuring heterogeneity – the extent to which data
are dispersed and the measures, which provide this
description, are called measures of dispersion or
variability.
Processing and Analysis of Data 107
Range
The range is defined as the difference between the
highest and lowest values. Mathematically,
R(Range) = mh – m L
Where mh and mL stand for the highest and the lowest
value. Thus, for the data set; 10, 22, 20, 14 and
14 the range would be the difference between 22
and 10, i.e., 12. In case of grouped data, we take
the range as the difference between the midpoints
of the extreme classes. Thus, if the midpoint of the
lowest interval is 150 and that of the highest is
850 the range will be 700.
Semi-Inter-Quartile Range or Quartile Deviation
Another measure of dispersion is the semi-inter-
quartile range, commonly known as quartile deviation.
Quartiles are the points, which divide the array or
series of values into four equal parts, each of which
contains 25 per cent of the items in the distribution.
The quartiles are then the highest values in each
of these four parts. Inter-quartile range is the
difference between the values of first and the third
quartiles.
Thus, where Q1 and Q3 stand for first and the third
quartiles, the semi-interquartile range or quartile
deviation.
Q3 – Q 1
Q.D. = —————
2
100 – 200 5
200 – 300 6
300 – 400 15
400 – 500 10
500 – 600 5
600 – 700 4
700 – 800 3
800 – 900 2
Total N = 50
Table
S.No. Weekly Number Cumulative
(1) Income of Frequency
(in Rs.) Families (CF)
(2) (3) (4)
1 100 – 200 5 5
2 200 – 300 6 11 = c
3 Q1 - 300 – 400 15 = f 26
4 400 – 500 10 36 = c
5 Q3 - 500 – 600 5 = f 41
6 600 – 700 4 45
7 700 – 800 3 48
8 800 – 900 2 50
Total N = 50
Processing and Analysis of Data 109
– C
Q1 = 1 1 + ————— (I)
F
12.5 – 11
= 300 + ——————— x 100
15
1.5
= 300 + ——— x 100
15
= 300 + 10
= 310
– C
Q3 = L 1 + —————— (I)
5
37.5 – 36
= 500 + —————— x 100
5
1.5
= 500 + ——— x 100
5
= 530
= Q3 – Q1
= 530 – 310 = 220
Q3 – Q 1
QD = ——————
2
220
= ———
2
= 110
Quartile Deviation is an absolute measure of dispersion.
If quartile deviation is to be used for comparing the
dispersion of series it is necessary to convert the
absolute measure to a coefficient of quartile deviation.
.Q3 – Q1
—————
2 Q3 – Q1
Symbolically, coefficient of Q.D. = ———— = —————
Q3 + Q1 Q3 + Q1
Q3 + Q1
—————
2
Illustration:
Table : Weekly Family Income (in Rs.)
Weekly Income (in Rs.) Number of Families
100 – 200 5
200 – 300 6
300 – 400 15
400 – 500 10
500 – 600 5
600 – 700 4
700 – 800 3
800 – 900 2
Total N = 50
Solution :
Standard Deviation
The most useful and frequently used measure of
dispersion is standard deviation or root-mean square
Table : Weekly Family Income (in Rs.)
Weekly Mid No. of Cumulative Deviation
Income Value families frequency from ‘f | d |
(f) median
400 | d |
N = 50 7400
112 Social Work Intervention with Communities and Institutions
50
——— – 10
2
= 300 + ————— x 100
15
25 – 10
= 300 + ————— x 100
15
15
= 300 + —— x 100
15
= 300 + (1 x 100)
= 300 + 100 = 400
d2
∑ = ———
N
∑ fd2 ∑f d2
∑ = ———— – ———— x t
N N
1 100 – 200 5
2 200 – 300 6
3 300 – 400 15
4 400 – 500 10
5 500 – 600 5
6 600 – 700 4
7 700 – 800 3
8 800 – 900 2
N = 50
Table
Table
2 Assume a mid-points as
average, preferably at the 450 = assumed average
centre
Conclusion
Knowledge of statistics helps the social worker in
two ways. First, the knowledge allows the social
worker to be able to analyse the data and draw
inferences. Second, as a consumer of researches
it enables him/her to understand the analysis of
data used in research reports.
Ungrouped data are not capable of being rapidly or
easily interpreted. In order that data may be more
readily comprehensible data can be grouped.
Mean, median and mode are the three measures
of central tendency. Mean is the arithmetic average
116 Social Work Intervention with Communities and Institutions
References
Elhance, D.N. (1984) Fundamental of Statistics, Kitab
Mahal, Allahabad.
Freud, J.E. (1977), Modern Elementary Statistics, Prentice
Hall, New Delhi.
Gupta, S.P. (1980), Statistical Methods, S.Chand, New
Delhi.
Krishef, Curtis H. (1987), Fundamental Statistics for
Human Services and Social Work, Duxbury Press, Boston.
Lal Das, D.K. (2000), Practice of Social Research
: A Social Work Perspective, Rawat Publications,
Jaipur.
Philip, A.E. et. al. (1975), Social Work Research, and
The Analysis of Social Data, Peragon Press, Oxford.
Sanders, D.H. et. al. (1975), Statistics A Fresh Approach,
Mc Graw Hill, New Delhi.