Professional Documents
Culture Documents
SAMPLING STATISTICS 11
III recent years the problem of recordillg alld handling great maSSflS of daLa
has heen wet, by the development of tlw IJllllCh card, 011 which can be recorded
by maciJiIll' allY type of information that can be coded by a nUltlber SYSLClll,
as well as original data which arc llllIlH·rical. To haIl(lh~ the eocl(~d data of
such cartis, Hnrling machilles and variolls kinds of tabulalors have been
dcvciolwcl (cf. Chapler 2, Sectioll C).
A schcduk fm a child's scho(lllw'ord if; gClwrally usd'ul ill (wo ways. (I) h
is valuahl(~ for Ihe Leacher, pKyeholugisL, 81e., who worke; willi l.lte individual
pupil and all('mpls 10 iroll oul, prohlelllS of IhaL child's adjllsillwilt Lo the
school 01' (lUll'!' situatiolls. This is a 1I001-statislical, illdividllal IISC of sllch
inforlllalioll. (2) ft is valuable for staLit-:I.ical purposc's, which fl}pallS l.lmt it
is IIseflll as olle ease ill hUlldreds 0[' t bOllsallds of slIeil records which, wIlen
eonsideT'('d Pll tIlass(~ 01' accurding to relcvallt groupings, lIHlY provide vaillable
illfoI'1I1,a('iol) ill l.Iw plallJlillg', Ii lIulIci) Ig', alld IllanagclllL'lI L of an educatiollul
sYRLeflJ. WII('II the purpose is statistical, a given child's schedule is more
appropriately ~igllifil·(l hy a COllvl~lIietJ L 1I11mber rat.her thall by hie; nalll(\
since the illyt'RtigaLol" is 110 111l1gL'r dl'alillg with tll~' illdividual child hut
with olle cast' ill a grollP or Illass of sta(.isLieal illfol'tllal.ioll. Such results w,
arc ohtaiIli'cl ill th(~ stalisLknl trealJlWllt of Lhe data will apply lo Lhe grollp
as a \vhole ralhe!' llmll to nil illdividual CH:oe. .
It shuuld he apparent thaL it is illlPossible adcquatdy to illterpret, III!!
l'(~cords of IJIIlldt'l·rls or I.hOU:'JandR of SII('h s!']I('dllles of information unl(·ss 1.I1(~
data arc t-iOIlIl'llIlW clas:,:ifil'd and Slllllltltuizl·d. An iuvestigator or l"(~searclt
work(~r is thus fU!:ml with the very pra('tical problem of n~c1I1('illg a great hulk
of data to a form t.lrat will be more rnadily perceived alld IlIIdefstoou. The
procedures to be adupted for such classificatioll and suwmarizatioll depelld
specifically upon the purposes of the investigation or inquiry. III general,
however, the statistical procedures that. may he used iJlclude ollly a few
alternatives; they will be deseribed ill detail ill the followillg chapLers 011
methods of dl'sCT'ipiire statistics. Here it is elllphasized lhaL descripl.iv() 8Lath;~
tical procedures serve a need whieh arisuH as soon as the obs(_~rvaLiolls or data
of any survey 01' inqlliry become at all sizable or bulky. For, as n. A. Fb]wJ'
says, "No human mind is capable of graspillg in the entirety the mnauillg'
of ally considerable quantity of numerical data." ,I: The statistical methods
used for the reductioll of data are of thrcc kinds:
1. Graphic methods
2. Computational methods yieldillg IlllIlwrieal measures
3. Tabular methods
The aim, theIl, of descriptive statist.ics is the reduction of data so that
the results of observation and measurement may he (1) made more inlllwdi~
atcly meaningful, and (2) presented in a form that will make interpretation
* R. A. Fisher, Statistical Me/hods .for Research Worker.~, Oliver & Boyd, Landau, 7th ccL,
]9~8, p, 6. .
THE NATURE OF STATISTICAL DATA 13
alld comparisou of resulLs (~usy and unumbiguOUf-l. In the light of the preced-
ing discussioll, descriptive statistics can now be ddincd as the orgalli:tatioll
amI sUIlllllarizaLion of collectiulls of lIl111wricul elata, inclndirlg dala arrivt"ri
aL by the simple method of cllllItlcratillg iusLUllCl'S. D('seriptive statisticH
cOllsists ill Llw reduclioll of groups or ma::;ses of daLa by means of tabIcs,
graphs, and numerical nwaSUl't:s such as percentages or proportions, averages,
1l1t'ai:HU'CS of deviation or dispersion, coeHicients of correlatioll, de.
ThaI. tlw methods of d(~seript.ivll sLatistics are essential to the methods of
analytical or salllpling statistics is appan'llt. Whelllt'r the data arc of a censlls
01' of a sumph', tlll' fin,!, sl.t~p ill their treatment comlists ill their appropriate
reduction or simplifkation.
Non-Variable Data
The incidence of the two sexes in a population provides -a common example
of non-variable data. People are either male or female. Scx is a non-variable
attribute. A person can he categorized as either one or the olher. FurtJwr-
more, no order is inherent in the arrangement of \Jwse two categories; that.
is, there is lIO basis in measurement for putting the male class first and the
female class second, or vice versa. A non-variable attribute is thus one that
exists with respect to distinct categories rather than wit.h respect to a par-
ticular degl:ee.
Non-variable data are oftOll rderred to as the data of categories. Categorical
data arc generally ohtained simply by the eIlumeration of instances that
occur, or that arc observed to exist, with respect to the classes or categories
under consideration. .
Variable Data
In conLrm;t to catl~gOl'ical data, variablu data represent quantitaLive differ-
ences (variation) ill the manifeHtaticHl of a property ur LraiL or attribute. Thus,
tbe age and height of persons are examples of atLributes thuL are vat'iablt's.
14 I!'-ITRODUCTION TO STATISTICS
have been devised for their treaLment and analysis.* These methods are
especially relevallt La rnatly market research investigations, as well as to
studies in social psychology and sociology. In Chapters 2-4, we shall present
the basin statistical nwtlJOds for 11on-variables as developed for problems of
descriptive statistics, and ill Chapters 5-9, the fundamental methods that
have been developed for the descriptive treatment of variables. However,
the distinction between these two sets of methods is not always sharp. The
data of variables are sometimes treated by methods developed for categorical
data. For example, ill order to determine whether a particular aptitude test
is satisfactory, the criterion therefor may be taken simply in terms of .mccessful
and nOThmcceo4tzl performance. Obviously, performance is itself a variable
attribute. However, we often lack satisfactOl'y methods for quantitatively
dif1'eren tiating degrees of success or non-success and we obtain, at best, broad
nOll-quantitative distinctions or differentiations of such attributes.
r.c.
, * G. V. Ylile and IVI. G. Kendall, An Introduclion 10 the Theory of SlatisUcs, Griffin,
London, 12th ccl., 1910, ChClJlH.I-G.
16 INTRODUCTION TO STATISTICS
EXERCISES
1. In what sense is statistics a form of appli.ed mathematics?
2. What are the implications of Quetelet's work for the development of descriptive
and sampling statistics? .
3. State tne different ways in which the concept "statistics" is employed.·
18 INTRODUCTION TO STATISTICS
A. INTRODUCTION
In this chapter we shall present some of the elementary but at the same time
indispensable statistical methods for the treatment of the non-variate type
of data often obtained in psychology, anthropology, sociology, and related
fields. The dala of non-variable attributes, of categories, their collection and
statistical treatment are of basic importance to the research worker, even
though amajority of research problems yield variate data.
From the point of view of the practical problems of research the initial
task to be dealt with is classificaLion, or division, of large masses of llon-
variat.e data. The logic of classification and division is essential to a sound
use of methods for their reduction and comparison. Just as the psychologist
and related scientists need to know the logic of measurement underlying the
treatment of the data of variables, so they also need to know how Lo handle
masses of non-variat.e data which first need to be classified and the results
then described through the use of appropriate statistical techniques.
We shall consider first the problem of classification. Then methods for the
reduction of snch data to a useful form will be presented. Basically, these
methods are simply tabulation and enumeration. Methods for the comparison
of such data will be developed in Chapter 3. These methods consist chiefly
ill the calculation of ratios or rates, such as percentages. Finally, in Chapter 4
we shall present methods for the correlation of categorical data.
Polylomous Classification
Eye color is again an attribute that can be divided into two categories, thc
BLUE-EYED and the NOT-BLUg-EYED. This time, however, the dichotomy it-
self is arbitrary. Eye color is an attribute which may, for research purposes,
be more usefully differentiated into more than two categories. In fact, so far
as it can be correlated with variations in degree of pigmentation, human eye
color may be considered as a variable attribute. But at the present time there
are no entirely satisfactory empirical methods for dealing quantitatively with
this attribute. The usual method for field and la.boratory purposes in psy-
chology and anthropology consists in using a sct of artificial eyes differing
in pigment.ation. By a matching technique (a person's eye color being com-
pared with the colors and shades of the artificial eyes until Lhe best match
is obta.ined), the color and lightness of an individual's eyes are identified