Professional Documents
Culture Documents
In this module, we begin our discussion on descriptive statistics. In particular, we will discuss
the different types of variables and classify data according to its level of measurement.
TYPES OF VARIABLES
A continuous quantitative variable, on the other hand, can take on an infinite number of
possible values because its values may not be exact. Some examples of this variable are
weights of produced items, the land area of a certain property, and the time it took for a
machine to produce an item.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 7
8
TYPES OF DATA
Data are raw materials of statistical investigations. They arise when measurements are
made and/or observations are recorded. If you are involved in statistics, marketing or data
science, it is essential to know the difference between qualitative and quantitative data
and analysis.
Data can be categorized as qualitative and quantitative. Quantitative data sets consist of
measures that take numerical values for which descriptions such as means and standard
deviations are meaningful. Quantitative data is the set of observations collected from
quantitative or numerical variables. Quantitative data are easily amenable to statistical
manipulations and can be represented with a wide variety of statistical types of
graphs and charts such as line graph, bar graph, scatter plot, box and whisker plot and
more.
Just like quantitative variables, there are two general types of quantitative data: discrete
and continuous. Discrete quantitative data can take on a count that involves integers. Only
a limited number of values are possible and the discrete values cannot be subdivided into
parts. For example, the number of children in a school is discrete data. You can count
whole individuals. You can’t count 1.5 kids. Continuous quantitative data are data that can
be meaningfully divided into finer levels. It can be measured on a scale or continuum and
can have almost any numeric value. For example, you can measure your height at very
precise scales — meters, centimeters, millimeters and etc.
As you might guess, qualitative data are information that can’t be expressed as a number
and can’t be measured. Qualitative data, such as eye color of a group of individuals, is not
computable by arithmetic relations. They are labels that advise in which category or class
an individual, object, or process falls. Qualitative data is the set of observations collected
from qualitative or categorical variables. Qualitative data consist of words, pictures,
observations, and symbols, not numbers. It is about qualities. Qualitative data is also
called categorical data. The reason is that the information can be sorted by category, not
by number. Qualitative data is analyzed to look for common themes.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 8
9
According to source, data can also be categorized as primary and secondary. Primary
data refer to the information which are gathered directly from an original source or which
are based on direct or first-hand experience. Secondary data refer to the information
taken from published or unpublished materials that have been previously gathered by
other individuals or agencies. Published data and the data collected in the past are
considered secondary data.
On the other hand, secondary data are those which have already been collected by
someone, may be sorted, tabulated and has undergone a statistical treatment. Secondary
data may be available from the following sources:
Government organizations or offices like the Philippine Statistics Authority and National
Economic and Development Authority
Commercial and financial institutions like banks
Research organizations or companies
Research journals and newspapers
Internet
It is also important to distinguish between cross-sectional data and time series data for
purposes of statistical analysis. Data which is gathered at the same or approximately the
same point in time is called Cross-Sectional Data while data collected over several periods
of time (like the quarterly inflation rate of the Philippines) is referred to as Time Series Data.
Inferential analysis for these two types of data differ and for this course, we would be
dealing with the analysis of cross-sectional data.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 9
10
SCALES OF MEASUREMENT
In order for the effective use of statistics, it would be helpful to view data in a different way
in order for it to be analyzed successfully. We do this by considering the level of scale by
which data was measures. The following are the four scales/levels of measurement of data,
presented from the lowest to the highest level:
Nominal. Nominal scales are used simply for labelling cases or items based on the
presence or absence of some attribute. Data obtained from a variable measured at the
nominal level can only be categorized but cannot be ranked or arranged in any order.
Furthermore, numbers can be obtained as data, however, they do not denote quantity;
they are only used as labels. Examples of nominal data are: hair color (black, blonde,
brown, brunette, etc.) and gender (male, female).
Ordinal. It is the simplest scale that arranges or assigns to order people, items, objects, or
events along some continuum. The name of this scale is derived from the use of ordinal
numbers for ranking. The numbers are used only to place the items in order. Moreover,
data obtained in the ordinal level can be used to categorize or classify the items under
study. Examples of ordinal data are: place of a person in a competition (first, second,
third), preference (most preferred, next preferred, least preferred), and quality (poor, fair,
good, very good, outstanding).
Interval. The intervals between each data value are the same. A popular example here
is the temperature in centigrade, where, for instance, the interval between 93oC and
95oC is the same as the distance between 1060C and 1080C. However, the starting point
in the interval scale is arbitrary which means that there is no fixed zero point or point of
origin in the measurement scale. With this, the value zero in the data set does not reflect
the absence of an attribute (e.g. 0°C)
Ratio. This scale does not only depict order and have equal intervals, but it also has the
value of zero as the fixed point of origin. In contrast to the interval scale, meaningful
ratios between values can be established in this scale of measurement. Examples are:
weight, length, and number of customers.
Identifying the levels of measurement where a data set falls under will help you decide
whether or not the data are useful in making calculations. The scales of measurement are
very important because they determine the types of data analysis that can be performed.
Perhaps you have seen that the nominal and ordinal scales are used for qualitative data
while the interval and ratio measurement scales are much more exact and are used for
quantitative data analysis.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 10
11
Work on the following problems using short bond papers with 1-inch margin on all sides.
You may not copy the problems. Your answers can be handwritten be computerized. If it is
computerized, save it as a single PDF file; if it is handwritten, scan or take a
photo/picture of each page, copy and paste the photo/picture to a WORD document
and save it as a single PDF file (or you can use any means, like a mobile App, to scan
and save it as a single PDF file). The Filename should be:
<SURNAMEFirstName_ReinforcementNo.> For example, BALLENAJaime_Reinforcement2
will be my output for Learning Reinforcement 2.
Property of and for the exclusive use of SLU. Reproduction, storing in a retrieval system, distributing, uploading or posting online, or transmitting in any form or by any
means, electronic, mechanical, photocopying, recording, or otherwise of any part of this document, without the prior written permission of SLU, is strictly prohibited. 11