Professional Documents
Culture Documents
Copyright
1
Table of Contents
U
UNNIITT 11:: W
WHHA
ATT IIS
SRRE
ESSE EA ARRC CH H?? ............................................................................ 6
Definition of Research: What is research?................................................................. 7
Types of research ....................................................................................................... 9
Scope of research ....................................................................................................... 9
Objectives of research .............................................................................................. 10
The importance of knowing how to conduct research ............................................. 10
Qualities of a researcher........................................................................................... 10
U
UNNIITT 22:: S
STTA
ATTE
EM ME EN NTT O OFF A AR RE ESSE EA ARRC CH HP PR RO OB BLLE EMM ...................................... 11
What is a research problem? .................................................................................... 11
What is a problem statement? .................................................................................. 12
Identification of the research problem ..................................................................... 13
Research Questions .................................................................................................. 16
Types of research questions: .................................................................................... 17
Hypotheses ................................................................................................................... 18
Null hypothesis ........................................................................................................ 19
Alternative hypothesis ............................................................................................. 19
Constructs and variables .............................................................................................. 20
Types of variables .................................................................................................... 21
Operational definition and measurement of variables ............................................. 22
Scales of Measurement ............................................................................................ 27
Internal and external validity ................................................................................... 27
What is validity? .......................................................................................................... 27
U
UNNIITT 33:: LLIITTE
ERRA
ATTU
URRE
ERRE
EVVIIE
EWW ........................................................................... 31
What is a literature review? ..................................................................................... 31
What is a literature review not? ............................................................................... 31
Conducting a literature review ................................................................................. 32
U
UNNIITT44:: R
REES
SEEA
ARRC
CHHMME ETTH HO OD DO OLLO OG GY YA AN NDDR RE ES SE EAAR RC CH HD DE ESSIIGGN N .............. 34
What is research methodology? ............................................................................... 34
Research Methods vs. Methodology ........................................................................ 34
Definition: What is research design? ....................................................................... 34
Research design ....................................................................................................... 34
Research Design: ..................................................................................................... 35
Quantitative research designs .................................................................................. 35
Qualitative research design ...................................................................................... 43
U
UNNIITT55:: D
DAATTA
ACCO
OLLLLE
ECCTTIIO
ONNM ME ETTH HO OD DS S ............................................................ 49
Quantitative Research .............................................................................................. 49
Qualitative Research ................................................................................................ 58
U
UNNIITT 66:: S
SAAM
MPPLLIIN
NGGM
MEETTHHO OD DS S ............................................................................ 65
What is sampling? .................................................................................................... 65
2
Sampling Methods ................................................................................................... 66
U
UNNIITT 77:: D
DAATTA
AAAN
NAALLY
YSSIIS
S ...................................................................................... 71
Handling and analysing qualitative research data .................................................... 71
The ten steps of content analysis ............................................................................. 77
Presenting Qualitative Research .............................................................................. 78
Handling and analysing Quantitative data ............................................................... 79
Computerised data analysis...................................................................................... 79
3
Welcome to SDS 2414: RESEARCH METHODS IN SOCIAL
SCIENCES
The main aim of this module is to enable students to conduct and critically evaluate
social research. It will do this in two ways. Firstly it will explore the different
philosophical and methodological debates within social sciences. Secondly, the course
will develop students’ practical skills in a selection of these research methods.
_____________________________________________________________________
This course is intended for people who are pursuing Bachelor of Arts Degrees at the
University of Zambia or related fields of study.
_____________________________________________________________________
Timeframe
Time allocated for this course is 3 hours of lectures and one hour of
tutorial per week.
How long?
Recommended self-study time is four hours per week
______________________________________________________
Study skills
As an adult learner your approach to learning will be different to
that from your school days: you will choose what you want to
study, you will have professional and/or personal motivation for
doing so and you will most likely be fitting your study activities
around other professional or domestic responsibilities.
Your most significant considerations will be time and space i.e. the
time you dedicate to your learning and the environment in which
you engage in that learning.
4
We recommend that you take time now—before starting your self-
study—to familiarize yourself with these issues. There are a
number of excellent resources on the web. A few suggested links
are:
http://www.how-to-study.com/
The “How to study” web site is dedicated to study skills resources.
You will find links to study preparation (a list of nine essentials for a
good study place), taking notes, strategies for reading text books,
using reference sources, test anxiety.
http://www.ucc.vt.edu/stdysk/stdyhlp.html
This is the web site of the Virginia Tech, Division of Student Affairs.
You will find links to time scheduling (including a “where does time
go?” link), a study skill checklist, basic concentration techniques,
control of the study environment, note taking, how to read essays for
analysis, memory skills (“remembering”).
http://www.howtostudy.org/resources.php
Another “How to study” web site with useful links to time
management, efficient reading, questioning/listening/observing skills,
getting the most out of doing (“hands-on” learning), memory building,
tips for staying motivated, developing a learning plan.
_____________________________________________________________________
Need help?
University of Zambia
Institute of Distance Education
Great East Road Campus
P.O Box 32379
Lusaka
Zambia
E-mail: eliphas.machacha@unza.zm
Website: www.www.unza.zm
_____________________________________________________________________
5
U
UNNIITT 11:: W
WHHA
ATT IIS
SRRE
ESSE
EAAR
RCCH
H??
LEARNING OUTCOMES
When you have completed this unit you will be able to:
According to Hudson Maxim (1853 –1927), “All progress is born of inquiry. Doubt
is often better than overconfidence, for it leads to inquiry, and inquiry leads to
invention.”
Introduction
Research is the cornerstone of any science, including both the hard sciences such as
chemistry and physics and the social sciences such as economics, political science,
psychology, public administration, sociology, management, or education. It refers to
the organized, structured, and purposeful attempt to gain knowledge about a suspected
relationship.
Many argue that the structured attempt at gaining knowledge dates back to Aristotle
and his identification of deductive reasoning. Deductive reasoning refers to a
structured approach utilizing an accepted premise (known as a major premise), a
related minor premise, and an obvious conclusion. This way of gaining knowledge
has been called a syllogism, and by following downward from the general to the
specific, knowledge can be gained about a particular relationship. An example of an
Aristotelian syllogism might be:
6
Specific Premises: John, Sally, Lenny and Sue attended class regularly
Specific Premises: John, Sally, Lenny, and Sue received high grades
Researchers combine the powers of deductive and inductive reasoning into what is
referred to now as the scientific method. It involves the determination of a major
premise (called a theory or a hypothesis) and then the analysis of the specific
examples (research) that would logically follow. The results might look something
like:
Class Attendance: Group 1: John, Sally, Lenny and Sue attend classes regularly
(Suspected Cause)
Group 2: Heather, Lucinda, Ling, and Bob do not attend
classes regularly
Grades: Group 1: John, Sally Lenny, and Sue received A’s and B’s
(Suspected Effect) Group 2: Heather, Lucinda, Ling, and Bob received C’s and
D’s
Definition box:
Definitions of research are legion, but the following can be employed to embrace
most projects which will involve student researchers
Research is:
(i) A systematic investigation into and study of materials, sources, etc., in order
to establish facts and reach new conclusions
(ii) An endeavour to discover new or collate old facts, events or issues by the
scientific study of a subject or through a course of critical investigation
7
(iii) A process of gathering data in a strictly organised manner. The end-product
of data gathering process may vary along a continuum from simple
description to reflection and interpretation. The emphasis is on structured
investigation, exploration or discovery
(iv) A process of testing a stated idea or assertion (the hypothesis) to see if the
evidence supports it or not
(v) A process of engaging in a planned or unplanned interaction in parts of the
real world, and reporting on what happens, and what they seem to mean
Before we do research, we rarely see things as they are. We see them as we are. Then,
in the research process, a sort of waltz begins. Subjectivity leads, objectivity follows.
When the dance is finished, we see things more accurately. As many advances in
social science thinking show, subjective experiences often enhance objective
knowledge in social sciences, leading to the discovery of new problems and new
solutions to old problems. Acknowledging that our experiences inspire us to ask
particular questions about the social world is not the same as saying that those
questions, or the answers we eventually uncover, are biased. Bias arises only when we
remain unaware of our subjectivity. It is the purpose of research to help us become
aware of our biases and to test theories against systematic observations of the social
world that other researchers can repeat to check up on us. On the basis of research, we
reject some theories, modify others, and are forced to invent new ones. Having
provided the definition and brief explanation research, it is now time to discuss the
research process and cycle in detail.
Ideally, social science research is a cyclical process that involves six steps.
(i) Formulating a research problem. A research problem must be stated so that it
can be answered by systematically collecting and analysing research data.
It should in mind in social science research that there are certain questions
or issues which cannot determine by carrying out a research study. For
example, social science research cannot determine whether God exists or
what the best political system is. Answers to such questions require faith
more than evidence. However, social science research can determine why
some people are more religious than others and which political system
creates most opportunities for higher education. Answers to such questions
require evidence more than faith.
(ii) Reviewing the existing research literature. Researchers must elaborate their
research problems in the clear light of what other social scientists have
already debated and discovered. Why? Because reading the relevant
8
literature stimulates researchers’ imaginations, allows them to refine their
initial questions, and prevents duplication of effort.
(iii)Selecting a research method. As we will see in detail later in this chapter, each
data collection method has strengths and weaknesses. Each method is
therefore best suited to studying a different kind of problem. When
choosing a method, one must keep these strengths and weaknesses in
mind.
(iv) Collecting data by observing subjects, interviewing them; reading documents
produced by or about them, and so forth. Many researchers think this is the
most exciting stage of the research cycle because it brings them face to
face with the puzzling social reality that so fascinates them.
(v) Analysing the data. The most challenging. During data analysis you can learn
things that nobody knew before. At this stage, data confirm some of your
expectations and confound others, requiring you to think creatively about
familiar issues, reconsider the relevant theoretical and research literature,
and abandon pet ideas.
(vi) Publishing the results. Research is not useful for the social science
community, the subjects of the research, or the wider society if researchers
do not complete this sixth step of publishing results in a report, a scientific
journal, or a book. Publication serves another important function, too.
It allows other social scientists to scrutinize and criticise the research. On that basis,
errors can be corrected and new and more sophisticated research questions can be
formulated for the next round of research. Science is a social activity governed by
rules defined and enforced by the scientific community.
Types of research
Scope of research
Does the researcher cover a particular objective of research or researcher?
Does the researcher cover a particular time period?
Does the study cover a specific geographical area?
If the study involves people, what age group, gender and places of origin are
to be included?
Are all dates of publication to be included?
9
Is the research going to cover publication from other countries?
Will the researcher include other languages and scripts? (Language of
research)
Are all perspectives to be considered? For example, philosophical, political,
sociological, economic, psychological etc.?
Scope of research is all about marking boundaries of the study in order to make it
manageable.
Objectives of research
To gain familiarity with new insights into phenomenon
To accurately portray the characteristics of a particular individual, group or a
situation
To analyse the frequency with which something occurs
To examine the hypothesis of a crucial relationship between two variables
Qualities of a researcher
Desire for accuracy of observation and precision of statement
An alert mind
Must practice “The art of enduring intellectual hardships”
Making statements cautiously
10
U
UNNIITT 22:: S
STTA
ATTE
EMME
ENNTT O
OFF A
ARRE
ESSE
EAAR
RCCH
HPPR
ROOB
BLLE
EMM
LEARNING OUTCOMES
When you have completed this module you will be able to:
Identify a research problem
List the criteria of a good research problem
Explain the components of a research problem
Design a study to test selected hypotheses
Explain the different types of variables
When asked, some students do not even know the meaning of a "research
problem". This is understandable given the numerous definitions of the term:
“research problem" which further confuses the beginning researcher. Some
supervisors fail to appreciate that for many students, it is the first time they are
conducting a 'research'. Learning the intricacies of research is a long and winding
process. To make matters worse, the most difficult phase of the research process is the
identification of the research problem.
Identification of the research problem is the MOST IMPORTANT step of the
research process. Not only must you be clear about the research problem, you must
also have a passion for it! Let us see whether you will be able to explain your research
problem clearly as well as be passionate about it, after having completed this module.
Definition box:
11
What is a problem statement?
Definition box:
A problem statement is the description of an issue currently existing which needs to
be addressed. It provides the context for the research study and generates the
questions which the research aims to answer.
The statement of the problem is the focal point of any research. A good problem
statement is just one sentence (with several paragraphs of elaboration). For example it
could be:
"The frequency of job layoffs is creating fear, anxiety, and a loss of productivity in
middle management workers."
While this problem statement is just one sentence, it should be accompanied by a few
paragraphs that elaborate on the problem. The paragraphs could cover present
persuasive arguments that make the problem important enough to study. They could
include the opinions of others (politicians, futurists, other professionals); explanations
of how the problem relates to business, social or political trends via presentation of
data that demonstrates the scope and depth of the problem.
A well-articulated statement of the problem establishes the foundation for everything
to follow in the proposal and will render less problematic most of the conceptual,
theoretical and methodological obstacles typically encountered during the process of
proposal development. This means that, in subsequent sections of the proposal, there
should be no surprises, such as categories, questions, variables or data sources that
come out of nowhere: if it can't be found in the problem section, at least at the implicit
level, then it either does not belong in the study or the problem statement needs to be
re-written.
1. The problem itself, stated clearly and with enough contextual detail to establish
why it is important
2. The method of solving the problem, often stated as a claim or a working thesis
3. The purpose, statement of objective and scope of the project being proposed.
These elements should be brief so that the reader does not get lost. One page is
enough for a statement problem.
Hence, a "research problem" is something that bothers you which needs to be resolved
by research. It is the beginning of the research process and ends with the solution to
the problem. So the next time, you are asked what is your research problem, would
you be able to state it orally or put it in writing.
12
But I don't have a research problem! Not to worry as there are several sources of
research problems:
The research question is formulated and then restated in the form of a statement that
notes the adverse consequences of the problem. The type of study determines the
kinds of question you should formulate: Is there something wrong in society,
theoretically unclear or in dispute, or historically worth studying? Is there a program,
drug, project, or product that needs evaluation? What do you intend to create or
produce and how will it be of value to you and society?
In a nutshell, sources of a research problem may include the following:
13
You start with a broad area. For example,
BROAD AREA you are concerned by the poor critical
thinking skills of university graduates.
- In your proposal the statement of the problem is oftentimes the first part to
be read with scrutiny. I am ignoring the title and the abstract because
ideally a title should be born out of a problem statement and an abstract
should be a summary after the problem has already been dealt with. The
problem statement should, therefore, "hook" the reader and establish a
persuasive context for what follows.
- You need to be able to clearly answer the question: "what is the problem"?
And "why is this problem worth my attention"? At the same time, the
problem statement limits scope by focusing on some variables and not
14
others. It also provides an opportunity for you to demonstrate why these
variables are important.
- When you set up to write a statement problem you should know that you
are looking for something wrong… or something that needs close
attention. Your problem statement is the statement that makes a point
about the issues and information you are discussing, and is what the rest of
the proposal hinges upon. It is not just your topic, but what you are saying
about your topic. In other words there must be very good communication
between your topic and the statement problem.
- The importance of the problem should receive considerable and persuasive
attention [note that importance is inevitably subjective and will vary from
researcher to researcher]. Nevertheless objectivity can be injected by
answering questions such as these:
The problem statement should persuasively indicate that major variables can be
measured in some meaningful way. If you can identify likely objections to the study,
identify and respond to them here.
The problem statement could close with a question. Typically, the question could
contain two variables, a measurable relationship, and some indication of population.
15
The purpose of the literature review that follows thereafter is to answer the research
problem question. If the literature cannot answer the question, the research is needed
to do so. An example question might be: this proposal poses the question, "What is
the relationship between farm productivity and farmer use of fertilizer"? The
information needed is (1) productivity levels and (2) some measure of fertilizer use. A
bad example might be: "What is the best way to train for use of fertilizer"? This is
insufficient because:
There should be a close relationship between the title of the proposal and the problem
statement question. For example, in the good example above, the title of this research
project would be something like this:
"Fertilizer use by small scale farmers in Bungoma district and their farm
productivity"
Research Questions
Just as the module has already stated, the word "research" means 'finding out' or
'discovery' using a systematic method. You "research" by asking questions and
searching for answers to the questions. You cannot "research" if you do not want to
know anything, that is, you must have something you would like to know more about
before you can do "research".
You begin with QUESTIONS. If you have none, you will find no answers or will not
know when you have found one. Your task is to conduct RESEARCH. A study
without a question in mind will NOT be a RESEARCH study. You should
MAKE SURE that:
a) The Research Question is clear, straightforward and easily understood by
others
b) The Research Question states the relationship between two or more
variables.
c) The variables mentioned in the Research Question should be measurable.
d) The answer to the Research Question is not immediately obvious.
e) The Research Question indicates the method that is to be adopted, i.e.
the data collection techniques
f) The Research Question can be answered in the time available to you.
g) The Research Question can be answered with the resources available to
you.
16
Types of research questions:
Generally there are three basic types of questions that research projects can address:
These statistical tools will be covered in in the second module: SS 242 (Statistical
Methods in the Social Sciences
These statistical tools will be covered in the second module: SS 242 (Statistical
Methods in the Social Sciences
17
Hypotheses: What is a hypothesis?
Definition box:
Goal of a hypothesis
Regardless of the type of hypothesis, the goal of a hypothesis is to explain the focus
and direction of the research. As such, a hypothesis will:
• State the purpose of the research
• Identify what variables are used
• A hypothesis will give a plausible explanation that will be tested. It can also
explain future phenomenon that will need to be tested
• Once a hypothesis has been tested and widely accepted, it is called a law. This
means that it is assumed to be true and will predict the outcome of certain
conditions or experiments.
• A theory is broader in scope and explains more events than a law. After
hypotheses and laws have been tested many times, with accurate results, they
become theories.
Types of a hypothesis
• Null hypotheses (no relationship between two variables).
• Alternative hypotheses (there is a relation between two variables)
• Non-directional hypotheses (we don’t know or won’t speculate about the
direction of the relationship between two variables).
• Directional hypotheses. We state the direction of the relationship between two
variables.
18
NULL HYPOTHESIS
The null hypothesis is a hypothesis (or hunch) about the population. It represents a
theory that has been put forward because it is believed to be true. The word "null"
means nothing or zero. So, a null hypothesis states that 'nothing happened'. For
example, there is no difference between males and females in critical thinking skills
or there is no relationship between socio-economic status and academic performance.
Such a hypothesis is denoted with the symbol “Ho:” In other words you are saying,
You do not expect the groups to be different
You do not expect the variables to be related
ALTERNATIVE HYPOTHESIS
The Alternative Hypothesis (H1) is the opposite of the Null Hypothesis. For example,
the alternative hypothesis for the study discussed earlier is that there is a difference in
science scores between the discovery method group and the lecture method group
represented by the following notation:
The Alternative Hypothesis (H1) is the opposite of the Null Hypothesis. For example,
the alternative hypothesis for the study discussed earlier is that there is a difference in
science scores between the discovery method group and the lecture method group
represented by the following notation:
Ha: µ1 ≠ µ2
Ha: The Alternative Hypothesis might be that the science mean scores between
discovery method group and lecture method group are DIFFERENT.
19
Ha: µ1 > µ2
Ha: The Alternative Hypothesis might be that the sciences mean score of the
discovery method group is HIGHER than the mean scores of the lecture method
group.
Ha: µ1 < µ2
Ha: The Alternative Hypothesis might be that the sciences mean score of the
discovery method group is LOWER than the mean scores of the lecture method
group.
CONCLUSION:
Based on the findings of the experiment, you found that there was a significant
difference in science scores between the discovery method group and the
lecture method group.
In fact, the mean score of subjects in the discovery method group was higher
than the mean of subjects in the lecture method group. What do you do?
You REJECT the null hypothesis because earlier you had said they would be
equal.
You reject the null hypothesis in favour of the ALTERNATIVE HYPOTESIS
(i.e. µ1 ≠ µ2).
What is a variable?
Definition box:
For example; “income” is a variable that change between data units in a population
(i.e. employees or businesses being studied may not have the same incomes) and can
also vary over time for each data unit (i.e. income can go up or down)
20
Researchers somewhat loosely call constructs or properties they study as
‘variables’. eg. gender, social class. A variable is something that varies. A variable is
a symbol to which numerals of values are assigned. For example, the symbol
"intelligence" is assigned a set of numerical values which may be IQ scores ranging
from 50 to 150. In the case of the variable "gender" there are only 2 values and they
are called dichotomous variables, i.e. male (1) and female (0). Other examples of
two-value variables are: graduate-nongraduate, low income-high income, citizen-
noncitizen. Besides dichotomous variables, some variables are polytomies, e.g.
religion - Islam, Christianity, Buddhism, Hinduism, etc.
TYPES OF VARIABLES
There are many ways of classifying variables but in educational research, the
two most common methods of classification are as follows:
Independent and Dependent Variables
Continuous and Categorical Variables (or nominal variables)
Put it another way, the DV is the variable predicted to, whereas the
independent variable is predicted from. The DV is the presumed effect, which varies
with changes or variation in the independent variable.
21
INDEPENDENT DEPENDENT
VARIABLE (IV) VARIABLE (DV)
Discussion
Academic
Teaching Performance
Method
Lecture
22
“If you lead a good life, you will not suffer”. This is a specific prediction of the
future, but it cannot be scientifically tested. Such a prediction is not scientifically
tested because we cannot define it operationally. How do you define ‘good life’ and
how do you define ‘suffer’. According to Bridgman, 1987, operational definition
means that variables used in the study must be defined as it is used in the context of
the study and publicly observable. This is done to facilitate measurement and to
eliminate confusion.
However, it should be borne in mind that in social sciences not all variables are
directly observable. For example, we cannot really observe learning, memory,
reasoning, and so forth. Though they cannot be observed they can be measured to see
their traces. With enough indirect evidence, researchers can make a convincing case
for the existence of these invisible variables (Mitchell and Jolley, 1988). For example,
though we cannot observe learning directly, we can see its effect on performance, i.e.
we can operationally define learning as an increase in performance. Thus, if we see
students improve their performance after practicing a task, we conclude that learning
has occurred. Similarly, we can provide operational definitions for such intangible
variables such as self-esteem, racial stereotype, attitudes and so forth.
Operational Definition
The person:
Excellent listens to teachers
Principal looks after the welfare of teachers
acknowledges effort
consults teachers
motivates teachers
2. What is measurement?
The principle in research is: Always use the highest level of measurement that you ca.
Definition of Measurement
Definition box
23
The rules for assigning labels to properties of variables are the most important
components of measurement, because the result or poor rules is meaningless
outcomes
Concepts often cannot be measured directly, e.g. “intelligence,” so what is
usually measured are indicators of constructs, such as speed, logic, verbal skill
etc.
Levels of measurement
Four levels of measurement have been identified. These levels differ in how
closely they approach the structure of the number system we use
Understanding the level of measurement of variables used in research is
important because the level of measurement determining the types of
statistical analyses that can be conducted
The conclusions that can be drawn from research depend on the statistical
analysis used.
1. Nominal
This is the most basic level of measurement. At this level we can determine only
whether two observations are alike or different. Nominal level of measurement uses
symbols to classify observations into mutually exclusive and exhaustive categories.
This level of measurement is qualitative. It involves naming things and putting them
into mutually exclusive and exhaustive categories.
Mutually exclusive means the categories must be distinct so that no
observation falls into more than one category
Exhaustive means sufficient categories must exist so that all observations fall
into some category
Example:
24
In nominal measurement, all observations in one category are alike on some
property and differ from the members in the other category on that property
(e.g. sex, marital status)
Ordering of categories exists. We cannot say one category is better or worse,
or more or less than another.
2. Ordinal
Ordinal level of measurement uses symbols to classify observations into categories
that are not only mutually exclusive and exhaustive, but also the categories have some
explicit relationship among them.
Observations may be classified into categories such as taller and shorter; greater and
lesser; faster and slower; harder and easier; and so on. The categories must be
exhaustive and mutually exclusive.
Most questionnaires use Likert type items. For example, we may ask teachers
about their job satisfaction
Asking whether a teacher is very satisfied, satisfied, neutral, dissatisfied, or
very dissatisfied is using an ordinal scale of measurement
25
3. Interval
The interval level of measurement classifies observations into mutually exclusive and
exhaustive categories that have some explicit relationship among them, and the
relationship between the categories is known and exact. This is the first quantitative
application of numbers on the scale of measurement.
4. Ratio
The ratio level of measurement is the highest level at which variables can be
measured. It has all the properties of the interval level of measurement with the
addition of a meaningful and non-arbitrary zero point
Variables measured at a higher level can always be converted to a lower level
but not vice versa.
Example: Weight, age, area, speed, velocity.
Observations of actual age (ratio scale) can be collapsed to categories of
younger and older (ordinal scale), but age measured simply as younger or
older cannot be converted to measures of actual age.
Ratio – Equal intervals & absolute zero
- Basic Empirical Operations
Determination of equality of ratios
- Permissible Statistics
26
Same as for interval
Coefficient of variation
Logarithmic transformations
- Examples:
Temperature: Kelvin scale
Length, weight, force, age etc.
Money, number of students in class
Remember this rule: Always measure things at the highest level of measurement
possible. Do not measure things at the ordinal level if you can measure them
intervally.
For example, if you are researching on farmers and you want to know the price they
paid for their seeds, then ask the price. Do not ask them to indicate whether they paid
between “K1m and K2m” etc.
If you want to know how much education people have had, ask them how many years
they went to school. During data analysis you can lump interval level data together
into ordinal or nominal categories.
Scales of Measurement
Scale Scale of
Scale Qualities Example(s)
Level Measurement
Magnitude
Absolute Zero
Magnitude
3 Interval Temperature
Equal Intervals
What is validity?
Definition box:
Validity refers to the accuracy and trustworthiness of instruments, data, and findings
in research. Nothing in research is more important than validity. There are two types
of validity: Internal and External validity
27
1. Internal Validity
Internal validity refers both to how well a study was run (research design,
operational definitions used, how variables were measured, what was/wasn't
measured, etc.), and how confidently one can conclude that the observed effect(s)
were produced solely by the independent variable and not extraneous ones. In
experimental research, internal validity answers the question, "Was it really the
treatment that caused the difference between the subjects in the control and
experimental groups?" In descriptive studies (correlational, etc.) internal validity
refers only to the accuracy/quality of the study (e.g., how well the study was run).
In their classic book on experimental research, Campbell and Stanley (1966) identify
and discuss 8 types of extraneous variables that can, if not controlled, jeopardize an
experiment's internal validity.
(i) History-- refers to the effect external events have on subjects between the
various measurements done in an experiment. These experiences function like
extra, and unplanned, independent variables. Compounding this, the
experiences are likely to vary across subjects which have a differential effect
on the subjects' responses. Studies that take repeated measures on subjects
over time are more likely to be affected by history variables than those that
collect data in shorter time periods, or those do not use repeated measures.
(ii) Maturation-- refers to how subjects naturally can change over the passage of
time (rather than due to the treatment). For example, the more time that passes
in a study the more likely subjects are to become tired and bored, more or less
motivated as a function of hunger or thirst, older, etc. As Isaac and Michael
(1971) point out, subjects may perform better or worse on a dependent
variable not as a result of the independent variable but because they are older,
more/less motivated, etc.
(iii)Testing-- refers to how a pre-test can affect subjects' performance on a post-
test. Many experiments pre-test subjects to establish that all the subjects are
starting the study at approximately the same level, etc. A consequence of
pretesting programs/protocols is that they can contaminate/change the
subjects' performance on later tests (e.g., those used as dependent variables)
that measure the same domain beyond any effects caused by the treatment
itself.
(iv) Instrumentation-- The reliability of the instrument used to gauge the
dependent variable or manipulate the independent variable may change in the
course of an experiment. Examples include changes in the calibration of a
mechanical measuring device as well as the proficiency of a human observer
or interviewer. Suppose that the dependent variable is measured twice for a
group of subjects, once at Time A and later at Time B, and that the
independent variable is introduced in the interim. Suppose also that the ability
of a recording device to detect instances of the target behaviour improves
(declines) as the experiment progresses. If scores on the dependent measure
differ at these two times, the discrepancy may be due to the independent
28
variable or to more (less) sensitive recordings of the target behaviour at Time
B relative to at Time A. In addition, changing the measurement methods (or
their method of administration) during a study can affect what is measured.
(v) Statistical Regression-- Statistical regression is the phenomenon whereby
pre-test results tend to regress toward the mean. When subjects in a study are
selected as participants because they scored extremely high or extremely low
on some measure of performance (e.g., a test, etc.), retesting of the subjects
will almost always produce a different distribution of scores, and the average
for this new distribution will be closer to the population's. For example, if the
chosen subjects all had high scores initially, the group's average on the retest
will tend to be lower (i.e., less extreme) than it was originally. Conversely, if
the group's mean was originally low, their retest mean would be higher.
(vi) Selection-- refers to the effect of non-equivalent groups on a study's validity.
The subjects in comparison (e.g., the control and experimental) groups should
be functionally equivalent at the beginning of a study. If they are, then
observed differences between the groups, as measured by the performance
dependent variable(s), at the end of the study are more likely to be caused only
by the independent variable instead of organismic ones. If the comparison
groups are different from one another at the beginning of the study, then the
observed effect(s) may be due to these differences, as opposed to the result of
the experimental treatment.
(vii) Experimental Mortality/Attrition-- refers to the potential bias that
occurs depending on who stays or drops out of a study. Subjects frequently
'drop out' of studies. If one comparison group experiences a higher level of
subject attrition than other groups, then observed differences between groups
become questionable. Were the observed differences produced by the
independent variable or by the different dropout rates? (Mortality is also a
threat when dropout rates are similar across comparison groups but high.)
(viii) Selection Interactions--In some studies the selection method can
interact with maturation, history or instrumentation, also biasing the study's
results.
2. External Validity
External validity represents the extent to which a study's results can be generalized
or applied to other people or settings. Campbell and Stanley (cited in Isaac &
Michael, 1971) have identified 4 factors that can adversely affect a study's external
validity.
(i) An interaction between how the subjects were selected and the treatment
(e.g., the independent variable) can occur. If subjects are not randomly
selected from a population, then their particular characteristics may bias their
performance and the study's results may not be applicable to the population or
29
to another group that more accurately represents the characteristics of the
population.
(ii) Pretesting subjects in a study may cause them to react more/less strongly to
the treatment than they would have had they not experienced the pre-test. In
such situations the researcher(s) cannot conclude that members of the
population who were not pretested would perform in a similar manner to the
participants in the study. Restated, to generalize the results of the study the
researcher would have to specify that a particular type of pretesting also be
done because the pretesting could be serving as an extra, unintentional
independent variable.
(iii)The performance of subjects in some studies is more a product or reaction to
the experimental setting (e.g., the situation where the study is conducted) than
it is to the independent variable. For example, subjects who know they are
participants in a study, or who are aware of being observed, etc., may react
differently to the treatment than a subject who experienced the treatment but
was not aware of being observed (Hawthorne Effect).
(iv) Studies that use multiple treatments/interventions may have limited
generalizability because the early treatments may have a cumulative effect on
the subjects' performance. If a group experienced treatment X1, and the first
treatment was followed by a second (X2), their measured performance after
X2 will be affected by both treatments not just X2's because the effects of X1
are not erasable.
30
U
UNNIITT 33:: LLIITTE
ERRA
ATTU
URRE
ERRE
EVVIIE
EWW
LEARNING OUTCOMES
When you have completed this unit you will be able to:
Explain the role of literature review in social research
Explain the importance of literature review in social research
Conduct literature review in your own social research
Most are aware that it is a process of gathering information from other sources and
documenting it, but few have any idea of how to evaluate the information, or how to
present it.
Definitions box
It is not a collection of quotes and paraphrasing from other sources. A good literature
review should also have some evaluation of the quality and findings of the research.
A good literature review should avoid the temptation of impressing the importance of
a particular research program. The fact that a researcher is undertaking the research
program speaks for its importance, and an educated reader may well be insulted that
31
they are not allowed to judge the importance for themselves. They want to be re-
assured that it is a serious paper, not a pseudo-scientific sales advertisement.
For example, a review of Victorian Age Physics, could present J.J. Thomson’s famous
experiments in a chronological order. Otherwise, this is usually perceived as being a
little lazy, and it is better to organize the review around ideas and individual points.
As a general rule, certainly for a longer review, each paragraph should address one
point, and present and evaluate all of the evidence, from all of the differing points of
view.
The only real way to evaluate is through experience, but there are a few tricks for
evaluating information quickly, yet accurately.
There is such a thing as ‘too much information,’ and Google does not distinguish or
judge the quality of results, only how search engine friendly a paper is. This is why it
is still good practice to begin research in an academic library. Any journals found
there can be regarded as safe and credible.
The next stage is to use the internet, and this is where the difficulties start. It is very
difficult to judge the credibility of an online paper. The main thing is to structure the
internet research as if it were on paper. Bookmark papers, which may be relevant, in
one folder and make another subfolder for a ‘shortlist.’
The easiest way is to scan the work, using the abstract and introduction as
guides. This helps to eliminate the non-relevant work and also some of the
lower quality research.
If it sets off alarm bells, there may be something wrong, and the paper is
probably of a low quality. Be very careful not to fall into the trap of rejecting
research just because it conflicts with your hypothesis. Failure to do this will
completely invalidate the literature review and potentially undermine the
research project. Any research that may be relevant should be moved to the
shortlist folder.
The next stage is to critically evaluate the paper and decide if the research is
sufficient quality. Think about it this way: The temptation is to try to include
as many sources as possible, because it is easy to fall into the trap of thinking
32
that a long bibliography equates to a good paper. A smaller number of quality
sources is far preferable than a long list of irrelevance.
Check into the credentials of any source upon which you rely heavily for the
literature review. The reputation of the University or organization is a factor,
as is the experience of the researcher. If their name keeps cropping up, and
they have written many papers, the source is usually OK.
Look for agreements. Good research should have been replicated by other
independent researchers, with similar results, showing that the information is
usually fairly safe to use.
If the process is proving to be difficult, and in some fields, like medicine and
environmental research, there is a lot of poor science, do not be afraid to ask a
supervisor for a few tips. They should know some good and reputable sources
to look at. It may be a little extra work for them, but there will be even more
work if they have to tear apart a review because it is built upon shaky
evidence.
33
U
UNNIITT44:: R
REES
SEEA
ARRC
CHHM
MEETTH
HOOD DO
OLLO
OGGY
YAAN
NDDR
REES
SEEA
ARRC
CHH
D
DEESSIIG
GNN
LEARNING OUTCOMES
When you have completed this unit you will be able to:
Explain what is meant by research methodology
Explain the difference between research methodology and social research
methods
Identify the various research designs used in social research
Explain the different types research designs
Definition box:
Definition box:
Research design can be thought of as the structure of research -- it is the "glue" that
holds all of the elements in a research project together. We often describe a design
using a concise notation that enables us to summarize a complex design structure
efficiently. What are the "elements" that a design includes? It is the master plan
specifying the methods and procedures for collecting and analysing the needed
information.
RESEARCH DESIGN
Research design provides the glue that holds the research project together. A design is
used to structure the research, to show how all of the major parts of the research
project -- the samples or groups, measures, treatments or programs, and methods of
assignment -- work together to try to address the central research questions. Here,
34
after a brief introduction to research design, I'll show you how we classify the major
types of designs. You'll see that a major distinction is between the experimental
designs that use random assignment to groups or programs and the quasi-experimental
designs that don't use random assignment. People often confuse what is meant by
random selection with the idea of random assignment. You should make sure that
you understand the distinction between random selection and random assignment.
Understanding the relationships among designs is important in making design choices
and thinking about the strengths and weaknesses of different designs. Then, I'll talk
about the heart of the art form of designing designs for research and give you some
ideas about how you can think about the design task. Finally, I'll consider some of the
more recent advances in quasi-experimental thinking -- an area of special importance
in applied social research and program evaluation.
Research Design:
It highlights decisions which include
The name of the study
The purpose of the study
The location where the study would be conducted
The nature of data required
The source(s) of the required data
The duration of the study
The type of sample design to employ
The techniques of data collection
The methods of data analysis
These experiments are sometimes referred to as true science, and use traditional
mathematical and statistical means to measure results conclusively.
35
They are most commonly used by physical scientists, although social sciences,
education and economics have been known to use this type of research. It is the
opposite of qualitative research.
Quantitative experiments all use a standard format, with a few minor inter-
disciplinary differences, of generating a hypothesis to be proved or disproved. This
hypothesis must be provable by mathematical and statistical means, and is the basis
around which the whole experiment is designed.
Ideally, the research should be constructed in a manner that allows others to repeat the
experiment and obtain similar results.
RESEARCH DESIGNS
1. SURVEY
What is a survey?
Definition box
A survey is a type of research in the course of which the researcher tries to gain an
overall picture of a comprehensive phenomenon spread out over a period of time and
space.
Characteristics of a survey
Large number of research units: This can be literally anything the researcher
intends to make statements about
Labour extensive data generation: The researcher uses fewer time-consuming
methods to generate data. This is essential considering the large number of
research units that need to be approached.
36
More breadth than depth
A random sample: Taking a random sample id typical for a survey. A random
sample is a sample survey in which all potential research units in a population
have an equal chance to be included, regardless of their characteristics. A
random sample selection is required to gain a representative picture of the
whole population, in order to be able to generalise the results later on.
Quantitative data and analysis
Preferably remote, closed data generation
This is a simple design and is aimed at finding out the prevalence of a phenomenon,
problem, attitude or issue by taking a snap-shot or cross-section of the population.
This obtains an overall picture as it stands at the time of the study. It measures units
from a sample of the population at only one point in time. Sample surveys are cross-
sectional studies whose samples are drawn in such a way as to be representative of a
specific population.
This is a type of research during which measuring takes place at various moments in
time within one and the same group. This type of research is especially suitable for
showing changes that are taking place within research units. For example, you would
like to know the influences a further training course for employees would have on
their ability to solve problems that arise during work. In this case you could measure
their problem-solving skills before and after the training course, respectively called
ex-ante measurement. After comparing both measurement results, you are able to
determine for each employee whether he or she has made any progress.
(iii)Time sequence
37
data that constantly being gathered by different organisations. This is called “official
statistical data.”
2. EXPERIMENT
What is experiment?
Definition box
An experiment is the most suitable type of research for gaining experience with newly
created situations or processes, which can be used to assess the effects of these
changes. You can get an idea of these effects by creating (at least) two groups which
are as similar as possible. One group receives special treatment (intervention) and the
other does not (or receives a different treatment). Subsequently you compare the
differences in performance between the two groups.
Characteristics of an Experiment
The formation of (at least) two groups, an experimental group and a control
group
A random assignment of participants or research objects to either group. This
is called randomisation
The researcher determines (and not the people being examined) which group
is subjected to the intervention and what happens further within the groups
The researcher makes sure that there are as few outside influences as possible
It is characterised by ex-post and ex-ante measurements.
Control. Control refers to steps taken to reduce the effects of extraneous
variables (i.e., variables other than the independent variable and the
dependent variable). These extraneous variables are called lurking variables.
38
experimenter compares results in the treatment group to results in the
control group.
Confounding
Confounding occurs when the experimental controls do not allow the experimenter to
reasonably eliminate plausible alternative explanations for an observed relationship
between independent and dependent variables.
39
Consider this example. A drug manufacturer tests a new cold medicine with 200
participants - 100 men and 100 women. The men receive the drug, and the women do
not. At the end of the test period, the men report fewer colds.
This experiment could be strengthened with a few controls. Women and men could be
randomly assigned to treatments. One treatment group could receive a placebo, with
blinding. Then, if the treatment group (i.e., the group getting the medicine) had
sufficiently fewer colds than the control group, it would be reasonable to conclude
that the medicine was effective in preventing colds.
Parts of an Experiment
Each factor has two or more levels (i.e., different values of the factor).
Combinations of factor levels are called treatments. The table below shows
independent variables, factors, levels, and treatments for a hypothetical
experiment.
40
VARIANTS OF EXPERIMENTS
Experimental Designs
(i) Pre-Experimental Design - loose in structure, could be biased
To attempt to explain a
consequent by an
One-shot
experimental case
X� O An approach that prematurely
links antecedents and
antecedent study consequences. The least
reliable of all experimental
approaches.
(ii) True Experimental Design - greater control and refinement, greater control of
validity
�O
[- � X
�O
[-�-
�O
To evaluate a situation
that cannot be pretested
Posttest only control
group
R--[ X�O An adaptation of the last two groups
in the Solomon four-group design.
Randomness is critical. Probably, the
simplest and best test for
[-�O significance in this design is the t-
test.
41
(iii) Quasi-Experimental Design - not randomly selected
To investigate a situation
in which random selection
Nonrandomized
control group pretest-
O�X�O One of the strongest and most widely
used quasi-experimental designs.
and assignment are not posttest Differs from experimental designs
possible because test and control groups are
O�-�O not equivalent. Comparing pretest
results will indicate degree of
equivalency between experimental
and control groups.
�-
To search backward from Ex post facto studies This approach is experimentation in
consequent data for reverse. Seldom is proof through
antecedent causes data substantiation possible. Logic
and inference are the principal tools
of this design
Leedy, P.D. (1997). Practical research: Planning and design (6th Ed.). Upper
Saddle River, NJ: Prentice-Hall, Inc., p. 232-233.
After statistical analysis of the results, a comprehensive answer is reached, and the
results can be legitimately discussed and published. Quantitative experiments also
42
filter out external factors, if properly designed, and so the results gained can be seen
as real and unbiased.
Quantitative experiments are useful for testing the results gained by a series of
qualitative experiments, leading to a final answer, and a narrowing down of possible
directions for follow up research to take.
In addition, the requirements for the successful statistical confirmation of results are
very stringent, with very few experiments comprehensively proving a hypothesis;
there is usually some ambiguity, which requires retesting and refinement to the
design. This means another investment of time and resources must be committed to
fine-tune the results.
Quantitative research design also tends to generate only proved or unproven results,
with there being very little room for grey areas and uncertainty. For the social
sciences, education, anthropology and psychology, human nature is a lot more
complex than just a simple yes or no response.
Data are not inherently quantitative, and can be bits and pieces of almost anything.
They do not necessarily have to be expressed in numbers. Frequency distributions and
probability tables don't have to be used. Data can come in the form of words, images,
impressions, gestures, or tones which represent real events or reality as it is seen
symbolically or sociologically (If people believe things to be real, they are real in
their consequences - the Thomas Dictum). Qualitative research uses unreconstructed
logic to get at what is really real -- the quality, meaning, context, or image of reality in
what people actually do, not what they say they do (as on questionnaires).
Unreconstructed logic means that there are no step-by-step rules, that researchers
43
ought not to use prefabricated methods or reconstructed rules, terms, and procedures
that try to make their research look clean and neat (as in journal publications).
For these reasons, these qualitative methods are often closely allied with interviews,
survey design techniques and individual case studies, as a way to reinforce and
evaluate findings over a broader scale.
Qualitative methods are probably the oldest of all scientific techniques, with Ancient
Greek philosophers qualitatively observing the world around them and trying to come
up with answers which explained what they saw.
- Not objectives
- Not hypotheses
- Central research questions: This is a broad question that asks for the
exploration of the central phenomenon
- Sub-questions: These are questions that narrow the focus of the study
44
RESEARCH DESIGNS
The design of qualitative research is probably the most flexible of the various research
techniques, encompassing a variety of accepted methods and structures.
1. CASE STUDY
Definition box
Case Selection: Single case study versus multiple or comparative case study
Single case study
- Typical case: highlights what is normal or average
- Unique case: this refers to a highly unusual manifestation of the
phenomenon or situation or case
- Intensive case: many manifestations of the phenomenon
45
Multiple case study
- Minimum differences between cases
- Maximum differences between cases
Purposive sampling
Theory based sampling: In this type of sampling, the interviewer decides on
the basis of his or her expectations
Snowball/chain sampling: The interviewer follows up contacts mentioned by
other respondents
Quota sampling: The interviewer interviews a certain number of people per
category.
Triangulation
What is Triangulation?
- Triangulation refers to the utilisation of multiple sources of
information to get an overview of a phenomenon
Types of Triangulation
- Triangulation of sources of data
- Triangulation of methods
- Triangulation of researchers
2. LONGITUDINAL STUDY
This follows study subjects over a long period of time with repeated data collection
throughout. Some longitudinal studies last several months, while others can last
decades. Most are observational studies that seek to identify a correlation among
various factors. Thus, longitudinal studies do not manipulate variables and are not
often able to detect causal relationships.
46
carry out. They are also useful when budgetary decisions have to be taken into
account.
The broader scope covered by these designs ensures that some useful data is always
generated, whereas an unproved hypothesis in a quantitative experiment can mean
that a lot of time has been wasted. Qualitative research methods are not as dependent
upon sample sizes as quantitative methods; a case study, for example, can generate
meaningful results with a small sample group.
Any qualitative research design is usually unique and cannot be exactly recreated,
meaning that they do lack the ability to be replicated.
47
Mode of analysis Inductive (by researcher) Deductive (by statistical
methods)
Merriam, S.B. (1988). Case study research in education: A qualitative approach. San
Francisco: Jossey-Bass, p. 18.
48
U
UNNIITT55:: D
DAATTA
ACCO
OLLLLE
ECCTTIIO
ONNM
MEETTH
HOOD
DSS
LEARNING OUTCOMES
When you have completed this unit you will be able to:
Explain the various methods of collecting data for qualitative and
quantitative researches
Explain the differences between data collection methods
Identify the advantages and disadvantages of each data collection method
Quantitative Research
1. Interviews
49
Monthly Survey of Manufacturing, the General Social Survey and the Workplace
Employee Survey.
1.3 Self-completed
A major disadvantage of a mail survey is that it usually has lower response rates than
other data collection methods. This may lead to problems with data quality. Also,
people with a limited ability to read or write English or French may experience
problems.
Interviews can be
1. Unstructured
1. Can be referred to as 'depth' or 'in depth' interviews
2. They have very little structure at all
3. The interviewer may just go with the aim of discussing a limited
number of topics, sometimes as few as just one or two
4. The interviewer may frame the interview questions based on the
interviewee and his/her previous response
5. This allows the discussion to cover areas in great detail
6. They involve the researcher wanting to know or find out more about
a specific topic without there being a structure or a preconceived
plan or expectation as to how they will deal with the topic
2. Semi structured
50
4. 'The open ended nature of the question defines the topic under
investigation but provides opportunities for both interviewer and
interviewee to discuss some topics in more detail'
5. Semi structured interviews allow the researcher to promt or
encourage the interviewee if they are looking for more information
or find what they are saying interesting
6. This method gives the researcher the freedom to probe the
interviewee to elaborate or to follow a new line of inquiry introduced
by what the interviewee is saying
7. Work best when the interviewed has a number of areas he/she wants
to be sure to be addressing
3. Structured
1. The interviewed asks the respondent the same questions in the same
way
2. A tightly structured schedule is used
3. The questions may be phrased in order that a limited range of
responses may be given - i.e. 'Do you rate our services as very good,
good or poor'
4. A researcher needs to consider whether a questionnaire or structured
interview is more appropriate
5. 'If the interview schedule is too tightly structured this may not enable
the phenomena under investigation to be explored in terms of either
breadth or depth.'
Qualitative interviews should be fairly informal and participants feel they are taking
part in a conversation or discussion rather than in a formal question and answer
situation.
1. Thought
2. Preparation
3. The development of the interview schedule
4. Conducting and analysing the interview data with care and consideration
2. Questionnaire design
51
Questionnaires play a central role in the data collection process. A well-designed
questionnaire efficiently collects the required data with a minimum number of errors.
It facilitates the coding and capture of data and it leads to an overall reduction in the
cost and time associated with data collection and processing. The biggest challenge in
developing a questionnaire is to translate the objectives of the survey into a well-
conceptualized and methodologically sound study.
Before you can design the questionnaire, you must plan the survey as a whole,
including the objectives, data needs and analysis. Once the questionnaire is designed,
it must be tested before you can proceed with the data collection.
52
School name_______________________________
Confidential
This survey provides you with an opportunity to share your thoughts on what is
needed to keep you and your school safe and healthy.
You do not have to complete this survey if you do not wish to do so. However,
everyone’s views are important and the more participation we receive, the better the
results will be. Please understand that this questionnaire is completely confidential.
Once the envelope is sealed, it will only be opened by the team entering your
responses to the questions into the computer system. Your envelope will be placed
with many others and there will be no way to identify individual respondents. The
results of all the questionnaires will be added together and reported back to the
school.
The opening questions of any survey should establish the respondents’ confidence in
their ability to answer the remaining questions. If necessary, the opening questions
should help determine whether the respondent is a member of the survey population.
A good questionnaire ends with a comments section that allows the respondent to
record any other issues not covered by the questionnaire. This is one way of avoiding
any frustration on the part of the respondent, as well as allowing them to express any
thoughts, questions or concerns they might have. Lastly, there should be a message at
the end thanking the respondents for their time and patience in completing the
questionnaire.
One of the most important factors in any survey is the design of the actual
questionnaire. The questions and instructions should be easy to understand and
respond to. The way a question is worded is very important as the same question
worded in a different manner may achieve completely different results. Consider the
following.
53
Abbreviations and acronyms
Better wording: Did you know that the population figures from the 2010 Census of
Population are available on the Central Statistical website at………..?
Better wording: Have you ever participated in an annual University of Zambia for
undergraduate students?
Example: Do you know who is leading the talks surrounding the impending
amalgamation of surrounding constituencies into the "new metro" areas?
Better wording: Do you know who is leading the talks in each of the provinces
regarding the amalgamation of cities, towns, villages and rural areas into "new
metro" areas?
Frame of reference
Does the word "your" refer to the respondent’s personal income, family income or
household income? Does the word "income" refer to salary and wages only, or does it
include tips or income from other sources? Because there is no specific time period
mentioned, does this question refer to last week’s income, last month’s or last year’s
income?
This question is too vague. It should be reworded so that all of the specific details
concerning the frame of reference are given.
Better wording: What was your household’s total income, from all sources before
taxes and deductions, for last year?
54
Specific questions
A question’s frame of reference is not the only specific detail required. In order to get
a uniform response from the entire sample, the question sometimes needs to state the
type of response needed.
Example: Respondents are shown a bottle of orange drink and are asked, "How much
orange juice do you think this bottle contains?"
Better wording: This bottle holds 250 millilitres (mL) of orange drink. How many
millilitres of this drink would you say are orange juice?
Double-barreled questions
Examples:
Do you plan to leave your car at home and take the bus to work during the coming
year?
Does your company provide training for new employees and retraining for existing
staff?
Each of the above examples asks two questions rather than one:
In the first example, the question asks respondents if they plan to leave their cars at
home, and whether or not they are taking the bus for the next year.
The second example asks respondents if their company provides training for new
employees as well as providing retraining for existing employees.
In some instances, the answer to each half of the question is the same. However,
sometimes there could be two very separate answers, which would make interpreting
this question difficult.
Loaded questions
The following examples demonstrate how a loaded question can impact the
respondent’s results.
Example 1:
In your opinion, should Sunday shopping be allowed in Ontario; that is, should stores
that want to stay open on Sunday be allowed to stay open on Sundays if they want to?
55
The wording of the first question asks whether the respondents were in favour of
Sunday shopping, while the second question was worded to ask respondents whether
they were in favour of not working on Sundays. As a result, there was a significant
change in the data.
A possible explanation for the difference in the results could be that some respondents
did not quite understand the implications of the question. Some people may be
opposed to working on Sundays, but are still in favour of shopping. However, if no
one works on Sundays, then stores cannot stay open for shoppers!
Generally there are two types of questions: open and closed. Open questions give
respondents an opportunity to answer the question in their own words. Closed
questions give respondents a choice of answers and the respondent is supposed to
select one.
- Open question
What is the most important issue facing today’s youth?
- Closed question
Which of these is the most important problem facing today’s youth?
Unemployment
National unity
Environment
Youth violence
Rising tuition fees
Drugs in schools
Need for more computers in schools
Career counseling
There are advantages and disadvantages to using one type of question versus another.
The open question allows the respondent to interpret the question and answer it
anyway he or she chooses. The respondent writes the answer or the interviewer
records verbatim what the respondent says in answer to the question.
The closed question restricts the respondent to select an answer from the specified
response options. For the respondent, a closed question is easier and faster to answer
and for the researcher, closed questions are easier and less expensive to code and
analyse. Also, closed questions provide consistency, an element that is not necessarily
going to occur with an open question.
56
instructions; determine problems caused by the respondent’s inability or
unwillingness to answer the questions; suggest additional response categories that can
be pre-coded on the questionnaire; and provide a preliminary indication of the length
of the interview and any refusal problems. Testing can include the complete
questionnaire or only a particular portion of it. The complete questionnaire will at
some point in time have to be fully tested.
1. Practical
2. Large amounts of information can be collected from a large number of
people in a short period of time and in a relatively cost effective way
3. Can be carried out by the researcher or by any number of people with
limited affect to its validity and reliability
4. The results of the questionnaires can usually be quickly and easily
quantified by either a researcher or through the use of a software package
5. Can be analysed more 'scientifically' and objectively than other forms of
research
6. When data has been quantified, it can be used to compare and contrast other
research and may be used to measure change
7. Positivists believe that quantitative data can be used to create new theories
and / or test existing hypotheses
The process of coding in the case of open ended questions opens a great possibility of
subjectivity by the researcher
57
Qualitative Research
There are many methods of data collection that are used in qualitative research
1. Participant Observation
2. Ethnography
3. Photography
4. Ethnomethodology
5. Dramaturgical Interview
6. Sociometry
7. Natural Experiment
8. Unobtrusive Measures
9. Content Analysis
10. Historiography
(i) Participant-observation
This is the process of immersing yourself in the study of people you're not too
different from. It is almost always done covertly, with the researcher never
revealing their true purpose or identity. If it's a group you already know a lot
about, you need to step back and take the perspective of a "martian", as if you
were from a different planet and seeing things in a fresh light. If it's a group you
know nothing about, you need to become a "convert" and really get committed
and involved. The more secretive and amorphous the group, the more you need
participation. The more localized and turf-conscious the group, the more you need
observation. It's customary in the literature to describe four roles:
It's difficult to say which of these four roles are the most common, probably the
middle two. The key point behind all of them is that the researcher must operate on
two levels: becoming an insider while remaining an outsider. They must avoid
becoming over-socialized, or "going native", as well as being personally revolted or
repulsed by the group conduct. Going native is sometimes described as giving up
research and joining the group for life. For instance, in most criminological circles, it
means losing your objectivity and glorifying criminals. Generally, it takes time to
carry out participant-observation, several weeks or months to 2-4 years. Gangs, hate
groups, prostitutes, and drug dealers have all been studied by this method.
58
(ii) Ethnography:
This is the process of describing a culture or way of life from a folk peoples' point of
view. Another name for it is field work. The folk point of view is the idea of a
universe in a dewdrop, each person a reflection of their culture in that all their
gestures, displays, symbols, songs, sayings, and everything else has some implicit,
tacit meaning for others in that culture. It's the job of ethnography to establish the
hidden inferences that distinguish, for example, a wink and a nod in any given culture.
Numerous funding opportunities exist both abroad and domestically for ethnographic
research.
The ethnographic method involves observation and note taking. The anthropologist
Clifford Geertz called it thick description. For about every half hour of observation,
an ethnographic researcher would write notes for about two hours. These notes would
contain rich, detailed descriptions of everything that went on. There would be no
attempt at summarizing, generalizing, or hypothesizing. The notes would capture as
factual a description of the drama as possible to permit multiple interpretations, and
most of all, to later infer cultural meaning. A coding procedure (much like content
analysis) would be used later for this.
Take notes as soon as possible, and do not talk to anyone before note taking
Count the number of times key words or phrases are used by members of the
folk group
Carefully record the order or sequence of events, and how long each sequence
lasts
Do not worry that anything is too insignificant; record even the smallest things
Draw maps or diagrams of the location, including your movements and any
reaction by others
Write quickly and don't worry about spelling; devise your own system of
punctuation
Avoid evaluative judgments or summarizing; don't call something "dirty" for
example, describe it
Include your own thoughts and feelings in a separate section; your later
thoughts in another section
Always make backup copies of your notes and keep them in a separate
location
59
(iii)Photography, or filmmaking:
(iv) Ethnomethodology:
You can just call it dramaturgy, is a technique of doing research by role playing or
play acting your own biases in some symbolic interaction or social performance.
Interviewing is conversation with a purpose. Dramaturgy was popularized by the
sociologist Erving Goffman in the early 1960s and is also associated with the pseudo
patient study "On Being Sane in Insane Places" by Rosenhan in 1973. Both
researchers pretended to be mentally ill to find out what it's like in a psychiatric
hospital. It's important to note that the acting out doesn't have to be deceptive. In fact,
it's preferable if the researcher act out on a self-conscious awareness of their own bias,
and just exaggerates a bit, in order to instigate a more emotional response from the
person being interviewed. A researcher interested in the beliefs of devout Catholics,
60
for example, might start asking "So you're Catholic, huh? I hear Catholics engage in
cannibalism when they go to Mass, is that true?" Knowing your biases is different
from bracketing those biases, the latter requiring not just an awareness, but being hard
on yourself, and developing a special openness or frankness that is the hallmark of a
dramaturgical researcher. At a minimum, you should examine yourself according to
the following:
your sex, age, ethnicity, religion, political party, and favourite psychological
theory
the ways in which these characteristics might bias you in your efforts at
interviewing
the ways in which you might counteract these biases
the ways in which your efforts to counteract your biases might lead to other
biases
Rapport and trust come from meeting the interviewee's expectations about ascribed
and achieved characteristics (gender, age, race, mannerisms, etc.), and then the
interview proceeds in a semi-directed manner with the interviewer (always self-
consciously) acting out on some bias believed to be associated with their own
characteristics or those of the interviewee (if different). In the first case, the researcher
is a dramaturgical performer; in the second case, a dramaturgical choreographer. The
thing to focus on with this technique is the nonverbal body language, as it is believed
that affective messages contained therein are more important than verbal messages. A
debriefing session is usually held after the dramaturgical interview. This method is
probably one of the most difficult qualitative methods as it is basis is in
phenomenological theory, but it has many advocates who point to its therapeutic
value for both interviewer and interviewee.
(vi) Socio-metry:
This is the measurement of social distance between group members. More precisely, it
is the assessment of attractions and repulsions between individuals in a group and
with the group structure as defined by feelings. The method was first established by
the social psychologist J.L. Moreno in 1934, and to this day, always involves a
graphical depiction of the structure of group relations called a sociogram. The
procedure for constructing a sociogram begins with a questionnaire-based sociometric
test which asks each group member the following:
name two or three peers you like the most, like working with, or are your best
friends
name two or three peers you least like, dislike working with, or that you reject
as friends
rate every member of the group in terms of like or dislike on a 5-point scale
After the mean ratings are collated, and one has identified what social structures
exist, the researcher then locates appropriate guides, informants, and gatekeepers to
61
the group. Fieldwork, or ethnography, is engaged in to obtain field notes. Together
with a coding and analysis of one's field notes and the collated results of sociometric
testing, the researcher draws up a sociogram depicting star and satellite cliques,
dyads, triads, and so forth. The arrows in the sociogram contain a number obtained by
dividing an individual's column score by n-1. A summary table usually accompanies
the sociogram showing the frequency distributions. An example of a sociogram
appears below:
This refers to a situation where a split or division has occurred between group
members, and the researcher is afforded an opportunity to study the differentiation
process of social structure. For example, suppose one group of students at a
University received campus crime report newsletter in their mails while on vacation,
and another group did not. Both groups, however, had a chance to review a second
newsletter once they got on campus. The researcher could then survey or interview all
of them once they got on campus, and not only make meaningful comparisons about
the perceived helpfulness of first report with the second, but inductive inferences
about concern for crime and campus safety generally. Increases or decreases in posted
speed limits are natural experiments, for example.
These are ways of gathering data in which subjects are not aware of their being
studied, and are sometimes called nonreactive measures. They usually involve
clandestine, novel, or oddball collection of trace data that falls into one of two
categories: accretion or erosion. Accretion is the stuff left behind by human activity.
An example would be going through someone's garbage. Erosion is the stuff that is
worn down by human activity. An example would be examining wear and tear on
floor tiles to estimate how much employees use the restroom. Examination of graffiti
and vandalism are examples of unobtrusive measures in criminal justice. Nobody
claims that unobtrusive measures are superior to other research methods. The only
advantage is that it is useful when the subjects to be studied are very suspicious and
distrustful.
This is a technique for gathering and analysing the content of text. The content can be
words, phrases, sentences, paragraphs, pictures, symbols, or ideas. It can be done
quantitatively as well as qualitatively, and computer programs can be used to assist
the researcher. The initial step involves sorting the content into themes, which
depends on the content. If you were studying white collar crime, for example, you
might have themes like planning, action, and cover-up. Then, a coding scheme is
devised, usually in basic terms like frequency (amount of content), direction (who the
content is directed to), intensity (power of content), and space (size of content). The
coding system is used to reorganize the themed content in what is called manifest
62
coding. Manifest coding is highly reliable because you can train assistants to do it,
ensuring inter-coder reliability, and all you're doing is using an objective method to
count the number of times a theme occurs in your coding scheme. At the next level,
the researcher engages in what is called latent coding. This requires some knowledge,
usually gained from fieldwork or observation, about the language rules, or semiotics,
of your subjects. It is less reliable than manifest coding, but involves the researcher
using some rubric or template to make judgment calls on implicit, ironic, or doubtful
content. Since not everything always fits in categories, there's always some leftover
content to be accounted for, and it must be interpreted in context by a knowledgeable
researcher who knows something about the culture of his/her subjects.
There are strict limitations on the inferences a researcher can make with content
analysis. For example, inferences about motivation or intent cannot normally be
made, nor can the researcher infer what the effect of seeing such content would be on
a viewer. Content analysis is only analysis of what is in the text. A researcher cannot
use it to prove that newspapers intended, for example, to mislead the public, or that a
certain style of journalism has a particular effect on public attitudes. The most
common inferences in content analysis make use of concepts like unconscious bias or
unintended consequences, and these are not the same as saying intentional bias or
intended effect. Content analysis has been applied extensively to all kinds of media:
newspapers, magazines, television, movies, and the Internet. Intelligence and law
enforcement agencies also do content analysis regularly on diplomatic channels of
communication, overseas phone calls, and Internet emails. A key point to remember is
that the more quantitative aspects of content analysis come first; the qualitative part of
the analysis comes last, although some advocates say the technique involves moving
back and forth between quantitative and qualitative methods.
(x) Historiography:
This is the method of doing historical research or gathering and analysing historical
evidence. There are four types of historical evidence: primary sources, secondary
sources, running records, and recollections. Historians rely mostly on primary sources
which are also called archival data because they are kept in museums, archives,
libraries, or private collections. Emphasis is given to the written word on paper,
although modern historiography can involve any medium. Secondary sources are the
work of other historians writing history. Running records are documentaries
maintained by private or non-profit organizations. Recollections are autobiographies,
memoirs, or oral histories. Archival research, which is the most common, involves
long hours of sifting through dusty old papers, yet inspection of untouched documents
can yield surprising new facts, connections, or ideas. Historiographers are careful to
check and double-check their sources of information, and this lends a good deal of
validity and reliability to their conclusions. Inferences about intent, motive, and
character are common, with the understanding of appropriateness to the context of the
time period. Historical-comparative researchers who do historiography often have to
make even more disclaimers about meanings in context, such as how they avoided
western bias.
63
An interesting variety of historical research is "prosopography" or prosopographic
analysis (Stone 1972). Although doubts may exist about its proper place in research
methods and the techniques are more akin to "profiling" in political psychology than
anything else, prosopography involves the study of biographical details (family
background, childhood events, educational background, religion, etc.) that are found
"in common" or "in the aggregate" among a group of people. The typical groups
studied by this method are Presidents, political leaders, generals, professors, terrorists,
and/or elites in society. Sometimes the method yields significant insights by
combining the common background elements in individual profiles. The method is
considered a useful corrective to the more one-sided, single biography technique often
found in the more-or-less mass market books aimed at those interested in
biographies. Specifically, it corrects the tendency toward "hagiography" or hero-
worship.
This is the reanalysis of data that was originally compiled by another researcher for
other purposes than the one the present researcher intends to use it for. Several
datasets in criminal justice and criminology exist just for this purpose. The UCR
(Uniform Crime Reports), for example, can be analysed in a number of ways other
than for its purpose as being a health scorecard for the nation. Often, secondary
analysis will involve adding an additional variable to an existing dataset. This variable
will be something that the researcher collects on their own, from another dataset, or
from a common source of information. For example, one could take police call for
service data and combine it with lunar cycles from the Farmer's Almanac to study the
effect of full moons on weird human behaviour. Secondary data analysis is only
limited by the researcher's imagination. While the technique is mostly quantitative,
limitations exist that often force such researchers to have some qualitative means of
garnering information also. In such cases (as with much Historical-Comparative
research), the qualitative part of the study is used as a validity check on the
quantitative part.
64
U
UNNIITT 66:: S
SAAM
MPPLLIIN
NGGM
MEETTH
HOOD
DSS
LEARNING OUTCOMES
When you have completed this unit you will be able to:
Explain what is meant by sampling
Explain the different sampling methods
Identify the various types of sampling
Explain the importance of sampling in research
What is sampling?
Definition box
(i) First, collecting data for a sample is less expensive than for a census.
(ii) Second, having to collect data from fewer people can be done faster
than a census.
(iii)Third, more attention can be given to each person than would be
possible for a census. More attention to each person can result in
more accurate data of higher quality and higher response rates.
65
Concepts in sampling:
Sampling Methods
In nonprobability sampling, members are selected from the population in some non-
random manner. These include convenience sampling, judgment sampling, quota
sampling, and snowball sampling. The advantage of probability sampling is that
sampling error can be calculated.
Sampling error is the degree to which a sample might differ from the population.
When inferring to the population, results are reported plus or minus the sampling
error. In nonprobability sampling, the degree to which the sample differs from the
population remains unknown.
66
1. Probability Sampling Design
This refers to sampling when the chance of any given individual being selected is
known and these individuals are sampled independently of each other. This is also
known as random sampling. A researcher can simply use a random number generator
to choose participants (known as simple random sampling), or every nth individual
(known as systematic sampling) can be included. Researchers also may break their
target population into strata, and then apply these techniques within each strata to
ensure that they are getting enough participants from each strata to be able to draw
conclusions. For example, if there are several ethnic communities in one
geographical area that a researcher wishes to study, that researcher might aim to have
30 participants from each group, selected randomly from within the groups, in order
to have a good representation of all the relevant groups.
Sampling Techniques
(i) Random sampling is the purest form of probability sampling. Each member
of the population has an equal and known chance of being selected. When
there are very large populations, it is often difficult or impossible to
identify every member of the population, so the pool of available subjects
becomes biased.
(ii) Systematic sampling is often used instead of random sampling. It is also
called an Nth name selection technique. After the required sample size has
been calculated, every Nth record is selected from a list of population
members. As long as the list does not contain any hidden order, this
sampling method is as good as the random sampling method. Its only
advantage over the random sampling technique is simplicity. Systematic
sampling is frequently used to select a specified number of records from a
computer file.
(iii)Stratified sampling is commonly used probability method that is superior to
random sampling because it reduces sampling error. A stratum is a subset
of the population that share at least one common characteristic. Examples
of stratums might be males and females, or managers and non-managers.
The researcher first identifies the relevant stratums and their actual
representation in the population. Random sampling is then used to select a
sufficient number of subjects from each stratum. "Sufficient" refers to a
sample size large enough for us to be reasonably confident that the stratum
represents the population. Stratified sampling is often used when one or
more of the stratums in the population have a low incidence relative to the
other stratums.
67
beyond such a narrow sample. For example, snowball sampling is an approach for
locating information-rich key informants. Using this approach, a few potential
respondents are contacted and asked whether they know of anybody with the
characteristics that you are looking for in your research. Snowball sampling is not a
stand-alone tool; the tool is a way of selecting participants and then using other tools,
such as interviews or surveys.
Sampling Techniques
With a single grain of rice, a housewife tests if all the rice in the pot has boiled; from
a cup of tea, a tea-taster determines the quality of the brand of tea; and a sample of
moon rocks provides scientists with information on the origin of the moon. This
process of testing some data based on a small sample is called sampling.
Definition:
68
Sampling is the process by which inference is made to the whole by examining a part.
Purpose of Sampling
In this method each item of the data (population) has the same probability of being
selected in the sample. The selection is usually made with the help of random
numbers.
Suppose there are N=850 students in a school from which a sample of n=10
students is to be taken. The students are numbered from 1 to 850. Since our
data runs into three digits we use random numbers that contain three digits. All
numbers exceeding 850 are ignored because they do not correspond to any
serial numbers in the data. In case the same number occurs again, the
repetition is skipped.
Systematic Sampling
In this method first we have to number the data items from 1 to N. Suppose the
sample size be n, then we have to calculate the sampling interval by dividing N by n.
And generate a number between 1 and N/n and select that data item to be in the
sample. Other items in the sample are obtained by adding the sampling interval N/n
successively to the random number.
Advantage of this method is that the sample is evenly distributed over the entire data.
The town of Fairfax is divided up into N = 576 blocks which are numbered
consecutively. A 10 percent sample of blocks is to be taken, which gives a
sampling interval of k = 10. If the random number between 1 and 10 is 3, the
blocks with the numbers
69
When the data items vary considerably in size, a simple random or a systematic
random sample of items does not produce a good estimate due to high variability. In
such a situation we get a better estimate by giving higher probability of selection to
the larger data items.
70
U
UNNIITT 77:: D
DAATTA
AAAN
NAALLY
YSSIIS
S
LEARNING OUTCOMES
When you have completed this unit you will be able to:
Understand the process of data analysis
Explain the different steps of data analysis for qualitative and quantitative
researches
Analysing survey data is an important and exciting step in the survey process. It is the
time that you may reveal important facts about your customers, uncover trends that
you might not otherwise have known existed, or provide irrefutable facts to support
your plans. By doing in-depth data comparisons, you can begin to identify
relationships between various data that will help you understand more about your
respondents, and guide you towards better decisions.
Assuming you need to analyse the data collected from your survey, the process begins
with a quick review of the results, followed by editing, analysis, and reporting. To
ensure you have accurate data before investing significant time in analysis, it is
important that you do not begin analysing results until you have completed the review
and editing process.
Quick Review
Read all your results. Although, this seems like an obvious thing to do, many
surveyors think that they can skip this step and dive right in to data analysis. A quick
review can tell you lots about your project, including any flaws in questionnaire
design or response population, before you spend hours of time in analysing the data.
During the quick review, you should look at every question and see if the results
"make sense". This "gut feel" check of the data will often uncover any issues with
71
your survey project. Most surveyors already have an idea of how they expect their
data to look. A quick review of the data can help you quickly understand that tell you
if the people that respond are the right people. For example, if you were conducting a
survey of all the employees in a company and you knew that 10% were in the
marketing department, 20% in sales, 45% in manufacturing, 5% in management, and
5% finance, and 15% research and development, you could reasonable expect your
responses to be similarly distributed. If your quick review disclosed 80% of your
respondents were from the sales department, you know that your survey did not
adequately capture a representative sample of all departments within the company.
The quick review can also highlight any problems with the survey instrument. Are
most respondents answering all questions? If not, your questionnaire could be flawed
in such a way that a person cannot complete the survey. A low response rate could
mean your survey invitation was not compelling enough to encourage participation, or
your timing was off and a follow-up reminder is needed.
Lastly, the quick review of the survey can show you what areas to focus on for
detailed analysis. As stated earlier, most surveyors already know what they expect to
get, so your quick review can show you the unexpected.
Editing and cleaning data is an important step in the survey process. Special care must
be taken when editing survey data so that you do not alter or throw out responses in
such a way as to bias your results. Although you can begin editing and cleaning your
data as soon as results are received, caution should be used since any edits can be lost
if the database is rebuilt. To be safe, wait until all data is received before you begin
the editing and cleaning process.
To start, find and delete incomplete and duplicate responses. A response should be
discarded if the respondent did not complete enough of the survey to be meaningful.
For example, if your survey was intended to determine future buying intentions across
various demographic groups and the respondent did not answer any of the
demographic questions, you should delete the response. On the other hand, if the
respondent answered all the demographic questions but omitted their name or email
address, then you should keep the response.
Duplicate responses are a unique issue for electronic surveys. Many tools, such as
eSurveysPro, provide built in features to help minimize the risk of duplicate
responses. Others, like the popular "infotainment" polls featured on many websites do
nothing to eliminate duplicates. Without removing duplicates, your data will be
skewed in favour of the duplicate response. Both the count and percentage of the
whole will be affected by duplicate responses, and computed means and medians will
also be thrown off. To find duplicate responses, carefully examine the answers to any
open-ended questions. When two open-ended questions have the exact same answer, a
duplicate response is likely to exist. Make sure the response is indeed a duplicate by
72
comparing the answers to all the other questions, and then delete one of the responses
if a match is found.
A common problem in any survey that needs attention during the editing and cleaning
process is when a respondent answers an "other, please specify" question by selecting
"other" and then writing in an answer that was one of the listed response options.
Without cleaning these answers, the "other" response will be overstated and the
correct response will be understated. For example, a demographics question that asks
for the respondent's role within the organization may have a response like "faculty,
teacher, or student" and a respondent selects "other" and types "professor," you would
want to clean the response by switching the other choice to the one for "faculty,
teacher, or student".
Once the data preparation is complete, it is time to start analysing the data and turning
it into actionable information.
Detailed Analysis
Analysis is the most important aspect of your survey research project. At this point,
you have collected a set of data that must now be turned into actionable information.
The process of analysis can lead to a variety of alternative courses of action. Mistakes
during analysis can lead to costly decisions down the road, so extreme caution and
careful review must be followed throughout the process. Carelessness during analysis
can lead to disaster. What you do during analysis will ultimately determine if your
survey project is a successful or not.
Depending on what type of information you are trying to know about your audience,
you will have to decide what analysis makes sense. It can be as simple as reviewing
the graphs that eSurveysPro automatically creates, or conducting in-depth
comparisons between questions sets to identify trends or relationships. For most
surveyors, a basic analysis using charts, cross tabulations, and filters is sufficient. On
the other hand, more sophisticated users may wish to do a more complex statistical
analysis using high powered analytical tools such as SPSS, Excel, or any number of
number crunching applications. For our purposes in this article, we will focus on basic
analysis techniques.
73
Graphical Analysis
Graphical analysis simply means displaying the data in a variety of visual formats that
make it easy to see patterns and identify differences among the results set. There are
many different graphing options available to display data, the most common are Bar,
Pie, and Line charts.
Bar charts use solid bars on an X and Y-axis that extend to meet a specific data value
indicated on the chart and can be shown either vertically or horizontally. These charts
are flexible and are most commonly used to display data from multiple-select, rank
order, single-select matrix and numerical questions. Each response option is shown as
an independent bar on the chart, and the length of the bar represents the frequency the
response was chosen relative to all choices.
Pie charts, or circle graphs, have colourful "slices" representing segments of your
data. These charts measure values as compared to a "whole", and the total percentages
of the segments always add up to 100%. Pie charts are most useful with single-select
questions because the each response is represented visually as a portion of the entire
pie. It is easy to interpret which answer received the most responses in a pie chart by
selecting the largest portion of the pie. When comparing two sets of data using a pie
chart, it is important to make sure the colours used for each response option remain
consistent in each chart. If represent the same response options in each chart, this
way, a side-by-side visual comparison can quickly be made. Pie charts are not
appropriate for multiple-select questions because each respondent can answer choose
more than one option, and the sum of the option percentages will exceed 100%.
There are other graphing options such as line charts, area charts and scatter graphs,
which are useful when displaying the same data over a period of time. However these
formats are not as easy to interpret for casual users, so they should be used sparingly.
Frequency Tables
Frequency tables are another form of basic analysis. These tables show the possible
responses, the total number of respondents for each part, and the percentages of
respondents who selected each answer. Frequency tables are useful when a large
number of response options are available, or the differences between the percentages
of each option are small. In most cases, pie or bar charts are easier to work with than
frequency tables.
Cross Tabulation
Cross tabulations, or cross tabs, are a good way to compare two subgroups of
information. Cross tabs allow you to compare data from two questions to determine if
there is a relationship between them. Like frequency tables, cross tabs appear as a
table of data showing answers to one question as a series of rows and answers to
another question as a series of columns.
74
Cross tabs are used most frequently to look at answers to a question among various
demographic groups. The intersections of the various columns and rows, commonly
called cells, are the percentages of people who answered each of the responses. In the
example above, females and males had relatively similar distribution among various
job titles, with the exception of the tile of "Technical Product Manager", where 2.5
times as many males had the title as compared to females. For analysis purposes,
cross tabs are a great way to do comparisons.
Filtering
Filtering is the most under-utilized tool used in analysis. Filters allow you select
specific subsets of data to view. Unlike a cross tab, that compares two questions, a
filter will allow you to examine all questions for a particular subset of the responses.
By viewing only the data from the people who responded negatively, look at how they
answered other questions. Find patterns or trends that help define why a person
answered the way they did. You can even filter on multiple questions and criteria to
do a more detailed search if necessary. For example, if you wanted to know the
buying intentions of men, over the age of 40, with annual income of about K50, 000,
you would set a filter that would remove all those respondents that do not meet your
criteria from the results set, thus enabling you to concentrate on the target population.
By applying filters to the date survey responses were received, you can see how the
answers change from one time frame to the next. For instance, by continually running
a customer satisfaction survey, you can assess changes in customer attitudes over time
by filtering on the date the survey was received. You can also use a filter on date
received to assess the impact of sales incentive programs or new product offerings by
comparing survey responses before and after the change.
Filters do not permanently remove the responses of those people that do not match the
specified criteria; they simply eliminate them from the current view of the data,
making it much easier to perform analysis. By looking at the same question with
different filters applied, differences between the various respondents represented by
the filter can be quickly seen. Because filters remain in effect until cleared, don't
forget to clear them before attempting to analyse your survey responses as a whole,
otherwise your observations will be inaccurate, and your recommendations flawed.
Reporting
After analysing your survey data, it is time to create a report of your findings. The
complexity and detail need to support you conclusions, along with your intended
audience, will dictate the format of your report.
75
required. For more complex topics, a detailed report created in Microsoft Word or
Adobe Acrobat is often required. Reports created using Word often include much
more detailed information, report findings that require significant explanation, are
extremely text heavy, and are often studied at great length and in significant detail.
No matter which type of report you use, always remember that information can be
more powerfully displayed in a graphic format verses a text or tabular representation.
Often, trends and patterns are more obvious and recommendations more effective
when presented visually. Ideally, when making comparisons one or more groups of
respondents, it is best to show a chart of each group's responses side-by-side. This
side-by-side comparison allows your audience to quickly see the differences you are
highlighting and will lead to more support for your conclusions.
At the beginning of your report, you should review your survey objective and
sampling method. This will help your audience understand what the survey was about,
and enable you to avoid many questions that are outside of your original objectives.
Your report should have a description of your sampling method, including who was
invited to participate, over what time frame results were collected, and any issues that
might exist relative to your respondent pool. Next, you should include your analysis
and conclusions in adequate detail to meet the needs of your audience. Include a table
or graph for each area of interest and explain why it is noteworthy. After your analysis
section, you should make recommendations that relate back to your survey objectives.
Recommendations can be as simple as conduct further studies to a major shift in
company direction. In either case, your recommendation must be within the scope of
your survey objective and supported by the data collected. Finally, you can include a
copy of your survey questions and a summary of all the data collected as an appendix
to your report.
76
1. Transcribing the interview involves taking notes of the interview...it
is the full 'script' of the interview and the aim is to take a full written
version of the interview
2. Transcribing an interview is very time consuming, with an estimated
time ratio of 5:1 (i.e. 5 hours of transcribing a one hour interview)
5. Tape analysis can be used, which is a combination on the two and involves
the researcher taking notes from the recording
6. Bias must be considered when taking notes or using tape analysis
7. Good quality transcribing relies on skills beyond just taking notes and there
is often space for subjectivity
Content analysis can be used when qualitative data has been collected through:
1. Interviews
2. Focus groups
3. Observation
4. Documentary analysis
Content analysis is '...a procedure for the categorisation of verbal or behavioural data,
for purposes of classification, summarisation and tabulation.'
1. Basic level or the manifest level: a descriptive account of the data i.e. this is
what was said, but no comments or theories as to why or how
2. Higher level or latent level of analysis: a more interpretive analysis that is
concerned with the response as well as what may have been inferred or
implied
Content analysis involves coding and classifying data, also referred to as categorising
and indexing and the aim of context analysis is to make sense of the data collected
and to highlight the important messages, features or findings.
77
1) Copy and read through the transcript - make brief notes in the margin when
interesting or relevant information is found
2) Go through the notes made in the margins and list the different types of
information found
3) Read through the list and categorise each item in a way that offers a description of
what it is about
4) Identify whether or not the categories can be linked any way and list them as major
categories (or themes) and / or minor categories (or themes)
6) If there is more than one transcript, repeat the first five stages again for each
transcript
7) When you have done the above with all of the transcripts, collect all of the
categories or themes and examine each in detail and consider if it fits and its
relevance
8) Once all the transcript data is categorised into minor and major categories/themes,
review in order to ensure that the information is categorised as it should be.
9) Review all of the categories and ascertain whether some categories can be merged
or if some need to them be sub-categorised
10) Return to the original transcripts and ensure that all the information that needs to
be categorised has been so.
The process of content analysis is lengthy and may require the researcher to go over
and over the data to ensure they have done a thorough job of analysis
1. When planning the presentation of qualitative data, consider that the data
are:
1. Subjective
2. Interpretative
3. Descriptive
4. Holistic
5. Copious
2. It may be suggested that the researcher base the structure of the presentation
of the research around the categories or themes that have emerged
3. The themes or categories may be presented as sections with relevant sub-
sections
78
4. Quotes can be used to demonstrate and or inform or support findings, but it
is recommended that the researcher consider the reliability and validity of
each quote
5. Consideration may also be given to whether or not qualitative data can be
represented in a quantitative form (i.e. 6 out of 10 people...)
The analysis of research in any project involve summarising the mass of data that has
been collected and the presenting the results in a way that communicates the most
important findings or features
Software packages are available for the analysis of quantitative and qualitative data.
Each packed has different features and the researcher needs to choose carefully. The
aim of all of the packages is to assist in the categorisation and matching process. The
packages can save time, but there is still a great deal of time required to set them up
and input the data and check through the process.
The most well known software packages are listed below, some have links attached
which you may wish to read through for further information:
SPSS
http://www.spss.com/uk/statistics/?gclid=COqEmJPdw5sCFRISzAodvX4K
dA
ATLAS/ti
http://www.psychologysoftwaredistribution.com/ATLAS_ti/atlas_ti.html
NVivo
http://download.qsrinternational.com/Document/NVivo7/NVivo7_Tutorials
_Lyn_Richards.pdf
NUD*IST http://www.sdgassociates.demon.co.uk/learnnudist.htm
QUALPRO
Ethnograph
There are also a number of networks available that are accessible via the Internet,
CAQDAS is one of them, available at http://www.soc.surrey.ac.uk/caqdas
79
Ethics in Social Research
Research ethics deals primarily with the interaction between researchers and the
people they study. Professional ethics deals with additional issues such as
collaborative relationships among researchers, mentoring relationships, intellectual
property, fabrication of data, and plagiarism, among others.
Researchers must be mindful of the need to respect their subjects’ rights throughout
the research cycle. Agreed-upon standards for research ethics help ensure that as
researchers we explicitly consider the needs and concerns of the people we study, that
appropriate oversight for the conduct of research takes place, and that a basis for trust
is established between researchers and study participants.
This means, first, that researchers must do their subjects no harm. This is the right to
safety. Second, research subjects must have the right to decide whether their attitudes
and behaviours may be revealed to the public and, if so, in what way. This is the right
to privacy. Third, researchers cannot use data in a way that allows them to be traced to
a particular subject. This is the subject’s right to confidentiality.
Fourth, subjects must be told how the information they supply will be used. They
must also be allowed to judge the degree of personal risk involved in answering
questions so that they can decide whether they may be studied and, if so, in what way.
This is the right to informed consent.
Ethical issues arise not only in the treatment of subjects but also in the treatment of
research results. For example, plagiarism is a concern in academic life, especially
among students, who write research papers and submit them to professors for
evaluation. For instance in United States of America, a 2003 study found that 38
percent of American college students admitted to committing “cut and paste”
plagiarism when writing essays, up from just 10 percent in 2000 (Edmundson, 2003).
Ready-made essays are also widely available for purchase.
Increased plagiarism is a consequence of the spread of the World Wide Web (www)
and the growing view that everything on it is public and therefore does not have to be
acknowledged or cited. That view is wrong. The Code of Ethics of the academic
writing states that we must “explicitly identify, credit, and reference the author” when
we make any use of another person’s written work, “whether it is published,
unpublished, or electronically available” (American Sociological Association, 1999:
16). Making such ethical standards better known can help remedy the problem of
plagiarism.
80
Powerful web-based applications are now available that can help university
instructors determine whether essays are plagiarized in whole or in part (visit
http://www.turnitin.com). Perhaps the most effective remedy, however, is for lecturers
to ensure that what they teach really matters to their students. If they do, students
won’t be as inclined to plagiarize because they will regard essay writing as a process
of personal discovery. You can’t cut and paste or buy enlightenment (Edmundson,
2003).
Some additional examples of sampling frames are phone books, college student
directories, directories of members of an association, a list of all the teachers in your
county, etc. Note that some sampling frames are better than others; for example, the
phone book excludes many people (that’s why a special technique called random digit
dialling is used to obtain telephone samples rather than relying on the phone book).
81
6. What do all of the “equal probability selection methods” (i.e., EPSEM)
have in common?
Each member of the population has an equal chance of being selected into the sample
in each of these selection methods. By the way, note that simple random sampling is
not the only equal probability sampling method.
82
Remember that the primary purpose of an experiment is make statements
about cause and effect. Making statistical generalizations to populations is of
secondary importance for individual experimental studies.
In Chapter 10, this difference will be discussed under the terms internal
validity (making valid causal statements) and external validity (making valid
generalizations). The bottom line will be that random assignment is very
important for internal validity and random selection is very important for
external validity.
13. If your population size is 250,000, then how many participants will you
need, at a minimum, for your research study?
83
Mixed purposeful sampling (mixing of more than one of the above sampling
strategies).
REFERENCES
Adler, P. & Adler (1987) Membership Roles in Field Research. Bverly Hills: Sage
Brown, S.R & L.E. Melamed (1990) Experimental Design and analysis. Newbury
Park: Sage Publications
Berg, B.C, (2001) Qualitative Research Methods for The Social Sciences. London:
Allyn and Bacon
Campbell, D.T., & Stanley, J.C (1966) Designing for Social Research. Chicago: Rand
McNally
Denzin, N, K, & Lincon, Y.S (2003) The Sage Handbook of Qualitative Research (3rd
Ed.). London: Sage Publications
Fowler Jr., F.J. (1993) Survey Research Methods. Newbury Park, CA: Sage
Publishers
Johnson J.M (1975), Doing Field Research. New York: Free Press
Leedy, P.D, (1997) Practical Research: Planning and Design (6th Ed.). New Jersey:
Upper Saddle River
Lofland, J, & Lofland, L.H (1984) Analysing Social Settings: A guide to qualitative
Observation and analysis. Belmont CA: Wadsworth Pub. Co
84
Merriam, S.B. (1988) Case Study Research in Education: A Qualitative approach.
Jossey-Bass: San Francisco
Neuman, L. & Wiegand, B (2000) Criminal Justice Research Methods. Boston: Allyn
& Bacon
Piet Verschuren and Hans Doorewaard (1999) Designing a Research Project. Utrecht:
LEMMA Publisher
Singleton Jr., R.A. & B.C Straits (1999) Approaches to Social Research. New York:
Oxford University Press
85