Professional Documents
Culture Documents
Data Analysis
Topic Preview
The purpose of this section is to consider how to approach data
analysis by reflecting on the following points:
how to organise, analyse and represent qualitative data
Counting, categorising, relating and predicting quantitative data
Topic Content
Introduction
The qualitative data analysis process can be broken down into three
stages: reduction, organisation and interpretation of the data.
Data Reduction
Qualitative research provides a vast amount of data which needs to be
reduced into a manageable form. This can be done by coding the data.
Open interviews require a detailed coding system which allows for the
richness of the data to be processed. Coding is when key words or short
two or three letter codes are used to represent a theme. The researcher
methodically works through the transcript or copy of notes, line by line or
paragraph by paragraph, and assigns the codes in the margin of the text.
This enables the researcher to begin to see patterns, categories or
themes.
Data organisation
The organisation part of the process is the systematic collating of coded
bits of data on the same theme from across all the interviews. If your
records are electronic then is it simply a matter of copying (not cutting)
and pasting the relevant bits of text within a new themed document.
Again, this is a laborious task but relatively simple if the coding has been
carried out efficiently.
There are a limited number of computer packages which help with
analysis of coded qualitative data by pulling together text associated with
each code. However, the researcher still has the laborious task of
examining the text in detail to assign the codes. Packages like this
require time to get to grips with and most researchers, particularly at the
Masters stage, find MS Word adequate.
If the notes or transcripts are on paper then the process is more difficult
and is carried out by copying and then cutting the paper or by rewriting
the information on another thematic sheet. Any manual rewriting like
this of course introduces the chance of error and care must be taken to
do a spot check to ensure that copying has been done accurately.
These new thematic documents of verbatim quotes and notes provide the
basis for the next stage - analysis or interpretation.
Strongly agree 1
Agree 2
Disagree 3
Strongly Disagree 4
The codes are visible to the respondent who may be asked to circle the
relevant number rather than ticking an empty box. The codes have no
scalar meaning – they are simply code numbers which can be input to a
stats package for the purpose of calculating the frequency of the various
responses in the sample.
However, codes can be confusing for respondents who may assign a
significance to the codes which does not exist. In the above example, a
respondent might be tempted to assume that there was some kind of
scalar significance to the codes, and that “strongly disagree” was
somehow considered to be 4 times as important as the “strongly agree”
option. To avoid any chance of misunderstanding of the significance of
the codes, many researchers prefer to assign codes to answers after the
return of questionnaires. This is called post-coding. With post-coding
the respondent is unaware of any code numbers in the questionnaire and
questions would typically invite the respondent to tick a box:
How frequently on average do you use the library?
(Please Tick)
Daily
Once a week
Once a month
Annually
Never
Master Template
Do you get information from any of the following sources? (Tick any
that apply)
for office use
School Policy resource collections 1
Personal buying, borrowing and reading 2
Other teachers 3
Schools Library Service 4
School Librarian(s) 5
Educational Librarians 6
1 2 3 4 for office
use
You and other teachers o o o o 1
You and the library o o o o 2
You and pupils o o o o 3
You and senior management o o o o 4
Your primary and associated
Secondary schools o o o o 5
Open questions may also be coded for analysis but in this case the
procedures follows that for qualitative data although the themes
emerging could easily be assigned a numeric code if you want to
calculate frequency with which the various themes arose within the
sample.
When coding questionnaires, it is good practice to code up answers to
one question across the whole sample of returned questionnaires before
moving on to code the next question across the sample. In other words,
do not be tempted to code a complete questionnaire at a time, where you
will have to move between questions and run the risk of errors occurring
as your eye moves between different sets of codes on the template.
As with coding, data is usually input into a spreadsheet or stats package
on a question by question basis. Data is input into a series of cells. Each
column represents the answers to one question, while each row
represents the answers from one respondent. For example:
Identity Q1 Q2 Age Q3 etc
Gender Attitude
1 2 21 4
2 2 20 3
3 1 20 1
4 1 25 1
etc
Table1: Data Input
The quality and reliability of the analysis depends on accuracy at all
stages - the more stages of manipulation and transcription of data the
more likelihood there is of errors occurring. It is important to carry out
spot checks on the use of codes, especially if you are manually post-
coding the data, to ensure that codes have been assigned accurately.
You might, for example, check every fifth or tenth questionnaire for
accuracy of coding depending on the size of the sample.
Similarly it is important to conduct spot checks on the accuracy of data
input if you are using a statistical package or spreadsheet. Input the
data and go back and check the accuracy against a sample of the original
questionnaires. As with qualitative data, always make sure that you can
cross reference to the original data.
responses. For example, the following table shows how the frequency of
the various responses to a question on abolishing exams can be
calculated by scoring each response as a tally mark (tick or “1”);
counting up the total number of occurrences of each response category;
calculating the frequency of each category as a percentage of the total
number of responses to the question as a whole.
This immediately allows the researcher to describe the data using simple
statistics in a concise way – for example we can see that respondents
were relatively evenly split between those who strongly agreed, agreed
and disagreed, though it is clear that very few strongly disagreed. While
the overall trend is towards agreeing, as many as 35% (17/49) disagree.
The data can be presented in a research report as a table (see Table 2
below) or as a graph (e.g., a simple bar chart).
Frequency
Attitude Number (n = 49) Percentage
Strongly agree 11 22.5
Agree 19 38.8
Disagree 17 34.7
Strongly disagree 2 4.1
Table 2: Attitude towards abolition of exams
(to examine how variable the data is around the mean value), and/or
percentile points (e.g. the 25 th percentile is the value below which 25% of
the data falls; the 75th percentile is the value below which 75% of the
data falls). These kinds of calculations allow the range and distribution of
the data to be examined and interpretations drawn.
Here there are a number of approaches that may be useful such as cross-
tabulation, correlations and regressions. See the topics in this module
that cover statistical analysis for more detailed information about the use
and execution of these statistical tests.
Further Reading
References and
Bibliography
O’Leary, Z. (2004) The Essential Guide to Doing Research. Sage: London
(chapter 12)
Saunders, M.; Lewis, P.; and Thornhill, A. (2003) Research Methods for
Business Students 3E. FT Prentice Hall: Harlow, UK (chapter 11,
12)
Topic Review
This topic has provided infroamtion about quantitative and
qualitative data analysis techniques.