Professional Documents
Culture Documents
Lecture 6 - Introdn To Statistical Analysis Variables
Lecture 6 - Introdn To Statistical Analysis Variables
• Example 1:
A questionnaire meant to be answered by adults over the age of
30 years has also been answered by some persons under the age
of 30 years
• Example 2:
A respondent gives her birthday as 1865 or claims to have a car
insurance but says she doesn‘t own a car
• Editing of data should be done wih a coloured pencil and the original
data must not be erased in case it is required for future reference.
• Data editing should be conducted systematically on the basis of
procedures made by professionals.
• Data editing has drawbacks, such as, the editor does not possess the
required level of intelligence, experience and objectivity.
• Data coding means translating the raw data into values suitable for
computer entry and statistical analysis.
• For example:
• Yes (or positive response) code - Y or 1
• No (or negative response) code - N or 2
• Don’t know code - D or 8
• No response/unknown code - U or 9
• Now, one by one, examine the responses of that same exact open-ended question to
ascertain the similarities and differences.
• If two or more responses are similar in meaning though not necessarily in language,
try to combine them under one category.
• Give a name to the category that is descriptive of the responses . Remember, when
you code the data you code categories, not responses per se.
• A response should not be placed within two categories, every response should be
placed within one of the categories.
The process of transforming raw data into useful information with an aim of suggesting
conclusions and supporting decision making.
It can also be defined as any entity that can take on different values. OK, so
what does that mean?
An attribute is a specific value on a variable. For instance, the variable sex or
gender has two attributes: male and female
04/12/24 BIT 1205 14
variables
Anything that can vary can be considered a variable. For instance, age can
be considered a variable because age can take different values for different
people or for the same person at different times.
• Control variable
• Extraneous variable
• Moderator variable
• A Control variable is anything else that could influence the dependent variable.
• Independent – sunlight,
• dependent- growth rate
• The experimenter randomly assigns students to two groups. All of the students study text
materials on a biology topic for thirty minutes.
• One group uses the new strategy for studying and the other uses a strategy of their choice. Then
all students complete a test over the materials.
• One obvious extraneous variable in this case would be pre-knowledge of the biology topic that
was studied.
• This variable will most likely influence student scores, regardless of which strategy they use.
• Because of this extraneous variable (and surely others like one group being more intelligent
than the other) there will be some spread within each of the groups.
• The third variable is referred to as the moderator variable or simply the moderator.
• Examining the relationship between Gender and ice cream consumption, you need to
include temperature since it moderates that relationship, else your conclusions will be
inaccurate.
• Continuous
• A variable that can take on any of a range of values, including fractions. e.g. height
can be 45.0 inches, 65.334 inches, 70.005 inches, weight, temperature etc.
• can assume an infinite number of values
• Discrete (or Categorical)
• their values vary but, they are constrained to a certain limit or boundary. e.g.
gender, number of goals scored in a football match, number of bedrooms in a house
etc..
• describe a finite set of conditions and take values from a finite, usually small, set of
states.
• Numerical data
• Characters and Strings
(personal reading)