Professional Documents
Culture Documents
observations.
LESSON 1: FUNDAMENTALS OF RESEARCH AND
STATISTICS Population - a complete set of individuals, objects,
or measurements having some common
Statistics - the process of collecting data in a observable characteristic.
systematic manner, examining those data, and
making inferences from them. Sample - a subset of a population that shares the
same characteristics as the population.
COMMONLY USED TERMS IN STATISTICS AND
RESEARCH Statistic - a number resulting from the
Variable - any measurable characteristic of a manipulation of sample data according to
person, environment, or experimental treatment certain specified procedures; statistics for samples
that varies from person to person, environment to are variable rather than constant. They are
environment, or experimental situation to reported by Roman letters (e.g., x, s, r).
experimental situation. - are numbers based on direct observation
and measurement.
Constant - a number that represents a construct
that does not change. Parameter - a value summarizing a measurable
characteristic of a total population that is
Dependent Variable (DV) - an outcome of interest estimated based on the value of a statistic.
(e.g., some aspect of behavior) that is observed Population parameters are constants. Parameters
and measured by a researcher in order to assess are represented by Greek letters.
the effects of the independent variable. - are often inferred values based on
statistics.
Independent Variable (IV)
- the variable that an experimenter uses to
describe or explain the differences in the SAMPLING
dependent variable or to cause changes in the
dependent variable. Simple Random Sample - a subset of a population
- Types of Independent Variables: selected in such a way that each member of the
Subject Variable - based on a measurable population has an equal an independent
characteristic of the subject that the opportunity to be selected.
experimenter does not directly change. A
condition of the subject that exists before Random Assignment - assigning subjects to
the research begins. different treatment conditions in such a way that
Manipulated or Experimental Variable - the each subject has an equal and independent
type that the experimenter systematically opportunity to be placed in each treatment
controls or manipulates and to which the condition. With random assignment we ensure
subjects are assigned. that the groups are equivalent to each other
before the research begins.
Data - numbers or measurements that are
collected as a result of observations.
Real Limits
- boundaries of intervals for scores that are
represented on a continuous number line.
- located exactly halfway between the scores. LESSON 2: MEASURES OF CENTRAL TENDENCY
- Each score has two real limits:
•upper real limit Central Tendency - is a statistical measure to
•lower real limit determine a single score that defines the center
of a distribution.
Nominal Scale - the goal of central tendency
- consists of a set of categories that have different is to find the single score that is most typical or
names most representative of the entire group.
- measurements on a nominal scale label and
categorize observations, but do not make any
quantitative distinctions between observations.
b. Median
- Characteristics of the Mean - the median is the midpoint of the list if the scores
1. Changing the value of any score will change in a distribution are listed in order from smallest to
the mean. largest
2. Adding a new score to a distribution, or - the median is the point on the measurement
removing an existing score, will usually change scale below which 50% of the scores in the
the mean. The exception is when the new score distribution are located.
(or the removed score) is exactly equal to the - the goal of the median is to locate the midpoint
mean. of the distribution.
3. If a constant value is added or subtracted to -there are no specific symbols or notation to
every score in a distribution, the same constant identify the median. APA uses Mdn.
will be added or subtracted to the mean. - Pag odd number yung total number of scores,
Ex.: automatically yung nasa gitna yung median. Pag
even number yung scores, pag add mo yung
dalawang nasa gitna then divide mo sa 2.
c. Mode
- is the score or category that has the greatest
frequency in a distribution.
- it can be used to determine the typical or most
frequent value for any scale of measurement,
including a nominal scale
- there are no symbols or special notation used to
Meron kang 6 scores which is 4, 2, 3, 3, 2, 3, pag identify the mode or to differentiate between a
pinag-add mo sila 17 total, tas 17 divide mo sa sample mode and a population mode.
6 (kase 6 scores) yung mean mo is 2.83. Pag
Frequency Distribution
- is an organized tabulation of the number of
individuals located in each category on the scale
of measurement.
- Two elements:
1. The set of categories that make up the
Yung mode diyan Is Luigi, hindi 42. Yung 42 yung original measurement scale.
highest sa table or ilang beses (frequent) umulit. 2. A record of the frequency, or number of
Kung sino yung nakakuha ng highest na score, individuals in each category.
yun yung mode.
Shape of a Frequency Distribution
- it is possible to have more than one mode. 1. Symmetrical Distribution - in a symmetrical
Specifically, it is possible to have two or more distribution, it is possible to draw a vertical line
scores that have the same highest frequency. through the middle so that one side of the
Bimodal - a distribution with two modes. distribution is a mirror image of the other.
Multimodal - a distribution with more than
two modes.
No mode - Adistribution with several equally
high points.
**WHEN TO USE THE MEDIAN AND MODE IS NOT 2. Skewed Distribution - the scores tend to pile up
INCLUDED HERE** toward one end of the scale and taper off
gradually at the other end.
PRESENTING MEANS AND MEDIANS IN GRAPHS 3. Tail - the section where the scores taper off
toward one end of a distribution.
Graphs
- can be used to report and compare measures 4. Positively Skewed - a skewed distribution with
of central tendency the tail on the right-hand side; the tail points
- it allows several means (or medians) to be shown toward the positive (above-zero) end of the X-
simultaneously so it is possible to make quick axis. (From left na mataas pababang right)
comparisons between groups or treatment
conditions. 5. Negatively Skewed - the tail points to the left of
- Kinds of Graphs: zero. (From left na mababa pataas ng right)
1. Line Graph
2. Histogram - uses numerical variables, walang
space yung bars, mode
3. Bar Graph - uses categories, may space yung
bars, mean and median
LESSON 4: VARIABILITY
This type of graph is used to show the shape of the 2. Variability measures how well an individual
distribution, its central value, and its variability. score (or group of scores) represents the entire
distribution.
In a box and whisker plot: - This aspect of variability is very important for
1. The ends of the box are the upper and lower inferential statistics, in which relatively small
quartiles, so the box spans the interquartile range samples are used to answer questions about
2. The median is marked by a vertical line inside populations
the box
3. The whiskers are the two lines outside the box **In short, mas maiksi yung interval ng bawat
that extend to the highest and lowest score mas maganda or greatest variability siya
observations. non.
RANGE
Formula: or
Formula:
Formula:
or
Variance =
Ex.:
Standard Deviation =
BIASED AND UNBIASED STATISTICS Error variance – is used to indicate that the
A sample statistic is unbiased if the average sample variance represents unexplained and
value of the statistic is equal to the population uncontrolled differences between scores.
parameter. (The average value of the statistic is
obtained from all the possible samples for a *As the error variance increases, it becomes
specific sample size, n.) more difficult to see any systematic differences or
patterns that might exist in the data.
A sample statistic is biased if the average value
of the statistic either underestimates or
overestimates the corresponding population LESSON 5: z-SCORES: LOCATION OF SCORES AND
parameter. STANDARDIZED DISTRIBUTIONS
*Dito naman, pag nag-p’present ng graph, The process of transforming X values into z-scores
dapat maliit lang yung standard deviation and serves two useful purposes:
kitang-kita yung difference ng bawat frequency. 1. Each z-score tells the exact location of the
Yung tipong isang tingin lang madali lang original X value within the distribution.
malaman kung ano yung pinapakita nung 2. The z-scores form a standardized distribution
graph. that can be directly compared to other
distributions that also have been transformed
into z-scores.
TRANSFORMATIONS OF SCALE
**Need i-transform yung x values into z-scores
2. Adding a constant to each score does not para mas malinis tignan yung graph kase liliit
change the standard deviation. yung standard deviation non and di kalat-kalat
yung values at the same time medaling i-
2. The z-scores are then transformed into new X Edi 0.50 multiply sa 10 so 5. Tapos add mo lang
values so that the specific μ and σ are attained. yung 50 so 55 yung sagot. Yun na yung X-value
(Gamitin mo yung formula na to) wag makulit. Ganon din kay Joe tamang gaya
lang ng process haha.
Edi ayan tapos mo na Step 1. Basis mo diyan is Characteristics of the Normal Distribution Curve
raw scores and original mean and standard 1. The scale along the bottom of the distribution is
deviation which is 57 and 14. Step 2 ka na. in z-score units, that is, μ = 0, σ = 1
**Basta lagi mong tatandaan na sa distribution Ex. 1: Corrine scores 93 on the IQ test. What is her
graph, lagging 50% yung sa gitna kase yun nga percentile rank?
yung mean so yun yung gitna okay besss. Solution:
Solution:
a.
Convert ulit ng z-score edi 0.53. Tas hanapin mo
0.53 sa table column C edi 0.20 siya. Di yun yung
sagot kasi yung hinahanap is percentage of test
Transform mo lang yung score ni Frank into z-
takers na mas mataas score sa kanya so need
score. Edi 1.67 siya, tignan mo non yung statistical
mag-subtract ng 50 (dahil nasa right side siya)
table yung 1.67 column D ata to edi .4525 siya.
edi 0.30 or 30% yung final answer.
Gawin mong percentage non either multiply mo
sa 100 or move two decimal places to the right
so magiging 45.25% siya. Yun yung percentage
of cases simula mean hanggang sa score niya.
What percentage of cases fall between a score 3. What percentage of cases fall between a
of 120 and a score of 88? score of 120 and a score of 88?
Step 1: Find the z corresponding to x = 120 Answer: 5,000 x .6963 = 3,482 cases