Professional Documents
Culture Documents
Qualitative Methods
Testing theories using language
Magazine articles/Interviews Conversations Newspapers Media broadcasts
Initial Observation
Find something that needs explaining
Observe the real world Read other research
Hypothesis
A prediction from a theory. E.g. the number of people turning up for a Big Brother audition that have narcissistic personality disorder will be higher than the general level (1%) in the population.
Falsification
The act of disproving a theory or hypothesis.
TV Show
Big Brother is a reality television franchise created by John de Mol. The premise of the show is that a group of people live together in a large house, isolated from the outside world. They are continuously watched by television cameras. Each series lasts for around three months, with 1216 contestants entering the house. In order to win the final cash prize, all the contestants have to do is survive periodic evictions and be the last one standing. The idea for Big Brother is said to have come during a brainstorm session at the Dutch-based international television production firmEndemol, on March 10, 1997. The first-ever version of Big Brother was broadcast in 1999 on Veronica in the Netherlands. Since then the format has become a worldwide TV franchise, airing in many countries in varying adaptations.
narcissism (nrs-szm) also narcism (-szm)n. 1. Excessive love or admiration of oneself. See Synonyms at conceit. 2. A psychological condition characterized by selfpreoccupation, lack of empathy, and unconscious deficits in selfesteem. 3. Pleasure derived from contemplation or admiration of one's own body or self, especially as a fixation on or a regression to an infantile stage of development. 4. The attribute of the human psyche charactized by admiration of oneself but within normal limits.
H1: % of people with ND will be higher is proven (845 /7662 x 100) = 11% having ND in the contestants versus the population.
Another observation: The contestants having high ND, are convinced that the public will love them and that they will win.
H: People with a high ND have a conviction to win. Testing of Hypothesis: by asking the Question: Do you thing you will win? 7/9 say yes, which confirms the observation. Another Theory: These contestants think they will win because they do not realize they have a personality disorder (ND). Hypothesis: These people are similar to others and do not have a different personality (ND). Hypothesis Testing: by asking question to those who thought that they would win, whether their personalities were different from the norm? If all 7 said that their personality was different from the norm. This contradicts my theory and is called Falsification
Other research carried out on BB 1. People with ND think they are more interesting than others 2. They think that they deserve more success than others 3. They think that others like them because they have special personalities.
Rival theory comes from another researcher that the problem is not that the ND contestants do not realize that they have a personality disorder, but they falsely believe that this ND is perceived positively by other people (i.e. they believe that their personality makes them likeable) Hypothesis: from this theory is: If higher ND contestants are asked to evaluate what other people of them, they will overestimate other peoples perceptions.
Independent Variable
The proposed cause A predictor variable A manipulated variable (in experiments) Coca-cola in the hypothesis above The proposed effect An outcome variable Measured not manipulated (in experiments) Sperm in the hypothesis above
Dependent Variable
Levels of Measurement
Categorical (entities are divided into distinct categories):
Binary variable: There are only two categories
e.g. dead or alive.
Ordinal variable: The same as a nominal variable but the categories have a logical order
e.g. whether people got a fail, a pass, a merit or a distinction in their exam.
Ratio variable: The same as an interval variable, but the ratios of scores on the scale must also make sense
e.g. a score of 16 on an anxiety scale means that the person is, in reality, twice as anxious as someone scoring 8.
Measurement Error
Measurement error
The discrepancy between the actual value were trying to measure, and the number we use to represent that value.
Example:
You (in reality) weigh 80 kg. You stand on your bathroom scales and they say 83 kg. The measurement error is 3 kg.
Validity
Whether an instrument measures what it set out to measure. Content validity
Evidence that the content of a test corresponds to the content of the construct it was designed to cover
Ecological validity
Evidence that the results of a study, experiment or test can be applied, and allow inferences, to realworld conditions.
Reliability
Reliability
The ability of the measure to produce the same results under the same conditions.
Test-Retest Reliability
The ability of a measure to produce consistent results when the same entities are tested at two different points in time.
Cross-sectional research:
This term implies that data come from people at different age points with different people representing each age point.
Experimental research:
One or more variable is systematically manipulated to see their effect (alone or in combination) on an outcome variable. Statements can be made about cause and effect.
Types of Variation
Systematic Variation
Differences in performance created by a specific experimental manipulation.
Unsystematic Variation
Differences in performance created by unknown factors.
Age, Gender, IQ, Time of day, Measurement error etc.
Randomization
Minimizes unsystematic variation.
Chimpanzee Experiment
Kurtosis
The heaviness of the tails. Leptokurtic = heavy tails. Platykurtic = light tails.
Skew
Kurtosis
Bimodal
Having two modes
Multimodal
Having several modes
A Bimodal Distribution
Example
Number of friends of 11 Facebook users.
x
i 1
x
i 1
x
i 1
1063 96.64 11
Example
Number of friends of 11 Facebook users. 22, 40, 53, 57, 93, 98, 103, 108, 116, 121, 252 Range = 252 22 = 230 Very biased by outliers
We have talked how frequency distributions can be used to get rough idea of probability of a score occurring. We can be precise. For any distribution of scores, we can calculate the probability of obtaining a score of a certain size. (It will be tedious and complex) Statisticians have defined common distributions known as probability distributions. From these it is possible to calculate the probability of getting particular scores based on the frequencies with which a particular score occurs in a distribution with these common shapes. One of these distributions is the Normal Distribution. Statisticians have calculated the probability of certain scores occurring in a normal distribution with a mean of 0 and a std. deviation of 1. We can use the probability tables to find the likely hood of that score.
We can convert any data set in to a data set that has a mean of 0 and std dev of 1. To centre the data around 0, take each score and subtract it from the mean. Divide the resulting score by the Std dev to ensure that the data has a std dev of 1.
Look at the Example in the Excel File
Z-scores
Standardising a score with respect to the other scores in the group. Expresses a score in terms of how many standard deviations it is away from the mean. The distribution of z-scores has a mean of 0 and SD = 1.
XX z s
Standard Normal Distribution Frequency Distribution in terms of Probability: Beachy Head example
How likely it is that a 70 year old commits suicide?
How likely it is that a 30 year old commits suicide?
No of sucicides by age
Therefore if we have a distribution similiar to Normal , we can use table of probabilities for Normal distribution. Example: Probability of person aged 70 or more. Mean is 36 and St. Deviation is 13 Z = X X / s, Z score is 2.62. Look this value in the Z table and you will find a value of .0044 r .44% The Normal Dist and the z scores allow us to go one step beyond our data in that from a set of scores.
Properties of z-scores
Z scores are important because their value cuts off certain important percentages of the distribution. The first important value is 1.96 followed by 2.58 and 3.29. 1.96 cuts off the top 2.5% of the distribution. 1.96 cuts off the bottom 2.5% of the distribution. As such, 95% of z-scores lie between 1.96 and 1.96. 99% of z-scores lie between 2.58 and 2.58, 99.9% of them lie between 3.29 and 3.29.
1.96 cuts off the top 2.5% of the distribution 95% of z-scores lie between 1.96 and 1.96. 99% of z-scores lie between 2.58 and 2.58, 99.9% of them lie between 3.29 and 3.29.
Fitting statistical models to the data Scientific statements can be be split into testable hypotheses.
The hypothesis or prediction that comes from your theory is usually saying that an effect will be present. This hypothesis is called the alternative hypothesis and is denoted by H1. (It is sometimes also called the experimental) There is another type of hypothesis, though, and this is called the null hypothesis and is denoted by H0. This hypothesis is the opposite of the alternative hypothesis and so would usually state that an effect is absent.
If our data give us confidence to reject the null hypothesis then this provides support for our experimental hypothesis. However, be aware that even if we can reject the null hypothesis, this doesnt prove the experimental hypothesis it merely supports it.
Rather than talking about accepting or rejecting a hypothesis (which some textbooks tell) we should be talking about the chances of obtaining the data weve collected assuming that the null hypothesis is true. i.e. what is the chance of obtaining the data that the big B contestants and the common public are same.
H1: % of people with ND will be higher is proven (845 /7662 x 100) = 11% having ND in the contestants versus the population.