Professional Documents
Culture Documents
Khan
August 7, 2023.
A p-value of 0.05 is often used as a cutoff for statistical significance. This means that if the p-value is less
than 0.05, then we can reject the null hypothesis with a 95% confidence level.For example, let's say we
are conducting a study to test the effectiveness of a new drug. We hypothesize that the drug will reduce
the risk of heart attack. We collect data from a group of patients who have taken the drug and a group of
patients who have not taken the drug. We then compare the rates of heart attack in the two groups.If the
p-value for this test is less than 0.05, then we can reject the null hypothesis and conclude that the drug is
effective in reducing the risk of heart attack. However, if the p-value is greater than 0.05, then we cannot
reject the null hypothesis and we cannot conclude that the drug is effective.
A t-test is a statistical hypothesis test used to compare the means of two groups and determine if there is
a significant difference between them. It's widely used for making inferences about population means
based on sample data. The t-test assesses whether the observed differences between the sample means
are likely to have occurred due to random sampling variation or if they represent true differences in the
population.
The chi-square test is a statistical test used to determine if there is a significant association or relationship
between two categorical variables. It compares the observed frequencies of categories in a contingency
table to the expected frequencies that would occur if the variables were independent. The chi-square test
assesses whether the observed distribution differs significantly from what would be expected by chance.
Purpose Modifies existing variable's Creates a new variable with transformed values
values
Original Variable Values are changed in the same Original variable remains unchanged
variable
Storage and Original variable is overwritten New variable can be used for specific analyses
Usage
Common Use Simplify categorical variable Create groups or categories for analysis
Cases values
Purpose Modifies existing variable's Creates a new variable with transformed values
values
Original Variable Values are changed in the same Original variable remains unchanged
variable
Storage and Original variable is overwritten New variable can be used for specific analyses
Usage
Common Use Simplify categorical variable Create groups or categories for analysis
Cases values
Combine or recode ordinal data Preserve original data while transforming values
Definition Rejecting the null hypothesis Failing to reject the null hypothesis when it is
when it is actually true. actually false.
Symbol α β
1.System BUS:
1.Address bus - carries memory addresses from the processor to other components such as primary
storage and input/output devices. ... 2.Data bus - carries the data between the processor and other
components. ... 3.Control bus - carries control signals from the processor to other components
2.Expansion BUS: Helps system bus, Transmit information between input,output and peripheral devices.
Open SPSS:
Start SPSS and open the dataset where you want to add variables.
Select Dataset: Choose the dataset that contains the variables you want to add.
Matching Variables: Specify the key variables that will be used to match cases between the two datasets.
Options: You can choose whether to replace or append the added cases and handle duplicate cases if
necessary.
(Adding Cases):
Open SPSS: Start SPSS and open the two datasets you want to merge.
Go to "Data" in the top menu, then select "Merge Files" and choose "Add Cases."
Target Dataset: Select the dataset you want to merge into (the one where you want to add cases).
Source Dataset: Choose the dataset you want to merge from (the one containing the cases you want to
add).
Matching Variables: Specify the key variables that will be used to match cases between the two datasets.
Options: Define how you want to handle cases that don't match or duplicate cases.
Descriptive statistics
Kolmogorov-Smirnov test: The Kolmogorov-Smirnov test is a non-parametric test that is based on the
cumulative distribution function of the data. It is a less sensitive test for departures from normality than
the Shapiro-Wilk test, but it is more robust to outliers.
Normality plots
Definition A graphical method for A graphical method for A graphical method for
assessing normality that assessing normality that plots displaying the distribution
removes the trend from the quantiles of the sample of data through its
the data before plotting. distribution against the quartiles. It displays the
quantiles of the standard median, quartiles, and
normal distribution. potential
Uses Used to assess whether Used to assess whether the Used to assess the
the data is normally data is normally distributed. distribution of data,
distributed after including its central
removing the trend. tendency, dispersion, and
outliers.
Strengths Can be used to assess Sensitive to departures from Easy to interpret and
normality even if the normality. understand.
data is not normally
distributed.
Histogram: A histogram is a graphical method for assessing normality. It shows the distribution of the data
in terms of the number of observations in each bin. If the data is normally distributed, then the histogram
will be bell-shaped.
A normal distribution is a continuous probability distribution that is symmetric around the mean, has a
bell-shaped curve, and tapers off towards the tails. The normal distribution is one of the most important
distributions in statistics, and it is used in a wide variety of applications, including: Testing hypotheses,
Estimating parameters, Making predictions, Data visualization
Recoding Variables:
You can recode values of a variable to new values based on specific criteria.
You can create new variables based on calculations or combinations of existing variables.
Aggregating Data:
Merging Datasets:
You can combine multiple datasets by merging them based on common variables.
/TABLE='AnotherDataset.sav'
/BY CommonVariable.