Professional Documents
Culture Documents
HYPOTHESIS TESTING
One sample t test using dummy (one-tailed)
One sample t test using dummy (two-tailed)
One sample t test using test average (one- tailed)
One sample t test using test average (two- tailed)
t test using function (all combinations)
Two sample - Independent sample t test
Two sample - Paired Sample t test
Two sample z test
ANOVA – Single Factor
ANOVA – Two Factor without replication
ANOVA – Two Factor with replication
F test
Chi square test
INTRODUCTION TO R
How to install R Studio
Four Panes in R
Import of Data Sheet in Excel
Descriptive statistics
Correlation
Hypothesis Testing: Two sample - Independent sample t test
Hypothesis Testing: Two sample - Paired Sample t test
Hypothesis Testing: One-way ANOVA
Hypothesis Testing: F test
Hypothesis Testing: Chi square test
FUNCTIONS
COUNT :
The COUNT function counts the number of cells that contain numbers, and counts numbers
within the list of arguments.
COUNTA:
The COUNTA function counts cells containing any type of information, including error values
and empty text ("").
COUNTIF:
The COUNTIF function in Excel counts the number of cells in a range that match one
supplied condition. Criteria can include logical operators (>,<,<>,=) and wildcards (*,?) for
partial matching.
COUNTBLANK:
The Excel COUNTBLANK function returns a count of empty cells in a range.
AVERAGE:
The excel AVERAGE function calculates the average (arithmetic mean) of supplied
numbers.
SUM:
The excel SUM function returns the sum of values supplied. These values can be numbers,
cell references, ranges, arrays and constants, in any combination.
AVERAGE IF:
The excel AVERAGEIF function in excel calculates the average of cells that meet one
criteria. AVERAGEIF calculates the average of cells that meet multiple criteria.
CONCATENATE:
The excel CONCATENATE function concatenates(joins) join up to 30 text items together
and returns the result as text.
VLOOKUP:
It is a function that makes Excel search for a certain value in a column in order to return a
value from a different column in the same row.
HLOOKUP:
The Microsoft Excel HLOOKUP function performs a horizontal lookup by searching for a
value in the top row of the table and returning the value in the same column based on the
index_number.
PIVOT TABLE:
A pivot table is a program tool that allows to reorganize and summarize selected columns
and rows of data in a spreadsheet or database table to obtain a desired report.
PIVOT CHART:
PIVOT SLICER :
Slicers in Excel are visual filters or interactive buttons that allow you to see what items have
been chosen within a Pivot Table.
TRANSPOSE TABLE:
The TRANSPOSE function converts a vertical range of cells to a horizontal range of cells, or
a horizontal range of cells to a vertical range of cells. In other words, TRANSPOSE "flips"
the orientation of a given range or array.
CONDITIONAL FORMATTING:
HIGHLIGHT CELL RULES:
The Excel Conditional Formatting Highlight Cells Rules allow you to apply formatting
to highlight cells that satisfy one or more specific conditions.
These conditions can relate to numeric values (e.g. greater than, between), dates
(e.g. a date occurring...) or text values (e.g. text containing...). We can also use the
Highlight Cells rules to highlight cells containing duplicate values.
TOP/BOTTOM RULES:
The Excel Conditional Formatting Top/Bottom Rules allow you to apply formatting to
cells that satisfy a statistical condition in relation to other cells in the range (e.g.
above average, within top 10%, etc.).
DATA BARS:
Data bars insert a bar chart inside excel, and this helps to visualize the values of the
cells. The larger value will have a large bar line, and the lesser value will have a
small bar line.
COLOR SCALES:
Color Scales in Excel make it very easy to visualize values in a range of cells. The
shade of the color represents the value in the cell.
ICON SETS:
Icon Sets in Excel make it very easy to visualize values in a range of cells. Each icon
represents a range of values.
DATA ANALYSIS:
Data Analysis is a process of inspecting, cleaning, transforming and modeling data
with the goal of discovering useful information, suggesting conclusions and
supporting decision-making.
HISTOGRAM
CUMULATIVE PERCENTAGE:
CHART OUTPUT:
DESCRIPTIVE STATISTICS:
HYPOTHESIS TESTING
RESEARCH PROBLEM: To determine that the population mean of age is greater than 40 at
α=0.05
Age DUMMY
42 0
76 0
56
56
67
65
65
89
76
45
45
65
78
55
52
53
44
65
76
89
44
54
45
56
56
56
76
Hypothesis Testing:
H0: µ ≤ 40
H1: µ > 40
t-Test: Two-Sample Assuming Equal
Variances
DUMM
Age Y
60.9629629
Mean 6 0
189.190883
Variance 2 0
Observations 27 2
182.183813
Pooled Variance 4
Hypothesized Mean Difference 40
df 27
2.11931569
t Stat 2
0.02170259
P(T<=t) one-tail 3
1.70328844
t Critical one-tail 6
0.04340518
P(T<=t) two-tail 7
2.05183051
t Critical two-tail 6
Decision Rule:
If t stats is greater than t critical, reject Null Hypothesis.
If P (T) less than α, reject Null Hypothesis.
Inference:
Since t stats=1.95 is greater than t critical=1.70, reject Null Hypothesis.
Since P=0.03 is less than α=0.05, reject Null Hypothesis.
Conclusion:
Therefore, the mean age of the population is greater than 40 at alpha=0.05 assuming equal
variances.
Research Problem: Is there sufficient evidence to suggest that the mean time
to exhaustion is greater after chocolate milk than after carbohydrate
replacment drink? Use a significance level of 0.05. (Use µ cm-µcd in hypothesis
statements)
Decision rule
If T stat > T critical , we reject Ho
If T stat < T critical , we accept Ho
SAMPLE PROB. 2
Decision rule
If T stat > T critical , we reject Ho
If T stat < T critical , we accept Ho
SAMPLE PROB. 3