You are on page 1of 28

INDEX

TOPIC PAGE NO.


FUNCTIONS
 Count
 Count A
 Count Blank
 Sum
 Average
 Count If
 Average If
 Concatenate
 VLOOKUP
 HLOOKUP
OTHER TOOLS
 Transpose table
 Conditional Formatting- Highlight Cell rules (greater than, less than,
 between, equal to, text that contains)
 Conditional Formatting - Duplicate values
 Conditional Formatting - Top/ Bottom rules
 Conditional Formatting - Data Bars
 Conditional Formatting - Color Scales
 Format Cells – Number, Alignment, Font, Border, Fill
 Data validation – settings (Any value, number, custom)
 Data validation – Input message
 Data validation – Error alert
 Customization- Quick access toolbar
 Save as adobe pdf
DATA VISUALIZATION AND ANALYSIS
 Correlation
 Frequency
 Relative frequency
 Percentage frequency
 Bar Graph
 Histogram using Graph tab
 Pivot Table and its tools
 Pivot Chart and its tools
 Histogram frequency distribution
 Histogram – Chart output
 Histogram – Pareto (sorted diagram)
 Histogram – Cumulative percentage
 Descriptive statistics

HYPOTHESIS TESTING
 One sample t test using dummy (one-tailed)
 One sample t test using dummy (two-tailed)
 One sample t test using test average (one- tailed)
 One sample t test using test average (two- tailed)
 t test using function (all combinations)
 Two sample - Independent sample t test
 Two sample - Paired Sample t test
 Two sample z test
 ANOVA – Single Factor
 ANOVA – Two Factor without replication
 ANOVA – Two Factor with replication
 F test
 Chi square test
INTRODUCTION TO R
 How to install R Studio
 Four Panes in R
 Import of Data Sheet in Excel
 Descriptive statistics
 Correlation
 Hypothesis Testing: Two sample - Independent sample t test
 Hypothesis Testing: Two sample - Paired Sample t test
 Hypothesis Testing: One-way ANOVA
 Hypothesis Testing: F test
 Hypothesis Testing: Chi square test
FUNCTIONS
COUNT :
The COUNT function counts the number of cells that contain numbers, and counts numbers
within the list of arguments.

COUNTA:
The COUNTA function counts cells containing any type of information, including error values
and empty text ("").
COUNTIF:
The COUNTIF function in Excel counts the number of cells in a range that match one
supplied condition. Criteria can include logical operators (>,<,<>,=) and wildcards (*,?) for
partial matching.

COUNTBLANK:
The Excel COUNTBLANK function returns a count of empty cells in a range.
AVERAGE:
The excel AVERAGE function calculates the average (arithmetic mean) of supplied
numbers.

SUM:
The excel SUM function returns the sum of values supplied. These values can be numbers,
cell references, ranges, arrays and constants, in any combination.
AVERAGE IF:
The excel AVERAGEIF function in excel calculates the average of cells that meet one
criteria. AVERAGEIF calculates the average of cells that meet multiple criteria.

CONCATENATE:
The excel CONCATENATE function concatenates(joins) join up to 30 text items together
and returns the result as text.
VLOOKUP:
It is a function that makes Excel search for a certain value in a column in order to return a
value from a different column in the same row.

HLOOKUP:
The Microsoft Excel HLOOKUP function performs a horizontal lookup by searching for a
value in the top row of the table and returning the value in the same column based on the
index_number. 
PIVOT TABLE:
A pivot table is a program tool that allows to reorganize and summarize selected columns
and rows of data in a spreadsheet or database table to obtain a desired report.

PIVOT CHART:
PIVOT SLICER :
Slicers in Excel are visual filters or interactive buttons that allow you to see what items have
been chosen within a Pivot Table.
TRANSPOSE TABLE:
The TRANSPOSE function converts a vertical range of cells to a horizontal range of cells, or
a horizontal range of cells to a vertical range of cells. In other words, TRANSPOSE "flips"
the orientation of a given range or array.

CONDITIONAL FORMATTING:
 HIGHLIGHT CELL RULES:
The Excel Conditional Formatting Highlight Cells Rules allow you to apply formatting
to highlight cells that satisfy one or more specific conditions.

These conditions can relate to numeric values (e.g. greater than, between), dates
(e.g. a date occurring...) or text values (e.g. text containing...). We can also use the
Highlight Cells rules to highlight cells containing duplicate values.
 TOP/BOTTOM RULES:
The Excel Conditional Formatting Top/Bottom Rules allow you to apply formatting to
cells that satisfy a statistical condition in relation to other cells in the range (e.g.
above average, within top 10%, etc.).
 DATA BARS:
Data bars insert a bar chart inside excel, and this helps to visualize the values of the
cells. The larger value will have a large bar line, and the lesser value will have a
small bar line. 

 COLOR SCALES:
Color Scales in Excel make it very easy to visualize values in a range of cells. The
shade of the color represents the value in the cell.

 ICON SETS:
Icon Sets in Excel make it very easy to visualize values in a range of cells. Each icon
represents a range of values.

DATA ANALYSIS:
Data Analysis is a process of inspecting, cleaning, transforming and modeling data
with the goal of discovering useful information, suggesting conclusions and
supporting decision-making.

HISTOGRAM

 PARETO (sorted histogram):

 CUMULATIVE PERCENTAGE:
 CHART OUTPUT:

DESCRIPTIVE STATISTICS:
HYPOTHESIS TESTING

 T-Test one sample test using Dummy (One Tail)

RESEARCH PROBLEM: To determine that the population mean of age is greater than 40 at
α=0.05

Age DUMMY
42 0
76 0
56  
56  
67  
65  
65  
89  
76  
45  
45  
65  
78  
55  
52  
53  
44  
65  
76  
89  
44  
54  
45  
56  
56  
56  
76  

Hypothesis Testing:

Null Hypothesis: The population mean of age is less than 40.

H0: µ ≤ 40

Alternate Hypothesis: The population mean of age is greater than 40.

H1: µ > 40
t-Test: Two-Sample Assuming Equal
Variances

DUMM
  Age Y
60.9629629
Mean 6 0
189.190883
Variance 2 0
Observations 27 2
182.183813
Pooled Variance 4
Hypothesized Mean Difference 40
df 27
2.11931569
t Stat 2
0.02170259
P(T<=t) one-tail 3
1.70328844
t Critical one-tail 6
0.04340518
P(T<=t) two-tail 7
2.05183051
t Critical two-tail 6  

Decision Rule:
If t stats is greater than t critical, reject Null Hypothesis.
If P (T) less than α, reject Null Hypothesis.
Inference:
Since t stats=1.95 is greater than t critical=1.70, reject Null Hypothesis.
Since P=0.03 is less than α=0.05, reject Null Hypothesis.
Conclusion:
Therefore, the mean age of the population is greater than 40 at alpha=0.05 assuming equal
variances.

 T-Test: Paired Two Sample for Means

Research Problem: Is there sufficient evidence to suggest that the mean time
to exhaustion is greater after chocolate milk than after carbohydrate
replacment drink? Use a significance level of 0.05. (Use µ cm-µcd in hypothesis
statements)
Decision rule
If T stat > T critical , we reject Ho
If T stat < T critical , we accept Ho

If sig level ( p value , alpha) > 0.05, Accept Ho


If sig level ( p value , alpha) < 0.05, Reject Ho

SAMPLE PROB. 2
Decision rule
If T stat > T critical , we reject Ho
If T stat < T critical , we accept Ho

If sig level ( p value , alpha) > 0.05, Accept Ho


If sig level ( p value , alpha) < 0.05, Reject Ho

SAMPLE PROB. 3

You might also like