You are on page 1of 48

Data Analysis

In Microsoft Excel
2

Working with Data


Building data sets and moving around efficiently
Excel for Data
Analysis Formulas & Functions
The Functions you should know

Pivot Tables
Summarizing Large Data Sets

Charts
How to work with them in Excel
Working with Data
How to Build a Data Set
About this Data Set

Randomly generated performance


metrics for a Facebook page and
YouTube channel

Metrics and Dimensions:


• Day
• Views
• Platform
• Number of Likes
• Comments
• Shares
Build Down Not Across
Database Calculator
Working with Data

Treat each row as one Attributes of Row 2:


“piece” of data • Date = 1/1/2017
• Views = 79,762
• Platform = YouTube
• Likes = 8,376
• Comments = 239
• Shares = 3,349
Working with Data
Important!
Section 1: How to Build a Data Set

Each column is either a “Dimension” or “Metric”

Measures / Metrics Vs. Dimensions


• Help quantify video performance • Put performance into context
• Views, Likes, Comments, Shares • Day, Platform
Working with Data
Tips and Tricks
Keyboard Shortcuts*
*For Mac, change CTRL to CMD

Get Moving
• Ctrl + Arrow Keys - Move to the last
piece of data
• Ctrl + Shift + Arrow Keys - Select a
range of data
• Also works when you’re building
formulas
Keyboard Shortcuts*
*For Mac, change CTRL to CMD

The HOT Keys


• Ctrl + 1 - Change the data
type of the selection
• Ctrl + F - Find a piece of
data
• Ctrl + H - Find and Replace
• Ctrl + C - Copy
• Ctrl + X - Cut
• Ctrl + V - Paste
• Ctrl + Z - Undo
Auto filling Patterns &
Formulas

Excel picks up patterns very well,


especially related to dates and simple
arithmetic equations.

This “pattern recognition” becomes


very valuable when you’re auto filling
formulas across large data sets.
Copying & Cutting
Formulas

When you copy a cell to another


cell, Excel shifts the formulas
proportionate to the move
(assuming you’re not using
absolute referencing)

When you cut cells that contain


formulas, Excel changes all cell
references to match the original
relationship
=sum(J7:J13) =sum(N7:N13)
If you delete columns used in a
formula, you’ll get a #REF error
Sort & Filter

Sorting
• Can be done based on values,
colors, conditional formatting
• Most commonly used to sort
data into ascending or
descending order

Filtering
• Can also be used to sort data into
ascending or descending order
• Also gives ability to search based
on numeric or text criteria, ie
“greater than 10,000 views“ or
“Youtube” only
Filter ALL rows Not just one
Tables and Named
Ranges

Properties of Tables:
• Easy to Filter
• Totals Row
• Calculated Columns
• Helpful in refreshing Pivot Tables

Named Ranges:
• Help simplify referencing data in
formulas
Formulas
Where to start

There are over 400 functions in Excel

Depending on how you’re using


Excel, you may never need to use
90%+ of them

We’re going to learn ~20 functions in


Excel that will help power 90% of
your analysis.
The Structure of a Function

Every function is comprised of


arguments.

Depending on the function, some


arguments may be optional. These
are denoted using brackets – “ [ ] “ in
the documentation.

You can call functions by simply


typing them into the formula bar or
by using the formula builder.
Comparison Operators

Used to compare two values in Excel

Result is TRUE or FALSE

Can also be used within formulas,


often with the IF functions (IF,
SUMIF, COUNTIF, etc..)

Can also be used to create helper


columns
Date Functions

In this section we’ll focus on:


• DAY
• MONTH
• YEAR
• WEEKDAY
• EOMONTH

These functions are very useful if


your job at all involves scheduling or
time based reporting.
How do dates work in
Excel?

Dates are calculated in Excel as the


“number of days since January 0,
1900.”

Excel can also help you calculated


the difference in hours, minutes, and
seconds. Each of these calculations
are represented by fractions of a day.
(Example: 12 noon is the same as 0.5
days)
Basic Math Functions

There are SO many Math & Trig


functions. Geometry class should
have featured a section on using
Excel! But we’re going to focus on
SUM, SUMIF, and SUMIFS.

The SUM() function is probably one


of the first functions that Excel users
get to know. But the ability to use
SUMIF and SUMIFS will take your
formula skills to the next level.
Statistical Functions

In this section we’ll focus on


AVERAGE, AVERAGEIFS, COUNT,
COUNTIF, and COUNTIFS.

These functions can help summarize


data, count different types of data
that appear in your data set, and
even set quality assurance checks.
Lookup & Reference

Out of all of these Lookup &


Reference functions, we’re going to
focus on INDEX, MATCH, and
VLOOKUP.

These functions will help you lookup


data from large data tables or join
two data tables together.
Logical

Of the LOGICAL functions, we’ll


focus on IF, AND, and OR.

These functions will help you:


• Check for multiple criteria within a
formula
• Add conditional statements to
your formulas and supercharge
your analysis
Handling Errors

Errors Commonly Found in Excel


• #N/A – Not found.
• #DIV/0 – Dividing by Zero.
• #NAME – A name error occurs when
you’ve misspelled the name of a

#N/A function you’re trying to use.


• #REF – Excel cannot find the cell
you’re referencing.
• #VALUE – The data type you’re
#NAME referencing is likely incorrect.
#VALUE Many times we can use these errors to our
advantage to spot flaws in our analysis or
data set.

We’ll learn the functions IFERROR &


ISERROR to show how these can help.
Pivot Tables
What is a Pivot Table?

A Pivot Table is a summary of large (or


small) data sets. The pivot table here
summarizes over 1,400 rows of data.

Metrics and dimensions summarized


in a fraction of the time as compared
to more complicated formulas
Building a Pivot Table

The most important thing you need


is well structured data!

Headers at the top

Metrics and Dimensions well


structured throughout the set

No need to format the cells of the


data set. That happens later
The Pivot Table Fields
Pane

Data can be added to four sections of


a pivot table:
• Columns
• Rows
• Values
• Filters

Nesting fields is allowed and often


creates a more aesthetic layout

Values can be shown in a variety of


ways, including sum, average, count,
and even a percentage of total.
Grouping Row items

Dates are one of the most common


items you can group together
(example left)

How to group row labels:


• Right click
• “Group”
• For dates, select how many levels
you’re grouping, i.e. months,
quarters, years?
Calculated Field

Calculated Fields are those that could


be calculated easily within the raw
data

Rather than calculating the field in


our raw data, this allows us to
calculate the values on the fly inside
of our pivot table.
Calculated Field

Engagements are calculated behind


the scenes in the pivot table rather
than in your data

Rather than calculating the field in


our raw data, this allows us to
calculate the values on the fly inside
of our pivot table.
Calculated Item

This is often used when we have to


calculated a metric that fits inside
multiple dimension categories
Calculated Item

In this case, the pivot table does the


heavy lifting of calculating the
difference between the worldwide
views for each and the US views for
each day, and generates an entirely
new pivot table column.

Cannot be used on “grouped” pivot


table data
Pivot Table Tricks

You can hide values from your pivot


table using the “Row Labels” and
“Column Labels” dropdowns

Show as a percent of global total,


column total
Final Touch

Refreshing a Pivot Table


• Create a Table vs. “Change Data
Source”

Allows you the ability to quickly add


new weekly, monthly, or quarterly
data to pivot tables you’ve already
set up
Visualizing Data with
Charts
Scatter Plot

Two variable plotted against each


other to reveal possible correlations

Excel can generate the regression


line for the data in addition to the R-
Squared value

Time is not shown on this axis. Only


uploads vs. views.
Line Chart

Most often used to plot data in a


series.

Can be used to study multiple


metrics along the same time period.

We’ll use to plot views and


engagements on the same graph.
Month over month change plotted over time.
https://www.thinkoutsidetheslide.com/column-graphs-vs-bar-charts-
when-to-choose-each-one/
Column Chart

Very helpful in visualizing


proportional relationship between
categories (aka dimensions)

Once you’ve done some initial


analysis into your data, column
charts can help put the data in
perspective.

Our Analysis
• Views and Engagements based on
day of the week
• Month over month change
• Percentage Organic vs. Paid by
Month
Pivot Charts

Pivot charts take the shape and form


of common chart types, but they
allow for more flexibility

Pivot charts add interactivity to you


and to end users, with the ability to
dynamically update
More Tips & Tricks
Conditional Formatting

Conditional formatting can help


distinguish winners from the losers

Can also be used to highlight cells


based on specific criteria
Data validation

Drop down lists ensure that data is


consistent through the data set.

Can also be used to restrict the


type of data you can enter
48

Thank You

©2018

You might also like