You are on page 1of 33

FDS - Module 5

Data Visualization
Data Visualization
• Data visualization is the graphical representation of information and data.
• By using visual elements like charts, graphs and maps, data visualization tools
provide an accessible way to see and understand trends, outliers, and patterns in
data
• Data visualization sits right in the middle of analysis and visual storytelling.
• The advantages and benefits of good data visualization:
o Our eyes are drawn to colors and patterns.
o we quickly see trends and outliers
o It’s storytelling with a purpose
• Effective data visualization is a delicate balancing act between form and function.
• The plainest graph could be too boring to catch any notice or it make tell a
powerful point.
• The most stunning visualization could utterly fail at conveying the right message
or it could speak volumes.
• The data and the visuals need to work together, and there’s an art to combining
great analysis with great storytelling.

History of Tableau
• Tableau was founded by Pat Hanrahan, Christian Chabot, and Chris Stolte from
Stanford University in 2003.
• The main idea behind its creation is to make the database industry interactive and
comprehensive.
• It is a user-friendly tool that can help you in creating graphs, charts, maps, reports

The importance of Tableau in Data science and Data visualization


• The world will generate 50 times the amount of data in 2020 as compared to 2011.
• That’s a dramatic rise in data generation all over the world with time.
• And there are a lot of secrets hidden within this data
• Secrets that could improve technologies, solve many puzzles, and lead to greater
human advancement.
• But for that to happen, we need to understand this data first! Most of the data
generated in this world is messy, complicated, unclear data obtained from
different sources in different forms and it is very difficult to understand.
• So, this data needs to be cleaned, organized and then visualized so that any
patterns and insights in the data can be understood easily.
• And Tableau is a data visualization tool that is helping with that.
• Tableau is used to first clean and shape the data and then convert it into
interactive data visualizations that help companies to obtain insights from data
using data storytelling.
• Tableau also allows you to prepare, clean, and format data of all types and ranges
and then create data visualizations to obtain actionable insights that can be
shared with other users.
• You can use data queries to obtain insights from your visualizations and also
manage metadata using Tableau.
• In fact, it is a lifesaver for many people in Business Intelligence as it allows you to
handle data without having great technical knowledge.
• Many organizations use Tableau such as Amazon, Lenovo, Walmart, Accenture etc
• There are different Tableau products that are aimed at different types of users,
whether they be individuals or organizations.

Advantages of Tableau

1. Great visualizations:
• With Tableau, you can work with more unordered data and create varieties
of visualizations with the help of the in-built features offered by Tableau.
• Moreover, you will be able to achieve great context, several ways of drilling
the data and exploring the data within minutes.

2. Detailed Insights:
• Tableau helps in analyzing future data without any future goals in mind.
• You will be able to explore visualizations and observe data from different
approaches.
• With hypothetical visualizations and a feature of adding components for
comparison and analysis, you can frame ‘what-if’ queries and work on the
data accordingly.
3. User-friendly Approach:
• The user-friendly feature is the major strength of Tableau.
• This feature demonstrates the ability of an individual to work without any
technical or coding knowledge.
• Since Tableau offers most of its features in a drag-and-drop form and each
visualization is built-in and self-depicting, any newbie can work without any
prior set of skills.

4. Working with the different Data Sources:


• The reason why Tableau is required by various enterprises these days is that
data can come from any source in today’s data-driven world.
• This feature makes Tableau more powerful than Business Intelligence and
Analytics tools.
• Tableau allows you to connect different data sources, data warehouses,
cloud files, big data, spreadsheet data, non-relational, and several other
types of data.
• Tableau can blend every kind of data to help organizations produce
attractive visualizations.

5. Adding Data Set is easy: Tableau can add new data sets easily which can be
automatically blended with Tableau using common fields.

Architecture of Tableau
• Tableau Server is designed to connect many data tiers. It can connect clients from
Mobile, Web, and Desktop.
• Tableau Desktop is a powerful data visualization tool. It is very secure and highly
available.
1. Data server:
o The primary component of Tableau Architecture is the Data sources which can
connect to it.
o Tableau can connect with multiple data sources.
It can blend the data from various data sources.
o It can connect to a spreadsheet, a database, and a web application at the same
time. It can also make relationships between different types of data sources.
2. Data connector:
o The Data Connectors provide an interface to connect external data sources
with the Tableau Data Server.
o Tableau has in-built SQL/ODBC connector. This ODBC Connector can be
connected with any databases without using their native connector.
o Tableau desktop has an option to select both extract and live data.
o The user can easily switch between live and extracted data.
o Real-time data or live connection: Tableau can be connected with real data by
linking to the external database directly.
o Extracted or in-memory data: Tableau is an option to extract the data from
external data sources
3. Components of Tableau server: Different types of components of the Tableau
server are:
o Application Server: The application server is used to provide the authorizations
and authentications. It handles the permission and administration for mobile
and web interfaces.
o VizQL server: VizQL server is used to convert the queries from the data source
into visualizations.
o Data server: Data server is used to store and manage the data from external
data sources. It is a central data management system.
4. Gateway:
o The gateway directs the requests from users to Tableau components.
o When the client sends a request, it is forwarded to the external load balancer
for processing.
o The gateway works as a distributor of processes to different components.
o In case of absence of external load balancer, the gateway also works as a load
balancer.
o For single server configuration, one gateway or primary server manages all the
processes.
o For multiple server configurations, one physical system works as a primary
server, and others are used as worker servers.
o Only one machine is used as a primary server in Tableau Server environment.
5. Clients:
o The visualizations and dashboards in Tableau server can be edited and viewed
using clients like a web browser, mobile applications, and Tableau Desktop.
o Web Browser:
▪ Web browsers like Google Chrome, Safari, and Firefox support the
Tableau server.
▪ The visualization and contents in the dashboard can be edited by using
these web browsers.
o Mobile Application:
▪ The dashboard from the server can be interactively visualized using
mobile application and browser.
▪ It is used to edit and view the contents in the workbook.
o Tableau Desktop:
▪ Tableau desktop is a business analytics tool.
▪ It is used to view, create, and publish the dashboard in Tableau server.
▪ Users can access the various data source and build visualization in
Tableau desktop.
Tools of Tableau
1. Tableau Desktop
• Tableau Desktop is a desktop application that is the fundamental offering of
Tableau.
• You can use Tableau Desktop to create interactive data visualizations, perform
unlimited data exploration, create charts and dashboards from the data, etc.
• It also allows connections to the data on the cloud or in local memory, whether
the data is an SQL database, spreadsheet data, Big Data, or data on Google
Analytics, Salesforce, etc.

2. Tableau Public
• Tableau Public is a free software provided by Tableau.
• It is used to create data visualizations and data charts that can be embedded
into blogs, web pages, etc. or transferred using social media.
• You can create visualizations in Tableau Public using the Tableau Desktop Public
Edition.
• However, the downside of Tableau Public is that the data visualizations created
here are accessible by everyone on the internet and cannot be saved privately.
• So, this product is best for students who are still learning Tableau, hobbyists,
journalists, bloggers, etc. who are fine with creating public data visualizations.

3. Tableau Online
• Tableau Online can be used to create data visualizations, storyboards, data
charts, etc. that are fully hosted on the cloud and not on local servers.
• You can create your visualizations and share them with other people online
using a web browser or the Tableau mobile app.
• There is no need to worry about software upgrades or hardware scaling or
server configuration while using Tableau Online as it is a totally online hosted
service.
• So, you can set up Tableau Online in minutes and start reaping the benefits as a
single user or company.

4. Tableau Server
• Tableau Server is a server product for your organization that needs to be
installed on a Windows or Linux server.
• This is widely used in the industrial domain with many IT companies installing
Tableau Server to create interactive data visualizations, perform unlimited
data exploration, create charts and dashboards from the data, etc. while being
sure that their data is safe and secure.
• Tableau Server can integrate with existing security protocols in companies such
as Kerberos, Active Directory, OAuth, etc. to ensure data security while also
providing data insights.

Workbooks and Sheets


• A workbook contains sheets. A sheet can be a worksheet, a dashboard, or a story.
1. Worksheet: A worksheet is where you build views of your data by dragging and
dropping fields onto shelves.
Creating a Worksheet:
Click on the ‘New Worksheet’ button in the bottom left corner of the screen.
2. Dashboard: A dashboard is a collection of views from multiple worksheets.
Creating a Dashboard
i) Click on the ‘New Dashboard’ button to create a new dashboard.
ii) From the Sheets list at left, drag views to your dashboard at right.
iii) Drag and drop a Sheet from the Sheets column in the Dashboard tab to the
canvas.

Create Dashboard Layouts for Different Device Types


Dashboards can include layouts for different types of devices that span a wide
range of screen sizes. Device layouts appear on the Dashboard tab, under Default.
i) Open a Dashboard
ii) On the Dashboard tab on the left, click Device Preview.
iii) In device preview mode, these options appear above the dashboard:
iv) Click through the Device types and Models and explore different screen sizes.
v) Choose a Device type, such as Tablet.
vi) In the upper-right corner, click the Add Layout button for the device type you
selected (for example, Add Tablet Layout).
3. Story: contains a sequence of worksheets or dashboards that work together to
convey information. Create a new Story by clicking on the ‘New Story’ button

Tableau Data types


The data type of a field is identified in the Data pane by the icons shown below:
1. String Data type: The collection of characters give rise to the string data type. A
string is always enclosed within a single or double inverted comma. The samples of
the string are — “Rahul” and “Hi! How are you?” It is of 2 types:
• Char String type: Char data type normally stores alphanumeric data values
having fixed lengths. If the user enters a string value which is greater than the
fixed length of the Char data type, then the system returns an error.
• Varchar String type: Varchar data type also stores alphanumeric data values.
As the name suggests, Varchar stores data values having a variable length. So,
the user can enter as many string values as they want, without facing any
restriction from the system.
2. Numeric Data type:
• This data type consists of both integer type or floating type.
• Out of which users prefer to use integer type over floating type, as it is
difficult to accumulate the decimal point after a certain limit.
• It also contains a function known as the Round() function which can be used
in rounding up float values.
3. Date and Time Data type:
• Tableau supports all forms of date and
o date and time values like dd-mm-yy, or mm-dd-yyyy, etc.
o time date values like a decade, year, month, hour, minutes, seconds, etc.
• Whenever the user enters data and time values, Tableau automatically
registers it under Date data type and Date & Time data value.

4. Boolean Data type:


• As a result of relational calculations, boolean data type values are formed.
The boolean data values are either True or False.
• Many a time the result of a relational calculation is unknown, in this situation
Null data values are used.

5. Geographic Data type:


• All values that are used in maps, comes under geographic data type.
• The example of geographic data values is country name, state name, city,
region, postal codes, etc.

6. Cluster or Mixed Data type:


• Sometimes data set contains values having a mixture of data types.
• Such values are known as cluster group values or mixed data values.
• In such a situation, users have the option either to handle it manually or
allow Tableau to operate on it.
Change the Data Type for a field in the Data Source page
• Sometimes Tableau incorrectly interprets the data type of a field.
• For example, Tableau might interpret a field that contains dates
as an integer data type, rather than a date data type.
1. Click the data type icon for the field
2. Choose a new data type from the drop-down list:

Change the data type for a field in the Data pane


• Click the icon to the left of the field name, and then choose a new
data type from the drop-down list.

\
Aliases in Tableau
• You can create aliases (alternate names) for members in a dimension so that their
labels appear differently in the view.
• Aliases can be created for the members of discrete dimensions only. They cannot
be created for continuous dimensions, dates, or measures
Steps to create an Alias:
1. In the Data Source page, select a dimension, say ‘Segment’
2. Click on the dropdown and select ‘Aliases’
3. A dialog box ‘Edit Aliases’ appears.

Sort fields in the Data pane


• When organizing the Data pane with or without folders, you can have Tableau sort
the items. These Sort by options are also located in the Data pane menu.
• Sort options are only available for relational data sources—not for
multidimensional (cube) data sources.
• You can sort by one of the following options:
o Sort by Name – lists the dimensions and measures in alphabetical order
according to their field aliases.
o Sort by Data Source Order – lists the dimensions and measures in the order
they are listed in the underlying data source.
\

Customize the Data Pane


1. Find fields
• You can search for fields, folders and hierarchies in the
Data pane.
o Click the Find Field icon at the top of the Data
pane (Ctrl + F) and enter the name of the item you
want to search for.
o As you type in the search box, search filters the contents of the Data pane to
show all fields, folders or hierarchies that contain the typed string.
2. Rename Fields
• You can rename fields in the Data pane. For example, you could rename a field
named Customer Segment in the data source to be Business Segment in Tableau.
• Renaming a field does not change the name of the field in the underlying data
source, rather it is given a special name that appears only in Tableau workbooks.
o Click field name in the Data pane and hold the mouse button down until the
field name is shown in an edit box:
o Type the new name for the field and press Enter.

3. Hide or Unhide Fields


• You can selectively hide or show fields in the Data pane. To hide a field, right-click
the field and select Hide.
• When you want to change your fields from hidden to visible, select Show Hidden
Fields on the Data pane menu.
• Select Hide All Unused Fields on the Data pane menu to quickly hide all of the
fields that are not being used in the workbook.
• The hidden fields are then shown in grey in the Data pane. You can then select one
or more hidden fields, right-click and select Unhide.

\
Cards and Shelves
• Every worksheet in Tableau contains shelves and cards, such as Columns, Rows,
Marks, Filters, Pages, Legends, and more
• By placing fields on shelves or cards, you:
o Build the structure of your visualization.
o Increase the level of detail and control the number of marks in the view by
including or excluding data.
o Add context to the visualization by encoding marks with color, size, shape,
text, and detail
1. Marks Card
• The Marks card is a key element for visual analysis in Tableau.
• As you drag fields to different properties in the Marks card, you add context and
detail to the marks in the view.
• You can encode your data with color, size, shape, text, and detail.
• After you add a field to the Marks card, you can click the icon next to the field to
change the property it is using. Example:
i) Colour option: Before, all the circles were blue. After adding ‘Category’ field to
Colour Mark type, we get the circles in different colour, representing the categories.

ii) Size option: Provides additional information (a size) for each mark
iii) Detail Property: Adding a dimension on the Detail marks is useful to breakdown
the data you already have in your chart/table with this dimension too.
iii) Label option: Used to add mark labels or text to the visualization.

iv) Shape option: Used to represent shapes corresponding to various categories.

2. Filters shelf
• The Filters shelf allows you to specify which data to include and exclude.
• You can filter data using measures, dimensions, or both at the same time.
Types of Filters in Tableau
• There are different types of filters in a tableau that can be used to organize data
based on predefined conditions and use them for data visualization like
1. Extract Filters 2. Data Source Filters
3. Context Filter 4. Dimension filter
5. Measure Filters 6. Table Filters
Such ability to filter large data sets helps prepare
o for analysis, including removing irrelevant data records,
o reducing data sizes for faster processing, and more.
• The filters are required to highlight any underlying insights that can be derived
from the data upon visualizing in a readable, actionable format.

1. Extract Filters:
o Tableau allows you to have a data source, either live or an extracted or
copied one.
o Live data source would mean, any changes done in the data source will
almost immediately reflect in Tableau.
o Extract, on the other hand, is a copy of the original data and will not reflect
any changes made to the data source unless you decide to reload the data
once again after the changes to the original data are saved.
o When you’re loading in your data you can choose to extract it, i.e., saving a
snapshot of how it looks in your workbook and ultimately reducing the
number of times Tableau queries the data source.
o Steps to add an Extract Filter:
1. Drag and drop the table into the canvas.
2. Select an Extract Connection. Then click on Edit.
3. Now, click on Add in the Extract Data dialog box.

4. Select the filter on which filtering is to be done.


5. Select the filter. Then, a new dialog box open, to select the data to be
filtered. Select the data, then click OK
6. Now, the the ‘Extract Data’ dialog box looks like this: Click OK

2. Data Source Filter:


o Data Source Filters
▪ reduce the amount of data being fed into Tableau and
▪ restrict what data the viewer sees.
o With certain access rights, the viewer can view all of the underlying data, so
if not done in the data source, Data Source Filters can be used to control
sensitive data.
o One thing that is important to note is that, Extract Filters and Data Source
Filters are not linked.
o So, if you change back to a live connection your Data Source Filters will still
be intact.
o Steps to add a Data Source Level Filter:
1. Drag and drop the table into the canvas.
2. Select a Live Connection.
3. In the canvas, on the right, you will the ‘Add’ beneath the Filter text.

4. Now, the ‘Edit Data Source Filters’ dialog box opens.

5. Click on Add to select the Filter on which filtering is to be done.

6. Select the filter. Then, a new dialog box open, to select the data to be
filtered. Select the data, then click OK
7. Now, the the ‘Edit Data Source Filters’ dialog box looks like this: Click OK
3. Context Filter:
• Context Filters are used to improve performance of views, filters, and
queries run on a data source.
• All filters by default are applied independently of any other filters in
Tableau, which means each filter accesses the entire data set without regard
to any other filters.
• A context filter can be applied to change this behaviour.
• You could have a context filter that is run before any other filters (run first),
and the rest of the filters are applied on top of the data returned after
context filtering.
• Steps to create a Context Filter:
1. Drag and drop a table into the canvas in the Data Source page.
2. Create a new ‘Worksheet’ in the bottom-left corner.
3. Drag a field from the Data pane to the Filters shelf.

4. The Filter dialog box opens so you can define the filter.
5. To create a context filter, select Add to Context from the context menu
of an existing categorical filter.

4. Dimension filter:
o Dimensions in Tableau are fields (columns) that are independent, typically
any field that contains categorical or qualitative information.
o When a dimension is used to filter the data in a worksheet, it is called as
Dimension filter. (Ex: applying Department as "CSE" filter on NHCE database)
o It is a non-aggregated filter (does not support AVG, SUM etc) where a
dimension, group, sets and bin can be added.
o The members present in a dimension can be included or excluded from the
list using this filter.
o Dimension filter can be shown in a sheet or dashboard to change the filter
condition dynamically.
o Steps to create a dimension filter: Steps 1 - 3 in Context Filter

5. Measure Filters:
o Measures are typically fields/columns that contain quantitative data.
o In this filter, you can apply the various operations like SUM, AVG, Median,
Standard Deviation, and other aggregate functions.
o Every time you drag the data you want to filter, you choose a specific setting.
o Steps to create a measure filter:
1. When you drag a measure from the Data pane to the Filters shelf in
Tableau Desktop, the following dialog box appears:

2. Select how you want to aggregate the field, and then click Next.
3. In the next dialog box, we can create four types of quantitative filters:
Range of Values, At Least, At Most, Special. Select a filter and click OK.
6. Table Filters:
o The table filter is used to process is the table calculation that gets executed
once the data view has been rendered.
o With this filter, you can quickly look into the data without any filtering of
the hidden data.
Tableau Data Connections
• There are 2 types of data connections in Tableau.
o LIVE
o EXTRACT (IN-MEMORY)
• Live connection is for high volume data and sends logic to data.
• Extract connection brings data in to memory, i.e., sends data to the logic.
• There are no standard rules to decide on which connection to choose.
• Depending on the situation and resources, we must choose the connection type
that helps to provide responsive report
• In general, high-volume data and frequently changing data may be eligible for
LIVE connection.
• Low volume data and less frequently changing data may be eligible for EXTRACT
connection. Again, it all depends on the available storage, memory, cache and
other resources.

Tableau Calculations
• Calculated fields allow you to create new data from data that already exists in
your data source.
• When you create a calculated field, you are essentially creating a new field (or
column) in your data source, the values or members of which are determined by a
calculation that you control.
• This new calculated field is saved to your data source in Tableau, and can be used
to create more robust visualizations.
• But don't worry: your original data remains untouched.
• You can use calculated fields for many, many reasons. Some examples might
include:
o To segment data
o To aggregate data
o To convert the data type of a field, such as converting a string to a date.
o To filter results
o To calculate ratios
Types of Charts in Tableau
1. Bar Charts
o Bar charts are definitely one of the most, if not the most common data
visualizations across all Business Intelligence (BI) platforms.
o You can quickly highlight differences between categories, show trends and
outliers, and reveal historical highs and lows at a glance.
o Bar charts are simple, yet, effective, especially when you have data that can
be split into many categories.
o Steps:
1. Connect to the data source like an Excel file
2. Drag the Order Date dimension to Columns and drag the Sales measure
to Rows.

3. Tableau uses Line as the automatic mark type because you added a date
dimension.
4. On the Marks card, select Bar from the drop-down list. The view changes to a
bar chart.
o If a dimension is in columns and a measure in rows, then we get vertical bar
graphs. If vice-versa, then horizontal bar graphs.
2. Line Chart
o The line chart, or line graph, is another familiar method for displaying data.
o It connects several distinct data points, presenting them as one continuous
evolution.
o The result is a simple, straightforward way to visualize changes in one value
relative to another.
3. Area Chart
o Area charts represent any quantitative data over various periods of time.
o It is basically a line graph where the area between line and axis is generally
filled with color.
o Steps:
1. Connect to a data-source
2. Navigate to a new worksheet.
3. From the Data pane, drag Order Date to the Columns shelf.
4. From the Data pane, drag Quantity to the Rows shelf.
5. From the Date pane, drag Ship Mode to Color on the Marks card.
6. On the Marks card, click the Mark Type drop-down and select Area.

4. Scatter Plots
o Scatter plots are an effective way to give you a sense of trends,
concentrations, and outliers that facilitate deeper investigations of your
data.
o A scatter plot presents lots of distinct data points on a single chart.
o The chart can then be enhanced with analytics like cluster analysis or trend
lines.
o Scatter plots are used to show if one variable is a good predictor of another,
or if they tend to change independently.
o This type of chart easily lends itself to many types of analysis.
o Steps:
1. Drag the Profit measure to Columns.
2. Drag the Sales measure to Rows.

3. Drag the Category dimension to Color on the Marks card.


4. Drag the Region dimension to Detail on the Marks card.
5. Pie Charts
o Pie charts are powerful for adding detail to other
visualizations.
o The angle of the pie determines the measured value.
o Different colours can be assigned to pie to represent
the members in a dimension (qualitative fields).
o Steps:
1. Drag the Sales measure to Columns
2. Drag the Sub-Category dimension to Rows.
Note: default chart type is a bar chart.
3. Click Show Me on the toolbar
4. Then select the pie chart type.
6. Bubble Charts
o Although bubbles aren’t technically their own type of
visualization, using them as a technique adds great
detail to scatter plots or maps.
o Varying the size and color of circles creates visually
compelling charts that presents large volumes of data
at once.
o Bubbles can add more detail to the traditional two-axis
chart, highlighting the relationship between three or
more variables, without overwhelming the viewer.
o Steps:
1. Drag the Category dimension to Columns.
2. Drag the Sales measure to Rows.
3. Click Show Me on the toolbar, then select the bubbles chart type.
7. Histogram
o Histograms show how your data is distributed across distinct groups.
o By grouping your data into specific categories (also known as “bins”), then
plotting the number of records in a category as a vertical bar, you can
quickly see which bins the majority of your data falls in.
o The histogram is your best option for visualizing how data falls into
categories.
o For example, the number of customers by company size,
student performance on an exam, and frequency of a
product defect.
o Steps:
1. Drag Quantity to Columns.
2. Click Show Me on the toolbar, then select the histogram
chart type.
3. Drag Segment to Color.
7. Heat Maps
o Heat maps are a great way to compare
data across two or more categories using
color.
o Patterns guide viewers around the chart,
quickly showing them where the
intersection of categories is strongest and
weakest.
o Heat maps are best for presenting data sets with lots of categories.
o This chart type can pack hundreds of comparisons into a small area, and still
be easy to understand at a glance.
o For example, sales quota assessment, actual spending vs. budget, and
performance spectrum (great/good/poor).
o Steps:
1. Drag Infant Mortality to the Columns shelf.
2. Drag the Life Expectancy Female to the Rows shelf. Now you have a one-
mark scatter plot.
3. Both Infant Mortality and Life
Expectancy are listed as a Sum,
rather than average. Right click
on both of these measures and
to change Measure(Sum)
to Average.
4. Drag the Country dimension
to Details on the Marks card.
5. On the Marks card, select Density from
the menu to change this scatter plot into
a density chart.
6. Tableau created a density chart by
overlaying marks, called kernels, and
color-coding where those kernels
overlap. The more overlapping data
points, the more intense the color is.
8. Tree Maps
o Treemaps relate different segments
of your data to the whole.
o By nesting rectangles within others,
treemaps show how individual data
points fit in a hierarchy.
o As the name of the chart suggests,
each rectangle is subdivided into
smaller rectangles, or sub-branches, based on its
proportion to the whole.
o They make efficient use of space to show the
percentage for each category.
o When a dataset can be broken down in many
different ways, a treemap might be the best way to
show what categories most of the data falls in.
o For example, storage usage across computer machines, managing the
number and priority of technical support cases and comparing fiscal budgets
between years.
o Steps:
1. Drag the Sub-
Category dimension to Columns.
2. Drag the Sales measure to Rows.
Tableau displays a bar chart
3. Click Show Me on the toolbar,
then select the treemap chart type.
9. Maps
o These are a no-brainer for visualizing any kind of location information,
whether it’s postal codes, state abbreviations, country names, or your own
custom geocoding.
o Maps highlight geographic trends in a format everyone knows and
understands.
o If you have geographic information associated with your data, maps are a
simple and compelling way to show how location correlates with trends in
your data.
o For example, insurance claims by state, product export destinations by
country, car accidents by zip code, and custom sales territories.

10. Pareto Chart


o It is a dual-axis combination chart used to highlight dimension members that
are having the biggest impact to the measure in question.
o A Pareto chart consists of both bar and line graph.
o On the primary axis, bars are used to show the raw quantities for each
dimension member, sorted in descending order.
o On the secondary axis, a line graph is used to show the cumulative total in
percent format.
o The basic purpose of using this Chart is to identify the contribution of
members present in a field.
Automatically Build Views with Ask Data
• Ask Data is Tableau’s NLP feature which
lets you type a question in common
language and instantly get a response right
in Tableau.
• Answers come in the form of automatic
data visualizations
• If the default viz doesn't fully reveal your
data, click the menu at upper right, and
choose from these supported viz types:
Tableau File Extensions
1. Tableau Workbook (.twb)
• A workbook in Tableau is a file that contains sheets, dashboards, etc.
• So, this particular Tableau file type contains information about worksheets
and dashboards present within a workbook.
• All the information regarding the fields, aggregation types, styles,
formatting, filters, etc is present in these files.
• One important thing to note about this Tableau file type is that we can only
create them if we are using live data connections and share them only with
the users having access to the same live data connection.
• The .twb files also contain metadata related to the existing data connection.
However, a .twb file does not contain actual data concerning the workbook.
2. Tableau Packaged Workbook (.twbx)
• The Tableau Packaged Workbook file type has both information about the
constituents of a workbook and the data extracted from the data source.
• The data extract taken from the source is present in the form of a .tde file.
• We can create them only if we are using live data connections
• You can use a .twbx file type in place of a .twb (Tableau Workbook) file when
you wish to share a workbook with a user who does not have access to the
live data connection.

3. Tableau Data Source (.tds)


• The Tableau Data Source files are files that contain all the necessary
information regarding a data connection made in Tableau.
• The Tableau Data Source files contain all the required information on setting
up a data connection along with the metadata of other specific
modifications made by the users.
• However, this Tableau file type only saves the information needed to
establish a connection with a data source but not the actual data.
• The details of the connection used to create the tableau report are stored in
this file.
• In the connection details, it stores the source type (excel/relational/sap, etc.)
as well as the data types of the columns.
4. Tableau Packaged Data source (.tdsx)
• This file is similar to the .tds file with the addition of data along with the
connection details.
• A Tableau Packaged Data Source file is a file that contains information of a
data source connection along with the data extracted from that source.
• The extracted data is saved as a .tde file and the
information on data-source as .tds file
• They are used when we want to share data and information about a data
source with a user who does not have access to the data source.

5. Tableau Data Extract (.tde)


• This file contains the data used in a .twb file in a
highly compressed columnar data format.
• This helps in storage optimization.
• It also saves the aggregated calculations that are applied in the analysis.
• This file should be refreshed to get the updated data from the source.
• These Tableau file types only contain a local copy of the entire or a subset of
data from its source.

6. Tableau Bookmark (.tbm)


• These files contain a single worksheet that is shared easily to be pasted into
other workbooks.
• These Tableau file types are most commonly used to save worksheets and
share them with others so that they can use it in their workbooks without
having to create a new worksheet from scratch.

7. Tableau Preferences (.tps)


• A Tableau Preference file contains all the information related to a
customized colour palette.
• This file stores the color preference used across all the workbooks. It is
mainly used for consistent look and feel across the users.
• You can create a custom colour palette or a theme and save it as a .tps file
so that you can use it all over the workbook uniformly at once.
• The Tableau Preference files have the extension as .tps and exist in XML
format.
8. Tableau Map Source (.tms)
• A Tableau Map Source file contains information about maps and its
elements for use in Tableau.
• As per the default settings, Tableau will fetch map details like background
and other layers from a certain map server or provider.

Tableau Joins
1. To create a join, connect to the relevant data source or sources.
2. Drag the first table to the canvas.

3. Select Open from the menu or double-click the first table to open the join canvas

4. Double-click or drag another table to the join canvas.


5. Click the join icon to configure the join. Add one or more join clauses by selecting a
field from one of the available tables used in the data source, choosing a join
operator, and a field from the added table.

6. When finished, close the join dialog and join canvas.


Join Types
In general, there are four types of joins that you can use in Tableau:

Join Type Result

Inner The result is a table that contains values that have matches in both
tables.
When a value doesn't match across both tables, it is dropped entirely.
Left the result is a table that contains all values from the left table and
corresponding matches from the right table.
When a value in the left table doesn't have a corresponding match in
the right table, you see a null value in the data grid.

Right The result is a table that contains all values from the right table and
corresponding matches from the left table.
When a value in the right table doesn't have a corresponding match in
the left table, you see a null value in the data grid.

Full outer The result is a table that contains all values from both tables.
When a value from either table doesn't have a match with the other
table, you see a null value in the data grid.
Example: Join on Customer ID
Order ID Customer ID Customer ID Customer Name
1 222 222 Rahul
2 777 777 Rohith
3 222 888 Aman
4 555 666 Shivanand
Inner Join
Order ID Customer ID Customer Name
1 222 Rahul
2 777 Rohith
3 222 Rahul
Left Join
Order ID Customer ID Customer Name
1 222 Rahul
2 777 Rohith
3 222 Rahul
4 555 null
Right Join
Order ID Customer ID Customer Name
1 222 Rahul
2 777 Rohith
3 222 Rahul
null 888 Aman
null 666 Shivanand
Full Outer Join
Order ID Customer ID Customer Name
1 222 Rahul
2 777 Rohith
3 222 Rahul
4 555 null
null 888 Aman
null 666 Shivanand

You might also like