You are on page 1of 18

LEARNING PROGRESS

REVIEW
Datalchemist

Classification - Internal
2

MEET US!
ADAM YAZLI PUTRA
AJENG YUNITA​

ADILAH RUMMANAH AFIRA SEPTRIA AGUS NURDANI​

Classification - Internal
OUTLINE
• Data Analytics
• Data Analytics Overview
• Data Analytics Workflow and CRISP-DM
Framework
• Level of Analytics
• Data Analytics Practitioner

• Understanding Excel and Function


• Basic Ms. Excel
• Logical and Basic Function

• Analysis with Excel

Classification - Internal
OVERVIEW

Data analytics is the process of


examining, cleaning,
transforming, and modeling From Data to Wisdom
DATA data with the goal of discovering
useful information: Wisdom

ANALYTICS
Evaluated understanding
• Find trends Knowledge
• Uncover opportunities
Application of data and information;
• Predict actions, triggers, or answers “how” and “why” question
events Information
• Make decisions
Data that are processed to be useful
provide answers to “who”, “what”,
“where” and “when” question

Data
Conceived of symbol, signs

Key Components Of Data Analytics Include:

Data Cleaning Interpretation


Data Data
and Data Analysis and Decision-
Collection Visualization
Preparation Making

Usually sourcing from Data Engineering, Data Data Analytics (Statistical methods, Descriptive & Decision making, based on
Information Warehousing help us Diagnostics Analysis, Predictive Model, etc) help us data, or even better: data-
Technology convert Data to convert Information to Knowledge by visualizing and driven, give us Wisdom
Information from the Knowledge
taking insights
Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
The data analytics workflow typically consists of several key stages, each
contributing to the overall process of deriving insights from data.
Common framework includes the following stages:

Data Analytics Workflow

Define Objectives and Set Clear Measurement


Understand what you aim to achieve and what questions
you want to answer through the data by knowing what and
how to measure

Data Collection:
• Determine the sources of data needed for analysis.
• Collect and gather the required data

Data Cleaning and Preprocessing


Identify and handle missing values, outliers, and errors in
the data. Then, convert and format the data as needed this may
need transformation, encoding, etc.

Data Analysis, Interpretation, and Data Visualization


Apply statistical methods or other technique tools
(machine learning algorithms) to test hypotheses, derive
insights or predictive modelling

Communicate Results
Present the findings to relevant stakeholders, providing
insights to support decision-making processes.

Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
Which stands for CROSS-INDUSTRY STANDARD PROCESS FOR DATA
MINING, is a widely used process model for data mining and analytics projects.

Define the problem


or goal in business
terms. Objective

It provides a structured
approach to guide
practitioners through
the stages of a data
mining project, from
understanding the
Develop a
business problem to
strategy for
deploying the deploying the final
model into the model.
business
environment.

The CRISP-DM process is


iterative, and it allows
for revisiting and revising
Select appropriate modeling
techniques and evaluate, validate
earlier stages as needed.
models using test data.

Classification - Internal
LEVEL OF ANALYTICS

Presciptive Analytics not only


predicts future outcomes but also
suggests actions to optimize
outcomes (unearthing new
Descriptive analytics : strategy)
involves the examination
of historical data to It provides recommendations on
understand what what actions to take to achieve
happened. desired results.

MethodBasic statistical Example: Recommending


measures, data marketing strategies to maximize
visualization, dashboards. revenue (causality)

Diagnostic analytics goes beyond


describing historical data and aims to Predictive Analytics uses statistical
understand why certain events occurred. algorithms and machine learning techniques
It involves identifying the factors that to identify future outcomes based on
contributed to specific outcomes. historical data.

Methods: Drill-downs, data discovery, root Methods: Regression analysis, machine


cause analysis. learning models, forecasting

Example: Analyzing factors that led to a Example: Sales forecasting (statistical


spike or drop in sales (advances statistics) learning)

Classification - Internal
DATA PRACTITIONER

Machine Learning
Visualization and Analytics
Data Engineering

Data Data Data BI Visualization Business


Data Scientist
Architect Engineer Analyst Analyst Analyst Analyst

• Problem solving
• Business decision strategu
• Analyze data, finding root cause
• Handling large-scale data processing • Utilize statistical analysis and analysis, and recommend business
and ensuring data availability visualization techniques to present decision
• Implement ETL (Extract, Transform, findings in a meaningful way. • Define business strategy in the future.
Load) processes to clean, transform, • Design and create dashboards and • Descriptive, Prescriptive and Predictive
and integrate data. reports that visually represent key
• Set up and manage the infrastructure performance indicators (KPIs) and Tools : SQL, Excel, SAS, Pentaho, Spark,
for data storage and processing other relevant metrics. Hadoop, Domain Knowledge
• Work closely with data scientists, • Focusing in analytic descriptive
analysts, and other stakeholders to and report summary
understand data requirements and • Descriptive, diagnostic
provide the necessary infrastructure
and tools for analysis. Tools : SQL, Excel, R, Python, Tableau,
Power BI, Google Data Studio, Domain
Tools : SQL, Excel, SAS, Pentaho, Spark, Knowledge
Hadoop, Domain Knowledge
Classification - Internal
OVERVIEW
Text Cell formatting
formatting

Column

Understanding
Excel and Row
Cell

Function Worksheets

Basic Function (a few example)


Sum Average Count
• Adds up all the numbers in a • Calculates the average of a • Counts the number of cells that
range. range of numbers. contain numbers in a range.

•Example =SUM(A1:A10) •Example: =AVERAGE(A1:A10) •Example: =COUNT(C1:C8)

IF COUNTIF SUMIF
• Performs a logical test and • Counts the number of cells • Adds up all the numbers in a
returns one value if the test is within a range that meet the range that meet a specified
true and another if false. given condition. condition.

•Example: IF(A1>A10,”Yes”,”No”) •Example: =COUNTIF(C1:C10, •Example: =SUMIF(A1:A10, “>10


“>50”)
Classification - Internal
LOGICAL FUNCTIONS
IF Function: OR

• It performs a logical test and returns one value if • The OR function returns TRUE if at least
the test is true and another value if the test is false. one of the conditions specified is true;
• Syntax: =IF(logical_test, value_if_true, value_if_false) otherwise, it returns FALSE.
• Syntax: =OR(condition1, condition2, ...)
Understanding
• Example: =IF(A1>10, "Yes", "No")
• Example: =OR(A1>10, B1<20)

Excel and IFERROR: AND:


Function • The IFERROR function allows you to handle errors • The AND function returns TRUE if all the
in your formulas. It returns a specified value if conditions specified are true; otherwise, it
the formula results in an error and the result of returns FALSE.
the formula if there is no error. • Syntax: =AND(condition1, condition2, ...)
• Syntax: =IFERROR(formula, value_if_error) • Example: =AND(A1>10, B1<20)
• Example: =IFERROR(A1/B1, "Error in
calculation")

NOT:

• The NOT function returns TRUE if the condition


specified is false and vice versa.
• Syntax: =NOT(logical_test)
• Example: =NOT(A1>10)

Classification - Internal
TEXT FUNCTIONS
CONCATENATE (or CONCAT): LOWER:

• Combines two or more text strings into one. • Converts all letters in a text string to lowercase.
• Syntax: =CONCATENATE(text1, [text2], ...) • Syntax: =LOWER(text)
• Example: =CONCATENATE(A1, " ", B1) • Example: =LOWER(A1)

Understanding
Excel and RIGHT: LEN:

Function • Returns a specified number of characters from the


end of a text string.
• Returns the number of characters in a text
string.
• Syntax: =RIGHT(text, num_chars) • Syntax: =LEN(text)
• Example: =RIGHT(A1, 3) • Example: =LEN(A1)

LEFT: MID:

• Returns a specified number of characters from the • Returns a specific number of characters from a
beginning of a text string. text string, starting at a specified position – as per
• Syntax: =LEFT(text, num_chars) character decide.
• Example: =LEFT(A1, 5) • Syntax: =MID(text, start_num, num_chars)
• Example: =MID(A1, 3, 4)

SUBSTITUTE: TRIM:

• Replaces occurrences of a specified substring with • Removes leading and trailing spaces from a text
another substring in a text string. string and EXCEPT a single space between
• Syntax: =SUBSTITUTE(text, old_text, new_text, words.
[instance_num]) • Syntax: =TRIM(text)
• Example: =SUBSTITUTE(A1, "apple", "orange") • Example: =TRIM(A1)

Classification - Internal
LOGICAL FUNCTIONS

Sum SUMIF

• Adds up all the numbers in a range. • Adds up all the numbers in a range that meet a
Understanding • Example =SUM(A1:A10) specified condition.
• Example: =SUMIF(A1:A10, “>10

Excel and
Function Count COUNTIF

• Counts the number of cells that contain • Counts the number of cells within a range that
numbers in a range. meet the given condition.
• Example: =COUNT(C1:C8) • Example: =COUNTIF(C1:C10, “>50”)

IF Average

• Performs a logical test and returns one value if


the test is true and another if false. • Calculates the average of a range of numbers.
• Example: IF(A1>A10,”Yes”,”No”) • Example: =AVERAGE(A1:A10)

Classification - Internal
Analysis with Excel

LOOKUP FUNCTION

• LOOKUP FUNCTION is one of the basic functions in searching


and referencing in Microsoft Excel

• LOOKUP is used to perform a rough match search in a one-


row or one-column range and return the corresponding value
from another one-row or one-column range.

• The LOOKUP FUNCTIONS that will be used are the HLOOKUP,


VLOOKUP, INDEX AND MATCH.

Classification - Internal
Analysis with VLOOKUP & HLOOKUP
excel
➢ VLOOKUP ➢ HLOOKUP

VLOOKUP is a function in HLOOKUP is a function in


Microsoft Excel that helps Microsoft Excel that helps
you find things in a table or you find things in a table or
a range by row. a range by column.

Classification - Internal
INDEX & MATCH
➢ INDEX ➢ MATCH
Analysis with INDEX function is used to MATCH function is used to
return a value or the reference locate the position of a lookup
excel to a value from within a table or value in a row, column, or
range. There are two ways to table. The function searches for
use the INDEX function: If you a specified item in a range of
want to return the value of a cells, and then returns the
specified cell or array of cells, relative position of that item in
see array form. If you want to the range.
return a reference to specified
cells, see reference form

Classification - Internal
A chart or graph is a visual representation of
Chart And data that helps to convey information in a
clear and concise manner. There are many
Visual different types of charts and graphs, each with
its own strengths and weaknesses.

Some common types of charts and graphs


include:
1. Bar chart
2. Line chart
3. Pie chart
4. Scatter plot
5. Heatmap
6. Treemap

Classification - Internal
VISUALIZATION TYPE 17

BAR CHART PIE CHART


LINE CHART

A bar chart is a graph that represents data with


A line chart is a graph that displays A pie chart is a circular graph that is divided
rectangular bars. The length or height of each
information as a series of data points into slices to represent numerical proportions.
bar is proportional to the value it represents.
connected by straight lines. Line charts are Pie charts are useful for showing how
Bar charts are useful for comparing data across
useful for showing trends over time. different parts make up a whole.
different categories.

SCATTER PLOT
MAP CHART TREEMAP

A Map Chart is a graphical A treemap is a hierarchical chart that displays data as


A scatter plot is a graph that displays the representation of data that displays nested rectangles. Each rectangle represents a
relationship between two variables. Each information on a map. different level in the hierarchy, and the size of
point on the graph represents a pair of values for the rectangle is proportional to the value it
the two variables. Scatter plots are useful for represents. Treemaps are useful for showing how
identifying patterns and trends in data. different parts make up a whole.
Classification - Internal
THANK YOU

Classification - Internal

You might also like