Professional Documents
Culture Documents
REVIEW
Datalchemist
Classification - Internal
2
MEET US!
ADAM YAZLI PUTRA
AJENG YUNITA
Classification - Internal
OUTLINE
• Data Analytics
• Data Analytics Overview
• Data Analytics Workflow and CRISP-DM
Framework
• Level of Analytics
• Data Analytics Practitioner
Classification - Internal
OVERVIEW
ANALYTICS
Evaluated understanding
• Find trends Knowledge
• Uncover opportunities
Application of data and information;
• Predict actions, triggers, or answers “how” and “why” question
events Information
• Make decisions
Data that are processed to be useful
provide answers to “who”, “what”,
“where” and “when” question
Data
Conceived of symbol, signs
Usually sourcing from Data Engineering, Data Data Analytics (Statistical methods, Descriptive & Decision making, based on
Information Warehousing help us Diagnostics Analysis, Predictive Model, etc) help us data, or even better: data-
Technology convert Data to convert Information to Knowledge by visualizing and driven, give us Wisdom
Information from the Knowledge
taking insights
Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
The data analytics workflow typically consists of several key stages, each
contributing to the overall process of deriving insights from data.
Common framework includes the following stages:
Data Collection:
• Determine the sources of data needed for analysis.
• Collect and gather the required data
Communicate Results
Present the findings to relevant stakeholders, providing
insights to support decision-making processes.
Classification - Internal
DATA ANALYTICS WORKFLOW & CRISP DM FRAMEWORK
Which stands for CROSS-INDUSTRY STANDARD PROCESS FOR DATA
MINING, is a widely used process model for data mining and analytics projects.
It provides a structured
approach to guide
practitioners through
the stages of a data
mining project, from
understanding the
Develop a
business problem to
strategy for
deploying the deploying the final
model into the model.
business
environment.
Classification - Internal
LEVEL OF ANALYTICS
Classification - Internal
DATA PRACTITIONER
Machine Learning
Visualization and Analytics
Data Engineering
• Problem solving
• Business decision strategu
• Analyze data, finding root cause
• Handling large-scale data processing • Utilize statistical analysis and analysis, and recommend business
and ensuring data availability visualization techniques to present decision
• Implement ETL (Extract, Transform, findings in a meaningful way. • Define business strategy in the future.
Load) processes to clean, transform, • Design and create dashboards and • Descriptive, Prescriptive and Predictive
and integrate data. reports that visually represent key
• Set up and manage the infrastructure performance indicators (KPIs) and Tools : SQL, Excel, SAS, Pentaho, Spark,
for data storage and processing other relevant metrics. Hadoop, Domain Knowledge
• Work closely with data scientists, • Focusing in analytic descriptive
analysts, and other stakeholders to and report summary
understand data requirements and • Descriptive, diagnostic
provide the necessary infrastructure
and tools for analysis. Tools : SQL, Excel, R, Python, Tableau,
Power BI, Google Data Studio, Domain
Tools : SQL, Excel, SAS, Pentaho, Spark, Knowledge
Hadoop, Domain Knowledge
Classification - Internal
OVERVIEW
Text Cell formatting
formatting
Column
Understanding
Excel and Row
Cell
Function Worksheets
IF COUNTIF SUMIF
• Performs a logical test and • Counts the number of cells • Adds up all the numbers in a
returns one value if the test is within a range that meet the range that meet a specified
true and another if false. given condition. condition.
• It performs a logical test and returns one value if • The OR function returns TRUE if at least
the test is true and another value if the test is false. one of the conditions specified is true;
• Syntax: =IF(logical_test, value_if_true, value_if_false) otherwise, it returns FALSE.
• Syntax: =OR(condition1, condition2, ...)
Understanding
• Example: =IF(A1>10, "Yes", "No")
• Example: =OR(A1>10, B1<20)
NOT:
Classification - Internal
TEXT FUNCTIONS
CONCATENATE (or CONCAT): LOWER:
• Combines two or more text strings into one. • Converts all letters in a text string to lowercase.
• Syntax: =CONCATENATE(text1, [text2], ...) • Syntax: =LOWER(text)
• Example: =CONCATENATE(A1, " ", B1) • Example: =LOWER(A1)
Understanding
Excel and RIGHT: LEN:
LEFT: MID:
• Returns a specified number of characters from the • Returns a specific number of characters from a
beginning of a text string. text string, starting at a specified position – as per
• Syntax: =LEFT(text, num_chars) character decide.
• Example: =LEFT(A1, 5) • Syntax: =MID(text, start_num, num_chars)
• Example: =MID(A1, 3, 4)
SUBSTITUTE: TRIM:
• Replaces occurrences of a specified substring with • Removes leading and trailing spaces from a text
another substring in a text string. string and EXCEPT a single space between
• Syntax: =SUBSTITUTE(text, old_text, new_text, words.
[instance_num]) • Syntax: =TRIM(text)
• Example: =SUBSTITUTE(A1, "apple", "orange") • Example: =TRIM(A1)
Classification - Internal
LOGICAL FUNCTIONS
Sum SUMIF
• Adds up all the numbers in a range. • Adds up all the numbers in a range that meet a
Understanding • Example =SUM(A1:A10) specified condition.
• Example: =SUMIF(A1:A10, “>10
Excel and
Function Count COUNTIF
• Counts the number of cells that contain • Counts the number of cells within a range that
numbers in a range. meet the given condition.
• Example: =COUNT(C1:C8) • Example: =COUNTIF(C1:C10, “>50”)
IF Average
Classification - Internal
Analysis with Excel
LOOKUP FUNCTION
Classification - Internal
Analysis with VLOOKUP & HLOOKUP
excel
➢ VLOOKUP ➢ HLOOKUP
Classification - Internal
INDEX & MATCH
➢ INDEX ➢ MATCH
Analysis with INDEX function is used to MATCH function is used to
return a value or the reference locate the position of a lookup
excel to a value from within a table or value in a row, column, or
range. There are two ways to table. The function searches for
use the INDEX function: If you a specified item in a range of
want to return the value of a cells, and then returns the
specified cell or array of cells, relative position of that item in
see array form. If you want to the range.
return a reference to specified
cells, see reference form
Classification - Internal
A chart or graph is a visual representation of
Chart And data that helps to convey information in a
clear and concise manner. There are many
Visual different types of charts and graphs, each with
its own strengths and weaknesses.
Classification - Internal
VISUALIZATION TYPE 17
SCATTER PLOT
MAP CHART TREEMAP
Classification - Internal