You are on page 1of 12

Week 1 Unit 2: Introduction to

Project Methodologies
Introduction to Project Methodologies
Why should there be a project methodology?

 The data science process must be reliable and


repeatable by people with little data science
background. TIME

 A project methodology:
Task 1
‒ Provides a framework for recording experience
‒ Allows projects to be replicated Task 2
‒ Provides an aid to project planning and Task 3
management
‒ Is a “comfort factor” for new adopters Task 4
‒ Reduces dependency on “stars”

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 2


Introduction to Project Methodologies
Cross-industry standard process for data mining (CRISP-DM)

Business Data
Understanding Understanding

Data
Preparation

Deployment

Modeling
Data

Evaluation

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 3


Introduction to Project Methodologies
CRISP-DM – Phase 1: Business Understanding

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Business
Determine Business Business
Background Success
Objectives Objectives
Criteria

Requirements
Assess Inventory of Risks & Costs &
Assumptions & Terminology
Situation Resources Contingencies Benefits
Constraints

Data Science
Determine Data Data Science
Success
Science Goals Goals
Criteria Key

Initial TASKS
Produce Project Assessment
Project Plan of Tools & OUTPUTS
Plan
Techniques

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 4


Introduction to Project Methodologies
CRISP-DM – Phase 2: Data Understanding

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Initial Data
Collect Initial
Collection
Data
Report

Data
Describe
Description
Data
Report

Data
Explore
Exploration
Data
Report Key
TASKS
Verify Data Data Quality
Quality Report OUTPUTS

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 5


Introduction to Project Methodologies
CRISP-DM – Phase 3: Data Preparation

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Dataset Dataset Description

Rationale for
Select Data
Inclusion/Exclusion

Data Cleaning
Clean Data
Report

Construct Data Derived Attributes Generated Records


Key
Integrate Data Merged Data TASKS

OUTPUTS
Format Data Reformatted Data

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 6


Introduction to Project Methodologies
CRISP-DM – Phase 4: Modeling

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Select Modeling Modeling


Modeling Technique
Technique Assumptions

Generate Test
Test Design
Design

Build
Parameter Settings Models Model Description
Model Key
TASKS

Revised Parameter OUTPUTS


Assess Model Model Assessment
Settings

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 7


Introduction to Project Methodologies
CRISP-DM – Phase 5: Evaluation

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Evaluate Assessment of Data


Approved Model
Results Mining Results

Review
Review of Process
Process

Determine List of Possible


Decision
Next Steps Actions Key
TASKS

OUTPUTS

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 8


Introduction to Project Methodologies
CRISP-DM – Phase 6: Deployment

Business Data Data


Modeling Evaluation Deployment
Understanding Understanding Preparation

Plan
Deployment Plan
Deployment

Plan Monitoring & Monitoring


Maintenance Maintenance Plan

Produce
Final Report Final Presentation
Final Report Key
TASKS

Review Experience OUTPUTS


Project Documentation

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 9


Introduction to Project Methodologies
CRISP-DM – Update

Business Data
Understanding Understanding

Data
Monitoring Preparation

Modeling
Deployment Data

Evaluation

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 10


Thank you

Contact information:

open@sap.com
© 2016 SAP SE or an SAP affiliate company. All rights reserved.

No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affiliate company.

SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affiliate
company) in Germany and other countries. Please see http://global12.sap.com/corporate-en/legal/copyright/index.epx for additional trademark information and notices.

Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors.

National product specifications may vary.

These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP SE or its
affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP SE or SAP affiliate company products and
services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as
constituting an additional warranty.

In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop
or release any functionality mentioned therein. This document, or any related presentation, and SAP SE’s or its affiliated companies’ strategy and possible future
developments, products, and/or platform directions and functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time
for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forward-
looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place
undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions.

© 2016 SAP SE or an SAP affiliate company. All rights reserved. Public 12

You might also like