Professional Documents
Culture Documents
Science Trajectories
Business Data
Understanding Understanding
Data
Preparation
Deployment
Data
Modelling
Evaluation
CRISP-DM evolution
Legend
KDD Related
6-sigma
Cios et al.
SEMMA (Harry & Schroeder, 1999;
Pyzdek, 2003) (Cios et al., 2000; Cios & CRISP-DM
(SAS Institute, 2015) Related
Kurgan, 2005)
RAMSYS
Human-Centered (Moyle & Jorge, 2001)
Other approach.
(Brachman and D^3M
Anand,1996, Gertosio and (Cao, 2010)
KDD Roadmap
Dussauchoy, 2014)
(Debuse et al., 2001)
HACE
(X. Wu, et al, 2014)
KDD
Cabena et al. CRISP-DM
(Piatetsky-Shapiro, 1991;
(Cabena et al., 1997) (Chapman et al., 2000)
Fayyad et al., 1996b)
ASUM-DM
(IBM, 2015)
CRISP-DM2.0 CASP-DM
Anand & Buchner (Martinez-Plumed et al.,
FMDS
5A's TDSP (IBM, 2015)
Two Crows (SPSS, 1999, de Pisón (Microsoft, 2016)
(Two Crows Corporation, 1999)
Ascacibar, 2003;
SPSS, 2007)
Adapted from G. Mariscal, O. Marban, and C. Fernandez: A survey of datamining and knowledge
discovery process models and methodologies, Knowledge Engineering Review 25(2):137-166, 2010.
Data takes centre stage
Contemporary Data Science starts from the data:
Business Data
Data Source Data Value
Understanding Understanding
Exploration Exploration
Data Acquisition
Data Release
Product Narrative
Evaluation Modelling
Exploration Exploration
Travelling through DST space
Business Data
Data Source Data Value
Understanding Understanding
Exploration Exploration
Data Acquisition
2
0 Goal 1 Data Simulation Data
Deployment Result
Exploration Preparation
Data Architecting Exploration
Data Release 3
5 Product
Evaluation Modelling Narrative
Exploration Exploration
4
A Data Science Trajectory
0 Business 1 Data
Undersanding Understanding
9 6
8 7
Data Source Data Value
Data Acquisition Exploration Exploration
Not all trajectories require Modelling
data ownership;
who profits from the additional value created.
training programmes
conferences, events & organisations
funding opportunities & job openings
technology infrastructure
legal & ethical frameworks
EuADS activities so far
Co-organised the European Conferences on Data
Analysis (ECDA)
Luxembourg 2013, Bremen 2014, Colchester 2015,
Wroclaw 2017, Paderborn 2018, Bayreuth 2019
Organised the European Data Science Conference
(Luxembourg, Nov. 2016).
Guest-edited a special issue on Data Science in Europe
of the Int. J. of Data Science and Analytics (2018).
Organised the first European Summer School on
Explainable Data Science (Luxembourg, Sept. 2019).
EDSC 2016
The EDSC programme consisted of invited plenary
talks, symposia, workshops and panel discussions on:
Navigate to EuADS.org.
Sabine Krolak-Schwerdt, 1958-2017
Recap (5/5)
Whether seen as the science of data, doing science
with data, or applying science to data: data science is
inherently multi-disciplinary and requires sustained
interaction between disciplines.
Tracing its evolution from data mining exposes the
essentially exploratory nature of data science.
The human factor raises important questions, e.g.
regarding ownership of data and results, and where
regulation/legislation is needed.
The EuADS mission is to help articulate, support and
advance these challenges and opportunities.
From Data Mining Processes to Data Science Trajectories | Peter Flach