Welcome to Scribd!

Mouhsine Elmoudir The Principles of Data Mining

Uploaded by

0% found this document useful (0 votes)

9 views2 pages

This document provides an overview of data mining and machine learning techniques discussed in three chapters of a book. Chapter 1 defines the data mining process and discusses supervised vs unsupervised learning. Chapter 2 covers data types and preparation. Chapter 3 introduces classification and supervised learning algorithms like Naive Bayes and K-Nearest Neighbors for assigning classifications. It explains how Naive Bayes uses probabilities to determine the most likely classification and how K-Nearest Neighbors uses the classifications of the k closest training instances.

Original Description:

bfdbdfbdfb

Original Title

Mouhsine Elmoudir the Principles of Data Mining

Copyright

Available Formats

ODT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as ODT, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views2 pages

Mouhsine Elmoudir The Principles of Data Mining

Uploaded by

Mouhsine EL MOUDIR

Copyright:

Available Formats

Download as ODT, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Mouhsine Elmoudir

IDSD

THE PRINCIPLE OF DATA MINING

A. CHAPTER 1 : we see in this explanation the definition of data mining
major , start by the KDD (Knowledge discovery of data) process which
maintain the Data Sources and transform it to Data Store by using Data
integration , so as to move the data preparation and then use directly
the data mining to gain the patterns and finally arrive to the knowledge
needed.
-Moving to the datasets which has examples (instances) , we can see the
labelled data (with target) or unlabelled data (without target)
-In the main of data mining there’s two types of machine learning ,
supervised learning which includes classification, regression, association
rules… .And the unsupervised learning contains the clustering(K-means)

B. CHAPTER 2 : This chapter shows us how data can effect the data
mining : type of variable -> nominal variables, binary variable, ordinal
variable…

-Also , there’s the categorical and continuous attributes so as to

preprare to a very important part called the data preparation and data
cleaning without mess to remove the missing values
- So, it introduces the standard formulation for the data input to data
mining algorithms that will be assumed throughout this book. It goes on
to distinguish between different types of variable and to consider issues
relating to the preparation of data prior to use, particularly the presence
of missing data values and noise. The UCI Repository of datasets is
introduced

C. CHAPTER 3 : we will discover in part an supervised learning

techniques. But first by giving the meaning of classification which looks
like many practical decision-making tasks can be formulated as
classification problems... assigning people or objects to one of a number
of categories, for example customers who are likely to buy or not buy a
particular product in a supermarket.
-We have also the Naïve Bayes algorithm known as probability theory to
find the most likely of the possible classifications. And we can use in
everyday life: the probability of an event, e.g. that the 6.30 p.m. train
from London to your local station arrives on time, is a number from 0 to
1 inclusive, with 0 indicating ‘impossible’ and 1 indicating ‘certain’.
-So, the Naïve Bayes algorithm gives us a way of combining the prior
probability and conditional probabilities in a single formula, which we
can use to calculate the probability of each of the possible classifications
in turn. Having done this we choose the classification with the largest
value
-We discussed also the K-nearst neighbours algorithm which is mainly
used when all attribute values are continuous, although it can be
modified to deal with categorical attributes. The idea is to estimate the
classification of an unseen instance using the classification of the
instance or instances that are closest to it, in some sense that we need
to define and move to the next steps is to find the k training instances
that are closest to the unseen instance. Then take the most commonly
occurring classification for these k instances

What Are The Types of Machine Learning?
Document24 pages
What Are The Types of Machine Learning?
sahil kumar
100% (1)
R18CSE4102-UNIT 2 Data Mining Notes
Document31 pages
R18CSE4102-UNIT 2 Data Mining Notes
texxas
100% (1)
Principles of Robotics
Document81 pages
Principles of Robotics
svenkatprasat
No ratings yet
Data Warehousing and Data Mining
Document7 pages
Data Warehousing and Data Mining
Deepti Singh
No ratings yet
PPAP Template InspectionXpert
Document14 pages
PPAP Template InspectionXpert
da_reaper_das
No ratings yet
GKS
Document49 pages
GKS
Steven
100% (1)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Jatayu - Architecture Document
Document13 pages
Jatayu - Architecture Document
deepika shankar
No ratings yet
Data Mining
Document19 pages
Data Mining
Sugandha BornWhimsical
No ratings yet
Data Mining and Visualization Question Bank
Document11 pages
Data Mining and Visualization Question Bank
ghost
100% (1)
Adewale Bello: 6031 Pineland DR, Dallas, TX 75231 Email: Cell: (682) 334-2888 Summary of Qualifications
Document3 pages
Adewale Bello: 6031 Pineland DR, Dallas, TX 75231 Email: Cell: (682) 334-2888 Summary of Qualifications
zakir itrecruit
No ratings yet
DM Notes (6th Nov)
Document6 pages
DM Notes (6th Nov)
Nithyan Nithya
No ratings yet
Data Mining
Document7 pages
Data Mining
Mano
No ratings yet
UNIT-1 1) KDD: KDD (Knowledge Discovery in Database)
Document17 pages
UNIT-1 1) KDD: KDD (Knowledge Discovery in Database)
Abinash Satapathy
No ratings yet
Data Mining and Data Warehousing
Document47 pages
Data Mining and Data Warehousing
asd
No ratings yet
Down 3
Document129 pages
Down 3
pavunkumar
No ratings yet
Ques 1.give Some Examples of Data Preprocessing Techniques?: Assignment - DWDM Submitted By-Tanya Sikka 1719210284
Document7 pages
Ques 1.give Some Examples of Data Preprocessing Techniques?: Assignment - DWDM Submitted By-Tanya Sikka 1719210284
Sachin Chauhan
No ratings yet
Basic Concept of Classification (Data Mining)
Document11 pages
Basic Concept of Classification (Data Mining)
IQAC VMDC
No ratings yet
Data Mining UNIT-2 Notes
Document91 pages
Data Mining UNIT-2 Notes
padma
No ratings yet
DM UNIT-1 Question and Answer
Document25 pages
DM UNIT-1 Question and Answer
Pooja Reddy
No ratings yet
Assignment 3 Explain Useful Data Mining Queries?
Document4 pages
Assignment 3 Explain Useful Data Mining Queries?
amandeep651
No ratings yet
Satyabhama Bigdata
Document128 pages
Satyabhama Bigdata
Vijaya Kumar Vadladi
No ratings yet
Sequential Pattern Mining: A Survey
Document27 pages
Sequential Pattern Mining: A Survey
Nengnya Kang Mas
No ratings yet
IV-cse DM Viva Questions
Document10 pages
IV-cse DM Viva Questions
Imtiyaz Ali
No ratings yet
Time Table Scheduling in Data Mining
Document61 pages
Time Table Scheduling in Data Mining
Amrit Kaur
No ratings yet
Dmbi Assignment 3
Document5 pages
Dmbi Assignment 3
Kanishk Test
No ratings yet
Activity 1 PDF
Document3 pages
Activity 1 PDF
John Michael Reyes
No ratings yet
Unit I DM
Document27 pages
Unit I DM
Suganthi D PSGRKCW
No ratings yet
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
Document10 pages
Unit 3 - Data Mining - WWW - Rgpvnotes.in PDF
Gaurav Rajpoot
No ratings yet
Using Predictive Analytics Model To Diagnose Breast Cnacer
Document9 pages
Using Predictive Analytics Model To Diagnose Breast Cnacer
Runal Bhosale
No ratings yet
What Is Machine Learning?
Document8 pages
What Is Machine Learning?
Pooja Patwari
No ratings yet
Cs1004 Data Warehousing & Mining Unit 5
Document10 pages
Cs1004 Data Warehousing & Mining Unit 5
Thaddeus Moore
No ratings yet
LECTURE NOTES ON DATA MINING and DATA WA
Document84 pages
LECTURE NOTES ON DATA MINING and DATA WA
Ali Azfar
No ratings yet
DM Lab Manual
Document32 pages
DM Lab Manual
7killers4u
No ratings yet
Assignment On Chapter 3 Data Warehousing and Management
Document17 pages
Assignment On Chapter 3 Data Warehousing and Management
Anna Belle
No ratings yet
Bi Ut2 Answers
Document23 pages
Bi Ut2 Answers
Suhasi Gadge
No ratings yet
Lecture 1428550844
Document11 pages
Lecture 1428550844
J Vel Murugan
No ratings yet
Why Data Mining
Document5 pages
Why Data Mining
Muhammad Tehseen Qureshi
No ratings yet
Great Compiled Notes Data Mining V1
Document92 pages
Great Compiled Notes Data Mining V1
MALLIKARJUN Y
No ratings yet
UNIT-1 Introduction To Data Mining
Document29 pages
UNIT-1 Introduction To Data Mining
VedhaVyas Mahasiva
No ratings yet
DM Unit 1 PDF
Document9 pages
DM Unit 1 PDF
Ayush
No ratings yet
Data Analytics 2marks PDF
Document13 pages
Data Analytics 2marks PDF
shobana
100% (1)
Assignment 1: Aim: Preprocess Data Using Python. Objective
Document7 pages
Assignment 1: Aim: Preprocess Data Using Python. Objective
Abhinay Surve
No ratings yet
Data Mining AND Data Warehousing: Computer Science & Engineering
Document14 pages
Data Mining AND Data Warehousing: Computer Science & Engineering
api-19799369
0% (1)
Untitled Document
Document5 pages
Untitled Document
Noel Ony
No ratings yet
DS Unit 1 Essay Answers.
Document18 pages
DS Unit 1 Essay Answers.
Savitha Elluru
No ratings yet
Data Mining - Reference - 1
Document91 pages
Data Mining - Reference - 1
Suresh Singh
No ratings yet
1.1 Data and Information Mining
Document24 pages
1.1 Data and Information Mining
jeron
No ratings yet
Data Mining Unit-4
Document27 pages
Data Mining Unit-4
19Q91A1231 NALDEEGA SAKETHA CHARY
No ratings yet
DWH Final Prep
Document17 pages
DWH Final Prep
Shamila Saleem
No ratings yet
Module1 DataMining Ktustudents - in
Document24 pages
Module1 DataMining Ktustudents - in
jeslin job
No ratings yet
Data Mining
Document87 pages
Data Mining
Megha Shenoy
No ratings yet
Book Exercises NayelliAnswers
Document3 pages
Book Exercises NayelliAnswers
Nayelli Valeria Pc
No ratings yet
1.data Mining Functionalities
Document14 pages
1.data Mining Functionalities
Sai Deekshith
No ratings yet
Chapter 3: Data Mining
Document20 pages
Chapter 3: Data Mining
shreya
No ratings yet
Dataminig ch1 30006
Document4 pages
Dataminig ch1 30006
Rehman Ali
No ratings yet
BCA Data Mining
Document116 pages
BCA Data Mining
sridharan
No ratings yet
DMWH M1
Document25 pages
DMWH M1
vani_V_prakash
No ratings yet
III CS Datamining - Unlocked
Document68 pages
III CS Datamining - Unlocked
Jana Jana
No ratings yet
DA (All CHP.)
Document14 pages
DA (All CHP.)
Sushant Thite
No ratings yet
Solutions To DM I MID (A)
Document19 pages
Solutions To DM I MID (A)
jyothibellaryv
100% (1)
Unit-2 Introduction To Data Mining
Document11 pages
Unit-2 Introduction To Data Mining
Khal Drago
No ratings yet
DataWarehouseMining Complete Notes
Document55 pages
DataWarehouseMining Complete Notes
Safee Khan
No ratings yet
Data Mining
Document135 pages
Data Mining
Dewsun Riseon
No ratings yet
Contact Me To Get Fully Solved Smu Assignments/Project/Synopsis/Exam Guide Paper
Document7 pages
Contact Me To Get Fully Solved Smu Assignments/Project/Synopsis/Exam Guide Paper
Mrinal Kalita
No ratings yet
NSE 4 Sample Exam 6
Document22 pages
NSE 4 Sample Exam 6
novalk23
No ratings yet
1.2.4.5 Packet Tracer - Network Representation - ILM
Document4 pages
1.2.4.5 Packet Tracer - Network Representation - ILM
kds20850
No ratings yet
Contoh Rab
Document45 pages
Contoh Rab
Yoyoc
No ratings yet
Automatizari & Control Vexve
Document12 pages
Automatizari & Control Vexve
doctorstrambalemne
No ratings yet
DCCT
Document48 pages
DCCT
Sairam Sai
No ratings yet
Cyber Security Notes
Document15 pages
Cyber Security Notes
Raja Gomathi
No ratings yet
Test Examples
Document18 pages
Test Examples
Muhammad Khaidhori
100% (3)
70 - PDF - Glass - House - Et - 09082014
Document3 pages
70 - PDF - Glass - House - Et - 09082014
Hahu
No ratings yet
40 1s4 PDF
Document5 pages
40 1s4 PDF
Shahzad Ali
No ratings yet
Driver's Manual: From Http://www.9ss1.dk/porsche944
Document66 pages
Driver's Manual: From Http://www.9ss1.dk/porsche944
Miguel Fragoso
No ratings yet
A Day Without Technology Essay
Document1 page
A Day Without Technology Essay
Kim B. Porteria
No ratings yet
Introduction To DSA Chapter 1
Document20 pages
Introduction To DSA Chapter 1
Abdiaziz Qabile
100% (3)
2018-19 F.Y.B.Sc. (ELECTRONICS) PDF
Document15 pages
2018-19 F.Y.B.Sc. (ELECTRONICS) PDF
sonawaneulhas292
No ratings yet
Precision, Wide Bandwidth 3-Port Isolation Amplifier AD210
Document8 pages
Precision, Wide Bandwidth 3-Port Isolation Amplifier AD210
Jorge Tarqui
No ratings yet
Deltorq Universal Actuators: The New Deltorq Series 21 & 2Z Pneumatic Rack and Pinion Actuators
Document2 pages
Deltorq Universal Actuators: The New Deltorq Series 21 & 2Z Pneumatic Rack and Pinion Actuators
Process Controls & Services
No ratings yet
Manual Nobreak APC SURTXLI100000
Document21 pages
Manual Nobreak APC SURTXLI100000
teodorojr
No ratings yet
Cisco Identity Services Engine Installation Guide, Release 2.4
Document90 pages
Cisco Identity Services Engine Installation Guide, Release 2.4
Minh Hoàng
No ratings yet
Kindly Motor: KR50/KC50-33G Series
Document1 page
Kindly Motor: KR50/KC50-33G Series
AsalAja
No ratings yet
Understanding Cryptography CHPTR 4 - AES
Document28 pages
Understanding Cryptography CHPTR 4 - AES
oorja rungta
100% (1)
Portable Low-Speed-Weigh-in-Motion
Document4 pages
Portable Low-Speed-Weigh-in-Motion
deby eka
No ratings yet
Turn-To-Turn Fault Detection in Transformers Using Negative Sequence Currents
Document144 pages
Turn-To-Turn Fault Detection in Transformers Using Negative Sequence Currents
Lê Nương
No ratings yet
TCP Ports Used by VeriCentre 3.0 SP1
Document3 pages
TCP Ports Used by VeriCentre 3.0 SP1
Hammad Shaukat
No ratings yet
Isai CV Ingles
Document2 pages
Isai CV Ingles
isai anselmo alfonsin clara
No ratings yet
Sample Question Paper
Document2 pages
Sample Question Paper
Talaram Chawla
No ratings yet
AB - PW3600 User Manual
Document91 pages
AB - PW3600 User Manual
reprop
No ratings yet