You are on page 1of 26

Introduction to Data Mining

Lect 1
Introduction to Data Mining - Dr. Ismael A. Ali - UoZ - Fall 2019
Outline
● Introducing us. Welcome!
● Course Policy: How is the course? (Course Website)
● Course Introduction:
○ What is Data Mining?
○ Why doing Data Mining?
○ How to do Data Mining?
● Course Content Overview: What you will learn?
Course Content Overview:
What will you learn in this course?

Learning Material: http://moodle.uoz.edu.krd/


General Course Website: http://ismaelali.net/introduction-to-data-mining/
Course Assessment
Task(s) Marks

Presentation 0%

Reports 5 Reports (for 4 Projects) x 2 Points = 10 %

Assignments 3 Python Homework x 2 Points = 6 %

Quizzes 4 Quizzes Over the Course x 1 Point = 4 %

Projects 4 Projects x 5 Points = 20 %

Mid-term examination 10% for Term1 Exam + 10% for Term2 Exam = 20 %

Final examination 40 %

Others 0
Introduction to Data Mining
What is Data Mining?

Why doing Data Mining?

How to do Mine Data?


What is Data Mining?
Gold Mining is the Mining of Gold
Data Mining is Mining insights from Data
Small Dataset = Easy Data Analysis
Simple Example:
Students of which city are smarter than the other, city A or B?
Student # City Grade

1 A 90

2 B 80

3 A 70

4 A 80

5 B 100

6 B 95
Big Dataset, how? Using Data Mining
Some Related Concepts and to DM:
● Machine Learning
● Data Science
● Big Data
● Smart Data
● Data Analytics
● Data Analysis
● Data Warehousing
● Deep Learning
● Statistical Inference
Data is generated everywhere and for different purposes.
Thus, data should be interesting!
● Less limitations than before.
● Gathered data will have value
either for the purpose is has
been collected for or for a
purpose that is not envisioned,
bust still has potential interest.

https://upxacademy.com/beginners-guide-to-big-data/
What is DM, and what DM can do?

● Exploration & analysis, by automatic or semi-automatic means, of


large quantities of data in order to discover meaningful patterns
Why doing Data Mining?
DM Applications: Data Mining can help solving real-world problems.
How to do Data Mining?
Data Mining Steps:
Data Mining Origins and Main Tasks:
- Data Mining has ideas from machine learning/AI, pattern
recognition, statistics, and database systems… etc.
- Main Tasks:
- Prediction Methods: Use some variables to predict
unknown or future values of other variables.
- Description Methods: Find human-interpretable
patterns that describe the data.

From [Fayyad, et.al.] Advances in Knowledge Discovery and Data Mining, 1996
Data Mining Models
Course Content
Course Content Overview:
A Process and Project Based Course.
Course Outline
1. Course Introduction and Introduction to Data Mining
2. Statistics and Algebra: Review
3. Data
a. Find Your Input
b. Data Types
c. Data Gathering, Getting data from data sources
d. Assigning and Discussing Group-Based Projects
4. Data Pre-Processing: Generating input datasets for data mining
algorithms
a. a. Feature Extraction and Selection
b. b. Data Cleaning and Integration (1)
c. Report #1: Data Selection and Preparation
7. Mining Your Data:
a. Pattern Mining
b. Report #2: Pattern Mining Results
c. Classification (Supervised Machine Learning)
d. Report #3: Data Classification Mining Results
e. Cluster Analysis (Un-Supervised Machine Learning)
f. Report #4: Data Clustering Results
g. Graph Data Mining
8. Data Visualization
a. Basic Concepts
9. Report #5: Graph Data Analysis and Visualization Results
10. Real-World Applications: Concepts and Practice
a. Social Networks Mining, SNA: Facebook
b. Text Mining: Newsletter
c. Web-Data Mining: PageRank
d. Time-Series Data Mining: Twitter
Labs
● Weekly in-lab assignments
● You will practice what you will learn in lectures
● 2-in-1 course = Learning Data Mining + Python Programming Language

You might also like