Professional Documents
Culture Documents
Lect 1
Introduction to Data Mining - Dr. Ismael A. Ali - UoZ - Fall 2019
Outline
● Introducing us. Welcome!
● Course Policy: How is the course? (Course Website)
● Course Introduction:
○ What is Data Mining?
○ Why doing Data Mining?
○ How to do Data Mining?
● Course Content Overview: What you will learn?
Course Content Overview:
What will you learn in this course?
Presentation 0%
Mid-term examination 10% for Term1 Exam + 10% for Term2 Exam = 20 %
Final examination 40 %
Others 0
Introduction to Data Mining
What is Data Mining?
1 A 90
2 B 80
3 A 70
4 A 80
5 B 100
6 B 95
Big Dataset, how? Using Data Mining
Some Related Concepts and to DM:
● Machine Learning
● Data Science
● Big Data
● Smart Data
● Data Analytics
● Data Analysis
● Data Warehousing
● Deep Learning
● Statistical Inference
Data is generated everywhere and for different purposes.
Thus, data should be interesting!
● Less limitations than before.
● Gathered data will have value
either for the purpose is has
been collected for or for a
purpose that is not envisioned,
bust still has potential interest.
https://upxacademy.com/beginners-guide-to-big-data/
What is DM, and what DM can do?
From [Fayyad, et.al.] Advances in Knowledge Discovery and Data Mining, 1996
Data Mining Models
Course Content
Course Content Overview:
A Process and Project Based Course.
Course Outline
1. Course Introduction and Introduction to Data Mining
2. Statistics and Algebra: Review
3. Data
a. Find Your Input
b. Data Types
c. Data Gathering, Getting data from data sources
d. Assigning and Discussing Group-Based Projects
4. Data Pre-Processing: Generating input datasets for data mining
algorithms
a. a. Feature Extraction and Selection
b. b. Data Cleaning and Integration (1)
c. Report #1: Data Selection and Preparation
7. Mining Your Data:
a. Pattern Mining
b. Report #2: Pattern Mining Results
c. Classification (Supervised Machine Learning)
d. Report #3: Data Classification Mining Results
e. Cluster Analysis (Un-Supervised Machine Learning)
f. Report #4: Data Clustering Results
g. Graph Data Mining
8. Data Visualization
a. Basic Concepts
9. Report #5: Graph Data Analysis and Visualization Results
10. Real-World Applications: Concepts and Practice
a. Social Networks Mining, SNA: Facebook
b. Text Mining: Newsletter
c. Web-Data Mining: PageRank
d. Time-Series Data Mining: Twitter
Labs
● Weekly in-lab assignments
● You will practice what you will learn in lectures
● 2-in-1 course = Learning Data Mining + Python Programming Language