Professional Documents
Culture Documents
Text Books: 1. J. Han, M. Kamber, Data Mining: Concepts and Techniques, Second Edition, Morgan
Reference Books: Kaufmann Publishers, 2006.
2. Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and
Techniques with Java Implementations. Morgan Kaufmann Publishers, 2005.
3. M. H. Dunham, S. Sridhar, Data Mining Introductory and Advanced Topics, Pearson
Education, 2006.
4. Hastie, Tibshirani, Friedman, The Elements of Statistical Learning. Springer-Verlag,
2001.
Course Data Mining is one of the hottest fields in Information Technology. Data has been
Introduction & accumulating throughout the computer age in many forms, including database systems,
Description: spreadsheets, text files, and recently web pages. Data mining aims to search through
data for hidden relationships and patterns in your data. We will cover advanced topics
such as large-scale data mining, similarity search, mining data streams, mining social
networks, relational data mining, and matrix factorization methods for data mining. This
course will be highly beneficial to students whose research interests are in database, data
mining, bioinformatics, information retrieval, decision science and artificial intelligence,
and also to those who may need to apply data mining to any application.
Course Outcomes: After completing this course, the student should demonstrate the knowledge and ability
to:
Show and understand the various kinds of Data Mining Tasks.
Use data mining to solve real life problems.
Find a research problem in the data mining for further research.
Class Policies: Attendance for lectures is compulsory. Attendance for less than 75% of the lectures
will result in students being barred from taking the Final Exam.
If you are absent from the lecture due to: Sickness – Medical Certificate is required,
in case of emergency – letter of guardian is required.
There will be no makeup quiz.
Make-up for Mid Term will only be given to those with STRONG VALID reason by the
prior approval of the Head of department.
Cheating and Plagiarism will not be tolerated and will be penalized accordingly.
There will be 5-7 assignments besides on class exercises. Assignments need to be
submitted before the deadline. If you have questions or doubts contact us in our
offices during visiting hours or use our email address.
pg. 1
Course Outline:
03 Classification
04 Clustering
07 Similarity Search
12 Tree/Graph Mining
15 Recommender Systems
16 Wrap-up course
Viva of Assignments
Final Examination
pg. 2
Grading Policy:
1 Assignments 10%
2 Quizzes 5%
3 Presentations 10%
3 Mid term 25%
Important notes:
4-5 numbers of quizzes will take place in the class to measure the learning progress of the students. These
quizzes will be announced or unannounced.
Plagiarism Policy:
During this course a strict no tolerance plagiarism policy will be adopted. While collaboration in this course is
highly encouraged, you must ensure that you do claim other people’s work/idea as your own. Plagiarism
occurs when the words, ideas, assertion, theories, figures, images, programming code of others is
presented as your own work. Failing to comply with plagiarism policy will lead to strict penalties including
zero marks in assignments.
_______________________________________________________________________________________
pg. 3