Professional Documents
Culture Documents
IS328 Assignment2 Ver1.0 DrVaniV Online PDF
IS328 Assignment2 Ver1.0 DrVaniV Online PDF
Overview
The goals of this assignment II or project are
To apply the concepts learnt in the chosen scenario using appropriate data mining tools.
As a team of 2 members, propose a project and implement it
For project ideas and data sources, you may visit
www.kdnuggets,com
www.kaggle.com
Make a report in the given format
This assignment is an important part of the course and counts for 15% of your final grade. Grades will be based on the choice of topic,
completeness of your project, quality of the project report and its application.
Plagiarism
For all the Assignment/Project works it’s essential that you avoid plagiarism. Not only do you expose yourself to possibly serious disciplinary
consequences, but you’ll also cheat yourself of a proper understanding of the concepts emphasized in the assignment.
It’s not plagiarism to discuss the assignment with your friends and consider solutions to the problems together. However, it is plagiarism
for you to copy all or part of each other’s solutions.
Project Contents
Abstract
Table of Contents
List of Figures
List of Tables
IS328: Assignment 2 Page 2 of 8 Dr.Vani Vasudevan
Introduction
Motivation
Problem Domain
Aim and Objectives
Data Mining Techniques
Dataset Used
Data Mining Tasks
Data Preprocessing
Methods and Models
Assessment
Result Analysis and Interpretation
Conclusion
Lessons learned
Future Work
References
Important Guidelines:
Abstract
< The abstract conveys the most important messages regarding your project, such as: what you set out to do? How did you do it? What results
were obtained? Where it can be applied? However, for your project proposal, just specify “What you set out to do?” in one paragraph and the
remaining part you can complete in the project report.>
Table of Contents
<List the main topics in the report >
List of Figures
<Include all the figures included under each topic with proper numbering>
IS328: Assignment 2 Page 3 of 8 Dr.Vani Vasudevan
List of Tables
<Include all the tables included under each topic with proper numbering>
Introduction
<Provide a brief overview of data mining. Describe what your proposal is about and the organization of the rest of the proposal. Include whether
you will be performing data mining tasks, implementing a new algorithm in R or Weka or combination of both, or modifying some other system to
incorporate data mining features, etc. Basically, provide the nature of your project. This section should be a page or less in length.>
Motivation
<Write a paragraph describing what made you to zero down to this topic>
Problem Domain
<Describe the problem domain that you have chosen. You can refer some of the sample domains specified in this document to pursue your project.
This section should be about a page or less in length>
Aim and objective
<To apply data mining approach to a problem in a chosen domain such as health, education, science and so forth.>
Domains
Choose any of these but not limited to these
Health
Business
Education
Science
Security/Crimes
Entertainment/Sports
Real Estate
Weather
Data Mining Techniques
Classification
IS328: Assignment 2 Page 5 of 8 Dr.Vani Vasudevan
Prediction
Clustering
Association
Outlier Detection
Choose whichever is appropriate for the chosen problem domain and datasets. Ideally each group should choose different set of data
domain and data mining task. For Example
Classification – Education
Prediction – Health
Association – Supermarket
Outliers – Security
Clustering - Business
Mark
Unsatisfactory Satisfactory Good s % Marks
CBOK
(0%-49%) (50% - 75%) (76% - 100%) Alloc Attained
ated
Data and I. Do not identify I. Identified accurately I. Identified accurately
Information accurately any of the some of the data most of the data quality
Management data quality problems quality problems problems
II. Do not perform all II. Performed most of the II. Performed all the
required tasks correctly and required tasks correctly and required tasks correctly 13
consistently consistently and consistently
After having discussed as group, we recommend the following mark allocation to each group member based on contribution or lack of it
throughout the assignment.
Certification