Professional Documents
Culture Documents
Final Project Grading Criteria
Final Project Grading Criteria
Compliance with specs (font size, length, required sections, number of references in literature survey
etc.) 30 pts
Good discussion about the purpose, why somebody would use the analysis, expected benefits, any
caveats, privacy issues. 5 pts
Good discussion about the data and problem: data points, features, label, the data source, and the kind
of ML problem you have (regression, classification or clustering) should be mentioned clearly. 10 pts
• If your dataset is unbalanced, then you need to add unbalanced dataset treatment discussion to
your analysis
• If you treat the outliers (everyone has to do outlier analysis) in a certain way, you need to
explain WHY you do that properly
• If you do normalization, you need to discuss the reasoning behind and need to show if it
improves the results or not. How do you prove if it was a good idea to use normalization
• If you have categorical data how you handle them? Why did you choose that method?
• If you use visualizations (everyone has to check data visually) you need to explain it properly.
You have to mention what the chart tells you, is it sound, why it is like that, is it in line with your
expectations, anything weird etc. Do not leave the charts withour proper reference in the
report.
• You need to be careful about scales of measurements: Ex: I do not want to see ratio analysis on
your nominal/categorical variables. You can not use “describe” for categorical variables even if
you convert them into numbers by hot encoding etc.
30 pts