You are on page 1of 1

APPLIED DATA SCIENCE

PROJECT

Prepare a report on a Data Analytics project that you will undergo. Choose a problem/question that may be
solved/answered by Data Science. You may want to refer to Module 2 for an example (GINA Case Study). Prepare a
detailed report on each phase of the Data Analytics lifecycle.
 Discovery
o Formulate hypothesis (general and specific). What do you propose to do?
o Identify stakeholders. Who will benefit from the information that you will generate? What is the
importance of this information?
o Identify sources of data. Describe the data to be collected and the method to collect them. Classify
sources as to structure.
 Data Preparation – include the Python codes
o Raw Data
o Importing Data Method/s
o Cleaning Data Method/s: raw data should be dirty at first
o Exploratory Data Analysis
o Visual Exploratory Data Analysis
 Model Building and Validation – include the Python codes
 Results and Key Findings
 Summary (similar to slide 34 of module 2)
 Presentation Materials
o Prepare materials as if the results of this study will be presented to: (a) stakeholders; (b) analysts
and Python programmers. You will have to prepare different materials for these two groups. Be
sure to identify which sections must be presented to each group. Attach copies of these
presentation materials to your written report

Please use letter-sized bond papers for the written report. Make sure it is bound and presented well. Prepare one
Jupyter notebook for all the codes used for the project. Submit the notebook through Cardinal Edge. All datasets
used should likewise be uploaded.

Page 1 of 1

You might also like