Professional Documents
Culture Documents
Wrangling Reporting
Wrangling Reporting
Introduction
In this project, i have tried to use most of the techniques shown in the datawrangling chapter
i have organized the project into the main steps: wich are: gathering/assessing/cleaning/anlysing/reporting
For the first step: which is gathering and colelcting the data needed for the project. we have used fifferent
techniques for gathering different sources of data: such as JISON files, TSV, web scrabing, Tweeter Api,..etc
3 differents sources proposed in this projects, where inputs and needed variables are scatter in the 3 files.
the role of the student is to collect all.
some of the files was ready made for the projects, others were downloadable directly from the URLs
communicated
For the Second step: after gathering the data. now we have all th inputs to start the project; the role is to
assess teh data, get usefull information
techniques for visual & programatical data assesement being used. many Quality and tidiness issues been
found as per request in project.
In the last step: which is the cleaning phase, i have tired to clear all the issues raised during the assement
phase: where i have used, the drop/dtypes/..also the important feature of Melt/merg
conclusion
The 3 steps, are the foundation for any project; quality of handling the 3 steps will impact the overall project
output, the nalysis might be bais if any failures in the above 3 steps.
Iteration is very important, goign back from the begining, verifying the code, and reviewing the final data
before analysis step.
In [ ]: