Professional Documents
Culture Documents
Problem: Zanzibar Insurance Cooperation (ZIC) seek for a consultant to analyze the payments
of each item of the service they offered as shown in the Microsoft Excel File called Data
Science Mini Project.xlsx
Task: Use the data science pipe line and any technical skills to solve this problem.
Data science pipeline – refers to the process and tools used to gather raw data from multiple sources,
analyze it, and present the results in an understandable format
Data science pipeline flow involves different stages;
Excel
Pivot table
Jupyter notebook
Matplotlib library
Pandas
Data collection
We already have our data is which is clean with no error miscalculation or duplication
Data preparation
We prepared the data by splitting them into different columns which are categories, services_id and
services to make data ready for analysis and visualization
Data analyzation and visualization
From the bar graph and pivot table above, it shows that the services belong to the three categories which
is category 6 offers more services compared to other and category 4 offers less services (only one
services).
From the pie chart above it show that large percent of the services offered in ZIC belong to category 6
which is 91.3% followed by category 5 which is 6.5% and small percent of services belong to category 4
which is 2.2%
Conclusion
The services offered in ZIC belong to three categories in which 91.3% of service offered belong to
category 6, 6.5% of services offered belong to category 5 and 2.2% of services offered by category 4,
which mean category 6 need 91.3% of payment for the services they offered 6.5 for category 5 and since
category 4 offer only one services small percent of payment is needed which is 2.2%
References
https:// www.tutorialpoint.com
https:// www.snowflake.com
https:// www.domo.com
https:// www.pythonspot.com
https:// www.w3schools.com