You are on page 1of 5

LEVEL: BACHELOR OF DATA SCIENCE YEAR 2

MODULE NAME: FUNDAMENTALS OF BIG DATA


LECTURER’S NAME: MR ALLY KHELEF
NATURE OF ASSIGNMENT: GROUP ASSIGMENT

S/NO NAMES REGISTRATION NO


ASYA HAJI HAJI 1527010162199
HASAM TEMA BAKARI 1517010032199
YASSIR H MOLLEL 1517010272101
MERCY IJAN MVUNGI 1517010052198

Problem: Zanzibar Insurance Cooperation (ZIC) seek for a consultant to analyze the payments
of each item of the service they offered as shown in the Microsoft Excel File called Data
Science Mini Project.xlsx
Task: Use the data science pipe line and any technical skills to solve this problem.
Data science pipeline – refers to the process and tools used to gather raw data from multiple sources,
analyze it, and present the results in an understandable format
Data science pipeline flow involves different stages;

 Data engineering (including collection, cleansing and preparation include ETL)


 Machine learning (model validation and model learning)
 Output (model deployment and data visualization)
The tools that we used in this project are;

 Excel
 Pivot table
 Jupyter notebook
 Matplotlib library
 Pandas
Data collection
We already have our data is which is clean with no error miscalculation or duplication
Data preparation

We prepared the data by splitting them into different columns which are categories, services_id and
services to make data ready for analysis and visualization
Data analyzation and visualization

From the bar graph and pivot table above, it shows that the services belong to the three categories which
is category 6 offers more services compared to other and category 4 offers less services (only one
services).
From the pie chart above it show that large percent of the services offered in ZIC belong to category 6
which is 91.3% followed by category 5 which is 6.5% and small percent of services belong to category 4
which is 2.2%
Conclusion
The services offered in ZIC belong to three categories in which 91.3% of service offered belong to
category 6, 6.5% of services offered belong to category 5 and 2.2% of services offered by category 4,
which mean category 6 need 91.3% of payment for the services they offered 6.5 for category 5 and since
category 4 offer only one services small percent of payment is needed which is 2.2%
References
https:// www.tutorialpoint.com
https:// www.snowflake.com
https:// www.domo.com
https:// www.pythonspot.com
https:// www.w3schools.com

You might also like