You are on page 1of 11

DEPARTMENT OF ARTIFICIAL INTELLIGENCE AND DATA SCIENCE

Main Presentation

Under

Industrial Training Presentation(5CA7-30)

Session-2022-23(Odd)

Presented by Presented to
Vikash Biswas Pankaj Rahi
PGI20CA042 Punit Kumawat
G2
What is Data Science

Data science is the field of study that combines domain expertise, programming
skills, and knowledge of mathematics and statistics to extract meaningful
insights from data.
Here is how you can solve a problem using data science:

Ask the right questions - To understand the business problem.


Explore and collect data - From database, web logs, customer feedback, etc.
Extract the data - Transform the data to a standardized format.
Clean the data - Remove erroneous values from the data.
Find and replace missing values - Check for missing values and replace them with a
suitable value (e.g. an average value).
Normalize data - Scale the values in a practical range (e.g. 140 cm is smaller than 1,8 m.
However, the number 140 is larger than 1,8. - so scaling is important).
Analyze data, find patterns and make future predictions.
Represent the result - Present the result with useful insights in a way the "company" can
understand.
Applications of Data Science

1. Healthcare
2. Gaming
3. Image Recognition
4. Recommendation Systems
5. Logistics
6. Fraud Detection
7. Internet Search
8. Speech recognition
9. Targeted Advertising
10. Airline Route Planning
11. Augmented Reality
Python Libraries for data science

1. Scientific Computing Libraries


Pandas
Numpy

2. Visualization Libraries
Matplotlib

3. Algorithmic Libraries
scikit learn
Diabetes Prediction using -Python

Diabetes is a health condition that affects how your body turns food into
energy. Most of the food you eat is broken down into sugar (also called
glucose) and released into your bloodstream. When your blood sugar goes
up, it signals your pancreas to release insulin.
So that i decide to predict using Machine Learning in Python.

Objectives

1. Predict if person is diabetes patient or not


2. Find most indicative features of diabetes
3. Try different classification methods to find highest accuracy
v
i
Conclusion and Future Plans

Conclusion
Diabetes is one of the ricks during Pregnancy. It has to be treat to avoid
complications.
BMI index can help to avoid complications of diabetes a way before
Diabetes start showing in age of 35 – 40 and increase with person age.

Next steps
There are lots of ways to improve the above model. Here are some
ideas.
•Input feature bucketing should help, i.e. create new variables for blood
pressure in a particular range, glucose levels in a particular range, and so
on.
•We can improve the data cleaning, by replacing 0 values with the mean
value.
•Read a bit about what metrics do doctors rely on the most to diagnose
a diabetic patient, and create new features accordingly.
Digital Certificate
Thanks
&
Query

You might also like