Professional Documents
Culture Documents
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Why do we need Data Science?
• In the past, we used to have data in a structured
format but now as the volume of the data is increasing,
so the number of structured data becomes very less,
How data science is
effecting our so to handle the massive amount of data we need data
everyday life?
science techniques
• Those data can be used to get the proper business
insights and the hidden trends from them.
• These insights helps the organization to predict the
Future
• Using data science decision making can be faster and
effective
• Helps to reduce the production cost
• Build model based on the data to give the ability to the
machine to predicts on its own
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
What is Data Science?
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Life cycle of Data Science?
Deploy the model Predict your model accuracy Machine Learning Algorithm
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Most Popular Programming Languages For Data Science?
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Introduction to Python
Object oriented
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
History of Python
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Why should you learn Python?
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Installing Python
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Popular IDE for Python: Pycharm
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Popular IDE for Python: Anaconda
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Popular IDE for Python: Google colab
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Statistical measurement on Python user
In recent time it is prominent that Python is one of the most popular language because of it’s simplicity
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Getting started with Python
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
How to Store Data?
How do I
store data?
“John”
123
TRUE
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Need of Variables
Student
“John”
0x1098ab
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Basic syntax for Python
Variable
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
DataTypes in Python
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Basic syntax for Python
Datatypes
2
Data types represent value which determines what
operations can be performed on that data
3
In python data types is assigned when you assign the
value to the variable
4
We can check the data type of the variable by calling the
type() function
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Basic syntax for Python Python string
2 Multiline string:
3 String as arrays:
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Basic syntax for Python Python string
4 String slicing:
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Python Libraries
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Data manipulation using Python
Data manipulation is a technique which allows to transform, extract, and filter your data
efficiently with less time.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Data manipulation using Numpy
Numpy stands for Numerical Python and it is used to perform mathematical and
logical operations on arrays
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Data manipulation using Pandas
Pandas is a popular data manipulation and analysis library in python which is based
on Numpy
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Demo on Numpy and pandas
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Creating a Dataframe
What is a
dataframe?
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Dataframe In-Built Functions
head()
shape() describe()
tail()
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Dropping Columns
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Dropping Rows
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Machine Learning to build the Model
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
What is Machine Learning?
Machine learning is a sub-set of artificial intelligence (AI) that allows the system to
automatically learn and improve from experience without being explicitly programmed
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Traditional Vs Machine Learning
Data Data
Model
Output
Program Output
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Types Of Machine Learning
Supervised Learning
Unsupervised
Learning
Reinforcement
Learning
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
What is Supervised Learning?
Supervised learning works as a supervisor or teacher. Basically, In supervised learning, we teach or train the
machine with labeled data (that means data is already tagged with some predefined class). Then we test our
model with some unknown new set of data and predict the level for them
▪ Learning from the labelled data and applying the knowledge to predict the
• Linear Regression
• Logistic regression
• Decision Tree
• Random Forest
• Naïve Bayes Classifier
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
What is Logistic Regression?
Logistic regression is also a part of supervised learning classification algorithm. It is used to predict the
probability of a target variable and the nature of target or dependent variable is discrete, so for the
output there will be only two class will be present
▪ The dependent variable is binary in nature so that can be either 1 (stands for
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Credit Card Fraud Analysis Using Python
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
Thank You
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited