You are on page 1of 52

Welcome to Python!

PYTHON FOR SPREADSHEET USERS

Chris Cardillo
Data Scientist
What is Python?
General purpose programming language

Has code packages for data science

Object-oriented

PYTHON FOR SPREADSHEET USERS


Object-oriented spreadsheets

PYTHON FOR SPREADSHEET USERS


Object-oriented spreadsheets

PYTHON FOR SPREADSHEET USERS


Object-oriented Python
In Python In Spreadsheets

math.SUM(2,3)

PYTHON FOR SPREADSHEET USERS


The power of the dot
math is our package
math.SUM()
SUM is our function in the package

The dot ( . ) lets us access the function


inside the package

PYTHON FOR SPREADSHEET USERS


Scripts
A Recipe for Bu ered Toast

1. Turn on toaster

2. Put bread in toaster for 1 minute

3. Take bread out of toaster

4. Put butter on toast

PYTHON FOR SPREADSHEET USERS


Importing packages
import math

PYTHON FOR SPREADSHEET USERS


Importing pandas

PYTHON FOR SPREADSHEET USERS


Importing pandas
import pandas

PYTHON FOR SPREADSHEET USERS


Loading Excel data
import pandas

pandas.read_excel('fruit.xlsx')

fruit.xlsx
(the kind of data pandas likes)

PYTHON FOR SPREADSHEET USERS


pandas as pd
before a er

import pandas import pandas as pd

pandas.read_excel('fruit.xlsx') pd.read_excel('fruit.xlsx')

PYTHON FOR SPREADSHEET USERS


Variable assignment
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit.xlsx

PYTHON FOR SPREADSHEET USERS


Looking at data with print()
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit)

PYTHON FOR SPREADSHEET USERS


Script and console panes in DataCamp
script.py is the script

IPython Shell is the console

PYTHON FOR SPREADSHEET USERS


Your turn!
PYTHON FOR SPREADSHEET USERS
DataFrames and
their methods
PYTHON FOR SPREADSHEET USERS

Chris Cardillo
Data Scientist
Where we left off
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit)

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


Anatomy of a pandas DataFrame

PYTHON FOR SPREADSHEET USERS


DataFrame methods
.head()

.info()

.describe()

.sort_values()

PYTHON FOR SPREADSHEET USERS


The .head() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.head())

PYTHON FOR SPREADSHEET USERS


The .head() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.head(2))

PYTHON FOR SPREADSHEET USERS


The .info() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.info())

PYTHON FOR SPREADSHEET USERS


The .describe() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

print(fruit.describe())

PYTHON FOR SPREADSHEET USERS


The .sort_values() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit = fruit.sort_values('name')
fruit = fruit.reset_index(drop=True)

print(fruit)

PYTHON FOR SPREADSHEET USERS


The .sort_values() method
import pandas as pd

fruit = pd.read_excel('fruit.xlsx')

fruit = fruit.sort_values('price_usd', ascending=False)


fruit = fruit.reset_index(drop=True)

print(fruit.head(3))

PYTHON FOR SPREADSHEET USERS


Your turn!
PYTHON FOR SPREADSHEET USERS
Filtering rows and
creating columns
PYTHON FOR SPREADSHEET USERS

Chris Cardillo
Data Scientist
Filtering in spreadsheets

PYTHON FOR SPREADSHEET USERS


Accessing a single column

PYTHON FOR SPREADSHEET USERS


Accessing a single column
fruit['name']

PYTHON FOR SPREADSHEET USERS


Comparison operators
== (equal to)

!= (not equal to)

< (less than)

> (greater than)

<= (less than or equal to)

>= (greater or equal to)

PYTHON FOR SPREADSHEET USERS


Comparisons
fruit['name'] fruit['name'] == "Apple"

PYTHON FOR SPREADSHEET USERS


Filtering
fruit[fruit['name'] == "Apple"]

PYTHON FOR SPREADSHEET USERS


Filtering
fruit[fruit['price_usd'] > 1]

PYTHON FOR SPREADSHEET USERS


Filtering
fruit[fruit['price_usd'] > 1].reset_index(drop=True)

PYTHON FOR SPREADSHEET USERS


Basic filtering pattern
df[df['this_column'] == 'that value']

PYTHON FOR SPREADSHEET USERS


Basic filtering pattern
In Python In Spreadsheets
fruit[fruit['name'] == "Apple"]

PYTHON FOR SPREADSHEET USERS


Creating a new column

PYTHON FOR SPREADSHEET USERS


Creating a new column

PYTHON FOR SPREADSHEET USERS


Mathematical operators
+ (addition)

- (subtraction)

* (multiplication)

/ (division)

PYTHON FOR SPREADSHEET USERS


Creating a new column
fruit['cost'] = fruit['price_usd'] * 2

PYTHON FOR SPREADSHEET USERS


Creating a new column

PYTHON FOR SPREADSHEET USERS


Creating a new column

PYTHON FOR SPREADSHEET USERS


Creating a new column
fruit['cost'] = fruit['price_usd'] * fruit['quantity']

PYTHON FOR SPREADSHEET USERS


Your turn!
PYTHON FOR SPREADSHEET USERS

You might also like