Open navigation menu

Welcome to Scribd!

Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data

Uploaded by

0% found this document useful (0 votes)

9 views1 page

Data wrangling

Original Title

Data_Wrangling

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Data wrangling

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

9 views1 page

Python For Data Science: Advanced Indexing Data Wrangling in Pandas Cheat Sheet Combining Data

Uploaded by

Data wrangling

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

> Advanced Indexing Also see NumPy Arrays > Combining Data

Python For Data Science

Selecting
>>> df3.loc[:,(df3>1).any()] #Select cols with any vals >1

Data Wrangling in Pandas Cheat Sheet >>>

>>>
>>>
df3.loc[:,(df3>1).all()] #Select cols with vals > 1

df3.loc[:,df3.isnull().any()] #Select cols with NaN

df3.loc[:,df3.notnull().all()] #Select cols without NaN

Learn Data Wrangling online at www.DataCamp.com Indexing With isin()

>>> df[(df.Country.isin(df2.Type))] #Find same elements

>>> df3.filter(items=”a”,”b”]) #Filter on values

Merge
>>> df.select(lambda x: not x%5) #Select specific elements

Where >>> pd.merge(data1,

data2,

> Reshaping Data >>> s.where(s > 0) #Subset the data

Query
how='left',

on='X1')

>>> df6.query('second > first') #Query DataFrame

Pivot >>> pd.merge(data1,

data2,

>>> df3= df2.pivot(index='Date', #Spread rows into columns

Setting/Resetting Index how='right',

on='X1')
columns='Type',

values='Value') >>> df.set_index('Country') #Set the index

>>> df4 = df.reset_index() #Reset the index

>>> pd.merge(data1,

>>> df = df.rename(index=str, #Rename

data2,

DataFrame columns={"Country":"cntry",
how='inner',

"Capital":"cptl",
on='X1')
"Population":"ppltn"})
>>> pd.merge(data1,

Reindexing data2,

how='outer',

on='X1')
Pivot Table >>> s2 = s.reindex(['a','c','d','e','b'])

Forward Filling Backward Filling

>>> df4 = pd.pivot_table(df2, #Spread rows into

columns values='Value',
>>> df.reindex(range(4),
>>> s3 = s.reindex(range(5),

index='Date',
method='ffill') method='bfill') Join
columns='Type']) Country Capital Population
0 3

0 Belgium Brussels 11190846

1 3
>>> data1.join(data2, how='right')
1 India New Delhi 1303171035
2 3

Stack / Unstack 2 Brazil Brasília 207847528

3 3

3 Brazil Brasília 207847528 4 3

Concatenate
>>> stacked = df5.stack() #Pivot a level of column labels

>>> stacked.unstack() #Pivot a level of index labels

MultiIndexing Vertical
>>> s.append(s2)
>>> arrays = [np.array([1,2,3]),

np.array([5,4,3])]
Horizontal/Vertical
>>> df5 = pd.DataFrame(np.random.rand(3, 2), index=arrays)

>>> pd.concat([s,s2],axis=1, keys=['One','Two'])

>>> tuples = list(zip(*arrays))

>>> pd.concat([data1, data2], axis=1, join='inner')

>>> index = pd.MultiIndex.from_tuples(tuples,

names=['first', 'second'])

>>> df6 = pd.DataFrame(np.random.rand(3, 2), index=index)

Melt >>> df2.set_index(["Date", "Type"])

> Dates
> Duplicate Data
>>> pd.melt(df2, #Gather columns into rows

id_vars=["Date"],

value_vars=["Type", "Value"],
>>> df2['Date']= pd.to_datetime(df2['Date'])

value_name="Observations") >>> df2['Date']= pd.date_range('2000-1-1',

>>> s3.unique() #Return unique values

periods=6,

>>> df2.duplicated('Type') #Check duplicates

freq='M')

>>> df2.drop_duplicates('Type', keep='last') #Drop duplicates

>>> dates = [datetime(2012,5,1), datetime(2012,5,2)]

>>> df.index.duplicated() #Check index duplicates >>> index = pd.DatetimeIndex(dates)

>>> index = pd.date_range(datetime(2012,2,1), end, freq='BM')

> Grouping Data

> Visualization Also see Matplotlib
Aggregation
> Iteration >>> df2.groupby(by=['Date','Type']).mean()

>>> import matplotlib.pyplot as plt

>>> s.plot()
>>> df2.plot()

>>> df4.groupby(level=0).sum()

>>> df4.groupby(level=0).agg({'a':lambda x:sum(x)/len(x), 'b': np.sum}) >>> plt.show() >>> plt.show()

>>> df.iteritems() #(Column-index, Series) pairs

>>> df.iterrows() #(Row-index, Series) pairs

Transformation
>>> customSum = lambda x: (x+x%2)

> Missing Data

>>> df4.groupby(level=0).transform(customSum)

>>> df.dropna() #Drop NaN values

>>> df3.fillna(df3.mean()) #Fill NaN values with a predetermined value

Learn Data Skills Online at www.DataCamp.com
>>> df2.replace("a", "f") #Replace values with others

You might also like

Soft Skills For: Data Scientist
Document7 pages
Soft Skills For: Data Scientist
Sergiu Lung
No ratings yet
Python Roadmap For 2022
Document9 pages
Python Roadmap For 2022
Sergiu Lung
No ratings yet
Critical Quality Attributes Critical Process Parameters
Document40 pages
Critical Quality Attributes Critical Process Parameters
Sergiu Lung
No ratings yet
En Iso 6507
Document11 pages
En Iso 6507
Sergiu Lung
100% (2)
WPS Asme Ix PDF
Document4 pages
WPS Asme Ix PDF
Sergiu Lung
No ratings yet
MOC Guidance
Document5 pages
MOC Guidance
Sergiu Lung
100% (1)
Acme Threads
Document3 pages
Acme Threads
UNIISCRIBD
No ratings yet
Eula Microsoft Visual Studio
Document3 pages
Eula Microsoft Visual Studio
qwwerttyy
No ratings yet
Acme Threads
Document3 pages
Acme Threads
UNIISCRIBD
No ratings yet
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Rating: 4 out of 5 stars
4/5 (1090)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Mhdp-08-Rsdll Report Issue 6.0
Document159 pages
Mhdp-08-Rsdll Report Issue 6.0
bill
No ratings yet
Dynamic Programming Algorithm Explained in ECE 551 Lecture
Document11 pages
Dynamic Programming Algorithm Explained in ECE 551 Lecture
adambose1990
No ratings yet
Surface Modification of Titanium Orthodontic Impla
Document30 pages
Surface Modification of Titanium Orthodontic Impla
Mary Smile
No ratings yet
University Library Management System
Document10 pages
University Library Management System
kochi jerry
No ratings yet
Parts List of Engine Assy Ofdz-00536
Document1 page
Parts List of Engine Assy Ofdz-00536
Cws
No ratings yet
Oil and Gas Field
Document5 pages
Oil and Gas Field
Muhammad Syafiie
No ratings yet
2.1 Conditional Logic: Ladder Programming
Document10 pages
2.1 Conditional Logic: Ladder Programming
Luka Nikitovic
No ratings yet
Bamuoingaythienquan Nguyenduynhien
Document9 pages
Bamuoingaythienquan Nguyenduynhien
FLed Nguyen
No ratings yet
Cancionero
Document7 pages
Cancionero
Jocelyn Almaguer Larios
No ratings yet
Kole 2010
Document12 pages
Kole 2010
Steven Mestres-Junque
No ratings yet
Cellular Manufacturing
Document61 pages
Cellular Manufacturing
api-3852736
100% (1)
Weak Downlink Data for Multiple TRX Indices Over Time
Document3,643 pages
Weak Downlink Data for Multiple TRX Indices Over Time
SK Basak BD
No ratings yet
Ruminations by Other People
Document31 pages
Ruminations by Other People
dhultstrom
No ratings yet
Hdpe Guide PDF
Document81 pages
Hdpe Guide PDF
balot
No ratings yet
Ideas For Income Generating Projects PDF
Document29 pages
Ideas For Income Generating Projects PDF
heart Aquino
No ratings yet
Loan Agreement
Document4 pages
Loan Agreement
Trishia Jane Javier
No ratings yet
EPS: Electric Power Steering Components & Modes
Document9 pages
EPS: Electric Power Steering Components & Modes
Tilahun Worku
100% (1)
The American School of Classical Studies at Athens
Document23 pages
The American School of Classical Studies at Athens
Danilo Andrade Tabone
No ratings yet
China Holidays, Public Holidays Calendar in 2020 - 2021 - 2022
Document15 pages
China Holidays, Public Holidays Calendar in 2020 - 2021 - 2022
eric retko
No ratings yet
Barcelona Conferences Abstracts December 2016
Document82 pages
Barcelona Conferences Abstracts December 2016
Bahadır Akbal
No ratings yet
Future Developments in Management Accounting
Document4 pages
Future Developments in Management Accounting
Amar narayan
No ratings yet
Lippo Karawaci Review 30 September 2019 FINAL PDF
Document149 pages
Lippo Karawaci Review 30 September 2019 FINAL PDF
Andy Aghasta
No ratings yet
MEL - Math 10C Item Writing - MC & NR Review
Document71 pages
MEL - Math 10C Item Writing - MC & NR Review
Mya Tse
No ratings yet
Crescent Moon Instructions
Document7 pages
Crescent Moon Instructions
SARA
No ratings yet
B2+ UNIT 9 Test Answer Key Higher
Document2 pages
B2+ UNIT 9 Test Answer Key Higher
MOSQUITO beats
No ratings yet
DES-4122 Specialist-Implementation Engineer PowerEdge Exam
Document4 pages
DES-4122 Specialist-Implementation Engineer PowerEdge Exam
Arnaldo Jonathan Alvarado Ruiz
No ratings yet
Methanol from Syngas Plant Design
Document13 pages
Methanol from Syngas Plant Design
KhalidMadani
No ratings yet
Effect of land use change on property values
Document3 pages
Effect of land use change on property values
eesuola akinyemi
No ratings yet
Activity 2EE56L FINAL
Document6 pages
Activity 2EE56L FINAL
LUAÑA ALMART
No ratings yet
What Is Low Cost Housing
Document19 pages
What Is Low Cost Housing
surbhi aggarwal
No ratings yet