Welcome to Scribd!

Skip carousel

Analyze automobile dataset with Pandas

Uploaded by

Gregory

0% found this document useful (0 votes)

13 views3 pages

Original Title

#1_Skill Builds_Data Analysis with Python

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

13 views3 pages

Analyze automobile dataset with Pandas

Uploaded by

Gregory

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

#

Import pandas library
import pandas as pd

# Read the online file by the URL provides above, and assign it to variab
"df"
other_path = "https://s3-api.us-geo.objectstorage.softlayer.net/cf-
courses-data/CognitiveClass/DA0101EN/auto.csv"
df = pd.read_csv(other_path, header=None)

# show the first 5 rows using dataframe.head() method
print("The first 5 rows of the dataframe")
df.head(5)

# print("The last 10 rows of the dataframe\n")
df.tail(10)

# create headers list
headers = ["symboling","normalized-losses","make","fuel-
type","aspiration", "num-of-doors","body-style",
         "drive-wheels","engine-location","wheel-base", "length","width","
height","curb-weight","engine-type",
         "num-of-cylinders", "engine-size","fuel-
system","bore","stroke","compression-ratio","horsepower",
         "peak-rpm","city-mpg","highway-mpg","price"]
print("headers\n", headers)

#We replace headers and recheck our data frame

df.columns = headers
df.head(10)

#we can drop missing values along the column "price" as follows
df.dropna(subset=["price"], axis=0)

# Write your code below and press Shift+Enter to execute
print(df.columns)

# if you would save the dataframe df as automobile.csv to your local

machine, you may use the syntax below:
df.to_csv("automobile.csv", index=False)
We can also read and save other file formats, we can use similar functions to pd.read_csv() and
df.to_csv() for other data formats, the functions are listed in the following table:

Data Formate Read Save

csv pd.read_csv() df.to_csv()
json pd.read_json() df.to_json()
excel pd.read_excel() df.to_excel()
hdf pd.read_hdf() df.to_hdf()
sql pd.read_sql() df.to_sql()
... ... ...

Data Types
Data has a variety of types.
The main types stored in Pandas dataframes are object, float, int, bool and datetime64. In order
to better learn about each attribute, it is always good for us to know the data type of each
column. In Pandas:

df.dtypes

returns a Series with the data type of each column.

# check the data type of data frame "df" by .dtypes
print(df.dtypes)

# If we would like to get a statistical summary of each column, such as

count, column mean value, column standard deviation, etc. We use the
describe method:
df.describe()

You can add an argument include = "all" inside the bracket. Let's try it again.

# describe all the columns in "df"
df.describe(include = "all")

You can select the columns of a data frame by indicating the name of each column, for example,
you can select the three columns as follows:

dataframe[[' column 1 ',column 2', 'column 3']]

Where "column" is the name of the column, you can apply the method ".describe()" to get the
statistics of those columns as follows:

dataframe[[' column 1 ',column 2', 'column 3'] ].describe()

df[['length', 'compression-ratio']].describe()

Another method you can use to check your dataset is:

# look at the info of "df"
df.info

It Era Midterms
Document4 pages
It Era Midterms
Carlo Aprado
100% (2)
Pandas What Can Pandas Do For You ?: Statsmodels SM Seaborn Sns
Document9 pages
Pandas What Can Pandas Do For You ?: Statsmodels SM Seaborn Sns
NohaM.
No ratings yet
Data Wrangling PDF
Document14 pages
Data Wrangling PDF
Edgar Guerrero
No ratings yet
PySpark Data Frame Questions PDF
Document57 pages
PySpark Data Frame Questions PDF
Varun Pathak
100% (1)
Python Codes
Document17 pages
Python Codes
Akhil
No ratings yet
9 2 2019
Document5 pages
9 2 2019
miyumi
No ratings yet
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
Document8 pages
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
Sudan Pudasaini
No ratings yet
Python-for-Data-Analysis (Pandas
Document31 pages
Python-for-Data-Analysis (Pandas
Naman Jain
No ratings yet
Iteration
Document40 pages
Iteration
Sidhu Worldwide
No ratings yet
Codes
Document37 pages
Codes
Tame PcAddict
No ratings yet
Data Analysis
Document4 pages
Data Analysis
Sereti Codepro
No ratings yet
DA0101EN-Review-Introduction - Jupyter Notebook
Document8 pages
DA0101EN-Review-Introduction - Jupyter Notebook
Sohail Doulah
No ratings yet
Python For DS Cheat Sheet
Document6 pages
Python For DS Cheat Sheet
Sebastián Emdef
100% (2)
Pandas Cheatsheet Introduction
Document3 pages
Pandas Cheatsheet Introduction
Tawsif Hasan
No ratings yet
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
Document3 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
Tawsif Hasan
100% (1)
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
Document3 pages
Data Analysis With Pandas - Introduction To Pandas Cheatsheet - Codecademy PDF
Tawsif Hasan
No ratings yet
5CS037 WS02 PandasForDataAnalysis
Document30 pages
5CS037 WS02 PandasForDataAnalysis
Pankaj Mahato
No ratings yet
Master CSV Files with Pandas
Document14 pages
Master CSV Files with Pandas
Rasik Thapa
No ratings yet
Exploring Immigration Data with pandas & Matplotlib
Document8 pages
Exploring Immigration Data with pandas & Matplotlib
duc anh
No ratings yet
Pandas Advance Quiz3 - Data Science Masters - PW Skills
Document3 pages
Pandas Advance Quiz3 - Data Science Masters - PW Skills
Shashi Kamal Chakraborty
No ratings yet
CHP 8 Pandas
Document49 pages
CHP 8 Pandas
Heshalini Raja Gopal
No ratings yet
Data Manipulation With Pandas - Introduction To Pandas Reference Guide - Codecademy
Document3 pages
Data Manipulation With Pandas - Introduction To Pandas Reference Guide - Codecademy
Yash Purandare
No ratings yet
Python Pandas Matplot
Document15 pages
Python Pandas Matplot
Rafay Farooq
No ratings yet
Data Exploration Preparation
Document12 pages
Data Exploration Preparation
hamidsithole65
No ratings yet
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
Document10 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
Raju Rimal
100% (1)
Pandas: Key Features of Pandas
Document44 pages
Pandas: Key Features of Pandas
jose
No ratings yet
Python - II
Document50 pages
Python - II
Moulding life
No ratings yet
Iris Flower Dataset Visualization
Document3 pages
Iris Flower Dataset Visualization
B49 Pravin Teli
No ratings yet
20 Pandas Functions For 80% of Your Data Science
Document22 pages
20 Pandas Functions For 80% of Your Data Science
HARRY
No ratings yet
Pandas Aggregates Cheatsheet
Document2 pages
Pandas Aggregates Cheatsheet
Utsav Soi
100% (1)
Pandas Tutorial
Document21 pages
Pandas Tutorial
KEVIN KUMAR
No ratings yet
程式語言 (二) Lecture 22 - Pandas (3) Pandas 資料運算與繪圖: FC Tien, Dept. of IE&M, Taipei Tech Email: fctien@ntut.edu.tw
Document20 pages
程式語言 (二) Lecture 22 - Pandas (3) Pandas 資料運算與繪圖: FC Tien, Dept. of IE&M, Taipei Tech Email: fctien@ntut.edu.tw
洪東凱
No ratings yet
Unit6 - Working With Data
Document29 pages
Unit6 - Working With Data
vvloggingzone05
No ratings yet
Two
Document2 pages
Two
akhilkadali123
No ratings yet
Naive Bayes Text Classification of 20 Newsgroups Data
Document3 pages
Naive Bayes Text Classification of 20 Newsgroups Data
brahmesh_sm
No ratings yet
arrow cookbook
Document12 pages
arrow cookbook
Ofili Lewis Obiajulum
No ratings yet
Sheet 5 Pandas
Document13 pages
Sheet 5 Pandas
Irene Gabriel
No ratings yet
Pandas Basics
Document21 pages
Pandas Basics
Dhruv Bhardwaj
No ratings yet
Pandas Dataframe Export The CSV File
Document9 pages
Pandas Dataframe Export The CSV File
ammouna beng
No ratings yet
MOD-3 Dap
Document41 pages
MOD-3 Dap
Varshitha Kn
No ratings yet
Sort Pandas DataFrame
Document37 pages
Sort Pandas DataFrame
B. Jennifer
100% (1)
pandas_1705297450
Document21 pages
pandas_1705297450
kkkvj9d8zw
No ratings yet
Pandas (Ziad)
Document38 pages
Pandas (Ziad)
kkr.nitpy
No ratings yet
Print (Sequences Head (4) )
Document3 pages
Print (Sequences Head (4) )
rizwana fathima
No ratings yet
Pandas Dataframe
Document48 pages
Pandas Dataframe
James Prakash
No ratings yet
Pandas: Computer Engineering Department Programming For Scientific Computing (Python) - CE0525
Document18 pages
Pandas: Computer Engineering Department Programming For Scientific Computing (Python) - CE0525
KHUSH PATEL
No ratings yet
Python Lab ALL 10 Prgms
Document16 pages
Python Lab ALL 10 Prgms
dvyvmsfcdwzbxpmymt
No ratings yet
Pandas Notes
Document4 pages
Pandas Notes
Dev D Ghosh
No ratings yet
Ai - Phase 3
Document9 pages
Ai - Phase 3
Manikandan N
No ratings yet
Pandas Cheat Sheet - Python For Data Science
Document5 pages
Pandas Cheat Sheet - Python For Data Science
sumanth Gutti
No ratings yet
UNIX Cmds
Document178 pages
UNIX Cmds
sam_try91
No ratings yet
FALLSEMFY2023-24 BCSE101E ELA CH2023241700215 Reference Material II 24-11-2023 Introduction To Pandas
Document15 pages
FALLSEMFY2023-24 BCSE101E ELA CH2023241700215 Reference Material II 24-11-2023 Introduction To Pandas
pramukhjain.9966
No ratings yet
Unix Fundamentals and Command References: Solaris Linux Hp-Ux AIX
Document178 pages
Unix Fundamentals and Command References: Solaris Linux Hp-Ux AIX
raj063
No ratings yet
Mrtglib
Document4 pages
Mrtglib
Paul Becan
No ratings yet
Pandas: Import
Document13 pages
Pandas: Import
hello
100% (1)
Daftar Lampiran: Music Signal Analysis
Document7 pages
Daftar Lampiran: Music Signal Analysis
jeremi kucing
No ratings yet
03 - Introduction To Pandas 3
Document1 page
03 - Introduction To Pandas 3
Kabir West
No ratings yet
Python-Pandas Notes
Document5 pages
Python-Pandas Notes
Filza Faisal
No ratings yet
Código K-Means en Spyder
Document3 pages
Código K-Means en Spyder
Manuel Calva Z
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Coding In C Decoded: Decoded, #1
From Everand
Coding In C Decoded: Decoded, #1
D. Brown
No ratings yet
Settings
Document1 page
Settings
Gregory
No ratings yet
Dxdiag
Document44 pages
Dxdiag
Gregory
No ratings yet
Settings
Document1 page
Settings
Gregory
No ratings yet
Log File
Document1 page
Log File
Gregory
No ratings yet
Delta Upang
Document2 pages
Delta Upang
Gregory
No ratings yet
Modul Gravity - Gregory Tampubolon 12016039
Document1 page
Modul Gravity - Gregory Tampubolon 12016039
Gregory
No ratings yet
Mineral Deposit in Indonesia KNT09 PDF
Document25 pages
Mineral Deposit in Indonesia KNT09 PDF
Gregory
No ratings yet
Mineral Deposit in Indonesia KNT09 PDF
Document25 pages
Mineral Deposit in Indonesia KNT09 PDF
Gregory
No ratings yet
Abritrary
Document1 page
Abritrary
Gregory
No ratings yet
Persiapan Geokom
Document11 pages
Persiapan Geokom
Gregory
No ratings yet
Persiapan Geokom
Document11 pages
Persiapan Geokom
Gregory
No ratings yet
Laboratorium Geologi Dinamik Tugas Pendahuluan Minggu Ke-5
Document1 page
Laboratorium Geologi Dinamik Tugas Pendahuluan Minggu Ke-5
Gregory
No ratings yet
Instructions for assembling paper craft coffee valve
Document8 pages
Instructions for assembling paper craft coffee valve
Jofre Giovanny Apaza Moya
No ratings yet
CSC 2623 Rubin Timilsina
Document8 pages
CSC 2623 Rubin Timilsina
rubin timilsina
No ratings yet
Club Planner Victor Cruz
Document2 pages
Club Planner Victor Cruz
Victor Cruz
No ratings yet
CI 1580A ENG User Manual
Document50 pages
CI 1580A ENG User Manual
Ardy Kristian
No ratings yet
Evaluating The Condition & Remaining Life of Older Power Plants
Document7 pages
Evaluating The Condition & Remaining Life of Older Power Plants
Alif Nur Firdaus
No ratings yet
How To Use Bigo TV For Live Streaming On Your Mobile Device
Document20 pages
How To Use Bigo TV For Live Streaming On Your Mobile Device
Live Hosting
100% (1)
Standard Operating Procedure Operation of Power Plant: Punyam Manufacturing INC
Document2 pages
Standard Operating Procedure Operation of Power Plant: Punyam Manufacturing INC
MAYMODERN STEEL
0% (1)
Ingles Rainier 11
Document14 pages
Ingles Rainier 11
Evelin Rojas
No ratings yet
STUDENT DATABASE
Document34 pages
STUDENT DATABASE
Reeta Shukla
No ratings yet
Susanta Roy Chouwdhury: Skills and Technology
Document4 pages
Susanta Roy Chouwdhury: Skills and Technology
PM CMS
No ratings yet
राष्ट्रीय मुक्त विधालयी विक्षा संस्थान National Institute of Open Schooling
Document27 pages
राष्ट्रीय मुक्त विधालयी विक्षा संस्थान National Institute of Open Schooling
Vivek Chandra
No ratings yet
FIC VY240D - REV 0.1 (ComunidadeTecnica - Com.br)
Document49 pages
FIC VY240D - REV 0.1 (ComunidadeTecnica - Com.br)
Selmar Cavalcanti
No ratings yet
Individual Assignment
Document5 pages
Individual Assignment
Hao Lim
No ratings yet
Dissertation Concours Capitaine Pompier
Document5 pages
Dissertation Concours Capitaine Pompier
PaperWritingHelpSyracuse
100% (1)
Thermal Imaging Camera t5 t6
Document3 pages
Thermal Imaging Camera t5 t6
Arius Gife
No ratings yet
Maharashtra State Board of Technical Education
Document15 pages
Maharashtra State Board of Technical Education
Kavir Ahir
No ratings yet
01fe19bme175 Internship
Document28 pages
01fe19bme175 Internship
Venugopal kulkarni
No ratings yet
CFO - IIM Course
Document13 pages
CFO - IIM Course
Rajesh Ramakrishna
No ratings yet
Contractors Electrical Webpage
Document4 pages
Contractors Electrical Webpage
JAGUAR GAMING
No ratings yet
Fee Structure Btech Cse
Document1 page
Fee Structure Btech Cse
ankit panwar
No ratings yet
GUIBONE - GRADE12 - Week6 Home Learning Plans2
Document2 pages
GUIBONE - GRADE12 - Week6 Home Learning Plans2
Gizellen Guibone
No ratings yet
Trimac Booth# 2A91
Document1 page
Trimac Booth# 2A91
mohamad
No ratings yet
Run Wordcount
Document3 pages
Run Wordcount
Khushi Patil
No ratings yet
B. Tech. 1 Year (Civil) Solid Mechanics (CEN-102) Tutorial Sheet 4: Torsion, Shear Force and Bending Moment
Document4 pages
B. Tech. 1 Year (Civil) Solid Mechanics (CEN-102) Tutorial Sheet 4: Torsion, Shear Force and Bending Moment
Kumar Shivam
No ratings yet
GLORIANET NETWORKING HUB
Document5 pages
GLORIANET NETWORKING HUB
reno
No ratings yet
15 ++103-+115+QH+8 (1) +2022+An++Aditia
Document13 pages
15 ++103-+115+QH+8 (1) +2022+An++Aditia
Suzan Haky
No ratings yet
ISO27k Guideline On ISMS Audit v2
Document57 pages
ISO27k Guideline On ISMS Audit v2
Fredy Avila
No ratings yet
ATKT Examination Fees Summer 2022
Document2 pages
ATKT Examination Fees Summer 2022
Kajal Dhansinghani
No ratings yet
Significant Figures Worksheet - 1
Document2 pages
Significant Figures Worksheet - 1
Yash Chauhan
No ratings yet