Welcome to Scribd!

Skip carousel

Data Representation

Uploaded by

URK20DA1007 RYAN POWELL

0% found this document useful (0 votes)

4 views12 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views12 pages

Data Representation

Uploaded by

URK20DA1007 RYAN POWELL

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 12

Search inside document

Data Representation

Introduction

● The main objective of machine learning is to build models that understand

data and find underlying patterns.

● Data must be feed in a way that is interpretable by the computer.

● To feed the data in the model, it must be represented as a table or matrix

dimensions.

● Converting the data into the correct tabular form is one of the first step in
data preprocessing.
Data Represented in a Table
Data should be arranged in a two dimensional space made of rows and columns.

Easily understand the data and pinpoint any problems.

CSV
data in
table
format
Data Represented in a Table
To load a CSV file and work on it as a table, we use the pandas library.

The data is loaded into tables called DataFrames.

Independent and Target Variables
DataFrame that we use contains variables or features that can be classified into two
categories.

Independent Variable (Predictor Variable)

o Used to predict the target variable.

o Is independent of each other.

Dependent Variable (Target Variable)

Independent Variables
Features in the dataframe

size (m, n)

where m is the number of observations

n is the number of features.

Independent Variables
Independent Variables must be normally distributed and should not contain

• Missing or Null Values

• Highly categorical data features

• Outliers

• Data on different scales

• Human error

• Multicollinearity (independent variables that are correlated)

• Very large independent feature sets

• Sparse data

• Special characters
Feature Matrix and Target Vector
A single piece of data is called a scalar.

A group of scalars is called a vector, and a group of vectors is called a matrix.

A matrix is represented in rows and columns.

Feature matrix data is made up of independent columns, and the target vector depends
on the feature matrix columns. Independent
Variable
Car Model Dependent
Car Capacity Variable
Car Brand Car Price
Loading a Sample Dataset and Creating Feature Matrix and Target
Matrix

1. Import Pandas
Library
import pandas as pd

2. Load the
dataset into
pandas dataset=“filename”
Dataframe df=pd.read_csv(dataset,header=0)

3. To print all the

colums
df.columns
Loading a Sample Dataset and Creating Feature Matrix and Target
Matrix

4. Total Number of
Rows
df.index

Syntax:
5. Set Address
column as index Dataframe.set_index(‘column name’,inplace=True)
df.set_index(‘Address’, inplace=True)

6. Reset the index

df.reset_index(inplace=True)
Loading a Sample Dataset and Creating Feature Matrix and Target
Matrix

7. Retrieve first
five rows and
columns df.iloc[0:4, 0:3]

8. Retrieve the
data using labels
df.loc[0:4,[“Avg. Area Income”, “Avg. Area House Age”]]

9. Reset the index

df.reset_index(inplace=True)
Loading a Sample Dataset and Creating Feature Matrix and Target
Matrix

10. Drop a
column X=df.drop[‘Price’,axis=1]

11. Shape of
feature matrix
x.shape

12. To store the Y=df[‘Price’]

target variable Created
y.head(10) Feature
and Target
13. Shape of new Matrices of
variable a dataset
y.shape

Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4610)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (121)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (400)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5795)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (345)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1713)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2104)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (74)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
6 Confidence Intervals
Document17 pages
6 Confidence Intervals
La Je
No ratings yet
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1016)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Quay Walls Design
Document12 pages
Quay Walls Design
Adrian Frantescu
100% (1)
Haytham Ibrahim's Assignment No.2 PDF
Document29 pages
Haytham Ibrahim's Assignment No.2 PDF
Ahmed Khairi
No ratings yet
Data Integration - Transformation
Document10 pages
Data Integration - Transformation
URK20DA1007 RYAN POWELL
No ratings yet
Professional English
Document87 pages
Professional English
URK20DA1007 RYAN POWELL
No ratings yet
Fundamentals of Information and Technology Notes
Document43 pages
Fundamentals of Information and Technology Notes
URK20DA1007 RYAN POWELL
No ratings yet
Statistics and Probability For Data Scinece Notes
Document41 pages
Statistics and Probability For Data Scinece Notes
URK20DA1007 RYAN POWELL
No ratings yet
Problem Solving Notes
Document56 pages
Problem Solving Notes
URK20DA1007 RYAN POWELL
No ratings yet
Binary Search: Dr. Beulah Christudas Assistant Professor, KITS
Document12 pages
Binary Search: Dr. Beulah Christudas Assistant Professor, KITS
URK20DA1007 RYAN POWELL
No ratings yet
3 - Stacks - Implementation Using Arrays
Document26 pages
3 - Stacks - Implementation Using Arrays
URK20DA1007 RYAN POWELL
No ratings yet
Array Operations: Dr. Beulah Christudas, Karunya Institute of Technology and Sciences
Document22 pages
Array Operations: Dr. Beulah Christudas, Karunya Institute of Technology and Sciences
URK20DA1007 RYAN POWELL
No ratings yet
4 - Singly Linked List
Document28 pages
4 - Singly Linked List
URK20DA1007 RYAN POWELL
No ratings yet
1 - Data Structures
Document22 pages
1 - Data Structures
URK20DA1007 RYAN POWELL
No ratings yet
Ludwig Wittgenstein - A Mind On Fire - New Statesman
Document14 pages
Ludwig Wittgenstein - A Mind On Fire - New Statesman
zentropia
No ratings yet
D-Wave Articulo Prof Venegas Del de Youtube PDF
Document31 pages
D-Wave Articulo Prof Venegas Del de Youtube PDF
Marco A. Erazo
No ratings yet
Dyonic Born-Infeld Black Hole in 4D Einstein-Gauss-Bonnet Gravity
Document12 pages
Dyonic Born-Infeld Black Hole in 4D Einstein-Gauss-Bonnet Gravity
mazhari
No ratings yet
L.04 Flexible Road Pavement Structural Condition Benchmark Methodology Incorporating
Document15 pages
L.04 Flexible Road Pavement Structural Condition Benchmark Methodology Incorporating
Agustina Manurung
No ratings yet
Comparative Bio Mechanics of Throwings
Document25 pages
Comparative Bio Mechanics of Throwings
Attilio Sacripanti
No ratings yet
Pub Quantum-Physics PDF
Document338 pages
Pub Quantum-Physics PDF
Raj Jana
No ratings yet
Mastercam 2017 Handbook Volume 3 SAMPLE
Document32 pages
Mastercam 2017 Handbook Volume 3 SAMPLE
sekhon875115
No ratings yet
Solving Word Problems Involving Exponential Functions
Document15 pages
Solving Word Problems Involving Exponential Functions
Sherwin Jay Bentazar
100% (1)
Phase Lead and Lag Compensator
Document8 pages
Phase Lead and Lag Compensator
Nuradeen Magaji
No ratings yet
ECON301 Handout 04 1415 02
Document17 pages
ECON301 Handout 04 1415 02
ffef fefff
No ratings yet
2139 12021 1 PB PDF
Document9 pages
2139 12021 1 PB PDF
Santosh Kumar Pandey
No ratings yet
Free-Falling Object Experiment
Document9 pages
Free-Falling Object Experiment
Ugur ASİT
100% (4)
Chapter 11
Document12 pages
Chapter 11
Jerome
No ratings yet
Chap 1 A - Propositions
Document44 pages
Chap 1 A - Propositions
عمار الدلال
100% (1)
Poutineau (2015) PDF
Document24 pages
Poutineau (2015) PDF
Miguel Szejnblum
No ratings yet
A Comparative Study On State-of-the-Art Prediction Tools For Seakeeping
Document13 pages
A Comparative Study On State-of-the-Art Prediction Tools For Seakeeping
haujes
No ratings yet
Liceo de Pulilan Colleges
Document2 pages
Liceo de Pulilan Colleges
jv_cindy
No ratings yet
JEXPO Maths
Document6 pages
JEXPO Maths
Avijit Das.
No ratings yet
Anova 1
Document11 pages
Anova 1
vijay2101
No ratings yet
Measuring The Acceleration Due To Gravity Lab
Document6 pages
Measuring The Acceleration Due To Gravity Lab
api-616433899
No ratings yet
High Low Method
Document4 pages
High Low Method
Samreen Lodhi
No ratings yet
Stat Module 5
Document10 pages
Stat Module 5
Remar Jhon Paine
No ratings yet
E & P Cycle in Petroleum Industry: A Question of Risk and Uncertainty
Document38 pages
E & P Cycle in Petroleum Industry: A Question of Risk and Uncertainty
Advait Deshmukh
No ratings yet
Deflection Considerations in Two-Way Reinforced Co
Document13 pages
Deflection Considerations in Two-Way Reinforced Co
Hamid Hassanzada
No ratings yet
EE5143 Tutorial1
Document5 pages
EE5143 Tutorial1
Sayan Rudra Pal
No ratings yet
AI - Lecture 2 - Uninformed Search
Document20 pages
AI - Lecture 2 - Uninformed Search
HunterxHunter03
No ratings yet