Welcome to Scribd!

ML Assignment1 Linear Regression

Uploaded by

0% found this document useful (0 votes)

5 views6 pages

The document shows analysis of a dataset containing employees' years of experience and salary. It loads and inspects the data, plots a scatter plot of experience vs. salary, fits a linear regression model to predict salary from experience, and plots the training and test results. Key steps include splitting the data into train and test sets, fitting a linear regression model to the training set, and using the model to make predictions on both training and test sets.

Original Description:

includes the linear regresssion model

Original Title

ML assignment1 linear regression

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views6 pages

ML Assignment1 Linear Regression

Uploaded by

Dishant kumar yadav mhakhariya

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

keyboard_arrow_down DISHANT KUMAR YADAV 2021BCS0136

#DISHANT KUMAR YADAV

import numpy as np
import pandas as pd

df = pd.read_csv('/content/sample_data/Salary_Data.csv')
df
#DISHANT KUMAR YADAV

YearsExperience Salary

0 1.1 39343.0

1 1.3 46205.0

2 1.5 37731.0

3 2.0 43525.0

4 2.2 39891.0

5 2.9 56642.0

6 3.0 60150.0

7 3.2 54445.0

8 3.2 64445.0

9 3.7 57189.0

10 3.9 63218.0

11 4.0 55794.0

12 4.0 56957.0

13 4.1 57081.0

14 4.5 61111.0

15 4.9 67938.0

16 5.1 66029.0

17 5.3 83088.0

18 5.9 81363.0

19 6.0 93940.0

20 6.8 91738.0

21 7.1 98273.0

22 7.9 101302.0

23 8.2 113812.0

24 8.7 109431.0

25 9.0 105582.0

26 9.5 116969.0

27 9.6 112635.0

28 10.3 122391.0

29 10.5 121872.0

#DISHANT KUMAR YADAV

import matplotlib.pyplot as plt

exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp,sal)
plt.xlabel('Experience')
plt.ylabel('Salary')
#DISHANT KUMAR YADAV
Text(0, 0.5, 'Salary')

#DISHANT KUMAR YADAV

exp_np = exp.to_numpy()
sal_np = sal.to_numpy()

exp_np.shape, sal_np.shape
#DISHANT KUMAR YADAV

((30,), (30,))

#DISHANT KUMAR YADAV

from sklearn.linear_model import LinearRegression

sklearn_model = LinearRegression().fit(exp_np.reshape((30,1)), sal_np)

sklearn_sal_predictions = sklearn_model.predict(exp_np.reshape((30,1)))
sklearn_sal_predictions.shape
#DISHANT KUMAR YADAV

(30,)

#DISHANT KUMAR YADAV

exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp,sal)
plt.xlabel('Experience')
plt.ylabel('Salary')

plt.scatter(exp,sklearn_sal_predictions )
#DISHANT KUMAR YADAV

output <matplotlib.collections.PathCollection at 0x7c7e2822d360>

#DISHANT KUMAR YADAV
predictions_df = pd.DataFrame({'YearsExperience': exp, 'Salary':sal, 'Sklearn salary prediction':sklearn_sal_predictions})

predictions_df
#DISHANT KUMAR YADAV

YearsExperience Salary Sklearn salary prediction

0 1.1 39343.0 36187.158752

1 1.3 46205.0 38077.151217

2 1.5 37731.0 39967.143681

3 2.0 43525.0 44692.124842

4 2.2 39891.0 46582.117306

5 2.9 56642.0 53197.090931

6 3.0 60150.0 54142.087163

7 3.2 54445.0 56032.079627

8 3.2 64445.0 56032.079627

9 3.7 57189.0 60757.060788

10 3.9 63218.0 62647.053252

11 4.0 55794.0 63592.049484

12 4.0 56957.0 63592.049484

13 4.1 57081.0 64537.045717

14 4.5 61111.0 68317.030645

15 4.9 67938.0 72097.015574

16 5.1 66029.0 73987.008038

17 5.3 83088.0 75877.000502

18 5.9 81363.0 81546.977895

19 6.0 93940.0 82491.974127

20 6.8 91738.0 90051.943985

21 7.1 98273.0 92886.932681

22 7.9 101302.0 100446.902538

23 8.2 113812.0 103281.891235

24 8.7 109431.0 108006.872395

25 9.0 105582.0 110841.861092

26 9.5 116969.0 115566.842252

27 9.6 112635.0 116511.838485

28 10.3 122391.0 123126.812110

29 10.5 121872.0 125016.804574

keyboard_arrow_down DISHANT KUMAR YADAV 2021BCS0136
# Step 1: Import the required python packages
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression

# Step 2: Load the dataset

df = pd.read_csv('/content/sample_data/Salary_Data.csv')

# Step 3: Data analysis - distribution plot shows the variation in the data distribution.
exp = df['YearsExperience']
sal = df['Salary']

plt.scatter(exp, sal)
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Distribution of Experience vs. Salary')
plt.show()

output

# Step 4: Split the dataset into dependent/independent variables

X = df[['YearsExperience']]
y = df['Salary']

# Step 5: Split data into Train/Test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Step 6: Train the regression model

regression_model = LinearRegression()
regression_model.fit(X_train, y_train)

▾ LinearRegression
LinearRegression()
# Step 7: Plot the training results
plt.scatter(X_train, y_train, color='blue')
plt.plot(X_train, regression_model.predict(X_train), color='red')
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Training Results: Experience vs. Salary')
plt.show()

# Step 7: Plot the test results

plt.scatter(X_test, y_test, color='blue')
plt.plot(X_train, regression_model.predict(X_train), color='red') # Same line as training for comparison
plt.xlabel('Experience')
plt.ylabel('Salary')
plt.title('Test Results: Experience vs. Salary')
plt.show()

Easy Sudoku Puzzle Book (Printable Version)
From Everand
Easy Sudoku Puzzle Book (Printable Version)
Sheba Blake
No ratings yet
PembelajaranMesin - Ipynb - Colaboratory
Document6 pages
PembelajaranMesin - Ipynb - Colaboratory
Khaerul Rijal
No ratings yet
21 tb1
Document6 pages
21 tb1
lotanna
No ratings yet
Stress Strain 2
Document1 page
Stress Strain 2
Shivam Dixit
No ratings yet
Datos F'C Fy Factor de Reducción (Ø) : 250 kg/cm2 4200 kg/cm2 0.9 2388000 KG-CM
Document50 pages
Datos F'C Fy Factor de Reducción (Ø) : 250 kg/cm2 4200 kg/cm2 0.9 2388000 KG-CM
RobsonOrtiz
No ratings yet
Tarea
Document12 pages
Tarea
Abrahan Macias
No ratings yet
Libro 1
Document7 pages
Libro 1
PIZARRO SUNCION JEAN PIER
No ratings yet
File of Monte Carlo
Document226 pages
File of Monte Carlo
Areeb Nasir Mughal
No ratings yet
Result Thesis
Document6 pages
Result Thesis
Ieqa Haziqah
No ratings yet
Regression Practice File
Document9 pages
Regression Practice File
Fadwa Jdia
No ratings yet
Data Munging - Ipynb - Colaboratory - Yodhi Adhi Sanjaya
Document4 pages
Data Munging - Ipynb - Colaboratory - Yodhi Adhi Sanjaya
adhi
No ratings yet
EkoTek - Part IV
Document15 pages
EkoTek - Part IV
wuri rahardjo
No ratings yet
Spacing 1 Number Trapezoids 10 Area 608.2286: Roy Haggerty
Document6 pages
Spacing 1 Number Trapezoids 10 Area 608.2286: Roy Haggerty
Spoil Bat
No ratings yet
Pruebas de Viscoelasticidad Salchicha
Document34 pages
Pruebas de Viscoelasticidad Salchicha
Catalina Mazo Rivas
No ratings yet
Quiz - MTK Optimasi - Randi Susilo - 1904020005
Document3 pages
Quiz - MTK Optimasi - Randi Susilo - 1904020005
Randi Susilo
No ratings yet
6.8 Functiob: Talde Digamma For Complex Arguments Y Y
Document6 pages
6.8 Functiob: Talde Digamma For Complex Arguments Y Y
tt
No ratings yet
Pca Implementation Notebook
Document4 pages
Pca Implementation Notebook
Walid Sassi
No ratings yet
Chart For Curved Wedges and Long Seam Weld Pipe Inspection v4
Document1 page
Chart For Curved Wedges and Long Seam Weld Pipe Inspection v4
david montilla
No ratings yet
No X T K K Ca CB Ra Fa /ra: Natasya Diwa Milenia 3335170047
Document5 pages
No X T K K Ca CB Ra Fa /ra: Natasya Diwa Milenia 3335170047
afif sena
No ratings yet
Table D.4 Example: Upper Percentage Points of The Distribution
Document2 pages
Table D.4 Example: Upper Percentage Points of The Distribution
Maytavee P. Chunhawutiyanon
No ratings yet
Seismic Checks Asce7 10
Document9 pages
Seismic Checks Asce7 10
طه حلمى
No ratings yet
XY Plot 16 - Velocity
Document4 pages
XY Plot 16 - Velocity
BAGARAGAZA ROMUALD
No ratings yet
From Utility Function To Demand Function and Deman Curve
Document10 pages
From Utility Function To Demand Function and Deman Curve
Julian
No ratings yet
Simple Linear Regression Lab II
Document5 pages
Simple Linear Regression Lab II
Zarfa Masood
No ratings yet
K-Nearest Neighbors Dataset X1 X2 Y
Document2 pages
K-Nearest Neighbors Dataset X1 X2 Y
Weary Case
No ratings yet
Countfwall
Document4 pages
Countfwall
Virendra Dehadrai
No ratings yet
DNV Riso
Document6 pages
DNV Riso
Von A. Damirez
No ratings yet
Prueba - KS
Document17 pages
Prueba - KS
Karen
No ratings yet
CG-3 Technical Note 4
Document3 pages
CG-3 Technical Note 4
Adindra Vickar Ega
No ratings yet
02 ESTRUCTURAS RESERVORIO ELEVADO 230 m3
Document20 pages
02 ESTRUCTURAS RESERVORIO ELEVADO 230 m3
Antonio Pineda Camones
No ratings yet
3rd Order Prob 1
Document102 pages
3rd Order Prob 1
Bonifacio Saut
No ratings yet
Book 1
Document8 pages
Book 1
aprajita roy
No ratings yet
PDD and PBU Test
Document2 pages
PDD and PBU Test
NA
No ratings yet
This Solution Solver Is For Up To 12 X 12 System of Linear Algebraic Equations Only
Document12 pages
This Solution Solver Is For Up To 12 X 12 System of Linear Algebraic Equations Only
bobos
No ratings yet
AND Gamma Function For Complex Arguments: Related
Document11 pages
AND Gamma Function For Complex Arguments: Related
tt
No ratings yet
Appendix C - Msa Manual 3Rd Edition: Values Associated With The Distribution of The Average Range
Document1 page
Appendix C - Msa Manual 3Rd Edition: Values Associated With The Distribution of The Average Range
neerajrdx
No ratings yet
Experiment 2 Lab Data
Document3 pages
Experiment 2 Lab Data
ELOUISE
No ratings yet
Prueba - KS
Document17 pages
Prueba - KS
lucho
No ratings yet
Caso IOSR CARACTERIZACIÓN
Document48 pages
Caso IOSR CARACTERIZACIÓN
sandruka605
No ratings yet
Eric Fluckiger Lab
Document98 pages
Eric Fluckiger Lab
api-302420928
No ratings yet
MATRIZ de Coheficientes A Vector B
Document12 pages
MATRIZ de Coheficientes A Vector B
Jefenandez
No ratings yet
Característica - 1 Servidor No Clientes Hllegada Tserv Tentrelleg Hinicioserv Tcola Hfinserv
Document17 pages
Característica - 1 Servidor No Clientes Hllegada Tserv Tentrelleg Hinicioserv Tcola Hfinserv
YudythVargas
No ratings yet
Laborator 1
Document13 pages
Laborator 1
CMM Intervention
No ratings yet
Tabel Present Value
Document8 pages
Tabel Present Value
Pricilla Putri
No ratings yet
Plantilla
Document9 pages
Plantilla
Viviana Calderon
No ratings yet
Fy 6000 Fy 5000 Fy 4200: Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd
Document5 pages
Fy 6000 Fy 5000 Fy 4200: Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd Mu/bd
Flack Baxter
No ratings yet
X A V V V
Document49 pages
X A V V V
Bima Maulana
No ratings yet
Calculo Final
Document8 pages
Calculo Final
Yesid Camilo Bohorquez Tordecilla
No ratings yet
RT Data
Document1 page
RT Data
Vaibhav Kotnala
No ratings yet
B57861S0303F040 - RT Data
Document1 page
B57861S0303F040 - RT Data
Vaibhav Kotnala
No ratings yet
Realestate Quiz Part1
Document27 pages
Realestate Quiz Part1
Shazeb Lalu
No ratings yet
ANISA Excel
Document16 pages
ANISA Excel
anisaishak
No ratings yet
Holt Winters Multiplicative
Document4 pages
Holt Winters Multiplicative
Akine Mikazuki
No ratings yet
Holt Winters Multiplicative
Document4 pages
Holt Winters Multiplicative
Muhammad Diannor Saputera
No ratings yet
Holt Winters Multiplicative
Document4 pages
Holt Winters Multiplicative
Muhammad Rizaldi
No ratings yet
Eksisting CFD
Document5 pages
Eksisting CFD
Wiranto Banjarnahor
No ratings yet
Power Load Curve 2
Document10 pages
Power Load Curve 2
Adrian M. Barrameda
No ratings yet
Prueba - KS SIMULACION GERENCIAL
Document17 pages
Prueba - KS SIMULACION GERENCIAL
David Vásquez
No ratings yet
MC17 Randdemo T
Document34 pages
MC17 Randdemo T
sivaji_ss
No ratings yet
Tabla de Equivalencias de Valores DCP Vrs CBR
Document1 page
Tabla de Equivalencias de Valores DCP Vrs CBR
Jules
No ratings yet
Class Assignment
Document8 pages
Class Assignment
Dishant kumar yadav mhakhariya
No ratings yet
Decision Tree
Document4 pages
Decision Tree
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 9
Document19 pages
Lecture 9
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 13
Document17 pages
Lecture 13
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 12
Document27 pages
Lecture 12
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 15
Document28 pages
Lecture 15
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 2 - Barriers To Communication
Document11 pages
Lecture 2 - Barriers To Communication
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 14b
Document46 pages
Lecture 14b
Dishant kumar yadav mhakhariya
No ratings yet
Kalasalingam Academy of Research and Education (Deemed To Be University) Anand Nagar, Krishnankoil - 626126
Document60 pages
Kalasalingam Academy of Research and Education (Deemed To Be University) Anand Nagar, Krishnankoil - 626126
Dishant kumar yadav mhakhariya
No ratings yet
Lecture 3 - Non - Verbal Communication
Document26 pages
Lecture 3 - Non - Verbal Communication
Dishant kumar yadav mhakhariya
No ratings yet