Professional Documents
Culture Documents
d’Intelligence Artificielle
ESIEA 3A 2019-2020
Mihir Sarkar
mihir@media.mit.edu
Objectifs du cours
# data manipulation
import pandas
# data visualization
import matplotlib.pyplot as plt
import seaborn
# configuration, don't pay attention
%matplotlib inline
plt.rcParams["figure.figsize"]=20, 10
import warnings
warnings.filterwarnings('ignore')
Import your data
data = pandas.read_csv('input/train.csv')
Import your data
data = pandas.read_csv('input/train.csv’)
data.head()
Import your data
data = pandas.read_csv('input/train.csv’)
data.head()
target = data['SalePrice’]
target.describe()
Import your data
data = pandas.read_csv('input/train.csv’)
data.head()
target = data['SalePrice’]
target.describe()
seaborn.distplot(target)
Import your data
data = pandas.read_csv('input/train.csv’)
data.head()
target = data['SalePrice’]
target.describe()
seaborn.distplot(target)
seaborn.distplot(data['YearBuilt'])
Bivariate analysis
The art of data visualization
seaborn.boxplot(x='YearBuilt', y="SalePrice", data=data)
seaborn.distplot(data['YearBuilt’])
features_data = data[features]
target_data = data[target]
my_first_model = LinearRegression()
my_first_model.fit(features_data, target_data)
Building your first model
Congratulations ! You just trained your first machine learning model !
Let's use it !