0% found this document useful (0 votes)

138 views10 pages

Time Series Forecasting Complete Tutorial Part 1

This article provides a tutorial on time-series forecasting. It discusses basics like seasonality, trend and unexpected events. It also covers topics like rolling statistics, additive and multiplicative time series, and exponential smoothing. The document demonstrates these concepts with practical examples in Python, like using moving average and exponential smoothing methods on an electricity consumption dataset.

Uploaded by

Smruti Ranjan Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

138 views10 pages

Time Series Forecasting Complete Tutorial Part 1

Uploaded by

Smruti Ranjan Nayak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Time-series Forecasting -Complete Tutorial | Part-1

BE G I NNE R D AT A S C I E NC E M A C HI NE LE A RNI NG PYT HO N T I M E S E RI E S F O RE C A S T I NG

This article was published as a part of the Data Science Blogathon

Introduction

A time series is a sequence of observations recorded over a certain period of time. A simple example of
time series is how we come across different temperature changes day by day or in a month. The tutorial
will give you a complete sort of understanding of what is time-series data, what methods are used to
forecast time series, and what makes time series data so special a complex topic in the field of data
science.

Table of Contents

Basics of Time-series Forecasting

Rolling statistics and stationarity in Time series
Additive and Multiplicative Time-series
Exponential Smoothing in Time Series
Practicals with Time-Series data
Exponential Smoothing Practicals
Time series decomposition and stationarity check
End Notes

Basics of Time-Series Forecasting

Timeseries forecasting in simple words means to forecast or to predict the future value(eg-stock price)
over a period of time. There are different approaches to predict the value, consider an example there is a
company XYZ records the website traffic in each hour and now wants to forecast the total traffic of the
coming hour. If I ask you what will your approach to forecasting the upcoming hour traffic?

A different person can have a different perspective like one can say find the mean of all observations, one
can have like take mean of recent two observations, one can say like give more weightage to current
observation and less to past, or one can say use interpolation. There are different methods to forecast the
values.

while Forecasting time series values, 3 important terms need to be taken care of and the main task of time
series forecasting is to forecast these three terms.

1) Seasonality

Seasonality is a simple term that means while predicting a time series data there are some months in a
particular domain where the output value is at a peak as compared to other months. for example if you
observe the data of tours and travels companies of past 3 years then you can see that in November and
December the distribution will be very high due to holiday season and festival season. So while forecasting
time series data we need to capture this seasonality.

2) Trend

The trend is also one of the important factors which describe that there is certainly increasing or
decreasing trend time series, which actually means the value of organization or sales over a period of time
and seasonality is increasing or decreasing.
3) Unexpected Events

Unexpected events mean some dynamic changes occur in an organization, or in the market which cannot
be captured. for example a current pandemic we are suffering from, and if you observe the Sensex or nifty
chart there is a huge decrease in stock price which is an unexpected event that occurs in the surrounding.

Methods and algorithms are using which we can capture seasonality and trend But the unexpected event
occurs dynamically so capturing this becomes very difficult.

Rolling Statistics and Stationarity in Time-series

A stationary time series is a data that has a constant mean and constant variance. If I take a mean of T1
and T2 and compare it with the mean of T4 and T5 then is it the same, and if different, how much
difference is there? So, constant mean means this difference should be less, and the same with variance.

If the time series is not stationary, we have to make it stationary and then proceed with modelling. Rolling
statistics is help us in making time series stationary. so basically rolling statistics calculates moving
average. To calculate the moving average we need to define the window size which is basically how much
past values to be considered.

For example, if we take the window as 2 then to calculate a moving average in the above example then, at
point T1 it will be blank, at point T2 it will be the mean of T1 and T2, at point T3 mean of T3 and T2, and so
on. And after calculating all moving averages if you plot the line above actual values and calculated moving
averages then you can see that the plot will be smooth.

This is one method of making time series stationary, there are other methods also which we are going to
study as Exponential smoothing.

Additive and Multiplicative Time series

In the real world, we meet with different kinds of time series data. For this, we must know the concepts of
Exponential smoothing and for this first, we need to study types of time series data as additive and
multiplicative. As we studied there are 3 components we need to capture as Trend(T), seasonality(S), and
Irregularity(I).

Additive time series is a combination(addition) of trend, seasonality, and Irregularity while multiplicative
time series is the multiplication of these three terms.

Time series Exponential Smoothing

Exponential smoothing calculates the moving average by considering more past values and give them
weightage as per their occurrence, as recent observation gets more weightage compared to past
observation so that the prediction is accurate. hence the formula of exponential smoothing can be defined
as.

yT = α * XT + α(1−α) * yT−1

Alpha is a hyperparameter that defines the weightage to give. This is known as simple exponential
smoothing, But we need to capture trend and seasonality components so there is double exponential
smoothing which is used to capture the trend components. only a little bit of modification in the above
equation is there.

Yt = α * Xt + (1-α) (yt-1 + bt-1) #trend component

where, bt = beta * (Yt – Yt-1) + (1-beta) * bt-1

hence here we are taking 2 past observations and what was in the previous cycle, which means we are
taking two consecutive sequences, so this equation will give us the trend factor.

If we need to capture trend and seasonality for both components then it is known as triple exponential
smoothing which adds another layer on top of trend exponential smoothing where we need to calculate
trend and seasonality for both.

Y = alpha * (Xt / Ct-1) + (1 – alpha)*(Y t-1 + bt-1)

where, ct = gamma * (xt/yt) + (1-alpha) * ct-alpha

here we are capturing trends as well as seasonality. Using smoothing we will be able to decompose our
time series data and our time-series data will become easy to work with because in real-world scenarios
working with time series is a complex task so you have to adopt such methods to make the process
smooth.

Practicals with Time series forecasting

It’s time to make our hands dirty by implementing the concepts we have learned so far till now from start.
we will implement Moving average, exponential smoothing methods and compare them with an original
distribution of data.

Exponential smoothing practicals

The dataset we are using is electricity consumption time series data and you can easily find it on Kaggle
from here.

step-1) Load the data first

import numpy as np # linear algebra import pandas as pd import matplotlib.pyplot as plt from
statsmodels.tsa.api import ExponentialSmoothing, SimpleExpSmoothing, Holt from pylab import rcParams

rcParams["figure.figsize"] = 20,5 df = pd.read_csv("Electric_Production.csv", header=0, index_col=0)

plt.plot(df[1:50]["Value"]) plt.xticks(rotation=30) plt.show()

Step-2) Moving Average method

we have seen how to calculate moving average using a window, same applies to our dataset and we will get
rolling statistics and find its mean. after the mean, if we plot the graph then you can see the difference in
smoothing of a graph as the original.

rollingseries = df[1:50].rolling(window=5) rollingmean = rollingseries.mean() #we can compute any statistical

measure #print(rollingmean.head(10)) rollingmean.plot(color="red") plt.show()
Step-3) Simple Exponential Smoothing

Now as we have seen in simple exponential smoothing has a parameter known as alpha which defines how
much weightage we want to give to recent observation. we will fit 2 models, one with high value and one
with less value of alpha, and compare both.

data = df[1:50] fit1 = SimpleExpSmoothing(data).fit(smoothing_level=0.2, optimized=False) fit2 =

SimpleExpSmoothing(data).fit(smoothing_level=0.8, optimized=False) plt.figure(figsize=(18, 8))

plt.plot(df[1:50], marker='o', color="black") plt.plot(fit1.fittedvalues, marker="o", color="b")

plt.plot(fit2.fittedvalues, marker="o", color="r") plt.xticks(rotation="vertical") plt.show()

Step-4) Holt method for exponential smoothing

Hot’s method is a popular method for exponential smoothing and is also known as Linear exponential
smoothing. It forecast the data with the trend. It works on three separate equations that work together to
generate the final forecast. let us apply this to our data and experience the changes. In the first fit, we are
assuming that there is a linear trend in data, and in the second fitting, we are having exponential
smoothing.

fit1 = Holt(data).fit() #linear trend fit2 = Holt(data, exponential=True).fit() #exponential trend

plt.plot(data, marker='o', color='black') plt.plot(fit1.fittedvalues, marker='o', color='b')

plt.plot(fit2.fittedvalues, marker='o', color='r') plt.xticks(rotation="vertical") plt.show()
You can observe that linear trend means blue plot does not fit fine, and following the original plot whereas
red plot is an exponential smoothing plot. This is a simple smoothing with the holt method, we also add
parameters like alpha, trend component, seasonality component.

Decomposition and stationarity check practicals

Now we will work and check which type of time series data we have, whether it is additive or multiplicative.
We will use a different dataset from above and it is known as drug sales data which you can download
from here.

Step-1) Load dataset

If you observe the above plot then we can see the upward trend in the data, but we cannot see any kind of
special seasonality.

from statsmodels.tsa.seasonal import seasonal_decompose from dateutil.parser import parse import pandas as pd
DrugSalesData = pd.read_csv('TimeSeries.csv', parse_dates=['Date'], index_col='Date')
DrugSalesData.reset_index(inplace=True) import matplotlib.pyplot as plt

plt.rcParams.update({'figure.figsize': (10,6)}) plt.plot(DrugSalesData['Value'])

Step-2) Decomposition of time-series data

Now we will decompose time series data into multiplicative and additive and visualize the seasonal and
trend components that they have extracted.

# Additive Decomposition add_result = seasonal_decompose(DrugSalesData['Value'], model='additive',period=1) #

Multiplicative Decomposition mul_result = seasonal_decompose(DrugSalesData['Value'],

model='multiplicative',period=1)

We imported the seasonal decompose function from the stats model and pass both the model as
multiplicative and additive. Now let us visualize the result of each model one by one. first plot the results
of the Additive time series.

add_result.plot().suptitle('nAdditive Decompose', fontsize=12) plt.show()

If you observe the plots you will get 4 plots, two for trend, one for seasonality, and one for residual. We can
see that trend is of course there using both time methods and seasonality is zero.

Now we also want to see the actual value of trend and seasonality, how much it has been calculated. so we
will prepare the dataframe of four columns which will have a value for each plot. let us make of additive,
and you can try will multiplicative in the same way.

new_df_add = pd.concat([add_result.seasonal, add_result.trend, add_result.resid, add_result.observed],

axis=1) new_df_add.columns = ['seasoanilty', 'trend', 'residual', 'actual_values'] new_df_add.head()

Step-3) ADfuller test for stationary

Stationary is constantly mean and constant variance. Adfuller is a simple test which tells that if the time
series is stationary which is a kind of hypothesis testing. The Null hypothesis is time series are non-
stationary. If the p-value is less than 5 percent then reject the NULL hypothesis else accept the NULL
hypothesis.

from statsmodels.tsa.stattools import adfuller adfuller_result = adfuller(DrugSalesData.Value.values,

autolag='AIC') print(f'ADF Statistic: {adfuller_result[0]}') print(f'p-value: {adfuller_result[1]}') for key,

value in adfuller_result[4].items(): print('Critial Values:') print(f' {key}, {value}')

P-value is greater than 5 per cent, which means we cannot build a model on Non-stationary data so we
have to make the time series stationary. Now to make time-series stationary there are different methods
like autoregression with ACF, PACF, etc which we will cover in the second part of this article.

End Notes

We have seen what is time-series data, what makes time-series analysis a special and complex task in
Machine learning. We also perform practicals on how to start working with time series data and how to
perform various analyses and drive inferences from it. In the upcoming part, we will discuss various
methods to make time-series stationary and we will also discuss various time series classical models like
ARIMA, SARIMA, etc.

I hope it was easy to follow till the end, I know it’s a little complex to handle time-series data But after
having a look through this article you got some sort of understanding and confidence that you can handle
time-series data. If you have any queries, please post them in the comment section below.

About the Author

Raghav Agrawal

I am pursuing my bachelor’s in computer science. I am very fond of Data science and big data. I love to
work with data and learn new technologies. Please feel free to connect with me on Linkedin.

If you like my article, please have a look at others articles. link

The media shown in this ar ticle are not owned by Analytics Vidhya and are used at the Author’s
discretion.

Article Url - https://www.analyticsvidhya.com/blog/2021/07/time-series-forecasting-complete-tutorial-

part-1/

agrawal@71

Time Series Forecasting
100% (1)
Time Series Forecasting
52 pages
Unit 3
No ratings yet
Unit 3
87 pages
Time Series Model
No ratings yet
Time Series Model
22 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
86 pages
Time Series Analysis: Methods & Applications
No ratings yet
Time Series Analysis: Methods & Applications
9 pages
DS Module 06
No ratings yet
DS Module 06
8 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
69 pages
Unit 3 B Time Series Analysis
No ratings yet
Unit 3 B Time Series Analysis
37 pages
Supervised Predictive Analytics Techniques
No ratings yet
Supervised Predictive Analytics Techniques
50 pages
Time Series Analysis: Trends & Models
100% (1)
Time Series Analysis: Trends & Models
48 pages
Exponential Smoothing Techniques Explained
No ratings yet
Exponential Smoothing Techniques Explained
32 pages
Chapter 5 - Times Seriesjjuuhggbuygbuygbyhvg
No ratings yet
Chapter 5 - Times Seriesjjuuhggbuygbuygbyhvg
43 pages
A129205660 - 23591 - 22 - 2019 - Time Series-1-1
No ratings yet
A129205660 - 23591 - 22 - 2019 - Time Series-1-1
20 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
36 pages
Understanding Time Trends in Forecasting
No ratings yet
Understanding Time Trends in Forecasting
14 pages
Time Series Modeling in Veterinary Science
No ratings yet
Time Series Modeling in Veterinary Science
18 pages
Time Series Analysis and Forecasting
No ratings yet
Time Series Analysis and Forecasting
7 pages
Week09 Handling Time Series
No ratings yet
Week09 Handling Time Series
24 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
13 pages
Predicting Stock Prices With Echo State Networks - Towards Data Science
No ratings yet
Predicting Stock Prices With Echo State Networks - Towards Data Science
19 pages
Time Series Analysis & Forecasting Guide
No ratings yet
Time Series Analysis & Forecasting Guide
26 pages
MSO S04 - Exponential Smoothing With Solutions
No ratings yet
MSO S04 - Exponential Smoothing With Solutions
33 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
4 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
Time-Series Analysis Guide
No ratings yet
Time-Series Analysis Guide
9 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
11 pages
Time Series Analysis and Forecasting Guide
No ratings yet
Time Series Analysis and Forecasting Guide
30 pages
Introduction To Time Series Analysis
No ratings yet
Introduction To Time Series Analysis
93 pages
Incorporating Seasonality in Forecasts
No ratings yet
Incorporating Seasonality in Forecasts
60 pages
Time Series EDA for Data Analysts
No ratings yet
Time Series EDA for Data Analysts
20 pages
Time Series Forecasting Techniques Guide
No ratings yet
Time Series Forecasting Techniques Guide
33 pages
Forecasting - Introduction
No ratings yet
Forecasting - Introduction
72 pages
Holt-Winters Seasonal Forecasting Guide
No ratings yet
Holt-Winters Seasonal Forecasting Guide
13 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
30 pages
FORECASTING
No ratings yet
FORECASTING
55 pages
Time Series Analysis and Forecasting Techniques
No ratings yet
Time Series Analysis and Forecasting Techniques
60 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
14 pages
Time-Series-Forecast-A-Comprehensive-Guide - Jupyter Notebook
No ratings yet
Time-Series-Forecast-A-Comprehensive-Guide - Jupyter Notebook
24 pages
DSS13 Time Series
No ratings yet
DSS13 Time Series
65 pages
Exponential Smoothing in Forecasting
No ratings yet
Exponential Smoothing in Forecasting
41 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
Time Series Forecasting Techniques
No ratings yet
Time Series Forecasting Techniques
9 pages
Time Series Analysis Homework Help
100% (1)
Time Series Analysis Homework Help
4 pages
Mid Sem Report
No ratings yet
Mid Sem Report
11 pages
Time-Series Analysis for Demand Forecasting
No ratings yet
Time-Series Analysis for Demand Forecasting
20 pages
Time-Series Smoothing Techniques Explained
No ratings yet
Time-Series Smoothing Techniques Explained
20 pages
Time Series Analysis Overview
No ratings yet
Time Series Analysis Overview
11 pages
Applied Business Forecasting and Planning: Moving Averages and Exponential Smoothing
No ratings yet
Applied Business Forecasting and Planning: Moving Averages and Exponential Smoothing
48 pages
Forecasting
No ratings yet
Forecasting
6 pages
Time Series Forecasting with Python
No ratings yet
Time Series Forecasting with Python
18 pages
S6 - Time - Series Analysis - 1
No ratings yet
S6 - Time - Series Analysis - 1
21 pages
Time Series Analysis Homework Solutions
100% (1)
Time Series Analysis Homework Solutions
6 pages
DSS16-Time Series
No ratings yet
DSS16-Time Series
65 pages
India INX Global Access FAQs
No ratings yet
India INX Global Access FAQs
7 pages
Not Found: Experience
No ratings yet
Not Found: Experience
1 page
Predicting House Prices Using Features
No ratings yet
Predicting House Prices Using Features
1 page
Advanced Financial Management PDF
100% (1)
Advanced Financial Management PDF
203 pages
Private Equity Terminology Explained
No ratings yet
Private Equity Terminology Explained
4 pages
Interactive Broker Deposit
No ratings yet
Interactive Broker Deposit
1 page
Malaysian Visa Application Form
No ratings yet
Malaysian Visa Application Form
2 pages
Advanced Financial Management PDF
100% (1)
Advanced Financial Management PDF
203 pages
Accounting For Managers Question Paper PDF
86% (7)
Accounting For Managers Question Paper PDF
4 pages
VTU Syllabus MBA 2014-2015
No ratings yet
VTU Syllabus MBA 2014-2015
118 pages
Accounting For Managers
100% (1)
Accounting For Managers
286 pages
IELTS Monthly Study Planner Template
No ratings yet
IELTS Monthly Study Planner Template
1 page
IELTS Weekly Planner PDF
No ratings yet
IELTS Weekly Planner PDF
1 page

Time Series Forecasting Complete Tutorial Part 1

Uploaded by

Time Series Forecasting Complete Tutorial Part 1

Uploaded by

Time-series Forecasting -Complete Tutorial | Part-1

BE G I NNE R D AT A S C I E NC E M A C HI NE LE A RNI NG PYT HO N T I M E S E RI E S F O RE C A S T I NG

This article was published as a part of the Data Science Blogathon

Basics of Time-series Forecasting

Basics of Time-Series Forecasting

Rolling Statistics and Stationarity in Time-series

Additive and Multiplicative Time series

Time series Exponential Smoothing

yT = α * XT + α(1−α) * yT−1

Yt = α * Xt + (1-α) (yt-1 + bt-1) #trend component

where, bt = beta * (Yt – Yt-1) + (1-beta) * bt-1

Y = alpha * (Xt / Ct-1) + (1 – alpha)*(Y t-1 + bt-1)

where, ct = gamma * (xt/yt) + (1-alpha) * ct-alpha

Practicals with Time series forecasting

Exponential smoothing practicals

step-1) Load the data first

rcParams["figure.figsize"] = 20,5 df = pd.read_csv("Electric_Production.csv", header=0, index_col=0)

Step-2) Moving Average method

rollingseries = df[1:50].rolling(window=5) rollingmean = rollingseries.mean() #we can compute any statistical

data = df[1:50] fit1 = SimpleExpSmoothing(data).fit(smoothing_level=0.2, optimized=False) fit2 =

plt.plot(df[1:50], marker='o', color="black") plt.plot(fit1.fittedvalues, marker="o", color="b")

Step-4) Holt method for exponential smoothing

fit1 = Holt(data).fit() #linear trend fit2 = Holt(data, exponential=True).fit() #exponential trend

plt.plot(data, marker='o', color='black') plt.plot(fit1.fittedvalues, marker='o', color='b')

Decomposition and stationarity check practicals

Step-1) Load dataset

plt.rcParams.update({'figure.figsize': (10,6)}) plt.plot(DrugSalesData['Value'])

Step-2) Decomposition of time-series data

# Additive Decomposition add_result = seasonal_decompose(DrugSalesData['Value'], model='additive',period=1) #

Multiplicative Decomposition mul_result = seasonal_decompose(DrugSalesData['Value'],

add_result.plot().suptitle('nAdditive Decompose', fontsize=12) plt.show()

new_df_add = pd.concat([add_result.seasonal, add_result.trend, add_result.resid, add_result.observed],

from statsmodels.tsa.stattools import adfuller adfuller_result = adfuller(DrugSalesData.Value.values,

autolag='AIC') print(f'ADF Statistic: {adfuller_result[0]}') print(f'p-value: {adfuller_result[1]}') for key,

About the Author

If you like my article, please have a look at others articles. link

Article Url - https://www.analyticsvidhya.com/blog/2021/07/time-series-forecasting-complete-tutorial-

You might also like