Professional Documents
Culture Documents
Institutional Affiliation
Course Code
Date
Forecasting Analysis for Beer
Executive Summary
Beer Here has been approached to sponsor a local event. The business case scenario in
this analysis involves forecasting the number of beer units to supply to the local music festival
event. The overall aim is to ensure there is sufficient beer at the festival, make the event a
success, and ensure the retailer is invited to sponsor the event again in the subsequent year.
Detailed data analysis and forecasting techniques are required to assist in understanding the
required supply data.
The data used for this forecasting was retrieved from the Lubbock Chamber of
Commerce for the period beginning 1st January 2001, to 20th December 2007. This was provided
in terms of weekly data on sales, units, and transactions in dollars. The available datasets indicate
the seasonality, magnitude of sales, and independent trends. The forecasting will use particular
models designed for particular datasets with their unique characteristics. The forecast will help
the management team in strategizing for the music festival to ensure effective inventory
management.
Description of Data
Plot chart for keystone light showing monthly seasonality per year
Table Showing Anticipated Forecasting Models
Pg. 29. Additive seasonality due to polynomial 3 being the highest R. values in different seasons
vary in constant amounts
The table below shows the actual Value Vs forecasted Values per year
Autocorrelation Charts above are statistical tools that are used to analyze the correlation
between a time series and the lagged values.
The Chart comprises of lag that is plotted on the x-axis and the autocorrelation coefficient
plotted on the y-axis. The autocorrelation coefficient helps to measure the correlation between a
time series and the lagged values. When determining the level of correlation, value of 1 shows a
perfect positive correlation, 0 shows no correlation, and -1 indicates a perfect negative
correlation.
The autocorrelation plot helps in pattern identification in a time series analysis, such as
seasonality or trends, by indicating the strength and significance of the correlation between a
time series and its past values. It also helps to determine the best lag length for time series
models, such as ARIMA models.
Partial Autocorrelation Charts
The partial autocorrelation chart above is also known as PACF plot. It is a statistical tool
that helps in analyzing the correlation between a time series and its lagged values while
controlling the effects of the intermediate lags.
Because the coefficient of AR(1) is not close to or equal to 1, we can conclude there are
systematic components in the data set that are predictable and therefore can use our best model
(Holts Winter Multiplicative) to predict future values.
The following table compares the past year's unit sales (also representing a Naive seasonal
forecast benchmark) vs our Holts Winters Multiplicative model to project next year's unit sales.
The Results of the Analysis shows that there is an expectation of 5.74% annual growth in
Keystone Light unit sales.
Based on the model analysis, I recommend that Holts Winters Multiplicative model
should be used to forecast the yearly sales, in order to make the management of the inventory
more effective, efficient and to get reliable insight for better decision making. I also recommend
that external factors that may have an effect on the sales should be monitored, these external
factors will help in interpreting and answering the results of the model.
Technical Summary
The aim of the project was to forecast the sales based on historical sales data. The dataset
includes weekly sales data recorded for a period of seven years, that is from January 2001 to
December 2007.
Data Preparation
The data was cleaned by removing the missing values, outliers, and seasonality. Seasonal
Decomposition of Time Series (STL) method was used to decompose the time series into its
trend, seasonal, and residual components. To stabilize the variance of the time series, the Box-
Cox transformation method was used.
Forecasting Methods
Several forecasting methods were used, namely, Naïve Seasonal model, Non Naïve
Seasonal, LSTM neural network model, Holt Winter Multiplicative, Holt Winter Additive
Moving Average etc.
ARIIMA Model
Autoregressive integrated moving average (ARIMA) model was fitted to the
preprocessed data. The ARIMA model was validated on a holdout dataset using the root mean
squared error (RMSE) as the evaluation metric.
Conclusion
The Holt winters model achieved the best performance based on forecasting, accuracy
and efficiency in computational, while the LSTM model showed good performance in capturing
the nonlinear patterns and dependencies in the sales data. The results of the study help in
drawing valuable insights which can be used to make informed decision and allocation of
resources for the retail store chains.