You are on page 1of 8

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/327225380

An Intelligent Approach to Demand Forecasting

Conference Paper · October 2017

CITATION READS
1 11,948

5 authors, including:

Teja Silaparasetti Nimai Chand Das Adhikari


National Institute of Technology Rourkela Microsoft
3 PUBLICATIONS   6 CITATIONS    13 PUBLICATIONS   69 CITATIONS   

SEE PROFILE SEE PROFILE

Rajat Garg

4 PUBLICATIONS   20 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Time Series Forecasting View project

Extreme Learning Machines View project

All content following this page was uploaded by Teja Silaparasetti on 25 August 2018.

The user has requested enhancement of the downloaded file.


Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

An Intelligent Approach to Demand Forecasting


Nimai Chand Das Adhikari∗ , Nishanth Domakonda∗ , Chinmaya Chandan∗
Gaurav Gupta∗ , Rajat Garg∗ , S Teja∗ , Lalit Das∗ , Dr. Ashutosh Misra†
∗ Advanced Analytics Team, SS Supply Chain Solutions Pvt. Ltd., Bangalore
† Enterprise Information Management, Philips Lighting, Bangalore

Abstract—Demand Forecasting, undeniably, is the single most and Operations Planning (S&OP) is an integration process
important component of any organizations Supply Chain. It used in business organization to ensure efficient coordina-
determines the estimated demand for the future and sets the level tion among cross functional units to align company strategy
of preparedness that is required on the supply side to match the
demand. It goes without saying that if an organization doesnt get with Supply Chain planning. Globalization, new opportunities
its forecasting accurate to a reasonable level, the whole supply and Supply Chain Management differentiation compel the
chain gets affected. Understandably, Over/Under forecasting has organization even struggle further as the integration process
deteriorating impact on any organizations Supply Chain and among suppliers, market, and stakeholders become complex
thereby on P and L. Having ascertained the importance of De- and data intensive. Various forecasting models are used to
mand Forecasting, it is only fair to discuss about the forecasting
techniques which are used to predict the future values of demand. predict what the demands on the system will be in the future
The input that goes in and the modeling engine which it goes so that appropriate designs and operating plans can be devised.
through are equally important in generating the correct forecasts Ironically, the basic premise behind generating forecasts is
and determining the Forecast Accuracy. Here, we present a very that they are mostly wrong. Forecasts must include analysis of
unique model that not only pre-processes the input data, but their potential errors. The longer the forecast horizon, the less
also ensembles the output of two parallel advanced forecasting
engines which uses state-of-the-art Machine Learning algorithms accurate the forecast will generally be. Benefits can often be
and Time-Series algorithms to generate future forecasts. Our obtained from transforming a forecasted quantity into a known
technique uses data-driven statistical techniques to clean the data quantity. The information systems play a key role mainly in
of any potential errors or outliers and impute missing values if operative planning and management. It is the requirement
any. Once the forecast is generated, it is post processed with of almost all the comapanies to start run their business
Seasonality and Trend corrections, if required.Since the final
forecast is the result of statistically pre-validated ensemble of without having relevant information from ERP (Enterprise
multiple models, the forecasts are stable and accuracy variation Resource Planning) systems. In consequence of more efficient
is very minimal across periods and forecast horizons. Hence it information use through ERP, the partial planning methods
is better at estimating the future demand than the conventional as MRP II (Manufacturing Resource Planning), Sales and
techniques. Operations Planning (S&OP) or APS (Advanced Planning
Index Terms—Supply Chain, Demand Forecasting, Time Se- and Scheduling) are developed. To maximize the effect of
ries, Machine Learning, Ensemble, Optimization, Cognitive Anal- accessible methods for any internal company processes, the
ysis, Outliers
company must be build on an objective and evaluated demand
forecasts. The choice of optimum forecasting procedures and
I. I NTRODUCTION following use of obtained forecasts may become a competitive
Demand Forecasting is the key activity which more or less advantage. Together with other modern methods it accelerates
controls all other activities of Supply Chain Management. It other company processes, reduces the costs and increases
is the key driving factor in planning and decision making in the value for the customer. The demand forecast determines
Supply Chain Management as well as Enterprise level. All ma- the volume of products, place and time horizon in which
jor firms truly depend on the efficiency of demand forecasting they will be needed. In relation with the demand forecast
to take major decisions such as capacity building, resource it is necessary to deal not only with the quantitative aspect
allocation, expansion and forward or backward integration, of the needs (the volume demanded by customers) but also
etc. Forecasting is a prediction or an estimation of an actual their qualitative aspect (the type of customer’s needs). The
value in a future time period .It is usual that there might be accurate demand forecast is thus important for the produc-
forecast error as actual results differ from the projected value, tion and distribution management but also for e. g. areas
long time horizon has chance of more error. This is important of marketing (distribution of sales forces, communication,
to measure the forecast error for adapting corrective action promotion and planning of new products), finance (current
plan. The development of advanced technology enables the need of money, budgets and calculations), investment designs
Supply Chain Management stakeholders to share real time data (production facilities, workshops and warehouses), research
and information across the network which helps dual benefit and development (innovations) and human resources (structure
to inventory and customer service. This underlying result of and labour force volume planning, training). Demand planning
the process is accuracy of forecast which firmly ensures the represents a set of methodologies and information technologies
successful and sustainable business operations. This is called for the use of demand forecasts in the process of planning.
Collaborative Planning, Forecast, and Replenishment. Sales The aim is to accelerate the flow of raw materials, materials
978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1105
Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

and services beginning with the suppliers through transforming • Spinning:manipulating forecasts to obtain the most fa-
to products in the company and to their distribution to their vorable reaction from individuals or departments in the
final consumers. The demand planning process is done to help organization
the business understand its profit potential. Indirectly it sets Demand plans should reflect what we really think what we
the stage for capacity, financing, and stakeholder confidence will sell in the future.
. The implementation of the demand planning enables to
determine the closest possible forecast to the planning horizon
B. Statistical Forecast:
and decide the volume of production, stock and sources
capacity distribution among particular products to maximize The ambition of the business in using the statistical
the profits of the whole company. The key requirement for forecast is to minimize the sales planner work load and
efficient company management is sharing the mutual forecast. in the meantime give a good forecast that will benefit the
However, the research carried in production companies showed organization. Statistical forecasting might not be suitable
individual departments of the company in some cases draw up for every product, but it is proven successful for products
forecasts on their own and thus they base their planning on with a fairly stable sales history and a large number of
different figures. This provokes conflicts among the resulting lighting products fit this description. Statistical forecasting
activities of in-company plans . This kind of situation happens significantly reduces the number of products that will have
in case when the company prefers to approve the financial plan to be touched by every individual sales planner every
which doesn’t correspond with the updated forecast results. week/month.
The forecasting should always be the process which is essen-
tial and determining for other company processes, including • Data pre-processing: Outlier alerts are used as input for
financial planning. The financial plan often represents the the stat forecast manager to do history cleaning.
main motivation source for the company managers as this All individual alerts will be listed, which means that if a
reflects the requirements of the company top management and certain product has more than one outlier, all these events
company’s strategic goals. will be specified. A value is considered an outlier if:
A variety of modeling techniques are available for produc- Value is higher than average +3 ∗ standarddeviation
ing forecasts. Based on data patterns, forecasting horizon, data Value is lower than average 3 ∗ standarddeviation
availability and business requirements the choice of technique • Algorithms: Based on the chosen statistical model, the
differs [20]. Through appropriate combining of the forecasting existing business model will calculate a forecast. Avail-
techniques it is possible to estimate quantitative influences able statistical forecast models are listed as:
of the identified factors and set the demand forecast [14]. 1) Moving Average: The moving average model is used
The most frequently used statistical forecasting method is the to exclude irregularities in the history pattern. The
time series technique. It uses historical data sequenced by model calculates the average of all historical values.
time and projects future demand by the same time sequence All values are weighted equally. It does not take
[24]. POS data is rich in information for building forecast leading zero-values into account.
models. Building a good forecasting model with POS data is 2) SES Model:The SES model will also calculate an
demonstrated in many case studies It is important to realize average, but instead of weighting all historical val-
that demand planning used in practice is not a mere creation ues equally, the SES model will give more weight
of a perfect system to carry out the demand and sales forecast. to the more recent history.Within the SES model
The objective of forecasting is to predict demand whilst the the smoothing factor (alpha) will determine how
aim of demand planners is to shape the demand and produce quickly the forecast reacts to a change in the history
a resource requirements plan [21]. Without the right forecast - pattern. The higher the alpha factor, the more weight
planning system integration it is not possible to use efficiently will be assigned to the most recent history, the
the information provided in the forecasts. quicker the forecast reacts to the recent history.
3) Croston Model: The Croston model is used for
II. E XISTING P ROCEDURE :
products with intermittent demand. It consists of
A. Demand Planning Process: exponential smoothing calculation, together with an
The objective of Demand planning is to make reliable average interval between demands. These 2 inputs
demand plans as base for the S&OP process. Game playing, will then be used in a form of a constant model to
the intentional manipulation of the forecasting process to gain predict future demand.In case no zero values exist in
personal, group, or corporate advantage, is clearly not allowed. the history pattern, Croston model will have exactly
Examples of such games: the same result as the SES model (in case of an
• Sandbagging:underestimating sales in order to set expec- equal alpha factor )
tations lower than the demand actually anticipated 4) Seasonal Linear Regression: Seasonal linear re-
• Hedging:overestimating sales in order to secure addi- gression will automatically determine trend values
tional product or production capacity and seasonal indices. The system will first try to
• Enforcing:maintaining a higher forecast than the antic- find a seasonal pattern. In case the system cannot
ipated sales, in order to keep forecasts in line with the find a seasonal pattern, it will calculate a forecast
organizations sales or financial goals using linear regression.In case it can find a seasonal
978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1106
Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

pattern, it will first correct the historical values using C. Data Shaping
calculated seasonal indices. A linear regression will The Data should be aggregated or dis-aggregated to the
calculate a forecast using these corrected historical level in which the forecast should be generated. This helps
values. Last step is to apply the seasonal indices to in making the generalization capability of the forecasting to a
the calculated linear regression. better level. The cost function for the time-series forecasting
5) Double exponential smoothing: Double exponential in this domain differs for different organization.
smoothing uses the same technique as the single Some common cost functions used are:
exponential smoothing, but will also apply a trend. 1. PN P3
As in the single exponential smoothing, an alpha i=1 | j=1 (yp − ya )|
factor will determine the weight to determine how Accuracy = 1 − P3
quickly the forecast reacts to a change in recent j=1 yp

history. The beta will determine how quickly the 2. PN P


forecast picks up a trend in history. i=1| (yp − ya )|
Accuracy = 1 − P3
• Forecast Corrections: j=1 ya
1) A manually maintained seasonality can be applied 3. PN P
to SES and moving average models. The system i=1| (yp − ya )|
Accuracy = 1 −
will first de-seasonalize the history using seasonality P3
j=1 yp
factors as put in by the regional planning manager.
It will then calculate a forecast using SES or moving where N is the total SKU’s present for a market and yp
average. The result of the forecast will be re- and ya are the predicted forecast for a month and actual for
seasonalized again using the seasonality factors. that month respectively. j varies for the forecasted done for
2) A workday correction will be applied to all stat the consecutive months.The first equation is to evaluate the
forecasts that do not use the manual seasonality or accuracy in the three months of forecast against the actuals
statistical forecasts that are using SLR. and the last two for evalating the accuracy for one month.
3) A manually maintained trend can be applied to SES
and moving average models. After calculating a D. Data Quality Assurance
forecast using SES or moving average, it will apply 1) Once the CBH data with Links is prepared as per
the trend values to the result. markets requirement, the CBH data is now required to
be analyzed and the quality of the data is to be ensured
III. D EMAND F ORECASTING P ROCEDURE before it is used for forecasting demand. The demand
The following set of procedures describes the demand fore- forecasting for each market is done from different stand
casting process, quality control and exceptions management points, as per market needs. Some of the examples are
process for forecasts. sales region wise, SKU wise, Channel level and material
level.
2) It is mandate to validate the data sets used for the
A. Extraction of Corrected Baseline History (CBH) with Links
purpose of generating Baseline Statistics Forecast. If
1) The Data in the real world has the life-cycle which some garbage data is put into the model the result will
varies a lot. The correct linking to the new product be garbage in itself.
introduction, sister SKU and de-clining SKU is very 3) The Data complete description is made which makes it
important in the forecasting methodology. important and easy to validate the data which input and
2) The Data which the model forecasts is CBH (Corrected output from the model.
Baseline History), which has the correct linking with 4) The Data needs to be validated by the market demand
the sister SKU or correction of the sales taking out the planners to find if the forecast follows the trend, pattern
effect of the external driving factors like promotions, and level along with the actual. In case of any discrep-
stockouts etc. If the corrections of the data is not done ancy, the forecast will be modified with the intelligence
or maintained, then the model will correct the data as of the market.
required using the outliers correction and missing value
imputation. E. Models used for generating the forecast:
Once the Data Quality is assured then the data is uploaded
B. Validation: in the forecasting model prepared on R and Python.
The first step of the validation should be done by checking 1) Outliers Detection, Data Treatment and Data Pre-
for the availability of the lowest Characteristics Vector Code processing:: The most important part for any data analysis
(CVC) elements. For each Market, comparison of the sum for is the data-pressesing. If there are ambiguities in the data,
last three months with that of the Month-wise Sum for the the forecast will also be ambiguous and un-useful. Outlier
data downloaded in the previous month should be done. This detection and removal or correction of the data are a part of
is done to have a validation on the actual which is downloaded the model. [23] [1] [4] Each Market requires the forecast at
is matching the figures that is going into the model for forecast. different level.
978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1107
Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

2) Model Selection:: The forecast done for each SKU, is as the outlier and is either brought down/up to
done using the time series and machine learning algorithms. the mean/median based on maximizing the cost
Then each of the SKU is matched with the results given by function of the organization. This basically means
time-series , machine learning and an model with the ensemble that the each SKU will have a different set of the
of the outputs of time-series and machine learning. outlier treatment. The missing value is nothing but
1) Time-Series Algorithms: The actual data is fed into to findout whether the ”sales zero” in a particular
the model consisting of the time-series algorithms. For data is actually the case or not. If the zero sales
different SKU’s different algorithms are selected based is not possible for the SKU, then the missing
on the mse in the validation or testing data. Before value is imputed with mean of the actual data. For
passing the data into the model, it is pre-processed. [2] getting the seasonal pattern of the data, the data is
tranformed into the lags of the months for finding
• Data Pre-processing: The actual sales do ave great
a relationship between the current actual sales. The
impact with the promotions,stockouts and dumping
problem of forecasting is a regression problem. The
etc.These behave as the outlier in the system and
regressors are the lags of the actual data. Here to
need to be removed or treated. The values which
find a seasonality pattern in the data, each yearly
are more than 40% or less than 10% for a particular
data is taken as a lag. The two important constraint
month in a range of 12-month are replaced with
are taken into the consideration:
the mean of the actual sales data or if the sales
for a particular month is above the particular limit – Time Constraint: It takes a lot of time to find the
Inter Quartile Range is treated and imputed with correct hyper-parameter settings.
the accepted sales for that month. Particular limit – Sesonality Pattern: It was found that if the lag of
varies SKU to SKU. Also, the sparse SKU i.e the 12 for monthly data and 53 for weekly data was
actual data containing more number of zeros are taken into the consideration, the accuracy [cost
either passed into croston model [17] [5] else the function] increased.
zero sales data in a month is replaced with mean of • Algorithms: There is a lot of constraints for the
the data. forecasting using machine learning.
• Algorithms: The algorithms which are used in – The hyper-parameters for each SKU for different
the time-series algorithms are Weighted moving algorithms will be different all the time. This
Average, Simple moving average, Autoregressive takes a lot of time to optimized to the required
integrated moving average (ARIMA) and Holts parameters settings.
Winter [22] [19] [9] [7] [25]. For Weighted Moving – Seasonality capturing is very less when used
Average 0.5, 0.3 and 0.2 weights are given starting machine learning algorithms.
from the most recent observation to the 3rd last
The algorithms that are used are Support Vector
observation whereas equal weights are assign to the
Regression [11] [13] [8], Decision Trees Regression
last n (no of dependent variables) observations to
[10], Linear Regression, Ridge Regression [6] and
find out the new forecast. On the basis of residuals
Random Forest Regression [16]. Random Forest
we are finding out the value of n. To find out the
Regression is used for the selection of important
order of the Arima model we have used Arima
features out of the lag training set [12]. Other feature
optimization function ML and CSS and then using
selection techniques can be used which is the future
residuals RMSE and MSE.To find out the value of
implementation and research for our work. As the
α, β and γ of Holt’s Winter, the optimization func-
dataset is a time series dataset, the sequence of
tions L-BFGS-B and Nelder-Mead and for finding
the data or the training examples is important. The
the residuals RMSE and MSE is used. In case the
training and testing sets are divided into the 70,30
model gives 0 in the middle of the forecast [18]
% using the hold-out method. For the optimizing
value is replaced with the mean of the whole vector.
the hyper-parameters, Grid Search and Random
In the validation set, the algorithm which gives the
search methods are used. Bayesian Search and other
minimum mse is selected for the future forecast [3].
evolutionary optimization search methods are the
2) Regression Based Algorithms: There is a bit of differ- future work. The best algorithm which results in
ence in the data that is fed into the models containing the minimum mse in the validation set is selected
machine learning regression algorithms. This regression for the future prediction of the sku. Out of all the
based model contains the following steps to be done to algorithms the one which gives the minimum mse is
take care of the forecasting of sales data. matched again with the ensemble of the algorithms.
• Data Processing: The sales of a particular period Ensemble is done with the averaging of the error in
(day, week, month or year) is matched with the ac- the validation set.
tual data and the values or the sales which contribute
to more than 40% or less than 10% of the total sales model = [algorithm1 , algorithm2 , ..., algorithmn ]
for 7 days(if the data is weekly), 53 weeks(data is
weekly) or 12 months(data is monthly) is treated errori = |yp − ya |
978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1108
Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

where yp is the predicted value and ya is actual. used averaging of the time series and regression based
Also,the weights assigned to each model/algorithm model results to give an ensemble forecast which brings
[algoritmi ] is given by: the deviation near to zero or minimum of the deviation
1 of time series output and regression based model output.
weighti = Pnerrori1 Hence, this more enhances the generalization capability
i=1 errori of the model. The weights that is calculated for the two
where i(1, n), n varies in the model set. kinds of models is by calculating the deviation from the
• Future Forecast Generation:Once the forecast is validation period of the data.
done, then the forecast results are validated and
different methodology are taken into the account to model = [modeltime−series , modelmachine−learning ]
see which all process can be taken into the account
for improving the accuracy. The comparison is
1
required to be done at the level at which the forecast
is generated. Identify the Models[algorithm with weighti = Pkdeviation1 i
i=1 deviationi
hyper-parameter settings] producing SKU wise best
results and separate the SKUs as per the models
here, k varies in the model dataset. The model that is
giving best results consistently in the validation
selected in case of the ensembling is through the greedy-
phase. Once the Model for specific variable is
selection. Here the algorithms that gives better results
identified, initiate the run to forecast for future 18
consistently is selected to used in the ensembling of the
months. Seasonality factor is applied to the model
results [15].
if required. This depends on the pattern in which
4) Market Intelligence or Cognitive Approach: After the
the result is produced.
results are generated, the forecasts are plotted against the
history actual and level, trend and seasonality pattern of
the SKU is observed. If the forecast for the SKU doesn’t
Algorithm for the Proposed Model:
follow any pattern with the history sales, it is changed
or tweaked a bit. The following are some of the market
– Input the Data intelligent inputs that are implemented in the model:
– for sku 1:N, where N = total number of SKU for
• Zero Forecast: If the actual sales for a SKU is zero
a market
for the consecutive 2 months, its converted into zero
∗ Data Preprocess: Outlier treatment, Missing forecast for the forecasting periods. This case will
value treatment not be valid for the SKU having sparse data.
∗ Dataset Creation: Convert the Actual data to • Trend Imbalance: If the trend of the forecast doesn’t
12-lag dataset follow the trend of the history of the sales, the trend
∗ Train, Test subset: Holdout method is used and is corrected. If the mean of the initial forecast value
time sequence for the data is maintained [As is 4 times or 0.25 time of the mean of the last 6
the data is time series] months then the forecast value is replaced with the
∗ Feature Selection: Transform the dataset to the last 3 months weighted moving average.
most correlated dataset • Level Correction: There will be some SKU’s which
∗ Training the Algorithms: Pass the Data through are going to have a change in the level due to some
different algorithms. promotional impacts and other external conditions.
∗ Algorithm Selection: Based on the MSE in the This information which is externally present is input
validation/test period the model which results to the model and the level is shifted for the future
in the minimum error is selected. forecast for that many amount of periods which the
∗ Future Prediction: Generate the future forecast market Demand Planner have informed.
3) Ensemble of the Algorithms: The results/forecast gen- • Seasonality Correction: Some of the months will
erated by the time-series and regression based algorithms have lower peaks or depths in the forecast data than
can either be over-forecasting or under-forecasting. To expected. This can be externally corrected before
achieve higher accuracy 100%, the forecast should be the forecast is submitted.
very much near to the actual sales or the error in the • PIPO:Phase-In and Phase-Out SKU’s are clustered
forecasting should minimize. Here, the term that defines and the forecast is changed in that manner. The
the error is deviation. Phase-In products are basically the NPI or New-
Xk Products Introduced and will have a higher or lower
deviation = |yp − ya | sales which are difficult to predict. In case of those
j=1 SKU’s having only one data point, are either used
where k is the time period for the calculation of the Naive Forecast or done the same result as the fore-
deviation. Thus, if the deviation is minimum, the gen- casting output. Mostly Market Demand Planner’s
eralization capability of the model increases. Here, we intelligence helps a lot to take care of this.
978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1109
Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

IV. R ESULTS has been achieved by Incumbent Business Model is 49%. In


The model comprising of the methods discussed above is checking the market-wise accuracy comparison, our model is
used to generate the forecast for the different markets of better than Incumbent Business Model in 5 markets out of
Philips and the results are compared with the tool which 7 markets. This comparison is the overall comparison of the
is mostly used in the forecasting for different organization, total SKU’s for each market in the month of May-2017.
Incumbent Business Model(I-B-M). For generating the fore-
C. Forecast Comparison for the month June-2017 using the
casts two different platforms are used R (Version 1.0.136) and
actual data till April-2017
Python (Version 2.7). The results are generated for different
markets and for different consecutive months to check the Market-Name total SKU’s Model FACC I-B-M
validity and the stability of the models. Two different termi- Market-1 3480 55% 38%
nology which are used are ”t-1” and ”t-5” forecast accuracy. Market-2 1391 39% 33%
”t-1” is the accuracy generated after two months of the current Market-3 2685 48% 41%
month the forecast is generated and ”t-5” is after 5 months of Market-4 480 58% 68%
the current month. Market-5 3262 58% 59%
Dataset Description:This data set contains the information Market-6 3028 49% 50%
of 7 markets. Due to confidentiality, actual names of the Market-7 2549 54% 56%
markets are masked with market − ”no.”. In this section,
first the comparison of the I-B-M is done with the developed Total number of SKU’s that has been forecasted is 16875.
model with different statistical measures followed by the Using the model developed by us, the overall average
result improvement comparison of ensemble method over the accuracy that has been achieved is 51.57% and the overall
Regression based model and time-series methods. accuracy that has been achieved by Incumbent Business
Model is 49.28%. This comparison is the overall comparison
A. Forecast Comparison for the month April-2017 using the of the total SKU’s for each market in the month of June-2017.
actual data till February-2017 If we check the above two table, we can find that our model
is consistently beating Incumbent Business Model by 3%.
Market-Name total SKU’s Model FACC I-B-M
Market-1 3256 53% 43% If we take the above three results, our model is stable
Market-2 982 36% 28% and giving good results. In case of the April-2017, the results
Market-3 1806 29% 31% of our model is comparible to that of the Incumbent Business
Market-4 923 64% 62% Model.
Market-5 1078 30% 51.1%
Market-6 2620 48% 48% Below are the results for the comparison of the individual
Market-7 2301 49% 50% models for different markets.

Total number of SKU’s that has been forecasted is 12966. D. Forecast Comparison for different algorithms in the model
Using the model developed by us, the overall average accuracy for different markets:
that has been achieved is 44.14% and the overall accuracy that Market-1:
has been achieved by Incumbent Business Model is 44.72%.
In checking the market-wise accuracy comparison, our model Month Time Series Regression Models Ensemble
is equivalent to Incumbent Business Model. This comparison
is the overall comparison of the total SKU’s for each market April’17 66% 63% 66%
in the month of April-2017. May’17 68% 64% 69%
June’17 65% 63% 66%
B. Forecast Comparison for the month May-2017 using the Market-5:
actual data till March-2017
Market-Name total SKU’s Model FACC I-B-M Month Time Series Regression Models Ensemble
Market-1 2999 54% 45% April’17 60% 58% 62%
Market-2 1164 49% 49% May’17 62% 59% 64%
Market-3 2007 49% 39% June’17 60% 59% 62%
Market-4 1446 56% 58%
Market-5 3265 58% 56% Market-3:
Market-6 3026 47% 43%
Market-7 2515 51% 53% Month Time Series Regression Models Ensemble

Total number of SKU’s that has been forecasted is 16422. April’17 53% 48% 53%
Using the model developed by us, the overall average accuracy May’17 45% 46% 49%
that has been achieved is 52% and the overall accuracy that June’17 49% 47% 51%

978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1110


Proceedings of the 2nd International Conference on Inventive Computation Technologies ( ICICT 2017)
IEEE Xplore Compliant - Part Number:CFP17K52-ART, ISBN:978-1-5090-6697-1

Market-7: [2] Ke-Lin Du and MNS Swamy. Fundamentals of machine learning. In


Neural Networks and Statistical Learning, pages 15–65. Springer, 2014.
[3] Peter Filzmoser, Bettina Liebmann, and Kurt Varmuza. Repeated double
Month Time Series Regression Models Ensemble cross validation. Journal of Chemometrics, 23(4):160–171, 2009.
[4] Anthony J Fox. Outliers in time series. Journal of the Royal Statistical
April’17 49% 53% 54% Society. Series B (Methodological), pages 350–363, 1972.
May’17 41% 38% 42% [5] Adel A Ghobbar and Chris H Friend. Evaluation of forecasting methods
June’17 55% 50% 55% for intermittent parts demand in the field of aviation: a predictive model.
Computers & Operations Research, 30(14):2097–2114, 2003.
The above table tells about the comparison of the different [6] Gene H Golub, Michael Heath, and Grace Wahba. Generalized cross-
algorithms that we are using in out model. We can see that validation as a method for choosing a good ridge parameter. Techno-
metrics, 21(2):215–223, 1979.
the ensemble of the results produces better results than the
[7] Clive WJ Granger and Roselyne Joyeux. An introduction to long-
individual algorithms in consecutive three months of April’17, memory time series models and fractional differencing. Journal of time
May’17 and June’17. The weights for the ensemble is found series analysis, 1(1):15–29, 1980.
out from the previous months and the average weight is [8] Wenwu He, Zhizhong Wang, and Hui Jiang. Model optimizing and
assigned for the final month. feature selecting for support vector regression in time series forecasting.
Neurocomputing, 72(1):600–611, 2008.
[9] Michael Hibon and Spyros Makridakis. Arma models and the box–
V. C ONCLUSION jenkins methodology. 1997.
The model which we have presented has all the state-of- [10] Robert K Lai, Chin-Yuan Fan, Wei-Hsiu Huang, and Pei-Chann Chang.
art statistical methods used in the demand forecasting fields. Evolving and clustering fuzzy decision tree for financial time series data
forecasting. Expert Systems with Applications, 36(2):3761–3773, 2009.
In the results above we see that ensemble of the results of
[11] Chi-Jie Lu, Tian-Shyug Lee, and Chih-Chou Chiu. Financial time series
time-series model and regression based model gives a better forecasting using independent component analysis and support vector
result due to the fact of nullifying the over-forecasting and regression. Decision Support Systems, 47(2):115–125, 2009.
under-forecasting and bringing the forecast values near to the [12] Bjoern H Menze, B Michael Kelm, Ralf Masuch, Uwe Himmelreich,
actual. These results are far better than considering individual Peter Bachert, Wolfgang Petrich, and Fred A Hamprecht. A comparison
of random forest and its gini importance with standard chemometric
algorithms used in the two models. methods for the feature selection and classification of spectral data. BMC
Accurate forecast are very important for the demand planning bioinformatics, 10(1):213, 2009.
team. The data used in this research and building the model [13] K-R Müller, Alexander J Smola, Gunnar Rätsch, Bernhard Schölkopf,
is using the sales-in data for different markets. The important Jens Kohlmorgen, and Vladimir Vapnik. Predicting time series with
support vector machines. In International Conference on Artificial
factor to be considered is the stability of the model and remov- Neural Networks, pages 999–1004. Springer, 1997.
ing the game-playing. Two open-source platforms are used to [14] Gregory Daniel Noble. Application of modern principles to demand
build the model. Time-series model is developed in R-Studio forecasting for electronics, domestic appliances and accessories. 2009.
and Regression Based model using the data mining algorithms [15] David W Opitz and Richard Maclin. Popular ensemble methods: An
developed in Python. After the results are generated, Ensemble empirical study. J. Artif. Intell. Res.(JAIR), 11:169–198, 1999.
of results is validated and generated using the Microsoft Excel. [16] Carl Edward Rasmussen and Christopher KI Williams. Gaussian
processes for machine learning, volume 1. MIT press Cambridge, 2006.
In the future work, different techniques will be considered and
[17] Lydia Shenstone and Rob J Hyndman. Stochastic models underlying
researched. Time-Series and Machine Learning to be built in croston’s method for intermittent demand forecasting. Journal of
one platform and check how the minimization of mse produces Forecasting, 24(6):389–402, 2005.
the forecast. [18] Richard J Tersine and Richard J Tersine. Principles of inventory and
materials management. 1994.
VI. ACKNOWLEDGEMENT [19] Mohammad Valipour, Mohammad Ebrahim Banihabib, and Seyyed
Mahmood Reza Behbahani. Comparison of the arma, arima, and
The complete list of the team which has worked in this the autoregressive artificial neural network models in forecasting the
project: monthly inflow of dez dam reservoir. Journal of hydrology, 476:433–
441, 2013.
Rajiv Ranjan∗ , Arashdeep Singh∗ , Shaivya Datt∗ , Aamir
[20] Vladimira Vlckova and Michal Patak. Role of demand planning in
Ahmed Khan∗ , Mitav Kulshrestha∗ , Pawan Kulkarni∗ , business process management. In The 6th International Scientific
Anubhav Rustogi∗ , Aditya Iskande∗ , Mandar Shirsavakar∗ , Conference” Business and Management 2010”. Selected paper, pages
Rabiya Gill∗ , Heine van der Lende† , Srinivas Deshpande† , 1119–1126, 2010.
Srijan A† [21] Vladimira Vlckova and Michal Patak. Barriers of demand planning
implementation. Economics & Management, 16:1000–1005, 2011.
∗ [22] Neal Wagner, Zbigniew Michalewicz, Sven Schellenberg, Constantin
Advanced Analytics Team, SS Supply Chain Solutions Pvt. Chiriac, and Arvind Mohais. Intelligent techniques for forecasting
Ltd., Bangalore multiple time series in real-world systems. International Journal of
† Enterprise Information Management, Philips Lighting, Intelligent Computing and Cybernetics, 4(3):284–310, 2011.
Bangalore [23] SM Watson, M Tight, S Clark, and E Redfern. Detection of outliers in
time series. 1991.
[24] Mengdi Wei and Yang Liu. Key factors and key obstacles in global
R EFERENCES supply chain management: A study in demand planning process, 2013.
[1] Vic Barnett, Toby Lewis, et al. Outliers in statistical data, volume 3. [25] G Peter Zhang. Time series forecasting using a hybrid arima and neural
Wiley New York, 1994. network model. Neurocomputing, 50:159–175, 2003.

978-1-5090-6697-1/17/$31.00 ©2017 IEEE 1111

View publication stats

You might also like