Professional Documents
Culture Documents
PROBLEM Developing an Analytics based model for big retail chains to help
IDENTIFIED STANDARD NEEDS
STATEMENT them know what items to make available, in which stores & quantity
Ensure accurate demand forecasting & inventory control system to Real time
OBJECTIVE Product Bundling Sales Forecasting
costs & productivity and profits. Inventory tracking
FOCUS Account factors leading to variation in demand & forecast to prevent
A) Excessive stocks (Blocked capital, labour & insurance costs)
B) Stock outs (Sales lost, reduced customer satisfaction & loyalty)
Multi store Predictive
Integration analytics
Global retail market will reach $25.038 trillion in 2019, an increase of 4.5%
SURVEY OF 200 MAJOR RETAILERS
Visited 5 major retailer Analyzed predictive On the basis of our WHAT WE AIM TO PROVIDE
to understand needs, models globally to primary & secondary
factors affecting compare best in research –we Setting up products Reduce Shrinkage
demand & technology business strategies & designed our
currently in use understand nuances analytics model
Establishing
Using Automation
RETAILERS – WE VISITED Networks
Forecasting
Creating flowcharts
Demand
Team PHOENIXES - FMS Delhi Capgemini L’INNOVATEUR 2.0
CASE IN POINTS
1 2
To enhance fill rate, efficiency, freshness to the food & grocery segment They wanted to predict customer’s future buying habits by historical data
Partnered with Perpule- moved data to the cloud, ensuring Partnered with Celect- to gain a more accurate view of
real-time access inventory in real time. Moving the data demand and help them exceed customer expectations by
helps multiple stores manage inventory in real time using machine learning on their past transactions record
2017 2018
The fulfilment centres get the real-time data which This will help them deepen relationships with customers
made stock replenishment process more efficient by providing them likely demand, delivery timing etc
Cloud computing not only eases access but also enhances TAKEAWAY : Consumer behavior or review can have substantial effects on
TAKEAWAY : TAKEAWAY :
computing speed for big data to run – Open source processing demand- to account that- sentiment analysis on Facebook
3 4
To learn about customers shopping pattern, needs & expectations To minimize their error value b/w predefined demand value & actual
demand through employing regression methods
Now Tesco is leveraging the power of big data & Machine
learning. They plan to improve their use of data in order to Multiple time series models to ensure all relevant
deal with constantly changing customer buying patterns 2016 parameters, such as channel and category, are considered 2018
for forecasting
This is helping them by showing how customers actually Analytics will help them enhance decision making, develop
shop, so they can adapt selling efforts to customer need & new business opportunities & help solve problems like
expectations. customer insights, pricing, ranging
DATA ENRICHMENT
DATA DATA FROM BIG DATA
COLLECTION CLEANING REPLENISHMENT
DATA MART ON
FIND FREQUENT HADOOP / SPARK
DUPLICATE INCORRECT ITEMS
DATA COMPUTATION
POS data, sellers experience, H2O library : Open source big data AI
Primary research & customers Open source data processing
platform to implement DL algorithms on engine- for storage & stream
feedback for real-time, subjectivity Hadoop Spark big data environments.
ERRONEOUS MISSING data, trends and graph processing
VALUE VALUES
Apriori algorithm : The frequent item
sets determined will be used to
determine trends in the database.
Team PHOENIXES - FMS Delhi Capgemini L’INNOVATEUR 2.0
ENSEMBLE STRATEGY : An ensemble system is composed of two parts: ensemble generation and ensemble integration In
ensemble generation part, a diverse set of base prediction models is generated by using different methods or samples. In
integration part, the predictions of all models are combined by using an integration strategy EXEC
ensemble system
FEATURE EXTRACTION ALGO
To present and analyze the
most significant features
Ensemble generation ensemble integration
Principal Component Analysis
(PCA) : Works on feature DEEP LEARNING MODEL
elimination & feature extraction BOOSTING STRATEGY
Decision integration approach
TIME SERIES MODEL combines strengths of
DIMENSIONALITY CLUSTERING different algorithms into a
REDUCTION SUPPORT VECTOR
single collaborated method
philosophy. Weighted
REGRESSION MODEL
CLUSTER FORMATION combination of the results
Segregation of groups with
similar traits and assign them
into clusters TIME SERIES MODEL DEEP LEARNING MODEL SVR MODEL
K-means will be employed- greedy Predict future demand based Multilayer feedforward Support Vector Regression
& fast clustering algo that partition on historical time series data artificial neural network tries to fit the best line within
samples into k clusters each sample which is a sequence of data (MLFANN) – Implementation a predefined or threshold
is near to cluster center points measured at of artificial neural networks error value; to predict a
successive intervals in time to analyze historical & real- continuous variable
time data
1. DL consumes time & leads computing lack which made us to opt for clustering.
POTENTIAL
POTENTIAL
1. Reduce out of stock products by 18-20% CHALLENGE
IMPACT