Professional Documents
Culture Documents
Opportunity Analysis
(using R programming language)
Submitted by
Mr. Abhijit Bag
(Reg.No:161541810001 & Roll No:15499016029)
On 7th MAY,2018
under supervision of
Mr. Subhajit Adhikari
I hereby declare that this thesis contents original research work done
by me, as part of master of computer science studies. All information
in this document has been obtained and presented in accordance
with the academic rules and ethical conduct.
I also declare that, as required by these rules and conduct I have
fully cited and referenced all the materials.
…………………………………….
Signature & Date
2|P a ge
Customer Segmentation & Opportunity Analysis 2018
Acknowledgement
I would like to express my sincere, heart-felt gratitude to my
respected guide Assistant Professor Mr.Subhajit Adhikari,
Department of Computer Science in DAITM under MAKAUT, for this
unfailing guidance, prolific encouragement, constructive suggestion
and continuous involvement during each and every phase of this
work.
Last but not the least; I would like to thank all my classmates of
MSc(CS) batch 2016-18 for their friendly co-operation and support.
3|P a ge
Customer Segmentation & Opportunity Analysis 2018
Certificate of Approval
…………………………………………….
Signature of examiner
Date:
4|P a ge
Customer Segmentation & Opportunity Analysis 2018
……………………………………………………..
Project Guide
Mr. Subhajit Adhikari
Assistant Professor,
Dinabandhu Andrews Institute Of Technology & Management,
Kolkata
……………………………………………………..
Forwarded by
Ms. Paramita Ray
HOD, Dept. of Computer Science,
Dinabandhu Andrews Institute Of Technology & Management
Kolkata.
5|P a ge
Customer Segmentation & Opportunity Analysis 2018
6|P a ge
Customer Segmentation & Opportunity Analysis 2018
7|P a ge
Customer Segmentation & Opportunity Analysis 2018
1. Improving your whole product: Having a clear idea of who wants to buy
your product and what they need it for will help you differentiate your
company as the best solution for their individual needs. The result will be
increased satisfaction and better performance against competitors. The
benefits also extend beyond your core product offering, since any insights
into your best customers will allow your organization to offer better
customer support, professional services, and any other offerings that
make up their whole product experience.
2. Focusing your marketing message: In parallel with improvements to the
product, conducting a customer segmentation project can help you
develop more focused marketing messages that are customized to each of
your best segments, resulting in higher quality inbound interest in your
product.
3. Allowing your sales organization to pursue higher percentage
opportunities: By spending less time on less lucrative opportunities and
more on your most successful segments, your sales team will be able to
increase its win rate, cover more ground, and ultimately increase
revenues.
4. Getting higher quality revenues: Not all revenue dollars are created
equal. Sales into the wrong segment can be more expensive to sell and
maintain, and may have a higher churn rate or lower upsell potential after
the initial purchase has been made. Staying away from these types of
customers and focusing on better ones will increase your margins and
promote the stability of your customer base.
8|P a ge
Customer Segmentation & Opportunity Analysis 2018
9|P a ge
Customer Segmentation & Opportunity Analysis 2018
Algorithm
10 | P a g e
Customer Segmentation & Opportunity Analysis 2018
Next, we’ll get the data into a usable format, typical of an SQL query from
an ERP database. The following code merges the customers, products and
orders data frames using the dplyr package.
11 | P a g e
Customer Segmentation & Opportunity Analysis 2018
Next, we need to convert the unit price to categorical high/low variables. One
way to do this is with the cut2() function from the Hmisc package. We’ll
segment the price into high/low by median price. Selecting g = 2 divides the
unit prices into two halves using the median as the split point.
12 | P a g e
Customer Segmentation & Opportunity Analysis 2018
K-Means Clustering
Now we are ready to perform k-means clustering to segment our customer-
base. Think of clusters as groups in the customer-base. Prior to starting we
will need to choose the number of customer groups, kk, that are to be
detected. The best way to do this is to think about the customer-base and our
hypothesis. We believe that there are most likely to be at least four customer
groups because of mountain bike vs road bike and premium vs affordable
preferences. We also believe there could be more as some customers may
not care about price but may still prefer a specific bike category. However,
we’ll limit the clusters to eight as more is likely to overfit the segments.
13 | P a g e
Customer Segmentation & Opportunity Analysis 2018
14 | P a g e
Customer Segmentation & Opportunity Analysis 2018
attaches the k-means centroids to the bike models and categories for trend
inspection.
Now, on to cluster inspection.
CLUSTER 1
We’ll order by cluster 1’s top ten bike models in descending order. We can
quickly see that the top 10 models purchased are predominantly high-end
and mountain. The all but one model has a carbon frame.
CLUSTER 2
Next, we’ll inspect cluster 2. We can see that the top models are all low-
end/affordable models. There’s a mix of road and mountain for the primary
category and a mix of frame material as well.
15 | P a g e
Customer Segmentation & Opportunity Analysis 2018
CLUSTERS 3, 4 & 5
Inspecting clusters 3, 4 and 5 produce interesting results. For brevity, we
won’t display the tables. Here’s the results:
Reviewing Results
Once the clustering is finished, it’s a good idea to take a step back and
review what the algorithm is saying. For our analysis, we got clear trends for
four of five groups, but two groups (clusters 2 and 4) are very similar.
Because of this, it may make sense to combine these two groups or to switch
from kk = 5 to kk = 4 results.
16 | P a g e
Customer Segmentation & Opportunity Analysis 2018
Additionally, you cannot force feed this process on your business. If the key
stakeholders that will be impacted by the best current customers
segmentation process do not fully buy-in, then the outputs produced from it
will be relatively meaningless.
17 | P a g e
Customer Segmentation & Opportunity Analysis 2018
REFERENCES
Dr. R. Gardener “The Essential R Reference” (2014),
Concepts of customer segmentation http://www.business-science.io
https://labs.openviewpartners.com/customer-segmentation/
Source data related to our analysis has been collected from
https://github.com/mdancho84/orderSimulatoR/tree/master/data
https://www.kaggle.com/
https://www.r-project.org/
https://www.rstudio.com/
18 | P a g e