Professional Documents
Culture Documents
Marketing Analytics Project: Hanoi University of Science and Technology
Marketing Analytics Project: Hanoi University of Science and Technology
GROUP 6
Contents
PART 0: INTRODUCTION.....................................................................................................................
PART I: MARKETING ANALYTICS BY SIMULATION WITH RAND()......................................
I. INTRODUCTION....................................................................................................................................
II. ANALYSIS............................................................................................................................................
III. SUMARY............................................................................................................................................
PART 2: MARKETING ANALYTICS BY SIMULATION WITH CRYSTAL BALL..................
I. Problem...............................................................................................................................11
II. Solve the problem...........................................................................................................11
1. Finding optimal order using the uniform distribution..............................................................11
2.Finding optimal order using the triangular distribution.............................................................17
3. Finding optimal order using the normal distribution......................................................21
4. Is the optimum number of chairs to order the average demand? Why or why not?.....23
5. Why is it not sufficient to simply calculate the average of the simulation demand to
validate the simulation?.............................................................................................................24
PART 3: MARKETING ANALYSIS....................................................................................................
I. About data set......................................................................................................................................
1. Insight of analyze the dataset.............................................................................................24
III. Visual Exploratory Data Analysis.................................................................................28
1. Diploma distribution and income level.............................................................................28
2. Diploma distribution by marital situation........................................................................29
1. Graphical methods...................................................................................................................31
2. Statistical methods...................................................................................................................32
Question 1: Is the correlation between the annual income and the amount of spending
statistically significant?................................................................................................................33
1. Hypothesis statement...............................................................................................................33
2. Analysis plan formulation: Rank Correlation Tests...................................................................33
Question 2: Is there correlation between the diploma owned and the marital status?................34
1. Hypothesis statement...............................................................................................................34
2. Analysis plan formulation: Rank Correlation Tests...................................................................34
V. Customer Segmentation: RFM Analysis and two-step cluster analysis algorithm using SPSS:
35
1.RFM Analysis(Recency – Frequency – Monetary Analysis)......................................................35
2
a. Define the Multivariate Technique to be used....................................................................35
2. Two-step cluster analysis algorithm:...................................................................................40
Reference:................................................................................................................................................
PART 0: INTRODUCTION
Throughout the 2023-1 semester at HUST, we delved into the crucial role of marketing
analytics in the big data era. We learned how analyzing customer and competitive data
enhances business decision-making through many tools such as excel, crystal ball, spss. These
are all very good tools and are often used in statistics, especially in marketing analysis.
However, managing and analyzing the ever-growing volume of data, both structured and
unstructured, presents substantial challenges.
Marketing Analytics bridges this gap by applying data science to marketing problems.
We explored customer data analysis techniques and their underlying principles, equipping
ourselves with crucial analytical skills for real-world scenarios.
Through three project assignments, we honed our skills in fundamental tools like Excel,
Crystal Ball, and SPSS.
The following sections will delve deeper into the methodologies and solutions we
employed for each problem, leveraging our newly acquired marketing analytics knowledge.
3
PART I: MARKETING ANALYTICS BY SIMULATION WITH RAND()
I. INTRODUCTION
Camden Electronics Inc. is a small electronics firm that produces a variety of special-
purpose analog-to-digital converters, which are used primarily for process control. Its
business has grown to a sales level of $250,000,000 per year.
Ms. Greene is a purchasing agent, she has been able to purchase batches of components that
have been classified as rejects by the Components Division of Cynctron Manufacturing
Company.
A study made at Cynctron has shown that it is not economically feasible for them to
test and sort the batch in the Products Division. Cynctron always sells these rejects in the
same way. A list is kept of purchasing agents who will buy parts with specific characteristics;
these purchasing agents are emailed and informed when a batch of potential interest is
available. If interested, each purchasing agent makes a sealed bid and Cynctron sells the batch
of components to the winner. Ms. Greene has recently been informed of the availability of a
batch of 100 components identified by Cynctron as MATS314Q.
Over the past 2 years, Ms. Greene has bid on 85 previous batches of these components
and has saved the winning unit bids.
Table 1: History bidding data
Unit bid per Unit bid per Unit bid per
Contract est. good Contract est. good Contract est. good
unit unit unit
1 334.10 30 317.60 59 324.30
2 282.60 31 282.70 60 267.40
3 286.80 32 326.70 61 309.60
4 263.80 33 327.30 62 286.10
5 269.00 34 270.50 63 275.40
6 323.00 35 260.20 64 317.60
7 335.00 36 328.00 65 260.80
8 281.50 37 264.40 66 305.90
4
9 279.00 38 334.60 67 286.90
10 281.70 39 326.70 68 310.70
11 330.00 40 268.30 69 260.10
12 270.20 41 289.30 70 309.00
13 294.10 42 318.50 71 299.40
14 270.10 43 303.20 72 294.60
15 278.70 44 321.30 73 323.10
16 261.60 45 311.80 74 313.30
17 307.40 46 300.00 75 272.40
18 273.50 47 269.80 76 270.10
19 311.30 48 299.00 77 338.70
20 316.20 49 313.20 78 264.50
21 309.60 50 306.20 79 337.20
22 301.80 51 303.80 80 318.40
23 300.20 52 332.80 81 271.80
24 296.00 53 324.50 82 273.50
25 309.70 54 332.40 83 266.20
26 275.50 55 290.40 84 335.00
27 307.10 56 278.50 85 337.70
28 331.50 57 280.50
29 327.00 58 277.60
With the information from Cynctron on the characteristics of item MATS314Q, the
engineering department has informed Ms. Greene that Camden can use the item now. They
have received a contract to supply 25 pressure sensing and control devices, each of which
would require four of these components.
For Camden to manufacture the same circuit by regular methods, the cost would be
$7500 for setup plus a cost of $310 per unit produced. Cynctron’s price for tested and
guaranteed MATS314Q’s in quantities of 200 or less is $550 each.
With this information and history data given, Ms. Greene is about to make her analysis
and determine what to bid. She estimated that other bidders are bidding about $300 per good
unit.
5
This study aims to determine the best winning price which the Camden Electronics
Inc. can win the bid.
II. ANALYSIS
Step 1 : Calculating the Frequency, Probability and Cumulative Probability
Base on the history data, we will divide 85 contracts into 8 group distance 10 units
from $ 270 to $340. After that we continue to calculate the Frequency, Probability and
Cumulative Probability of each group.
Table 2: The function to calculate Frequency
6
Table 4: The Frequency, Probability and Cumulative Probability result
7
Step 3: Building the cost structure
As history data, the total cost would be $7500 for setup plus variable cost. Here is the fomular
of variable cost:
Variable cost = Variable production cost * No of pressure sensing
Table 7: The function to calculate the cost structure
Step 5: Calculating the P(Win/Bid), Slope and Intercept to Generate Random Bids for
Simulation
ECDF = Rand()
Slope = ΔBid/ΔECDF
Intercept = Bid −Slope * ECDF
Table 9: The P(Win/Bid), Slope and Intercept Random Bids result
8
Step 6: Simulation with 1000 Trial
To calcutlate the competitor’s bid, we will use VLOOKUP function.
Table 10: The function to calculate Competitor’s bid
Finding the Status. If Ms. Green’s bidding cost > Competitors’ bidding, she will win.
The result is shown in the table 11.
Table 11: The function to calcu late Competitor’s bid
9
III. SUMARY
The study demonstrate how to convert data to solve managerial decisions. Real
historical (empirical) data does not necessarily fit a known distribution, yet these data
frequencies and rankings can be used to estimate the appropriate empirical probability
distribution. Subsequently, the empirical distributions are used in decision trees and
simulations to make optimum managerial decisions. Base on history data bid given and
compare with the expected price of competitors, Camden Electronics Inc. should offer a price
$340/unit to win the bidding.
10
PART 2: MARKETING ANALYTICS BY SIMULATION WITH
CRYSTAL BALL
I. Problem
A retailer orders chair at a cost of $175 each and will sell them at $250. They forecast that
the demand will be about 8000 but in the range of 7000 to 9000. If they can’t sell all chairs at
the end of the season, they must sell the rest at half of the initial price. We need to find the
optimal order number.
II. Solve the problem.
The retailer won to retain in the inventory because of many reasons:
- The chairs might not be in good condition after being stored, or their designs may have
become outdated and no longer align with modern trends.
- As retailers, they aim to avoid incurring inventory costs, which would entail a substantial
investment for renting storage space.
a. Rand() function:
To simulate, we will set up 10,000 trials by using the series function. We utilize the
Rand () function which is an approximately uniform distribution to simulate the demand for
chairs in this season. The discrete uniform random variable function in Excel is:
11
To calculate the profit/loss of each demand, we multiply the number of chairs sold
during the season by the profit of each chair, and plus the multiple of the number of chairs for
sale multiply the loss of each chair when sold off and minus costing (= buying price multiple
number of chairs order)
To calculate the profit/loss of each demand, we multiply the number of chairs sold
during the season by the profit of each chair, and plus the multiple of the number of chairs for
sale multiply the loss of each chair when sold off and minus costing (= buying price multiple
number of chairs order)
12
To find the optimal order number, we will find the average profit so we can use Solver
parameters to optimize profit, bringing the highest revenue to the seller.
With objective is average profit, by changing value cells is Order (this number at first
I took as 8000, the number is between 7000 and 9000, it can change the seller's profit because
it is the deciding factor in the store's sales volume), constraints are: order >=demand min,
order <= demand max and order is an integer.
After running the model, we get the optimal order number = 8216 and average profit
= 570,708.25
13
To have the highest profit, the store should order 8,216 chairs, to avoid inventory or
shortages, affecting profits.
To check if this is uniform distribution, I find the bin of the number of seats that the
restaurant orders, starting at 7000, then I move forward by 100 seats, until it reaches 9000
then stops, to calculate the frequency of them being placed. order number of times, I use the
Frequency function, then find their probability by taking the frequency/number of seats
ordered. And I highlight the bin column and hold down the ctrl key, highlight the probability
column, then draw a column chart to prove that it is evenly distributed.
Probability
0.06
0.05
0.04
0.03
0.02
0.01
0
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90
b. Crystal ball
If order < demand, all ordered chairs are sold, there is no inventory, so no.chairs sold
during season will be equal to number chairs ordered, but order > demand, seats are not sold
out, there will be a surplus equal to order - demand, so no.chairs sale off is calculated by
order-demand and profit = no.chairs sell off * recycling price + no.chairs sold during season *
selling price –( no.chairs sell off + no.chairs sold during season) * buying price.
To be able to use crystal ball to optimize costs so that the store can have the highest
profit, we will have to set up variables.
14
We define the assumption variables( Demand), decision variables(Number chairs of
order) and forecast(profit/loss)
15
Set up conditions: objectives: maximize the Mean of Profit/Loss, decision variable:
Order
16
We get the results of average profit = $574,704 and the optimal order number = 8219
chairs
c. Compare the results of optimize order using Excel and Crystall ball
Method 1 Method 2
No.chairs ordered 8,216 8,266
Profits $570,708.25 $570,366.59
The two results of the two methods have relatively similar values. The rand() function
is a simple way to generate random numbers. However, this function may not generate
exactly distributed random values. Crystal Ball is a more powerful piece of software that can
generate more complex random distributions, so the Crystal Ball method can generate demand
values with greater precision than the rand() function method.
a. Rand function
This is Triangular distribution, so we use the following formula:
17
This function needs to be inverse to use in Excel for simulation, we have:
And unit sold = min(demand, most likely demand), Inventory = if(most likely demand
> demand, most likely demand – demand, 0), profit = buying price * unit sold + Recycling
price * inventory – buying price * most likely demand
18
To find the optimal order number, we will find the average profit so we can use Solver
parameters to optimize profit, bringing the highest revenue to the seller.
With objective is average profit, by changing value cells is Most likely demand (this
number at first I took as 8000, it can change the seller's profit because it is the deciding factor
in the store's sales volume), constraints are: Most likely demand >=demand min, Most likely
demand <= demand max and Most likely demand is an integer.
To achieve the highest possible profit, the analysis recommends that the store order
precisely 8698 chairs. This figure navigates the delicate balance between satisfying customer
demand and preventing either an excess of unsold inventory or a shortage of available chairs,
both of which would hinder profitability. In essence, the analysis has pinpointed the optimal
order quantity that maximizes earnings while ensuring a seamless alignment between supply
and demand.
The chart to visually illustrate and validate the even distribution pattern.
19
Triangle distribution
0.12
0.1
0.08
0.06
0.04
0.02
0
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90
b. Crystal Ball
c. Compare the results of optimize order using Excel and Crystall ball
20
Method 1 Method 2
No.chairs ordered 8,698 8,222
Profits $570,708.25 $573,472.57
The results of optimizing the number of seat orders using Excel and Crystal Ball
exhibit a minor difference, but this variance is not deemed significant. Possible factors
contributing to this difference include the chosen approach, uncertainties in demand, and
sensitivity to profit.
21
Unit sold, inventory, profit all have the same calculation formula as uniform
distribution. And then I use Solver to optimize profits, find out how many chairs to order will
bring the highest profit. We see that, the result is the max average profit = 595,243 and the
sellers should be order 8026 chairs.
Normal distribution
0.16
0.14
0.12
0.1
0.08
0.06
0.04
0.02
0
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90
22
The result show us number of chairs seller’s should be order 8176 chairs and
maximize profit is 572,228
c. Compare the results of optimize order using Excel and Crystall ball
Method 1 Method 2
No.chairs ordered 8,026 8,176
Profits $595,243 $572,288.31
The results of optimizing order quantities using Excel and Crystal Ball differ slightly,
but the difference is not significant. This is because both methods are only simulations, and
they can only account for a portion of the uncertainty in demand. In reality, there are many
other factors that can affect demand, such as market trends, competitor actions, and supply
chain disruptions. Therefore, it is important to consider all of these factors when making
business decisions.
4. Is the optimum number of chairs to order the average demand? Why or why
not?
The optimum number of chairs to order is not the average demans. The optimum
number of chairs to order may not align precisely with the average demand due to the inherent
variability in demand. In this particular situation involving Adirondack chairs, the demand is
uncertain and can fluctuate within a range of 7000 to 9000. Determining the optimal order
quantity involves considering multiple factors such as costs, carrying or holding costs,
shortage costs, and the uncertainty associated with demand. Relying solely on average
23
demand might not adequately account for potential variations or spikes in demand, risking the
possibility of stockouts or excessive inventory.
5. Why is it not sufficient to simply calculate the average of the simulation demand
to validate the simulation?
The data set contains information on 2205 Ifood corporate consumers, including:
• Customer profiles
• Product preferences
• Campaign successes/failures
• Channel performance
The management wishes to identify the persona of a potential client who exhibits
favorable behavior toward the company's market marketing effort. Throughout particular, we
shall examine the following questions throughout the article:
· Who are the customers who spend the most and are the most loyal?
· Which customers are most likely to leave the company?
· Who are the new customers and how can they increase the likelihood of recurring purchases?
24
· Which customers could potentially convert to higher spending customers?
· Which customers have left the brand and are least likely to return?
· Which customers need to be cared for and retained before becoming churn customers?
· What different customer segments require different care, incentives and marketing activities?
2. Analyze orientation
This essay identifies customers using the RFM method, then explores customers who have
accepted offers in the campaign to find customers who have positive behavior with the
marketing campaign and are likely to make large purchases.
The data contains 2,205 observations and 39 columns. Dataset has 5 categorical variables
and 23 numerical variables. After reviewing data columns and comparing them to the dataset
description, looking for missing values, checking column types and assessing unique values,
we can conclude that there is no missing value
The data set consists of 2205 customers of Ifood company with data on:
Customer profiles
25
· Education_PhD = Customer's education level is PhD (21 years)
Has_child
Channel performance
Campaign successes/failures
26
· Educational_years = Number of education year for each diploma
· Marital_Situation = 0 means “Alone”- Divorced, Single, Widow, 1 means “In relationship”
· Has_child = 0 means “No child” and 1 means “Has child”
· Spending = Total of amount spent on products.
· AcceptedCmpOverall = Count how many times customer accepted the offer in campaigns.
Statistic summary
27
Channel Performance
According to the Channel Performance chart, clients mostly purchase items online
(9042 purchases) and in-store (11768 purchases). The high incidence of in-store transactions
might be attributed to the company's fresh items such as meat, fish, etc.
Last month's website visits were also extremely high: 11,768 visits out of a total of 2,205
consumers. Each user visits the website 5 to 6 times each month on average.
28
PhD holders had the highest average salary, at $56,161.
Basic degree holders had the lowest median earnings of $20,306 More than half of our
consumers hold a graduate degree.
According to the chart of average income by graduation level, the higher the diploma,
the greater the average wage. People with a basic education, on the other hand, earn half as
much as those with two cycles.
In this section, we explore whether the average wage of persons with a PhD differs
statistically from that of those with a Master's degree.
The degree distribution appears to be same in the "Coupled" and "Alone" populations.
In both plots, the graduation level continues to dominate with 50%. PhD, Master's, and
2nd cycle follow.
3. Spending by income
We will eventually use a statistical test to see whether the association between annual
income and quantity spent is statistically significant.
My group has classified the customer's spending level into 39 levels (from 0 to over
2000, with a 50-point difference between the ranges).
Customers with children have an extraordinarily high purchasing level of less than $50
USD. Customers with children spend more than those without children in the $50-1200 range.
People without children, however, spend more after 1200.
30
IV. Statistical Hypothesis Testing
1. Is the average salary of PhD owners statistically different from Master owners?
2. Is the correlation between the annual income and the amount of spending statistically significant?
3. Is there correlation between the diploma owned and the marital status?
Firstly, normality tests must be done, if data are Normal distribution, parametric statistical
methods will be used. On the contrary, nonparametric statistical methods should be used.
1. Graphical methods
31
From the graph, we can immediately see which variables seem to be Normal or Gaussian-like:
2. Statistical methods
The tests assume that that the sample was drawn from a Normal distribution
32
The following, we answer 2 business problems by using Hypothesis Testing:
Question 1: Is the correlation between the annual income and the amount of spending
statistically significant?
1. Hypothesis statement
Spearman rank correlation is a non-parametric test that is used to measure the degree of
association between two variables.
Our second question was to find if there is a statistically significant correlation between the
income and the spending amount.
33
So we can conclude the positive monotonic association between “Income” and
“Spending”.
Question 2: Is there correlation between the diploma owned and the marital status?
Our third question was to find if there is a statistically significant correlation between the
diploma and the marital situation.
Marital situation
Alone Incouple Total
Basic 20 34 54
2n cycle 62 136 198
Graduation 401 712 1113
Master 125 239 364
PhD 175 301 476
783 1422
1. Hypothesis statement
34
The p-value in above test is larger than 0.05 so we can not reject H_0: Education and
Marital_Situation are independent.
When doing and reading research, understanding the different types of variables is
important which are mainly 4 types:
1. Independent variable: a variable thought to be the cause of some effect. This term is
usually used in experimental research to describe a variable that the experimenter has
manipulated.
2. Dependent variable: a variable thought to be affected by changes in an independent
variable. You can think of this variable as an outcome.
3. Predictor variable: a variable thought to predict an outcome variable. This term is basically
another way of saying ‘independent variable’.
35
4. Outcome variable: a variable thought to change as a function of changes in a predictor
variable. The term is also synonymous with ‘dependent variable’.
a. Variables can be split into categorical and continuous, and within these types there are
different levels of measurement:
b. Categorical (entities are divided into distinct categories)
i. Binary variable There are only two categories (e.g., dead or alive).
ii. Nominal variable: There are more than two categories (e.g., whether someone is an omnivore,
vegetarian, vegan, or fruitarian).
iii. Ordinal variable: The same as a nominal variable but the categories have a logical order (e.g.,
whether people got a fail, a pass, a merit or a distinction in their exam).
i. Interval variable: Equal intervals on the variable represent equal differences in the property
being measured (e.g., the difference between 6 and 8 is equivalent to the difference between
13 and 15).
ii. Ratio variable: The same as an interval variable, but the ratios of scores on the scale must also
make sense (e.g., a score of 16 on an anxiety scale means that the person is, in reality, twice
as anxious as someone scoring 8). For this to be true, the scale must have a meaningful zero
point.
b. Dataset
After Cleaning the data, we got 2205 transactions. We focused on consumers from the
ABC company only, where a transaction represents one customer, and each transaction
contains a set of items purchased by the customer. It is useful to cluster the customers so that
customers with similar buying patterns are in a cluster. It is useful for:
2. Targeted Marketing.
36
Figure 1: The clean data set and variables’ view in spss.
With the above data, to gain a deeper understanding of customer behavior and
painpoint the most valuable segments within our dataset, we're employing RFM analysis prior
to clustering. This technique leverages three key metrics—Recency, Frequency, and Monetary
value—to create more informative variables based on customers' past purchase patterns. By
identifying the most profitable customer segments, RFM analysis will enable us to tailor our
segmentation strategy effectively.
Figure 2: RFM analysis for Customer Segmentation using SPSS: Direct Marketing> RFM
Analysis function.
37
Figure 3: RFM analysis: choosing format
38
Frequency_Score: it shows how often a customer makes repeat purchases, with higher
numbers meaning they buy more often
Monetary_Score: it shows how much a customer spends, with higher numbers meaning they
spend more
RFM_Score: it shows the total RFM score, customers with high RFM scores, indicating
frequent purchases, recent activity, and significant spending, tend to be the most vocal
advocates for your brand
Because I chose "Assign ties to the same bin" in the binning section, which means that
duplicate values will be assigned to the same bin, my results are as follows:
Output: The heat map of mean monetary distribution shows the average monetary
value for categories defined by recency and frequency scores. Darker areas indicate a higher
average monetary value. In other words, customers with recency and frequency scores in the
darker areas tend to spend more on average than those with recency and frequency scores in
the lighter areas.
On the vertical axis, 'Recency' scores tell us how long it's been since a customer's last
visit. Higher scores indicate recent visits, while lower scores mean longer gaps between
purchases
Darker blue implies bigger spender.
The horizontal axis tracks 'Frequency,' with lower values reflecting infrequent visits
and higher values signifying frequent market visit.
39
So, we can see that customer persona is someone who shops frequently, spends
generously, and visits the market regularly. This insight guides our segmentation strategy
towards those with higher overall 'RFM scores.
Our next phase in the case design involves implementing the chosen multivariate
analysis. We've opted for the two-step cluster algorithm in SPSS, and we will elaborate on
this in the following section.
Two-step cluster analysis offers distinct advantages over K-means clustering for
identifying meaningful groups within data. It leverages a dual-stage approach, first employing
pre-clustering techniques and then refining results through hierarchical methods. This hybrid
strategy excels at uncovering naturally occurring segments, reflecting real-world patterns.
Additionally, it can uncover subtle relationships between individuals who share multiple
distinct characteristics. Notably, it can handle both categorical and numerical variables
concurrently without requiring standardization, a common prerequisite for K-means. Finally,
it effectively manages large datasets, making it well-suited for contemporary data-rich
environments.
40
Figure 7: Silhouette measure of cohesion and separation
41
Figure 9: Predictor Importance
42
43
Results:
In this section we will conclude that our final results and readings from the cluster segments.
Cluster 1 (30.5%): This is the largest customer group, accounting for 30.5% of the total
customer base. This group has low activity, average value, average loyalty, and low risk. This
group could be new customers or customers who are in the early stages of their relationship.
Cluster 4 (23.1%): This is a medium-sized customer group, accounting for 23.1% of the total
customer base. This group has medium activity, low value, high loyalty, and high risk. This
group could be loyal customers but do not generate much value for the business.
Cluster 5 (19.5%): This is a medium-sized customer group, accounting for 19.5% of the total
customer base. This group has medium activity, high value, average loyalty, and high risk.
This group could be potential customers with the potential to generate a lot of value for the
business.
Cluster 2 (17.7%): This is a medium-sized customer group, accounting for 17.7% of the total
customer base. This group has high activity, average value, low loyalty, and medium risk.
This group could be new customers with the potential to generate a lot of value for the
business.
Cluster 3 (9.2%): This is the smallest customer group, accounting for 9.2% of the total
customer base. This group has high activity, low value, high loyalty, and high risk. This group
could be loyal customers but do not generate much value for the business.
Summary
For the group of new customers or customers who are in the early stages of their relationship.
Strategies:
+ Send attractive promotions or offers to attract this customer group to shop more.
+ Optimize the online shopping experience to make it easier for customers to find the
products or services they are interested in.
+ Strengthen customer service to solve customer problems and build relationships with them.
For the group of loyal customers but do not generate much value for the business.
Strategies:
44
+ Develop new marketing programs to attract potential customers.
For the group of potential customers with the potential to generate a lot of value for the
business.
Strategies:
+ Build a loyalty program to encourage this customer group to shop more and receive special
benefits.
+ Strengthen customer service to make customers feel cared for and valued.
+ Send marketing messages that are relevant to the needs and preferences of customers in this
cluster.
Conclusion:
RFM analysis, a data-driven method, serves as both a customer segmentation technique and a
basis for strategic marketing decisions. Our research employs RFM values (recency,
frequency, and monetary) to segment customers into cohesive groups, allowing us to tailor
marketing strategies based on their purchasing behaviors.
45
Reference:
46