Professional Documents
Culture Documents
O ur main objective in this paper is to measure the value of customers acquired from Google search adver-
tising accounting for two factors that have been overlooked in the conventional method widely adopted in
the industry: (1) the spillover effect of search advertising on customer acquisition and sales in off-line channels
and (2) the lifetime value of acquired customers. By merging Web traffic and sales data from a small-sized U.S.
firm, we create an individual customer-level panel that tracks all repeated purchases, both online and off-line,
and tracks whether or not these purchases were referred from Google search advertising.
To estimate the customer lifetime value, we apply the methodology in the customer relationship management
literature by developing an integrated model of customer lifetime, transaction rate, and gross profit margin,
allowing for individual heterogeneity and a full correlation of the three processes. Results show that customers
acquired through Google search advertising in our data have a higher transaction rate than customers acquired
from other channels. After accounting for future purchases and spillover to off-line channels, the calculated
value of new customers using our approach is much higher than the value obtained using conventional method.
The approach used in our study provides a practical framework for firms to evaluate the long-term profit impact
of their search advertising investment in a multichannel setting.
Key words: customer lifetime value; multiple-channel shopping; customer acquisition; sponsored search
advertising
History: Received: August 8, 2010; accepted: April 25, 2011; Eric Bradlow and then Preyas Desai served as the
editor-in-chief and Peter Fader served as associate editor for this article. Published online in Articles in
Advance August 25, 2011.
837
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
838 Marketing Science 30(5), pp. 837–850, © 2011 INFORMS
important factors in the profit calculation. First, multi- over time. The predicted CLV from new customers
channel distribution has become more prevalent with after accounting for future purchases and acquisi-
the widespread use of Internet. By focusing only on tion and sales spillover to the off-line channel is
online transactions, we are not able to account for the much higher than the CLV obtained when we use
potential cross-channel spillover effects. According to the conventional method.3 Assuming all customers
ComScore, a substantive portion of customers ends acquired from Google would not be acquired from
up making purchases off-line even though they are other channels, the breakeven CPC across keywords,
made aware of the retailer through search advertis- above which the firm’s expected returns are negative,
ing; for example, only 25% of purchases in the jewelry is $10.22, significantly higher than the current average
category generated from search advertising were con- CPC at $0.80 in data. By contrast, the breakeven CPC
verted online (Ryan 2006). The conventional method calculated from the conventional method is $0.37,
does not capture such positive spillovers from search much lower than the current CPC. These results show
advertising and therefore might lead to an underesti- that the firm’s management would be seriously mis-
mate of the profit impact. led on their investment on search advertising without
Second, with the shift from product-centered think- accounting for sales spillovers and future purchases
ing to customer-centered thinking in marketing from newly acquired customers.
research and practice, customer lifetime value has The contribution of this paper is twofold. From the
been widely used in many industries as key market- academic perspective, to the best of our knowledge,
ing asset metrics. As firms increasingly target mar- our empirical study is the first one to apply CRM
keting expenditures on the maximization of these models to investigate the long-term value of cus-
metrics, profit measures that only reflect short-term tomers acquired from sponsored search advertising in
returns are often prejudicial against marketing expen- a multichannel context. Managerially, we demonstrate
ditures (Rust et al. 2004). With a trusted relationship that, by making use of the data sources available to
with firms, returning customers may spend more and advertisers, a better measurement of the value of cus-
purchase more frequently in the future. Because the tomers acquired from search advertising can be con-
conventional method only considers immediate pur- structed to help firms evaluate the true profitability
chases of customers acquired from Google, it over- of their investment.
looks the value from the same customers whose
future purchases are no longer referred from search 1.1. Literature Review
advertising. Marketing researchers have developed a number of
The main objective of this paper is to develop an models to measure the customer lifetime value. One
empirical method to estimate the lifetime value of of the earliest and most impactful models is the
customers who clicked on Google search advertise- Pareto/NBD model proposed by Schmittlein et al.
ments prior to their first-time purchases with the (1987). Their model has been widely adopted and
firm, accounting for the cross-channel spillover effect. has become the building block for many later models
Building on the current methodology in the cus- of customer lifetime value (including Schmittlein and
tomer relationship management (CRM) literature, we Peterson 1994, Reinartz and Kumar 2000, Fader et al.
develop an integrated model of customer lifetime, 2005a, and our study here). More recently, Fader et al.
transaction rate, and gross profit margin, allowing for (2005b) proposed an alternative beta-geometric/NBD
individual customer heterogeneity and a full correla- model to reduce the estimation burden. Donkers
tion of the three processes, to estimate the customer et al. (2007) compared the performances of competing
lifetime value (CLV). The model is estimated using models for CLV calculation. Venkatesan et al. (2007)
the well-established hierarchical Bayesian method. and Glady et al. (2009) relaxed the assumption of
We apply the model to an individual customer-level independence between transaction rate and transac-
panel we obtained from a small-sized U.S. firm that tion amount and modeled these two processes jointly.
has spent a substantial proportion of its marketing Abe (2009) extended the original Pareto/NBD model
budget on purchasing search keywords from Google to incorporate richer customer heterogeneity in a hier-
in recent years. We view our effort to establish a cor- archal Bayesian fashion. Singh et al. (2009) proposed a
rect measurement of the customer value as a crucial similar Markov chain Monte Carlo (MCMC) approach
step to evaluate the returns of investment on Google for estimating an extended range of models. In this
search advertising. paper, we follow this rich tradition and develop an
Our results show that, on average, customers
acquired from Google have a higher CLV, mainly 3
We note that the estimated value may not be equivalent to the
because they purchase more frequently relative to cus- profitability of the firm’s investment, because some of these cus-
tomers acquired through other channels. Returning tomers could have been acquired later from other methods had
customers tend to increase their purchase quantities Google search advertising not been used.
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
Marketing Science 30(5), pp. 837–850, © 2011 INFORMS 839
integrated model for customer lifetime, transaction unique customer panel data set that tracks the search
rate, and gross margin. By incorporating individual- and purchase behavior of individual customers in a
level customer heterogeneity and full intercorrelations multichannel context. As a result, we can build a
between the three stochastic processes, our model is model based on the well-established CRM modeling
less restrictive than the original Pareto/NBD model, literature and apply it to our empirical context.
and therefore it is complementary to the existing CLV The rest of this paper is organized as follows.
modeling literature. Section 2 describes the data and explains how we
Several studies have empirically examined the construct the customer-level panel data. Section 3
cross-channel effect in a dual-channel (i.e., online describes in detail how we model and estimate the
and off-line) setting. Biyalogorsky and Naik (2003) customer lifetime value. Section 4 reports the esti-
showed that online sales did not significantly canni- mation results and the value of customer acquisi-
balize retail sales. Deleersnyder et al. (2002) found tion through Google search advertising. Finally, §5
that cannibalization was most likely to occur when concludes.
the online channel closely mimics the off-line set-
ting. Ansari et al. (2008) found a negative impact 2. Data
of Internet usage on long-term purchase incidences. We obtained data from a small-sized firm in a
Verhoef and Donkers (2005) explored how retention Midwest city in the United States that has been
rates and cross-selling opportunities varied across dif- in business for about 20 years. It specializes in
ferent acquisition channels. In this study, we quan- providing biomedical and chemical lab supplies to
tify the impact of search advertising on customer the science community primarily inside the United
acquisition in the dual-channel setting and examine States. Its clients can be divided into two categories:
how customer lifetime, transaction rate, and gross research customers including colleges, universities,
margin differ among customers depending on acqui- and research labs; and commercial customers includ-
sition methods, the first-time transaction channel, and ing small pharmaceutical and biomedical companies.
other observed customer characteristics. The firm has a dual-channel structure for business—
Search advertising, as one of the newest forms of clients can either place orders on its website (online
advertising, has received increased interest in aca- channel) or by phone or fax (off-line channel). The
demic research in recent years. The majority of the firm has traditionally relied on word of mouth to
theoretical literature focuses on advertisers’ bidding reach potential customers; however, since late 2003,
strategy for keywords and optimal mechanism design the owner has started to actively use Google search
for search websites. Examples include Edelman et al. advertising to acquire new customers it could not
(2007) and Katona and Sarvary (2010). Empirical reach by traditional methods.
research on search advertising, on the other hand, The sample period of our data is from January
has focused on exploring the impact of search adver- 2004 to August 2007. The data come from three
tising on advertisers’ click-through and conversion sources: (i) keyword performance records from
rates. Ghose and Yang (2009) modeled the relation- Google AdWords, (ii) log files from the firm’s web-
ship between click-through rates, conversion rates, site, and (iii) customer transaction records, which are
CPC, and ad ranks using a simultaneous equations typical data sets often maintained by firms that run
model. Rutz and Bucklin (2011) examined potential online business and use sponsored search advertising.
spillover effects between generic and branded key- Therefore we believe that the data-merging methods
words and found that generic keyword searches affect and the empirical model we propose below can be
branded keyword searches, but the reverse effect adopted by these firms. The Google keyword perfor-
is not significant. A few recent studies have struc- mance data record the firm’s bid and CPC for each
turally modeled the competition among advertisers keyword, daily average rankings of its sponsored ads,
for search keywords. Yao and Mela (2011) developed a and number of daily impressions and clicks. The firm
dynamic model of advertisers’ bidding strategy. Chan typically bids for the generic chemical names of its
and Park (2010) used the method of moment inequali- top-selling products, the majority (71.6%) of which
ties to estimate the advertisers’ profitability generated are placed at the first or second position at Google-
from consumers’ click-through of sponsored search sponsored links.4 The CPC charged to the firm ranges
ads. To our knowledge, empirical works on search from $0.01 to $3.00 in the data, with a mean value at
advertising to date have only focused on the short- $0.53 and median at $0.37. Some summary statistics
term profit impact of search advertising on online are provided in Table 1. The firm has become more
sales, which might lead to a serious underestimate
of the true profit impact. Our purpose of calculat- 4
The top five keywords of the firm remain active throughout the
ing the long-term value of new customers acquired entire data period. They generated 65% of the impressions and 82%
from search advertising is achieved by constructing a of the clicks, and they accounted for 93% of the costs.
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
840 Marketing Science 30(5), pp. 837–850, © 2011 INFORMS
Table 1 Usage of Google Search Advertising Table 2 Trends in New Customer Transactions
Period 2004 2005 2006 2007 (Jan.–Aug.) Period 2004 2005 2006 2007 (Jan.–Aug.)
Total amount paid to 493 386 21414 31810 Number of new customers 80 126 131 71
Google ($) Number of transactions 97 231 337 255
Number of keywords bid 90 122 182 208 generated by new customers
Number of Google 11888 21473 41480 41767 Gross margin from new 12.4 39.8 64.2 67.4
keyword referrals customers (in thousand $)
Average CPC ($) 0.26 0.16 0.54 0.80
the same ISP name. In this case we use the time prox- Table 3 Conversion Rates
imity of the Web browsing session of the unique IP
Purchases from Purchases from
address and the transaction date of each individual online channel off-line channel
customer in the organization as the criteria for further Conversion rates only (%) only (%) All purchases (%)
matching.
Potential customers 0033 0068 1002
Returning customers 9098 34052 44050
2.2. Conversion Rate and Purchase Behaviors All customers 3006 10022 13028
Based on the browsing history from the merged data
set, we classify a customer as acquired by Google
search advertising if prior to the first-time purchase To gain a better understanding of the customer pur-
chase behaviors, we divide the customers acquired
he or she has clicked into the firm’s website through
during the sample period into cohorts along two
one of the advertised keywords on Google.7 There are
dimensions. The first dimension is the customer
altogether 67 new customers referred from Google in
acquisition method, where we divide customers into
the data and another 16 from other search engines
Google and non-Google customers. Non-Google cus-
(e.g., MSN, Yahoo!, etc.). Customers acquired from
tomers are mainly acquired from word of mouth;
Google made 181 transactions, contributing approxi-
only a few are from other search engines such
mately 20% of the total number of transactions and
as Yahoo! and MSN. The second dimension is the
18% of gross margin generated from new customers.
channel—online and off-line—where customers made
This merged data set also enables us to differenti- their first-time transactions. Table 4 shows the num-
ate website visits from existing customers and from ber of customers, the yearly transaction rate,10 and
potential customers.8 Among the 6,405 transactions the average dollar gross margin per transaction in
made by existing customers, approximately 18% use each cohort. Two-thirds of customers made first-time
search advertising prior to purchasing from the firm. transactions off-line. Their yearly transaction rate and
Conversion rate from search advertising in industry gross margin are significantly higher than customers
practice is typically calculated as the ratio of observed who made first-time transactions online (0.76 ver-
online transactions referred by search engine to the sus 0.58 for transaction rate and $238 versus $146
total number of visits referred by search engine. This for gross margin, respectively). Customers acquired
calculation overlooks the purchases made through through Google (67 out of a total of 408) tend to
the off-line channel. Table 3 shows that more than have a higher transaction rate and gross margin than
three-quarters of the purchases with prior clicks from customers acquired from other methods (1.10 ver-
sponsored links are off-line purchases—the conver- sus 0.63 for transaction rate and $240 versus $203
sion rates for online and off-line are 3.1% and 10.2%, for gross margin, respectively). Based on these obser-
respectively. This could be due to the following rea- vations, we consider acquisition methods and first-
sons: (i) the off-line channel provides consumers an time transaction channels to be important factors that
opportunity to negotiate prices and to obtain more may explain the variance in the customer long-term
information on products and delivery services; and profitability in our model. We also investigate some
(ii) personal conversation over the phone may help potential dynamics in customer purchase behaviors
to enhance business-to-business relationships. Table 3 from data. The only significant finding, based on sim-
also shows that the conversion rate of returning cus- ple regressions, is that the gross margin of each trans-
tomers, combining both online and off-line channels, action is positively correlated with the length of time
is more than 40 times higher than potential customers since customers were acquired, implying that cus-
(44.5% versus 1.0%, respectively), perhaps because it tomers tend to be more profitable if a longer relation-
is difficult for the firm, as a small unknown supplier, ship is maintained.
to gain trust from the latter group of customers.9
3. Modeling the Customer
7
A customer is not counted as acquired by Google search adver- Lifetime Value
tising if he or she is referred from Google using search phrases For the purpose of this study, we focus on mod-
containing the whole or part of the name of the firm.
eling the value of the 67 new customers acquired
8
Website visits from potential customers are defined as click- from Google during the sample period, and we com-
throughs from search engines that do not match with the IP address
of any existing customer.
pare it with the value of the 341 customers acquired
9
Alternatively, it may be because potential customers are less likely
10
to find the products they need from the firm. We believe such an The yearly transaction rate is calculated by dividing the total
explanation does not apply in our context because the keywords in number of transactions by the number of years after acquisition.
our study are all specifically related to generic products that (except Although the number is subject to a right-censored bias, it is only
for prices) are almost homogeneous among suppliers. shown for illustration purposes.
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
842 Marketing Science 30(5), pp. 837–850, © 2011 INFORMS
Table 4 Number of Customers, Transaction Rate, and Gross Margin respectively. Fader and Hardie (2005a) derive the like-
Across Customer Segments lihood for 8xi 1 tix 1 Ti 9 as
x
First-time transaction i i i −4i +i 5tix
channel L4i 1 i xi 1 tix 1 Ti 5 = e
i + i
Online Off-line Total x +1
i i
Acquired from Google search advertising
+ e−4i +i 5Ti 0 (1)
i + i
Number of customers 22 45 67
Yearly transaction rate 0086 1021 1010 The main difference between modeling the cus-
Average gross margin 13500 29104 24001 tomer gross margin and customer lifetime and trans-
per transaction ($) action rate is that we allow for dynamic changes in
Acquired from other methods the former process. We utilize the panel data structure
Number of customers 105 236 341 and develop a random effect linear model as
Yearly transaction rate 0052 0068 0063
Average gross margin 14802 22706 20302 ln zij = bi + · ln4dij 5 + ij 1 ij ∼ N401 2 51 (2)
per transaction ($)
where zij is the gross margin for the jth transac-
tion conditional on the customer being alive, bi the
customer-specific random effect, and dij the length of
from other channels. We follow the Pareto/NBD time from customer i’s acquisition to his or her jth
model developed in Schmittlein et al. (1987) to model transaction with the firm. The coefficient captures
customer lifetime and transaction rate. Conditional the dynamics in customer purchase behavior we dis-
on the occurrence of a transaction, we then model cussed before: we expect that customers tend to pur-
the gross margin for each transaction.11 Our model chase larger quantities the longer they have been in
also incorporates observed and unobserved customer business with the firm. Finally, ij is the idiosyncratic
heterogeneity in all three processes. Specifically, we error term that is assumed to be normally distributed.
assume that at any time there is a time-invariant Let individual parameters i ≡ 4ln i ln i bi 50
hazard function that a current customer i termi- represent the customer heterogeneity in the three pro-
nates his or her relationship with the firm. This cesses. We model the parameters as jointly deter-
probability function is assumed to be exponentially mined by a vector of covariates Xi as follows:
distributed with a hazard rate i . Conditional on i = G0 Xi + i 1 i ∼ N401 è51 (3)
being alive, the customer makes transactions accord-
ing to a time-invariant Poisson process with param- where G is a matrix of parameters and
eter i .12 Based on these two assumptions, previous
2
b
studies (e.g., Schmittlein et al. 1987) show that suf- 2
ficient statistics for customer lifetime and transac- è= b
tion rate are 8xi 1 tix 1 Ti 9, which represent the number b b b2
of repeated transactions (there are altogether xi + 1
is the variance–covariance matrix capturing the inter-
observed transactions for each individual i, includ-
dependence among customer lifetime, transaction
ing the first-time transaction), the time of the last rate, and gross margin. The covariates Xi include the
transaction and the total length of observation period, following four dummy variables: google (which equals
1 if the customer is acquired through Google search
11
In this study, we choose to use the gross margin per transaction
advertising), online (which equals 1 if the customer’s
instead of dollar purchase amount in the estimation. The main rea- first-time transaction is made from the online chan-
son is that, from the firm’s perspective, it is more important to nel), research (which equals 1 if the customer is from
understand the profitability per transaction that is better captured a research organization), and late-period (which equals
by the gross margin. 1 if the customer is acquired after 2006). We include
12
To test whether the time-invariant hazard rate and transaction the last variable to investigate whether or not the cus-
rate assumptions are reasonable, we estimate two alternative mod-
tomer value in the later period, during which our
els. The first model uses a more flexible Weibull distribution
assumption for customer lifetime, in which the hazard rate depends focal firm faced more intense competition for Google
on the length of a customer’s relationship with the firm. The second search keywords such that new customers might be
model assumes a nonhomogeneous Poisson process for the transac- more difficult to acquire, is systematically different
tion process, in which the transaction rate depends on the length of from that in the early period. Abe (2009) used a sim-
time since a customer’s last transaction. Model performance mea-
ilar model but only considers customer lifetime and
sured by either marginal likelihoods (Chib 1995) or Bayes factors
favors our proposed model over the alternative models. Detailed transaction rate. By explicitly allowing gross margin
estimation and model comparison results of the alternative models to be dynamically changing over time, and a full cor-
are available from the authors upon request. relation between customer lifetime, transaction rate,
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
Marketing Science 30(5), pp. 837–850, © 2011 INFORMS 843
and gross margin, our model can be viewed as an Similar to standard CLV models in previous stud-
extension of the standard CLV models. ies, customer lifetime probability derived from our
The model is estimated using the MCMC method. model has a “memoryless” property; i.e., P 4i >
We run 50,000 iterations with the first 10,000 iterations Ti + s5 = P 4i > Ti 5e−i ·s . Also, the probability of trans-
as burn-ins, and the last 40,000 iterations are used action at any time is fixed at the transaction rate i ;
to summarize the posterior distribution for parame- therefore i 4Ti + s5 = i in the Equation (6). By plug-
ters. The convergence is monitored visually and also ging in the equation for zi1 Ti +s and integrating out the
tested formally using the Gelman and Rubin (1992) stochastic component , we have
method. Details of the estimation are described in the
E6vi 4Ti + s57 = P 4i > Ti 5e−i ·s i
electronic companion, available as part of the online
2
version that can be found at http://mktsci.pubs · ebi +·ln4Ti +s5+ /2 e−r·4Ti +s5 0 (7)
.informs.org/.
Given the observation yi = 4xi 1 tix 1 Ti 5, the probabil-
3.1. Calculating CLV ity the customer remains to be active with the firm
The expected CLV can be calculated directly from the after the observed tenure length Ti in data, P 4i > Ti 5,
estimate of individual customer parameters i . Let r is derived in Schmittlein et al. (1987):
(we use 0.0043) be the continuous discount rate,13 and 1
let S be the length of time in the calculation of the P 4i > Ti xi 1tix 1Ti 5 = 0
1+4i /4i +i 556e4i +i 54Ti −tix 5 −17
customer value, where S = + is typically assumed
(8)
in the literature. We normalize the time when the cus-
tomer is acquired to 0. Let the discounted customer Finally, we integrate out the CLV for customer i
value till the projected period S be CVi 4S5. Also, let from Ti to the end period S as follows:
the discounted total value of (observed) transactions, 2
Intercept −5092∗ (−7.23, −4.97) −4044∗ (−4.89, −3.98) 4060∗ (4.28, 4.91)
Google −0011 (−1.66, 1.32) 0051∗ (0.08, 0.95) 0014 (−0.14, 0.42)
Online −0015 (−1.63, 1.25) −0044∗ (−0.85, −0.05) −0025∗ (−0.48, −0.01)
Research −0073 (−2.14, 0.48) −0058∗ (−0.89, −0.26) −0032∗ (−0.57, −0.05)
Late-period −0054 (−1.95, 0.74) −0030 (−0.65, 0.04) −0004 (−0.25, 0.17)
ln d — — 00038∗ (0.002, 0.074)
2 — — 0066 (0.51, 0.86)
Variance–covariance parameters:
2
1051 (0.59, 2.86)
2
−0005 (−0.81, 0.69) 1034 (1.00, 1.70)
2 0021 (−0.47, 0.79) 0023∗ (0.04, 0.42) 0067 (0.52, 0.83)
or regional market. Customers who used the online To further check the goodness of fit of our model,
channel for the first-time transaction have a signifi- we plot diagnostic graphs for cumulative number of
cantly lower transaction rate and gross margin than repeated transactions and cumulative gross margin
customers who used the off-line channel for the first- in Figures 1 and 2. We use each individual’s pos-
time transaction. We also find that research-type cus- terior distribution of i to project his or her future
tomers tend to have a lower transaction rate and gross transactions after acquisition, and we compare with
margin than commercial-type customers. The coeffi- the actual transaction data to check the model fit.
cient for late-period is not statistically significant in Other than the 44 months of data we used as a cali-
any of the three processes, suggesting that there is no bration sample for model estimation, we also use an
systematic difference between the customers acquired additional 28 months out-sample data for validation
before 2006 and those acquired after that.16 Finally, test. Similar plots for model diagnostics are used in
the significantly positive coefficient for ln4dij 5 implies Fader and Hardie (2005b) and Abe (2009). We split
that customers tend to increase the purchase quantity customers into two cohorts: customers acquired from
with the length of tenure, perhaps because of their Google search advertising and from other methods.
increased trust in the firm. Figure 1 compares the model fit of the cumulative
The last three rows in Table 5 are estimates for the number of repeated transactions, and Figure 2 com-
variance–covariance matrix. The correlation between pares the model fit of the cumulative margin of repeat
the stochastic components in customer lifetime and transactions across these two customer cohorts. In
transaction rate is insignificant, which is consis- both figures, the solid line represents the actual data,
tent with the independence assumption made in the dashed line represents the mean model predic-
Schmittlein et al. (1987) and the result in Abe (2009). tion, and the dotted line represents the 95% confi-
However, we find that the correlation between trans- dence intervals of the model prediction. Both figures
action rate and gross margin is positive and signif- suggest that our model predictions fit with data rea-
icant, indicating that in our data high-valued cus- sonably well in both in-sample (before August 2007)
tomers who purchase more frequently also tend to and out-sample (after August 2007) periods.
buy more each time. We also estimated an alterna- We also plot the time-tracking graphs for the
tive model assuming no correlation between the gross monthly number of repeated transactions and gross
margin and the other two processes. The log marginal margin for in- and out-sample periods on a per-
likelihood of the integrated model is −3,305.38 com- customer basis in Figures 3 and 4. Because the
pared with the alternative model at −3,307.42. The estimation of the customer lifetime value and transac-
Bayes factor calculated from these two likelihoods is tion rate is based on the cumulative numbers rather
7.69, implying that our model is moderately favored than the monthly numbers, and because of demand
over the latter in terms of data fit (Jeffreys 1961). fluctuations that may be due to seasonality, the fit
between model predictions and the actual data is
16
Higher competition for Google search advertising in the later
expected to be worse than that of the cumulative
period may have a more significant impact on the acquisition rate, plots. However, our model still captures the trends
which is not modeled in this paper. of repeated transactions and gross margin in the two
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
Marketing Science 30(5), pp. 837–850, © 2011 INFORMS 845
Figure 1 Time-Series Tracking Plot for the Cumulative Number of Repeated Transactions
Actual
Model
800 95% CI
600
Count
400
200
Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2010
Google customers Non-Google customers
Figure 2 Time-Series Tracking Plot for the Cumulative Gross Margin of Repeated Transactions
Actual
200 Model
95% CI
Gross margin (000$)
150
100
50
Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2010
Google customers Non-Google customers
figures. Note that the decline of repeated transac- sample, the correlation is high at 0.92 for the number
tions and gross margin in out-sample periods is due of repeated transactions and 0.86 for the gross margin.
to customer dropouts, as customers acquired after The correlations are also reasonably high for the vali-
August 2007 are not accounted for in this exercise. dation sample. We also use time-series mean absolute
Both model predictions and actual data suggest that,
on a per-customer basis, Google-acquired customers
Table 6 Model Fit Statistics
make more repeated transactions and also generate
higher gross margins than customers acquired from Number of Gross margin of
other methods. transactions transactions
Table 6 presents sample fit statistics at the disaggre- Disaggregate measure: Correlation
gate and aggregate levels for the number of repeated Calibration period 0092 0086
transactions and gross margin amount. These statis- Validation period 0073 0058
tics have also been used in the previous literature Total period 0088 0073
to check the model fit (see Abe 2009, for example). Aggregate measure: Time-series MAPE
The correlation between predicted and actual trans- Calibration period (%) 2102 6208
actions across individual customers is used for the Validation period (%) 208 202
Total period (%) 1307 3702
disaggregate measure of sample fit. For the calibration
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
846 Marketing Science 30(5), pp. 837–850, © 2011 INFORMS
Figure 3 Time-Series Tracking Plot for the Average Monthly Number of Repeated Transactions per Customer
0.10
0.05
0.00
Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2010
Google customers Non-Google customers
Figure 4 Time-Series Tracking Plot for the Average Monthly Gross Margin of Repeated Transactions per Customer
40
30
20
10
Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2004 Jan. 2005 Jan. 2006 Jan. 2007 Jan. 2008 Jan. 2009 Jan. 2010
Google customers Non-Google customers
percentage errors (MAPE) to check the sample fit at from Google on average have a higher lifetime value
the aggregate level. The measure for either repeated (mean CLV at $1,002) than customers acquired from
transactions or gross margin amount in the validation other channels (mean CLV at $808). The difference
period is smaller than that in the calibration period. is even larger for those whose first-time purchase
This is because we have few observations in the very
early period; hence the prediction errors are larger in
the earlier periods than in later periods (see the mis- Table 7 Median and Mean of Projected CLV Across Customer Cohorts
match of predictions and observed data on the left (In Dollars)
side of Figures 3 and 4). First-time transaction channel
Note. These values are calculated for a “typical” customer who is a commercial customer acquired in the first two
years and who used the off-line channel for his or her first-time transaction with the firm; numbers in parentheses
indicate the 2.5 and 97.5 percentiles.
was off-line (mean CLV at $1,226 versus $959, respec- Table 9 Decomposition of the Google Value
tively), implying that the customer value would be
Expected customer lifetime value
underestimated had we only focused on online trans-
actions. Finally, we notice that the mean CLV is Gross margin ($) % Increase
always much larger than the median, implying that Non-Google 11126 (597, 1,989) —
the distribution of CLV is right-skewed, which is con- Google—Lifetime 11150 (471, 2,162) 2 (−43, 59)
sistent with observations in many industries. Google—Transaction rate 11919 (871, 3,759) 70 (7, 156)
Google—Gross margin 11300 (651, 2,400) 15 (−13, 52)
To understand the implications of the coefficients
Interaction effect — 11 (−43, 75)
for google in Table 5, we assume a typical commercial- Google 21227 (853, 4,609) 98 (2, 230)
type customer who was acquired before 2006 from
Note. These values are calculated for a “typical” customer who is a com-
Google search advertising (Google customer) and
mercial customer acquired in the first two years and who used the off-line
made her first-time transaction off-line. We simulate channel for his or her first-time transaction with the firm; numbers in paren-
her transactions for 10 years in the future17 and com- theses indicate the 2.5 and 97.5 percentiles.
pare the result with another typical commercial-type
customer acquired before 2006 through other meth-
Based on the simulation result, we project the dif-
ods and made first-time transaction off-line (non-
ference in CLV between the above Google customer
Google customer). The comparison is made along and non-Google customer to be $1,101.18 We fur-
the following three dimensions: probability of being ther explore the factors driving this Google value by
“alive,” expected number of repeat transactions, and decomposing such incremental value into four parts,
the expected gross margin of transactions, in each the added value resulting from (1) a longer lifetime,
year. The result is summarized in Table 8. For illus- (2) a higher transaction rate, (3) an increased gross
tration, the expected transaction rate and gross mar- margin, and (4) the interaction between the above
gin are not conditional on being alive; hence both three effects (e.g., a higher transaction rate from the
measurements decline over time. In the first year, the Google customer leads to a larger gross margin).
probability of being alive is approximately 80% for To calculate the first effect, we assume a “Google-
both. But the Google customer purchases about 1.8 lifetime” condition in which only the parameter value
times, whereas the non-Google customer purchases of for a hypothetical customer is the same as the
only once. As a result, the expected gross margin Google customer, whereas the value of other param-
in the first year for the Google customer is 94% eters remain the same as the non-Google customer.
higher than the non-Google customer. The probabil- We calculate the second and the third effects using a
ity of remaining as a customer decreases over time. similar method. The last effect is calculated by sub-
In the 10th year, our model predicts the probabilities tracting the sum of the first three effects from the dif-
of being alive for the two segments are 35% and 31%. ference in CLV between the Google and non-Google
The projected gross margin of the Google customer customers ($1,101). Table 9 reports the results. The
is approximately 108% higher than the non-Google majority of the Google value comes from a higher
customer.
18
In this analysis we use the posterior parameter estimates for a
17
We utilize the last 10,000 MCMC draws in the model estimation commercial customer. As commercial customers tend to have larger
to simulate future transactions. Results in Table 7 are based on the values than research customers, the number reported here is also
posterior means from those draws. larger than the pooled mean reported in Table 7.
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
848 Marketing Science 30(5), pp. 837–850, © 2011 INFORMS
VCA and breakeven CPC ($) Total VCA Average VCA Breakeven CPC Total VCA Average VCA Breakeven CPC
Notes. Numbers in parentheses indicate the 2.5 and 97.5 percentiles. Because the one-time transaction value is directly summarized over observed data, the
corresponding estimates do not have confidence intervals.
transaction rate (70%). The longer lifetime contributes for search keywords at Google despite the increasing
2%, the larger gross margin contributes another 15%, costs and the economic downturn.
and the remaining 11% comes from the interaction. One implicit assumption in our calculation is
that customers acquired from Google would not be
4.3. Value of Customer Acquisition from acquired from other channels if the firm did not invest
Google Search Advertising in Google search advertising. We think this assump-
Under the assumptions that we will further discuss, tion may not be unreasonable because of several
we can now compute the value of customer acqui- observations from the data. First, the firm has only
sition (VCA) as the difference between the CLV and spent a small amount on other marketing promotions
the overall search advertising cost incurred by the and traditionally relied on word of mouth. Google
company. The advertising cost can be divided into advertising is likely to help the firm reach very dif-
the acquisition cost, which is the cost incurred by ferent customers. For example, we find from data
acquiring a new customer through Google, and the that the proportion of research-type customers rela-
retention cost, which in our empirical context is the tive to commercial-type acquired from Google is sig-
cost incurred by future click-throughs from customers nificantly higher than that from other methods (70%
after acquisition. The latter cost is negligible in our versus 45%, respectively). Second, potential customers
data: after matching the transaction records of exist- are unlikely to find the firm’s website from organic
ing customers with Google referrals, we find that the search results, as we find that its website is consis-
average click-throughs per transaction for customers tently placed beyond 20 Google organic search result
acquired from Google in their subsequent transactions pages for all the keywords it bid. This implies that
is 2.2. As the average CPC is $0.53, this implies an any substitution between sponsored search advertis-
average retention cost of $1.2 per future transaction ing and organic results is negligible. Finally, we find
compared with the per-transaction gross margin of from data that the number of customers generated
$200.19 Thus, in the VCA calculation we only consider from other methods has been increasing every year,
the acquisition cost. This is calculated as the average implying that search adverting may not have can-
CPC divided by the acquisition rate among potential nibalized customer acquisition from other methods.
customers, which is 1.02% including both online and Even if this assumption is incorrect and we have over-
off-line purchases in our data (see Table 3). estimated the true VCA, we believe our policy evalu-
To address the concern of increasing CPC faced by ations will remain valid based on the high estimates
the owner of the firm, we calculate the corresponding presented previously.
profit breakeven CPC for each of the above scenarios.
Using the conventional method, the breakeven CPC is
only $0.37, far lower than the CPC at $0.80 in the last 5. Conclusions, Limitations, and
year of the data. However, when we account for both Future Research
the cross-channel spillover effect and the long-term We argue in this paper that the conventional method
effect, the breakeven CPC would increase to $10.22. of measuring the profit impact of search advertising
The two estimates offer very different policy impli- in the industry may be seriously biased for two rea-
cations. In contrast to the conventional method, our sons. First, by focusing only on online transactions,
results imply that investing in Google search adver- we ignore the potential cross-channel sales spillover
tising has been very profitable to the firm, which from search advertising to off-line channels. Second,
the owner of the firm seems to agree. He clearly the long-term profit impact of new customers has not
stated that he would continue his aggressive bidding been considered. Our goal in this study is to develop
an empirical method to estimate the value of cus-
19
This implies that the cost of “reacquiring” existing customers at tomers acquired from search advertising by explicitly
Google is very small, at least in our empirical context. accounting for these two factors.
Chan, Wu, and Xie: Measuring the Lifetime Value of Customers Acquired from Google Search Ads
Marketing Science 30(5), pp. 837–850, © 2011 INFORMS 849
Katona, Z., M. Sarvary. 2010. The race for sponsored links: Bidding Schmittlein, D. C., R. A. Peterson. 1994. Customer base analysis:
patterns for search advertising. Marketing Sci. 29(2) 199–215. An industrial purchase process application. Marketing Sci. 13(1)
Performics. 2007. DoubleClick Performics 50 search trend report Q4 41–67.
2006. Press release (March 21), http://www.performics.com/ Schmittlein, D. C., D. G. Morrison, R. Colombo. 1987. Counting
news-room/press-releases/doubleclick-performics-50-search your customers: Who are they and what will they do next?
-trend/411. Management Sci. 33(1) 1–24.
Reinartz, W. J., V. Kumar. 2000. On the profitability of long-life cus- Singh, S. S., S. Borle, D. C. Jain. 2009. A generalized framework
tomers in a noncontractual settings: An empirical investigation for estimating customer lifetime value when customer lifetimes
and implications for marketing. J. Marketing 64(4) 17–35. are not observed. Quant. Marketing Econom. 7(2) 181–205.
Rust, R. T., T. Ambler, G. S. Carpenter, V. Kumar, R. K. Srivastava. Venkatesan, R., V. Kumar, T. Bohling. 2007. Optimal customer re-
2004. Measuring marketing productivity: Current knowledge lationship management using Bayesian decision theory: An
and future directions. J. Marketing 68(4) 76–89. application for customer selection. J. Marketing Res. 44(4)
Rutz, O. J., R. E. Bucklin. 2011. From generic to branded: A model 579–594.
of spillover dynamics in paid search advertising. J. Marketing Verhoef, P. C., B. Donkers. 2005. The effect of acquisition channels
Res. 48(1) 87–102. on customer loyalty and cross-buying. J. Interactive Marketing
Ryan, K. M. 2006. Search’s role in the buying cycle. iMedia 19(2) 31–43.
Connection (March 29), http://www.imediaconnection.com/ Yao, S., C. F. Mela. 2009. A dynamic model of sponsored search
printpage/printpage.aspx?id=8850. advertising. Marketing Sci. 30(4) 702–716.
Copyright 2011, by INFORMS, all rights reserved. Copyright of Marketing Science is the property of
INFORMS: Institute for Operations Research and its content may not be copied or emailed to multiple sites or
posted to a listserv without the copyright holder's express written permission. However, users may print,
download, or email articles for individual use.