Professional Documents
Culture Documents
customer segmentation and customised services prediction model", International Journal of Retail &
Distribution Management, Vol. 46 Issue: 4, pp.406-420, https://doi.org/10.1108/IJRDM-06-2017-0130
Permanent link to this document:
https://doi.org/10.1108/IJRDM-06-2017-0130
Downloaded on: 22 July 2018, At: 03:50 (PT)
References: this document contains references to 46 other documents.
To copy this document: permissions@emeraldinsight.com
The fulltext of this document has been downloaded 405 times since 2018*
Users who downloaded this article also downloaded:
(2015),"Consumer attitudes towards online shopping: The effects of trust, perceived benefits, and
perceived web quality", Internet Research, Vol. 25 Iss 5 pp. 707-733 <a href="https://doi.org/10.1108/
IntR-05-2014-0146">https://doi.org/10.1108/IntR-05-2014-0146</a>
(2014),"Moderating effects of online shopping experience on customer satisfaction and repurchase
intentions", International Journal of Retail & Distribution Management, Vol. 42 Iss 3 pp.
187-204 <a href="https://doi.org/10.1108/IJRDM-03-2012-0034">https://doi.org/10.1108/
IJRDM-03-2012-0034</a>
Access to this document was granted through an Emerald subscription provided by emerald-
srm:320271 []
For Authors
If you would like to write for this, or any other Emerald publication, then please use our Emerald
for Authors service information about how to choose which publication to write for and submission
guidelines are available for all. Please visit www.emeraldinsight.com/authors for more information.
About Emerald www.emeraldinsight.com
Emerald is a global publisher linking research and practice to the benefit of society. The company
manages a portfolio of more than 290 journals and over 2,350 books and book series volumes, as
well as providing an extensive range of online products and additional customer resources and
services.
Emerald is both COUNTER 4 and TRANSFER compliant. The organization is a partner of the
Committee on Publication Ethics (COPE) and also works with Portico and the LOCKSS initiative for
digital archive preservation.
IJRDM
46,4 Customer online shopping
experience data analytics
Integrated customer segmentation and
406 customised services prediction model
Received 9 June 2017
Eugene Wong and Yan Wei
Revised 28 September 2017 Department of Supply Chain and Information Management,
29 January 2018
Hang Seng Management College, Hong Kong
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
Abstract
Purpose – The purpose of this paper is to develop a customer online behaviour analysis tool, segment
high-value customers, analyse their online purchasing behaviour and predict their next purchases from an
online air travel corporation.
Design/methodology/approach – An operations review of the customer online shopping process of an
online travel agency (OTA) is conducted. A customer online shopping behaviour analysis tool is developed.
The tool integrates competitors’ pricing data mining, customer segmentation and predictive analysis.
The impacts of competitors’ price changes on customer purchasing decisions regarding the OTA’s products
are evaluated. The integrated model for mining pricing data, identifying potential customers and predicting
their next purchases helps the OTA recommend tailored product packages to its individual customers with
reference to their travel patterns.
Findings – In the customer segmentation analysis, 110,840 customers are identified and segmented based on
their purchasing behaviour. The relationship between the purchasing behaviour in an OTA and the price
changes of different OTAs are analysed. There is a significant relationship between the flight duration time
and the purchase lead time. The next travel destinations of segmented high-value customers are predicted
with reference to their travel patterns and the significance of the relationships between destination pairs.
Practical implications – The developed model contributes to pricing evaluation, customer segmentation
and package customization for online customers.
Originality/value – This study provides novel method and insights into customer behaviour towards OTAs
through an integrated model of customer segmentation, customer behaviour and prediction analysis.
Keywords Customer segmentation, Data mining, Big data analytics, Customer purchase behaviour,
Online travel agencies, Recency frequency and monetary
Paper type Research paper
1. Introduction
Online shopping is increasing continuously, with 1.61 billion people worldwide
purchasing online, amounting to USD1.9 trillion in 2016. The number of people
purchasing goods and services online is expected to increase to over two billion in 2019
and the global e-retail sales amount is expected to be more than double in 2020 (Statistia,
2017). Global online travel sales have also increased in recent years. Ostdick (2017)
reported that over 148 million travel bookings are made on the internet every year and
that 57 per cent of travel reservations are made online. Travel e-commerce sites now offer
diverse products with flight, hotel and car rental packages. These online packages
are offered by various parties, such as airlines (e.g. Lufthansa and Cathay Pacific) and
online travel agencies (OTAs) (e.g. Expedia, CTrip and Zuji). Statistia (2017) indicated
that online travel sales worldwide would increase from USD564.9 billion in 2016 to
USD817.5 billion by 2020. Travel companies, especially OTAs facing severe competition
International Journal of Retail &
Distribution Management from emerging OTAs in terms of pricing, quality and cost, are actively exploring
Vol. 46 No. 4, 2018
pp. 406-420
opportunities in the online travel business. TrekkSoft (2016) revealed that tour and
© Emerald Publishing Limited
0959-0552
activity providers are implementing online booking systems, as the online channel tends
DOI 10.1108/IJRDM-06-2017-0130 to receive 30-40 per cent more bookings after implementing such systems.
With the increasing number of travellers reviewing travel websites and booking tickets Customer
online, OTAs are seeking to leverage big data analytics to increase their marketing and sales online
by providing targeted customers with customised travel packages in a more proactive shopping
manner. Using massive data sets to identify high-valued customers and analyse their online
shopping behaviour is critical for OTAs to increase their market share. Determining potential experience
customers, understanding their online purchasing behaviour and predicting their next
purchases are the key factors for developing a successful online travel business. Empirical 407
studies have been carried out to understand the attributes and determinants in the selection of
OTAs (Ha and Ha, 2016; Kim et al., 2007a, b; Li and Suomi, 2007). Other studies have carried
out hypothesis to examine the motivations of online purchasing (Huang et al., 2017; Bilgihan
et al., 2016; Amaro and Duarte, 2015). To help OTAs identify and segment customers
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
according to their online purchasing behaviour, various data mining models and rules have
been developed to systematically profile profitable customers. These models include the
recency, frequency and monetary (RFM) model (Dursun and Caber, 2016; Mohammadian and
Makhani, 2016; Lumsden et al., 2008), customer lifetime value (CLV) (Wang, 2015; Kumar and
Rajan, 2009), the analytic hierarchy process (Lin et al., 2016; Chiang, 2014) and association
rules (Schiaffino and Amandi, 2009; Leung et al., 2006).
Transforming the traditional business of travel agencies, OTAs are continuously
enhancing the perceived values of products by allowing customers to access and purchase
broad assortments of travel packages online in a user-friendly, low-cost and customised way.
This is achieved via the use of data mining tools to obtain massive competitors’ pricing data
and customers’ online purchasing usage data, and via the use of big data analytics tools to
analyse customers’ behaviour and predict their next purchases (Wang and Wang, 2010;
Demangeot and Broderick, 2007; Kamarulzaman, 2007; Wong et al., 2006). (Kim et al., 2007a, b)
indicated pricing as the most critical attribute for customers in purchasing air flight tickets,
followed by security. Hao et al. (2015) demonstrated the importance of customer satisfaction
on OTA websites. Big data analytics is the process of examining massive data sets and
revealing hidden patterns, unknown correlations, market trends and customer preferences
(White, 2012). Thus, it is important for online retail businesses with huge petabytes of
information about transactions, customer behaviour and purchasing record to collect and
analyse such data every hour (Bradlow et al., 2017). In this study, an integrated data mining
and analytics model is developed; thereafter an operations review is conducted in the Hong
Kong office of an international OTA. The online flight ticket pricing figures of the OTA’s
competitors are crawled in the model for real-time data analysis. Customer data are profiled
and segmented for passenger travel behaviour analysis. Ten travel destinations are selected
for review. The ticket prices of the selected OTA and five of its rival OTAs are obtained
through Web data crawling every day for five months. Overall, 110,843 customer profiles and
purchasing values in 2015 and 2016 are attained and segmented using the RFM and CLV
models. High-value and profitable customers are identified. The developed system predicts the
customers’ next destinations to facilitate the promotion of customised packages to potential
clients. The model’s framework is shown in Figure 1.
408
Customer
segmentation
via RFM and CLV
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
Predictive analysis
indicated that RFM values should be firm-specific and based on the nature of the products.
Customer behaviour can be quantified through the RFM model using the customer quintile
method (Miglautsch, 2000) or behaviour quintile method (McCarty and Hastak, 2007).
As a quintile is a group of numbers representing 20 per cent of the values in a larger dataset,
it has the advantage of having equal numbers of customers in each segment. A customer
quintile follows this principle in segmenting customers equally by comparing them with
designated RFM parameters and scores. The customers with the highest RFM scores are
generally categorised as the most profitable (Hosseini et al., 2010). However, segmentation via
the quintile method may be biased if a high percentage of customers has only purchased once in
the frequency measurement. This may result in two or three quintiles having identical
behaviour; that is, the customers in frequency scores 2 and 3 behave identically to those in score
1. The behaviour quintile method sorts customers by generating arbitrary cutoffs on percentage
behaviour for solving the sensitivity problem of the customer quintile method (Dursun and
Caber, 2016). It has the advantage of clustering customers with similar behaviour.
Overall, 110,843 customers from the transaction data are obtained and segmented
according to the customer quintile method. A recency scoring scheme is established by scoring
one of the five intervals from 1 to 5 for each customer identification number based on the
customers’ travel booking dates in decreasing order. For example, a customer with a booking
date of yesterday would have a score of 5. Table I shows the quintile scoring of recency.
by OTA x. The price data of each competitor are collected online by a data crawling
programme at the three internet rush hour times (i.e. 12:00 a.m., 10:00 a.m. and 2:00 p.m.).
The departure dates of the crawled tickets are two weeks after the crawling dates. The data
collected above are from 23 November 2016 to 31 December 2016. These dates are indexed
as the 0th to 38th days.
This analysis uses a linear function to determine the relationship between the ticket price
fluctuations of OTA x’s five competitors and the OTA x purchase number. The notations are
as follows:
• Ytm: the purchase number at OTA x on the tth day for destination m, t ¼ 1, …, 38,
m ¼ 1, …, 10.
• Pitm: the average price of competitor i on the tth day for destination m, calculated as
the average of the three prices recorded at 12:00 a.m., 10:00 a.m. and 2:00 p.m., i ¼ 1,
…, 5; t ¼ 1, …, 38; m ¼ 1, …, 10.
• ΔPitm: the price fluctuation of competitor i on the tth day for destination m, which
equals the difference in the average prices on the tth day and on the t−1th day, i.e.
ΔPitm ¼ Pitm−Pit−1m, i ¼ 1, …, 5; t ¼ 1, …, 38; m ¼ 1, …, 10.
Then the regression model can be represented as follows:
X
5
Y tm ¼ bm þ aim DP itm þetm t ¼ 1; . . .; 38; m ¼ 1; . . .; 10 (1)
i¼1
where αim and βm are the coefficient of the ith competitor’s price fluctuation and the
intercept term of the regression model for destination m, respectively, εtm is the error term of
the tth observation of destination m.
The results of the regression models for the ten destinations are shown in Tables II
and III. The p-values of the coefficients in Table III are all greater than 0.05. This shows that
there is no significant relationship between OTA x’s customer purchasing behaviour and its
competitors’ ticket price fluctuations against those of the previous days. It could be
observed that insignificant results would be the number of purchases obtained on the
purchasing date in the dataset is focused on the traditional analysis of data crawling on
two weeks before the departure date. Further development can be carried out on the
regression analysis of the dataset to extend the purchasing dates on which for each
departure date of t, the average price of competitor i for destination m, Pitm, is collected for
t ¼ t14; . . .; t1.
• ΔPit: the price fluctuation of OTA x on tth day for destination i, which equals
the difference in the average prices on the tth day and on the t−1th i.e. ΔPit
¼ Pit − Pit−1; i ¼ 1, …,10; t ¼ 1, …, 50.
Then the regression model can be represented as follows:
X
10
Y tm ¼ bm þ aim DP it þetm t ¼ 1; . . .; 50; m ¼ 1; . . .; 10 (2)
i¼1
where αim and βm are the coefficient of the ith destination’s price fluctuation within OTA x
and the intercept term of the regression model for destination m, respectively, εtm is the error
term of the tth observation of the regression model for destination m.
significant relationship between the purchase behaviour for the ith destination and the price
fluctuations of the other destinations of OTA x.
m Des. α1m α2m α3m α4m α5m α6m α7m α8m α9m α10m βm
1 SIN 0.0004 −0.0008 0.0000 0.0001 0.0000 0.0030 0.0003 0.0008 −0.0001 0.0000 5.9
2 SHA −0.0007 −0.0007 0.0022 0.0000 0.0026 −0.0011 −0.0005 −0.0006 −0.0003 0.0001 5.3
3 BKK 0.0008 0.0007 0.0016 0.0003 0.0002 0.0053 0.0010 0.0025 −0.0003 −0.0006 17.6
4 TYO 0.0002 −0.0036 0.0068 −0.0013 0.0007 0.0108 0.0015 0.0001 −0.0001 −0.0002 11.2
5 TPE 0.0041 0.0122 −0.0021 0.0031 −0.0010 0.0071 0.0010 −0.0062 0.0011 0.0001 47.9
Table IV. 6 KHH 0.0000 0.0012 −0.0001 0.0000 0.0011 0.0015 0.0001 0.0005 −0.0003 0.0001 4.0
Values of the 7 OSA 0.0001 −0.0005 −0.0012 0.0002 0.0019 0.0031 −0.0009 0.0020 −0.0007 −0.0002 7.1
coefficients and 8 SEL 0.0004 0.0041 −0.0052 0.0002 −0.0008 0.0030 0.0003 0.0023 0.0006 0.0001 11.4
intercept of each 9 MNL 0.0002 0.0002 0.0035 0.0004 −0.0020 0.0031 0.0008 −0.0022 0.0015 0.0000 4.0
destination 10 LHR −0.0001 0.0009 0.0006 0.0002 0.0012 −0.0023 −0.0001 −0.0010 0.0002 0.0000 1.8
m Des. α1m α2m α3m α4m α5m α6m α7m α8m α9m α10m βm
1 SIN 0.643 0.587 0.991 0.926 0.993 0.414 0.779 0.534 0.883 0.999 0.000
2 SHA 0.243 0.543 0.228 0.986 0.320 0.695 0.528 0.548 0.708 0.282 0.000
3 BKK 0.666 0.839 0.767 0.850 0.982 0.534 0.631 0.425 0.913 0.102 0.000
4 TYO 0.952 0.437 0.346 0.592 0.947 0.346 0.604 0.985 0.969 0.747 0.000
5 TPE 0.547 0.335 0.912 0.631 0.971 0.818 0.892 0.582 0.896 0.950 0.000
Table V. 6 KHH 0.959 0.313 0.947 0.999 0.647 0.602 0.934 0.617 0.718 0.352 0.000
p-values of the 7 OSA 0.960 0.832 0.748 0.845 0.717 0.607 0.574 0.361 0.670 0.343 0.000
coefficients and 8 SEL 0.755 0.127 0.214 0.865 0.894 0.641 0.875 0.342 0.716 0.649 0.000
intercept of each 9 MNL 0.762 0.827 0.039 0.457 0.404 0.241 0.202 0.022 0.042 0.897 0.000
destination 10 LHR 0.865 0.278 0.644 0.555 0.492 0.228 0.762 0.173 0.753 0.545 0.000
To investigate the efficiency of different channels for OTA x, the conversion rates of Customer
various channels are studied with the use of Google Analytics engine (Table VI). online
The conversion rate is the percentage of visits that result in an e-commerce transaction. shopping
Conversion rates are calculated by dividing the number of conversions by the number of
total ad clicks that can be tracked to a conversion during the same period. If one is tracking experience
more than one conversion action or choosing to count every conversion, the conversion rate
obtained may be over 100 per cent, as more than one conversion may be counted for each 413
click, similar to the results in Table VI. The top three conversion rates are 58.59 per cent,
36.5 per cent and 29.75 per cent accessed via direct, organic and paid search, respectively.
The relatively high organic search rate means that people searching for OTA x actually
click on its website from the listing, suggesting that adding keywords, such as “OTA x” to
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
the relevant AdWords campaigns may help the agency attract more customers.
90
Average date difference
414 80
70
60
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
50
40
30
20
Figure 2. 0 2 4 6 8 10 12 14 16
Scatter plot Time(hr)
of the data R2
Note: The line indicates the predictions and the points are the actual data
Holidays Des. 1 Des. 2 Des. 3 Des. 4 Des. 5 Des. 6 Des. 7 Des. 8 Des. 9 Des. 10
Python programme, pricing data in each captured timeframe are stored in the database
along with the customer data. Customer profiles are segmented using the weighted RFM
model, and the customer purchasing behaviour in response to competitors’ price changes is
further analysed. The next destinations of identified high-value customers are predicted
using a priori association rules (Borgelt and Kruse, 2002) and customised packages are
promoted to targeted customer segments.
Des. Des. Des. Des. Des. Des. Des. Des. Des. Des.
Holidays 1 2 3 4 5 6 7 8 9 10 Average
Lunar New Year 22.8 21.4 25.0 22.4 22.9 26.4 24.2 23.5 21.6 19.6 23.0
Easter 40.0 39.1 37.2 41.2 37.0 39.2 42.3 41.1 39.7 41.9 39.9
Ching Ming Festival 38.3 36.8 36.3 45.7 38.2 35.2 41.1 37.7 39.7 52.1 40.4
Labour Day 39.2 39.6 49.1 32.6 37.6 37.7 38.2 42.6 33.7 52.9 40.3
Birthday of Buddha 33.2 31.3 39.8 40.9 41.5 45.1 45.0 49.5 32.7 62.9 42.2
Tuen Ng Festival 48.2 34.6 45.7 44.3 44.3 57.0 54.9 55.4 39.0 66.5 49.0
HKSAR Establishment
Day 50.8 37.1 53.6 50.8 45.5 65.4 56.6 72.1 41.3 61.1 53.4
Mid-Autumn Festival 61.2 44.2 58.8 54.9 51.0 57.4 42.7 63.1 45.8 61.8 54.1 Table VIII.
National Day 58.9 42.7 50.3 48.8 61.0 45.9 39.8 57.6 43.6 60.2 50.9 Average purchase
Chung Yeung Festival 61.2 47.1 61.3 43.8 57.0 47.0 63.9 69.9 41.2 59.8 55.2 lead time (days) for
Christmas 72.7 74.9 74.5 73.7 75.7 77.7 67.6 108.0 71.2 76.6 77.3 holiday seasons
An association rule in the form of X⇒Y (X ⊆ I,Y ⊆ I ) indicates that if a customer buys X, he
or she is also likely to buy Y. There are three main measures in the association rules
analysis, including support, confidence and lift:
• Support (X, D). This indicates the popularity of X in database D, calculated as the
proportion of the transactions in which X appears:
Occurrence of X
Support ðX ; DÞ ¼
m
• Support (X⇒Y ). The proportion of transactions in which both X and Y appear:
Confidence(X⇒Y ) ¼ Support(X∪Y,D)/Support(X,D)
• Lift(X⇒Y ). This value indicates the enhanced degree of a targeted marketing
strategy based on X⇒Y against a random choice targeted marketing strategy.
The lift of the rule also represents the probability of purchasing Y when X is
purchased, but with the consideration of Y’s popularity:
Support ðX [Y ; DÞ
LiftðX ) Y Þ ¼
Support ðX ; DÞ Support ðY ; DÞ
A lift value greater than 1 shows the degree to which the occurrence of X and Y are associated
to each other and makes the rule X⇒Y potentially useful for the market prediction.
Customer
data
OTA website
. . .
Web Data
Web data Database
Figure 3. server processing Reports
OTA website crawling storage
System flow model tools
(Python) (access) (VBA)
. . .
when the oriented group is the customers who have bought tickets to BKK. The effect of
sending promotion packages for NRT to customers who have bought tickets to ICN would be
greater than sending promotion packages for MNL. These customers would have a higher
chance in further purchase tickets to NRT. Similarly, for the 2016 transaction database, as
shown in Table XI, only one association rule simultaneously satisfies the minimum support
and minimum confidence. As this association rule has a lift value of 3, the promotion based on
this rule would be promoting tickets to NRT to customers who have bought tickets to LAX.
This is due to the attraction of using NRT as the customers’ transshipment choice when they
are travelling to LAX. The promotion strategy based on the mining results would have an
obvious enhanced customer response rate compared to that of a random promotion.
With the identified high-valued customers and the integrated system including pricing,
customer profile and behaviour and service prediction modules, the results of predicting
subsequent destinations have great implications for travel agencies. Online customer behaviour is
captured and analysed, facilitating the formation of promotion strategies, especially for OTAs
with limited resources for promoting packages to millions of consumers. The promotion packages
and advertisements that reach each customer are expected to be tailored and customised in the
era of overwhelming customer promotion information, especially for high-valued customers.
5. Conclusion
Online shopping demands, including travel tickets and packages, are continuously
increasing. Travel companies such as OTAs are actively exploring how customers navigate
OTAs and make their purchase decisions. A process analysis of customer online navigation
and purchasing on OTA x is evaluated in this study. Customer relationship management is
reviewed, and customer segmentation is proposed using weighted RFM and CLV models.
Overall, 110,843 customers are segmented into different customer groups and 0.1 per cent
customers are identified as high-value customers based on their RFM purchasing behaviour.
Rule ID Association rules Support (%) Confidence (%) Lift Table XI.
Profile of the 2016 4.6-5
1 LAX⇒NRT 10.9 60.0 3.00 CLV customer group
IJRDM To further analyse customer purchasing behaviour, transaction data are analysed from
46,4 the perspectives of the price change and lead time of the purchase. Two regression models
are implemented to analyse purchasing behaviour against ticket price change. The first
regression is conducted between OTA x’s customer purchasing behaviour and its
competitors’ ticket price fluctuation against the previous day. The second regression is
conducted on the price changes to various destinations in OTA x against its customers’
418 purchase decisions. The results reveal that the number of purchases in the tested panel data
could be extended from the traditional analysis of two weeks before the departure date to
the daily purchase number over the two weeks before the departure date. With each
departure date, panel data with 14 observations are formulated to analyse and obtain
further results on the relationship between purchase behaviour and price change.
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
In reviewing the efficiency of different channels for OTA x, the conversion rates
of various channels are evaluated. The top three conversion rates are direct search
(58.59 per cent), organic search (36.5 per cent) and paid search (29.75 per cent). This suggests
that adding keywords of OTA x to relevant AdWords campaigns may help the agency
attract more customers. In the analysis of customer behaviour on purchasing lead time,
two analyses are conducted: one on the relationship between the lead time and flight
duration time and one on the relationship between the lead time and public holidays.
Recommendations have been provided to OTA x that the time in promoting the holiday
flights to the customers should be adjusted. For example, the promotion of Christmas flights
can start as early as the beginning of October. The numeric relationship between the lead
time and flight duration time for OTA x is lead time ¼ 40.11 + 4.409 × flight duration time.
To enhance customer segmentation and behaviour prediction for customising the
product packages, a novel integrated data analytics model on customer segmentation,
competitors’ price change data mining and customer behaviour prediction is developed.
The model is applied in OTA x and insights are provided to the company on its customers’
reactions to the price changes of its ten competitors on the ten most popular travel
destinations. The model can assist OTAs with customer segmentation and future product
package promotion strategies. With the designed model, a COSBA system with competitor
pricing, customer segmentation and customised service prediction modules is developed.
The next destinations of the high-value customers identified are predicted (e.g. ICN and
MNL, ICN and NRT, YVR and ICN, BKK and SIN and LAX and NRT). Future analysis
can be extended using more customer demographic information, behaviour on online
user-interface navigation and the time prediction of customer purchasing. The historical
performance of customers can also be added to enhance the projection accuracy.
References
Adjei, M.T., Noble, S.M. and Noble, C.H. (2009), “The influence of C2C communications in online brand
communities on customer purchase behavior”, Journal of the Academy of Marketing Science,
Vol. 38 No. 5, pp. 634-653.
Amaro, S. and Duarte, P. (2015), “An integrative model of consumers’ intentions to purchase travel
online”, Tourism management, Vol. 46, pp. 64-79.
Ascarza, E., Neslin, S., Netzer, O., Anderson, Z., Fader, P., Gupta, S. and Provost, F. (2018), “In pursuit of
enhanced customer retention management”, Customer Needs and Solutions, Vol. 5 Nos 1/2, pp. 65-81.
Bilgihan, A., Barreda, A., Okumus, F. and Nusair, K. (2016), “Consumer perception of knowledge-sharing
in travel-related online social networks”, Tourism Management, Vol. 52, pp. 287-296.
Borgelt, C. and Kruse, R. (2002), “Induction of association rules: a priori implementation”, Compstat,
Physica-Verlag HD, pp. 395-400.
Bradlow, E.T., Gangwar, M., Kopalle, P. and Voleti, S. (2017), “The role of big data and predictive
analytics in retailing”, Journal of Retailing, Vol. 93 No. 1, pp. 79-95.
Chiang, W.Y. (2014), “Mining high value travelers using a new model designed for online CRM Customer
systems: a case study in Taiwan”, Aviation, Vol. 18 No. 3, pp. 157-160. online
Demangeot, C. and Broderick, A.J. (2007), “Conceptualising consumer behaviour in online shopping shopping
environments”, International Journal of Retail & Distribution Management, Vol. 35 No. 11, pp. 878-894.
Dursun, A. and Caber, M. (2016), “Using data mining techniques for profiling profitable hotel customers:
experience
an application of RFM analysis”, Tourism Management Perspectives, Vol. 18, pp. 153-160.
Fan, Y.W. and Ku, E. (2010), “Customer focus, service process fit and customer relationship 419
management profitability: the effect of knowledge sharing”, The Service Industries Journal,
Vol. 30 No. 2, pp. 203-223.
Government of the Hong Kong Special Administrative Region (2017), Hong Kong in Figures, Hong Kong.
Greenspan, R. (2004), “Internet high on travel destinations”, available at: www.clickz.com/stats/
Downloaded by UNIVERSITY OF NEW ENGLAND (AUS) At 03:50 22 July 2018 (PT)
segmentation approaches based on RFM and demographics analysis”, Kybernetes, Vol. 45 No. 7,
pp. 1129-1157.
Schiaffino, S. and Amandi, A. (2009), “Building an expert travel agent as a software agent”, Expert
Systems with Applications, Vol. 36 No. 2, pp. 1291-1299.
Statistia (2017), “E-commerce worldwide – statistics & facts”, available at: www.statista.com/topics/
871/online-shopping (accessed 10 June 2017).
TrekkSoft (2016), “Online booking and management software for tour operators and activity
providers”, available at: www.trekksoft.com (accessed 10 June 2017).
Uncles, M.D., Dowling, G.R. and Hammond, K. (2003), “Customer loyalty and customer loyalty
programs”, Journal of Consumer Marketing, Vol. 20 No. 4, pp. 294-316.
Venkatesan, R. and Kumar, V. (2004), “A customer lifetime value framework for customer selection and
resource allocation strategy”, Journal of Marketing, Vol. 68 No. 4, pp. 106-125.
Verhoef, P.C. and Lemon, K.N. (2013), “Successful customer value management: key lessons and
emerging trends”, European Management Journal, Vol. 31 No. 1, pp. 1-15.
Visa (2015), “Visa survey: Hong Kong people spend more than half of their holidays travelling abroad”,
Hong Kong.
Wang, H. and Wang, S. (2010), “Predicting mobile hotel reservation adoption: insight from a perceived
value standpoint”, International Journal of Hospitality Management, Vol. 29 No. 4, pp. 598-608.
Wang, J.F. (2015), “Customer lifetime value incorporating negative behavior: measurement and
comparison”, in Zhang, Z., Shen, Z.M., Zhang, J. and Zhang, R. (Eds), LISS, Springer, Berlin and
Heidelberg, pp. 897-901.
White, T. (2012), Hadoop: The Definitive Guide, O’Reilly Media, Inc.
Wong, J.Y., Chen, H.J., Chung, P.H. and Kao, N.C. (2006), “Identifying valuable travelers and their next
foreign destination by the application of data mining techniques”, Asia Pacific Journal of
Tourism Research, Vol. 11 No. 4, pp. 355-373.
Further reading
Gupta, S., Hanssens, D., Hardie, B., Kahn, W., Kumar, V., Lin, N., Ravishanker, N. and Sriram, S. (2006),
“Modeling customer lifetime value”, Journal of Service Research, Vol. 9 No. 2, pp. 139-155.
Wei, J.T., Lin, S.Y. and Wu, H.H. (2010), “A review of the application of RFM model”, African Journal of
Business Management, Vol. 4 No. 19, pp. 4199-4206.
Corresponding author
Eugene Wong can be contacted at: eugenewong@hsmc.edu.hk
For instructions on how to order reprints of this article, please visit our website:
www.emeraldgrouppublishing.com/licensing/reprints.htm
Or contact us for further details: permissions@emeraldinsight.com