You are on page 1of 18

This article was downloaded by: [129.186.138.

35] On: 31 January 2019, At: 00:32


Publisher: Institute for Operations Research and the Management Sciences (INFORMS)
INFORMS is located in Maryland, USA

INFORMS Journal on Computing


Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org

A New Approach to Real-Time Bidding in Online


Advertisements: Auto Pricing Strategy
Shalinda Adikari, Kaushik Dutta

To cite this article:


Shalinda Adikari, Kaushik Dutta (2019) A New Approach to Real-Time Bidding in Online Advertisements: Auto Pricing Strategy.
INFORMS Journal on Computing

Published online in Articles in Advance 21 Jan 2019

. https://doi.org/10.1287/ijoc.2018.0812

Full terms and conditions of use: https://pubsonline.informs.org/page/terms-and-conditions

This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.

The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.

Copyright © 2019, INFORMS

Please scroll down for article—it is on subsequent pages

INFORMS is the largest professional society in the world for professionals in the fields of operations research, management
science, and analytics.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
INFORMS JOURNAL ON COMPUTING
Articles in Advance, pp. 1–17
http://pubsonline.informs.org/journal/ijoc/ ISSN 1091-9856 (print), ISSN 1526-5528 (online)

A New Approach to Real-Time Bidding in Online Advertisements:


Auto Pricing Strategy
Shalinda Adikari,a Kaushik Duttab
a Department of Information Technology, University of Moratuwa, Moratuwa, Sri Lanka; b Muma College of Business, University of
South Florida, Tampa, Florida 33647
Contact: sadikari@uom.lk, http://orcid.org/0000-0001-6255-0604 (SA); duttak@usf.edu, http://orcid.org/0000-0001-8076-1472 (KD)

Received: September 18, 2015 Abstract. Real-time bidding (RTB) for digital advertising is becoming the norm for
Revised: August 29, 2016; May 29, 2017; improving advertisers’ campaigns. Unlike traditional advertising practices, in the process
January 4, 2018 of RTB, the advertisement slots of a mobile application or a website are mapped to a partic-
Accepted: February 5, 2018 ular advertiser through a real-time auction. The auction is triggered and is held for a few
Published Online in Articles in Advance: milliseconds after an application is launched. As one of the key components of the RTB
January 21, 2019
ecosystem, the demand-side platform gives the advertisers a full pledge window to bid
https://doi.org/10.1287/ijoc.2018.0812 for available impressions. Because of the fast-growing market of mobile applications and
websites, the selection of the most pertinent target audience for a particular advertiser is
Copyright: © 2019 INFORMS not a simple human-mediated process. The real-time programmatic approach has become
popular instead. To address the complexity and dynamic nature of the RTB process, we
propose an auto pricing strategy (APS) approach to determine the applications to bid for
and their respective bid prices from the advertising agencies’ perspective. We apply the
APS to actual RTB data and demonstrate how it outperforms the existing RTB approaches
with a higher conversion rate for a lower target spend.
History: Accepted by Ram Ramesh, Area Editor for Knowledge Management and Machine Learning.
Supplemental Material: A video abstract is available at https://doi.org/10.1287/ijoc.2018.0812.

Keywords: real-time bidding • demand-side platform • bid price • bid request • target audience • dynamic programming • winning rate

1. Introduction (Ghosh et al. 2009). But in the RTB process, the advertis-
With advances in digital advertising, making decisions ers can target not only the context but also the specific
using real-time data is now the norm. One such advance users, which improved their return on investment (ROI)
is autonomous and algorithm-driven real-time bidding (Kohno et al. 2005) with more conversions (numbers of
(RTB), which completes a full transaction in millisec- clicks). With the increased ROI, advertisers with smaller
onds on preset parameters (IAB 2014). In online adver- budgets also have the opportunity to access more appo-
tising, RTB helps publishers sell and advertisers buy site and higher quality impressions. The RTB ecosys-
impressions via a programmatic instantaneous auc- tem largely consists of three components: a demand-
tion. RTB allows advertisers to launch their advertis- side platform (DSP), a seller-side platform (SSP), and an
ing campaigns via multiple ad networks, which enables RTB exchange (or ad exchange). Each component has its
advertisers to buy inventory of advertisement slots in own techniques in the bidding process (IAB 2014). RTB
a cost-effective manner and serve ads to the right per- allows purchase of individual impressions through a
son in the right context at the right time (Lee et al.
bid engine that unfolds within milliseconds when an
2013). According to Econsultancy’s recently published
application is launched by a consumer. As per the work-
“Online Advertisers Survey Report 2013,” 62% of the
flow of publishers and ad agency communication, DSP
650 advertisers surveyed see improved performance as
aids the agencies by planning and executing the ad cam-
the main advantage of RTB. The survey also reported
that the trading desk spend on RTB globally stands at paigns and analyzing the best possible investments on
40% of global digital advertisement spend. Mainly, RTB bidding, to improve the ROI for advertisers (Yahalom
helps reduce the time and effort of manual interven- and Stopel 2013, IAB 2014). The main goal of the DSP is
tion in running a digital advertisement campaign; it also to manage ad campaigns and optimize their RTB activ-
facilitates better targeted marketing strategies in real ities (Zhang et al. 2014), such that the advertisers’ con-
time. straints are satisfied with the optimal price to bid for
In traditional online advertising techniques like each ad call, while maximizing the ad campaign per-
sponsored search or contextual advertising, advertis- formance. An advertising agency is seen as any third
ers enter into an agreement called a guaranteed con- party or in-house team that works on behalf of advertis-
tract with publishers, and the publishers pledge to offer ers to broadcast their advertising performance. An ad
the requested number of impressions to the advertisers exchange is responsible for deciding the winning crite-
1
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
2 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

ria and delivering the winning notification to the rele- target” denotes the advertisers’ target spend, target
vant advertisers through a DSP. audience, the target number of conversions, and the
One of the most frequently cited and employed strate- campaign duration altogether. The “target spend” is
gies for real-time ad allocations is Google’s Adwords the total dollar value an advertiser can spend to buy
(http://www.google.com/adwords), which sacrifices impressions during a particular campaign. The target
the buyer’s ability to control ad placement. In Google audience of an application can be determined based on
Adwords, publishers specify a set of relevant key- the characteristics of users, such as age, income, eth-
words, and the agencies look for impressions that nicity, languages, children, gender, education, etc. The
match their target keywords. However, Google has conversions are also called “actions” and reflect how
complete control over where the advertisement will be users interact with the advertisements, such as clicks,
placed; the advertising agency has no say. Recently, calls, SMS (short message service), views, etc. The cam-
a number of digital advertisement organizations (e.g., paign period is the total duration of an ad campaign. For
inMobi (Inmobi.com), Smaato (Smaato.com), Mopub example, a DSP runs an ad campaign for a day target-
(Mopub.com), and OpenX (Openx.com)) have evolved, ing a Unilever Shampoo product called “Dove” toward
following a more open standard (Open RTB (IAB 2014)) a female audience. Here Unilever is the advertiser and
for advertisements, particularly in the context of the Dove is the product. In this instance, Unilever will set
mobile ecosystem, in which Google holds only 47% of the target spend as $1,000, the campaign period as a
the total market share (Emarketer.com 2014). A num- day, the target audience as 100% Female, and the tar-
ber of algorithms, including plain greedy algorithms, get number of conversions as 2,000 clicks (Adikari and
are followed by DSPs. We discuss a number of such Dutta 2015b).
algorithms in detail in the related work section. All of According to the RTB ecosystem (IAB 2014), the DSP
these are based on the fundamental assumption that needs to determine the best bid price that can win the
future impressions and bidding patterns will follow impression. If the DSP is not interested in winning the
past behavior. impression, then the bid price is recorded as zero in
However, through a real-life data set, Adikari and the bid response. The key problem that we address
Dutta (2015a) observed that in the case of the RTB sys- in this research is how to determine this bid price while
tem, it is difficult to predict a future pattern based on achieving the advertiser target. In relation to the previous
historical data. They demonstrated that forecasting the example, the DSP needs to decide the most appropriate
number of bid requests and the winning bid price based bid prices for a selected set of bid requests that enable
on static historical data has very low accuracy. Any Unilever to achieve the maximum return from the cam-
approach that is solely based on static modeling and paign. The return is the number of actions (clicks) by
one-time optimization derived from longitudinal his- the target audience at a given target spend.
torical data will produce suboptimal results. We have We have made a few clear contributions in this study.
demonstrated that the effectiveness of historical data First, we explain the problem of bid price determi-
in determining future bids diminishes by 400% as the nation from the DSP perspective in an RTB system.
length of historical data is increased from the last 5 min- Second, we present a dynamic programming (DP)-
utes to the last hour. Therefore, this study recommends:
based approach to adjust the bid price of applications.
a novel approach where the bid price and the deci-
Third, we compare the proposed approach with exist-
sion to bid on an impression are decided in real time
ing approaches and demonstrate that the proposed
through constant adjustment of the decision process
approach has better performance in achieving a higher
within very short time periods (5 minutes), based on
number of conversions for the same target spend.
a more recent (last 5 minutes) stream of impressions
The rest of the paper is organized as follows. Sec-
and their bidding results. We have demonstrated our
tion 2 provides an overview of the prior research on
strategy on real mobile RTB campaign data. Through
RTB strategies. In Sections 3–5, we describe the DP
an extensive evaluation, we have demonstrated that
approach in detail, including the problem formation
our approach outperforms existing approaches with
and the proposed algorithm. In Section 6, the bench-
reasonable effectiveness. Our approach improves the
mark approaches are elaborated briefly. Section 7 pro-
conversions (number of clicks) by 19%, compared to a
vides a summary of the data set used to test the
recent benchmark approach with the same campaign
spending. proposed models and describes the analysis of the DP
Among the different types of ad platforms that prac- model, and its performance is compared to the bench-
tice RTB strategies, the main two streams are web and mark approaches. Section 8 discusses future work and
mobile. Almost all RTB systems facilitate both plat- the conclusion for the study.
forms in parallel without much differentiation. Conse-
quently, in this paper, we use the term “application” 2. Related Works
to address any mobile applications or websites that Buying impressions for and allocating them to the
are incorporated with an ad platform. The “advertiser advertisers is the most critical process of DSP. In terms
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 3

of selling advertisements, online advertising consists according to target spend and fine-grained impression
of two main dimensions, either “sold in spot” through valuation. Also, Sun et al. (2016) have studied infor-
an auction mechanism, which is called RTB, or in mation revelation design and policies in terms of the
advance via guaranteed contracts (Chen et al. 2014). ad exchange perspective. They discussed one-call and
The research conducted by Ghosh et al. (2009) to two-call approaches. Under the one-call approach, the
allocate impressions randomized to contractual online ad exchange makes one call to all bidders (DSPs) at the
advertising campaigns can be easily applied to the RTB beginning of an auction; under the two-call approach,
context. Ghosh et al. (2009) have developed an adaptive in addition to the one call, there is a second call to
bidding agent for RTB exchanges to learn the distribu- the winning bidder (DSP) at the end of the auction to
tion of bids and then bid according to that. However, match the right advertiser for the impression.
this cannot be practically implemented in the DSP con- In many proposed RTB strategies, optimization plays
text because only partial data are accessible (King and a key role. Zhang et al. (2014) proposed the opti-
Mercer 1985). Chen et al. (2014) developed a mathemat- mal RTB strategy based on the nonlinear relationship
ical model to allocate impressions between real-time between the optimal bid and the conversion rate. In the
auctions and guaranteed contracts; therefore, applica- research on sponsored search auctions (Karande et al.
tion of such a model only in an RTB context will not 2013), in selecting the impressions, DSPs optimized the
be effective. Our proposed strategy focuses only on the advertisers’ ROI and quality of ads. Li and Guan (2014)
RTB context. predicted the bid value while acquiring an impression
Some of the RTB-related studies looked at ad allo- at the lowest cost. Their strategy predicted the win
cation in general from either advertiser or publisher rate and the winning price based on a logistic regres-
perspectives. Rogers et al. (2007) proposed a proba- sion model and then derived the bidding strategy
bilistic model of behaviors of the users (advertisement from the resulting model. Lee et al. (2013) emphasized
viewers) and the advertisers. The authors tested their the challenge of reaching multiple targeted users with
model through simulated comparisons with alterna- a restricted budget. Therefore, they propose a logis-
tive allocation tools, with the objective of identifying
tic regression to select high-quality impressions and
the most effective bid value for each auction and max-
adjust the bid price based on the prior performance.
imizing the number of impressions in a given time
The abovementioned strategies were all based on
frame. Hegeman et al. (2013) emphasize the key criteria
historical data and on the number of bidders, impres-
for building a bidding strategy based on the historical
sions, bid values, and winning prices (Chaitanya and
value of the impression, the time or date of the impres-
Narahari 2012). However, in the RTB process from the
sion, total allocated budget, available budget, the iden-
DSP perspective, the system is dynamic. The param-
tity of the entity requesting, the predicted likelihood
eters used in RTB decisions (such as the number of
that the ad will be selected, the presence of social func-
tionality, total number of impressions of the ad, the bid requests received from each application, the differ-
remaining number of impressions to be achieved, etc. ent types of active applications in a particular period,
Yuan et al. (2013) posit that temporal behaviors—the winning bid price of each application, and number of
frequency and recency of ad displays—would be non- advertisers) are constantly changing. For example, in
trivial attributes when developing an RTB strategy. our data set, we noted that some mobile applications
In addition to the general discussions on bidding are highly active on a particular day with a larger num-
strategies, there are studies on aspects of the online ber of incoming bid requests, but on the next day, some
RTB ecosystem, such as DSP, SSP, and RTB exchanges. of them did not even appear, and others had far fewer
Among the discussions, Chakraborty et al. (2010) have bid requests. For this reason, applying any predictive
come up with a joint optimization framework through logic could be futile: Adikari and Dutta (2015a) tried
online algorithms and stochastic modeling to opti- to forecast bid prices and the number of bid requests
mize the allocation and solicitations in RTB exchanges. for each week of a particular month, based on past per-
Their solution is an online recurrent Bayesian decision formance data from a DSP. According to their results,
framework with Bandwidth-type constraints. Con- both the bid price and the bid request forecasts have
versely, many researchers have attempted to find bet- very low accuracies (15%–66% error in the case of the
ter strategies for the advertisers from the DSP per- bid price and 25%–79% error in the case of the bid
spective. For example, through the greedy approach request count), with a considerable difference between
and linear programming, one bidding strategy opti- the weeks’ results.
mizes both the budget and the bid price and guar- Perlich et al. (2012) used supervised learning algo-
antees the convergence at a locally envy-free equilib- rithms, among other optimization-related techniques,
rium (Chaitanya and Narahari 2012). Chaitanya and for the bid price prediction from the DSP perspec-
Narahari (2012) have theoretically shown that under tive. But the result is not exclusively dependent on
linear programming primal-dual formulation, an RTB the supervised learning algorithm. Analyzing conver-
algorithm can be implemented to adjust bid values sion rates, Yuan et al. (2013) indicate the need for an
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
4 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

optimization algorithm that incorporates factors that We use the following symbols in the previous model.
earlier studies have not adequately addressed, such as
the frequency and recency of the ad displays. Conse-
Decision Variable:
quently, on behalf of advertisers, DSPs need to estab-
K̂ jt 0-1 decision variable, 1 if application
lish bid prices based on the combination of advertisers’
target spend, target audience, and behavior of the pre- j is selected to bid at time t,
vious bid period, which includes winning rates, the 0 otherwise
number of bid requests, and conversion rates. Indices:
Additionally, unlike past studies, we consider some j Index for applications
practical constraints of designing a bidding strategy for t Current instant (time)
DSP that are important for real-life applicability. First, Parameters:
compared to other auction systems, in the RTB process, T Total campaign period
each DSP receives only its own winning bid prices, so Ĉ jt Conversion rate for application j
it does not have similar data for the other DSPs. RTB at time t
exchanges also publish the winning bid average (WBA) V̂jt Number of impressions won in
for each application. However, the WBA is computed application j at time t
at a lower frequency (such as every 24 hours) and is Budget Total campaign budget
based on a longer duration of data (such as a week). P̂ jt Bid price for application j at time t
These data do not add much value other than aiding
the current approach of DSP bidding, bidding higher
The conversion rate, number of impressions won,
than the published WBA of the desired application.
and the bid price for applications are time-variant vari-
Second, the RTB exchange does not send all the bid
ables. However, these parameters are dependent on
requests to every DSP. The bid requests distribution is
many external factors, such as how the RTB engine
based on the agreement between the DSP and the RTB
is distributing the app impressions to various DSPS,
exchange. For these reasons, the global view of the RTB
exchange is unavailable to DSPs. As a result, we cannot how the popularity of an app is changing, and how an
develop a solution that considers the global view of the app is being used. So it is impossible to find a math-
data. Any solution we design should be bound to the ematical representation of a stochastic distribution to
local view of the data, i.e., from a particular DSP per- fit these parameters. The distributions of these param-
spective only, and will be run within the DSP system. eters are unknown, and it is not possible to predict
Furthermore, when an ad campaign is launched, there the future distribution of these parameters to solve
is not enough time to run a learning phase that can the above model. There exist additional complexities:
track down the dynamism entrenched in the bidding (1) the campaign spending needs to be as uniformly
process. Because earlier research does not admit the distributed over the total campaign duration as possi-
dynamism that is embedded in the RTB process, the ble, and (2) the bid price needs to be determined by
existing solutions do not bind with the rapid real-time the DSP. To solve the problem addressing the prior
decision making, whereas the dynamism can be imple- complexities, we resort to a heuristic approach—auto
mented through the proposed approach. Last, this is pricing strategy.
one of the few papers on RTB that uses a real-life data
set, which is described later in the paper.
4. Auto Pricing Strategy
In this section, we present a heuristic, auto pricing
3. Problem Description
Our main objective in the research is to find the appli- strategy (APS), to solve the previous model. The APS
cations to bid at any time (decision variable K̂ jt ) so that helps the DSP maximize the number of conversions by
we maximize the total number of conversions (number making the following decisions:
of clicks, Ĉ jt V̂jt ) in the campaign for the given cam- • Determine how to distribute the budget over the
paign budget (Budget). Thus, we present the model of total campaign period
the problem as follows: • Determine the bid price of each application
• Determine the number of impressions to bid for
T X
Maximize
X
Ĉ jt V̂jt K̂ jt each application.
t0 ∀ j Before describing the approach, we present a list
of notations that are used throughout the paper in
subject to
Table 1.
T X
X To use the full campaign period properly and achieve
P̂ jt V̂jt K̂ jt ≤ Budget the advertiser target, we divide the campaign period
t0 ∀ j
into equal multiple intervals, which are called bid peri-
K̂ jt ∈ {0, 1} ∀ j, t. ods, and denote them as t from this section onward.
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 5

Table 1. Notation for the Model Descriptions exogenous factors need to be predetermined by the
DSPs. For example, to distribute and allocate the tar-
Decision variables
get spend for the next bid period and to determine
K j, t Number of impressions selected to bid from
application j at bid period t the bid prices to reach the target winning rate in the
Indices next bid period, DSPs should predetermine the vari-
j Index for applications j  1, . . . , J ations in target spend and bid prices, respectively.
b Index for the selected applications for Hence, we define two other supportive strategies: a
bidding b  1, . . . , B
x Index for application’s target audience
budget allocation strategy and a bid price adjustment
characteristics x  1, . . . , X strategy.
d Index for advertiser’s desired target
audience characteristics d  1, . . . , D 4.1. Budget Allocation Strategy
t Index for a bid period t  1, . . . , T When a new bid period is starting, we need to allo-
Parameters
St Remaining budget of an advertiser at bid
cate the target spend for each remaining bid period,
period t depending on the previous periods’ total number of
I j, t Total number of impressions available for bid requests received and the remaining target spend.
application j at bid period t As RTB takes place in a rapidly changing environment,
P j, t Bid price for application j at bid period t
the number of bid requests a DSP receives can fluctu-
C j, t Conversion rate for application j at bid
period t ate in adjacent bid periods capriciously. To enforce this
Yx, j Target audience options for characteristic x dynamism, we calculate the moving average value of
on application j the total bid requests for each bid period until the last
Yd0 Advertiser’s target audience option for executed bid period:
characteristic d
J Total number of available applications to
j I j, t × t
P P
t
bid Mt  P . (1)
X Total number of target audience t
characteristics
T Total number bid periods performed during According to the Equation (1), we consider the index
an ad campaign period of the bid period as the weight of each period’s total
B Total number of applications that have been
selected to bid
bid request. As a result, the current period has the
Aj Accessible target audience for application j highest weight compared to the previous bid peri-
Wb, t Target winning rate for application j at bid ods. Using the moving average value of the current
period t period (t) and the previous period (t − 1), we compute
Pb, t Bid price for application b at bid period t the moving average ratio (MAR) for the next period, as
S0t Budget allocation for bid period t
Mt Moving average on total bid requests at bid shown in Equation (2). This will help apply the recent
period t changes of the received bid requests in the last bid
MARt Moving average ratio at bid period t period in the model with respect to the previous bid
Vb, t Number of impressions won in application periods:
b at bid period t
Mt
∗ Set of zero and all positive integers MARt+1  , ∀ t. (2)
Mt−1

Note that the discrete time (t) in Section 3 is being mod- When we have MAR for the next period, we compute
ified to represent bid period (t) in this and subsequent the budget for the next period, as shown in Equa-
sections in the paper. The DSP will decide the duration tion (3):
of a bid period.
St × MARt+1
As explained in Figure 1, when the campaign is , MARt+1 < T − t



T−t

running, based on the previous bid period’s data, the


S0t+1  S , MARt+1 ≥ T − t , ∀ t. (3)
proposed model will determine the target applications  t

and the number of bid requests per app. This process S ,

t T
 t
is considered the application selection strategy. The
application selection strategy depicts an autonomous According to Equation (3), we allocate the budget for
and dynamic bidding strategy for the whole campaign the next period based on the number of remaining
and facilitates identifying the best possible applica- bid periods and the number of bid requests received
tions. Such applications can optimize the advertiser in the past bid periods. During our analysis, we have
target by maximizing the advertiser’s utility value with deduced that in most cases MAR is between 0 and 2.
respect to exogenous factors such as the accessible tar- Therefore, in any given bid period, after allocating the
get audience, the conversion rate, the remaining tar- budget for the next period, the algorithm can preserve
get spend, and the bid price. The changes in these the target spend for the remaining bid periods.
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
6 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

Figure 1. (Color online) Three Steps of the Auto Pricing Strategy

Input Output

Historical winning
Target applications
bid price

Budget allocation Bid price App selection Number of bid


Target audience strategy adjustment strategy strategy requests to target
per app

Historical
Target bid price
conversion rate

Bid period (t) Model Bid period (t + 1)

4.2. Bid Price Adjustment Strategy The new target bid price for the next bid period is
In RTB, the winning bid price has a dynamic behavior computed as Equation (5):
that varies frequently. Thus, the winning bid price that  Pb, t × α, Wb, t < 0.5, 1 ≤ α ≤ 2
is predicted for a particular bid period might not be 

 Pb, t

the optimal bid price in the following period. If we bid

Pb, (t+1)  , 1 > Wb, t ≥ 0.5 (5)
higher than the optimal bid price, we will pay more 
 Wb, t
than required for the desired goal, so we would not be
 P × β,

Wb, t  1, 0.5 ≤ β ≤ 1.
 b, t
able to achieve a higher number of conversions because
In Equation (5), we have defined two constants, α
of the restricted target spend; in contrast, if we bid for a and β, as thresholds to limit the scope of bid price
lower bid price, then also we would not win enough to adjustment. α is defined to adjust the bid price when
achieve a higher number of conversions because of the the current bid period’s winning rate is 0 for a particu-
limited duration of the campaign. As a result, in this lar selected app. Its boundary is based on the concept
step, we adjust the bid price with regard to the actual of doubling the price to increase the winning proba-
winning rate, which is defined in Equation (4): bility (Perlich et al. 2012). β is defined to adjust the
bid price of the apps that won all the bid requests
Vb, t
Wb, t  , K bt > 0, b ∈ J, ∀ b. (4) that were selected to bid in the current period. Based
K b, t on the concept followed in α, we define the bound-
ary of β to see the winning rate when the bid price
In our model, the actual winning rate is calculated at
is half of the current bid period’s bid price. Through
the end of each bid period (i.e., merely the beginning
these experiments (Section 6.2), we have identified the
of the next period) and is used to adjust the bid price
best possible values for the α and β as 1.5 and 0.8,
for the next period. Mainly, the bid price adjustment
respectively. Apart from the bid price adjustment on
strategy is based on the idea of maintaining a higher the selected applications, the system also should keep
winning rate for a lower bid price. For instance, in a track of all the applications and their details that gen-
particular application, in a particular bid period, if the erated the bid requests in the current period. We apply
number of actual winning bid requests is equal to the the price adjustment strategy to all the targeted appli-
number of expected winning bid requests, it is possible cations during the current bid period. Therefore, to
to reduce the bid price. Winning all bids in a bid period condense the effect from applications that were not tar-
indicates that the bid price may have been higher than geted in the current period, we have updated their total
that required to win the bid, indicating paying more, number of bid requests in the current period as 1. Even
and thus there is a possibility of reducing the bid price. if the model selects such applications to bid in the next
However, in many situations, the actual winning period, it can expect only one bid request to be bid.
rate is lower than the expected winning rate. Hence, As a result, if such bid requests could not win in the
to increase the winning rate for the subsequent bid bid period (t + 1), then the model will adjust their bid
period, we increase the current period bid price with prices for the next bid period (t + 2) and increase the
respect to the actual winning rate for the current probability of winning for the next period. Equation (6)
period. Furthermore, if the actual winning rate for the defines this constraint. This strategy has been exper-
current period is zero for a particular application, then imentally evaluated, and the results are explained in
we increase its bid price for the next period by a con- Section 7.2
stant value multiplication. I j, t  1 j < B, ∀ j. (6)
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 7

4.3. App Selection Strategy The policy K πj, t (Θt )∀ j in our problem is obtained by
Here we do not know what the conversion rate and maximizing the utility function across all apps for the
the number of impressions of an app would be in the next bid period (t + 1) based on the values of exoge-
future. Moreover, as these parameters (conversion rate nous factors received from the outcome in the past bid
and the number of impressions) of an app are depen- period (t) (Equation (9)). This is equivalent to the look-
dent on so many external factors (such as how the RTB ahead policy defined by Powell (2014), with one time
engine distributes the app impressions to various DSPs, period as the horizon.
how the popularity of an app is changing, or how the Next, we describe the estimation of parameters for
users are interacting with an app), we decided it is not the utility function and the maximizing problem to
prudent to fit stochastic distributions to these parame- define the policy of transition from one bid period (t)
ters and develop an approach based on these distribu- to the next bid period (t + 1).
tions. Instead, we followed a dynamic approach based
on the simple rule that the “near past is the best predic- 4.3.2. Optimization Strategy. In the RTB process, the
tor of immediate future.” In this approach, we develop advertiser is looking for higher returns through a
a mathematical optimization problem for app selec- higher number of conversions for his investments. The
tion based on maximizing a utility function of the app goal of the model is therefore to increase the adver-
in the problem. The input of the optimization prob- tisers’ ROI of buying impressions by optimizing the
lem is the performance and impression data of apps advertisers’ target in all the available listed applications
from the previous time period (t − 1); the output of at a certain bid period. In the process of establishing
the optimization problem is the list of selected apps to the model, we can define a utility value to demonstrate
bid in the next bid period (t). Subsequently, based on the effectiveness of achieving conversions with regard
newer performance and impression data, the optimiza- to the advertiser target. To define the utility value of
tion model is again run after the next bid period (t) to placing an advertisement in an app, we first define the
decide the list of selected apps for the following time accessible target audience.
period (t + 1). This continues until the campaign budget Accessible target audience. In most cases, publishers
has been exhausted or the campaign duration has come provide the target audience for their applications to the
to an end. Below we present the dynamic programming DSPs based on user characteristics, such as the percent-
representation of the above approach. age of males using the app, the percentage of different
4.3.1. Dynamic Programming Model. For our problem age groups using the app, etc. With such information
following the DP (Lew and Mauch 2006) approach, we and the advertiser’s preferences, the following process
define the contribution function R( · ) of the DP model defines the strategy to identify the accessible target
as in Equation (7): audience. For example, if an advertiser requests target-
R(Θt , K πj, t (Θt ))  U j, (t+1) (Θt , K πj, t (Θt )), ing of his advertising campaign toward female users,
X
∀j
j the best impressions to be published for such adver-
where Θt  (C jt , Vj, t , P j, t ). (7) tisements are those that belong to an application with
a target audience gender of 100% female, and the worst
In Equation (7), a particular state is defined with Θt case scenario is an application with a 100% male target
and its outcome depends on the previous bid period’s audience. The same would hold for an advertiser that
exogenous factors C j, t , Vj, t , and P j, t . We define the requires targeting of his ad campaign for females who
contribution function R( · ) as a reward using the util- are under the age of 30. If there is a particular appli-
ity function U j, t ( · ), which is derived in Section 4.3.2. cation with 60% female and 80% under the age of 30
K πj, t (Θt ) defines the policy of selecting the apps based as the target audience, then the accessible target audi-
on the utility value at the end of each period. There- ences will be computed by the product of female and
fore, to maximize the number of conversions obtained age group percentages (48%). Hence, the accessible tar-
for a lower target spend throughout all the bid peri- get audience for a particular application is captured
ods (T) of the campaign, we maximize the contribution based on the product of relevant target audience char-
function, as given in Equation (8). The notation Ɛπ in acteristics. It is defined as Equation (10):
Equation (8) defines the possibility that the exogenous Y
factors (conversion rate, winning rate, and bid prices) Aj  Yx, j , Yx, j ∈ Yd0 , x ∈ d, ∀ j. (10)
depend on the outcome of the previous bid period. x
 
Utility function. The utility value can be determined
Maximize Ɛπ U j, (t+1) (Θt , K πj, t (Θt )) ,
XX
∀j (8)
j, (0<t≤T) t j
based on the return rate. By multiplying the con-
π version rate and accessible target audience, we can
Ɛ : K πj, t (St )∀ j
X calculate how many conversions of the required tar-
 Maximize U j, (t+1) (Θt , C j, t , Vj, t , P j, t ), ∀ j (9) get audience can be obtained for a particular bid
j value of an impression. Then, to identify the exposure
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
8 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

or return per dollar, the return rate is estimated as Based on the previous three constraints, the aggre-
Equation (11): gated utility value is maximized to find the most
suitable applications that could optimize the adver-
Return rate RR jt+1  C j, t A j /P j, t , ∀ j, t. (11) tiser’s expectations. We define Equation (16) to maxi-
mize the sum of the utility value across all applications
Since we have the return rate per impression, the value at a particular bid period (t) while bonding through
of an effective number of conversions per dollar can the constraints:
also be computed. This value reflects the utility value
for an advertiser. Equation (12) defines the utility value X X C j, t A j K j, (t+1)
Maximize U j, (t+1) → Maximize ,
for the advertiser on a particular application j: j j P j, (t+1)

C j, t A j K j, (t+1) ∀ t. (16)
Utility value U j, (t+1)  , ∀ j, t. (12)
P j, t According to the value of K j , we find the actual number
of impressions to bid for each application. The appli-
Optimization policy. In transitioning from state Θt to
cations whose impressions are not selected will not be
Θt+1 (bid period (t) to (t + 1)), we apply the policy of
considered for the bid price adjustment of the next bid
maximizing the aggregated utility value (Equation (9))
period, but, as explained earlier, their number of bid
for all the apps for bid period (t + 1) based on the state
requests will be changed.
at the end of bid period t. When maximizing the utility
In summary, the budget allocation strategy allocates the
value, we can increase the return rate and an effective
relevant amount of target spend for the bid period;
number of conversions that can be attained for a lower
the bid price adjustment strategy adjusts the bid prices,
bid price; in simple terms, when maximizing the util-
according to each app’s current bid period’s winning
ity value we can access to a higher conversion rate,
rate; and the app selection strategy selects the apps and
a higher accessible target audience, a lower bidding
calculates the number of impressions to bid for the next
price, and a higher number of selected impressions.
period.
However, when identifying the highest utility value
among all the applications’ utility values, we have to
endure the following constraints of the advertiser. 5. APS Algorithm
The first constraint—Equation (13), the number of The complete approach is presented in algorithm form
impressions chosen from a particular application—is in Algorithm 1.
limited to its total number of available impressions. Algorithm 1 (APS algorithm)
Per the results of the earlier section, we consider the
predicted value of the number of available impres- 1. Initialize S0 , T, distanceVector  0, bidReqTotal  0,
j, t0 I j, t0  0, S1  S0 /T, M0  1
P 0
sions for the next bid period. Therefore, when selecting
impressions to bid, the total number of impressions 2. Source state: t  0, which is the most recent historic
that can be selected is limited to the number of avail- campaign data, is used to compute the
able impressions of that particular bid period. This application set B and K b in the initial state: t  1
should be true for all the listed applications. 3. for ∀ t, until t ≤ T or St > 0 do
4. bidResCount j  0, clickCount j  0, ∀ j
K j, (t+1) ≤ I j, t , ∀ j, t. (13) 5. for ∀ bid requests, until the end of the bid
period t do
The next constraint, Equation (14) is that, when select- 6. I j, t  I j, t + 1
ing applications, spending for all the impressions 7. if (j ∈ B and S0t > 0 and K jt ≥ bidResCount j )
should be less than or equal to the target spend for 8. send bid response with P j, t
the period. The mathematical formulation for this con- 9. bidResCount j  bidResCount j + 1
straint is defined as follows: 10. if (win note receives)
X 11. S0t  S0t − winPrice
S0t+1 ≥ P j, (t+1) K j, (t+1) , ∀ t. (14) 12. Vj, t  Vj, t + 1
j
13. If (conversion note receives)
However, since impressions are selected only from the 14. clickCount j  clickCount j + 1
applications that have maximum utility value, for the 15. end if
remaining applications, the number of selected impres- 16. end if
sions could be zero. Therefore, the constraint, Equa- 17. end if
tion (15), should hold: 18. end for // the end of the bid period t
P
19. calculate j I j, t
K j, (t+1) ∈ ∗ , ∀ j, t. (15) 20. calculate C j, t  clickCount j /Vj, t
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 9

21. St  St−1 − S0t allocation for the bid period or the expected number of
22. distanceVector  distanceVector + 1 impressions to bid. We keep track of each responded
bidReqTotal  bidReqTotal + j I j, t bid request (bidResCount j ). When a winning notifica-
P
23.
24. Mt  bidReqTotal/distanceVector tion is received, the winning price of the winning bid
25. Calculate MARt+1  Mt /Mt−1 is subtracted from the target spend allocation for the
26. if ((t  T) or (MARt+1 ≥ T − t)) bid period. Also, the number of winning bids Vj, t is
27. S0t+1  St counted. Subsequently, if there is a click for the win-
28. else if (MARt+1 < T − t) ning impression, then that is (clickCount j ) counted as
S × MARt+1 well. At the end of the bid period, but before running
29. Calculate S0t+1  t the proposed model, we calculate the total number of
T−t
30. end if bid requests received (line 19), the conversion rate for
31. for j  1 until j ≤ J do each application C j, t (line 20), and the remaining target
32. if ( j ∈ B and K j, t > 0) spend St (line 21).
Vj, t Lines 22–26 discuss the budget-allocation strategy.
33. Calculate W j, t 
K j, t Based on the distanceVector and bidReqTotal, the Mt is
34. if (W j, t < 0.5) calculated. Afterward, MAR is computed for the next
35. P j, (t+1)  P j, t × α bid period using the current period and the previous
36. else if (W j, t < 1 and W j, t > 0.5) period Mt . Since we have both St and MARt+1 , we are
37. P j, (t+1)  P j, t /W j, t able to calculate the target spend allocation for the next
38. else period (lines 26–30). Again from Algorithm lines 31
39. P j, (t+1)  P j, t × β to 45, we compute the bid price adjustment strategy.
40. end if As shown in line 31, we iterate through all the applica-
41. else tions and calculate the winning rate for each bid appli-
42. I j, t  1 cation of the current period. Then, based on the win-
ning rate, we adjust the bid price as depicted in lines
43. end if
34–40. As explained in Equation (6), in line 42, we have
44. calculate A j
changed the total bid request count to 1, for an appli-
45. end for
cation that was not selected for the current period. At
P C j, t A j K j, (t+1) line 44 we compute the accessible target audience of
46. compute Maximize j , w.r.t.
P j, (t+1) each application based on the advertiser target audi-
Equations (13), (14), and (15) ence (see Equation (10)). Line 46 depicts the optimiza-
47. output K j, (t+1) where j ∈ B (selected apps for the tion strategy with regard to the constraints discussed
next period) in Equations (13)–(15). The output of the first iteration
48. end for // the end of the campaign period of Algorithm 1 will be utilized as the input to the next
Algorithm 1 is established on the bid periods, and bid period.
each bid period will define a new state. By iterating As Algorithm 1 elaborated, in each iteration the
through the bid periods, the algorithm strives to opti- model optimizes for a higher number of conversions
mize the advertiser return (maximizing the total con- for a lower target spend. It implies that the algorithm
versions as discussed in Section 3). Before starting the achieves the local optima at each bid period. As we
algorithm, at the initial step (Algorithm 1: line 1), we show later in the analysis section (Section 6.4), even
initialize a set of values including the advertiser target though each bid period is solved individually for the
spend S0 . Then at line 2 we initialize the sources state local optima, the final output of the model is not
(t  0), which is used to find the set of B and K b in the same as the global optimal solution obtained by
the initial state in the bid period one. Lines 3–44 define the exact approached described in Section 5.1. The
the complete set of functions that the APS carried out solution obtained by the exact approach is impossi-
in a particular bid period. In each bid period (line 4) ble to achieve without knowing the future. The final
we define a set of constants to track the number of bid solution is in close approximation to the exact optimal
requests that are bid bidResCount j and the number of solution.
clicks that are obtained in clickCount j with respect to
the each application. As depicted in line 5, we consider 6. The Benchmark Approaches
each incoming bid request and track the relevant bid In this section, we try to evaluate and understand
request count from each application (line 6), until the the possible gains of the proposed bidding approach.
bid period ends. According to lines 7–18, from each For that, we propose two brute force approaches:
incoming bid request, the bid requests that are relevant the greedy approach and the exact approach. More-
to the selected applications B will be selected to bid. over, we have compared our approach with the three
However, this is conditioned on either the target spend latest approaches, which are selected based on the
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
10 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

attributes that are available in our data set. There are 6.2. The Greedy Approach
other approaches as discussed in Related Works sec- One of the simplest and most commonly practiced bid-
tion; however, they require data items from the ad ding approaches in many DSPs is the greedy approach.
network (SSP) or ad exchanges, data which are not When the DSP receives bid requests, it will bid with
possible in a realistic DSP scenario. These approaches the highest possible bid price for all the bid requests
were satisfactorily implemented using our data and that have the expected target audience. As a result, it
provided a complete empirical comparison to our pro- will achieve a certain number of conversions within
posed approach using factual real-time data. a shorter period of time for the whole target spend.
Even though the bid price is set at the highest value,
6.1. The Exact Approach the winning bid price is decided by the second price
The exact approach can be implemented only when auction, i.e., the second highest bid price. For example,
all the future information about the RTB process (such if the DSP is interested in a particular incoming bid
as target audience, winning price, and conversion out- request, then it can bid for $1, which is a really high
come) is given, because the bidding is made only for price, but the DSP may have to pay only $0.1 as the win-
the bids whose conversion outcomes are positive and ning bid price, attributable to the second price auction.
have the lowest winning price and correct target audi- This can be considered as a benchmark approach for
ence. Therefore, in practicality, it is impossible to imple- comparison purposes. Here no intelligence is applied
ment the exact approach. However, for benchmarking in determining the bid price of the application. The
purposes, we can implement the exact approach based algorithmic description of the approach is as follows.
on the historical campaign data and compare it with See Table 2 for the notation described.
our APS. Following the Algorithm 2 model provides
some insights into the exact approach. Here we provide Algorithm 3 (The greedy algorithm)
a list of bid requests as an input and attain a number of Initialize z to a timestamp during the campaign.
target conversions and a list of selected bid requests as Initialize Z as the end time of the campaign
outputs. See Table 2 for the notation described. While (z ≤ Z)
Algorithm 2 (The exact algorithm) If (S ≥ 0)
If (Yxr ∈ Yd0 and x ∈ d)
Initialize Lin and Lout to empty, where Lin and
S  S − W Rr
Lout are lists
If (CONr  1)
Add all r . . . R into L in with ascending order of WP r
N  N + CONr
While (Lin length > 0)
End if
If (S ≥ 0)
End if
If (WP r > 0 and Yxr ∈ Yd0 and x ∈ d and
Else
CONr  1)
Exit While
S  S − WP r
End if
N  N + CONr
End While
Remove r from Lin
Output: N
Add r into L out
End if Note that here the advertiser will end up paying
Else more because he is not pricing the bid per his require-
Exit While ment or utilizing the full supply source. For example,
End if if an app is expected to receive 100,000 bid requests in
End While a day, and the advertiser requires winning only 1,000
Output: N, L out impressions, the greedy approach will select the first
1,000, regardless of the price. But if the bid price can be
Table 2. Notation for the Exact Approach and the Greedy predicted, then the system should select the cheapest
Approach 1,000 bid requests.

Indices 6.3. Optimal Real-Time Bidding


r Index for the bid requests r  1, . . . , R Zhang et al. (2014) demonstrated that the optimal
Parameters bid has a nonlinear relationship with the conversion
S Total budget of an advertiser
CON r Conversion value for bid request r either 1
rate. Using real-world data, they have shown that the
(converted) or 0 (not converted) winning rate consistently has an (approximately) con-
R Total number of bid requests received cave shape. Therefore, compared to the higher valued
WP r Actual winning price for bid b impressions, the lower valued impressions are more
Decision variables cost effective and have a higher winning rate. In accor-
N Total number of conversions achieved
dance with this rationale, a DSP is able to bid on more
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 11

impressions rather than focus on a small set of high of 0.05 (the resulting significance is 0.005), we reject
valued impressions. To further simplify the proposed the null hypothesis of no difference between each cam-
solution and to calculate the bid price using conversion paign data set. As a result, we can state that the data
rate, they have applied the Lagrangian function. selected for each campaign differ, and this supports our
approach in different types of RTB campaigns.
6.4. Bidding Below Max eCPC As we discussed in Section 4, the problem has
Per Lee et al. (2012), the conversion rate is dependent been formulated using mathematical modeling, and
on the past performance data. Lee et al. demonstrated we have developed a DP algorithm (Bertsekas 1995)
that when the advertiser’s goal of max eCPC is the to test the model. Since the model is developed such
upper bound of cost per click, the bid price on an that each bid period’s input is set based on the pre-
impression can be obtained by multiplying max eCPC vious bid period’s output, this can be implemented
and pCTR. Zhang et al. (2014) have followed and com- incrementally while adjusting the bid prices and tar-
pared their solution while calculating the max eCPC get spend dynamically during the algorithm execution.
for each campaign by dividing its cost and achieving The algorithm was coded using Java programming lan-
a number of conversions in the training data. To com- guage, and the solution to the optimization problem is
pare this with our APS approach, we followed the same implemented using the existing free and open-source
strategy to predict the bid price and compute the num- Java library called Java Optimization Modeler. It offers
ber of conversions using our data set. a full-fledged platform to model Java programs and
solve optimization problems.
6.5. Linear-Form Bidding of pCTR To evaluate the efficiency and effectiveness of the
Per Perlich et al. (2012), the bid value is linearly propor- model, we defined the metric target spend per conver-
tional to the pCTR. Perlich et al. estimated the pCTR sion (TSPC), defined by Equation (17):
by a supervised learning algorithm. The approach has
also been applied by Zhang et al. (2014) for the perfor- Target spend per campaign
mance comparison of their proposed approach. TSPC  . (17)
Total number of conversions
The TSPC allows us to normalize the number of con-
7. Analysis and Results versions based on the dollar spend in a campaign. It is
7.1. Data Set also a metric that has the same unit as the cost per con-
To develop a model and test it empirically, we pre- version (CPC), frequently used in digital advertising.
served data from an ongoing RTB process of a leading
mobile DSP (Mobilewalla.com). Compared to previ- 7.2. Analysis of APS
ous studies, where the model is examined with syn- According to the metric we formulated, the objective of
thetic data, the factual data give proper insight into the our analysis is to demonstrate that the proposed model
model. The data set includes the data for three cam- can accomplish lower TSPC while achieving a higher
paigns, each of 10 days’ duration, run by the DSP in number of conversions. According to Table 4, TSPC val-
August 2014. It includes 6,317,443 bid requests spread ues for each target spend increase when the bid period
across the month of August 2014. Table 3 depicts is increased. We demonstrate this scenario in Figure 2.
some details of these three campaigns. To determine Comparing Figure 2 (a) and (b), we can determine that,
whether the three campaigns’ data differ significantly, when the bid period duration is small (this means a
we have computed the difference between related sam- higher number of bid periods), a higher number of
ples using Friedman’s two-way analysis of variance, conversions can be achieved for a lower TSPC value.
based on the number of bid requests for each cam- However, when a particular bid period is considered,
paign. Since the Friedman test is nonparametric and TSPC increases with respect to the target spend. That is
distribution free, it can seek differences between each because, when we allocate a higher budget for a cam-
campaign data set (Friedman 1937, Mack and Skillings paign, the allocation for a particular period is also high.
1980). According to the result, at the significance level Since the model has a lower granularity to select appli-
cations, it will also bid for the applications with higher
Table 3. Data Difference Among Three Days bid prices to maximize the number of conversions.
Per the previous scenario, we can demonstrate that,
Campaign X Campaign Y Campaign Z when the bid period is smaller, the model accuracy
Total number of 2,209,864 2,113,487 1,994,092
will be increased. Nevertheless, when the target spend
bid requests is high, it will try to achieve more conversions from
Total distinctive 240 205 160 the applications with the higher bid prices. The insight
applications from this analysis is that when an advertising cam-
Winning bid average for 1.06 1.15 1.08 paign is running in RTB, it needs to maintain the bid
the whole campaign $
period at a minimum level.
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
12 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

Table 4. Model Behavior at Different Bid Periods

5 minutes 15 minutes 30 minutes 60 minutes

Campaign X TSPC Conversions TSPC Conversions TSPC Conversions TSPC Conversions

Target spend
$ 1,000 0.14 6,621 0.21 4,555 0.28 3,879 0.46 1,988
$ 2,000 0.19 8,961 0.23 6,878 0.31 5,006 0.51 2,731
$ 3,000 0.22 9,877 0.27 7,543 0.33 5,766 0.54 3,104

To validate the model with Equation (6), we pri- model’s performance; the first technique is to assign
marily look at how the model behavior would change the number of bid requests to 1 for all the inactive
based on the number of bid requests of the inactive applications in the previous bid period, and the second
applications, during the previous bid period. As a technique is to increase the bid price by 1.5 times for
result, we test the model based on the last updated the unsuccessful applications in the previous period.
bid requests and updating them to single-bid requests In Figures 3 and 4, we illustrate how the performance
and 100-bid requests. The analysis is performed for of our model varies based on the two parameters α
three campaigns when the bid period is 5 minutes. The and β in Equation (5). We vary α between 1 and 2
results are listed in Table 5. in steps of 0.2. We vary β between 0.5 and 1 in steps
As depicted in Table 5, we can get a better outcome of 0.1. For three campaigns, we plot the TSPC values
when the number of bid requests adjusts to 1 for all with α in Figure 3(a) and the number of conversions
the inactive applications during the previous period. achieved with α in Figure 3(b). As shown in Figure 3,
This provides an added advantage: if the applications the TSPC and the number of conversions follow the
with a single bid request are selected by the optimiza- opposite pattern (concave in the case of TSPC versus
tion strategy, then, based on their winning outcomes, convex in the case of number of conversions) when
the model can further adjust their bid prices and opti- α is changed. The TSPC is the lowest at α  1.5, the
mize the winning outcome for the next bid period. In same point at which the number of conversions is also
parallel, when the bid price is increased by 1.5 times the highest. This shows that when the winning rate is
for an application that could not win any impressions lower than 50%, the price to bid needs to be increased
during the previous period, the model provides two by 50%.
advantages. The first advantage is that its return rate Similar behavior can be seen for β in Figure 4. With
will be reduced according to Equation (11). This will the increase of β, the TSPC values follow a concave
increase the probability of bidding for another appli- graph, whereas the number of conversions follows a
cation with a higher return rate. Second, if it is still convex graph. The TSPC value is the lowest at β  0.8,
selected by the optimization strategy, then there is a at which point the number of conversions is also the
high likelihood of winning its impressions because of highest. Thus, per Equation (5), when the winning rate
the high bid price. The analysis provides insight in is 1 (i.e., the DSP has won all its bids), the bid price
understanding two techniques that help increase the needs to be reduced by 20%.

Figure 2. (Color online) Conversions and TSPC Values


(b) Conversions achieved vs. Bid periods
(a) TSPC vs. Bid periods in X, Y, and Z campaigns
0.8 7

0.7 X Y Z
6
Conversions achieved ('000)

0.6
5
0.5
4
TSPC

0.4
3
0.3
2
0.2

0.1 1

0 0
5 15 30 60 5 15 30 60
Bid periods (minutes) Bid periods (minutes)
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 13

Table 5. Model Behavior Based on the Number of Bid Requests of the Applications That Are Not Active
During the Previous Period

Based on last updated bid requests Based on single bid request Based on 100 bid requests

TSPC # conversions TSPC # conversions TSPC # conversions

Campaign X 0.18 4,883 0.14 6,621 0.19 4,412


Campaign Y 0.21 4,411 0.17 6,103 0.28 3,631
Campaign Z 0.27 3,867 0.25 5,789 0.32 3,347

7.3. Performance Analysis As mentioned earlier, since it is difficult to calculate


7.3.1. Complexity Calculation. To understand the the total number of steps in the optimization strategy
computational efficiency of our proposed algorithm, of the algorithm, the complexity is given as a function
we compute the algorithmic complexity. We define f (Opt) of relevant parameters:
complexity as a numerical function, time versus the
input size, and estimate the efficiency of the algorithm O( f (Opt))  f (J, I j , P j , S).
asymptotically. We measure time as the number of ele-
Hence, the total complexity of the algorithm can be
mentary “steps.” The complexity of the algorithm is
defined as follows, indicating linear complexity with a
given as a ballpark figure, as it is complex to define a
number of apps:
correct complexity value, for the optimization strategy,
because complexity analysis of linear programming O(J) + O(J) + O(J) + O( f (Opt)).
(LP) presolving involves some heuristic implementa-
tion mixed with the mathematical background. Based 7.3.2. Runtime Performance of APS. As explained
on the notations of Table 1, the following complexities earlier, we execute the APS at the end of each bid
are computed. period. Once APS is executed it selects the applica-
The budget allocation strategy mainly depends on tions with the number of requests to be bid and the
the number of applications. Therefore the complexity relevant bid price. Then, according to the resulting val-
is O(J). ues, bidding is conducted during the bid period until
The bid price adjustment strategy has two key tasks: the algorithm is executed at the end of the next bid
calculating the win rate and changing the bid price for period. Therefore, the algorithm does not compute the
the selected application and changing the bid requests’ bid price in real time for each received bid request. But,
count for a nonselected application. Thus, we can state as demonstrated in the previous section, when the bid
that, for this strategy, the complexity depends on the period is smaller, the actual gain for the advertiser is
total number of applications O(J). increased. To reduce the bid period, we need to under-
The app allocation strategy also has two tasks: cal- stand the runtime performance of APS with respect
culation of the target audience for each application to the number of bid requests available to bid and a
and app allocation via an optimization algorithm. The number of applications available to select.
target audience calculation complexity can be written As illustrated in Figure 5, in both cases the perfor-
as O(J). mance is linear to the number of bid requests and the

Figure 3. (Color online) (a) TSPC vs. α Value and (b) Conversions Achieved vs. α Value in X, Y, and Z Campaigns

(a) (b)
0.30 7.0

0.28 6.8
Conversions achieved ('000)

0.26 6.6

0.24 Campaign X 6.4

0.22 Campaign Y 6.2


TSPC

0.20 Campaign Z 6.0

0.18 5.8

0.16 5.6

0.14 5.4

0.12 5.2

0.10 5.0
1.0 1.2 1.4 1.6 1.8 2.0 1.0 1.2 1.4 1.6 1.8 2.0
 value  value
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
14 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

Figure 4. (Color online) (a) TSPC vs. β Value and (b) Conversions Achieved vs. β Value in X, Y, and Z Campaigns
(a) (b)
0.30 7.0
0.28 6.8

Conversions achieved ('000)


0.26 6.6
0.24 6.4
Campaign X
0.22 6.2
Campaign Y
TSPC

0.20 Campaign Z 6.0


0.18 5.8
0.16 5.6
0.14 5.4
0.12 5.2
0.10 5.0
0.5 0.6 0.7 0.8 0.9 1.0 0.5 0.6 0.7 0.8 0.9 1.0
 value  value

number of applications, which makes the APS algo- number of conversions obtained with a $1,000 target
rithms runnable at much higher frequency, e.g., 5 min- spend for the three campaigns in Table 6. For APS the
utes. This allows us to have shorter bid periods in our bid period is set for 5 minutes. As shown in Table 6,
approach, which enables higher accuracy in bid price based on the average number of conversions obtained
determination. across three campaigns, our APS approach performs
approximately 19%, 66%, 37%, and 84% better than the
7.4. Comparison with Benchmark Approaches ORTB, Mcpc, Lin, and the greedy approaches, respec-
We implemented three benchmark approaches— tively. In contrast, according to the exact approach
optimal real-time bidding (ORTB), bidding below max (which performed 70% better than our approach), we
eCPC (Mcpc), and linear-form bidding of pCTR (Lin)— can say that APS has considered 30% of bid requests
and applied these approaches in running the three out of the most cost-effective bid requests (because the
campaigns X, Y, and Z. In implementing the bench- exact approach always bid only for such bid requests)
mark approaches, we derived all model parameters to gain its conversions.
following the methodologies given in the respective Next, we plotted how the average TSPC values
papers. Additionally, we implemented the exact and across three campaigns varied with respect to target
greedy approach for comparison purposes. In this sec- spend in Figure 6. From Figure 6, we note that the
tion, we compare the results of all five approaches to TSPC values for APS are lower than ORTB, Mcpc, Lin,
our proposed APS approach. and the greedy approach. Additionally, we also ob-
We have compared the outcome of the APS and all serve that, with the increased target spend, the differ-
the other benchmark approaches with respect to the ence between APS and other approaches is widened.

Figure 5. (Color online) (a) Number of Bid Requests vs. Algorithm Performance Time and (b) Number of Applications vs.
Algorithm Performance Time
(a) (b)
80 80

70 70
Performance time (milliseconds)

Performance time (milliseconds)

60 60

50 50

40 40

30 30

20 20

10 10

0 0
0 0.5 1.0 1.5 2.0 2.5 0 50 100 150 200 250
Number of bid requests (millions) Number of applications
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 15

Table 6. Comparison of the Obtained Conversions Between APS and Other Approaches

Average number of
Campaign X Campaign Y Campaign Z conversions per campaign

APS 6,621 6,103 5,789 6,171


ORTB 5,194 4,807 4,988 4,996
Mcpc 2,290 2,167 1,893 2,117
Lin 3,871 4,184 3,608 3,888
The greedy approach 1,165 870 967 1,001
The exact approach 25,431 18,055 19,464 20,983

Nevertheless, compared to the exact approach, APS $1.00 for a $3,000 target spend. This demonstrates that,
has a slightly high TSPC, but the difference between when the target spend is increased, APS is bidding
the two approaches is less than 0.15. Therefore, we at higher bid prices to achieve more conversions with
can conclude that the proposed APS approach achieves higher winning bid prices. However, at a lower target
a higher conversion rate with reduced spending for spend, the bid price in the case of APS is decreased
advertisers compared to other approaches. significantly to achieve better outcomes (higher conver-
Next, to better understand why the APS approach sions). This dynamic behavior cannot be seen in other
performs better than other approaches, we compared approaches. Furthermore, as shown in Figure 7, for dif-
the bidding behavior of each approach in Figures 7 ferent target spends, APS always bids at a much lower
and 8. In Figures 7 and 8, the X axis demonstrates the bid price than other approaches. This can be further
bid prices in $ and the Y axis demonstrates the number demonstrated by looking at Figure 8, where APS over-
of bid requests selected to bid by each approach. The laps with the exact approach graph, which always bids
graphs (a)–(c) in both Figures 7 and 8 illustrate the bid- at the best bid prices for the advertiser. The main rea-
ding behavior in campaign X when the target spend is son for this achievement is that the APS dynamically
$1,000, $2,000, and $3,000 ,respectively. When the target adopts the most recent bidding behavior by inspecting
spend increases, the number of bid requests selected to the most recent bid period and adjusts the bid price
bid is increased. For example, in APS, it has increased to enact the optimal bid price for the period. There-
from 14,917 for a $1,000 target spend to 24,125 for s fore, combining the insights from Table 6 and Figures 7
$3,000 target spend. For ORTB the corresponding val- and 8, we can conclude that APS provides the impro-
ues have increased from 7,827 to 15,944. However, the visation to identify and bid the bid requests at a lower
distribution of bid prices has not changed significantly. bid price but achieve higher conversion rates.
The bid prices in ORTB range from $3.50 to $3.75; for We have further analyzed the bidding behavior with
Mcpc the bid prices are around $4.00; for Lin bids respect to apps that have a higher numbers of bid
they are around $4.50; and for the exact approach, requests and apps that have lower average winning bid
they are around $0.25. For APS, the average bid prices prices. For that, we followed two techniques—Type (1)
have increased from $0.25 for a $1,000 target spend to and Type (2)—and plotted the graphs in Figure 9.
Type (1): Sort the apps in descending order based
Figure 6. (Color online) TSPC vs. Target Spend in Different
on the number of bid requests received and select the
Approaches top 50 apps that have higher numbers of bid requests.
Then, based on these selected 50 apps:
1.0
Figure 9(a). Compute the percentage of the bid
0.9 requests selected to bid, out of the total number of bid
0.8
requests selected to bid in each approach.
APS The exact approach
Figure 9(b). Compute the percentage of conver-
0.7 ORTB Mcpc
sions that occurred, from the total number of conver-
Lin The greedy approach
0.6 sions occurring in each approach.
TSPC

0.5
Type (2): Sort the apps in ascending order based on
the average winning bid price and select the top 50
0.4
apps that have lower average winning bid prices. Then,
0.3 based on these selected 50 apps:
0.2
Figure 9(c). Compute the percentage of the bid
requests selected to bid, from the total number of bid
0.1 requests selected to bid in each approach.
0 Figure 9(d). Compute the percentage of conver-
1,000 1,500 2,000 2,500 3,000 sions that occurred, from the total number of conver-
Target spend ($) sions occurring in each approach.
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
16 INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS

Figure 7. (Color online) Number of Bid Requests Selected by APS, ORTB, Lin, and Mcpc, to Bid for Different Bid Prices
(a) $1,000 Target spend (b) $2,000 Target spend (c) $3,000 Target spend
12 12 12
APS ORTB
# of bid requests selected

10 Mcpc 10 10
Lin

8 8 8
to bid '000

6 6 6

4 4 4

2 2 2

0 0 0
0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10
Bid price ($)

Figure 8. (Color online) Number of Bid Requests Selected by APS, the Exact Approach, and the Greedy Approach to Bid for
Different Bid Prices
(a) $1,000 Target spend (b) $2,000 Target spend (c) $3,000 Target spend
50 50 50
45 APS 45 45
# of bid requests selected

40 The exact approach 40 40


35 The greedy approach 35 35
to bid '000

30 30 30
25 25 25
20 20 20
15 15 15
10 10 10
5 5 5
0 0 0
0 2 4 6 8 10 0 2 4 6 8 10 0 2 4 6 8 10
Bid price ($)

Per Figure 9(a) and (b), in the midst of all the related approach, APS performs much better in both types
work approaches (ORTB, Lin, and Mcpc), there is no of apps; because the greedy approach does not have
significant percentage difference in the total selection any selection criteria for apps, it bids for bid requests
of bids and conversions occuring in Type (1) apps. based on the first-in-first-served basis. According to
However, when the apps are selected from Type (2) this, when the apps are selected to bid, APS gives
apps, as shown in Figure 9(c), APS bids for a higher higher priority to the apps with lower winning bid
number of bid requests than other approaches. Accord- prices rather than the apps with a higher number of
ing to Figure 9(d), APS also achieves a higher number bid requests. In this process, the bidding constraints
of conversions from the Type (2) apps. APS will specif- such as lower winning bid prices, higher conversion
ically select the bid requests only from the apps that rates, and the relevant target audience are optimized
have a lower average winning bid price and a higher meticulously to achieve the target goals. In summary,
number of bid requests. Compared to the greedy we can emphasize that to achieve target goals, the APS

Figure 9. (Color online) Target Spend vs. the Percentage of the Number of Bid Requests Selected to Bid (a) and (c) and Target
Spend vs. the Percentage of the Number of Conversions Which Occurred (b) and (d)
(a) (b) (c) (d)
100 100 100 100
No. of bid requests

No. of bid requests

No. of conversions
No. of conversions
selected to bid (%)

90 90 90 90
selected to bid (%)

occured (%)
occured (%)

80 80 80 80
70 70 70 70
60 60 60 60
50 50 50 50
40 40 40 40
1,000 1,500 2,000 2,500 3,000 1,000 1,500 2,000 2,500 3,000 1,000 1,500 2,000 2,500 3,000 1,000 1,500 2,000 2,500 3,000
Target spend ($) Target spend ($) Target spend ($) Target spend ($)

APS ORTB Mcpc Lin The exact approach The greedy approach
Adikari and Dutta: New Approach to Real-Time Bidding in Online Advertisements
INFORMS Journal on Computing, Articles in Advance, pp. 1–17, © 2019 INFORMS 17

preferably selects higher-quality apps compared to the Emarketer.com (2014) Driven by Facebook and Google, Mobile
apps with a higher quantity of bid requests. Ad Market Soars 105% in 2013. Accessed July 5, 2015, http://
www.emarketer.com/Article/Driven-by-Facebook-Google-Mobile
-Ad-Market-Soars-10537-2013/1010690#sthash.WWVavkBo.dpuf.
8. Future Work and Conclusion Friedman M (1937) The use of ranks to avoid the assumption of
normality implicit in the analysis of variance. J. Amer. Statist.
Even though the APS approach outperforms the exist-
Assoc. 32(200):675–701.
ing approaches, it has a few limitations as well. First, it Ghosh A, McAfee P, Papineni K, Vassilvitskii S (2009) Bidding for
is primarily developed following the characteristics of representative allocations for display advertising. Internat. Work-
the target audience; it should be enhanced so that it can shop Internet Network Econom. (Springer, Berlin), 208–219.
Hegeman J, Yan R, Badros GJ (2013) U.S. Patent Application No.
observe other vital characteristics like device type, type 13/294,094.
of network connection, location, mobile platform, etc. IAB (2014) Openrtb api specification version 2.2. Accessed
Second, the proposed solution does not emphasize any August 5, 2015, http://www.iab.net/media/file/OpenRTBAPI
mechanism to increase the conversions from the mobile SpecificationVersion2_2.pdf.
Inmobi (2016) Inmobi. Accessed August 5, 2015, http://www
user perspective, where it only facilitates selection of .inmobi.com/.
the best applications that have higher conversion rates. Karande C, Mehta A, Srikant R (2013) Optimizing budget con-
Since this is one of the crucial issues that online adver- strained spend in search advertising. Proc. Sixth ACM Internat.
tising is facing now (Lee et al. 2013), there is a need for Conf. Web Search Data Mining (ACM, New York), 697–706.
King M, Mercer A (1985) Problems in determining bidding strate-
more focused research on bridging this gap. gies. J. Oper. Res. Soc. 36(10):915–923.
In conclusion, this study not only created a model Kohno T, Broido A, Claffy KC (2005) Remote physical device finger-
that successfully attains a higher number of conver- printing. Dependable Secure Comput., IEEE Trans. 2(2):93–108.
sions for a lower target spend; it has also provided a Lee KC, Jalali A, Dasdan A (2013) Real time bid optimization with
smooth budget delivery in online advertising. Proc. Seventh Inter-
theoretical lens that has proved empirically that using nat. Workshop Data Mining Online Advertising (ACM, New York),
the DP approach would provide a simple but effec- Article 1.
tive mechanism to achieve higher efficiency in a bid- Lee KC, Orten B, Dasdan A, Li W (2012) Estimating conversion rate
in display advertising from past performance data. Proc. 18th
ding process. The comparison with other approaches
ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining
provides clear insight into the proposed approach’s (ACM, New York), 768–776.
performance, and we have demonstrated how the Lew A, Mauch H (2006) Dynamic Programming: A Computational Tool,
parameters such as bid period and target spend can vol. 38 (Springer, Berlin).
Li X, Guan D (2014) Programmatic buying bidding strategies with
affect the performance of the approach. Additionally, win rate and winning price estimation in real time mobile
because of the outstanding app selection strategy and advertising. Pacific-Asia Conf. Knowledge Discovery Data Min-
cost-effective budget allocation and bid price adjust- ing (Springer International Publishing, Cham, Switzerland),
ment strategies, APS outperforms other benchmark 447–460.
Mack GA, Skillings JH (1980) A Friedman-type rank test for main
approaches. To sum up, the proposed DP approach effects in a two-factor ANOVA. J. Amer. Statist. Assoc. 75(372):
addresses the dynamism inherited in the RTB process 947–951.
with a novel and compelling solution, while embed- Mobilewalla.com (2016) Mobilewalla. Accessed May 3, 2016, https://
ding autonomous bidding decisions into the RTB in www.mobilewalla.com/.
Mopub (2016) MoPub—Powerful app monetization. Accessed May 3,
advertising. 2016, http://www.mopub.com/.
OpenX (2016) OpenX: Programmatic advertising. Accessed May 3,
References 2016, http://openx.com/.
Adikari P, Dutta K (2015a) Auto pricing strategy for real time bid- Perlich C, Dalessandro B, Hook R, Stitelman O, Raeder T, Provost F
ding in advertising: A dynamic programming approach. Inter- (2012) Bid optimizing and inventory scoring in targeted online
nat. Conf. Inform. Systems (ICIS), Dallas. advertising. Proc. 18th ACM SIGKDD Internat. Conf. Knowledge
Adikari S, Dutta K (2015b) Real time bidding in online digital adver- Discovery Data Mining (ACM, New York), 804–812.
tisement. Internat. Conf. Design Sci. Res. Inform. Systems (Springer Powell WB (2014) Clearing the jungle of stochastic optimiza-
International Publishing), 19–38. tion. INFORMS TutORials in Operations Research (INFORMS,
AdWords G (2015) Google ads. Accessed March 5, 2015, https:// Catonsville, MD), 109–137.
www.google.com.sg/adwords/. Rogers A, David E, Payne TR, Jennings NR (2007) An advanced bid-
Bertsekas DP (1995) Dynamic Programming and Optimal Control, vol. 1 ding agent for advertisement selection on public displays. Proc.
(Athena Scientific, Belmont, MA). 6th Internat. Joint Conf. Autonomous Agents Multiagent Systems
Chaitanya N, Narahari Y (2012) Optimal equilibrium bidding strate- (ACM, New York), 51.
gies for budget constrained bidders in sponsored search auc- Smaato (2016) Smaato. Accessed May 3, 2016, https://www
tions. Oper. Res. 12(3):317–343. .smaato.com/.
Chakraborty T, Even-Dar E, Guha S, Mansour Y, Muthukrishnan S Sun Z, Dawande M, Janakiraman G, Mookerjee VS (2016) The mak-
(2010) Selective call out and real time bidding. Internat. Workshop ing of a good impression: Information hiding in ad exchanges.
Internet Network Econom. (Springer, Berlin), 145–157. MIS Quart. 40(3):717–739.
Chen B, Yuan S, Wang J (2014) A dynamic pricing model for unify- Yahalom S, Stopel D (2013) U.S. Patent Application No. 13/282,489.
ing programmatic guarantee and real-time bidding in display Yuan S, Wang J, Zhao X (2013) Real-time bidding for online advertis-
advertising. Proc. Eighth Internat. Workshop Data Mining Online ing: Measurement and analysis. Proc. Seventh Internat. Workshop
Advertising (ACM, New York), 1–9. Data Mining Online Advertising (ACM New York), Article 3.
Econsultancy (2013) Online advertising survey 2013. Accessed July Zhang W, Yuan S, Wang J (2014) Optimal real-time bidding for dis-
5, 2015, https://econsultancy.com/reports/online-advertising play advertising. Proc. 20th ACM SIGKDD Internat. Conf. Knowl-
-survey. edge Discovery Data Mining (ACM, New York), 1077–1086.

You might also like