Professional Documents
Culture Documents
a r t i c l e i n f o a b s t r a c t
Article history: We consider the portfolio selection problem for a multiperiod investor who seeks to maximize her utility
Received 21 July 2016 of intermediate consumption facing multiple risky-assets and proportional transaction costs in the pres-
Accepted 23 July 2018
ence of return predictability. With the presence of transaction costs, this problem is very difficult to solve
Available online 27 July 2018
even numerically due to the curse of dimensionality. In this paper, we propose first several suboptimal
JEL classification: rebalancing policies that are based on optimizing simple quadratic programs for a mean-variance investor
G11 who faces proportional transaction costs. Then, we propose some feasible rebalancing and consumption
policies that can be easily computed even for many risky assets, for an investor with power utility, based
Keywords: on the proposed suboptimal policies. Finally, we show how to compute upper bounds and use them to
Investment analysis
study how the certainty equivalent losses of consumption, associated when using the approximate poli-
Portfolio optimization
cies, depend on different problem parameters.
Dynamic portfolio choice
Information relaxations © 2018 Published by Elsevier B.V.
https://doi.org/10.1016/j.jbankfin.2018.07.012
0378-4266/© 2018 Published by Elsevier B.V.
132 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
When we add predictability, the case with multiple-risky as- ity. Specifically, these approximate trading strategies are proposed
sets and proportional transaction costs is much more difficult to using the tractable mean-variance framework by G&P and can be
solve, and only a small number of papers deals with this prob- conveniently computed by solving simple quadratic programs.
lem. Balduzzi and Lynch (1999) study the impact of return pre- Our second contribution is to show how to adapt the strate-
dictability on the utility cost and obtain the optimal rebalancing gies to a framework based on a CRRA power utility. To do this,
rules for a single-risky asset case. They show the costs of ignor- we consider an investor who wishes to maximize her CRRA util-
ing predictability can be substantial for a CRRA investor with a ity of intermediate consumption with predictable returns, in the
finite life. Lynch and Tan (2010) analyze numerically the model presence of proportional transaction costs. We compute numeri-
with two risky assets and predictable returns for a multiperiod cally the corresponding upper bounds to the certainty equivalent
CRRA investor who maximizes her power utility of intermediate of the investor and show that the certainty equivalent losses from
consumption. Using numerical dynamic programming, they show using these approximate policies are reasonably small.
that for each state, there is a quadrilateral-shaped no-trade re- Finally, we show that the multiperiod portfolio selection prob-
gion that confines the transaction. Their numerical methods are lem with multiple risky assets in the presence of predictability
based on a grid discretization of the state space, and then their ap- and proportional transaction costs can be tackled through the use
proach could run into the curse of dimensionality with more risky of the duality method developed in Brown et al. (2010) based
assets. Brown and Smith (2011) provide several heuristic trading on information relaxation. This method can be used to compute
strategies for a finite-horizon discrete-time investor facing propor- dual bounds on the optimal value function by introducing proper
tional transaction costs and multiple risky assets in the presence penalty functions. We show these dual bounds can improve signif-
of return predictability. They evaluate the optimality of the pro- icantly the bounds computed when no penalty function is consid-
posed heuristics based on upper bounds obtained through a dual ered.
approach. The dual method based on information-relaxation is ini- Our work is related to Brown and Smith (2011) and
tially developed in Brown et al. (2010) and it provides a technique Mei et al. (2016). Like (Brown and Smith, 2011), we propose some
to construct valid dual bounds for any approximated solution. approximate trading strategies for a multiperiod investor with
The aforementioned papers show that, for a CRRA power util- CRRA power utility, but instead of approximating the dynamic pro-
ity investor facing a small number of risky assets (up to two), the gramming recursion (the continuation value functions) of the pri-
model that incorporates return predictability with transaction costs mal problem, we approximate the primal problem for each period
generally admits only a numerical solution. With more risky assets, with a quadratic program that can handle problems with more
only an approximate solution can be obtained due to the curse of risky assets. In addition, we consider an investor that maximizes
dimensionality. Gârleanu and Pedersen (2013), hereafter G&P, con- her power utility of intermediate consumption while (Brown and
sider a more analytically tractable framework that allows them to Smith, 2011) consider an investor who maximizes her utility of ter-
achieve a closed-form solution for the optimal portfolio policy in minal wealth. Mei et al. (2016) consider a mean-variance investor
the presence of quadratic transaction costs. Specifically, their in- who faces general transaction costs and constant investment op-
vestor maximizes the present value of the mean-variance utility portunity set. For the case with proportional transaction costs, they
of her wealth changes at multiple time periods, has access to un- give closed-form expressions for the no-trade region. In this pa-
constrained borrowing, and faces multiple risky assets with pre- per, we propose the approximate trading strategies based on their
dictable price changes. With the quadratic utility, quadratic trans- analysis on no-trade regions, but we consider a more realistic case
action costs and no portfolio constraints, the model is formulated where there is predictability.
as a linear quadratic control problem which is straightforward to The remainder of this paper is organized as follows. In
solve. However, the quadratic form in G&P model is not able to Section 2, we introduce the dynamic portfolio selection frame-
characterize the trading strategies of an investor who incurs pro- work in the presence of proportional transaction costs and pre-
portional transaction costs. Meanwhile, the mean-variance frame- dictability. In Section 3, we describe our approximate trading poli-
work also ignores consumption, which is a key element in practice. cies for a mean-variance investor and evaluate these approximate
In this paper, we consider a dynamic portfolio-selection prob- strategies under the mean-variance framework. Section 4 describes
lem in a discrete-time, finite-horizon setting. In our model, how to adapt these approximate strategies to a CRRA power utility
the investor maximizes her expected CRRA utility of interme- framework and evaluates numerically these strategies. The evalua-
diate consumption. We further assume that she faces mul- tion is based on information relaxations that allow to obtain dual
tiple risky assets with predictable returns and without con- bounds. Section 5 concludes. Appendix A contains the figures and
straints on borrowing, and incurs proportional transaction costs. tables. Appendix B contains the derivation of the aim portfolio,
We propose several approximate trading strategies that are based which is a weighted average of the current Markowitz portfolio
on solving simple quadratic programs and evaluate the sub- and the expected future Markowitz portfolios. Appendix C contains
optimality of these strategies through the dual approach proposed the derivation of the penalty function. In Appendix D we describe
by Brown et al. (2010). In order to propose these approximate how to approximate the consumption for at each period for the
strategies, we first approximate our model for a CRRA power util- model with transaction costs.
ity investor with the same mean-variance problem considered in
G&P. But instead of considering a model with an infinite invest-
ment horizon and quadratic transaction costs, we considered a 2. General framework
more realistic framework with a finite investment horizon and pro-
portional transaction costs. We then find some approximate solu- We now describe the basic portfolio selection problem of an
tions that induce a lower utility loss for the mean-variance prob- investor with a finite life of T periods who needs to decide the
lem. Finally, we adapt these approximate solutions to the CRRA portfolio weights for N risky assets. Assume time is discrete and
framework. Our numerical experiments suggest that these adapted indexed as t = 0, 1, . . . , T with t = 0 being the initial time and
approximate strategies perform reasonably well in practice. t = T being the terminal time. There is also a risk-free asset be-
We make three contributions to the dynamic portfolio-choice ing traded in the market and the risk-free rate Rf is assumed to
and transaction cost literature. Our first contribution is to provide be constant over time. Let Pt, i be the price of asset i at time t,
several approximate trading strategies for a mean-variance utility from time t − 1 to t, the risky asset returns are stochastic and de-
Pt,i
investor who faces proportional transaction costs and predictabil- noted by Rt = [Rt,1 , . . . , Rt,N ], where Rt,i = Pt−1,i ≥ 0 is the gross re-
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 133
turn on asset i. Based on the asset returns up to t, the investor risky-asset weight is the ratio of these two, the transaction costs
then determines the decision vector xt = [xt,1 , . . . , xt,N ], where xt, i then becomes
is the weight of the ith asset held in period t. Throughout this
xt−1,+ − νt
manuscript, we will use xt to denote the sequence of decision vari- T˜Ct =
K xt −
.
(3)
ables [x0 , . . . , xt−1 ] and x to denote the full sequence of decisions 1 − ct − Lc,t
1
[x0 , . . . , xT −1 ].
We assume there is no margin requirements imposed for bor- We set τi,c = κi so that the cost of liquidating the entire holding of
rowing on margin or shorting. Moreover, we will focus on the spe- risky asset i equals the cost of rebalancing the risky-asset holding
cial case where transaction cost for each period is proportional to to zero. In this sense, our τ i, c choice balances the cost of liquidat-
the amount of trade. This type of transaction cost is realistic to ing the risky assets with the cost of rebalancing.
model small trades, where the transaction costs come from the Let Ct denote the total consumption, that is Ct = ct Wt . We as-
bid-ask spread and other brokerage fees. Let xt,+ be the vector of sume that the investor’s objective is to maximize the expected util-
allocation to the risky assets inherited from period t, that is ity of the intermediate consumption over all the periods:
xt · Rt+1
T −1
xt,+ = , max E0 ρ t+1Ut (Ct ) , (4)
R p,t+1 {ct ,xt }t=0
T −1
t=0
where · denotes the component-wise product and R p,t+1 is the
portfolio return defined as where ρ ∈ (0, 1) is the discount factor and Ut is the power utility
function
R p,t+1 = xt Rt+1 + [1 − xt e − K (xt − xt−1,+ )1 ]R f ,
1 −γ
Ct −1
with e a vector of ones with length N. The term K (xt − xt−1,+ )1 Ut = , (5)
is the proportional transaction costs that the investor incurs. 1−γ
Here K is a N × N diagonal matrix with elements in the di- with relative risk-aversion parameter γ ≥ 1. In (4), E0 ( · ) denotes
agonal diag(K ) = [κ1 , · · · , κN ]. Each κ i denotes the proportional the conditioned expectation on the information at the beginning
transaction-cost rate parameter for asset i. Following Lynch and of initial period.
Tan (2010), we assume the following law of motion for investor’s Moreover, let {Ft }t=0 T −1
denote the filtration generated by the
wealth in the presence of transaction costs: risky asset returns and other state variables. We let each Ft rep-
Wt+1 = Wt (1 − ct )R p,t+1 , (1) resents the set of events that describes the investor’s state of in-
formation at the beginning of period t, and further we require
where Wt is investor’s wealth at t, and ct is the fraction of Ft ⊆ Ft+1 for all t < T so the investor does not ignore the past.
wealth consumed at t. Similar to Balduzzi and Lynch (1999); Let P (u ) ⊆ P0 × · · · × PT −1 denote the set of all feasible action
Lynch (2001) and Lynch and Tan (2010), the above law of mo- sequences x = [x0 , · · · , xT −1 ] for any given scenario of state vari-
tion indicates that the transaction cost at t is Wt (1 − ct )K (xt − able u. We assume each xt satisfies the nonanticipativity constraints
xt−1,+ )1 and is paid by costlessly liquidating the risk-free asset. such that the decision choice xt for each period t must depend only
Moreover, it also implicitly assumes that consumption at each pe- on the information known at the beginning of t, and we denote
riod is obtained by liquidating the ith risky assets and the risk-free by PF the set of nonanticipative feasible policies. With all of these
asset in the proportions xi,t−1,+ and (1 − xt−1
,+
e ).2 definitions, we can now introduce the proposed dynamic portfolio-
When rebalancing rules may be sensitive to the cost differen- choice problem:
tial associated when using the risky assets rather than the risk-
less one for consumption, we can introduce the proportional cost
T −1 1 −γ
Ct
of τ i, c to liquidate the risky asset i for consumption, following the max E0 ρ t+1 (6)
(c,x )∈PF 1−γ
procedure described in Lynch and Balduzzi (20 0 0). Letting Lc, t be t=0
R˜ p,t+1 = xt Rt+1 + (1 − xt e − T˜Ct − Lc,t )R f , (2) In the presence of liquidation costs for consumption, we replace
N R p,t+1 with R˜ p,t+1 in (7).
where the liquidation costs for consumption Lc,t = i=1 τi,c νi,t , and Note in the proposed model, the investor has access to uncon-
ν i, t ≥ 0 is the component of consumption at t obtained by liquidat- strained borrowing to facilitate the adaption of the approximate
ing the risky asset i. Let νt = [ν1,t , · · · , νN,t ] be the vector formed strategies from mean-variance.3 Several comments are in order.
by the component of consumption at t. Since we assume the liqui- When we allow the presence of predictability, the proposed opti-
dation cost can be paid costlessly from the money market account, mization problem can be formulated as a stochastic dynamic pro-
then νi,t ≤ xt−1,+ . Note that the pre-rebalancing portfolio risky- gram where the state variables include the current wealth level,
asset weights at t now depend on the consumption decision at t. the portfolio position and the market state variables. Note that for
After consumption, the holding of risky assets can be expressed by a power utility, the wealth level Wt can be factored out for each
the vector xt−1,+ − νt and the portfolio value is 1 − ct − Lc,t , each period and if St denotes the vector of state variables, the Bellman
expressed as a fraction of wealth Wt . Because the pre-rebalancing equation for each period faced by the investor can be written as:
2
The liquidation for consumption may be costless given the availability of money
3
market bank accounts and given that equities pay dividends. For the case when the We can also consider the case when the investor faces constraints on borrowing
sum of risky assets dividends exceeds the consumption out of the risky asset, a or when margin requirements are imposed for borrowing on margin or shorting.
dividend reinvestment plan can be used to costlessly reinvest the excess dividend In this case, we simply need to add the corresponding margin constraints to the
in the risky asset. optimization model.
134 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
and from (13), we can define the value functions for previous pe- opportunity set, Mei et al. (2016) show the optimal policy is to
riods recursively according to the Bellman equation: trade to an aim portfolio at each period, similar to the result
γ derived in Gârleanu and Pedersen (2013).
Vt (xt−1 , μt ) = max ρ t+1 (xt μt − xt xt ) With proportional transaction costs, both Calafiore (2009) and
xt ∈Pt (xt−1 ) 2
Skaf and Boyd (2010) use linear decision rules to address dynamic
−ρ t K (xt − xt−1 )1 + Et [Vt+1 (xt , μt+1 )], portfolio choice problems without return predictability. They show
for t = 0, · · · , T − 1. Although the numerical solution is also diffi- through examples that, imposing affine restrictions on the decision
cult to obtain when there are more than two risky assets, we only variables, they can largely improve the flexibility and can reduce
need to track the wealth change at each period instead of tracking the complexity of the solution, at the cost of a reasonable loss of
the evolution of total wealth under the G&P framework. Moreover, optimality. With predictability, Moallemi and Sağlam (2017) pro-
unlike the model with power utility, the focus on price changes pose a class of linear rebalancing rules for dynamic portfolio op-
implies that there is no need to track the risky-asset price evo- timization problems that allow for various types of transaction
lution. Hence, instead of considering the power utility framework, costs and also trading constraints. Besides computational tractabil-
in this section, we propose trading strategies based on the G&P ity, they show through an example of portfolio execution that the
framework, and later, in Section 4 we propose to adapt them back linear rebalancing policies can achieve near optimal performance.
to the CRRA utility framework. For those reasons, with proportional transaction costs, we pro-
pose to consider a linear rebalancing approximation, similar to that
3.2. Approximate trading strategies of the model with quadratic transaction costs: the investor has an
aim portfolio at each period, and she chooses to trade partially to
To avoid the difficulties we may find when solving the port- this aim portfolio, following the linear policy in Gârleanu and Ped-
folio optimization model with predictability and transaction costs, ersen (2013). Specifically, we define the aim portfolio for period
we propose several trading strategies to approximate the optimal t < T as
trading strategy in (12). These strategies provide a first step ap- aimt = z Markowitzt + (1 − z )Et (aimt+1 ), (15)
proximation for the suboptimal trading strategies of model (6)–(8). √
For simplicity, and following G&P framework, we assume that the γ −γ ρ −λ(1−ρ )+ (γ ρ +λ(1−ρ ))2 +4λγ ρ 2
where z = γ +c and c = 2ρ . At t =
dynamic of price changes follows the model specified in (10) and
T − 1, the aim portfolio is the optimal portfolio in the absence of
(11). Analogously, the same procedure can be applied to derive the
transaction cost, that is,
approximate strategies for the case with a different predictability
model. aimT −1 = (γ )−1 μT −1 .
Finally, the proposed linear policy is:
3.2.1. Simple policy
First, we consider a deterministic approximation that ignores c c
xt = (1 − )xt−1 + aimt . (16)
model predictability and simply follows the optimal trading strat- λ λ
egy recommended by a deterministic model. In this model, the In Gârleanu and Pedersen (2013), λ denotes the quadratic transac-
investor ignores the innovations in the predictability model. The tion cost parameter. Since we use (16) as an approximation to the
model becomes a deterministic problem which can be solved us- model with proportional transaction costs, we propose to calibrate
ing quadratic programming: λ such that the total utility is maximized. With the dynamics of
T −1
price changes specified in (10) and (11), the aim portfolio for each
γ
max ρ t+1 (xt μ˜ t − xt xt ) − ρ t K (xt − xt−1 )1 , (14) period is
{xt }t=0
T −1 2
aimt = (γ )−1 Ar + Br ad gt (bd ) + bTd −1−t dt
t=0
where r̄ p,c and σ̄ 2 are the mean and variance of portfolio return, Table A1
Certainty-equivalent Returns - The Base Case. This table shows the
respectively, net of proportional transaction costs of each sample
certainty-equivalent returns for an investor with objective (12) based on
path. Moreover, we also compute the portfolio turnover to get a the proposed approximate policies. Column 2–5 show the returns ob-
sense of the amount of trading required to implement each port- tained based on the simple policy(S-P), the linear policy(L-P), the no-trade
folio strategy. The turnover is defined as the average sum of the region policy(NTR-P) and the rolling optimize-and-hold policy(ROH-P) re-
absolute value of the trades across the N available assets: spectively. The last column denotes the upper bounds to the return ob-
tained based on perfect hindsight solution. The values of the returns are
T −1 N
1 reported in the first row and the second row shows the correspond-
Turnover = (|xi,t − xi,t−1 | ). ing gaps respect to the upper bounds (in %). The gaps are computed by
T (Ubounds − Uheuristic )/Ubounds . The Third row gives the turnover of each pol-
t=0 i=1
icy.
We assume the predictability models for dividend yield and
Policies S-P L-P NTR-P ROH-P Bounds
price changes are given by (10) and (11) and that the initial div-
idend yield is neutral (i.e., d1 = 0). Finally, we assume that the in- CEQ 0.0083 0.0086 0.0089 0.0089 0.0091
vestor has an initial wealth of 1 dollar. With these assumptions, CEQ Loss (in %) 7.95 3.89 0.92 1.49
Turnover 0.0461 0.0867 0.0505 0.0504 0.0427
the absolute risk-aversion parameter under the mean-variance
framework, described in Section 3.1, is equivalent to the relative
risk-aversion parameter under the power utility framework.
As an illustrative example, we consider a model with two risky Finally, for the no-trade region policy and the rolling optimize-
assets,6 following the same example in Lynch and Tan (2010), and-hold policy, the center of the no-trade region can be calculated
where we consider the 12-month dividend yield on the value- based on the conditional mean at each period.
weighted New York Stock Exchange (NYSE) index as a proxy for For the base case scenario, the certainty-equivalent returns for
the predictive variable D. For the first risky asset, we consider the feasible policies and the associated upper bounds are reported
the monthly rate of return on the value-weighted NYSE index, in Table A.1. It gives the certainty-equivalent returns obtained
while for the second risky asset we consider the high BM port- based on the simple policy (S–P), the linear policy (L–P), the no-
folio which is formed from the 6 value-weighted portfolios SL, trade region policy (NTR-P) and the rolling optimize-and-hold policy
SM, SH, BL, BM and BH.7 The parameters for the predictabil- (ROH-P), respectively. The last column of this table denotes the up-
ity model in (10) and (11) are estimated using ordinary least per bounds to the certainty-equivalent returns obtained based on
squares (OLS) with Ar = [0.83; 0.54], Br = [0.47; 0.30], ad = 0, bd = a perfect hindsight solution. The values of the certainty-equivalent
0.98, the state covariance matrix for both risky assets is = returns, reported in the first and second rows, show the corre-
[0.0 054 0.0 037; 0.0 037 0.0 030] and volatilities for both assets are sponding gaps (certainty-equivalent return losses) which are com-
σ1 = 7.37% and σ2 = 5.52%, respectively. The risk-free rate is esti- puted by (C Ebounds − C Eheuristic )/C Ebounds . We observe that the return
mated by mean the of 1-month T-bill rate over this period, which loss associated when adopting the simple policy (that is, the rela-
gives a value of 0.056%. tive difference between the certainty equivalent return of the sim-
ple policy and the upper bound obtained from perfect hindsight
3.3.1. The base case scenario solution (22)) is as much as 7.95%. This return loss is relatively
For our base case scenario, we consider a time horizon of high because for the simple policy, the existence of predictabil-
6 months, that is T = 7. We assume an absolute risk-aversion ity is completely ignored in the model, hence the trading strategy
parameter γ = 5, a matrix of proportional transaction cost K = at each period lacks information updated from previous periods,
[0.0 050 0; 0 0.0 050], an annual discount rate ρ = R1 , and that the which results in a high return loss.
f
Regarding the linear policy, which relies on both the previous
investor starts with holding zero shares in both assets.
stage portfolio and an aim portfolio, we find that the certainty-
For the simple policy, the expectation is μ ˜ t = E0 (μt ) = Ar +
equivalent loss is over 3.8%, which is also high compared with
Br bt−1
d
d1 , where the innovation term t+1 has zero mean and vari-
NTR-P and ROH-P. The result indicates that for the linear policy,
ance 1 − b2d . Note that the higher bd , the lower volatility in dt .
the strategy penalizes transactions in terms of a quadratic form.
For the linear policy, the value of λ is calibrated such that it pro-
With proportional transaction costs, the investor is motivated to
vides the maximum value of total utility. Fig. A.1 depicts the total
make more transactions compared to the other strategies, which
utility of the linear policy for different values of δ when the other
can also be verified by the high turnover given in Table A.1.
model parameters are fixed. In Gârleanu and Pedersen (2013), the
The no-trade region policy, on the other hand, outperforms the
value of λ is calibrated for each asset, which turns out to be
other proposed strategies with an associated return loss less than
around 10−7 . In our example, we change the value of λ from 0 to
0.92%. This result indicates that, compared with the simple policy,
10, with the parameters specified above, and hence the total utility
the no-trade region policy takes the aim portfolio as the center
varies from 4.53 × 10−3 to 5.19 × 10−3 , and reaches its maximum
of the no-trade region, using updated information from previous
around λ = 5.28.8
period; while compared with the linear policy, the existence of a
no-trade region confines the amount of transaction, implying then
6
Our approximate trading strategies can also be applied to the case with many a more conservative trading strategy.
risky assets using a similar procedure: evaluate the approximate strategies by dis- Finally, the return loss associated when adopting the rolling
cretizing the state variable space for each of the assets. An example with more risky
optimize-and-hold policy is 1.19%, which also shows a low loss com-
assets is given in Section 4.3.3.
7
Following Lynch and Tan (2010), the low and high BM portfolios are formed
pared to the simple policy and the linear one, and a slightly higher
from the 6 value-weighted portfolios SL, SM, SH, BL, BM, and BH from Fama and loss than that of the no-trade region policy. This shows that, con-
French (1993) and Davis et al. (20 0 0). The notation S(B) indicates that the firms in fining transactions with a no-trade region that updates periodically
the portfolio are smaller (larger) than 50% of NYSE stocks. The notation L indicates helps to correct the over-trading in the linear policy. Nevertheless,
that the firms in the portfolio have BM ratios that place them in the bottom 3
assuming no trades until the last period may cause large deviation
deciles for all stocks; analogously, notation M indicates the middle 4 deciles and H
indicates the top 3 deciles. The high BM portfolio is an equal-weighted portfolio of
SH and BH.
8
We consider monthly asset returns with unconditional mean [0.83%; 0.54%]. around 10−3 over all the sample paths. Alternatively, if we annualize the parame-
Starting with [0; 0], this policy indicates a movement towards the aim portfolio ters, the corresponding total utility changes from 0.112 to 0.132, indicating a more
at each period which does not vary significantly, resulting in an average total utility significant change.
138 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
Fig. A1. Utility of Linear Policy depending on λ This figure shows the utility of the linear policy for different values of λ. In this figure, we consider proportional transaction
costs parameter κ = 0.005, a discount factor R1 , a risk-aversion parameter γ = 5, and an investment horizon T = 7.
f
from the true center of the no-trade region, this strategy obtain First, we find that regardless of the value of bd , the certainty-
higher losses than those in NTR-P. equivalent returns decrease monotonically for all policies as κ in-
Fig. A.2 shows the trading trajectories of the proposed strate- creases. This is intuitive since the same amount of transaction may
gies and the perfect insight policy. It shows that the simple policy result in lower return when κ is higher. Moreover, we find that
deviates highly from the rest of the policies due to the lack of in- the relative return loss associated with all the approximate poli-
formation update. The linear policy tends to trade closely to the cies does not depend on the risk-aversion parameters.
aim portfolio aimt at each period, resulting in a higher turnover Regarding each policy, we observe the certainty-equivalent
compared to other strategies. Finally, the no-trade region policy and return loss associated with ignoring uncertainty in the predictabil-
the rolling optimize-and-hold policy are confined by the same no- ity model (i.e., the simple policy) is high for all the considered
trade region in the last period, with the latter being slightly more parameters especially when bd is small (i.e., high volatility in
conservative in terms of turnover. dividend yield). Taking into account that a higher bd indicates a
lower volatility in the dividend yield, we observe the return loss
in the simple policy decreases dramatically when we increase
3.3.2. Comparative statics for other scenarios
bd from 0.75 to 0.98. For example, for an investor with γ = 5
We analyze empirically how the certainty-equivalent return
and κ = 0.0050, the return loss reduces from 22.62% to 7.95%.
loss associated with all the proposed policies depend on the
The main reason is that the conditional mean μ ˜ t in the simple
transaction-cost parameter, the risk-aversion parameter and the
policy can approximate future means better when there is lower
slope in the predictability model for the dividend yield. Table A.2
volatility in the dividend yield. Moreover, we also find the return
reports the return loss considering an investing horizon of T = 6
loss associated with the simple policy decreases as κ increases.
months and Table A.3 shows the corresponding turnover.
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 139
Table A2
Certainty-equivalent Returns: mean-variance utility. This table shows the certainty-
equivalent returns of an investor with objective function (12) based on the proposed
heuristic policies. The first three columns give the values of bd , the risk-aversion pa-
rameter γ and the proportional transaction costs parameter κ . Columns 4–7 show
the returns obtained based on the simple policy(S-P), the linear policy(L-P), the no-
trade region policy(NTR-P) and the rolling optimize-and-hold policy(ROH-P), respec-
tively. The last one column shows the upper bounds to the return obtained based
on perfect hindsight. In each row, the value of CEQ returns and the corresponding
gap (in %) with respect to the upper bound is reported.
0.75 3 0.0020 0.0895 0.2017 0.1639 0.1439 0.1430 4.1.1. Approximating consumptions
0.0050 0.0844 0.1479 0.1231 0.1155 0.1340 We consider two proposals to approximate the consumption.
0.0100 0.0758 0.1470 0.0920 0.0863 0.1211
The first proposal assumes that the investor’s consumption to
5 0.0020 0.0537 0.1210 0.0984 0.0864 0.0858
0.0050 0.0506 0.1008 0.0763 0.0693 0.0804 wealth ratio for each period, ct , is given by the model without
0.0100 0.0455 0.1007 0.0572 0.0518 0.0727 transaction costs. Note that in the presence of transaction costs,
7 0.0020 0.0384 0.0865 0.0703 0.0618 0.0613 the optimal consumption is no longer affine in total wealth, which
0.0050 0.0362 0.0788 0.0557 0.0495 0.0574
is in contrast to the model without transaction cost. However,
0.0100 0.0325 0.0787 0.0418 0.0370 0.0519
0.98 3 0.0020 0.0951 0.1530 0.0991 0.0986 0.1066
Sefton and Champonnois (2015) show that the investor consumes
0.0050 0.0900 0.1420 0.0836 0.0833 0.0984 at a constant rate along the optimal path in the limit case.9 More-
0.0100 0.0814 0.1252 0.0719 0.0719 0.0864 over, given risky asset return dynamics in (10) and (11), the opti-
5 0.0020 0.0571 0.0918 0.0595 0.0592 0.0639 mal consumption in the absence of transaction costs can be easily
0.0050 0.0540 0.0852 0.0502 0.0500 0.0591
computed numerically using dynamic programming by discretizing
0.0100 0.0488 0.0798 0.0432 0.0431 0.0518
7 0.0020 0.0408 0.0656 0.0425 0.0423 0.0457 the market state variables.
0.0050 0.0386 0.0609 0.0359 0.0358 0.0422 Note that this approximation is different from the idea of con-
0.0100 0.0349 0.0590 0.0309 0.0309 0.0370 sidering a simple investment policy approximation, where the op-
timal portfolio weights are taken from the problem without trans-
action costs. First, the simple trading strategy often does not per-
3. For the no-trade region policy, it performs constantly well for form well since it leads to continuously rebalancing which re-
all the cases, where the associated return losses are relatively sults in high transaction costs. Second, there is extensive literature
low. The associated return losses for all cases is relatively low. showing the portfolio weight of each asset belongs to a no-trade
It can be used as a robust approximation for the optimal solu- region policy in the presence of proportional transaction costs,
tion in the presence of proportional transaction cost and pre- but no such region applies to investor’s optimal consumption. Fi-
dictability. The gap between its certainty-equivalent return and nally, as pointed out by Liu (2004), the optimal consumption for
the upper bound is small for all the cases. the model with proportional transaction costs of an investor with
4. For the rolling optimize-and-hold policy, it is again a robust ap- CARA preference is affine in the dollar amount invested in the
proximation for the optimal solution in the presence of propor- risk-free asset and nonlinear in the dollar amounts in risky as-
tional transaction costs and predictability. It has lower turnover sets. Hence, considering a simple consumption policy is equivalent
compared to the no-trade region policy, and the gap between to approximating a nonlinear function with a linear one within a
its certainty-equivalent return and the corresponding upper small interval. That implies, the assumed constant consumption to
bound is again small for all the cases. wealth ratio is still a conservative and reasonable choice.
The second proposal to approximate the optimal consumption
is based on the Bellman equation an investor has to solve at each
4. The CRRA framework
period:
We now adapt each of the approximate trading strategies, pro-
posed for the mean-variance problem in Section 3, to the frame- φt (St , xt−1,+ )1−γ
work with CRRA power utility. Campbell and Viceira (2003) show 1−γ
that when the risky asset returns are lognormal, the portfolio c 1 −γ (1 − ct )1−γ
1−γ 1−γ
choices resulting from the power utility and mean-variance frame- = max t
+ E φt+1 St+1 , xt,+ R p,t+1 .
(ct ,xt )∈Pt (xt−1 ) 1 − γ 1−γ
works are consistent. In the absence of transaction costs, the in-
vestor trades off mean against variance for a single period in both (23)
cases. In the presence of transaction costs, Mei et al. (2016) show
that the certainty equivalent loss from adapting the mean-variance For a discretized state variable space at each period, the approx-
framework is typically smaller than 0.5% for the case with a con- imate investment strategies for each sample path can be com-
stant investment opportunity set. puted and stored. Starting from the last
period where φT −1 = 1,we
In this section, we consider an investor who maximizes her
1−γ 1−γ
compute the expectation term bt : E φt+1 St+1 , xt,+
ˆ ˆ Rˆ at p,t+1
CRRA utility of intermediate consumption by investing in a risk-
free asset and N risky assets in the presence of predictability, and each node based on the discrete approximation, where φˆ and
subject to proportional transaction costs (i.e, with preferences (6)– Rˆ p,t+1 denote the values obtained based on the approximate in-
(8)). In Section 4.1, we obtain several approximate solutions for vestment policy. Consequently, we propose to use an approxima-
the portfolio optimization problem (6)–(8) by adapting the pro- tion of the optimal consumption by simply maximizing the follow-
posed feasible policies proposed based on the G&P framework. We
also propose alternative suboptimal consumptions in Section 4.1.1.
9
By aggregating the proposed approximations for consumption and They consider a continuous dynamic portfolio problem of an investor who max-
imizes her welfare from a consumption stream for the case where risky assets re-
trading strategies together, we can then evaluate the performance turns are predictable in the presence of transaction costs. In the limit case, they
of these strategies through upper bounds based on an information show that an investor will consume at rate (R f − ρ )/γ where γ is the absolute
relaxation procedure, introduced in Section 4.2. risk aversion coefficient.
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 141
ing univariate function at each node for every period: account that it is very difficult to solve numerically the original
c 1 −γ (1 − ct )1−γ
model, we complement these proposed feasible policies with up-
max t
+ bˆ t (24) per bounds on the certainty equivalent based on the dual approach
ct 1−γ 1−γ which will be explained in the next section.
which implies
1 4.2. Obtaining upper bounds
cˆt = 1 /γ
. (25)
1 + bˆ t
Therefore, we can obtain the approximated optimal consumptions Given the suboptimal policies for the portfolio optimization
based on a “pseudo-dynamic programming”approach at each node problem (6)–(8), we can evaluate their performance using simu-
for every period. Note when values of state variables are not on lation by generating different sample paths and averaging utilities.
the grid, the consumptions can be obtained using interpolation. These averages provide unbiased lower bounds to the optimal util-
ity U∗ . When considering these suboptimal policies, it would be
4.1.2. Wealth dynamics helpful to know how much better we could possibly do. In other
To simplify the framework adaptation, we assume that the in- words, we will use the optimal utility U∗ to evaluate the optimality
vestor has an initial wealth of $1 invested in the risk-free as- of the suboptimal policies. However, because the optimal utility U∗
set. With this assumption, the absolute risk aversion parameter in is unavailable, we propose to develop an upper bound to U∗ .
model (12) under the mean-variance framework equals the relative One simple way to obtain an upper bound is to consider the
risk aversion parameter in (6)-(8) for the power utility framework. same problem (6)–(8) but to ignore the existence of transaction
f
Hence, for each period, the amount of money invested in risk-free costs. That is, if Vt denotes the optimal value function at t for
asset is problem (6)–(8) in the absence of transaction costs, then
ˆ t (1 − cˆt ) − W
Wt f = W ˆ t (1 − cˆt )K (xˆt − xˆt−1,+ )1 ,
ˆ t (1 − cˆt )xˆt Pt − W
Vt∗ (ct−1 , xt−1 ) ≤ Vt f (ct−1 , xt−1 ),
where cˆt is the approximated consumption to wealth ratio in
where Vt∗ (ct−1 , xt−1 ) denotes the optimal value function at t. How-
Section 4.1.1 and xˆt = xt ./Pt is the number of shares that the in-
ever, Vt∗ generally provides a weak upper bound for U∗ , especially
vestor can hold when the price is Pt instead of $1. Here ./ refers to
when transaction costs are substantial.
the component-wise division of two vectors. The investor’s wealth
ˆ t+1 in each period is the sum of the total holding across the risky Alternatively, if we consider an information relaxation, where G
W
is a relaxation of filtration F defined in Section 2, such that the
assets and risk-free asset, i.e.,
decision maker knows future values of the state variables under G,
W ˆ t (1 − cˆt )xˆt Pt+1 ,
ˆ t+1 = W f R f + W then it can be easily shown that
t
Finally, note that given each feasible policy provides an approxi- where Gt is a relaxation of filtration Ft such that Ft ⊆ Gt in each
mation to the optimal solution for the portfolio optimization prob- period. If Gt denotes perfect information relaxation, the investor
lem with CRRA power utility, the corresponding certainty equiva- determines the actions with full knowledge of future state vari-
lent is clearly a lower bound to that of the true model. Taking into able values. However, also note that it is not practical to obtain Vt∗
142 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
in (29). A natural idea is to approximate the above penalty func- where ∇ I˜t (y˜ t∗ ) denotes the gradient of the reward function for each
tion with a proper Vˆt such that: period with respect to the decision vector ct and xt . Consequently,
T −1
if we solve the following deterministic problem
πˆ (x ) = Vˆt+1 (xt ) − E Vˆt+1 (xt )|Ft . (30) T −1
1 −γ
Ct
t=0 max ρ t+1 − ∇ I˜t (y˜ t∗ )(yt − y˜ t∗ ) , (35)
c,x 1−γ
In this case, the upper bound to the optimal value function V1∗ is t=0
then given by
s.t. Wt+1 = Wt (1 − ct )R p,t+1 ,
(36)
T −1
Vub := E0 max Ut (xt ) − πˆ (c, x ) . (31) ct ≥ 0
x∈P ( u )
t=0 p
in each sample path and let U(m ) be the corresponding optimal
Then, to obtain an unbiased estimate for the previous expecta- M p
m=1 U(m )
tion, we simulate M sample paths of the state variables and dis- objective function for m = 1, 2, · · · , M and U p = M , then U p
turbances, and solve the following maximization problem by as- will be a convenient upperbound to the
utility of heuristic policy
suming the investor knows the future values of state variables in U h compared to Vf and E0 T −1
U ( x ) . Finally, if we consider the
t=0 t t
each sample path,
certainty equivalent
T −1
1
max Ut (xt ) + Et Vˆt+1 (xt ) − Vˆt+1 (xt ) . (32) CE (U p ) = ((1 − γ )U p ) 1−γ ,
x
t=0
then, it is clearly a convenient upper bound to the certainty equiv-
The expected value in (31) is then estimated by taking the average alent provided by the approximate policies.
of the optimal objective function value of problem (32) over all
sample paths. 4.3. Numerical results
Note that in (31), each Ut (xt ) term is generally concave in the
decision variables x, and it leads to convexity in the original DP de- In this section, we analyze empirically the certainty equivalent
fined by (9). However, objective function (32) may not be concave losses associated when adapting the proposed policies, as well as
after taking into account the penalty function, and consequently, how these losses depend on the model parameters. Because we
it cannot be solved conveniently in practice. A solution is to find consider a “two-folded approximation” for the model with inter-
an approximation of Vˆt+1 to guarantee the convexity of (31). For mediate consumption, we evaluate the performance of the pro-
instance, Brown and Smith (2014) consider penalties based on the posed investment rule and suboptimal consumption strategy sepa-
first-order approximation of Vˆt (xt−1 ) to have a convex structure. rately. We first assess the certainty equivalent losses of the subop-
To summarize the procedure, a convenient penalty function can timal investment strategies by considering an investor with CRRA
be constructed by a convex approximation of the approximated utility of terminal wealth. Specifically, the upper bound for the
value function Vˆt . In this paper, we propose a penalty function certainty equivalent with terminal wealth is computed based on
πˆ (c, x ) in (31) such that at each period, the following reward func- the procedure similar to the one described in Section 4.2. Then,
tion is considered: to show how much better each of the heuristic strategies perform
1 −γ compared to the policy that completely ignores transaction costs,
Ct
I˜t (ct , xt ) = ρ t+1 , (33) we also report the certainty-equivalent returns of terminal wealth
1−γ
of an investor considering the frictionless model, but deducting
subject to the law of motion for investor’s wealth which is defined transaction costs from the wealth at each period. Finally, we com-
in (1) but without transaction costs. The reason is that, first, the pare the associated certainty equivalent losses of the heuristic in-
associated utility for the model with reward function I˜t (ct , xt ) is vestment policies with the suboptimal consumption strategy based
greater than that for the original model (that is, it is a relaxation on the model with intermediate consumption.
of the original model). And second, if Vˆt (ct−1 , xt−1 ) is the opti- In all cases, we begin by solving the dynamic program for the
mal value function corresponding to the proposed reward func- model without transaction costs. After repeatedly generating ran-
tions, then it is also an approximation to the optimal function dom sequences of market states and returns, we determine se-
Vt∗ (ct−1 , xt−1 ) of the original model. For those reasons, we can de- quence of trades selected by the proposed heuristics and further
fine the penalty function as adapt them to the CRRA framework in each trial. For purpose of
T −1
simplification, note that the sequence of consumption is also de-
π ( c, x ) = Vˆt+1 (ct , xt ) − Et Vˆt+1 (ct , xt ) . termined based on the model without both transaction costs and
t=0 liquidation costs. We also solve the inner problem (35) in the
To guarantee the convexity, and following Brown and same scenario and calculate the corresponding certainty equivalent
Smith (2014), we consider the first-order linear approxima- of each heuristic. We repeat the simulation process 10 0 0 times.
tion of Vˆt so that the resultant penalty function is linear in the Moreover, we consider two risky assets with the same model pa-
decision variables. If y˜t∗ = (c˜t∗ , x˜t∗ ) denotes the optimal policy for rameters used in Section 3.3 for monthly returns in the base case.
the no-transaction costs problem with reward function (33) and To gauge the robustness of the proposed trading strategies, we
∗
W˜ t the corresponding wealth at each period, we can then obtain then evaluate the performance of those strategies using models
the gradient penalty as follows10 , with longer investment horizon and more risky assets. Finally, to
T −1 assess the impact of liquidation costs for consumption, we com-
π˜ (c, x) = ∇ I˜t (y˜ t∗ )(yt − y˜ t∗ ), (34) pare separately the certainty equivalent consumption and level
t=0 of consumption for the model with and without liquidation cost
when fixing a trading strategy.
10
When we take first order approximation of the approximate value function 4.3.1. The base case scenario
T
Vˆt+1 (ct , xt ), the term which is constant in actions t=1 Vt+1 (y
ˆ ˜t∗ ) − Et Vˆt+1 (y˜t∗ ) is
omitted from the penalty function π (c, x). When we calculate the upper bound for
For our base case scenario, we consider the same parameters
each scenario, we add the realized values for this term after (34), which serves as used in Section 3.3.1. That is, the investor has a relative risk-
a control variate, see Brown and Smith (2014). aversion parameter γ = 5, an initial wealth of $1 invested in the
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 143
Fig. A3. Comparison of heuristic and boundsThis figure shows the comparison of utilities of an investor following a frictionless strategy (the strategy that ignores transaction
costs), a no-trade region approximate trading strategy, and the upper bound obtained based on the gradient-based penalty function. The parameters are the ones specified
in the base-case scenario. .
risk-free asset, faces proportional transaction costs of 50 basis which is consistent with the conclusion in Mei et al. (2016) for the
points for both assets, rebalances her portfolio once per month, model without predictability.
and has an investment horizon of T = 7. Note that with cT = 1, we Regarding the loss associated when considering simultaneously
only need to solve a six period problem for the feasible policies to the suboptimal trading strategies and the linear consumption pol-
evaluate a T = 7 problem. icy, we observe that similar to the model with utility in terminal
For the model with utility of terminal wealth, we find that wealth, the no-trade region policy outperforms the other policies
the certainty equivalent losses associated when adapting the sim- with associated loss around 3.6%, indicating an increased loss in
ple policy (S-P), the linear policy (L-P), the no-trade region pol- percentage caused by the suboptimal consumption policy.11
icy (NTR-P) and the rolling optimize-and-hold policy (ROH-P) are
4.51%, 3.01%, 2.30% and 2.57%, respectively, whereas that of the 4.3.2. Comparative statics for other scenarios
strategy that ignores transaction costs is as high as 7.79%. Fig. A.3 In this section, we analyze empirically how the certainty equiv-
shows the certainty-equivalent comparison between the no-trade alent losses associated when ignoring volatility in the dividend
region policy and the lower and upper bounds for each trial of yield (i.e., with the simple policy), trading linearly (i.e, with the lin-
the simulation. Note the big gap in losses implies the benefits of ear policy), the no-trade region policy and the rolling optimize-and-
considering the heuristic trading strategies rather than completely hold policy depend on the proportional transaction costs rate κ , the
ignoring the existence of transaction costs. risk-aversion parameter γ and the slope of dividend bd .
Moreover, note that these losses can be decomposed in two We examine first the losses associated when adapting subopti-
parts: the loss from considering suboptimal trading strategies and mal investment policies for an investor who maximizes her CRRA
that from adapting mean-variance framework to the CRRA utility utility of terminal wealth. As expected, Table A.4 shows that there
framework. For the first part, note that the insights are similar to are big gaps in certainty equivalent between the strategy based
those in Section 3.3: the no-trade region policy is the best approx- on ignoring transaction costs and the proposed investment strate-
imate feasible solution to the original model among all the poli-
cies, followed by the rolling optimize-and-hold policy. However,
11
the second part of the loss does not increase proportionally to the We also evaluate the certainty equivalent losses of a heuristic-based consump-
first part, indicating a policy with high certainty equivalent loss in tion approximation, resulting in higher losses compared to the consumption policy
recommended by the model in the absence of transaction costs in all cases (be-
mean-variance framework may perform better under power utility.
cause the former policy relies on the proposed approximate strategies, which will
Additionally, the extra loss caused by adapting the mean-variance typically result in a large gap especially when the approximate strategies do not
framework is relatively small when we consider a shorter horizon, perform well in practice.)
144 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
Table A4
Certainty-equivalent wealth: CRRA utility of terminal wealthThis table shows the
certainty-equivalent annualized wealth of an investor with CRRA utility of terminal
wealth based on the proposed heuristic policies for different values of bd , the risk-
aversion parameter γ , and the proportional transaction cost rate κ , which are given in
the first three columns. Columns 4–7 show the certainty equivalent based on the sim-
ple policy(S-P), the linear policy(L-P), the no-trade region policy(NTR-P) and the rolling
optimize-and-hold policy(ROH-P), respectively. The last two columns shows the lower
and upper bounds to the return obtained based on frictionless model and perfect hind-
sight. In each row, the value of the CEQ returns and the corresponding gap with respect
to the upper bound (in %) is reported.
gies. Basically, the former results in very small net returns espe- policy into the model with CRRA utility of intermediate consump-
cially when κ is high due to higher transaction costs compared to tion. Note that the losses in this table can be decomposed into
the suboptimal investment strategies. Similar to the case with the three parts that are not additive: the loss from choosing the sub-
mean-variance utility, there are higher losses associated with the optimal policy under the mean-variance framework, the loss from
trading strategies when the assets returns are more predictable. adapting the mean-variance framework to CRRA utility with ter-
In addition, the certainty equivalent losses associated when using minal wealth, and that from augmenting the suboptimal consump-
NTR-P and ROH-P generally increase as proportional transaction tion policy. When looking at the gap between the certainty equiv-
cost rates get higher. However, unlike the mean-variance frame- alent of upper bounds, compared to the model with zero-penalty,
work, the certainty equivalent return losses decrease as the in- the upper bound is significantly improved after considering the
vestor gets more averse of risk. This is intuitive taking into account gradient-based penalty in all cases.
that the investor prefers to trade less, and meanwhile to invest Similar to the model with mean-variance utility, a higher pro-
more into the risk-free asset as she gets more risk averse. More- portional transaction cost rate κ is always associated with a higher
over, note that the return losses of the proposed trading strate- certainty equivalent loss for both NTR-P and ROH-P. There are
gies under the mean-variance framework may differ from that un- mainly two reasons for this performance. First, the loss in mean-
der the power utility framework due to the nonadditivity of the variance utility increases with κ due to the high rate of decrease
adaptation process. Finally, the extra losses obtained when adapt- in turnover, and second, the procedure we have used to augment
ing the no-trade region policy to the power utility framework is the proposed policy to finance intermediate consumption requires
no more than 3.4% percent, which is in line with the conclusion a large amount of trading on risky assets, and thus large transac-
in Mei et al. (2016), who show a less than 0.5% loss for a model tion costs.
with i.i.d. returns. Moreover, we find the losses decrease as risk aversion pa-
Table A.5 shows the certainty equivalent consumptions and as- rameter γ increases. As it plays no role in the certainty equiv-
sociated losses after incorporating the approximate consumption alent loss of the proposed approximate strategies under the
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 145
Table A5
Certainty-equivalent consumption: CRRA utility of intermediate consumption. This table
shows the certainty-equivalent consumption of an investor with CRRA utility of intermedi-
ate consumption based on the heuristic policies for different bd , γ , and proportional trans-
action cost rate κ , which are given in the first three columns. Columns 4–7 show the cer-
tainty equivalent based on the simple policy(S-P), the linear policy(L-P), the no-trade region
policy(NTR-P) and the rolling optimize-and-hold policy(ROH-P), respectively. The last two
columns show the upper bounds to the certainty equivalent obtained based on the perfect
information relaxation with and without penalty function. In each row, the value of the CEQ
consumption and the corresponding gap with respect to the upper bound with penalty func-
tion (in %) is reported.
mean-variance framework, it is then intuitive that the optimal comparing them with the upper bound, which is typically greater
amount of rebalancing decreases given that a more risk averse than that of the optimal strategy.
investor chooses to trade less and put more weights in riskless Overall, our results show that the certainty equivalent losses as-
assets. sociated with the proposed feasible policies are quite acceptable,
Finally, we find that the simple policy performs the worst with the no-trade region policy outperforming the other approxi-
among the proposed heuristic policies, especially when there is mate strategies for most of the cases. In addition to the reasonable
more volatility in the predictive variables. On the other hand, the time needed to determine the proposed strategies, the approxi-
NTR-P beats the other policies in most of the cases, with the high- mate strategies based on the mean-variance framework allow us
est certainty equivalent loss being no more than 8.4%, followed by to deal with many risky assets simultaneously.
the ROH-P. The superior performance of NTR-P and ROH-P relies
on the fact that there is a no-trade region confining the invest-
4.3.3. The investment horizon and the number of assets
ment at each period, which is consistent with the spirit showed
To analyze the robustness and applicability of the proposed ap-
in Lynch and Tan (2010) and Muthuraman and Kumar (2006), who
proximate trading strategies against different parameter settings,
demonstrate that the optimal investment is constrained by a no-
we consider a more general case when the investor has a longer
trade region for an investor who maximizes CRRA utility of inter-
investment horizon and more risky assets to invest. We first en-
mediate consumption. However, in contrast to the structure of the
large the investment horizon to 12, 24 and 48 months. Then we
no-trade region in NTR-P and ROH-P, Lynch and Tan (2010) and
examine the certainty equivalents of the policies when there are
(Muthuraman and Kumar, 2006) show that the no-trade region
three risky assets, including the G&P type predictability model. To
may not be centered around an aim portfolio, which further ex-
illustrate the main insights, in all cases, we focus on the base-case
plain this part of the losses in both strategies. In addition, note that
scenario where an investor has a risk averse parameter γ = 5 and
for all the cases, the certainty equivalent losses are evaluated by
a proportional transaction cost rate of 50 basis points.
146 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
Table A6
Computing times required to evaluate approximate strategies and upper bounds. This table shows the
computing times(in seconds) required to evaluate the performance of proposed approximate trading
strategies and the associated upper bounds depending on different investment horizons. Specifically,
the first column specifies the horizons, columns 2–3 report the run time needed for frictionless mod-
els, columns 4–7 show the computing times of heuristic strategies and the last two column report
time required to obtain the upper bounds.
Table A7
Certainty equivalents with different investment horizons: policies and bounds. This ta-
ble reports the certainty-equivalent annualized wealth and consumption of the proposed
heuristic policies and upper bounds depending on different investment horizon, T. The
first column specifies the investment horizons. Columns 3–6 give the values of cer-
tainty equivalents of the simple policy(S-P), the linear policy(L-P), the no-trade region
policy(NTR-P) and the rolling optimize-and-hold policy(ROH-P), respectively. The last col-
umn reports that of the upper bound. In each row, the value of the CEQ and the corre-
sponding gap with respect to the upper bound with penalty function (in %) is reported.
Fig. A4. Certainty equivalents with the investment horizon. This figure shows the comparison between certainty equivalents of the no-trade region policy and that of the
upper and lower bounds (obtained based on frictionless model) depending on the investment horizon T. The left panel shows the certainty-equivalent annualized wealth of
the no-trade region policy against that of the lower and upper bounds. The right panel compares the certainty-equivalent consumption of the no-trade region policy with
the upper bounds obtained with zero penalty function (dark line) and gradient-based penalty function (red line).. (For interpretation of the references to color in this figure
legend, the reader is referred to the web version of this article.)
Table A.8 specifies the computational times required to evaluate losses both in terms of annualized terminal wealth and interme-
the performance of the proposed strategies and the corresponding diate consumption, especially for the linear policy and the no-trade
certainty equivalents. We find that there is a reasonable increase in region one. Taking into account that both the linear policy and the
computational time except for the linear policy and upper bounds. no-trade region one choose to trade partially to the aim portfolio
For the linear policy, it requires more time to calibrate the parame- at each period, the increase in losses can partly be explained by
ter λ when there are more predictors. For upper bounds, note that the presence of multiple predictors, which causes the aim portfo-
the additional predictor produces more discretization nodes in the lio deviates more from its true counterpart at each period. Finally,
approximation, implying more optimization problems to solve at the absolute loss of the rolling optimize-and-hold policy in termi-
each node. We also find a small increase in certainty equivalent nal wealth is less than 0.05$, and that in intermediate consump-
148 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
Table A8
Certainty equivalents for three-assets case: policies and bounds. This table reports the certainty-
equivalent annualized wealth and consumption of the proposed heuristic policies and upper
bounds when the number of risky assets N = 3. The first column specifies the predictabil-
ity model. Columns 3–6 give the values of performance measures of the simple policy(S-P),
the linear policy(L-P), the no-trade region policy(NTR-P) and the rolling optimize-and-hold
policy(ROH-P), respectively. The last column reports that of the upper bound.
tion is 0.006, suggesting a very small loss from considering such not only consumption at each period, but also the liquidation costs
policy. to support consumption.
5. Conclusion
4.3.4. Liquidation costs for consumption
Note that in previous analysis we evaluate the performance of We consider a multiperiod CRRA investor who faces transaction
the proposed trading and the consumption strategies in the ab- costs and has access to multiple risky assets in the presence of pre-
sence of liquidation costs. The reason is twofold. First, the pres- dictability. We propose some feasible trading strategies for the as-
ence of liquidation costs has little impact on the mean-variance sociated multiperiod portfolio selection problem with proportional
framework which we rely on for the heuristic trading strategies. transaction costs, and develop lower and upper bounds for the
Second, the consumption strategy cannot be solved efficiently for certainty-equivalent consumptions of these policies. In particular,
the model with liquidation costs when there are more than two we base our feasible strategies on the framework in Gârleanu and
risky assets and no transaction costs. However, transaction cost in Pedersen (2013) that allows us to obtain portfolio weights through
liquidating the risky assets to buy the consumption good may also a quadratic optimization problem. Our empirical analysis shows
have impact on the investor’s consumption strategy. that there is a very small mean-variance certainty equivalent loss,
To analyse such impact, we proceed in two steps. First, consider suggesting some of the proposed strategies are nearly optimal in
a given consumption strategy, i.e., the strategy derived from the the Gârleanu and Pedersen (2013) framework. With a power util-
frictionless model, and a heuristic trading strategy. For the 10 0 0 ity, we find that the certainty equivalent losses are also low and
trials, we assess the average certainty equivalent consumption of practical, and in addition, we can consider many risky assets given
the model with only transaction costs and that of the model with that the predictability model admits a discretization approxima-
both transaction costs and liquidation costs based on the given tion. For both cases, we have performed some comparative statics
consumption and trading strategy. Second, for a fixed trading strat- and robustness checks to better understand the losses associated
egy, we perform a “pseudo-dynamic programming” approach to when considering the proposed approximate strategies. Finally, we
calculate the fraction of consumption at each period for the model have shown how the upper bounds of the certainty equivalent
with and without liquidation costs, respectively. Note that the first consumption can be significantly improved using duality methods
step is to gauge the impact of liquidation costs on certainty equiv- based on a information relaxation.
alent returns, and the second step is assess the impact of liquida-
tion costs on consumption. To demonstrate the main insights, we Acknowledgement
consider the parameter setting as in the base-case scenario.
Panel A of Table A.9 shows how the liquidation costs of con- The authors gratefully acknowledge financial support from
sumption affect the certainty-equivalent consumption. For each the Spanish government through projects MTM2013- 44902-P
trading strategy, there is a slight difference between the certainty- and MTM2017-88979-P, and from the Chinese Fundamental Re-
equivalent consumption of the models before and after taking search Funds for the Central Universities #20720171057 and
into account liquidation costs. The small difference may be ex- #20720181004.
plained by the fact that deriving trading and consumption strate-
gies from the model without liquidation costs may result in Appendix A. Figures and Tables
over-consumption. To further confirm this conjecture, Panel B of
Table A.9 reports the average level of consumption at each period .
over 10 0 0 trials when different heuristic trading strategies are con-
sidered. Compared to the model that ignores liquidation costs for Appendix B. Derivation of heuristic policies
consumption, we find that the level of consumption at each pe-
riod is reduced slightly after directly taking into account the liq- Aim portfolio in linear policy
uidation costs in the model, regardless of the trading strategy. In Starting from the last period, we know the aim portfolio for an
other words, the decrease in the level of consumption may be in- investor is just Markowitz strategy
terpreted by the fact that liquidation costs decrease consumption,
since the gains from liquidating part of the portfolio need to fund MarkowitzT −1 = (γ )−1 μT −1 .
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 149
Table A9
Certainty Equivalents and Level of Consumption: with and without liquidation costs. This table
reports the certainty-equivalent consumption and level of consumption for different trading
strategies with and without liquidation costs for consumption respectively. Panel A reports
the Certainty Equivalents when we consider the frictionless-model consumption strategy to-
gether with the four heuristic trading strategy in the presence of transaction costs. Panel B
specifies the average level of consumption at each period when different trading strategies
are predetermined. All results are based on the same parameter setting as in base-case sce-
nario.
Given the model dynamics specified in (10) and (11), the condi- ≡ max ρ T −1 (xT −2 [μT −2 + ρμT −1→T −2 ]
tional mean for each period is μt = Ar + Br dt . The aim portfolio for
xT −1
γ (1 + ρ )
the second last period is
− xT −2 xT −2
2
aimT −2 = zMarkowitzT −2 + (1 − z )ET −2 (aimT −1 )
−ρ T −2 K (xT −1 − xT −2 )1 . (B.6)
= zMarkowitzT −2 + (1 − z )ET −2 (MarkowitzT −1 )
In Mei et al. (2016), they show problem (B.6) is equivalent to the
1 z
= −1 (Ar + Br bd dT −2 ) + −1 Br dT −2 (1 − bd ). (B.1) following constrained optimization problem
γ γ
min (xT −2 − xT −3 ) (xT −2 − xT −3 )
Analogously, the aim portfolio for period t can be derived as fol- xT −2
lows 1
s.t. K −1 (xT −2 − xCT −2 )∞ ≤ ,
aimt = zMarkowitzt + (1 − z )Et (aimt+1 ) ργT −2
= (γ ) −1
Ar + Br ad gt (bd ) + bTd −1−t dt where xCT −2 = γ 1 −1 μT −2 , γT −2 = (1 + ρ )γ and μT −2 = μT −2 +
T −2
(1 − ρ )μT −1|T −2 . Analogously, assume trading only occurs in pe-
+z(γ )−1 Br [dt ft (bd ) − ad ht (bd )], (B.2)
riod t, then each period the investor selects her portfolio by solving
where ft (bd ) is a polynomial of bd changing along period with ex- optimization problem (18)-(19).
pression
Appendix C. Derivation of penalty function
ft (bd ) = (1 − z )bd ft+1 (bd ) + 1 − bTd −1−t (B.3)
A brief review about duality based on information relaxation
−t−2
T
has been made in Haugh and Wang (2014). This approach has been
ht (bd ) = bid + bd (1 − z )ht+1 (bd ) (B.4) explained in detail in Brown et al. (2010) as well as in Brown and
i=0
Smith (2014). The following we are going to derive the expression
gt (bd ) = bTd −t−2 + gt+1 (bd ) (B.5) for penalty function based on approximate reward functions (33).
Given y˜t∗ to be the optimal solution to the problem without trans-
and fT −1 = 0, hT −1 = 0, gT −1 = 0. action costs, for each period the derivative for reward function is
Derivation of rolling optimize-and-optimize policy
∂ I˜t ∂ I˜t ∂ I˜t ∂ I˜t
Starting with the period before the last t = T − 2, and assuming ∇ I˜t (y˜t∗ ) = ( , ,··· , , )| ∗ , (C.1)
the investor does not trade at t = T − 1 (which means xT −1 = xT −2 ), ∂ c1 ∂ x1 ∂ ct ∂ xt y=y˜t
the value function for the last period is where
γ ∂ I˜t 1 −γ 1−γ R p, j+1
VT∗−1 (xT −2 , μT −1 ) = ρ T (x
T −2 μT −1 − T −2 xT −2 ).
x = −ρ t ct Wt , for j = 1, · · · , t − 1
2 ∂cj W j+1
The optimal strategy at t = T − 2 is the solution to the following ∂ I˜t 1 −γ 1 −γ
(1 − c j )Rej+1
problem = ρ t ct Wt , for j = 1, · · · , t − 1
∂xj W j+1
γ
VT −2 (xT −3 , μT −2 ) = max ρ T −1 (xT −2 μT −2 − T −2 xT −2 )
x ∂ I˜t −γ −γ
xT −2 2 = ρ t ct Wt−1 Wt
∂ ct
−ρ T −2 K (xT −2 − xT −3 )1
∂ I˜t
+ET −2 [VT∗−1 (μT −1 , xT −2 )] =0 (C.2)
∂ xt
150 X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151
Table D.10
Sample Statistics, VAR coefficients and Quadrature Approximation: High and Low Book-to-Market Portfolios.This table repro-
duces the quadrature method based on the estimated VAR model given in (10) and (11). The information indicated under the
table ‘DATA’ is the estimation from the sample, while the values under the table ‘QUADRATURE’ are the results obtained from
quadrature approximations. It reports the moments and parameters for the high and low book-to-market portfolios adapted
from Lynch and Tan (2010) and calculated for the quadrature approximation based on the VAR model that uses log dividend
yield as the only state variable. High and low book-to-market portfolios are denoted as High BM and Low BM respectively.
Panel A reports unconditional means, VAR slopes and the corresponding coefficients of determination for the data and the
quadrature approximation. Panel B reported the unconditional covariance matrix for the data and for the quadrature ap-
proximation. All results are for continuously compounded returns. Returns are expressed per month and in percentages. The
coefficients of determination are expressed in percentages.
DATA QUADRATURE
2
Asset/Variable Uncond. Mean Slope R Asset/Variable Uncond. Mean Slope R2
with Rte the risky asset returns excess of risk-free asset. When we and the covariance matrix for log returns at all grid points of
solve problem (35) for each sample path, y˜t∗ is computed through the predictive variable as well as the unconditional volatilities
dynamic programming by discretizing the state variable space. Tak- of the predictive variables. We choose 19 grid points for the
ing into account the fact that y˜t∗ does not involve transaction costs, dividend yield and 3 grid points for each of the stock-return
it can be solved in short time. innovations since Balduzzi and Lynch (1999) find that the resulting
approximation is able to capture important dimensions of the
Appendix D. Approximate Consumption for the Model with return predictability in the data. In Table D.10, we reproduce the
Transaction Costs quadrature method based on the estimated VAR model given
in Lynch (2001). In this table, all the results are for continuously
When we adapt the heuristic policies to power utility frame- compounded returns. Returns are expressed per month and in
work, we let the consumption for each period to be the one for percentages. The coefficients of determination are expressed in
the model without transaction costs. We discretize first the state percentages.
variable space in order to implement dynamic programming pro- Besides, the Bellman equation corresponding to prob-
cedure. lem (4) with utility function (5), without transaction costs
The VAR model is discretized using a variation of Tauchen and is
Hussey (1991) Gaussian quadrature method, which has been de-
scribed in Balduzzi and Lynch (1999). The variation is designed to
Vt (dt )
ensure that d is the only state variable to predict the risky assets
1−γ
returns. Specifically, for an VAR specified in (8)-(9), it assumes that c 1 −γ
any return predictability is fully captured by dt . Without specify- (1 − ct )1−γ
1 −γ
= max t
+ρ E xt Rt+1 + (1 − xt e )R f Vt+1 (dt+1 ) .
ct ,xt 1−γ 1−γ
ing that the residual terms in (8) and (9) are uncorrelated, this
VAR model also implies that the following expression for stock re- (D.2)
turns:
Note that the above equation does not depend on previous stage
rt+1 = Ar + Br dt + ηt+1 + ut+1 (D.1)
position. Starting from the last period, it costs no effort to solve
where η is N × 1 vector of coefficients from a regression of et+1 on for the optimal consumption of each period at each node point by
t+1 , and ut+1 is a i.i.d. disturbance vector with zero mean and co- backward iteration. In numerical experiments, the corresponding
variance matrix u , and is uncorrelated with t+1 . With this speci- consumption which is not on the grid can be computed by linear
fication, we can first discretize dt using a variation of the Gaussian interpolation.
quadrature method described by Tauchen and Hussey (1991) for a
first-order autoregressive process. Then, we can calculate a discrete References
distribution for rt+1 for each {dt+1 , dt } pair from the transforma-
tion of dt , since (D.1) implies Akian, M., Menaldi, J.L., Sulem, A., 1996. On an investment-consumption model with
transaction costs. SIAM J. Control Optim. 34 (1), 329–364.
rt+1 = Ar + Br dt + η (dt+1 − ad − bd dt ) + ut+1 , Balduzzi, P., Lynch, A.W., 1999. Transaction costs and predictability: some utility cost
calculations. J. Financ. Econ. 52 (1), 47–78.
given the model followed by dt in (9). Follow- Brown, D.B., Smith, J.E., 2011. Dynamic portfolio optimization with transaction costs:
heuristics and dual bounds. Manage Sci. 57 (10), 1752–1770.
ing Lynch (2001) and Lynch and Tan (2010), the discretization Brown, D.B., Smith, J.E., 2014. Information relaxations, duality, and convex stochastic
is implemented so as to match both the conditional mean vector dynamic programs. Oper. Res. 62 (6), 1394–1415.
X. Mei, F.J. Nogales / Journal of Banking and Finance 94 (2018) 131–151 151
Brown, D.B., Smith, J.E., Sun, P., 2010. Information relaxations and duality in stochas- Lynch, A.W., 2001. Portfolio choice and equity characteristics: characterizing the
tic dynamic programs. Oper. Res. 58 (4-part-1), 785–801. hedging demands induced by return predictability. J. Financ. Econ. 62 (1),
Calafiore, G.C., 2009. An affine control method for optimal dynamic asset allocation 67–130.
with transaction costs. SIAM J. Control Optim. 48 (4), 2254–2274. Lynch, A.W., Balduzzi, P., 20 0 0. Predictability and transaction costs: the impact on
Campbell, J.Y., Viceira, L.M., 1999. Consumption and portfolio decisions when ex- rebalancing rules and behavior. J. Finance 55 (5), 2285–2309.
pected returns are time varying. Q. J. Econ. 114 (2), 433–495. doi:10.1162/ Lynch, A.W., Tan, S., 2010. Multiple risky assets, transaction costs, and return pre-
003355399556043. dictability: allocation rules and implications for u.s. investors. J. Financ. Qunati-
Campbell, J.Y., Viceira, L.M., 2003. Strategic asset allocation: Portfolio choice for tative Anal. 1015–1053.
long-term investors. Oxford University Press. Magill, M.J., Constantinides, G.M., 1976. Portfolio selection with transactions costs. J.
Constantinides, G.M., 1979. Multiperiod consumption and investment behavior with Econ. Theory 13 (2), 245–263.
convex transactions costs. Manage Sci. 25 (11), 1127–1137. Mei, X., DeMiguel, V., Nogales, F.J., 2016. Multiperiod portfolio optimization with
Constantinides, G.M., 1986. Capital market equilibrium with transaction costs. J. multiple risky assets and general transaction costs. J. Banking Finance 69, 108–
Polit. Econ. 842–862. 120. https://doi.org/10.1016/j.jbankfin.2016.04.002.
Davis, J.L., Fama, E.F., French, K.R., 20 0 0. Characteristics, covariances, and average Merton, R.C., 1971. Optimum consumption and portfolio rules in a continuous-time
returns: 1929 to 1997. J. Finance 55 (1), 389–406. model. J. Econ. Theory 3 (4), 373–413.
Davis, M.H., Norman, A.R., 1990. Portfolio selection with transaction costs. Math. Moallemi, C.C., Sağlam, M., 2017. Dynamic portfolio choice with linear rebalancing
Oper. Res. 15 (4), 676–713. rules. J. Financ. Quantitative Anal. 52 (3), 1247–1278.
DeMiguel, V., Garlappi, L., Uppal, R., 2007. Optimal versus naive diversification: how Muthuraman, K., 2007. A computational scheme for optimal investment–consump-
inefficient is the 1/n portfolio strategy? Rev. Financ. Stud. 22 (5), 1915–1953. tion with proportional transaction costs. J. Econ. Dyn.Control 31 (4), 1132–1159.
Fama, E.F., French, K.R., 1993. Common risk factors in the returns on stocks and Muthuraman, K., Kumar, S., 2006. Multidimensional portfolio optimization with pro-
bonds. J. Financ. Econ. 33 (1), 3–56. portional transaction costs. Math. Fin. 16 (2), 301–335.
Gârleanu, N., Pedersen, L.H., 2013. Dynamic trading with predictable returns and Sefton, J. A., Champonnois, S., 2015. Managing transaction costs in a dynamic trad-
transaction costs. J. Finance 68 (6), 2309–2340. ing strategy.
Haugh, M., Iyengar, G., Wang, C., 2016. Tax-aware dynamic asset allocation. Oper. Skaf, J., Boyd, S.P., 2010. Design of affine controllers via convex optimization. IEEE
Res. 64 (4), 849–866. Trans. Automat. Contr. 55 (11), 2476–2487.
Haugh, M., Wang, C., 2014. Dynamic portfolio execution and information relaxations. Tauchen, G., Hussey, R., 1991. Quadrature-based methods for obtaining approximate
SIAM J. Financ. Math. 5 (1), 316–359. solutions to nonlinear asset pricing models. Econometrica 371–396.
Liu, H., 2004. Optimal consumption and investment with transaction costs and mul-
tiple risky assets. J. Finance 59 (1), 289–338.