Professional Documents
Culture Documents
Abstract. We assess empirically the intertemporal hedging for assets with mo-
mentum (we term it “intertemporal momentum” or IM). Consistent with the dy-
namic portfolio theory, we show that (1) IM significantly forecasts stock returns
at both market level and firm level over long horizons and complements standard
myopic momentum (MM); (2) IM strategies produce returns with a slightly lower
mean (due to the cost of hedging), much lower volatility, higher skewness (due to
the heavy penalty for very negative returns), and hence much higher Sharpe ratio
(due to the investment target), comparing with MM; (3) because our IM strategies
and static augmented MM strategies manage different risks, a simple combination
of them not only generates low volatility as our intertemporal strategies but also
high mean as static strategies, more than quadrupling Sharpe ratios of MM; (4)
the strong performance of our strategies over long investment horizons reflects the
fact that momentum depends heavily on horizons.
1. Introduction
Momentum is one of the most prominent financial market phenomena and has
been extensively documented for a wide variety of assets.1 Most studies on momen-
tum rely on the myopic strategy that explores momentum alone and is silent on
the intertemporal hedging of Merton (1971). In this paper, we empirically study
hedging momentum that helps multiperiod investors to manage reinvestment (or
long-term) risks associated with momentum. The reinvestment risks significantly
affect momentum strategies that exhibit heavy horizon dependence resulting from
their inherent path dependence.
We base our empirical analysis on the optimal dynamic momentum portfolio the-
ory developed by Li and Liu (2018a) (LL). More specifically, LL demonstrate that a
multiperiod investor needs to explore a new momentum variable, in addition to the
standard myopic momentum (MM). The new variable provides a hedge for reinvest-
ment risk and we term it intertemporal momentum (IM). It is a weighted average
of historical returns over the look-back period with more recent returns receiving
higher weights, reflecting the fact that more recent returns predict more future re-
turns in a finite horizon for an asset with MM. The optimal dynamic momentum
portfolio reduces to the standard MM strategy, or mean-variance strategy, when
investment horizon is one time period. The momentum model of LL that specifies
instantaneous return of stocks implies that stock return over long horizons can be
forecasted by a variable very close to IM. We show that IM predicts stock returns
well at both market level and firm level and contains complementary information to
MM when forecasting long-run future returns.
A major issue affects the empirical assessment of optimal portfolio theories is
estimation risk. The literature shows that the out-of-sample (OOS) performance of
the optimal portfolio is severely affected by the estimation errors. On the one hand,
it is widely documented that the mean-variance framework of Markowitz (1952)
has very poor OOS performance due to estimation errors.2 On the other hand, Lan
(2015) shows that the estimation risks can even cause the optimal dynamic portfolio
strategies of Merton (1971) to underperform the myopic strategies on OOS.3 We
1Jegadeesh and Titman (1993) document momentum for individual U.S. stocks. This evidence
has been extended to stocks in other countries (Rouwenhorst, 1998), industry portfolios (Moskowitz
and Grinblatt, 1999), country indices (Asness, Liew and Stevens, 1997), currencies (Okunev and
White, 2003), commodities (Gorton, Hayashi and Rouwenhorst, 2013), corporate bonds (Jostova,
Nikolova, Philipov and Stahel, 2013), and exchange traded futures contracts (Moskowitz, Ooi and
Pedersen, 2012; Asness, Moskowitz and Pedersen, 2013).
2E.g., Jobson and Korkie (1980), Michaud (1989), Kan and Zhou (2007), DeMiguel, Garlappi
portfolios. For example, the nonparametric approach of Brandt (1999) and the semiparametric
approach of Aı̈t-Sahalia and Brandt (2001) directly study the dependence of portfolio weights on
state variables to deal with the estimation risks in implementing the optimal dynamic strategies.
However, these approaches are difficult to implement in a setting of many stocks, which is the case
studied in our paper.
decreases from about 5 years in 1980 to 5 months in 2009. Therefore, the multiperiod
investment problems are more important in practice than the one with a one period
horizon. Short investment horizon also suffers higher transaction costs and price
impact, especially for institutional traders.
Our paper is related to Da, Gurun and Warachka (2014) who show that exploring
historical price paths can significantly improve the standard momentum strategies
because investors are less attentive to “continuous information” than to “discrete
information” that is reflected in the price paths. Cooper, Gutierrez and Hameed
(2004) also document that momentum profitability depends on market states mea-
sured by historical market returns. Different from these studies, we explore the
historical price paths via IM by following the optimal momentum portfolio theory
of LL that is derived based merely on the assumption that stocks exhibit MM.
Our IM variable places different weights on different historical returns in the look-
back period, featuring both path dependence and horizon dependence of momentum.
Our paper differs from Novy-Marx (2012) who shows that different past returns in
the look-back period have different predictive ability for future returns. First, our
strategies are based on the empirical regularities of MM that the past 12-month
return is a positive predictor of next month return, rather than the “echo” effect
documented in Novy-Marx (2012). The weights on historical returns in our paper
are optimal for a mutiperiod momentum investor. Second, our strategies capture the
intertemporal hedging for a mutiperiod investment problem; however, Novy-Marx
(2012) studies a single-period investment problem.
The remainder of paper is organized as follows. Section 2 illustrates the intuition
of hedging momentum. Section 3 discusses hedging momentum by summarizing
the optimal dynamic momentum portfolio theory of LL and studies the return pre-
dictability by IM. Section 4 develops intertemporal momentum strategies to explore
the intertemporal effect of momentum, Section 5 compares IM with MM, Section 6
studies dynamic rebalancing strategies that dynamically rebalance portfolio weights
over the investment horizon, Section 7 provides further analysis, and Section 8 con-
cludes. Appendices include predictive regressions at market level and an optimal
dynamic momentum portfolio theory for the cross-section of stock returns.
equal forecasting power for next moenth return. Similarly, the future return at
month 2 can be forecasted by returns at months {−10, −9, · · · , 0, 1}; · · · ; and the
future return at month 12 can be forecasted by returns at months {0, 1, · · · , 10, 11}.
Notice that the more recent historical returns can forecast more future returns. For
example, the return at month -11 (a distant historical return) can be used to forecast
the future return only at month 1, while the return at month 0 (a recent historical
return) is useful for forecasting all future returns over the 12-month horizon. As
a result, more recent historical returns play more important roles when forecasting
future returns over long horizons. Fig. 1 illustrates the returns used to forecast
future returns.
Therefore, a multiperiod investor should use a new momentum variable that relies
more heavily on more recent historical returns to manage the reinvestment risk. In
other words, the intertemporal hedging demand of Merton (1971) should depend on
such a new variable. This is the intuition behind our development of new momentum
variables that hedge changes in future investment opportunities over long horizons.
The illustration above is based on a single asset. We will formalize the intuition
using the theory of LL in next section, and demonstrate that the results can be
simply extended to the cross-section of stock returns in Appendix B.
on T given by5
ω̂v = ωv /ω̄, (3.2)
where
(v − t + τ )/τ 2 , v ∈ [t − τ, t − τ + T ], Z t
ωv = and ω̄ = ωv dv, (3.3)
T /τ 2 , v ∈ [t − τ + T, t], t−τ
Figure 2 also compares the weights placed on historical returns by the myopic mo-
mentum mt (the left panel) and intertemporal momentum mH 12,t (the middle panel).
It is worth noting that both variables are independent of other parameters in
the momentum model of LL, such as volatility, unconditional expected return, and
the coefficient of momentum. Further, it follows from (3.3) that different slopes of
the linear function over v ∈ [t − τ, t − τ + T ] (the decaying rate of the weights on
past returns) only change mH T,t multiplicatively, and hence they do not affect either
the predictive regressions or the zero-investment long-short portfolios to be studied
below. We standardize the sum of the weights ω̂ to 1 by scaling by ω̄. Therefore,
the new variable IM (mH 12,t ) does not suffer estimation risk.
5Corollary3.5 of LL shows that in a limiting case with large CRRA coefficient γ the weight
ωv of historical return dSv /Sv in the hedging demand is given by (3.3). This equation provides
a very close approximation for the general case. As shown in Section 3 of LL, the weight ωv is
approximately a linear function of v for v ∈ [t − τ, t − τ + T ] and is a constant for v ∈ [t − τ + T, t].
6We refer readers to LL for more discussions on the economics of the weights.
section, we use IM alone to sort stocks and form the long-short portfolios. We will
explore IM and MM jointly later in Section 5. There are three major reasons for using
IM alone. First, this variable reflects the key feature of the optimal portfolio weights
of LL that relies more heavily on recent versus distant historical returns. This
variable explores not only momentum but also intertemporal hedging. In fact, IM is
highly correlated with MM (with correlation of 0.91 at the market level and 0.74 at
the firm level), and hence it captures both momentum and intertemporal hedging.
Exploring hedging simultaneously exploits momentum as well. Second, mH 12,t can
reflect the average effect of dynamic portfolio rebalancing in the sense that the
weights it placed on past returns are proportional to the sum of all IMs with horizon
varying from 0 to its upper limit τ . To further capture the dynamic portfolio choice
effects, we also examine portfolio returns over the entire investment horizons. Third,
as shown in Table 1, IM can better forecast future returns over long horizons than
does MM; hence, we expect that the intertemporal momentum strategies involving
IM outperform the standard myopic momentum strategies involving MM. Section
5 further shows that the single sort studied is sufficiently to explore both MM and
IM.
Our portfolio differs form the standard momentum portfolio in Jegadeesh and
Titman (1993) in two ways. First, we use the IM mH i,T,t rather than the standard
momentum mi,t to sort the stocks. Second, we study the portfolio returns over its
entire investment horizon rather than studying the average returns across all “active”
portfolios at each month, in order to reflect the intertemporal hedging.7 Especially,
when investment horizon is one time period (i.e., one month in our paper), our
portfolio reduces to the standard myopic momentum strategy, or mean-variance
strategy.
Throughout this paper, we construct intertemporal momentum strategies with
different horizons using the same IM variable mH 12,t by choosing T = 12 months
in (3.1). In fact, all IM variables with different horizons can capture the common
feature of optimal portfolio weights of LL: putting more weights on more recent
7Economically, the dynamic portfolio is not optimal until the investment horizon. This is also
consistent with the practice of the fund industry. Many funds charge high fees when clients
withdraw money from the funds before maturity. Econometrically, Jegadeesh and Titman (1993)
report portfolio returns with overlapping holding periods in order to increase the power of their
tests. We report the arithmetic average return over the horizon to be comparable with the literature
of standard momentum that reports the arithmetic average return across all “active” portfolios.
In untabulated results, we find that alternatively reporting cumulative returns over the horizon
leads to higher mean than the sum of simple returns but does not significantly alter volatility.
historical returns. This choice facilitates the comparisons among different portfo-
lios.8 Also notice that, like the standard momentum variable, mH 12,t involves only
one parameter, the look-back period τ .
Table 2 reports the profits from the single-sorted intertemporal momentum strate-
gies. For comparison, we also report the profits for the standard myopic momentum
strategy that explores momentum alone and reflects the underlying model assump-
tion that MM predicts linearly and positively the next month return. We will study
later the standard momentum strategies with longer holding periods that also par-
tially reflect hedging (even not in an optimal way).
Several observations follow Table 2. First, the intertemporal momentum strategies
generate large Sharpe ratios for multiple holding periods, consistent with the optimal
dynamic portfolio theories showing that maximizing the expected CRRA utility is
equivalent to maximizing sum of squared Sharpe ratios.9 The annualized Sharpe
ratios of the intertemporal momentum strategies with holding periods of six to
twelve months are over 0.60, double Sharpe ratio of myopic momentum strategy
that explores momentum alone.
Second, there is a low cost of hedging in the sense that exploring hedging slightly
decreases portfolio return. The factor-risk-adjusted returns for the intertemporal
momentum strategies are comparable to that for the myopic momentum strategy.10
Although both the alphas and Sharpe ratios are considered as risk-adjusted returns,
the former is adjusted by the risks characterized by factors (i.e., cross firms) and the
latter features IM that explores reinvestment risk (i.e., cross time). So our results
complement current literature on cross-sectional strategies that mainly explore static
factors rather than dynamic effect.
Our results show that hedging is not to time market (hence not significantly
improves mean), but it works as momentum insurance so as to significantly reduce
return volatility and maximize Sharpe ratios. Further, the first series correlation
of returns and squared returns are 0.77 and 0.76 respectively for our intertemporal
momentum with 12-month horizon, comparing with 0.03 and 0.16 respectively for
myopic momentum with 12-month holding period. This shows that, by effectively
exploring hedging, our intertemporal strategy avoids both large losses and large
profits, leading to return persistence. This is consistent with the dynamic portfolio
theory which shows that the optimal strategy smooths portfolio returns and hence
8Weexpect that it would generate better portfolio performance to use the horizon-adjusted IM
variables for holding periods other than 12 months given that these variables allows extra flexibility.
9LL show that the value function for the optimal dynamic momentum strategies depends only
on the Sharpe ratios of the portfolio returns when the investor has CRRA utility. Under certain
probability measure, the value function equals the expected exponential function of the sum of
squared Sharpe ratios over the investment horizon.
10The US Fama-French three and five factors are downloaded from Ken French Data Library.
reduces volatility. Further, Hendricks, Patel and Zeckhauser (1993) show that there
is short-run persistence in mutual fund performance, which is probably related to
the long average holding period of stocks for institutional traders (varying from 5
months to 7 years), e.g., Sirri and Tufano (1998) and Bogle (2000).
Third, exploring intertemporal hedging slightly increases return skewness, even
though the skewness is still negative. According to the optimal portfolio theory, the
CRRA utility as used by LL gives heavy penalty for strongly negative returns in
very bad state while the utility gain at a good state is relative smaller. Therefore,
our intertemporal strategies tend to prevent big losses and hence increase return
skewness.
Fourth, our intertemporal momentum strategies do not revere in the long-run. In
contrast, the myopic momentum strategies are widely documented to yield negative
returns for long holding periods (over one years).11 This result is intuitive: the
profits for long holding periods are due to intertemporal hedging. Ideally, an optimal
dynamic momentum strategy always yields positive risk-adjusted expected returns
for any investment horizons. The IM with 12-month investment horizon used to
construct prtfolios reflects the common feature of IMs with different horizons and
explores intertemporal hedging, generating positive returns over long horizons.
In all, our intertemporal momentum strategies emphasize the importance of both
intertemporal hedging and investment horizons, on which the myopic strategies are
silent.
where we do not normalize the sum of weights to one because normalization does not
affect the long-short portfolios. The weights are also illustrated in the right panel
of Figure 2. Like MM and IM, we exclude the last month return prior to portfolio
formation for IM⊥ . IM⊥ is “orthogonal” to MM in the sense that their correlation is
only 0.007 at the firm level. Because the optimal momentum portfolio of LL depends
linearly on MM and IM, it also depends linearly on MM and IM⊥ . Also IM⊥ does
not affect the significance of the cross-sectional regressions (3.6)-(3.7). Therefore,
we can explore both momentum and intertemporal hedging by alternatively using
MM and IM⊥ . Interestingly, IM⊥ is opposite to the echo effect of Novy-Marx (2012)
that weighs more on returns 12 to seven months prior to portfolio formation and
weighs less on more recent returns.
To explore the joint effect of both momentum, we form double-sorted portfolios
sequentially that first condition on MM, then IM⊥ . Specifically, we sort stocks
into quintiles according to their MM and then subdivide these quintiles into IM⊥
subportfolios (into 2 groups).12 Post-formation returns of longing the stocks with
highest MM and IM⊥ and shorting the stocks with lowest MM and IM⊥ over the next
three months to five years are then computed. Again, we study portfolio returns
over its entire horizon.
Table 5 Panel A reports the double-sorted portfolio returns. The double-sort
portfolios have very similar performance to the corresponding single-sort portfolios
documented in Section 4.1.13 As a robustness test, Panel B reports the momentum
profits from independent double-sorts on MM and IM⊥ . These results suggest that
IM in the single sort in Section 4.1 can explore both momentum and hedging at
the same time, and the two variables in the double sorts explores momentum and
hedging respectively.
In general, double sorts are employed to control/mitigate the effect of the single-
sort variable on the other variable. Because the first-sort variable is orthogonal to
the second-sort variable in our case (with correlation of only 0.007), the single sort
studied in Section 4.1 is therefore sufficiently to explore both MM and IM. Further,
both the double sorts and the standard momentum strategies involve MM; thus, the
12In
untabulated results, we find very similar results when we sort stocks into quintiles according
to their MM and then subdivide these quintiles into 5 groups of IM⊥ subportfolios, or sort stocks
into two groups according to MM and then subdivide them into 5 groups of IM⊥ subportfolios.
Similarly, we find that our single-sort results also do not alter when we sort stocks into 25 groups.
13We also examine our double-sort strategy adjusted by volatility using (7.1)-(7.2), where r
IM
is the double-sort momentum return. Table 3 Panel B reports the results that are also similar to
the results for the augmented single-sort portfolios studied in Section 7.1.
where ORl and ORs are the overlapping ratios for the long legs and short legs
respectively and M V OL is the monthly market volatility. They show that the ratio
for the long legs decreases with market volatility but the ratio for the short legs
increases with market volatility. Therefore, as market volatility increases, the two
strategies tend to short the same stocks but long very different stocks.
Similarly, the following regressions also show the different patterns in both legs:
ORtl = 0.28 − 0.0063 × SEN Tt + SEN
t
T,l
, ORts = 0.28 + 0.0162 × SEN Tt + SEN
t
T,s
,
(−1.13) (2.54)
ORtl = 0.27 + 0.0130 × ST ATt + ST
t
AT,l
, ORts = 0.27 − 0.0216 × ST ATt + ST
t
AT,s
,
(1.67) (−3.10)
where SEN T is the investor sentiment of Baker and Wurgler (2006) and ST AT is
the market state defined as past three-year market return by following Cooper et al.
(2004).
M Mi + ωj IMi⊥ ,
where MMi and IM⊥i represent the standard momentum and adjusted intertemporal
momentum for firm i respectively and ωj is the assigned number to quintile j (j =
1, 2, 3, 4, 5), to which stock i belongs. We form a portfolio by longing the top decile
stocks with the highest adjusted momentum and shorting the bottom decile stocks.
This reflects the fact that the relative weight of hedging demand to myopic demand in
the optimal portfolio depends on the level of momentum. This portfolio’s monthly
return is realized at month t + 1. At month t + 1 we do not use the portfolio
constructed at month t but construct a new portfolio using the same method at
month t except that ω = (ω1 , · · · , ω5 ) is replaced by ω ∗ 11/12. This portfolio’s
monthly return is realized at month t + 2. Similarly, at time t + 2 we construct a
new portfolio with ω ∗ 10/12; · · · ; and at month t + 11 we construct a new portfolio
with ω ∗ 1/12 realizing return at month t + 12. Therefore, over the holding period
of 12 months, we have 12 monthly portfolio returns. The average of the 12 returns
is just the return to the strategy starts at month t. Similarly, we have one strategy
starting at each month and all these strategies lead to a time series of returns. The
rebalancing strategy features the dynamic portfolio choice effect in the sense that the
portfolio is continuously rebalanced over the investment horizon. As time increases
approaching the investment horizon, the portfolio weights converge to the myopic
strategy (e.g., Merton, 1973).
Table 7 reports the returns to the rebalancing strategies. We first study four ω’s
with equal weights. This is equivalent to assume that the relative level of hedging
demand to the myopic demand is independent of firms’ momentum level. Several
observations follows from the comparison with the results for sequential double-sorts
involving MM and IM⊥ with 12-month horizon reported in Table 5 Panel A. First,
the rebalancing strategies double the mean, slightly reduce the volatility, and hence
more than double the Sharpe ratio. Second, rebalance also significantly increases
skewness. The skewness becomes higher for higher weight ω. The skewness increases
from -1.92 for the double-sort strategy to 1.43 for the rebalancing strategy with
ω = (1, 1, 1, 1, 1). The results are robust to different weighting schemes.
We also choose a weighting scheme ω = (0.5, 0.4, 0.3, 0.2, 0.1). That is, we assign
more weight to groups with higher momentum. Table 7 shows that rebalance further
improves the Sharpe ratios. The Sharpe ratio becomes 2.01 and the skewness is -
0.00.
Panels B, C, and D report the risk-adjusted returns according to the three-factor
model of Fama and French (1993), the five-factor model of Fama and French (2015),
and the myopic momentum (UMD). The rebalancing strategies not only generate
significantly positive alpha according to the five-factor model of Fama and French
(2015), but also are immune to the five factors. They not only generate much lower
return volatility than the myopic momentum strategy, but also lead to significantly
positive adjusted returns relative to the myopic momentum measured by UMD.
7. Further Analysis
7.1. Augmented Intertemporal Momentum Strategies. Our intertemporal
momentum strategies manage the risk in the long-run (i.e., cross time), rather than
the “static” risk studied in most of the literature (i.e., cross firms), e.g., factor mod-
els. So our results complement current literature on cross-sectional strategies that
only explore static factors rather than dynamic effect. This further implies that
a combination of our strategies and existing static strategies should significantly
improve the portfolio performance because they manage different risks.
Recently, Daniel and Moskowitz (2016) document momentum crashes that make
the momentum strategies less appealing to investors. An important development
of recent momentum literature is to eliminate the momentum crashes e.g., Barroso
and Santa-Clara (2015), Daniel and Moskowitz (2016), Gulen and Petkova (2018),
Daniel, Jagannathan and Kim (2019), and Huang, Zhang, Zhou and Zhu (2019),
among others. We do not study the combination of our strategies with each of
these augmented momentum strategies. Instead, we focus on the volatility-scaled
strategy developed in Barroso and Santa-Clara (2015) who show that the risk of
myopic momentum is highly predictable and simply scaling the long-short portfolio
by its realized volatility can eliminate momentum crashes.
Following Barroso and Santa-Clara (2015), we study volatility-scaled intertempo-
ral momentum strategies. Let {rIM,t }Tt=1 be the monthly returns of intertemporal
momentum, and {rIM,d }D T
d=1 and {dt }t=1 be the daily returns and the time series
of the dates of the last trading sessions of each month respectively. The variance
forecast of the intertemporal momentum strategy at month t is given by
125
X
2
σ̂IM,t = 21 rIM,dt−1 −j /126, (7.1)
j=0
by using daily returns in the previous six months. We use the forecasted variance
to scale the returns:
σtarget
rIM ∗ ,t = 2 rIM,,t , (7.2)
σ̂IM,t
where rIM,,t is our (unscaled) intertemporal momentum, rIM ∗ ,t is the scaled in-
tertemporal momentum, and σtarget is a constant corresponding to the target level
of volatility. We use a target corresponding to an annualized volatility of 12%.
The above volatility-scaled method follows exactly Barroso and Santa-Clara (2015)
except that we replace the standard momentum by our intertemporal momentum.
We first study the single-sort portfolio. That is, rIM in (7.1)-(7.2) is the single-sort
intertemporal momentum return documented in Subsection 4.1. Table 3 Panel A
reports the results. First, scaling volatility further increases the skewness of portfolio
returns. For example, the skewness increases from -3.35 for the standard momentum
to -1.64 for our unscaled intertemporal strategy with 12-month horizon, and further
increases to 0.30 after scaling volatility. The skewness becomes even higher for large
horizons. Therefore, the augmented strategies can significantly alleviate momentum
crashes, consistent with Barroso and Santa-Clara (2015). However, our strategies
generate much higher skewness than Barroso and Santa-Clara (2015)’s volatility-
scaled strategy that leads to return skewness of -0.42.
Second, the combined strategies realize much higher returns. For example, the
mean increases from 0.61% for unscaled intertemporal strategy with 12-month hori-
zon to 3.43% for the scaled strategy. Although scaling volatility also increases return
volatility, the volatilities are still lower than that of the standard myopic strategies.
As a result, the combined strategies generate even higher Sharpe ratios, more than
quadrupling the Sharpe ratio of the standard momentum and double the Sharpe
ratio of the volatility-scaled strategy in Barroso and Santa-Clara (2015).
Therefore, the combined strategies feature both hedging that generates large
Sharpe ratios and “static” risk managing that increases mean and skewness. The
results further confirm that our intertemporal momentum strategies and the static
augmented momentum strategies explore very different risks, and hence they com-
plement each other.
alpha for the strategy over the 12-month horizon is 8.40 that is much higher than
the t-statistic of alpha for the strategy return in each month. Therefore, the port-
folio returns over the entire horizon generate extra risk-adjusted returns relative to
the portfolios returns in each month within the investment horizon. In fact, our
strategies explore the reinvestment risk associated with momentum, leading to ab-
normal risk-adjusted returns that reflect the path dependence of firm returns over
long horizons. The 12-month returns as a whole captures more information during
the 12-month investment horizon.
Panel A also shows that our strategies significantly load on the market and val-
ue factors but do not significantly load on the other three factors. However, the
strategy return in each month is insensitive to the market factor in Panel B. This
suggests again that our intertemporal momentum strategy is beyond myopic mo-
mentum by also exploring the horizon dependence of momentum. The returns over
multiple periods also capture the serial correlations of the returns. In this sense, our
strategy is not exploring the static market factor itself but to some extent exploring
the structure of it, i.e., the dynamics of the market factor. As shown in Appen-
dix A, although the myopic momentum effect is insignificant for the market, the
intertemporal momentum does significantly predict market returns in the long-run.
The term structure of the loadings on the profitability and investment factors over
the horizon also shows that the 12 monthly portfolio returns as a whole is different
from the return in each month. In untabulated results, we find the same patterns
for 3-, 6-, and 9-month investment horizons.
Theoretically, many factor models have justified as empirical applications of the
ICAPM of Merton (1973), e.g., Fama (1991) and Fama and French (1993, 2015). In
the ICAPM, the state variables relate to shifts in the investment opportunity set
and their predictive ability results from “hedging” risk factors (reinvestment risk). If
those state variables are Markovian, then Merton (1973) shows that the cross-section
of returns is completely characterized by these “original state variables” independent
of investment horizons. However, momentum involves historical information and is
non-Markovian. As horizon increases, the path-dependence of momentum generates
new state variables (hedging) that characterize the agents’ optimal portfolio and
hence the equilibrium, as demonstrated by Li and Liu (2018a, 2018b). Therefore,
if (standard) momentum is a state variable of the ICAPM framework, or a risk
factor, then stock returns will be characterized by different sets of state variables
(i.e., hedging) for different horizons. In other words, the cross-section of returns
with long horizons will be characterized by intertemporal momentum, even though
it can be still characterized by other Markovian factors.
8. Conclusion
While momentum has been extensively documented in the literature, most s-
tudies rely on the myopic strategy that explores momentum alone and is silent on
the intertemporal hedging of Merton (1971). In this paper, we develop a practical
method of hedging momentum by directly exploring the intertemporal momentum
14The alternative averaging suffers look-ahead bias when dealing with missing data because, in
order to be comparable with our “across time” averaging, the averaging weights when calculating
long/short leg return depend on the number of months data availability in next T months.
Press (CRSP) and the S&P 500 data prior to January 1927 from Robert Shiller’s Web site. The
risk-free rate from 1920 to 2017 is the Treasury-bill rate and the risk-free rate prior to the 1920s
is estimated using the commercial paper rates by following Welch and Goyal (2008). The excess
return is the difference between the log return on the S&P 500 (dividend included) and the log
return on a risk-free bill.
16Although the predictability for excess returns of traditional macroeconomic variables has been
found to be stronger at longer horizons (see, among many others, Campbell and Shiller, 1988a,
1988b, Cochrane, 1992, and Ang and Bekaert, 2007), MM or IM variable as a single predictor has
insignificant forecasting power for long-run excess returns. Therefore, the strong predictability at
More importantly, by comparing Panels A-B with Panel C, for any given horizon
T , the R2 statistic for model (A.3) exploring both momentum variables is greater
than the sum of the R2 statistics for models (A.1) and (A.2) which use MM variable
alone and hedging momentum variable alone respectively. This observation holds
for both in-sample and our-of-sample R2 s, and becomes more significant for longer
horizons. This indicates that both MM and IM variables essentially contain comple-
mentary information and must be explored simultaneously for a long-run investor.
In addition, the coefficients on both momentum are much larger in the bivariate re-
gression than in the univariate regressions, suggesting that the univariate regression
with longer horizons suffers from an omitted variable bias that lowers the marginal
impact of both momentum on expected excess returns.17
Both myopic and intertemporal momentum variables are highly correlated with
correlations greater than 0.9. However, they capture different sets of historical
information, as shown in Table A.1 and the cross-sectional results in the body. In
addition, multicollinearity is not an issue for model (A.3) because both regression
coefficients are significant and have the opposite signs, implying very different roles
of the two variables.
where the interval [t − τ, t] is the look-back period for momentum and τ > 0 is the
length of the look-back period. We assume that dBi,t is independent of dBj,t for
i 6= j.
The predictive model of (B.1) is an extension of the single-asset momentum model
of LL. It reduces to LL’s model for N = 1. It is also commonly used in the literature
(e.g., Grinblatt and Moskowitz, 2004; Heston and Sadka, 2008; and Lewellen, 2015;
among others) that documents that the coefficient β is significantly positive even
after controlling for other factors. This model is also supported by the regression
results reported in Table 1 Panel A.
Now we derive the optimal dynamic trading strategy for an investor who maxi-
mizes the expected utility over terminal wealth WT . We assume that the riskless
rate is a constant r. The optimization problem of the investor is given by
1−γ
WT
max E0 , (B.3)
{φi,t }t∈[0,T ] 1−γ
where φi,t is the portfolio weight of stock i at time t, T is the investment horizon,
γ > 0 is a constant relative risk aversion (CRRA) coefficient, and the wealth Wt
satisfies
dWt 0
= φt (α + βmt − r) + r dt + φ0t ΣdBt ,
(B.4)
Wt
where φt = (φ1,t , φ2,t , · · · , φN,t )0 , mt = (m1,t , m2,t , · · · , mN,t )0 , and Bt = (dB1,t , dB2,t , · · · , dBN,t )0
are N × 1 vectors and Σ = diag(σi ) is a N × N diagonal matrix. The optimal port-
folio weight of stock i is, according to Proposition 3.4 of LL, given by
φ∗i,t = φM H
i,t + φi,t . (B.5)
β α−r
φM
i,t = m +
2 i,t
(B.6)
γσi γσi2
that depends on momentum mi,t . The second is the intertemporal hedging demand
given by
β(1 − γ) H
φH
i,t = mi,T,t + ci , (B.7)
γσi2
where mH i,T,t is a weighted average of historical returns with the form mi,T,t =
H
Rt
ω dRi,v and ci is a constant. Applying the Cox-Huang (1989) approach, Propo-
t−τ i,v
sition 3.4 of LL derives the closed-form solutions of both the weight ωi,v and ci . To
save space, we do not specify them in this paper. LL show that mH i,T,t places more
weights on more recent historical returns and the weights rely on parameters γ, α,
β, and σi . Especially, to the leading order of 1/γ, mH
i,T,t reduces to the intertemporal
where the weight ω̂v is given by (3.2). The proof of (B.8) follows Corollary 3.5 of
LL.
Equations (B.5)-(B.8) show that, different from the myopic demand that depends
on momentum, the hedging demand however depends on IM. It is induced by in-
tertemporal hedging of Merton (1971) and provides a hedge for reinvestment risk
over long horizons. The intertemporal momentum strategies examined in our paper
explore this variable.
The new state variable IM reflects the path dependence that is inherent with
momentum. In contrast, for the optimal portfolio problems with Markovian state
variables, the original state variables that characterize the stock prices can consti-
tute a sufficient statistic of the indirect utility and hence can fully characterize the
optimal dynamic portfolios. This is the key difference between path-dependent s-
tate variables, such as momentum, and Markovian state variables, and underlies our
intertemporal factors for long investment horizons.
References
Aı̈t-Sahalia, Y. and Brandt, M. (2001), ‘Variable selection for portfolio choice’, Journal of Finance
56, 1297–1351.
Ang, A. and Bekaert, G. (2007), ‘Stock return predictability: Is it there?’, Review of Financial
Studies 20, 651–707.
Asness, C., Liew, J. and Stevens, R. (1997), ‘Parallels between the cross-sectional predictability of
stock and country returns’, Journal of Portfolio Management 23, 79–87.
Asness, C., Moskowitz, T. and Pedersen, L. (2013), ‘Value and momentum everywhere’, Journal
of Finance 68, 929–985.
Atkins, A. and Dyl, E. (1997), ‘Transactions costs and holding periods for common stocks’, Journal
of Finance 52, 309–325.
Baker, M. and Wurgler, J. (2006), ‘Investor sentiment and the cross-section of stock returns’,
Journal of Finance 57, 1645–1679.
Barroso, P. and Santa-Clara, P. (2015), ‘Momentum has its moments’, Journal of Financial Eco-
nomics 116, 111–120.
Bogle, J. (2000), ‘Restoring faith in financial markets’, The Wall Street Journal .
Brandt, M. (1999), ‘Estimating portfolio and consumption choice: A conditional Euler equations
approach’, Journal of Finance 54, 1609–1645.
Campbell, J. and Shiller, R. (1988a), ‘The dividend-price ratio and expectations of future dividends
and discount factors’, Review of Financial Studies 1, 195–228.
Campbell, J. and Shiller, R. (1988b), ‘Stock prices, earnings, and expected dividends’, Journal of
Finance 43, 661–676.
Campbell, J. and Thompson, S. (2008), ‘Predicting excess stock returns out of sample: Can
anything beat the historical average?’, Review of Financial Studies 21, 1509–1531.
Carhart, M. (1997), ‘On persistence in mutual fund performance’, Journal of Finance 52, 57–82.
Cochrane, J. (1992), ‘Explaining the variance of price-dividend ratios’, Review of Financial Studies
5, 243–280.
Cochrane, J. (2011), ‘Discount rates’, Journal of Finance 66, 1047–1108.
Conrad, J. and Yavuz, M. (2017), ‘Momentum and reversal: Does what goes up always come
down?’, Review of Finance 21, 555–581.
Cooper, M., Gutierrez, R. and Hameed, A. (2004), ‘Market states and momentum’, Journal of
Finance 59, 1345–1365.
Cox, J. and Huang, C. (1989), ‘Optimal consumption and portfolio policies when asset process
follow a diffusion process’, Journal of Economic Theory 49, 33–83.
Da, Z., Gurun, U. and Warachka, M. (2014), ‘Frog in the pan: Continuous information and
momentum’, Review of Financial Studies 27, 2171–2218.
Daniel, K., Jagannathan, R. and Kim, S. (2019), ‘A hidden Markov model of momentum’, Working
paper .
Daniel, K. and Moskowitz, T. (2016), ‘Momentum crashes’, Journal of Financial Economics
122, 221–247.
DeMiguel, V., Garlappi, L. and Uppal, R. (2009), ‘Optimal versus naive diversification: How
inefficient is the 1/N portfolio strategy?’, Review of Financial Studies 22, 1915–1953.
Fama, E. (1991), ‘Efficient capital markets: II’, Journal of Finance 46, 1575–1615.
Fama, E. and French, K. (1992), ‘The cross section of expected stock returns’, Journal of Finance
47, 427–465.
Fama, E. and French, K. (1993), ‘Common risk factors in the returns on stocks and bonds’, Journal
of Financial Economics 33, 3–56.
Fama, E. and French, K. (2015), ‘A five-factor asset pricing model’, Journal of Financial Economics
116, 1–22.
Fama, E. and MacBeth, J. (1973), ‘Risk, return, and equilibrium: Empirical tests’, Journal of
Political Economy 81, 607–636.
Gorton, G., Hayashi, F. and Rouwenhorst, K. (2013), ‘The fundamentals of commodity futures
returns’, Review of Finance 17, 35–105.
Grinblatt, M. and Moskowitz, T. (2004), ‘Predicting stock price movements from past returns:
The role of consistency and tax-loss selling’, Journal of Financial Economics 71, 541–579.
Gulen, H. and Petkova, R. (2018), Absolute strength: Exploring momentum in stock returns,
working paper, Purdue University.
Hendricks, D., Patel, J. and Zeckhauser, R. (1993), ‘Hot hands in mutual funds: Short-run persis-
tence of performance’, Journal of Finance 48, 93–130.
Heston, S. and Sadka, R. (2008), ‘Seasonality in the cross-section of stock returns’, Journal of
Financial Economics 87, 418–445.
Huang, D., Zhang, H., Zhou, G. and Zhu, Y. (2019), Twin momentum: Fundamental trends
matter, working paper, Washington University in St. Louis.
Jegadeesh, N. and Titman, S. (1993), ‘Returns to buying winners and selling losers: Implications
for stock market efficiency’, Journal of Finance 48, 65–91.
Jobson, J. and Korkie, B. (1980), ‘Estimation for Markowitz efficient portfolios’, Journal of the
American Statistical Association 75, 544–554.
Jostova, G., Nikolova, S., Philipov, A. and Stahel, C. (2013), ‘Momentum in corporate bond
returns’, Review of Financial Studies 26, 1649–1693.
Kan, R. and Zhou, G. (2007), ‘Optimal portfolio choice with parameter uncertainty’, Journal of
Financial and Quantitative Analysis 42, 621–656.
Lan, C. (2015), ‘An out-of-sample evaluation of dynamic portfolio strategies’, Review of Finance
19, 2359–2399.
Lewellen, J. (2015), ‘The cross-section of expected stock returns’, Critical Finance Review 4, 1–44.
Li, K. and Liu, J. (2018a), Optimal dynamic momentum strategies, working paper, SSRN.
https://ssrn.com/abstract=2746561.
Li, K. and Liu, J. (2018b), Portfolio selection under time delays: A piecewise dynamic programming
approach, working paper, Available at SSRN: https://ssrn.com/abstract=2916481.
Markowitz, H. (1952), ‘Portfolio selection’, Journal of Finance 7, 77–91.
Merton, R. (1971), ‘Optimum consumption and portfolio rules in a continuous-time model’, Journal
of Economic Theory 3, 373–413.
Merton, R. (1973), ‘An intertemporal capital asset pricing model’, Econometrica 41, 867–887.
Michaud, R. (1989), ‘The Markowitz optimization enigma: Is ‘optimized’ optimal?’, Financial
Analysts Journal 45, 31–42.
Moskowitz, T. and Grinblatt, M. (1999), ‘Do industries explain momentum?’, Journal of Finance
54, 1249–1290.
Moskowitz, T., Ooi, Y. and Pedersen, L. (2012), ‘Time series momentum’, Journal of Financial
Economics 104, 228–250.
Newey, W. and West, K. (1987), ‘A simple positive semi-definite, heteroskedasticity and autocor-
relation consistent covariance matrix’, Econometrica 55, 703–708.
Figure 1. The figure illustrates the returns used to forecast the future
return rt at month t, for t = 1, 2, · · · , 12. The future return r1 is forecasted
by returns {r−11 , r−10 , · · · , r0 }; r2 is forecasted by returns {r−10 , r−9 , · · · , r1 };
· · · ; and r12 is forecasted by returns {r0 , r1 , · · · , r11 }. It shows that more
recent historical returns predict more future returns. So a multiperiod
investor should use a new momentum variable that places more weights
on more recent historical returns to manage reinvestment risk.
Figure 3. The upper panel and middle panel illustrate the smoothed
overlapping ratios (moving average of 10 observations before, current
observation and 10 observations after) between our double-sort
portfolio and the myopic momentum portfolio for the long legs and
short legs respectively. The lower panel plots the smoothed overlapping
ratios for portfolios (the average of the ratios for long legs and short
legs) and the market index.
Panel B: ri,t:t+T = aH + bH mH H
i,T,t + i
3-month 9.26 1.97
(8.90)
6-month 7.89 1.78
(12.40)
9-month 6.27 1.62
(13.02)
12-month 4.61 1.40
(11.39)
Panel B: “breaking down” returns for the IM strategy with 12-month horizon
1st-month 2.31 (3.08) -0.16 (-0.86) -0.30 (-0.98) -1.03 (-2.50) 1.52 (2.75) 1.73 (1.96)
2nd-month 1.81 (3.44) -0.12 (-0.92) -0.19 (-0.84) -0.91 (-2.80) 0.88 (2.21) 1.26 (2.07)
3rd-month 1.31 (3.31) -0.04 (-0.37) -0.14 (-0.78) -0.64 (-2.40) 0.66 (2.22) 0.65 (1.46)
4th-month 1.18 (3.55) -0.02 (-0.23) -0.15 (-0.96) -0.64 (-2.80) 0.41 (1.62) 0.45 (1.25)
Panel C: rt:t+T = a + b1 mt + b2 mH
T,t + t+1
3-month -2.39 2.76 1.30 -0.32
(-1.45) (1.73)
6-month -2.91 3.46 2.36 1.57
(-2.15) (2.84)
9-month -2.73 3.26 2.00 1.49
(-1.86) (3.00)
12-month -3.58 3.52 1.82 1.21
(-2.49) (3.71)