Professional Documents
Culture Documents
Chapter
12
INTRODUCTION
This chapter provides readers with volume techniques for algorithmic
trading. This includes methods to forecast monthly average daily volumes
(ADVs) and daily volumes. We also provide insight into weekly and
intraday volume patterns. The monthly ADV model incorporates previous
volume levels, momentum, and market volatility. The daily volume fore-
casting model is based on an autoregressive moving average (ARMA)
time series using a historical median measure combined with a day of
week effect adjustment factor.
X 1þa4 !
n
I xk
MIbp ðxk Þ ¼ b1 $ $ þ ð1 b1 Þ$I
k¼1
X xk þ vk
where
Size ¼ order size expressed in terms of ADV as a decimal:
X
Size ¼
ADV
a ¼ trading rate
X
a¼ ; a0
Vt
X 1þa4 !
n
I xk
MIbp ðxk Þ ¼ b1 $ $ þ ð1 b1 Þ$I
k¼1
X xk þ vk
Average Daily Volume 303
Techniques to estimate average daily monthly volume (ADV) and daily vol-
ume (Vt) with a day of week effect are provided below.
Methodology
Period: 19 years of data: Jan-2000 through Dec-2018.
Universe: ADVs for large cap (SP500) and small cap (R2000) stocks by
month.
Definitions
V(t) ¼ ADV across all stocks per day in corresponding market cap
category
s(t) ¼ average stock volatility in the month
SPX ¼ SP500 index value on last day in month
DV(t) ¼ log change in ADV (month over month [MOM]). For example,
in January, this represents the log change in ADV from December to
January:
DVðtÞ ¼ lnfVðtÞg lnfVðt 1Þg
DV(t 12) ¼ log change in daily volume (MOM) 1 year ago to incorpo-
rate a monthly pattern. For example, in January, this represents the log
change in ADV from December to January in the previous year. This
is a proxy for a seasonal effect:
DVðt 12Þ ¼ lnfVðt 12Þg lnfVðt 13Þg
304 CHAPTER 12 Volume Forecasting Techniques
Fig. 12.1 shows the ADV per stock in each month for large cap and small cap
stocks over the period Jan-2000 through Dec-2018 (18 years of monthly
data). For example, in December 2018 the ADV for a large cap stock was
5,947,593 per day and the ADV for a small cap stock was 983,149 per
day. It is important to note that historical volume levels will change based
on stock splits and corporate actions.
Beginning in 2014, large cap stock ADV was just slightly above 4 million
shares per day. There has been a recent spike in large cap volume in 4Q-
2018. Over the same period, small cap ADV has increased dramatically
from about 500,000 shares per day to about 850,000 shares per day. This
represents approximately a 70% increase in ADV for small cap stocks.
Analysis
The monthly volume forecasting analysis is to determine an appropriate
relationship to predict the expected change in monthly volume levels. Since
the number of trading days will differ in each month due to weekdays,
holidays, etc., it is important that we adjust the number of trading days to
make a fair comparison across time. Our analysis included 1- and
12-month autoregressive terms, the change in monthly volatility levels for
Average Daily Volume 305
May-07
May-18
Aug-04
Dec-00
Dec-11
Aug-15
Nov-01
Sep-03
Nov-12
Mar-09
Feb-10
Sep-14
Apr-08
Jan-00
Jan-11
Jun-06
Oct-13
Jun-17
Oct-02
Jul-05
Jul-16
(B) Small Cap ADV
2000 - 2018
1,200,000
1,000,000
800,000
600,000
400,000
200,000
0
May-2007
May-2018
Nov-2001
Sep-2003
Nov-2012
Sep-2014
Dec-2000
Aug-2004
Feb-2010
Dec-2011
Aug-2015
Jan-2000
Jun-2006
Jan-2011
Jun-2017
Mar-2009
Oct-2013
Oct-2002
Apr-2008
Jul-2005
Jul-2016
each market cap category, and the MOM change in SP500 index for both
large and small cap stocks.
We estimated our regression coefficients for large and small cap stocks for
three different periods to help uncover trends and to evaluate the stability of
these relationships. These periods are:
19 years: 2000e18.
10 years: 2014e18.
5 years: 2016e18.
306 CHAPTER 12 Volume Forecasting Techniques
Regression Results
The result of our regression study is shown in Table 12.1. In total, there are
six scenarios that were analyseddthree for large cap and three for small cap.
The results show the estimated betas, corresponding standard errors and
t-stat, and the R2 statistic. Overall our regression model had a very strong
fit. The model did explain a larger percentage of the variation for
large cap stocks than for small cap stocks (as is expected due to the trading
stability of the smaller companies).
Beta 0.003 0.324 0.210 0.359 0.793 Beta 0.003 0.208 0.199 0.309 0.350
SE 0.007 0.050 0.050 0.039 0.211 SE 0.007 0.060 0.061 0.044 0.194
t-Stat 0.478 6.515 4.204 9.134 3.755 t-Stat 0.475 3.482 3.274 6.961 1.807
R2 0.51 R2 0.27
will purchase fewer shares in a rising market but more shares in a fall-
ing market. Redemption of a fixed dollar amount will require fewer
shares to be traded in a rising market and more shares to be traded
in a declining market. We expect that this trend will stay constant
and may additionally become negative for small cap volumes until
investor sentiment and overall market confidence increase.
o Small cap stock volume is positively related to prices. As the market
price increases, small cap volume increases and vice versa. This is
likely due to high investor sentiment during times of increasing
market levels. Investors will put more in small cap stocks in a rising
market hoping to earn higher returns, but will trade small stocks less
often in a decreasing market. This relationship, however, was not
found to be statistically significant in more recent years. There no
longer appears to be any relationship between price level and small
cap volumes.
The monthly volume forecasting models for large and small cap stocks
using all 19years of data are:
n Small cap stocks. Small cap stock volume was found to be statistically
related to three factors: (1) previous month volume change, (2) change
in volatility, and (3) price level. Small cap stocks were found to have
a negative relationship with previous volume change. That is, if the
volumes were up in the previous month, they were likely to decrease
in the current month. If the volumes were down in the previous month,
they were likely to be up in the current month. Volumes continue to be
related to volatility (although almost insignificantly). Thus, change in
volatility level is still somewhat an indication for the potential to earn
a short-term profit. Volumes are negatively related to price level. This
negative relationship is likely due to funds having a fixed dollar quantity
that they will invest in the market. Thus, as prices decrease, portfolio
managers will purchase more shares and as prices increase, portfolio
managers will purchase fewer shares.
Our preferred monthly volume forecasting model for large and small
cap stocks based on the previous 3 years of data is shown below. We do
recommend performing an updated analysis of market volumes to test for
the statistical significance of each of the factors. Our results for all the
factors are shown below (not all these factors were found to be statistically
significant):
Variable Notation
V(t) ¼ actual volume on day t
b
VðtÞ ¼ forecasted volume for day t
MDV(n) ¼ MDV computed using previous n-trading days
ADV(n) ¼ ADV computed using previous n-trading days
Day Of Week(t) ¼ percentage of weekly volume that typically trades on
the given weekday
b ¼ autoregressive sensitivity parameterdestimated via ordinary least
b
squares regression analysis
e(t) ¼ forecast error on day t
where V t ðnÞ is either the n-day moving ADV or n-day moving MDV, and
e(t1) is the previous day’s volume forecast error (actual minus estimate).
That is:
eðt 1Þ ¼ Vðt 1Þ V t1 ðnÞ $ DayOfWeekðt 1Þ
The error term above is calculated as the difference between actual volume
on the day and estimated volume only using the day of week adjustment
factor. The theoretical ARMA model will cause persistence of the error
term because it includes the previous day’s error in the forecast,
e.g., e(t 2). Our analysis has found that we could achieve more accurate
estimates defining the error term only as shown above. Additionally,
computation of daily volume estimates is also made easier since we do not
need to maintain a series of forecast errors.
Analysis Goal
The goal of our daily volume forecasting analysis is to determine:
n Which is better: ADV or MDV?
n What is the appropriate number of historical days?
n Day of week adjustment factor
n Autoregressive volume term
312 CHAPTER 12 Volume Forecasting Techniques
Conclusion #1
n We conclude that the MDV using a historical period of 10 days, e.g.,
MDV(10), has the lowest forecast error across stocks and market cap
during our analysis period.
Forecasting Daily Volumes 313
45%
40%
ADV Median
35%
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29
Historical Time Period
75%
70%
65%
60%
55%
50% ADV Median
45%
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29
Historical Time Period
Authors note: As shown above, the ADV measure will more often be higher
than the actual volume due to the positive skew of the volume distribution.
Volume distributions tend to be more above average than below average
outliers. This will result in actual costs being higher than the predicted
cost. For example, if we trade 200,000 shares out of a total ADV of
1,000,000 shares, we may be tempted to state that a full-day strategy corre-
sponds to a trading rate of 20%. However, if the actual volume on the day is
only 800,000 shares, the actual trading rate will be 25%, resulting in higher
than predicted costs. The market impact forecasting error will be biased (to
the high side) when using the ADV measure to predict daily volume. In our
sample, we found the ADV to be higher than the actual volume on the day
65% of the time.
314 CHAPTER 12 Volume Forecasting Techniques
Conclusion #2
n Stock trading patterns exhibit a cyclical weekly pattern.
n The cyclicality pattern is different for large cap and small cap stocks.
Forecasting Daily Volumes 315
n FIGURE 12.3 Day of Week Effect. (A) Large Cap Stocks Day of
Week Effect, (B) Small Cap Stocks Day of Week Effect.
b
Step 3. Estimate the Autoregressive Parameter b
The autoregressive parameter is used to correct for persistence of volume over
consecutive days. We found above average volume days were more likely to
be followed by above average volume days and below average volume days
were more likely to be followed by below average volume days. But the rela-
tionship was much more significant for above average volume days than for
the below average volume days. This process is as follows:
n Estimate volume for the day based on the 10-day median plus the day of
week adjustment.
n Compute the forecast error term as the difference between the actual
volume on the day and the estimated volume. This difference is:
εðtÞ ¼ VðtÞ MedianðtÞ$DayOfWeekðtÞ
316 CHAPTER 12 Volume Forecasting Techniques
n Run a regression of the error term on its 1-day lagged term, that is:
εðtÞ ¼ a þ b$εðt 1Þ
o Compute the slope term b for large and small cap stock.
Large cap stocks had a much larger degree of autocorrelation than small cap
stocks. The average correlation of errors was blarge ¼ 0.409 for large
cap stocks and bsmall ¼ 0.237 for small cap stocks. After correcting for auto-
correlation there was still a very slight amount of autocorrelation present,
negligible to the effect on our forecasts, due to the constant term in our
regression model.
Forecast Improvements
We next compared the results from our preferred ARMA model (shown
above) to a simple 30-day ADV measure (e.g., ADV30) to determine the
extent of the forecasting improvement. The preferred ARMA model
reduced forecast error 23.2% for large cap stocks and reduced forecast error
23.9% for small cap stocks (Table 12.2).
Conclusion #3
n Statistical evidence shows mean reversion trend in monthly volumes.
n Forecasts can be improved through incorporation of an autoregressive
term.
Author’s note:
n In theory, the beta term in an ARMA model is often shown to be fore-
casted without the constant term alpha, but since we are using a moving
median it is not guaranteed that the mean error will be zero and thus a
constant term is needed.
n The ARMA forecast with the “beta” autoregressive terms can be
computed both with and without special event days. Since it is important
that this technique be continuous, unlike the day of week adjustment, we
need to include all days. As an adjustment, we can (1) treat the special
event day and day after the special event as any other day and include
an adjustment for the previous day’s forecasted error, (2) define the
forecast error to be zero on a special event day (this way it will not
be included in the next day’s forecast), or (3) use a dummy variable
for special event days.
n Our analysis calculated an autoregressive term across all stocks in each
market cap category. Users may also prefer to use a stock-specific
autoregressive term instead. We did not find statistical evidence that a
stock-specific beta is more accurate for large cap stocks but there was
some evidence supporting the need for a stock-specific beta for the small
cap stocks. Readers are encouraged to experiment with stock-specific
forecasts to determine what works best for their specific needs
(Table 12.3).
they need to trade faster or slower, and the cost ramifications of doing so.
Knowledge of intraday trading patterns also helps investors hedge timing
risk and structure optimized trading strategies to balance the tradeoff
between market impact and timing risk.
Stock intraday volumes have traditionally followed a U-shaped pattern
where more volume is traded at the open and close during midday.
However, more recently, and over the previous 5 years there has been a shift
to the intraday volume profiles and these curves now more resemble a
J-shaped pattern and a U-shaped pattern. Now, while there is still more
volume traded at the open than midday, there is much more volume traded
at the close than the open.
This shift in more trading volume at the close is due to less market trans-
parency at the open and a lack of proper price discovery mechanism in place
to help investors determine the appropriate fair value price for stocks.
As discussed in previous chapters, specialists and market-makers provided
investors with a very valuable price discovery process where there was con-
fidence in the opening stock prices on the day. But now, in an environment
without specialists and market-makers, investors must rely on trading
algorithms executing 100 share lots to determine the fair value stock price.
While algorithms do get the price discovery process right and do determine
the fair value stock price, it could take from 15 to 45 min to determine a
stable fair value price. Because this morning period has more volatility
and less price certainty, some investors have elected not to trade at the
open and they start trading their orders at 10 a.m. or after. Thus they will
need to execute more shares at the close to complete their orders. Historical-
ly, specialists and market-makers were able to determine a fair value opening
price because they were provided with the complete order size for numerous
investors and they maintained an order book with buy and sell shares and
corresponding prices. Thus by gauging the buyesell imbalance at the
opening, they could specify an accurate opening price. Nowadays, algo-
rithms do not have access to orders from any other investors, and thus are
at a disadvantage when it comes to specifying a fair value opening price.
Small cap stock volume spikes more at the close than large cap stocks. This
higher end of day spike for small cap stocks is primarily caused because
small cap stocks exhibit more uncertainty in period volumes and if investors
miss a trading opportunity during the day they will need to make up for
the missed opportunity by trading more volume at the close. Otherwise,
the order may not complete by the end of the day.
Forecasting Daily Volumes 319
Fig. 12.4A illustrates the intraday trading patterns for large cap and small cap
stocks. As shown in the figure, both large cap and small cap stock trading
volume spikes toward the close and closing volumes are much higher than
opening volumes. Intraday volume profiles are used by algorithms to deter-
mine how many shares of the order to execute in each trading interval.
Fig. 12.4B illustrates the cumulative intraday volume pattern for stocks.
Notice the increase in cumulative volume for small cap stocks. This is
caused by the increase in trading volumes into the close. Cumulative
intraday volume profiles are used by algorithms to determine the percentage
of the order that is to be completed at a specified point in time and to forecast
the remaining volume on the day.
14:15
9:30
9:45
10:00
10:15
10:30
10:45
11:00
11:15
11:30
11:45
12:15
12:30
12:45
13:00
13:15
13:30
13:45
14:00
14:30
14:45
15:00
15:15
15:30
15:45
LC SC
LC SC
Daily Volumes
SP500 100% 104% 119% 184% 124% 107% 105% 92% 33%
R2000 100% 104% 145% 192% 285% 109% 106% 96% 48%
Intraday Volume
SP500 100% 105% 97% 186% 106% 103% 100% 91% 32%
R2000 100% 105% 110% 193% 183% 103% 102% 95% 47%
Market on Close %
SP500 3.3% 2.9% 10.2% 4.4% 20.8% 7.8% 7.8% 3.7% 0.9%
R2000 4.0% 3.1% 11.6% 6.1% 110.7% 10.3% 8.4% 4.5% 2.1%
Market on Open %
SP500 0.7% 0.7% 15.6% 1.3% 0.7% 0.7% 0.6% 0.8% 0.6%
R2000 1.3% 1.1% 29.8% 2.5% 1.3% 1.1% 0.9% 1.4% 1.4%
Source: Journal of Trading/The Science of Algorithmic Trading and Portfolio Management.
322 CHAPTER 12 Volume Forecasting Techniques
where
cdf(t) ¼ cumulative intraday volume percentage at time t
ADV ¼ stock’s ADV
Actual Volume(t) ¼ actual volume traded on the day from the open
through the current time