Professional Documents
Culture Documents
Preprocessing
Before performing the actual sentiment analysis, there are several prepro-
cessing steps as follows:
1. Filtering. We apply the following filter rules: first, each announcement must
have at least 50 words. Second, we focus only on ad hoc press releases from
German companies which are written in the English language. Our final
corpus consists of 14,427 ad hoc announcements. To study stock market re-
action, we use the daily stock market returns of the corresponding company,
originating from Thomson Reuters Datastream. We only include business
days and take the first message of each day; yielding a total of 1,892 observa-
tions. In addition, we adjust the publication days of ad hoc announcements
according to the opening times of the stock exchange. This is achieved by
assigning all disclosures filed after 8 p. m. to the next day.
2. Tokenization. Corpus entries are split into single words named tokens [1].
4. Stop word removal. Words without a deeper meaning, such as the, is, of,
etc. are named stop words [5] and can be removed. We use a list of 571 stop
words proposed in [6].
One of the most common ways to study stock price reactions from news
disclosures is to use the event study methodology [15]. We now utilize this ap-
proach to measure the stock price reaction caused by ad hoc announcements.
Abnormal returns are defined as the difference between the actual and the nor-
mal return of a security at time τ. The actual return R(τ) is measured by the
price change of the security, while the normal return originates from the market
model. Then, the abnormal return AR(τ) at a news disclosure τ is calculated
via
AR(τ) = R(τ) − E[R(τ) | ¬X τ ], (C.1)
where E[R(τ) | ¬X τ ] represents the expected return in the absence of an event
X τ.
In our study, we calculate the normal return, i. e. the expected return in
the absence of a news release, based on the market model [15]. The market
model assumes a stable linear relation between market return and normal re-
turn. Thereby, we model the market return using a reference market index,
namely, the CDAX. Our estimation period is set to 20 trading days prior to the
event window. By definition, we set the abnormal return to zero for the index
itself. This might be relevant when we analyze strategies that decide between
individual stocks and an index.
Altogether, we then can expect that the abnormal return following the dis-
closure of new and relevant information is mostly due to the news release. In
other words, the abnormal return can be regarded as some kind of excess return
caused by the news release.
with a time span of δ days. In short, momentum denotes the difference be-
tween today’s closing price and the closing price N days ago, thus referring to
prices continuing to trend. In comparison, the rate-of-change RoCi represents
the relative change as a fraction, i. e.
1: Initialize stock s ← ⊥.
2: for t in T do
pi,t−1 − pi,t−1−δ
3: Compute RoCi,t−1 ← for all stocks i.
pi,t−1−δ
4: if s = ⊥ then
5: Buy or short-sell stock s ← arg max |RoCi |.
i
6: else if RoCs,t−1 < θRoC then
7: Remove investment in stock s.
8: Buy or short-sell stock s ← arg max |RoCi |.
i
9: end if
10: end for
The portfolio strategy builds upon the previous momentum trading. But
instead of choosing always one stock, it updates the selection of stocks on a daily
basis. The sole criterion for the decision is rate-of-change. For that purpose, this
strategy picks a total of N stocks s = 1, . . . , N by maximizing maxi RoCs,t−1 . By
utilizing a larger portfolio of stocks, this strategy adjusts to the relative riskiness
by spreading the potential risk of defaults.
Appendix E. Pseudocode
The results of both benchmarks, namely, the CDAX stock market index and
momentum trading, are presented in Fig. F.1 where we see different perfor-
mance patterns. In other words, this figure shows how the value of an invest-
ment portfolio evolves over time when starting with 1 monetary unit. After a
highly volatile beginning (with also negative valuations), the value of the CDAX
increases gradually over time, while momentum trading faces a sharp drop in
the beginning. It only later recovers in early 2005, followed by a substantial
rise. The final valuation is more substantial in the case of momentum trading
in comparison to the CDAX.
Figures G.2 and G.3 provide a comparison of risk and (abnormal) returns.
60
-30
Jan 2004 Apr 2004 Jul 2004 Oct 2004 Jan 2005 Apr 2005 Jul 2005
Figure F.1: Cumulative returns of both the CDAX index and the momentum trading strategy
compared across the first 400 business days.
Supervised Learning
Average Daily Abnormal Return (in %)
Figure G.3: Comparison of abnormal returns versus risks across different trading strategies. Con-
sistent with the literature, risk is measured in terms of volatility.
1.2 Supervised Learning
Reinf. Learning
0.9
Average Daily Return (in %)
0.6
0.3
Figure G.2: Comparison of returns versus risks across different trading strategies. Consistent
with the literature, risk is measured in terms of volatility.
References
[8] B. Pang, L. Lee, Opinion Mining and Sentiment Analysis, Foundations and
Trends in Information Retrieval 2 (2008) 1–135.
[9] M.-A. Mittermayer, G. F. Knolmayer, Text Mining Systems for Market Re-
sponse to News: A Survey, 2006.
[14] E. Henry, Are Investors Influenced By How Earnings Press Releases Are
Written?, Journal of Business Communication 45 (2008) 363–407.
[19] K.-J. Kim, Financial Time Series Forecasting using Support Vector Ma-
chines, Neurocomputing 55 (2003) 307–319.