Professional Documents
Culture Documents
Traditional and Emotional Stylometric An
Traditional and Emotional Stylometric An
257
@ 1996Kluwer Academic Publishers. Printed in the Netherlands.
Cynthia Whissell
Psychology Department, Laurentian University of Sudbury, Sudbury, Ontario P3E 2C6, Canada
Email: cwhissel@Nickel.Laurentian.Ca
Abstract
Traditional stylometric measures such as word usage, word length, and word repetition were paired with six
new measures that described word emotionality in terms of a word's pleasantness, its activation level, and the
combination of these factors. All measurements were applied to the songs composed by Beatles Paul McCartney
and John Lennon between 1962 and 1970. Stylistic and emotional differences between composers and across years
were found to be in agreement with observations made by critics and analysts of Beatles' songs, suggesting that
emotional stylometry is a valid instrument for the analysis of text. Lennon was the less pleasant and the sadder
lyricist, and the Lennon-McCartney lyrics became less pleasant, less active, and less cheerful over time. Several
other differences were observed and reported. A technique for summary emotional description (the emotion clock)
was also introduced.
Computer-driven stylometric analysis capitalizes on which studies words without studying their meanings
the computer's ability to count large numbers of (Sams, 471). On the other hand, stylometry has been
words speedily and accurately. A question frequently lauded for its relative objectivity (Merriam, 5). This
addressed by early stylometrists was that of authorship report describes a new stylometric technique - one
or of differences among authors. Discussions taking which adds some degree of meaning to word-counting
place in the Shakespeare Newsletter of 1984 (4, 5) analyses - and it applies this new technique, in con-
suggest that this question was of interest a dozen years junction with some standard stylometry, to the lyrics of
ago. It continues to be of interest to researchers such songs written by Beatles' composers Paul McCartney
as Smith ("Sir Thomas Moore, Pericles, and Stylom- and John Lennon.
etry" was published in 1994). A second common line In his 1969 article, Charles Osgood (194-199)
of inquiryhas addressed the ontogenesis or develop- distinguished between the two types of meaning that
ment of style. Development comes into question when may be carried by a word. The first is denotative
works of a single author (or those in a single category) or descriptive meaning, and the second connotative
are spread across a considerable span of time, during or emotional meaning. Denotative meaning is both
which they might have been expected to change. In complex and multidimensional. It is therefore very
his note on "'Edmund Ironside' and Stylometry," difficult to quantify. Connotative meaning, however,
Sams (470) recognizes the importance of changing may be quantified in terms of two or three straight-
styles, though he criticizes the assumptions that Smith forward dimensions. The quantification of connotative
used to describe Shakespeare's different modes of meaning was pioneered by Osg0od with the help of
writing. the semantic differential rating technique. According
Stylometry has been criticized for being a cold to Osgood, three emotional dimensions (evaluation,
and impoverished method of textual analysis - one potency, activation, or E, P, and A) were sufficient to
258
explain most of the variability in semantic differen- t" 1567-1570). Bestgen, Hogenraad, and their col-
tial ratings. Other researchers have reported that two leagues, as well as Anderson and Masters have also
dimensions (evaluation, activation) were sufficient to addressed the issue of emotional stylometry, and have
explain some 80% of differences among words rated on provided convincing evidence for the contribution
scales such as good-bad, fast-slow, blue-yellow, nice- which it can make to the analysis of texts such as
nasty, and so on (Whissell, "Pleasure and Activation stories, titles, and sentences (Bestgen, 21-36; Hogen-
Revisited" 871-874; Russell 1152-1168). raad, Bestgen, & Durieux 455-478; Anderson &
In terms of denotative meaning, there is little simi- McMaster 3-9).
larity between an ice cream cone and a smile yet as Although emotional textual analysis suffers from
researchers who describe connotative meaning have some of the same limitations as traditional stylometry
noted, there is considerable similarity in the emotional (the principle limitation being the paucity of mean-
meaning of these two words because both are ing carried by single words) it brings the analyst one
pleasant (positively evaluated) and moderately active. step closer to the complex judgments of text made
Similarly, nightmares and broken bones are both by readers on an ongoing basis, and it accomplishes
unpleasant and active, while sleeping and rocking this while utilizing relatively objective and perfectly
a rocking-chair are both pleasant and passive, and replicable assessment techniques.
failure and loneliness are both unpleasant and passive. Any new measurement technique must be vetted
Whissell and her colleagues (Whissell, Fournier, in terms of its reliability and its validity (its consis-
Pelland, Weir, & Makarec, 877) have established a tency of measurement, and its ability to measure what
dictionary of connotative meaning by asking people to it claims to measure) before it can pass into general
rate several thousand words in terms of the two main use (Anastasi describes the requirements of accept-
dimensions of emotion-activation and pleasantness (or able measurement instruments in chapters 5 and 6 of
evaluation). Where standard dictionaries provide a her book on psychological testing). Where emotional
description of the denotative meaning of individual stylometry is concerned, reliability is not problem-
words, the Dictionary of Affect provides two ratings atic. The same passage or text will generate exactly
that represent connotative meaning. For example, the the same emotional measures each time it is assessed
Dictionary of Affect reports ratings of 4.2 (activation) because measures depend on ratings of the emotional
and 6.4 (pleasantness) for the word "delighted" and dimensions as these apply to the words in the passage.
ratings of 2.4 (activation) and 3.2 (pleasantness) for A computer matches the passage to the Dictionary
the word "gloomy." Given that both ratings were made of Affect, and summarizes the resulting data in terms
with reference to a seven-point scale which had a stan- of passage means or percentage usage of various
dard deviation of one unit, it becomes evident that categories of emotional word. Reliability across
"delighted" carries a positive and moderately active ° passages but within author has also been demon-
connotation, while "gloomy" carries an unpleasant, strated by Whissell ("A Computer Program" 820-21).
inactive one. The two dimensions are unrelated or Passages written by different authors can be correctly
orthogonal: a high score on one will in no way pre- classified with considerable success on the basis of a
dict a high or low score on the other. combination of traditional and emotional stylometric
In her 1994 article dealing with the writing of measures.
Hemingway, Galsworthy, and Faulkner, Whissell ("A A reliable or consistent measure is sure to be
Computer Program" 818) has suggested that whole measuring something, but there is no guarantee that it
passages may be quantified in terms of the emotional is measuring what it claims to measure. The question
or connotative meaning of their component words, which remains to be answered addresses the validity
and has further demonstrated that a combination of what is being measured, and this question is much
of stylometric measures with emotional measures more difficult to answer in absolute terms. One species
provides an improved method of text description which of validity discussed by Anastasi is convergent validity
comes closer to representing the complexity of criti- (156-158). Emotional stylometric measures could be
cal commentaries that describe authors' styles than do evaluated against a criterion of critical commentary in
techniques which do not quantify emotion. Whissell a search for convergence or agreement among method-
has also used a combined stylometry (with both emo- ologies. For example, Whissell has demonstrated
tional and traditional measures) to examine writing a species of validity for emotional and traditional
styles in popular fiction ("Objective Analysis of Tex- stylometry by pointing out the consistency between
259
critics' comments and measurements of text ("A Com- especially by the third stage (Norman, 292; Camp-
puter Program" 823). bell & Murphy, xxiii). The most noticeable change,
The opus of Beatles' composers John Lennon and in the view of the critics, was a decreased cheerful-
Paul McCartney is both limited and readily available ness and a move away from love songs towards songs
for analysis in a standard copyrighted form (Campbell of self-discovery. According to this observation, both
and Murphy list all Beatles' compositions in alphabet- the pleasantness and the activation level of Beatles'
ical order). The two musicians composed 155 songs songs should have decreased across the four stages.
between 1962 and 1970, and also wrote the lyrics for A subsidiary question addresses the stage at which
their music. Songs composed after the group disbanded the greatest change occurred: according to a some
were excluded from consideration. Although almost observers, the Beatles had changed when Sgt Pepper
all the songs written by McCartney and Lennon was released but according to others, the changes noted
were copyrighted jointly under the names Lennon- (decreased cheerfulness, represented in emotional
McCartney, Compton (126) has been able to establish terms by decreased pleasantness and decreased activa-
that most of the songs were authored by one or the tion) were already evident in the second stage, specifi-
other Beatle. This is especially true for songs written cally in the songs released for the albums Revolver and
after 1964. Compton also notes that each composer Rubber Soul. Emotional stylometry will confirm, if
generally provided both the words and the music possible, the change in cheerfulness of Beatles' lyrics.
for their individual compositions, though McCartney An attempt will also be made to identify the stage at
generally composed the music first, and Lennon which this change took place.
frequently created lyrics before composing music. This Several traditional stylometric measures were
report limits its study to the Lennon-McCartney lyrics incorporated into the analysis of Beatles' lyrics. These
which are treated as verbal texts and submitted to were: word length, word frequency, use of personal
emotional and traditional stylometric analysis. pronouns (first, second, and third person); use of punc-
The first question which will be addressed by the tuation; type to token ratio within songs, song length,
analyses in this report is the question of emotional and the use of ten different specific types of words.
differences between the two composers. Lennon is The computer program used in this analyses counted
known to have been the sadder of the two individ- negatives (no, not, -n't), definite articles (the), indef-
uals. This sadness is evident in his poetry, and it was inite articles (a, an), conjunctions (and), forms of the
commented upon by several critics of Beatles music word "love" (loved, loving, loves, etc), forms of the
(Schaffner notes this on page 148 of his book, and word "girl," forms of the word "dead", and whoops,
Davies on page 110 of his). By contrast, McCartney hollers, and nonsense syllables (such as nah, ya, doo,
was regarded as the more cheerful and upbeat group OOO).
member (Davies, 281). If emotional stylometry is These stylometric measures allowed for the testing
sensitive to such differences, it will be assumed that of several observations made by Beatles' critics,
some evidence for convergent validity of the technique namely that Lennon was more preoccupied with
has been provided. death (Schaffner, 148), that he was more negative
The second question which will be addressed is than McCartney, that Lennon played with words and
that of development across the nine years during which language (Davies, 106; Schaffner, 27, 41, 114), that
McCartney and Lennon composed Beatles' music. The McCartney wrote more love songs (Compton, 121),
Beatles' career may be broken down into four stages: that McCartney was fond of whoops and other non-
1962-1964 (from first recordings to world-wide popu- sense sounds in his music (Howlett in Beatles Live at
larity, including the group's visit to the Ed Sullivan the BBC; Schaffner, 151), that the later Beatles' songs
show, and the production of A Hard Day's Night, the were not addressed to girls, and that the later songs
first Beatles film); 1965-1966 (world tours, the second were less personal in nature (Campbell & Murphy,
film Help.t, and the long-playing albums Revolver, and xxiv).
Rubber Soul); 1967-68 (a cessation of touring, Sgt Analyses at the level of counting individual words
Pepper's Lonely Hearts Club Band, Magical Mystery were also performed in order to compare the vocabu-
Tour, the double white album); 1969-1970 (the final laries of Lennon and McCartney, and in order to
albums Let It Be and Abbey Road, and the break-up establish a listing of words most commonly used in
of the group). It is generally agreed that the emotional the Lennon-McCartney compositions.
tone of Beatles music changed across these stages,
260
Table 1. Differences between composers and/or among stages for the measures of the study: trends, percentages or
medians, and significance levels
@ where the difference between composers was significant, Len (Lennon) or McC (McCartney) indicates which
composer had the higher percentage or median.
* "linear": indicates the presence of a significant linear trend across years for the variable under consideration; "a"
indicates that this trend was ascending, "d" that it was descending across stages. "nonlinear" indicates the presence of
a significant non-linear relationship.
262
Table 2. Words used more than fifty times in Lennon-McCartney songs that were used in significantly
different proportions by the two lyricists
stages, and words became longer. F r o m a stylometric dropped from 15% to 11% between stages one and
point of view, the difference between the earlier and two, and remained relatively stable thereafter. When
later Lennon-McCartney compositions was one differences between stages were examined, it became
characterized by depersonalization (fewer first and evident that the primary inflection point lay between
second person pronouns), and a move away from the first and second stages (eight times, when ties are
love songs to songs with a less pleasant and less excluded from consideration), followed by the second
active emotional tone which made less use of the word and third stages (five times), and finally the third and
"love." This finding agrees with the observations of fourth stages (twice).
Davies (262) and Campbell & Murphy (xxiv). As indi- The emotional inflection point for Beatles' lyrics
cated by the rise in repetition and in the category lies between stages one and two, where pleasantness,
of nonsense syllables and whoops, later songs were cheerfulness, and activation all drop, or between stages
frequently embellished with repetitive nonsense three and four where nastiness and softness rise, but
refrains: two examples o f this embellishment are found not between stages two and three. Given the values
in McCartney's Hey Jude and Lennon's I Am the of the differences between stages, and the significance
Walrus. of linear trends, changes in Beatles' music may be
Although many trends across stages were linear, deemed to have taken place throughout the group's
it was possible, by consulting the data in Table 1, to nine active years, or across all four stages, with most
identify inflection points (points of greatest change) changes taking place between 1962-1964 and 1965-
for several measures. For example, the percentage of 1966, and fewest changes characterizing the compar-
pleasant words in Lennon-McCartney compositions ison of 1967-1968 with 1969-1970.
263
admiring
A word-by-word comparison of the songs of Paul
McCartney to those of John Lennon resulted in
3.0 fri~f1" "~~rful
lists of a several thousand words for each composer
(14,491 tokens with 1338 types for Lennon and 12,179
2.0
peaceful// "~brave
tokens with 1370 types for McCartney). When a chi 1.0
squared for goodness of fit was employed to compare
~~larmed
the proportional usage of individual words by each 0
purpose. Once the bona tides of the technique have Norman, Philip. Shout.t The Beatles in Their Generation. New York:
been established (once its reliability and validity have Simon & Schuster, 1981.
Osgood, Charles E. "On the Why's and Wherefore's of E, P, and
been demonstrated), emotional stylometry may be used
A.'" Journal of Personality and Social Psychology 12 (1969),
to investigate texts even in the absence of critical com- 194--199.
mentary. Russell, J. A. "Evidence of Convergent Validity of Affective Mean-
ing Systems." Journal of Personality and Social Psychology 36
(1978), 1152-1168.
Sams, Eric. "Edmund Ironside and 'Stylometry'." Notes and Queries
References 4 (1994), 469-472.
Schaffner, Nicholas. The Beatles Forever. New York: McGraw-Hill,
Anastasi, Anne. Psychological Testing. 6th Edition. New York: 1977.
Macmillan, 1988. Shakespeare Newsletter 34 (1984), 4-5.
Anderson, Cliff W. and M. McMaster. "Modeling Emotional Siegel, Sidney. Nonparametric Statistics for the Behavioral
Tone in Stories Using Tension Levels and Categorical States." Sciences. New York: McGraw-Hill, 1956.
Computers and the Humanities 20 (1986), 3-9. Smith, M. W. A. "Sir Thomas Moore, Pericles, and Stylometry."
The Beatles Live at the BBC. (Compact Disc and Booklet), Parlo- Notes and Queries 4 (1994), 55-58.
phone, 1994. Whissell, Cynthia. "Pleasure and Activation Revisited: Dimensions
Bestgen, Yves. "Can Emotional Valence in Stories be Determined Underlying Semantic Responses to Fifty Randomly Selected
from Words?" Cognition and Emotion 8 (1994), 2t-36. 'Emotional' Words." Perceptual and Motor Skills 53 (1981),
Campbell, Colin and A. Murphy. Things We Said Today: The 871-874.
Complete Lyrics and Concordance to the Beatles' Songs 1962- Whissell, Cynthia. "A Computer Program for the Objective Analysis
1970. Ann Arbour, MI: Pierian Press, 1980. of Style and Emotional Connotations of Prose: Hemingway,
Compton, Todd. "McCartney or Lennon?: Beatle Myths and the Galsworthy, and Faulkner Compared." Perceptual and Motor
Composing of the Lennon-McCartney Songs." Journal of Skills 79 (1994), 815-824.
Popular Culture 22.2 (1988), 99-131. Whissell, Cynthia. "Objective Analysis of Text: I. A Comparison of
Davies, Hunter. The Beatles. Revised. New York: McGraw-Hill, Adventure and Romance Novels." Perceptual and Motor Skills
1978. 79 (1994), 1567-1570.
Hogenraad, Robert, Yves Bestgen and Jean-Franqois Durieux. Whissell, Cynthia, Michael Fournier, Ren6 Pelland, Deborah Weir
"Psychology as Literature." Genetic, Social and General and K. Makarec. "A Dictionary of Affect in Language: IV.
Psychology Monographs 118 (1992), 455-478. Reliability, Validity, and Applications." Perceptual and Motor
Merriam, Tom. "Morton vs Smith: An Objective Analysis." Shake- Skills 62 (1986), 875-888.
speare Newsletter 34 (1984), 5.