You are on page 1of 6

Neural Networks 32 (2012) 159–164

Contents lists available at SciVerse ScienceDirect

Neural Networks
journal homepage: www.elsevier.com/locate/neunet

2012 Special Issue

Assessing cognitive alignment in different types of dialog by means of a


network model✩
Alexander Mehler a,∗ , Andy Lücking a , Peter Menke b
a
Goethe-University Frankfurt am Main, Text Technology Group, Germany
b
Bielefeld University, CRC 673 ‘‘Alignment in Communication’’, Germany

article info abstract


Keywords: We present a network model of dialog lexica, called TiTAN (Two-layer Time-Aligned Network) series. TiTAN
Alignment
series capture the formation and structure of dialog lexica in terms of serialized graph representations. The
Task-oriented dialog
dynamic update of TiTAN series is driven by the dialog-inherent timing of turn-taking. The model provides
Spontaneous dialog
Networks a link between neural, connectionist underpinnings of dialog lexica on the one hand and observable
Similarity symbolic behavior on the other. On the neural side, priming and spreading activation are modeled in
terms of TiTAN networking. On the symbolic side, TiTAN series account for cognitive alignment in terms of
the structural coupling of the linguistic representations of dialog partners. This structural stance allows us
to apply TiTAN in machine learning of data of dialogical alignment. In previous studies, it has been shown
that aligned dialogs can be distinguished from non-aligned ones by means of TiTAN -based modeling. Now,
we simultaneously apply this model to two types of dialog: task-oriented, experimentally controlled
dialogs on the one hand and more spontaneous, direction giving dialogs on the other. We ask whether
it is possible to separate aligned dialogs from non-aligned ones in a type-crossing way. Starting from a
recent experiment (Mehler, Lücking, & Menke, 2011a), we show that such a type-crossing classification
is indeed possible. This hints at a structural fingerprint left by alignment in networks of linguistic items
that are routinely co-activated during conversation.
© 2012 Elsevier Ltd. All rights reserved.

1. Background: alignment, neural representations, and dialog Rather, being an idiomatic phrase, learn by heart is mentally treated
types as a single item (Jackendoff, 2002, pp. 65–67), which is in toto as-
sociated with a meaning. Thus, the meaning of a formulaic expres-
In spite of language’s often stressed creativity, human conver- sion does not need to be constructed online in a conversation, but
sations are surprisingly formulaic: speakers use a lot of fixed ex- is lexically stored (Jackendoff, 2002, Chap. 6). Being lexicalized, for-
pressions like idioms (Čermák, 2007), turn-taking management mulas are at immediate disposal for production (Kuiper, 1996). For
of interlocutors draws on adjacency pairs (Sacks, Schegloff, & Jef- this reason, such ‘‘prefabricated components’’ of language provide
ferson, 1974), proverbs (Norrick, 2007) are used to express com- a short circuit between production and comprehension. Due to this
monplaces, amongst others, (frozen) metaphors convey similes special status, the use of formulaic expressions is comparatively
(Kövecses, 2002), and institutionalized speech provides com- resource-free. The usage of lexicalized phrases allows speakers to
municative ‘‘ready-mades’’ for official communication situations reduce memory loads and is especially tailored to routine conver-
(Miller, 1994) (see also Bazzanella, 1996; Tannen, 2007). The differ- sation tasks. As a result, formulaic speech is highly fluent and in-
ent sorts of formulaic speech have in common that they are learned terpretable without effort (Kuiper, 2000).
and represented holistically. This is because the meaning, for in- However, speakers are not confined to take only the lexicalized
stance, of an idiom like learn by heart is not composed of the mean- phrasal expressions offered by their native language. Rather, they
ings of the words involved in terms of compositional semantics. interactively create their own, mutually shared routines within
their conversations by automatically aligning their linguistic
representations. Recently, the diverse findings on alignment in
✩ Support for this work by the German Federal Ministry of Education and
dialog on the phonetic/phonological (Giles & Powesland, 1975),
Research (Linguistic Networks project) and the German Research Foundation (CRC the lexical (Clark & Wilkes-Gibbs, 1986), the syntactic (Branigan,
673 Alignment in Communication) is gratefully acknowledged.
∗ Corresponding author. Pickering, & Cleland, 2000), and the semantic (Garrod & Anderson,
E-mail addresses: mehler@em.uni-frankfurt.de (A. Mehler),
1987) level have been amalgamated by Pickering and Garrod
luecking@em.uni-frankfurt.de (A. Lücking), peter.menke@uni-bielefeld.de (2004) into the Interactive Alignment Model (IAM). One central
(P. Menke). claim of the IAM is that dyads (i.e., pairs of dialog partners) build
0893-6080/$ – see front matter © 2012 Elsevier Ltd. All rights reserved.
doi:10.1016/j.neunet.2012.02.013
160 A. Mehler et al. / Neural Networks 32 (2012) 159–164

Furthermore, a difference in the degree of alignment has been


reported between instances of different types of dialog, namely
between task-oriented and more spontaneous dialogs (Reitter,
Keller, & Moore, 2006; Reitter, Moore, & Keller, 2006). These
authors investigated syntactic priming, that is, speakers’ tendency
to repeat previously used syntactic patterns, by means of text
data from the Switchboard (Godfrey, Holliman, & McDaniel, 1992)
(spontaneous) and the MapTask (Anderson et al., 1991) (task-
oriented) corpus. They found that both kinds of dialog show
nearly the same degree of within-speaker priming while between-
speaker priming was stronger in the task-oriented corpus than in
the spontaneous one.
While these studies do not employ a control condition, Healey,
Fig. 1. From priming to alignment. A linguistic form /a/ activates its representation Howes, and Purver (2010) and Howes, Healey, and Purver (2010)
a and co-activates representation b connected to a. Both sorts of priming occur tested for syntactic priming against a random baseline. The authors
in the speaker and the hearer, leading to activated representations shared among
used dialog data from the Diachronic Corpus of Present-Day Spoken
interlocutors. If the same activation patterns are triggered repeatedly, the synaptic
connections involved become routinized. English (DCPSE). Repetitions of syntactic patterns were compared
between original dialogs and ‘‘fake’’ dialogs that were created by
up a set of expressions specific to their dialog: a ‘‘dialog lexicon’’. interleaving turns from the whole corpus. The authors found that
(Pickering & Garrod, 2004, p. 184). The dialog lexicon contains ‘‘people’s level of structural (syntactic) matching is no different
routinized expressions which are repeatedly used within a dyad from chance’’. (Healey et al., 2010, p. 3). The asymmetry of the
extent of alignment as a result of switching the dialog type could
with a fixed function or meaning. ‘‘Such routinized expressions
not be replicated either (Howes et al., 2010, p. 2006). A related
are similar to stock phrases and idioms [14], except that they
result has been reported by Hadelich, Pickering, Branigan, and
only ‘‘live’’ for the particular interaction. Routinization greatly
Crocker (2004) concerning alignment on the lexical level. They
simplifies the production process [15] and gets around problems
concluded that ‘‘subsequent descriptions in the verbal feedback
of ambiguity resolution in comprehension’’. (Garrod & Pickering,
conditions do not necessarily have to be driven by an automatic
2004, p. 10) ([14] is a reference to Jackendoff, 2002 and [15] to tendency to align on a name, but could also show effects of this
Kuiper, 1996). Thus, expressions of the dialog lexicon that are collaboration’’. (Hadelich et al., 2004, pp. 39–40).
routinely used, have a cognitive status analogous to formulas. These findings pose the question whether assessments on
According to contemporary neuroscience, lexicalized items are alignment can be generalized from one type of dialog to another.
cortically represented as distributed neuron webs (Pulvermüller, More specifically, they question whether alignment, at least on
2002, Chap. 4). Connections between items in memory are in- the lexical and syntactic level, plays a role in dialog as important
stantiated by spreading activation (Jackendoff, 2002, p. 65). The as claimed by the IAM. From this perspective, lexical or syntactic
cortical realization of linguistic representations allows for ground- alignment may be seen as artifacts of the operative dialog type,
ing the alignment process physically in terms of priming. Here, rather than being affected by neural processes of routinization.
two notions of priming have to be distinguished, namely ac- In other words, alignment could be a result of some function of
tivation and co-activation: a linguistic item /a/ immediately the type of dialog (e.g., ‘‘collaboration’’), rather than being a result
activates its corresponding mental representation a within the of common processes of spreading activation and their neural
neuron web of a speaker; a linguistic item /a/ co-activates a rep- reinforcement.
resentation b that is connected to a mediated by the activation of In order to make progress in deciding on this issue, a methodic
a. To illustrate priming and the activation/co-activation distinction improvement has to be made. As we argued elsewhere (Mehler,
in terms of a linguistic example: the word form /bird/ activates Lücking, & Weiß, 2010), simply counting repetitions of items in a
the lexical item bird. Additionally, the representations ostrich and dialog misses the structural aspect of linking between these items.
worm may be co-activated. Ostrich is connected to bird semanti- Accordingly, in Mehler et al. (2010) the TiTAN model for assessing
cally, since ostriches are a special kind of bird (viz. ratite). The form alignment in dialog has been developed that goes beyond counting
/bird/ primes worm associatively, because it is the early bird that repeated items by accounting for the co-activation of associated
gets the worm. Fig. 1 illustrates these matters. Since under nor- items and the formation of routines over time. In previous machine
learning studies, TiTAN modeling has been successfully applied
mal conditions occurrences of word forms are perceived by the
to task-oriented data of the Jigsaw Map Game Corpus (Weiß,
speaker (via self-monitoring) and by the hearer, priming processes
Pfeiffer, Schaffranietz, & Rickheit, 2008) and on more spontaneous,
triggered by these occurrences happen within both interlocutors.
direction giving dialogs from the SaGA corpus (Lücking, Bergmann,
From this perspective, routines can be understood as synaptic con-
Hahn, Kopp, & Rieser, 2010). In both cases, dialogs that manifest
nections that ‘‘emerge’’ as stable sub-configurations within neu- alignment could be separated from less or non-aligned ones
ron nets. In this sense, reinforcement of connections in a neural net (Mehler et al., 2011a; Mehler, Lücking, & Menke, 2011b). In this
provides an explanans of long-term memorizing of dialog routines. article, we apply the TiTAN model in an experiment on classifying
Note that there are additional sources of alignment (Garrod & dialogs of different type. That is, we investigate whether the class
Pickering, 2007, Section 26.2). Higher-level representations as, for of aligned dialogs can be identified across different types of dialog.
example, partner models and processes of explicit negotiation also A successful classification can be seen as a hint at a structural
play a role in aligning mental representations. However, priming fingerprint left by alignment, independent of the underlying type
as a neural motor (of activation and co-activation) is a key factor. of dialog. Conversely, in the case that our experiment replicates
Despite its rather automatic mechanism, alignment of repre- the afore-mentioned findings even by means of the improved TiTAN
sentations is a matter of degree. The range of both extremes (full method, it would give us one more reason to rethink the status of
alignment versus no alignment) spans a continuum depending on alignment in dialog.
how ‘‘many aspects of the representation[s] will be shared’’ (Pick- The article is organized as follows. In Section 2, we recapitulate
ering & Garrod, 2004, p. 172). This scalar nature makes the degree the TiTAN model. In Section 3, we describe experimental dialog data
of dialogical alignment accessible to empirical measurements. In that we use for our type-crossing classification. The classification
the past, there have been such measurements on the level of lexi- experiment is described and discussed in Section 4. Finally,
cal (Church, 2000) and syntactic priming (Branigan et al., 2000). Section 5 gives a conclusion and a prospect on future work.
A. Mehler et al. / Neural Networks 32 (2012) 159–164 161

2. Two-layer Time-Aligned Network (TiTAN) series

In this section, we describe the TiTAN (Two-layer Time-Aligned


Network) model. It was introduced in Mehler et al. (2010) as a
model of lexical alignment in dyadic conversations and empirically
tested using experimentally controlled and more spontaneous alignment channel
dialogs (Mehler et al., 2011a, 2011b). A TiTAN is a bipartite graph
GAB that models the dialog lexicon of a dyadic conversation. GAB
is partitioned into the sub-graphs A and B that capture the sub-
lexica of the interlocutors involved. Networking within TiTAN is
done turn-wise. This results in a TiTAN series where at each time
point t a graph is emitted that models the operative lexicon at
time t. Formally speaking, a TiTAN is a graph GAB = (V , E ) whose
vertex set V is partitioned into non-empty disjunct subsets VA and
VB that collect the words used by interlocutor A and B, respectively.
Further, the set of edges E is sub-divided into three non-empty
disjunct subsets EAB , EA , EB so that all edges {v, w} ∈ EAB end at Fig. 2. TiTAN series. The initially unlinked lexica of interlocutors A and B
vertices v ∈ A, w ∈ B, while all other edges {x, y} ∈ EX end at are networked turn-wise. The terminal dialog lexicon is linked by intra- and
interpersonal edges. Interpersonal links represent the ‘‘alignment channel’’.
vertices x, y ∈ VX , X ∈ {A, B}. EA and EB capture intrapersonal
priming relations, while EAB models relations of interpersonal
I (X ; Y )
priming. GX = (VX , EX ), X ∈ {A, B}, is called the X -layer of GAB = D(X , Y ) = 1 − ∈ [0, 1]. (1)
max(H (X ), H (Y ))
(V , E ) and is denoted by the projection function πX (GAB ) = GX .
In order to account for priming as a gradual process, TiTAN s are The measure follows the idea that if X and Y are statistically
further developed as weighted labeled graphs Lt = (V , Et , νt , LV ), independent, then X and Y are maximally distant. Utilizing this
where t ∈ N is a time stamp indicating the dialog turn at which scheme, Mehler et al. (2010) developed a measure of the similarity
Lt was created. The vertices v ∈ V are labeled by the surjective of sub-lexica of dialog lexica, that is, a graph similarity measure.
The TiTAN model gives an intuitive account of extreme values
function L : V → LV . In the case of lexical alignment, LV consists
of alignment in communication. The one extreme of complete
of lemmas, but in principle any kind of linguistic items can be
alignment is given by a conversation in which both interlocutors
considered. Thus, πA (Lt ) = LAt = (VA , EAt , νAt , LA ) and πB (Lt ) =
(unrealistically) use the same words always in the same way so
LBt = (VB , EBt , νBt , LB ) are the sub-lexica of interlocutor A and B,
that their dialog lexica are finally identically structured around
respectively. Networking within a TiTAN is done according to the the same vocabulary. If, conversely, the interlocutors always use
following construction rules Mehler et al. (2010, p. 1453): different words or the same words always in different ways, their
dialog lexica get maximally dissimilar. The technical details of this
• Intrapersonal links: if at time t, agent X ∈ {A, B} uses a model are given in (Mehler et al., 2010, Section 4.2) together with
form belonging to lemma l to express the current turn’s topic a range of random models for which it is shown that they do not
T = T (t ), we generate intrapersonal links between vX ∈ VX , account for the structure of dialog lexica as captured by the TiTAN
LX (vX ) = l, and all other vertices labeled by items that X has model. The interested reader is asked to refer to this publication for
used in the same round or in the preceding round on the a formal account of TiTAN series, a range of their random baselines
same topic T . If these links already exist, their weights are and their exemplary build-up.
incremented by 1. A round is defined as a series of consecutive As we deal with a type-crossing classification of dialogs we
turns on the same topic. need to take care not to characterize dialog lexica according to the
• Interpersonal links: if at time t, agent X ∈ {A, B} uses l ∈ Vt distinction of their underlying dialog types. More specifically, we
to express topic T = T (t ) that has been expressed by Y ̸= X deal with two types of dialog: strictly task-oriented, experimen-
in the same or preceding round on the same topic by the same tally controlled dialogs on the one hand and more spontaneous,
item, we span an interpersonal link e = {vA , wB } ∈ Et between direction giving dialogs on the other. Obviously, these dialogs can
be distinguished by the experimental setting that underlies the
vA ∈ VA and wB ∈ VB for which LA (vA ) = LB (wB ) = l, given
task-oriented dialogs, which guarantees that, other than in the case
that e does not already exist. Otherwise, its weight νt (e) is
of more spontaneous dialogs, turns are more evenly distributed
incremented by 1.
among interlocutors. Having this in mind, we depart from Mehler
These construction rules implement priming as a short-term et al. (2011a) by disregarding all network indices that focus on sub-
effect. However, this procedure can easily be adapted to account for lexica of single interlocutors in isolation. Further, we also disregard
the graph similarity measure of the TiTAN model as it assumes a
more delayed priming effects. Fig. 2 gives a visualization of a TiTAN
special position of reference vertices that holds dialog-wide—such
series. Initially, the lexica of both interlocutors are completely
a special position cannot be assumed for more spontaneous dialogs
disconnected. The update mechanism defined by the construction
whose turn-taking and topic selection are not controlled experi-
rules connects these lexica turn-wise by intra- and interpersonal
mentally. Therefore, we recalculated the quantitative profiles of all
edges so that the resulting TiTAN LAB maps the patterns of short- dialogs of both types of dialog anew. The underlying dialog data of
term lexical priming as manifested by the underlying dialog. this calculation is described in the next section.
TiTAN comes with an original measure of the degree of
structural coupling of the dialog lexica of interlocutors. The 3. Data: quantitative comparison of both corpora
measure is derived from a distance measure D(X ; Y ) of Kraskov
and Grassberger (2008) that is based on the mutual information This section summarizes and compares quantitative character-
I (X ; Y ) of two random variables X and Y : istics of both corpora under consideration: (i) the Jigsaw Map Game
162 A. Mehler et al. / Neural Networks 32 (2012) 159–164

Table 1 Table 2
Quantitative characteristics of the JMG corpus and the SaGA corpus. Summary of the results of QNA reported in (Mehler et al., 2011a, 2011b). The last
column denotes the number of features output by the genetic search.
Level JMG corpus SaGA corpus
No. Corpus Procedure F -score Selection
Dialogs 64 25
Turns/utterances 28,380 6413 1 JMG QNA[Mahalanobis, complete] 0.92284 47/103
Words 93,120 45,048 2 JMG Random baseline known-partition 0.75073
Length (s) 821.2 719.4 3 JMG Random baseline equi-partition 0.61788
Size (turns) 441.4 249.6 4 SaGA QNA[Mahalanobis, complete] 0.96057 16/50
Size (words) 1418.8 1801.9 5 SaGA Random baseline known-partition 0.58668
Turns/min. 32.3 20.8 6 SaGA Random baseline equi-partition 0.58583
Words/min. 103.7 150.3

Table 3
(JMG) corpus as a collection of strictly task-oriented, experimen- Summary of the results of re-performing QNA separately for the JMG and SaGA data
tally controlled dialogs (Weiß et al., 2008); (ii) the Speech and Ges- by using a subset of 60 topological indices.
ture Alignment (SaGA) corpus as a collection of more spontaneous, No. Corpus Procedure F -score Selection
direction giving dialogs (Lücking et al., 2010). Both corpora have 1 JMG QNA[Mahalanobis, complete] 0.86675 26/60
been collected and assembled by the Collaborative Research Centre 2 SaGA QNA[Mahalanobis, complete] 0.91651 22/60
673 Alignment in Communication (www.sfb673.org). Table 1 shows
characteristics for both corpora. The JMG corpus consists of 64 di-
alogs that have been transcribed on the level of utterances and do not. Generally speaking, QNA learns classes of networks by
words. In addition, events and event phases of the underlying game their distinctive topological features. Starting from a classification
have been annotated as well as morphological analyses and repairs of a set of networks, QNA seeks structural features that best
of critical words. The SaGA corpus contains 25 dialogs. In addition distinguish these networks according to this classification. This
to the transcription of turns, topics and words, a detailed anno- involves three steps of modeling (Mehler et al., 2011a): (i) Graph
tation of gestures has been done. The concepts of turns and utter- modeling: firstly, each target graph is represented by a vector of
ances refer to the same structure, namely, to contiguous sequences topological indices that model its structure. Here, these indices
of words (and word-like elements like hesitations and turn keeping operate on the final state networks of the TiTAN series, that is, on
signals). networks spanned for completed dialogs. (ii) Feature selection: a
Average values for several measures of dialog size are summa- genetic search is performed to find those topological features that
rized in Table 1. While the average lengths of dialogs in both cor- best separate the target classes. This search utilizes the F -measure
pora are rather similar, both corpora diverge in the number of turns as the harmonic mean of precision and recall. (iii) Classification:
per dialog as well as in the number of words per dialog: on aver- based on the selected feature vectors, a hierarchical agglomerative
age, the SaGA corpus contains fewer turns and more words than clustering is performed together with a subsequent partitioning.
the JMG corpus. There is an obvious explanation for this difference We use complete linkage and the Mahalanobis distance to perform
between the two types of dialog: in the JMG setting, participants this step. For more details on QNA see Mehler et al. (2010).
switch roles (instructor and constructor) on a regular and frequent In the present study, we considered 60 topological indices per
basis. In the SaGA setting, one person has the role of description TiTAN to model the topological characteristics of the underlying
giver throughout the dialog, while the other participant does not dialog. This includes, amongst others, graph entropy measures
necessarily have to produce turns on a regular basis—a common (Dehmer, 2008), graph centrality measures and the spectrum of
strategy in these dialogs was for the description giver to elaborate measures studied in the theory of complex networks (Newman,
on landmarks and objects in lengthy turns, with one or few con- 2003). The topological characteristics of TiTAN s that are captured
cluding feedback signals by the follower. There is also a difference by these indices do not linearly separate the target classes. Rather,
in the number of turns or words per minute, respectively: since the lexical alignment is a latent variable that asks for a multitude of
average dialog length is similar, there are fewer turns per minute indices to be learnt appropriately.
and more words per minute in the SaGA corpus. Using this range of 60 topological indices we start with re-
calculating the classification of aligned effective (aligned) versus
4. Classifying dialogs in a type-crossing experiment ineffective (not aligned) dialogs separately for both types of
dialog. Table 2 repeats our earlier findings for these two types
As the types of dialog corpora described in Section 3 vary of dialog from Mehler et al. (2011a, 2011b). When recalculating
tremendously in terms of their underlying experimental setting the classification using the novel set of 60 topological indices
and, thus, by their linguistic structure, one can ask for a classifier that was used to guarantee comparability among both types of
that nevertheless separates the class of aligned dialogs in both dialog (see Section 2), we get the results reported in Table 3.
corpora from their non-aligned counterparts. In the JMG corpus, Obviously, the goodness of fit of the former experiment is more
we observed 47 dialogs that manifest alignment, while 8 dialogs or less approximated by the latter; however, we also observe a
were judged to manifest no alignment. Conversely, in the SaGA decrease of the F -score. This means that by using the indices of
corpus we observed 8 successful directions in contrast to 17 the latter experiment to perform the type-crossing classification,
incorrect ones. Based on that, we build a type-crossing corpus it is somehow handicapped compared to its non-type-crossing
of 80 dialogs out of which 55 dialogs manifest alignment (or counterpart performed in Mehler et al. (2011a, 2011b).
effective communication), while 25 dialogs manifest no alignment Table 4 shows the results of the final type-crossing classifica-
(or ineffective communication). After having represented each of tion of effective versus ineffective dialogs of both types of dialog
these dialogs by a separate TiTAN series, we seek a classifier that considered here. As the best performing type-crossing classifica-
learns this separation from the TiTAN models of the corresponding tion performs with an F -score of around 83%, we can speak of a
dialog lexica. This is done with the help of Quantitative Network successful classification. Note that the second and third best per-
Analysis, which implements a semi-supervised classifier that is forming classifications result in similar F -scores. Finally, if we com-
informed about the number of target classes—in our case, these pute the corresponding random baselines (one of which assumes
are two classes: dialogs manifesting alignment and dialogs that an equipartition of the target classes, while the other is informed
A. Mehler et al. / Neural Networks 32 (2012) 159–164 163

Table 4
Summary of the results of performing QNA in a type-crossing manner for the JMG and SaGA data by
using a subset of 60 topological indices.
No. Corpus Procedure F -score Selection

1 JMG ∪ SaGA QNA[Mahalanobis, complete] 0.84174 14/60


2 JMG ∪ SaGA QNA[Mahalanobis, complete] 0.83887 23/60
3 JMG ∪ SaGA QNA[Mahalanobis, complete] 0.83171 21/60
4 JMG ∪ SaGA Average over non-random results 0.83744 19.3̄/60
5 JMG ∪ SaGA Random baseline known-partition 0.57068
6 JMG ∪ SaGA Random baseline equi-partition 0.5601

about the exact cardinality of these classes), we see that both base- representations of communication units as input to machine
lines are outperformed by our classification. Note also that the dis- learning. Our starting point was the Interactive Alignment Model
tance between the baselines and the target classification is even (IAM) of Pickering and Garrod (2004). By focusing on priming as
higher than in the case of the single-type classification based on a neural motor of short-term alignment, we related the process of
the JMG corpus reported in Table 2 (the opposite is true for the lexical alignment to its neural counterpart in terms of spreading
SaGA data). From this perspective, we can hardly state that our activation and reinforcement learning. In order to show that this
type-crossing classification was not successful. Rather, we get first alignment model is also operative in a type-crossing way, we
classificatory evidence that it is possible to separate aligned dialogs investigated two corpora that collect dialogs of two different
from non-aligned ones in a type-crossing way. This hints at a struc- types. The reason for this was to shed light on the objection that
tural, though weak fingerprint left by alignment in networks of lin- alignment is an artifact of the dialogical setting rather than a
guistic items that are routinely co-activated during conversation. general characteristic of human communication. Thus, we focused
on the question whether it is possible to separate aligned dialogs
4.1. Discussion from non-aligned ones in a type-crossing way. For this task,
we built a corpus that collects effective and ineffective dialogs
Our study provides a dialog type-crossing classification of align- of different provenance, whose status of manifesting alignment
ment in communication. It does so in a purely structural way by or not is conditioned by completely different experimental
exploring networks of syntagmatic associations in dialog lexica. settings. Even starting from such a heterogeneous corpus, the
Except for transcriptional work, this procedure is completely au-
classification that we performed by means of the TiTAN model
tomatic. The results provide evidence for alignment being a gen-
still outperformed its corresponding baseline scenarios. This is
eral (cognitive) effect, rather than just being a function of dialog
a first hint at a structural fingerprint that might be left behind
types—cf. the argument in Section 1. Thus, until any future falsifica-
by alignment. In future work, we plan to consider additional
tion, we can claim that TiTAN series provide a representational for-
dialogical data. However, as such data is hardly accessible due to
mat for the effective classification of alignment in communication.
the enormous effort of manual annotations, we plan to additionally
However, we need to take care not to overestimate this finding.
Generally speaking, there are two reference points of assessing explore discussions in Wikipedia since this data is also dialog-like
TiTAN series, a methodical and an empirical one, where the focus is (by including turn-taking, adjacency pairs and multiple threads
on the former: (i) Supposing that alignment is effective in a dialog of discussion). Finally, we plan to perform experiments that
type-crossing way, we show that the TiTAN approach is not falsified investigate the exact nature of the structural fingerprints of
in detecting it. (ii) Along this line of reasoning, we may even state cognitive alignment.
that until any future falsification, we can retain the hypothesis that
the traces left behind by alignment are not restricted to a certain References
class of dialogs. However, we do not yet know whether there are
independent reasons for the success of our classification that are Anderson, A. H., Bader, M., Bard, E. G., Boyle, E., Doherty, G., Garrod, S., et al. (1991).
The HCRC Map Task corpus. Language and Speech, 34(4), 351–366.
independent of what is called alignment in the IAM. In other words,
Bazzanella, C. (Ed.) (1996). Repetition in dialogue. Tübingen: Niemeyer.
we do not give an explanation of the underlying cognitive pro- Branigan, H. P., Pickering, M. J., & Cleland, A. A. (2000). Syntactic co-ordination in
cesses but only hint at a certain separability of their semiotic out- dialogue. Cognition, 25, B13–B25.
put. Further, as this study focuses on lexical alignment it leaves out Čermák, F. (2007). Idioms and morphology. In H. Burger, D. Dobrovolskij, P. Kühn,
& N. R. Norrick (Eds.), Phraseology. An international handbook of contemporary
other levels of linguistic resolution that may also provide evidence research (pp. 20–26). Berlin, New York: Walter de Gruyter.
of alignment, say, on the phrasal or, more generally, syntactic level. Church, K. W. (2000). Empirical estimates of adaptation: the chance of two noriegas
Following Section 1, we expect that phrasal, formulaic expressions is closer to p/2 than p2 . In Proceedings of coling 2000 (pp. 180–186) Saarbrücken.
tend to be dialog-type specific (Tannen, 2007, cf.). Thus, we expect Clark, H. H., & Wilkes-Gibbs, D. (1986). Referring as a collaborative process.
Cognition, 22, 1–39.
a smaller contribution by syntagmatic alignment in type-crossing Dehmer, M. (2008). Information processing in complex networks: graph entropy
scenarios. In order to further clarify the classificatory power of and information functionals. Applied Mathematics and Computation, 201, 82–94.
the TiTAN model in this area, we need to take into account larger Garrod, S., & Anderson, A. (1987). Saying what you mean in dialogue: a study in
amounts of multiresolutional data of a wider range of dialog types. conceptual and semantic co-ordination. Cognition, 27, 181–218.
Garrod, S., & Pickering, M. J. (2004). Why is conversation so easy? Trends in Cognitive
However, the availability of such data is restricted by the enormous Sciences, 8(1), 8–11.
effort of its preprocessing (resp. annotation). In any event, with the Garrod, S., & Pickering, M. J. (2007). Alignment in dialogue. In G. Gaskell (Ed.), Oxford
advent of interactive interfaces the automatic processing of dialog- handbook of psycholinguistics (pp. 443–451) Oxford, UK: Oxford University
Press, (Chapter 26).
ical data comes into the fore of computer science and related dis- Giles, H., & Powesland, P. F. (1975). Speech styles and social evaluation. London:
ciplines. Our study contributes to this kind of research. Academic Press.
Godfrey, J. J., Holliman, E. C., & McDaniel, J. (1992). SWITCHBOARD: telephone
5. Conclusion speech corpus for research and development. In Proc. of IEEE ICASSP-92. Vol. 1
(pp. 517–520).
Hadelich, K., Pickering, M. J., Branigan, H. P., & Crocker, M. W. (2004). Alignment
We presented the TiTAN model to represent dialog data as in dialogue: effects of visual versus verbal-feedback. In Proc. of catalog’04
networks of lexical items. This has been done to get graph (pp. 35–40).
164 A. Mehler et al. / Neural Networks 32 (2012) 159–164

Healey, P. G. T., Howes, C., & Purver, M. (2010). Does structural priming occur in Mehler, A., Lücking, A., & Weiß, P. (2010). A network model of interpersonal
ordinary conversation? In Proc. of conference on linguistic evidence. Tübingen, alignment in dialogue. Entropy, 12(6), 1440–1483.
Germany. Miller, G. (1994). Towards ethnographies of institutional discourse: proposals and
Howes, C., Healey, P. G. T., & Purver, M. (2010). Tracking lexical and syntactic suggestions. Journal of Contemporary Ethnography, 23(3), 280–306.
alignment in conversation. In Proc. of CogSci’10 (pp. 2004–2009). Newman, M. E. J. (2003). The structure and function of complex networks. SIAM
Jackendoff, R. (2002). Foundations of language. Brain, meaning, grammar, evolution. Review, 45, 167–256.
Oxford, UK: Oxford University Press. Norrick, N. R. (2007). Proverbs as set phrases. In H. Burger, D. Dobrovolskij, P. Kühn,
Kövecses, Z. (2002). Metaphor: a practical introduction. Cary: Oxford University & N. R. Norrick (Eds.), Phraseology. An international handbook of contemporary
Press. research (pp. 381–393). Berlin, New York: Walter de Gruyter.
Kraskov, A., & Grassberger, P. (2008). MIC: mutual information based hierarchical Pickering, M. J., & Garrod, S. (2004). Toward a mechanistic psychology of dialogue.
clustering. In F. Emmert-Streib, & M. Dehmer (Eds.), Information theory and Journal of Behavioral and Brain Science, 27(2), 169–190.
statistical learning (pp. 101–123). New York: Springer.
Pulvermüller, F. (2002). The neuroscience of language: on brain circuits of words and
Kuiper, K. (1996). Smooth talkers: the linguistic performance of auctioneers and
serial order. Cambridge, UK: Cambridge University Press.
sportscasters. Mahwah, New Jersey: Lawrence Earlbaum Associates, Inc..
Reitter, D., Keller, F., & Moore, J. D. (2006). Computational modelling of structural
Kuiper, K. (2000). On the linguistic properties of formulaic speech. Oral Tradition,
priming in dialogue. In Proc. of NAACL-Short’06. ACL (pp. 121–124).
15(2), 279–305.
Lücking, A., Bergmann, K., Hahn, F., Kopp, S., & Rieser, H. (2010). The Bielefeld Reitter, D., Moore, J. D., & Keller, F. (2006). Priming of syntactic rules in
speech and gesture alignment corpus (SaGA). In Multimodal corpora: advances task-oriented dialogue and spontaneous conversation. In Proc. of CogSci’06
in capturing, coding and analyzing multimodality. LREC 2010. Malta. Vol. 5 (pp. 685–690).
(pp. 92–98). Sacks, H., Schegloff, E. A., & Jefferson, G. (1974). A simplest systematics for the
Mehler, A., Lücking, A., & Menke, P. (2011a). From neural activation to symbolic organization of turn-taking for conversation. Language, 50(4), 696–735.
alignment: a network-based approach to the formation of dialogue lexica. In Tannen, D. (2007). Talking voices: repetition, dialogue, and imagery in conversational
Proc. of IJCNN 2011. San Jose. Vol. 8 (pp. 527–536). discourse (2nd ed.). Cambridge, UK: Cambridge University Press.
Mehler, A., Lücking, A., & Menke, P. (2011b). Modelling lexical alignment in Weiß, P., Pfeiffer, T., Schaffranietz, G., & Rickheit, G. (2008). Coordination in dialog.
spontaneous direction dialogue data by means of a lexicon network model. In In Proc. of 8th Annual Meeting of the Cognitive Science Society of Germany.
Proc. of CICLing 2011. Tokyo. Vol. 2. Saarbrücken. Vol. 4 (pp. 1–17).

You might also like