Professional Documents
Culture Documents
Audiovisual Analysis of Relations Between Laughter
Audiovisual Analysis of Relations Between Laughter
ha (85)
lax (41)
hu (55)
nasalized (66)
giggle (106)
schwa (172)
guffaw (69)
alternated (479)
breathy (128)
80 80 40
60 60 30
40 40 20
20 20 10
0 0 0 Production Vowel quality Style
Head motion
100% others
90%
80% nod
70% tilt
Head motion 60%
left/right
25 50%
40%
Occurrence rate (%)
20 30% up-down
15 20% down
10%
10 0% up
no
breathy (83)
ha (43)
schwa (80)
lax (23)
nasalized (37)
hu (26)
guffaw (18)
alternated (315)
giggle (30)
5
0
807
Upper-body motion Cheeks
100% others 100%
90% turn 90%
80% 80%
70% tilt 70%
60% left/right 60% not raised
50% 50%
40% downward 40% raised
30% upward 30%
20% 20%
10% backward 10%
0% frontward 0%
breathy (90)
ha (59)
lax (28)
hu (39)
nasalized (44)
schwa (116)
alternated (316)
giggle (37)
guffaw (36)
no
bitter (72)
depreciatory (57)
inviting (145)
funny1 (157)
funny2 (130)
dumbfounded (40)
social (135)
soften (91)
contageous (112)
funny (127)
self-conscious (152)
Production Vowel quality Style
bitter (40)
social (74)
contageous (51)
funny (61)
depreciatory (24)
self-conscious (69)
dumbfounded (25)
inviting (67)
soften (48)
funny1 (82)
funny2 (62)
closing in giggle and guffaw laughter styles. In guffaw
laughter, all laughter events were accompanied by some body
motion, from where the occurrence rate of backward motion
was relatively higher.
Regarding the vowel quality, by comparing the
distributions of “ha” and “hu”, it can be observed that in “hu” Upper-body motion
100% others
the occurrence rate of head down and body frontward motion, 90%
80% round
while in “ha”, head up motion occurs with relatively high rate. 70% tilt
60%
Regarding the production type, “breathy” and “lax” 50% left/right
40% downward
production types show higher occurrence of “no” motion for 30%
both head and body motion, compared to the “alternated” 20% upward
10%
0% backward
pattern. frontward
bitter (28)
social (52)
contageous (46)
funny (52)
depreciatory (19)
self-conscious (64)
inviting (53)
soften (38)
funny1 (53)
funny2 (56)
dumbfounded (8)
inviting (138)
funny1 (150)
funny2 (121)
soften (88)
dumbfounded (40)
social (129)
contageous (108)
funny (122)
self-conscious (142)
808
Regarding the body motion categories, it can be observed 4. Conclusions
that the occurrence rates of front, back and up-down motions
increase, as the laughter intensity increases. The results for In the present work, we analyzed audiovisual properties of
intensity level 4 shows slightly different results, but this is laughter events in face-to-face dialogue interactions.
probably because of the small number of occurrences (around Analysis results of laughter events revealed relationships
20, for 8 categories). between laughter motions (facial expressions, head and body
From the results for head motion, it can be observed that motions) and laughter type, laughter function and laughter
the occurrence rates of nods decrease, as the laughter intensity intensity. Firstly, it was found that giggle and guffaw laughing
increases. Since nods usually appear for expressing agreement, styles are almost always accompanied by smiling facial
consent or sympathy, they are thought to be easier to appear in expressions and head or body motion. Artificially produced
low intensity laughter. laughter (such as social, bitter, dumbfounded and softening
laughter) tends to be accompanied by less motion compared to
Eyelids spontaneous laughter (such as funny and contagious laughter).
100%
90% Finally, it was found that the occurrence of smiling faces
80%
70% open (Duchenne smiles) and body motion increase, and the
60%
50% narrowed occurrence of nods decrease, as the laughter intensity increases.
40%
30% closed Future works include evaluation of acoustic features for
20% automatic detection and classification of laughter events, and
10%
0%
applications to laughter motion generation in humanoid robots.
1 (379) 2 (221) 3 (103) 4 (23)
5. Acknowledgements
Cheeks
100%
90% This study was supported by JST/ERATO. We thank Mika
80% Morita, Kyoko Nakanishi and Megumi Taniguchi for
70%
60% not raised contributions in the annotations and data analyses.
50%
40% raised
30%
20%
6. References
10%
0% [1] Ishi, C., Liu, C., Ishiguro, H. and Hagita, N. (2012). “Evaluation
1 (402) 2 (229) 3 (109) 4 (24) of a formant-based speech-driven lip motion generation,” In 13th
Annual Conference of the International Speech Communication
Lip corners Association (Interspeech 2012), Portland, Oregon, pp. P1a.04,
100%
90% September, 2012.
80% [2] C.T. Ishi, C. Liu, H. Ishiguro, and N. Hagita. “Head motion
70% lowered during dialogue speech and nod timing control in humanoid
60%
50% straight robots,” Proc. of 5th ACM/IEEE International Conference on
40%
30% raised Human-Robot Interaction (HRI 2010), pp. 293-300, 2010.
20% [3] C. Liu, C. Ishi, H. Ishiguro, and N. Hagita. Generation of
10% nodding, head tilting and gazing for human-robot speech
0%
interaction. International Journal of Humanoid Robotics (IJHR),
1 (377) 2 (219) 3 (107) 4 (24)
vol. 10, no. 1, January 2013.
Head motion no
[4] S. Kurima, C. Ishi, T. Minato, and H. Ishiguro. Online Speech-
100% Driven Head Motion Generating System and Evaluation on a
90% others Tele-Operated Robot, IEEE International Symposium on Robot
80%
70% nod and Human Interactive Communication (ROMAN 2015), pp.
60% tilt 529-534, 2015.
50%
40% left/right [5] Ishi, C., Minato, T., Ishiguro, H. (2015) "Investigation of motion
30% generation in android robots during laughing speech," Intl.
20% up-down
Workshop on Speech Robotics (IWSR 2015), Sep. 2015.
10% down
0% [6] Devillers, L. & Vidrascu, L., “Positive and negative emotional
1 (401) 2 (228) 3 (105) 4 (23) up states behind the laughs in spontaneous spoken dialogs”, Proc. of
Interdisciplinary Workshop on The Phonetics of Laughter, 37-40,
Body motion no 2007.
100%
90% others [7] Szameitat, D. P., Darwin, C. J., Szameitat, A. J., Wildgruber, D.,
80% & Alter, K. Formant characteristics of human laughter. J Voice,
70% turn
60%
25, 32-37, 2011.
50%
tilt [8] Campbell, N., “Whom we laugh with affects how we laugh”,
40% left/right Proc. of Interdisciplinary Workshop on The Phonetics of
30%
20% up-down Laughter, 61-65, 2007.
10% [9] Tanaka, H. & Campbell, N., “Acoustic features of four types of
0% back
laughter in natural conversational speech”, Proc. of ICPhS XVII,
1 (362) 2 (183) 3 (93) 4 (19) front
1958-1961, 2011.
[10] Ishi, C., Hatano, H., Hagita, N. (2014) "Analysis of laughter
Figure 4. Distributions of eyelid, cheek, lip corner,
events in real science classes by using multiple environment
head and body motion categories, for different sensor data," Proc. of 15th Annual Conference of the
categories of laughter intensity (1 to 4). The total International Speech Communication Association (Interspeech
number of utterances is shown within brackets. 2014), pp. 1043-1047, Sep. 2014.
809
[11] H. Yehia, T. Kuratate, and E. Vatikiotis-Bateson, Using speech
acoustics to drive facial motion, Proc. of the 14th International
Congress of Phonetic Sciences (ICPhS99), 1, pp. 631-634, 1999.
[12] R. Niewiadomski, M. Mancini, Y. Ding, C. Pelachaud, and G.
Volpe (2014). “Rhythmic Body Movements of Laughter,” In
Proc. of the 16th International Conference on Multimodal
Interaction (ICMI '14). ACM, New York, NY, USA, 299-306.
[13] Niewiadomski, R, Ding, Y., Mancini, M., Pelachaud, C., Volpe,
G., Camurri, A. “Perception of intensity incongruence in
synthesized multimodal expressions of laughter,” The sixth
International Conference on Affective Computing and Intelligent
Interaction (ACII2015), 2015
[14] P. Ekman, R.J. Davidson, W.V. Friesen. The Duchenne smile:
Emotional expression and brain physiology II. Journal of
Personality and Social Psychology, Vol. 58(2), 342-353, 1990.
810