Professional Documents
Culture Documents
Sai 2005
Sai 2005
Increasing evidence suggests that newborn infants can rapidly learn about faces.
Sai (1990), Bushnell et al. (1989), Field et al. (1984) and Pascalis et al. (1995) found
statistically reliable preference for the mother’s face, as opposed to a female
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 31
may not only play a role in determining the mother’s face recognition, but is
likely to lay ground to early cross-modal learning.
Research has now clearly revealed many interesting intermodal capabilities
in infancy (for review see, Bushnell and Boudreau, 1991, 1993; Lewkowicz
and Lickliter, 1995; Rose and Ruff, 1987). Some of the intermodal relations
are amodal and redundant across the senses. For example, the synchrony
of voice and mouth when a person speaks. Other cues are modality-specific and
arbitrary-related across the senses, such as the pairing of a face and the sound of
the voice.
From birth, newborn infants perceive a wide range of invariant amodal
relations. For example, they reliably co-ordinate visual and auditory space, in
that they turn their heads and eyes in the direction of a sound source, suggesting
that knowledge about spatial location is provided by both visual and auditory
information (e.g. Wertheimer, 1961; Muir and Field, 1979; Clifton et al., 1981;
Butterworth, 1983; Muir and Clifton, 1985). At 5 weeks, infants can recognize a
visual shape (a pacifier) that had been experienced tactually by sucking,
suggesting that the pacifier is coded both tactually and visually (Meltzoff and
Borton, 1979). Twenty week old infants show a sensitivity to temporal synchrony
specified intermodally in that they detect the common rhythm and duration of
tones and flashing lights (Lewkowicz, 1986). Around this age, infants can also
match appropriately the sounds made either by a single unitary element or by a
cluster of smaller elements (Bahrick, 1987, 1988). Together, these findings indicate
that infants from birth perceive amodal relations.
On the other hand, recent evidence suggests that arbitrary intermodal
relations are detected and learned in infancy too. Two-day-old infants can learn
arbitrary visual–auditory associations when familiarized to two alternating
visual stimuli differing in colour and orientation, and when each association
was accompanied by its appropriate sound (Slater et al., 1997). By 14 weeks,
infants were able to learn to associate the sound of their mother’s voice
with the sight of her face (Spelke and Owsley, 1979), and 24 week olds
learned the association between the colour of a container and the taste of the food
it contained (Reardon and Bushnell, 1988). Prior to 12 weeks, infants were not
able to detect the arbitrary pitch colour/shape relations (Bahrick et al., 1996).
Only infants of 28 weeks demonstrated learning of the arbitrary relations
(Bahrick, 1994). If young infants are able to learn the synchrony of voice and mouth
when the person speaks (amodal information), and can learn the pairing of a
stimulus and the sound that stimulus produces (arbitrary information), then learning
about faces would be most likely the outcome of an early intermodal processing.
It is, however, possible that there exists a specific mechanism for learning the
visual features of the mother’s face. If this were the case, then newborn infants
would still prefer their mother’s face to that of a female stranger even when the
mother’s voice is not available to the infants from birth through testing. On the
other hand, preference for the mother’s face might disappear with the
unavailability of a postnatal auditory experience to the mother’s voice. In this
case, the conclusion that would be drawn is that face preference does not solely
depend on visual processing. Instead, we would assume that face preference is
the product of intermodal learning, especially since 2-day-old newborn infants
are able to learn arbitrary visual–auditory associations (Slater et al., 1997) as
mentioned earlier. However, we do not have evidence concerning whether
infants are able to perceive the mother’s face–voice associations and later use
such information in the recognition test. The experiments described here were
designed to determine the role of the mother’s voice in facilitating preference for
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
32 F. Z. Sai
the mother’s face at birth. The infants were, therefore, not exposed to their
mother’s voice from birth through testing.
Some clarification of our understanding of infants’ learning about faces, and
the role of the mother’s face in developing facial processing is likely to emerge
when newborn infants are tested using the same procedures as in our previous
studies (Sai, 1990; Bushnel et al., 1989), with a visual preference task and paired
rather than sequential presentations of the comparison faces. This paper
describes four studies which used non-Caucasian participants.
EXPERIMENT 1
The purpose of the present experiments was to determine the role of experience
with the mother’s voice in facilitating face recognition in the neonatal period.
However, since our previous studies (Sai and Bushnell, 1987; Bushnell et al., 1989;
Sai, 1990) demonstrating recognition of the mother’s face in the neonatal period
tested only Caucasian neonates from the area of Glasgow, Scotland (UK), and the
participants in this research were Malaysians, it was seen advisable to first
replicate these earlier findings on Malaysian neonates.
Before presenting details about the methodology and procedures followed in
these experiments, it is worth pointing out that the newborn infants were
retained on the ward at all times, in bassinets located adjacent to their mother’s
bed. The mothers had continuous responsibility for the care of their infants.
Although the total interactive contact hours involved within each mother–infant
dyad was not assessed, some provisional observations (Sai, 1990) showed that
there was a fairly substantial period of contact within the first 24 h. However,
there was a considerable variability across dyads.
Since our earlier findings had indicated no significant relationship between age
and extent of preference, age was not considered as a potential factor, and was
therefore not included. Similarly, sex factor was not incorporated for the same
reason (see Sai, 1990).
Method
Participants
Fourteen newborn infants, seven males and seven females (M ¼ 3:00 h,
range ¼ 224 h; M birth weight ¼ 2:75 kg, range ¼ 2:7022:90) participated in this
study. They all had normal Apgar scores after 5 min (M ¼ 9, range ¼ 8210), and
throughout testing they remained in the behavioural state of alert inactivity
(Ashton, 1973). All the infants in this and subsequent studies were normal,
healthy and full-term infants (37 weeks gestation) with no complications during
delivery. The infants were primarily from middle class and diverse ethnic
backgrounds of Malaysia (Malay, Chinese and Hindou). They were volunteered
by their mothers on the wards of the University Hospital, Kuala Lumpur where
they were born. The data of ten additional infants were discarded from the study:
four for side bias in their looking behaviour (see procedure section for details
regarding criteria for side bias), three for excessive fussiness, two for persistent
sleeping and one because the mother made sounds during the testing.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 33
from the same maternity ward or, another young woman. To ensure that
preference for the mother’s face was not being forced by an inadvertent bias in
terms of overall brightness of the face or the hair colour, a control over brightness
of the face and hair colour was incorporated. In Field et al.’s study (1984), hair
colour and face brightness cues were randomly varied across participants so that
sometimes the mother’s face should have been brighter and sometimes less
brighter than the stranger’s, and sometimes the hair colour should have been
lighter and sometimes darker. Yet, it could be argued that there might still have
been a systematic bias along these dimensions in favour of the mothers’ faces
since Field et al., used the same small set of comparison faces throughout the
study, and this might have resulted in the demonstrated preferences. Therefore, it
was seen advisable to match each mother on the wards with another female
stranger who was judged to be broadly comparable in terms of hair colour, hair
shape and facial complexion.
The mother and a volunteer female stranger were asked to sit behind a large
white screen (2 2.5 m) into which at head height were cut two apertures
(30 25 cm), one at either side of mid-line and separated by 12 cm. These allowed
a good view of the hair and face of the stimuli from the neonate’s position, with
lighting being provided by two fluorescent tubes above and in front of the faces.
In order to prevent any contamination of the data due to differences in the
clothing of the stimulus faces, they each had white sheet draped around their
necks to effectively mask their clothing. The visible background, a curtain
backcloth, was the same for both stimulus faces (see diagram).
Through a hole, which was made in the middle of this panel, between the two
apertures, a video camera recorded the infant’s face during the whole testing and
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
34 F. Z. Sai
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 35
infant, the total fixation time towards either face was based on the live recording
of the central observer, and was then compared to the video recording. For the
test trials of eight infants, the inter-observer reliability was high (Pearson
r ¼ 0:86).
All Malay infants ðn ¼ 12Þ heard the voice of their father’s call for prayer. It
was delivered far from their mothers to prevent any association of voice cues
with the mother’s face. In the Malay culture, this message is given to the
newborn baby soon after birth for 2–3 min to ensure that it will always be heard
and answered.
The data were expressed in terms of percentage of total looking time (PTLT)
infants spent fixating the mother’s face on each trial. Looking percentage for both
trials were averaged for each participant (Table 1). Each newborn infant gave
data for two trials, giving 28 trials for all the 14 participants. From the findings
with both trials included, 12 of the 14 infants looked longer at their mother’s face.
The total visual fixations to the mother and the female stranger accumulated,
respectively, 58.53 and 41.47%. The means preference for the mother on trials 1
and 2 were, respectively, 59.21% (S:D: ¼ 13:3) and 57.85% (S:D: ¼ 9:72). These
preferences are given in Figure 1.
To address the main research question, whether neonates showed significant
preference for the mother’s face over the stranger’s, single-sample t tests were
conducted. The results indicated that the newborn infants showed significant
preference for the mother’s face on trial 1, ðtð13Þ ¼ 2:59, p50:02), on trial 2,
(tð13Þ ¼ 3:02, p50:010) and on the combined trials (tð13Þ ¼ 3:83, p50:002).
The data replicate and extend the earlier findings that though few hours old,
Malaysian newborn infants are able to recognize their mother’s face from that of
a female stranger even when matched closely in terms of hair colour, hair length
and hair form; and when a control over olfactory and auditory cues was
implemented during the testing. However, since the control over the mother’s
Table 1. Experiment 1: Infants’ preference for the mother’s face across trials 1 and 2 and
the combined trials
Infant Sex Trial 1 (T1) Trial 2 (T2) Combined trials (COMTs)
1 M 65 50 57.5
2 M 75 70 72.5
3 M 30 48 39
4 M 50 78 64
5 M 65 70 67.5
6 M 55 59 57
7 M 60 50 55
8 F 65 61 63
9 F 85 48 66.5
10 F 45 64 54.5
11 F 60 51 55.5
12 F 65 60 62.5
13 F 50 52 51
14 F 59 49 54
Mean 59.21 57.85 58.53
S.D. 13.3 9.72 8.34
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
36 F. Z. Sai
Figure 1. Percentage preference for the mother’s face on trials 1, 2 and on the combined
trials.
voice was limited to the testing period, experience with the mother’s voice prior
testing might have facilitated learning of the mother’s visual features.
EXPERIMENT 2
The second experiment was designed to tackle the question of auditory cues by
preventing the infants from hearing their mother’s voice from birth through testing.
All other information was available to the participants from birth to 5 to 15 min
prior testing. The mothers were encouraged to visually interact and play with their
infants without talking or making any kind of sounds. Otherwise a similar design
and test procedures to those used in the first experiment were adopted.
Method
Participants
Participants were 14, six males and eight females (M ¼ 7:35 h, range¼2 –12 h;
M birth weight¼2.91 kg, range¼2.61–3.64; with normal Apgar scores at 5 min
(M ¼ 9, range 7–10), born at the Maternity Department, University Hospital,
Kuala Lumpur, where they were tested. The data of 16 additional infants (nine
males and seven females) were discarded from the final sample because of side
bias in their looking behaviour ðn ¼ 8Þ, or the infants fussed or fell asleep before
the testing was over ðn ¼ 6Þ, or there were technical failures of various kinds
ðn ¼ 4Þ. The subjects were volunteered by their parents who agreed to participate
in the control of voice cues.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 37
Table 2. Infants’ preference for the mother’s face across Trials in the no voice condition
Infant Sex Age (h) Trial 1 Trial 2 Combined trials
1 M 5 40 60 50
2 M 5.3 80 60 70
3 M 6.55 30 5 17.5
4 M 8 50 55 72.5
5 M 7.3 65 65 65
6 M 6 30 90 60
7 F 5 40 45 42.5
8 F 5.2 60 40 50
9 F 5 60 40 50
10 F 4.27 45 65 55
11 F 7 20 50 35
12 F 3.17 45 65 30
13 F 5.54 10 50 72.5
14 F 12 65 80 72.5
Mean 6.09 45.71 55 53.03
S.D. 2.12 19.3 20.19 17.26
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
38 F. Z. Sai
The data were expressed in terms of PTLT infants spent fixating the mother’s
face on each trial. Looking percentages for both trials were also averaged for each
participant (Table 2).
To assess the role of experience with the mother’s voice in facilitating
recognition of the mother’s face, single-sample t test were conducted on the
PTLTs to the mother’s face on trials 1, 2 and on the combined trials (Trials 1 and
2). As a group, the newborn infants failed to significantly discriminate and
recognize the face of the mother on trial 1 (tð13Þ ¼ 0:83, p ¼ ns), trial 2
(tð13Þ ¼ 0:92, p ¼ ns), and on the combined trials (tð13Þ ¼ 0:65, p ¼ ns). Figure 2
displays the means percentage preference for the mother’s face across trials.
Experience with the mother’s voice prior testing seems necessary for mother’s
face recognition. The question how much experience with the mother’s voice–
face is needed for such learning to develop remains however, unanswered.
Further analyses were conducted to determine whether preference for the
mother differed as a function of condition and position of the mother’s face. The
performance of the infants in Experiment 1 was compared to that of infants in
Experiment 2. A mixed factorial design with condition (voice vs no voice) and
side preferences (position of the mother on the right side vs left side) as main
factors revealed no main effects of condition (Fð1; 26Þ ¼ 3:09, p ¼ ns), side
preferences (Fð1; 26Þ ¼ 0:001, p ¼ ns) or interaction, condition X side preferences
(Fð1; 26Þ ¼ 0:099, p ¼ ns). The results indicated that although the newborn infants
showed no significant differences in their preferential behaviour across
conditions, a slight preference for the mother was demonstrated in the voice
condition (Table 3). Further, the infants demonstrated a slight preference for the
mother’s face shown on the left side in the voice experiment. This pattern was
however, reversed in the no-voice experiment. (Figure 3). Perhaps when infants
failed to discriminate between the two faces, they tended to look more to the right
side. This is consistent with early reports suggesting significant head turns
towards the right side in 3 month-old infants (Gesell and Ames, 1947; Turkewitz
et al., 1965). In a study by Bahrick et al. (1996) side preferences was however not
significant at 5 or 8 months.
An interview was conducted with the mother and the female stranger at the
end of the testing of each infant in the no-voice experiment. We asked whether
Figure 2. Percentage preference for the mother’s face across trials in the no voice
condition.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 39
Figure 3. Percentage preference for the mother’s face across conditions (voice–no voice).
the infant was attracted to the stimulus faces. For the test trials of eight infants the
agreement that infants were not attracted to the faces was high (Pearson,
r ¼ 0:87).
To conclude, the findings from Experiment 2 indicate that newborn infants are
not able to recognize their mother’s face without a previous experience with the
mother’s voice. The development of face perception in the neonatal period seems
to be, therefore, contingent on a prior exposure to the mother’s voice. It is worth
noting that compared to our previous testing including that of Expt 1, neonates
showed only few head turns towards the stimulus faces in the no-voice
condition. Nevertheless evidence suggests that defense and orientation responses
are present even in uterus. Moreover, quieting characterized by a decreased fetal
heart rate (FHR) and body movements including head orientation are signs of
attention and, perhaps, fetal cognition (Kisilevsky et al., 1998). Thus, since
orientation response is present in the term foetus, then it should be present at
birth. In the present study, the removal of the mother’s voice, as a familiar cue,
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
40 F. Z. Sai
probably did not only hinder the neonates’ ability to learn the amodal relations,
but it also depressed their attention to stimuli around them. Accordingly, a study
designed to investigate the number of infants’ head turns across both the voice
and the no-voice conditions would certainly clarify the role of the mother’s voice
in facilitating recognition of the mother’s face at birth.
EXPERIMENT 3
The role of the mother’s voice in facilitating early mother’s face preference was
further investigated using head turn as a response measure in Experiments 3 and
4. In our previous testing (Sai, 1990; Bushnell et al., 1989; Sai and Bushnell, 1988),
the newborn infants showed a number of head turns towards the stimulus faces.
Such activity disappeared along with the control of the mother’s voice in Exp. 2.
It is however well established that sounds trigger head turns on the part of the
newborn infants. They can shift their eyes and heads in the direction of a sound
quickly and accurately (Field et al., 1980), and that they will make consistent head
turns towards a sound source even when there is no visible stimulus at the sound
locus (Muir and Field, 1979). Moreover, young infants can orient appropriately if
there is a visual pattern at the source of a sound (Castillo and Butterworth, 1981).
However, we do not know whether infants who have never heard their mother’s
voice will orient towards faces, or whether neonates will orient to silent faces
even if one of them is the mother’s face.
Method
Participants
Ten neonates, five males and five females (Mean age¼7.5 h, range¼4–10; Mean
birth weight¼3.40 kg, range 2.80–4.30), with normal Apgar Scores after 5 min
(M ¼ 9:20, range 7–10), born at the Maternity Department, Larbaa Hospital,
Algeria. The data of eight additional infants (five males, three females) were
collected and rejected from the study: four for side bias in their looking
behaviour, three for falling asleep and one for experimenter error.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 41
Table 4. Number of head turns towards the mother and the stranger across trials in the
voice condition
Infant Sex Age (h) MT1 MT2 MCTS SCTs
1 M 10 3 2 5 2 1
2 M 7 5 4 9 4 2
3 M 4 2 2 4 5 3
4 M 9 4 7 11 6 2
5 M 7 2 4 6 5 3
6 F 6 4 6 10 9 5
7 F 7 5 4 9 8 4
8 F 8 4 3 7 5 2
9 F 9 6 4 10 6 4
10 F 8 3 5 8 4 2
Figure 4. Number of head turns towards the mother and the stranger across trials in the
voice condition.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
42 F. Z. Sai
stranger). Infants showed more head turns towards their mother’s face
(M ¼ 3:95) than towards the female stranger’s face (M ¼ 2:7) and this difference
was significant (Fðl; 9Þ ¼ 15:4, p ¼ 0:003). Also, neonates turned their heads more
towards the mother on trial 2 (M ¼ 4:1) than on Trial 1 (M ¼ 3:8), and this
difference was significant (Fð1; 9Þ ¼ 0:018, p ¼ ns), (Table 4). The interaction of
face x trial failed to reach significance (Fðl; 9Þ ¼ 0:78, p ¼ ns). Preferences for the
mother and the stranger are given in Figure 4.
From the findings with both trials included, newborn infants demonstrated
more head turns towards their mother’s face (M ¼ 7:9) than towards the female
stranger (M ¼ 5:4). Further, head movements were observed but were not
considered as full head turns, as defined above. Preference for the mother’s face
was further replicated in the present study with non-Caucasian faces, and with
head orientation as a response measure. A possible interpretation of the findings
is that since both the mother’s voice and visual cues were available from birth to
the time of testing, neonates might have demonstrated a long-term memory effect
for their mother’s voice–face relations in the recognition test.
EXPERIMENT 4
The purpose of this study was to determine whether the absence of the mother’s
voice, from birth through testing, would result in less head turns towards the
mother’s face.
Method
Participants
There were 15 participants (eight males and seven females) successfully tested
in this experiment while a further 10 infants were eliminated from the sample
because of side bias in their looking behaviour ðn ¼ 4Þ or fell asleep ðn ¼ 6Þ. The
mean age of the sample was 6.31 h (range 2–10), the mean birth weight was
3.30 kg (range 2.98–3.60), the mean Apgar score after 5 min was 9.50 range 7–10).
They were born and tested at the Maternity Department; Larbaa Hospital,
Algeria.
Head orientations towards the mother and the stranger were calculated as before,
and this information is set out in Table 5. The subjects had almost equal
preferences for the faces of the mother (M ¼ 1:72) and that of the female stranger
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 43
Table 5. Number of head turns towards the mother and the female stranger across trials
Infant Sex Age (h) Mt1 Mt2 MCTs St1 St2 SCTs
1 M 6 1 4 5 2 4 6
2 M 9 3 2 5 3 4 7
3 M 6 3 1 4 2 2 4
4 M 7 1 2 3 1 3 4
5 M 7 1 3 4 3 1 4
6 M 5 2 1 3 1 1 2
7 M 10 1 3 4 1 4 5
8 M 7 2 2 4 1 2 3
9 F 8 2 2 4 3 0 3
10 F 9 0 3 3 2 2 4
11 F 5 2 2 4 3 2 5
12 F 6 2 1 3 2 1 3
13 F 8 0 0 0 2 2 4
14 F 4 1 2 3 3 0 3
15 F 2 2 1 3 1 2 3
16 F 2 1 2 3 1 1 2
Figure 5. Number of head turns towards the mother and the stranger across trials in the
no-voice condition.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
44 F. Z. Sai
Figure 6. Number of total head turns to the mother and the stranger across voice–no-voice
conditions.
Table 6. Number of total head turns towards the mother and the stranger across voice–
no-voice conditions
Voice condition No-voice condition
Infant MCTS SCTS Infant MCTS SCTS
1 5 2 1 5 6
2 9 4 2 5 7
3 4 5 3 4 4
4 11 6 4 3 4
5 6 5 5 4 4
6 10 9 6 3 2
7 9 8 7 4 5
8 7 5 8 4 3
9 10 6 9 4 3
10 8 4 10 3 4
11 4 5
12 3 3
13 0 4
14 3 3
15 3 3
16 3 2
Mean 7.9 5.4 3.43 3.87
S.D. 2.33 2.01 1.15 1.36
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 45
for the mother’s face (Fðl; 24Þ ¼ 0:208, p ¼ ns). The occurrences of head turns
toward the mother’s face in the voice experiment were more frequent than in the
no-voice experiment regardless of the position of the mother (Figure 6). The
number of head turns towards the mother and the stranger were calculated as
before and are displayed in Table 6.
GENERAL DISCUSSION
The findings from these studies show that neonates can visually recognize their
mother’s face only if a postnatal exposure to the mother’s voice–face combination
was available. A common expressed interpretation of the presence of the
mother’s face recognition in the neonatal period is in terms of an innate
perceptual mechanism that detects and responds specifically to faces (Johnson
and Morton, 1991; Meltzoff, 1995; Walton et al., 1997). The present results do not
support this hypothesis. If the CONSPEC mechanism, were the only mediator for
face preference at birth, then infants would have been able to recognize their
mother’s face even when no postnatal exposure to the mother’s voice was
available. As well as showing that at birth there is a preference for some spatial
frequencies that are shared by other stimuli other than face-like pattern, would be
insufficient to claim that there is an innate mechanism specific for faces. The
present findings indicate that preference for the mother’s face is not exclusively
determined by the unique structure of the face. Rather, intermodal structural
properties, which other stimuli possess, would be crucial in determining
newborns’ preferences for the mother’s face.
Thus, the model may be able to explain face preference in older infants,
but it cannot account for the data from studies with real faces which
had been previously associated with voice cues such as the mother’s
voice. Nor can it explain the neonates’ failure to recognize the mother’s
half-profile face where information about the internal configuration was still
available (Sai, 1994).
Given that the infant is born in a multimodal stimulation unit, it is very absurd
to constrain preference for the mother’s face within visual parameters.
Alternatively, an explanation, which takes into account both the auditory and
visual information, would be plausible, especially that young infants are adept
perceivers of multimodal stimulation (Lewkowicz and Lickliter, 1994). They
rapidly turn their heads and eyes in the direction of a sound source (Wertheimer,
1961; Butterworth, 1983; Muir and Clifton, 1985), and they quickly learn arbitrary
visual–auditory associations (Slater et al., 1997). Thus, the amodal information of
temporal synchrony of voice and lips seems likely to facilitate learning the
mother’s voice–face combination. The multimodal stimulation of the mother’s
face–voice could be more powerful than Conspec in attracting newborns’ eye and
head turns.
Learning the mother’s voice–face association might be therefore, initially
triggered by the mother’s voice, a highly familiar stimulus from the time of
uterus. It is then strengthened with exposure to the mother’s voice and face
association. Once this latter has been encoded, the sight of the mother’s face
alone might elicit a visual recognition of the mother’s face as demonstrated in
Exps. 1 and 3.
The particular attractiveness of the mother’s voice has been documented by
several authors (see Andre-Thomas, 1966; Brazelton, 1978). The argument
accounting for this phenomenon could be a postnatal association of the mother’s
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
46 F. Z. Sai
voice with positive reinforcers such as the sight of the mother. Conversely,
preference for the mother’s face is likely to be rooted in foetal exposure to the
mother’s speech (Decasper and Spence, 1991; Turkewitz, 1988; Richards et al.,
1992). A growing body of evidence has demonstrated the functioning of foetal
sensory systems during gestation, as well the sequential functional development
of sensory systems (for review, see Lecanuet and Schaal, 1996). The auditory
system develops much earlier than the visual system (Gottlieb, 1971), and healthy
full-term human foetuses respond to a wide variety of sounds and vibrations
originating from outside the womb (Birnholz and Benaceraf, 1983; Kisilevsky and
Muir, 1991; Lecanuet et al., 1986; Serafini and Lindsay, 1984). As well, FHR and
movements to vibroacoustic stimulation were reported in foetuses as early as 26
weeks gestation (Kisilevsky et al., 1992). Data from studies on uterus speech
transmission performed with a hydrophone near the foetal head during delivery,
have revealed a significant better transmission of the maternal voice than of the
external voice (Querleu et al., 1988). We also know that the mother’s voice is the
only auditory stimulus that is transmitted internally via body tissues and bones
(Petitjean, 1989); it also involves diaphragmatic movements generating pressure
changes in the fluid medium of the uterus. Also, prenates of mothers who
listened to ‘soap opera’ theme music during pregnancy showed a preference for
the same music during the postnatal period (Hepper, 1988). The longer the
prenatal exposure of human neonates, living in the Osaka airport neighbour-
hood, to airplane noises, the better they slept compared to babies whose mothers
had lived in the area of the airport for shorter times during pregnancy (Ando and
Hattori, 1977). The work of DeCasper and Fifer (1980), DeCasper and Spence
(1986), DeCasper and Prescott (1984) reveals a complexity of bonding and
language processing during the prenatal period. Given that the presence of
potential sensory stimulation in the foetal milieu as well the presence of foetal
sensory sensitivity to acoustic (e.g. Lecanuet et al., 1995) and vibroacoustic (e.g.
Kisilevsky, 1995) stimuli, and the continuity of foetal and newborn response to
sound and vibration, it is, therefore, possible that neonates are born with a long-
term memory for the mother’s voice.
However, the question whether the neonates rapidly associate the mother’s
voice–face at birth, or demonstrate a long-term memory effect for the mother’s
voice learned while the foetus was still in the mother’s womb remains
unanswered. Nevertheless, data from one of our studies in which newborn
infants were exposed to both the mother’s voice and face, among others, show
that mother preference was stronger with shorter delays than with longer delays
between last exposure to the mother and preference testing (Sai, 1990), indicating
that learning the mother’s face is very stable and established as a long-term
memory. The mother’s voice–face association seems, therefore, to take place at
least shortly after the infant’s birth.
These findings replicate and extend our previous research suggesting a rapid
learning of the mother’s face at birth. Such learning, however, vanished when a
previous exposure to the mother’s voice–face combinations (Exps 2 and 4) was
made unavailable to neonates from birth through testing. A prior exposure to the
mother’s voice–face combination remains, therefore, a significant predictor of the
mother’s face recognition at least within the first days of birth. We still do not
know precisely what visual features, the newborns associated to the mother’s
voice in order to make their preferential choices. Nevertheless, the results
strongly suggest an intermodal learning at birth, probably facilitated by a long-
term memory for the mother’s voice learned in the confines of the womb. One
interesting direction for future research concerns the amount of exposure to the
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 47
ACKNOWLEDGEMENTS
REFERENCES
Ando Y, Hattori H. 1977. Effects of noise on sleep of babies. Journal of the Acoustical Society
of America 47: 1128–1130.
Andre-Thomas AS. 1966. Locomotion from prenatal life. Spatic Society. Heinemann:
London.
Ashton R. 1973. The state variable in neonatal research: A review. Merrill-Palmer Quarterly
19: 3–20.
Aslin RN. 1987. Visual and auditory development in infancy. In Handbook of Infant
Development (2nd edn), Osofsky JD (ed.). Wiley: New York.
Bahrick LE. 1987. Infants’ intermodal perception of the two levels of temporal structure in
natural events. Infant Behavior and Development 10: 387–416.
Bahrick LE. 1988. Intermodal learning in infancy: learning on the basis of two kinds of
invariant relations in audible and visible events. Child Development 59: 197–209.
Bahrick LE. 1994. The development of infants’ sensitivity to arbitrary intermodal relations.
Ecological Psychology 6: 111–123.
Bahrick LE, Moss L, Fodil C. 1996. The development of visual self-recognition in infancy.
Ecological Psychology 8: 189–208.
Birnholz JC, Benaceraf BR. 1983. The development of human fetal hearing. Science 222:
516–518.
Brazelton TB. 1978. The remarquebale talents of the newborn. Birth and the Family Journal 5:
4–10.
Bushnell EW, Boudreau JP. 1991. The development of Haptic perception during infancy. In
The Psychology of Touch, Heller MA, Schiff W (eds). Erlbaum: Hillsdale, NJ.
Bushnell EW, Boudreau JP. 1993. Motor development and the mind: The potential role of
motor abilities as a determinant of aspects of perceptual development. Child Development
64: 1005–1021.
Bushnell IWR, Sai FZ, Mullin J. 1989. Neonatal recognition of the mother’s face. British
Journal of Developmental Psychology 7: 3–15.
Butterworth G. 1983. Structure of the mind in human infancy. In Advances in Infancy
Research, Lipsitt LP, Rovee-Collier CK (eds), vol. 2. Ablex: Norwood, NJ; 1–19.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
48 F. Z. Sai
Caron AJ, Caron RF, MacLean DJ. 1988. Infant discrimination of naturalistic emotional
expressions: the role of face and voice. Child Development 59: 604–616.
Castillo M, Butterworth GE. 1981. Neonatal localisation of a sound in visual space.
Perception 10: 331–338.
Clifton RK, Morrongiello BA, Kulig JW, Dowd JM. 1981. Newborns’orientation towards
sound: possible implications for cortical development. Child Development 53: 833–838.
DeCasper AJ, Fifer W. 1980. Of human bonding: newborns prefer their mothers’ voices.
Science 208: 1174–1176.
DeCasper AJ, Prescott PA. 1984. Human newborns’ perception of male voices: preference,
discrimination, and reinforcing value. Developmental Psychobiology 17: 481–491.
DeCasper AJ, Spence MJ. 1986. Prenatal maternal speech influences newborns’ perception
of speech sounds. Infant Behavior and Development 9: 133–150.
DeCasper AJ, Spence MJ. 1991. Auditory mediated behavior during the prenatal period: a
cognitive view. In Newborn Attention Biological Constraints and the Influence of Experience,
Weiss MJS, Zealot PR (eds). Ablex: Norwood, NJ; 142–176.
Dziurawiec S. 1987. Neonates’ attention to faces. In Developmental Aspects of Face
Recognition, Bruce V, Ellis H, Young A (eds). Unpublished report on ESRC workshop,
Grange-over-sands.
Fantz RL. 1966. Pattern discrimination and selective attention as determinants of
perceptual development from birth. In Perceptual Development in Children, Kidd AH,
Rivoire JL (eds). International Universities Press: New York.
Fearon I, Kisilevsky BS. 1994. Fetal responsiveness to vibroacoustic stimulation: influence
of vibrator characteristics. Poster presented at the 9th International Conference on Infant
Studies, Paris.
Field J, Muir D, Pilon R, Sinclair M, Dodwell P. 1980. Infants’ orientation to lateral sounds
from birth to three months. Child Development 51: 295–298.
Field TM, Cohen D, Garcia R, Greenberg R. 1984. Mother–stranger face discrimination by
the newborn. Infant Behavior and Development 7: 19–25.
Gesell A, Ames LB. 1947. The development of handedness. Genetic Psychology 70: 155–176.
Gibson EJ. 1983. Development of knowledge about intermodal unity. Two views. In Piaget
and the Foundations of Knowledge, Liben LS (ed.). Erlbaum: Hillsdale, NJ; 19–41.
Goren CC, Sarty M, Wu PYK. 1975. Visual following and pattern discrimination of face-like
stimuli by newborn infants. Pediatrics 56: 544–548.
Gottlieb G. 1971. Development of Species Identification in Birds. University of Chicago Press:
Chicago.
Gottlieb G. 1978. Development of species identification in ducklings. IV. Changes in
specific perception caused by auditory deprivation. Journal of Comparative Physiological
Psychology 92: 375–387.
Gottlieb G. 1991. Epigenetic systems view of human development. Developmental
Psychology 27: 33–34.
Hepper PG. 1988. Fetal ‘soap’ addiction. The Lancet 1: 1147–1148.
Johnson MH, Morton J. 1991. Biology and Cognitive Development. Blackwell: Oxford, UK.
Kisilevsky BS. 1995. The influence of stimulus and subject variables on human fetal
responses to sound and vibration. In Fetal Development: A Psychobiological Perspective,
Lecanuet J-P, Fifer WP, Kransnegor NA, Smotherman WP (eds). Lawrence Erlbaum:
Hillsdale, NJ; 263–278.
Kisilevsky BS, Fearon I, Muir DW. 1998. Fetuses differentiate vibroacoustic stimuli. Infant
Behavior and Development 1(1): 25–46.
Kisilevsky BS, Muir DW. 1991. Human fetal and subsequent newborn responses to sound
and vibration. Infant Behavior and Development 14: 1–26.
Kisilevsky BS, Muir DW, Low JA. 1992. Maturation of human fetal responses to
vibroacoustic stimulation. Child Development 63: 1497–1508.
Kuhl PK, Meltzoff AN. 1982. The bimodal perception of speech in infancy. Science 218:
1138–1141.
Kuhl PK, Meltzoff AN. 1984. The intermodal representation of speech in infants. Infant
Behavior and Development 7: 361–381.
Lecanuet J-P, Grannier-Deferre C, Busnel M-C. 1995. Human fetal auditory perception. In
Fetal Development: A Psychological Perspective, Lecanuet J-P, Fifer WP, Krasnegor NA,
Smotherman WP (eds). Lawrence Erlbaum: Hillsdale, NJ; 239–262.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
The Role of the Mother’s Voice 49
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)
50 F. Z. Sai
Sai FZ. 2000. Neonatal recognition of the mother’s face. Evidence for intermodal
perception. Paper presented at the XII Biennial International Conference on Infant
Studies (ICIS), Brighton, UK.
Sai FZ, Bushnell IWR. 1987. Neonatal recognition of the mother’s face. University of
Glasgow Psychological Report. No. 87/1: Glasgow University.
Sai FZ, Bushnell IWR. 1988. The perception of faces in different poses by 1-month olds.
British Journal of Developmental Psychology 6: 35–41.
Scibetta JJ, Rosen MG, Hochbbberg CJ, Chik L. 1971. Human fetal brain response to sound
during labor. American Journal of Obstetrics and Gynecology 109: 82–85.
Serafini P, Lindsay MBJ, Nagey DA, Pupkin MJ, Tseng P, Crenshaw C Jr. 1984. Antepartum
fetal heart rate response to sound stimulation: the acoustic stimulation test. American
Journal of Obstetrics and Gynecology 148: 41–45.
Slater A, Brown E, Badenoch M. 1997. Intermodal perception at birth: newborn infants’
memory for arbitrary auditory–visual pairings. Early Development and Parenting 6:
99–104.
Slater A, Butterworth G, Samuels C. 1998. Newborn infants prefer attractive faces. Infant
Behavior and Development 21(2): 345–354.
Spelke B, Owsley CJ. 1979. Intermodal exploration and knowledge in infancy. Infant
Behaviour and Development 2: 13–24.
Turati C, Simion F, Milani I, Umila C. 2002. Newborns’ preference for faces: what is crucial.
Developmental Psychology 38(6): 875–882.
Turkewitz G. 1988. A prenatal source for the development of hemispheric specialization. In
Brain Lateralization in Children, Molfese DL, Segalowitz J (eds). Guilford Press: New York;
73–83.
Turkewitz G, Gordon EW, Birch HG. 1965. Head turning in the human neonate:
spontaneous pattern. The Journal of Genetic Psychology 107: 143–158.
Walker-Andrews AS. l997. Infants’ perception of expressive behaviors: differentiation of
multimodal information. The Psychological Bulletin 12(3): 437–456.
Walton GE, Amstrong ES, Bower TGR. 1997. Faces as forms in the world of the newborn.
Infant Behavior and Development 20: 537–543.
Walton GE, Bower NNJA, Bower TGR. 1992. Recognition of familiar faces by newborns.
Infant Behavior and Development 15: 265–269.
Walton GE, Bower TGR. 1993. Newborns form ‘prototypes’ in less than 1 minute.
Psychological Science 4: 203–205.
Wertheimer M. 1961. Psychomotor coordination of auditory and visual space at birth.
Science 134: 1692.
Copyright # 2004 John Wiley & Sons, Ltd. Inf. Child Dev. 14: 29–50 (2005)