Embodied Cognitive Robotics and The Learning of Sensorimotor Schemes

Special Issue: Spotlight on 4E Cognition Research in Mexico
Adaptive Behavior
1–14
Ó The Author(s) 2018
Embodied Cognitive Robotics and the Reprints and permissions:
sagepub.co.uk/journalsPermissions.nav
learning of sensorimotor schemes DOI: 10.1177/1059712318780679
journals.sagepub.com/home/adb
Bruno Lara1 , Dadai Astorga1, Emmanuel Mendoza-Bock2,

Manuel Pardo1, Esaú Escobar1 and Alejandra Ciria1,3
Abstract
Embodied Cognitive Robotics focuses its attention on the design of artificial agents capable of performing cognitive tasks
autonomously. A central issue in this consists in studying process by which agents learn through interaction with their
environment. Embodied Cognitive Robotics aims to implement models of cognitive processes coming from Cognitive
Sciences. The guidelines in this research area are a direct response to the shortcomings of Classical Artificial
Intelligence, where high-level tasks and behaviors were studied. This article describes the work carried out in the
Cognitive Robotics Laboratory at the Universidad Autónoma del Estado de Morelos (UAEM). Our work is based on the
concept of low-level sensorimotor schemes coded by Internal Models, thus falling as a matter of course within the tenets
of Embodied Cognition, particularly with the idea that cognition must be understood as occurring in agents that have a
body with which they interact in a specific environment. It is through this interaction that learning emerges laying the
ground for cognitive processes. Our research includes theoretical work laying the foundations of Embodied Cognitive
Robotics, as well as work with artificial and with natural agents.
Keywords
Embodied Cognitive Robotics, Internal Models, sensorimotor schemes, Embodied Cognition
Handling Editor: Ximena González-Grandon, National Autonomous University of Mexico, Mexico
1. Introduction of success is one of the main reasons why critics of

Classical AI proposed the need to design artificial
Artificial Intelligence (AI), now known as Classical or agents situated in the world, emphasizing the funda-
Traditional AI emerged as a discipline aiming to repro- mental role the body plays during the interaction with
duce certain aspects of human intelligence in machines. the environment in producing and learning intelligent
Its hypothesis was that if all aspects of learning or any behavior (Dreyfus, 1967). The most natural platforms
other intelligence capability could be described with to study this interaction are artificial agents (Asada
precision, then it should be possible to be reproduced in et al., 2009; Moravec, 1984).
machines (McCarthy, Minsky, Rochester, & Shannon, Perception and sensorimotor abilities have come to
2006). Classical AI was strongly influenced by the be the really hard problems to solve. However, accord-
school of cognitivism, a paradigm affirming that human ing to this approach, once an agent has basic sensing
cognition is based on the manipulation of symbolic
representations through a set of rules (Newell & Simon,
1
1976). However, the idea that intelligent behavior can Cognitive Robotics Laboratory, Center for Science Research,
Universidad Autónoma del Estado de Morelos (UAEM), Cuernavaca,
be captured using only symbolic representations has México
been widely criticized. 2
Instituto de Investigaciones Filosóficas, UNAM, Mexico City, Mexico
After decades of research, even though AI has man- 3
Facultad de Psicologı́a, UNAM, Mexico City, Mexico
aged significant success in specific domains such as the
generation of algorithms capable of solving abstract Corresponding author:
Bruno Lara, Cognitive Robotics Laboratory, Center for Science Research,
problems, it has not been able to produce artificial sys- Universidad Autónoma del Estado de Morelos (UAEM), Av. Universidad
tems imitating basic human abilities such as sensorimo- No. 1001, Cuernavaca CP 62100, México.
tor processes or perception (Mingers, 2001). This lack Email: bruno.lara@uaem.mx
2 Adaptive Behavior
and moving capabilities, higher order abilities should Internal Models have been proposed to code the sen-
become easier to implement. High-order abilities sorimotor schemes that emerge from the interaction of
include problem solving and language expert knowl- agents with their environment. Originating in control
edge, among others (Brooks, 1991). Following these theory (Miall & Wolpert, 1996), these models have been
lines, a fraction of researchers in AI turned to the use used to explain a wide range of cognitive phenomena
of robots as situated agents in what became known as (for a review of the topic, see Schillaci, Hafner, and
Embodied Cognitive Robotics (Clark & Grush, 1999; Lara, 2016). The core of the research reported here is
Pfeifer, Lungarella, & Iida, 2007; Pfeifer & Scheier, Internal Models and the multimodal associations that
2001). Embodied Cognitive Robotics is in accordance are necessary for them to emerge.
with the tenets of the embodied and embedded frame- This article is organized as follows: the next section
works, which highlight the importance of the interac- attempts to lay out the links between AI and Embodied
tion of agents with their environment for the Cognition, positioning the work reported here in the
development of cognitive processes (Pfeifer & Scheier, wide 4E cognition framework (cognition as embodied,
2001). embedded, extended, and enactive) and clarifying the
Development is about change throughout the life- reasons to frame it mainly within Embodied Cognition.
span of agents and this, without a doubt, requires learn- The section contains a subsection on Internal Models,
ing and adaptive capabilities. Movement is perhaps the which are at the core of the research carried out in our
most pervasive and fundamental of all psychological lab. The next section ‘‘Embodied sensorimotor
activities providing a highly significant source of per- schemes,’’ presents an overview of the work carried out
ceptual experiences (Adolph & Berger, 2006). Humans in our laboratory. First, it outlines some of the theore-
are endowed genetically with the capability and propen- tical discussions on artificial agents and intelligence,
sity to learn sensorimotor behavioral structures also then it presents work carried out on mobile agents,
referred to as sensorimotor schemes (Harvey, 2018). humanoids, and finally natural agents. The experiments
Human fetal motility starts at 7.5 weeks of gestational are ordered according to the complexity of the architec-
age with isolated stereotyped movements of arms, legs, tures used as well as the different cognitive processes
and head, leading to more varied and complex patterns they model. In the final section, we draw conclusions
of movement as early as at 9 weeks of gestational age and discuss further research.
(Lacquaniti, Ivanenko, & Zago, 2012; Lüchinger,
Hadders-Algra, Van Kan, & de Vries, 2008). Later on,
the newborn child is prepared to interact with the envi- 2. AI and Embodied Cognition
ronment and adapt to it, displaying functional goal-
directed, prospective and flexible movements (von AI aims to create tools or agents that are capable of
Hofsten, 2003, 2009). exhibiting intelligent behaviors. Among other things,
Movement cannot rely on sensorimotor feedback the AI community managed to create a machine that
because of the delays in efferent and afferent pathways successfully plays chess against the best human players.
and the corresponding processing delays. These delays However, to date there is still not a machine capable of
are known to vary across modalities in the order of physically moving the pieces on a chess board as skill-
approximately 200 ms (Desmurget & Grafton, 2000; fully as a child. As a consequence, the idea of having a
Miall & King, 2008; Miall, Weir, Wolpert, & Stein, body in order to act in the real world and give meaning
1993; von Hofsten, 2009). In order to overcome this to the internal processing produced a major shift in AI
control problem, agents progressively and successfully research. Having a body is necessary for cognition and
predict the consequences of behavior by learning the thus a prerequisit for intelligent behavior (e.g. Pfeifer
associations between actions and their consequences and Bongard, 2006). The implementation of these ideas
during the body–environment interaction. This interac- in AI became known as Embodied Cognitive Robotics.
tion leads to the continuous learning of rules and regu- Importantly, the Embodied Cognitive Robotics agenda
larities of events, further building more complex and its theoretical foundations acquired more solid
sensorimotor schemes (von Hofsten, 2007, 2009). ground as Cognitive Sciences suffered a major shift of
Moreover, a constant recalibration of learned sensori- paradigm, namely, the conception of the Embodied
motor schemes, in addition to the acquisition of new Cognition framework.
ones, and the development of the physical body enable The 4E cognition has been characterized, in general
behavioral patterns to become more complex, increas- terms, as the claim that cognition is embodied,
ing the possibilities for interaction with the world ‘‘for embedded, extended, and enactive. The core of this
further development of intelligence to proceed in an view is that motor skills (the body), the material and
incremental fashion’’ (Harvey, 2018, p. 26). Thus, sociocultural world (the environment), and their combi-
‘‘from a functional perspective, cognitive development nation in specific contexts (the body–environment
has to do with expanding the prospective control of interaction) are indispensable components of cognition
actions’’ (von Hofsten, 2007, p. 58). (Anderson, 2003; Chiel & Beer, 1997; Clark, 1999;
Lara et al. 3
Lakoff & Johnson, 1999; Noë, 2004; Pezzulo et al., regularities in sensory input enhancing internal infor-
2011; Pfeifer et al., 2007; Robbins & Aydede, 2009; mation processing, and from these regularities cognitive
Shapiro, 2010; Varela, Thompson, & Rosch, 2017; behavior can emerge. It is suggested that the associa-
Wilson, 2002). These four claims of cognition represent tion of sensory and motor information, known as sen-
a family of individual explanatory frameworks, which, sorimotor schemes, could be the ideal unit of analysis
in a strict sense, cannot be reduced to four labels. At for the study of cognition. Again, we believe that beha-
the same time, there are also frameworks that include vior is directly affected by two elements, the ecological
the affective approach, sometimes presented as 4EA niche in which the agent is immersed and the morphol-
cognition (see Kiverstein and Clark, 2009; Ward and ogy of the agent (location of its sensors and effectors).
Stapleton, 2012). Cognitive Sciences literature has Furthermore, Anderson (2003) suggests that beha-
emphasized the fact that there is no homogeneity vior represents emergent phenomena that can be attrib-
between the different 4E cognition frameworks, mainly uted to the physical foundations that have evolved over
because there are significant differences in the models time. Therefore, the foundations of cognitive processes
and explanations for the subject of study. It is thought lie in the different bodily modalities of the sensorimotor
that an attempt to characterize and generalize their system. Cognition must be understood in terms of how
interests might lead to neglect peculiar aspects that are it functions under the pressures of real-time interaction
particular to the explanatory frameworks for each of with the environment (Wilson, 2002, p. 626). When cog-
the claims (Anderson, 2003; Rowlands, 2010; Shapiro, nition takes place in a context of inputs and outputs rel-
2010). Also, placing the labels under the same general evant to a specific task, it means cognition is situated.
scheme risks losing both the individual nuances and the That is to say, during cognitive processing, perceptual
real disagreements between the different proposals (Di information continues to affect processing, and at the
Paolo, 2009; Kiverstein and Clark, 2009; Menary, same time motor activity affects the environment (on-
2010). line cognition).
The work conducted in our laboratory follows Given that not all cognitive agents are capable of
mainly the ideas of Embodied Cognition, based on the carrying out certain tasks, it is essential to bear in mind
notion that cognition is determined not only by brain that it is body morphology which allows agents to
processes but also by a combination of these, together diversify their interaction with a constantly changing
with the body morphology and capabilities particular environment. At the same time, it is the environment
to the agents situated in an environment (Rowlands, which allows an agent to evolve particular cognitive
2010). Specifically, our work takes, from the notion of capabilities (Pfeifer et al., 2007; Varela et al., 2017;
‘‘embodiment,’’ the principles of processing of sensori- Wheeler & Clark, 2008).
motor schemes, simulation models, and multimodal During development, agents learn sensorimotor
associations. In doing so, we distance ourselves from schemes; these provide predictions of sensory informa-
the controversies arising when attempting to bring all tion to efficiently interact with the environment
the E’s together under one umbrella. (Harvey, 2018). Sensorimotor schemes associate differ-
The bodily skills to respond to the environment are ent sensory modalities (e.g. exteroceptive and interocep-
as constitutive to cognition as internal processing. This tive) which reflect the knowledge of an agent about the
statement implies that in order to understand cognition regularities of the environment as consequences of its
it is necessary to understand the complex and dynamic actions. Moreover, the knowledge about sensory conse-
interaction between the brain, body, and environment quences of self-generated actions provides the feeling of
(Clark & Grush, 1999; Van Gelder & Port, 1995). It agency and, as a result, a subjective point of view of the
has been suggested that some behaviors that can be world. Importantly, the sensorimotor schemes provide
classified as cognitive arise from the dynamic interac- a link to Gibsonian affordances, which are particularly
tion between the basic physical abilities of the agents appealing for Embodied Cognitive Robotics (Nasuto &
and their environment. Thus, the physiology, the evolu- Hayashi, 2016).
tionary history, the practical activity of the agent, and Traditionally, cognitive processing implies the inte-
the situation it is immerse in play the role of modeling, gration of what has been known as high- and low-level
limiting, and grounding the projects of Embodied processes (Koeinig, Ktihnberger, & Kietzmann, 2013).
Cognition (Anderson, 2003). However, rather than insisting on this distinction, we
The physical structure of the body sets limits, gives assume that sensorimotor schemes underlie all types of
precise indications, and offers opportunities for neural cognitive processes.
control and cognitive processes (Chiel & Beer, 1997; A telling example is off-line cognition. Humans can
Rowlands, 2010; Shapiro, 2007). Pfeifer et al. (2007) have processes that are not necessarily situated. If
argue that the continuous and dynamic interaction of situatedness is a condition for cognition and it is not
the agent with its environment (physical processes and fulfilled, then it seems that these processes cannot be
incoming sensory information) induces statistical considered as cognitive. This intuitive idea is
4 Adaptive Behavior
contradictory since humans perform cognitive pro- 2.1. Internal Models

cesses that can be carried out in the absence of inputs Cognitive processing depends on the sensorimotor cap-
and outputs relevant to the task at hand. Humans are
abilities an agent possesses, based on its body, situated
capable of cognitive processing that is not in direct
in a specific environment. Our main research interest is
interaction with the environment (off-line cognition;
centered on the understanding of how agents acquire,
Wilson, 2002). For Embodied Cognition, non-situated
modify, and improve their sensorimotor schemes dur-
processes are based on mechanisms that evolved for
ing interaction with the environment to accurately pre-
agents to interact with their envorinment while
dict the sensory consequences of self-generated actions.
immersed in it. Thus, off-line cognition is based on sen-
Internal Models have been shown to be a good strat-
sory and motor control mechanisms (Wilson, 2002, p.
egy to acquire the regularities of the environment as
626). It is possible that human and non-human agents
they are constructed by learned multimodal associa-
may invoke previously situated modalities. In non-
tions. Stretching the capabilities of Internal Models, the
situated processes, cognition can resort to a simulation
authors followed the ideas of Dewey and Heidegger
of bodily processes or ‘‘simulated situations’’ that oper-
and presented arguments linking a version of ‘‘inten-
ate under the same physical and situational limitations.
tionality’’ with the notion of Internal Models as the
In the simulated situation, the central nervous system
core structure for coding basic sensorimotor schemes
performs a reactivation of sensory information from
(Lara Guzmán, Hermosillo Valadez, & Baeza Mariscal,
different sources (e.g. visual, proprioceptive) and pre-
dicted consequences of states (Barsalou, 2008). The 2017). The proposed notion of ‘‘intentionality’’ is
multimodal associations, used during simulated off-line understood in a pragmatic sense, that is, as a basic-level
situations, rely on previously learned schemes and on- agent–environment interaction.
line experiences. In other words, simulations are a reac- The notion of Internal Models implies that any
tivation of the representations of a relevant learned epi- model for motor adaptation knows something about its
sode and the bodily resources associated with motor apparatus and the environment (Kawato, 1999).
sensorimotor strategies of the past (Pezzulo et al., In the study of Internal Models, forward and inverse
2011). Barsalou (2008) argues that such a system allows models have been proposed (Jordan & Rumelhart,
the brain to admit different kinds of simulation in dif- 1992), and studied in many other disciplines, such as
ferent cognitive processes, such as high-level percep- neuroscience (Kawato, 1999; Wolpert, Ghahramani, &
tion, working memory, implicit memory, long-term Flanagan, 2001), biology (Webb, 2004), and philosophy
memory, and conceptual knowledge. (Karniel, 2002; Kiverstein, 2007). Each of these disci-
Finally, we come to an important question, namely, plines presents different ontological commitments to
how can cognitive abilities be defined or, more pre- the notion of Internal Models. It is even possible that
cisely, what does a cognitive process involve. In the different disciplines understand the terms related to
Cognitive Sciences, there exists a discussion on the the notion in a different and perhaps incompatible
notion of cognition. The issue in these disputes lies in manner, which means that this can present epistemolo-
defining cognition in different terms and therefore in a gical repercussions (Mendoza Bock, 2013). However, in
disagreement about future experiments that would be Embodied Cognitive Robotics, care has been taken to
essential to arbitrating their differences (Buckner & use only their core capabilities, namely, their ability to
Fridland, 2017). fuse multimodal information and provide predictions.
We agree with the suggestion of Newen (2017) that An inverse model, or controller, generates a motor
to define cognition it is necessary to first characterize a command to the motor plant and a forward model pre-
cognitive process using an example-based approach. dicts the output of the system, that is, the sensory con-
Newen (2017) proposed the comparator mechanism as sequences based on the motor command (Karniel,
a typical cognitive process. This mechanism is based on 2002). A forward model incorporates knowledge about
Internal Models and has been widely studied in the sensory changes that will be produced by self-
Embodied Cognitive Robotics (Schillaci et al., 2016). generated actions of an agent. In other words, the for-
Rodrı́guez (2012) presents a general discussion of the ward model predicts the sensory situation St+ 1 given a
relationship between Embodied Cognition and motor command Mt applied to an initial sensory situa-
Embodied Cognitive Robotics, presenting the concept tion St (see Figure 1). Thus, forward models play a cen-
of Internal Models as a suitable methodological tool to tral role in cognitive agents as they provide predictions
test the ideas of Embodied Cognition. of the sensory consequences of motor commands
Namely, that having a body is necessary for intelli- (Maye & Engel, 2012; Schillaci et al., 2016; Zambelli &
gent behavior and the interaction of body and environ- Demiris, 2015).
ment is the fundament for the development of cognitive The essence of Internal Models or the process of the
abilities. Embodied Cognitive Robotics conceives intel- comparator mechanism, as the name suggests, is the
ligent behavior as the actions that occur during the situ- comparison between expected sensory output from the
ated body–environment interaction. model and the actual sensory input generated by the
Lara et al. 5
these representations are linked to the external world

remains elusive. In response, Embodied Cognitive
Robotics has proposed that these representations arise
from the motor and sensorial capabilities of the agents,
where the characteristics of the perceived data are
grouped into general categories. Such categorical repre-
Figure 1. Given a current sensory situation St and a desired
sensory situation, St + 1 , the inverse model suggests a motor sentations act as concepts that allow agents to select
command, Mt . The forward model uses the current sensory referents of the manipulated symbols (Rodrı́guez,
situation and the efferent copy of the motor command and Hermosillo, & Lara, 2012).
predicts the next sensory situation, St + 1 . The physical grounding framework holds that the
unit of analysis to understand cognition is a sensorimo-
tor cycle (Pfeifer et al., 2007). Here, the development of
execution of a motor command. The comparison may cognitive skills demands a body that is situated and
result in either a match or a specific error, which in the acts, situated actions, under specific contexts, in the
literature has become known as prediction error. This real world. These tenets fall within the framework of
mechanism allows an online goal-directed adjustment Embodied Cognition, as mentioned earlier, particularly
of the motor command required to achieve the desired in what relates to the processing of low-level sensori-
sensory consequences during a specific task situated motor schemes, internal simulations, and multimodal
in a context. Extensive experimental and theoretical associations. Still, it has been argued that, in Embodied
arguments support the hypothesis that the brain uses Cognitive Robotics, there might be the need to take
Internal Models, for motor planning and execution into account behaviors related to interoceptive infor-
(Wolpert, Ghahramani, & Jordan, 1995; Wolpert, mation. Particularly, Hernandez Ochoa (2017) argued
Goodbody, & Husain, 1998a), distinguishing that agents must respond in the lowest levels to emo-
between self and others (Casile & Giese, 2006; tions by satisfying basic body needs and drives that
Knoblich & Flach, 2001), and feeling of agency due allow them survival. Artificial emotions represent an
to a sensory attenuation sensation of self-generated incipient line of research, highly neglected so far in the
actions (Blakemore, Frith, & Wolpert, 1999; Blakemore, area.
Smith, Steel, Johnstone, & Frith, 2000a, Blakemore,
Wolpert, & Frith, 2000b; Frith, Blakemore, & Wolpert,
2000; Wolpert et al., 1998a), among other tasks.
3.1. Implementations in artificial agents
Conceiving the comparator mechanism as a typical To put the concept of Internal Models to the test, dif-
cognitive process, we argue that cognition relates to ferent implementations have been carried out. These
sensorimotor interactions leading to constant learning implementations are presented here in the order of the
and adjustment of sensorimotor schemes which can be complexity of the models and the type of behaviors they
potentially reactivated during simulated interactions. try to address.
Knowledge arises through this interaction and forms
the basis of the cognitive abilities that shape the mind
3.1.1. Navigation. One of the early implementations is
(Lara, Ciria, Escobar, Gaona, & Hermosillo, 2018).
presented by Escobar, Hermosillo, and Lara (2012),
The morphology of an agent (e.g. the distance
where an internal model provided an agent with basic
between its ears or eyes) affects and modifies the
capabilities for navigation by means of self body-map-
perception–action cycle. Cognition depends on the type
ping. Using a stereo camera mounted on a Pioneer 3-
of body and sensory and motor abilities that have epi-
DX robot, the disparity value coming from the two
genetically and ontogenetically developed for specific
images was used as St . The value of Mt was kept con-
environments (Shapiro, 2004). Thus, our body is the
stant, and the output from the model was the disparity
only one with the abilities to perceive as we do (Nagel,
values for t + 1 as well as the predicted bumper
1974; Noë, 2004; Shapiro, 2004). It is for all these rea-
states. The agent was let to perform straight trajec-
sons that betting on sensorimotor schemes, multimodal
tories colliding with obstacles. The forward model
associations, and prediction seems like a plausible
learned the associations between the observed world
model to understand some of the mechanisms that
and the tactile consequences of moving forward. A
underlie cognition.
forward model was coded using standard Multi-layer
Perceptron Networks (Rumelhart, Hinton, &
Williams, 1985) trained with resilient Error Back-pro-
3. Embodied sensorimotor schemes
pagation (Riedmiller & Braun, 1993). Once the model
An open problem in Robotics and AI is how autono- was trained, the tactile prediction was active in the
mous agents can generate internal representations that parts of the field of view where collisions would occur
allow them to manipulate symbols. Furthermore, how allowing a second system to drive the robot to areas
6 Adaptive Behavior
where there were no obstacles. It is worth noting that

in this work, the calculation of the disparity values is
a geometrical solution to the problem of 3D percep-
tion. To avoid using such solution, a system was
designed to learn the distance to obstacles in relation
to its movement in the world (Gaona, 2014; Gaona,
Hermosillo, & Lara, 2012). Here, the authors used a
system that takes as input the two images coming
from the stereo camera and one of 3 different motor
commands. The output of the forward model is the
visual prediction as well as the tactile situation. The Figure 2. The use of St + 1 as the input to the forward model.
tactile information is an array of the same dimensions Given a current sensory situation, St , and a desired sensory
as the images entirely filled with 1’s when there is a situation, St + 1 , the inverse model suggests a motor command,
Mt . The forward model uses the current sensory situation and
collision and 0’s otherwise. The system is composed
the efferent copy of the motor command and predicts the next
of an array of Multi-layer Perceptron Networks that
sensory situation, St + 1 .
function as local predictors. Each of these networks
takes as input a small window of each of the input
images and the motor command. The output of each
(Moons, 1998; Tsai, 1987). This is also true in robotics,
network is a smaller window of the predicted images
where distances are necessary to, among other things,
and a small window of the tactile situation. Together,
navigate safely or grasp objects in reach. In the systems
the networks are capable of predicting the two full
presented here, the agent experiences the affordance of
images and the tactile situation for the same area as
‘‘distance to objects’’ through direct interaction with its
the field of view of the three-dimensional (3D) scene.
surroundings. The affordance learned is based on the
An essential feature of the forward model concept is
motor and sensor capabilities particular to the agent.
that it can be used to perform long-term predictions
The objects are located x number of motor commands,
(LTPs). That is, the predicted output can be fed back to
the agent does not acquire an obstacle notion, and the
the system, and a new prediction is made, which can be
only knowledge it acquires is: ‘‘if I were to execute x
done as long as the error permits, as this accumulates
number of motor commands, I will experience a change
over time (see Figure 2). In Gaona (2014), the author
in sensory situation.’’ It is worth noting that the change
reported reliable predictions of up to 12 steps, allowing
in sensory situation becoming relevant for action is the
the agent to plan collision-free trajectories before mov-
tactile situation, which is learned through interaction.
ing. Taking the initial two images, a tree of visual and
In similar work, Möller and Schenck (2008) let a
tactile predictions is constructed, and the agent can then
robot learn an inverse and forward model to predict
take the path where the tactile predictions are lower.
the visual consequences of executed motor commands.
As in all previous works, the decision of which
They use a set of Internal Models, one for every possi-
motor command to execute, or which prediction branch
ble motor command, which creates conceptual prob-
to follow, is based on the predictions of the tactile sen-
lems on the need of a new pair of models for every new
sory situation. This prediction is not part of the input
motor command, deeming the system very inflexible.
to the system but it is learned from the sensorimotor
Hoffmann (2007) presents a forward model to predict
associations during the interaction of the agent with its
changes in visual information due to motor commands.
environment. However, the motor command to execute
The main difference with our work is that we use multi-
comes from a separate decision-making mechanism not
modal associations, which deliver predictions not only
related to learning or the environment. One could argue
in the visual space but also in the tactile space. These
that it is a non-grounded decision. To address this
predictions contain no credit or value per se, but can be
issue, Gaona, Escobar, Hermosillo, and Lara (2014)
further used by a separate system to aid the navigation
used LTPs to ground the concept of collisions. The sys-
process.
tem was developed in two stages. In the first, a forward
model was trained with disparity values and motor
commands as inputs and disparity values and bumper 3.1.2. Self and others. Bearing in mind that the context
predictions as outputs. Once trained, the forward with which an agent interacts is not limited to the con-
model, using LTP (off-line), associates negatively high ditions and physical characteristics of their environ-
bumper values to the motor commands producing ment, but also to interact with other agents, Arceo,
them. Finally, when in use, the system uses as input Escobar, Hermosillo, and Lara (2013) used sensorimo-
only disparity values and executes motor commands tor learning as the basis for the recognition and imita-
associated with low bumper values. tion of motor actions of another agent. It has been
In traditional artificial vision research, methods to proposed that Internal Models can be used to explain
estimate distance are based on geometric models the functioning of the mirror neuron system (Miall,
Lara et al. 7
2003). Through interaction with their environment, problem (Ince et al., 2009; Schmidt et al., 2016), because
agents acquire a repertoire of sensorimotor schemes. of the non-stationary nature of the ego-noise generated
The observed sensory consequences of an action allow by the autonomous agents (Ince et al., 2010). In an
a conversion by the inverse model into potential motor early bio-inspired approximation, Ince et al. (2009)
commands so that the observed movement can be pre- showed that it is possible to suppress this noise by let-
dicted by the forward model enabling imitation. ting the agent perform predictions about the sound pro-
The authors adhere to the possibility that mirror duced when executing a motor command and then
neurons in premotor areas could function as predictors subtracting that prediction from the total sound input.
of the observed behaviors from an associative learning Following these ideas, Pico et al. (2017) propose that
(Heyes, 2010). These associations are based on the sen- it is possible to not only solve this problem using pre-
sorimotor experiences of the agent and coded by dictions and forward models, but more importantly the
Internal Models. Similar work was performed by ego-noise encodes useful information about the agent
Oztop, Kawato, and Arbib (2006) in a simulated robot and its environment. To test this, Pico et al. (2017)
arm and by Demiris and Hayes (2002) on a developed a two-wheeled differential agent, capable of
demonstrator–imitator simulated environment. synchronizing motor commands and sensory data using
Using mobile agents, this idea was put to the test two controllers. A low-level controller is in charge of
(Arceo et al., 2013). Here, the authors let an agent asso- the sensors, and a high-level controller is in charge of
ciate the visual consequences of small movements to the making predictions and pre-processing the audio.
right and left. The visual information is the coordinates Using this agent and forward models, implemented
of a fiduciary marker in its field of view. The forward as a Multi-layer Perceptron Network, Pico et al. (2017)
model takes as input visual information and motor run two different experiments: In the first experiment
commands and delivers as output the changes in visual named ‘‘ego-noise classification,’’ the authors try to
information. The model is coded with a Multi-Layer classify the ego-noise of the agent as a consequence of
Perceptron network. The inverse model is a function its own motor commands using predictions. The motor
that finds a vector in a pool containing all collected tri- commands correspond to the velocity profiles of the
plets of visual information for time t, denoted by Vt , two motors in the agent. In total, 11 forward models
motor command, Mt , and the corresponding change in were trained, each corresponding to a velocity profile
visual information at time t + 1, denoted by vt + 1 . and predicting the ego-noise produced. Once the mod-
Once trained, the agent observes movements of the els are trained, the agent executes motor profiles and
marker mounted on a second agent. Using the observed the ego-noise produced is compared to the predictions
vt and vt + 1 , it uses the inverse model containing its of all the models. Finding the smallest Euclidean dis-
motor repertoire, to find the motor commands that best tance between the two signals, the motor profile caus-
bring about the visual changes. These motor commands ing the noise is found. The results of this experiment
are passed to the forward model, together with the ini- demonstrate that forward models can, in fact, code the
tial visual situation to perform a set of predicted visual relation between a motor command and the sound that
situations. The system then compares the suggested this produces.
visual situations with the observed ones in order to exe- In a second experiment ‘‘Motor babbling and ego-
cute that motor command in the triplet with the smal- noise prediction,’’ the focus was on the detection of
lest error in visual information. changes in the environment of the agent. A forward
A final work using mobile agents adheres to the model was trained in a flat arena to predict the sounds
claim that ownership of actions and its consequences in produced by the robot using random velocities for the
the world are fundamental components of the cognitive motors. Once the model is trained, the robot is tested
process of agency (Tsakiris, Schütz-Bosbach, & in the original arena and in a second one, containing a
Gallagher, 2007; Weiss, Tsakiris, Haggard, & Schütz- ramp. When the robot moves on the ramp, the error
Bosbach, 2014). The association between actions and between the ego-noise and the prediction increases sig-
sensory consequences in the world is at the core of the nificantly, suggesting that this type of implementation
notion of forward models and has been put to the test can not only give information about the agent, but also
by Pico, Schillaci, Hafner, and Lara (2017). The detect changes in its environment.
authors attempt to tackle the ego-noise problem in an
autonomous agent. The main issue here is for the agent
to suppress the sounds produced itself while moving 3.1.3. Humanoids. While all of the previous works have
around to increase its performance on the processing of been carried out using mobile agents, research has also
external sound sources, for tasks such as speech recog- been conducted using forward models in humanoid
nition or localization of other sound sources. platforms, namely, the NAO robot from Aldebaran.
Traditional methods for audio processing, such as The agent has been let to learn its own body capabilities
static noise reduction techniques or the use of different by performing motor babbling (Demiris & Dearden,
microphone arrays’ geometries, fail to address this 2005). The NAO robot learned the visual consequences
8 Adaptive Behavior
of moving its arm around and how these changed when absent without affecting its performance. Different cog-
the use of a tool modified its motor capabilities nitive processes have been modeled using SOIMA,
(Schillaci, Hafner, & Lara, 2012a). including coding the reaching space of the NAO’s arm,
Forward models were also used to let the agent learn saccade movements of the eyes, foveation of a point in
different actions by associating the motor commands to space and then reaching it with the arm (see Figure 3),
execute them with the visual consequences while execut- and the characterization of shorthand trajectories
ing those actions (Schillaci, Lara, & Hafner, 2012b). (Escobar Juárez, 2016). As can be seen from the exam-
Once trained, the system was capable of correctly recog- ple, the SOIMA can propagate information in any
nizing observed actions. The same principle is extended direction depending of the needs of the task, depending
to let the agent know whether an observed hand trajec- on the availble sensory or motor information.
tory is performed itself or by another agent based on We believe that this type of knowledge acquisition,
the amount of prediction error processed (Schillaci, based on a Piagetian motor babbling strategy (Harvey,
Hafner, Lara, & Grosjean, 2013). 2018), is not only in accordance with the principles of
In all the previously described implementations, the Embodied Cognition but also serves as a proof of con-
forward models were coded using Multi-layer cept. Namely, an agent can acquire the most primitive
Perceptron Networks, a straigthforward strategy that tools to interact with its environment by first forming a
allows the designer to concentrate on the modeled cog- basic body image based on its sensorimotor capabilities
nitive processes as well as the cognitive architecture. and then use this to further explore the world incremen-
However, the nature of these tools presents some short- tally acquiring more complex and adaptive behaviors.
comings such as the impossibility of adding knowledge
after training and more importantly does not work cor-
rectly when there is information missing.
3.2. Natural agents
The case of missing information is of particular To further test the concept of Internal Models (Wolpert
interest given that, as an example, when an agent learns et al., 1995; Wolpert, Miall, & Kawato, 1998b) as expla-
the reaching space of its arm, it does so by performing a natory mechanisms, several experiments have been car-
multimodal association. This association can include ried out with natural agents. As stated before, Internal
motor, tactile, and visual information. However, when Models are thought to be involved in learning and
an agent observes an arm moving, it is capable of recog- motor control; at the same time, they are supposed to
nizing the movement and reproduce the action even be at the center of action and perception. We have car-
when the agent does not have access to first-person ried out several experiments with humans related to
motor or visual information. Among other reasons, this tasks that involve the sense of agency, recognition of
led to a change of machine learning paradigm. actions, and the prediction of external events.
To tackle some of these issues, Escobar Juárez, A core element in the proposal of Internal Models is
Schillaci, Hermosillo Valadez, and Lara Guzmán their ability to explain the sense of agency understood
(2016) presented a Self-Organizing Internal Models as the sense of control over our actions and their effects
Architecture (SOIMA) based on Hebbian learning and (Haggard & Tsakiris, 2009). The sense of agency in the
self-organizing maps (SOMs; Kohonen, 1998). This framework of Internal Models arises as the result of the
architecture is inspired by the work of Morse, Greef, ability of the sensorimotor system to predict the conse-
Belpaeme, and Cangelosi (2010) where the iCub huma- quences of our actions. The predicted consequences
noid robot associates different modalities in a similar and the incoming sensory information are then com-
fashion but without the use of predictions based on pared in low-level sensorimotor processes (Haggard &
Internal Models. Chambon, 2012). However, alternative proposals sug-
In SOIMA, the regularities of a sensory modality are gest that the agency experience is the result of high-level
coded in a SOM so that there can be as many SOMs as cognitive processes, specifically causal inferences about
sensory modalities. The modalities are then connected the relationships between thoughts and actions
using Hebbian learning, specifically between a sensory (Wegner & Wheatley, 1999). Therefore, from this per-
situation, a motor change, and the sensory conse- spective, our experience of agency, like any other causal
quences of this. The connections are organized in what inference, lays its foundations around contingencies
we call a Multimodal Representation Map (sensorimo- that allow us to infer, in this particular case, an appar-
tor schemes). The result is a modular architecture that ent authorship.
codes the forward and the inverse model under the Taking this panorama as a reference, Astorga (2016)
same structure. Furthermore, SOMs and in particular investigated the effect of perceptual differences between
dynamic self-organizing maps (DSOMs; Rougier & judging the onset of a voluntary action and the onset of
Boniface, 2011) can keep learning, and new schemes its sensory consequence (sense of agency) and the judg-
are added to the knowledge of the agent as this inter- ments related to the onset of two contingent events
acts with its environment. When the architecture is in (causal inference). This work was carried out using the
use, information of one or more modalities can be paradigm of temporal binding proposed by Haggard,
Lara et al. 9
Figure 3. The connections in this SOIMA can be used by the NAO humanoid robot for foveation of a stimulus in its field of view
and then to bring its arm to point at it. The bottom maps are modal maps organizing the regularities of sensory information. The top
maps join events together as multimodal representations. So, for instance, an initial and a final (goal) position in the vision map are
connected to a unit in the Saccadic MMR, and this unit is connected to a motor command for the head which will bring the visual
stimulus to the center of the image. The right section of the SOIMA will bring the head and the arm to absolute positions in space,
with the arm pointing to the stimulus.
Clark, and Kalogeras (2002). Astorga (2016) found a 4. Conclusion and further research
temporal binding between events that involves agency
The research carried out in our laboratory follows the
significantly greater than when related events are judged
principles of the 4E cognition framework, specifically the
contingently. The result is consistent with Haggard and
ideas of Embodied Cognition. Interaction of agents with
Chambon (2012), in that the sense of agency is the result
their environment and sensorimotor schemes that emerge
of comparisons made by sensorimotor low-level cogni-
tive processes. The experimental results suggested that through this interaction have been the backbone of our
when there is a sensory consequence associated with a research. It is our conviction that Internal Models are
voluntary action (motor command), and because this the natural candidates for coding the aforementioned
information is highly relevant for the organism, priority schemes, as the capabilities to perform sensorimotor pre-
is given to this association, strengthening it through a dictions is an essential feature of cognitive agents.
temporal link between both events. As previously described, instead of imitating high-
Under the hypothesis of Internal Models and their level cognitive tasks in order to understand cognition, a
relationship with the sense of agency, Rojas (2014) fraction of the AI community turned to study what
worked on the issue of action recognition. This become known as low-level cognitive processes. We
research was based on the assumption that individu- believe that the study of sensorimotor schemes and the
als process perceptual information together with processes and capabilities that emerge from these repre-
motor information. The result of this processing is sent the fundaments for further research in Embodied
the acquisition of sensorimotor schemes, particular to Cognitive Robotics. The quest is not over yet; however,
the individual. When recognizing an action, these important steps have been taken.
schemes are used in simulation to solve the task suc- The experiments summarized here present results
cessfully. A fundamental element in action recogni- that support the main tenets of Embodied Cognition
tion comes from the discrimination between an action and the 4E cognition approaches in general, namely,
recognized as our own, or as executed by someone the fact that cognition must be understood as occurring
else (Knoblich, Seigerschmidt, Flach, & Prinz, 2002). in agents that have a body, and with this, they interact
Rojas (2014) found that individuals recognized faster in a specific environment. The use of Internal Models
between actions (take a rubber ball, hit it with the in our work serves as a proof of concept of different
fist, or make it roll) when these were previously exe- strategies for the acquisition of basic cognitive tools for
cuted by them. These findings can be seen as a result agents, laying the ground for further development of
of perceptual anticipation, a phenomenon that exem- more complex and adaptive behaviors. The implemen-
plifies the processing and preparation of actions. tations of Internal Models can be read as agents acquir-
Recognition of actions is carried out through simula- ing the capability of experiencing affordances of the
tions using as a reference the sensorimotor informa- environment they interact with. These affordances are
tion coded in the Internal Models of the individual coded using sensorimotor schemes particular to the
(Wilson & Knoblich, 2005; Wolpert et al., 1998b). agents, their bodily properties, and their environment.
10 Adaptive Behavior
Another significant contribution comes from the Clark, 2014). In the IFM, motor commands are
experiments centered on prediction, and the study of replaced by top-down sensory predictions issued by the
the capabilities agents acquire for successful interaction forward model, removing any distinction between
with their environment when they become predicting motor and sensory processing and also the need of an
agents. It has been shown that using Internal Models inverse model. Thus, IFMs posit a more complex (dis-
to code sensorimotor schemes in order to predict the tributed) forward model, mapping prior beliefs about
sensory consequences of actions allows different types the desired trajectories to sensory consequences
of behaviors to emerge. These behaviors are in accor- (Pickering & Clark, 2014, p. 456). However, in classic
dance with the principles of the Embodied Cognition control theory, the inverse model has played a signifi-
framework; particularly, we have addressed acquisition cant role despite learning difficulties inherent to the
of affordances, agency, recognition and imitation of model. Rejecting the idea and need of an inverse model
actions, and simulations. brings interesting issues but should be considered care-
Our experiments in natural agents have allowed us fully. As stated earlier, SOIMA intrinsically codes
to investigate important cognitive processes, such as inverse and forward models, so the relevance of the
action recognition and agency, in the light of Internal inverse model could be further studied. Second, predic-
Models. At the same time, these experiments provide tive processing suggests the concept of precision
valuable insights for the implementation of Internal weights as an attentional mechanism which modulates
Models in artificial agents. the incoming sensory information relevant to a context.
Following the latest paradigm in the Cognitive This attentional mechanism has generated many con-
Sciences, current research in our laboratory also focuses troversies and demands further research and under-
on modeling of perception and action in the framework standing. We argue that given the modular architecture
of predictive processing (Clark, 2013, 2015a). Predictive of the SOIMA, SOMs could be added to potentially
processing represents one of the most radical views code context to control the flow and processing of rele-
about the flow of sensory information processing dur- vant sensory information. Such a structure could be
ing perception, asserting that it is primarily a top-down considered as a precision weighting mechanism.
process, rather than a bottom-up process. However, the Furthermore, the effects of prediction error on the flow
suggestion of an inverted structure of the perceptual of upcoming information has not been widely modeled
system requires agents to first learn forward (genera- and explored. In the SOIMA, the flow of information
tive) models which capture the statistical structure of is not restricted to one direction. The bidirectional flow
the incoming sensory information by tracking the of information allows studying the effects of prediction
causes of that specific structure (Clark, 2013). Cognitive error on Hebbian learning processes and the way infor-
agents are thought to be actively predicting the stream mation propagates.
of sensory stimulation before it occurs, and only unex- Considering the core results of our work so far, we
pected sensed deviations from the predicted state (pre- are convinced that using Internal Models to learn and
diction error) are further processed bottom-up (Clark, recalibrate sensorimotor schemes leads to more com-
2015a, 2015b). The perceptual and motor systems work plex behavioral patterns for further development of
together to predict incoming sensory information in all intelligence. In this framework, we hold that intelligence
modalities: exteroceptive, interoceptive, and, in the case can be concieved as a behavior which leads an agent to
of the motor system, proprioceptive (Adams, Shipp, & the succesful execution of a task situated in a certain
Friston, 2013). In addition, in this framework, percep- context. Consequently, we claim that Internal Models
tion and action constitute ‘‘active inference’’ (Friston, are valuable tools for research which can significantly
Daunizeau, Kilner, & Kiebel, 2010) or ‘‘action-oriented contribute to the a better understanding of cognition.
predictive processing’’ (Clark, 2013). Through actions,
cognitive agents minimize prediction error about Acknowledgements
incoming sensory information, evidencing the intimate
The authors would like to thank the current and former
relationship between perception and action (Clark,
members and close collaborators of the Cognitive Robotics
2013; Friston, Kilner, & Harrison, 2006). Laboratory, particularly Guido Schillaci, Jorge Hermosillo,
In this line of research, we are focusing on the fol- Verena Hafner, Esau Escobar, Wilmer Gaona, Karla Baeza,
lowing open questions and future implementations. Diana Valenzo, Jorge Hernandez, Abraham Martinez, Javier
First, active inference suggests that there is no need for Vargas, David Castillo, Dairon Rodriguez, Eduardo Rojas
an inverse model or an optimal control mechanism, as and, Antonio Pico.
motor commands are conceived as proprioceptive pre-
dictions that drive action (Clark, 2015a; Pickering & Declaration of conflicting interests
Clark, 2014). In this framework, an integral forward The authors declared no potential conflicts of interest con-
model (IFM) is a forward (generative) model and the cerning the research, authorship, and publication of this
core machinery of perception and action (Pickering & article.
Lara et al. 11
Funding Clark, A. (2015a). Embodied prediction. In: T. Metzinger, &

The author(s) received no financial support for the research, J. M. Windt (Eds.). Open mind (pp 1–21). Frankfurt Am
authorship, and/or publication of this article. Main, Germany: Mind Group.
Clark, A. (2015b). Surfing uncertainty: Prediction, action, and
ORCID iD the embodied mind. Oxford, UK: Oxford University Press.
Clark, A., & Grush, R. (1999). Towards a cognitive robotics.
Bruno Lara https://orcid.org/0000-0002-9844-6435 Adaptive Behavior, 7, 5–16.
Demiris, J., & Hayes, G. M. (2002). Imitation in animals and
References artifacts. In C. L. Nehaniv, K. Dautenhahn (Eds.), Imita-
tion as a dual-route process featuring predictive and learning
Adams, R. A., Shipp, S., & Friston, K. J. (2013). Predictions components: A biologically plausible computational model
not commands: Active inference in the motor system. (pp. 327–361). Cambridge, MA: The MIT Press.
Brain Structure & Function, 218, 611–643. Demiris, Y., & Dearden, A. (2005). From motor babbling to
Adolph, K., & Berger, S. (2006). Motor development. In hierarchical learning by imitation: A robot developmental
W. Damon, R. Lerner, D. Kuhn, & R. S. Siegler (Eds.), pathway. Retrieved from http://cogprints.org/4961/1/
Handbook of child psychology: Cognition, perception, and demiris.pdf
language (vol. 2, pp. 161–213, 6th ed.). New York, NY: Desmurget, M., & Grafton, S. (2000). Forward modeling
John Wiley & Sons. allows feedback control for fast reaching movements.
Anderson, M. L. (2003). Embodied cognition: A field guide. Trends in Cognitive Sciences, 4, 423–431.
Artificial Intelligence, 149, 91–130. Di Paolo, E. (2009). Extended life. Topoi, 28(1), 9–21.
Arceo, D. C., Escobar, E., Hermosillo, J., & Lara, B. (2013). Dreyfus, H. L. (1967). Why computers must have bodies in order
Modelado de un sistema de neuronas espejo en un agente to be intelligent. The Review of Metaphysics, 21(1), 13–32.
autónomo artificial model of a mirror neuron system in an Escobar, E., Hermosillo, J., & Lara, B. (2012). Self body
artificial autonomous agent. Nova Scientia, 5, 51–72. mapping in mobile robots using vision and forward mod-
Asada, M., Hosoda, K., Kuniyoshi, Y., Ishiguro, H., Inui, T., els. In Proceedings of the 9th IEEE Electronics, Robotics
Yoshikawa, Y., & Yoshida, C. (2009). Cognitive develop- and Automotive Mechanics Conference (CERMA) (pp.
mental robotics: A survey. IEEE Transactions on Autono- 72–77). New York, NY: IEEE Computer Society.
mous Mental Development, 1, 12–34. Escobar Juárez, E. (2016) Modelado de una arquitectura de
Astorga, D. (2016) Efecto de Vinculación Temporal: un efecto modelos internos para la generación de esquemas sensorimo-
dependiente de la agencia o de la contigüidad y predictibili- trices (PhD thesis). Instituto de Investigación en Ciencias
dad de los eventos? (Master’s thesis). Centro de Investiga- Básicas y Aplicadas, Cuernavaca, Mexico.
ción en Ciencias Cognitivas, Cuernavaca, Mexico. Escobar Juárez, E., Schillaci, G., Hermosillo Valadez, J., &
Barsalou, L. W. (2008). Grounded cognition. Annual Review Lara Guzmán, B. (2016). A self-organized internal models
of Psychology, 59, 617–645. architecture for coding sensory–motor schemes. Frontiers
Blakemore, S. J., Frith, C. D., & Wolpert, D. M. (1999). Spa- in Robotics and AI, 3, Article 22.
tio-temporal prediction modulates the perception of self- Friston, K., Kilner, J., & Harrison, L. (2006). A free energy
produced stimuli. Journal of Cognitive Neuroscience, 11, principle for the brain. Journal of Physiology-Paris, 100,
551–559. 70–87.
Blakemore, S. J., Smith, J., Steel, R., Johnstone, E., & Frith, Friston, K. J., Daunizeau, J., Kilner, J., & Kiebel, S. J. (2010).
C. D. (2000a). The perception of self-produced sensory sti- Action and behavior: A free-energy formulation. Biological
muli in patients with auditory hallucinations and passivity Cybernetics, 102, 227–260.
experiences: Evidence for a breakdown in self-monitoring. Frith, C. D., Blakemore, S. J., & Wolpert, D. M. (2000).
Psychological Medicine, 30, 1131–1139. Explaining the symptoms of schizophrenia: Abnormalities
Blakemore, S. J., Wolpert, D., & Frith, C. (2000b). Why can’t in the awareness of action. Brain Research Reviews, 31,
you tickle yourself?Neuroreport, 11, R11–R16. 357–363.
Brooks, R. A. (1991). Intelligence without representation. Gaona, W. (2014) Percepción de Distancia en un Agente Artifi-
Artificial Intelligence, 47, 139–159. cial a trave´s de la Simulación de Esquemas Sensorimotrices
Buckner, C., & Fridland, E. (2017). What is cognition? Angsty (PhD thesis). Facultad de Ciencias, Montevideo, Uruguay.
monism, permissive pluralism(s), and the future of cogni- Gaona, W., Escobar, E., Hermosillo, J., & Lara, B. (2015).
tive science. Synthese, 194, 4191–4195. Anticipation by multi-modal association through an artifi-
Casile, A., & Giese, M. A. (2006). Nonvisual motor training cial mental imagery process. Connection Science, 27, 68–88.
influences biological motion perception. Current Biology, Gaona, W., Hermosillo, J., & Lara, B. (2012). Distance per-
16, 69–74. ception in mobile robots as an emergent consequence of
Chiel, H. J., & Beer, R. D. (1997). The brain has a body: visuo-motor cycles using forward models. In Proceedings
Adaptive behavior emerges from interactions of nervous of the IEEE 9th Electronics, Robotics and Automotive
system, body and environment. Trends in Neurosciences, Mechanics Conference (pp. 42–47). New York, NY: IEEE
20, 553–557. Computer Society.
Clark, A. (1999). An embodied cognitive science?Trends in Haggard, P., & Chambon, V. (2012). Sense of agency. Current
Cognitive Sciences, 3, 345–351. Biology, 22, R390–R392.
Clark, A. (2013). Whatever next? Predictive brains, situated Haggard, P., Clark, S., & Kalogeras, J. (2002). Voluntary
agents, and the future of cognitive science. Behavioral and action and conscious awareness. Nature Neuroscience, 5,
Brain Sciences, 36, 181–204. 382–385.
Haggard, P., & Tsakiris, M. (2009). The experience of agency: robotics in Mechatronics (pp. 321–348). Berlin, Germany:
Feelings, judgments, and responsibility. Current Directions Springer International Publishing AG.
in Psychological Science, 18, 242–246. Lara Guzmán, B., Hermosillo Valadez, J., & Baeza Mariscal,
Harvey, K. (2018). An open-ended approach to Piagetian K. (2017). Intentionality and internal models in artificial
development of adaptive behavior. Open Access Library agents. Pragmatics & Cognition, 23, 209–237.
Journal, 5, e4434. Lüchinger, A. B., Hadders-Algra, M., Van Kan, C. M., & de
Hernandez Ochoa, J. L. (2017) Robotica Cognitiva y Emo- Vries, J. I. (2008). Fetal onset of general movements.
ciones (Master’s thesis). Centro de Investigación en Cien- Pediatric Research, 63, 191–195.
cias Cognitivas, Cuernavaca, Mexico. McCarthy, J., Minsky, M. L., Rochester, N., & Shannon, C.
Heyes, C. (2010). Where do mirror neurons come from?Neur- E. (2006). A proposal for the Dartmouth summer research
oscience and Biobehavioral Reviews, 34, 575–583. project on artificial intelligence. AI Magazine, 27, 12.
Hoffmann, H. (2007). Perception through visuomotor antici- Maye, A., & Engel, A. K. (2012). Using sensorimotor contin-
pation in a mobile robot. Neural Networks, 20, 22–33. gencies for prediction and action planning. In T. Ziemke,
Ince, G., Nakadai, K., Rodemann, T., Hasegawa, Y., Tsujino, C. Balkenius, & J. Hallam (Eds.), Proceedings of the Inter-
H., & Imura, J. I. (2009). Ego noise suppression of a robot national Conference on Simulation of Adaptive Behavior
using template subtraction. In Proceedings of the IEEE/ (pp. 106–116). Berlin, Germany: Springer.
RSJ International Conference on Intelligent Robots and Menary, R. (2010). Introduction to the special issue on 4E
Systems (IROS) (pp. 199–204). New York, NY: IEEE. cognition. Phenomenology and the Cognitive Sciences, 9,
Ince, G., Nakadai, K., Rodemann, T., Hasegawa, Y., Tsujino, 459–463.
H., & Imura, J. I. (2010). A hybrid framework for ego Mendoza Bock, E. (2013) Los términos nómadas de las Cien-
noise cancellation of a robot. In N. Xi, & W. R. Hamel cias Cognitivas: el caso de los Modelos Directos (Master’s
(Eds.), Proceedings of the IEEE International Conference thesis). Facultad de Humanidades, Montevideo, Uruguay.
on Robotics and Automation (pp. 3623–3628). New York, Miall, R. C. (2003). Connecting mirror neurons and forward
NY: IEEE. models. Neuroreport, 14, 2135–2137.
Jordan, M. I., & Rumelhart, D. E. (1992). Forward models: Miall, R. C., & King, D. (2008). State estimation in the cere-
Supervised learning with a distal teacher. Cognitive Sci- bellum. The Cerebellum, 7, 572–576.
ence, 16, 307–354. Miall, R. C., & Wolpert, D. M. (1996). Forward models for
Karniel, A. (2002). Three creatures named ‘‘forward model.’’ physiological motor control. Neural Networks, 9,
Neural Networks, 15, 305–307. 1265–1279.
Kawato, M. (1999). Internal models for motor control and Miall, R., Weir, D. J., Wolpert, D. M., & Stein, J. (1993). Is
trajectory planning. Current Opinion in Neurobiology, 9, the cerebellum a smith predictor? Journal of Motor Beha-
718–727. vior, 25, 203–216.
Kiverstein, J. (2007). Could a robot have a subjective point of Mingers, J. (2001). Embodying information systems: The con-
view?Journal of Consciousness Studies, 14, 127–139. tribution of phenomenology. Information and Organiza-
Kiverstein, J., & Clark, A. (2009). Introduction: Mind embo- tion, 11, 103–128.
died, embedded, enacted: One church or many?Topoi, 28, Möller, R., & Schenck, W. (2008). Bootstrapping cognition
1–7. from behavior a computerized thought experiment. Cogni-
Knoblich, G., & Flach, R. (2001). Predicting the effects of tive Science, 32, 504–542.
actions: Interactions of perception and action. Psychologi- Moons, T. (1998). Lecture notes in computer science. In
cal Science, 12, 467–472. R. Koch, & L. V. Gool (Eds.), 3D structure from multiple
Knoblich, G., Seigerschmidt, E., Flach, R., & Prinz, W. images of large-scale environments (Chapter: A guided tour
(2002). Authorship effects in the prediction of handwriting through multi view relations) (vol. 1506, pp. 304–346). Ber-
strokes: Evidence for action simulation during action per- lin, Germany: Springer.
ception. The Quarterly Journal of Experimental Psychol- Moravec, H. (1984) Locomotion, vision and intelligence. In
ogy: Section A, 55, 1027–1046. M. Brady, & R. Paul (Eds.), Robotics Research (pp. 215–
Koeinig, P., Ktihnberger, K. U., & Kietzmann, T. C. (2013). 224). Cambridge, MA: The MIT Press.
A unifying approach to high-and low-level cognition. In Morse, A. F., Greef, J. D., Belpaeme, T., & Cangelosi, A.
U. Gähde (Ed.), Models, simulations, and the reduction of (2010). Epigenetic robotics architecture (era). IEEE Trans-
complexity (vol. 4, pp. 117–139). Berlin, Germany: De actions on Autonomous Mental Development, 2, 325–339.
Gruyter. Nagel, T. (1974). What is it like to be a bat? The Philosophical
Kohonen, T. (1998). The self-organizing map. Neurocomput- Review, 83, 435–450.
ing, 21, 1–6. Nasuto, S. J., & Hayashi, Y. (2016). Anticipation: Beyond
Lacquaniti, F., Ivanenko, Y. P., & Zago, M. (2012). Develop- synthetic biology and cognitive robotics. Biosystems, 148,
ment of human locomotion. Current Opinion in Neurobiol- 22–31.
ogy, 22, 822–828. Newell, A., & Simon, H. A. (1976). Computer science as
Lakoff, G., & Johnson, M. (1999). Philosophy in the Flesh empirical inquiry: Symbols and search. Communications of
(vol. 4). New York, NY: Basic books. the ACM, 19, 113–126.
Lara, B., Ciria, A., Escobar, E., Gaona, W., & Hermosillo, J. Newen, A. (2017). What are cognitive processes? An example-
(2018). Cognitive robotics: The new challenges in artificial based approach. Synthese, 194, 4251–4268.
intelligence. In O. O. S. Villegas, M. Nandayapa, & I. Soto Noë, A. (2004). Action in perception. Cambridge, MA: The
(Eds.), Advanced topics on computer vision, control and MIT Press.
Lara et al. 13
Oztop, E., Kawato, M., & Arbib, M. (2006). Mirror neurons distinction in robotics. In J. Crandall, & S. Sabanovic,
and imitation: A computationally guided review. Neural (Eds.), Proceedings of the 8th ACM/IEEE International
Networks, 19, 254–271. doi:10.1016/j.neunet.2006.02.002. Conference on Human-Robot Interaction (pp. 223–224).
Pezzulo, G., Barsalou, L. W., Cangelosi, A., Fischer, M. H., New York, NY: IEEE.
McRae, K., & Spivey, M. J. (2011). The mechanics of Schillaci, G., Lara, B., & Hafner, V. (2012b). Internal simula-
embodiment: A dialog on embodiment and computational tions for behaviour selection and recognition. In A. Salah,
modeling. Frontiers in Psychology, 2, 5. J. Ruiz-del Solar, C. Mericli, & P. Y. Oudeyer (Eds.), Human
Pfeifer, R., & Bongard, J. (2006). How the body shapes the behavior understanding (Lecture notes in computer science)
way we think: A new view of intelligence. Cambridge, MA: (vol. 7559, pp. 148–160). Berlin, Germany: Springer.
The MIT Press. Schmidt, A., Deleforge, A., Kellermann, W., Schmidt, A.,
Pfeifer, R., Lungarella, M., & Iida, F. (2007). Self-organiza- Deleforge, A., Kellermann, W., & Kellermann, W. (2016).
tion, embodiment, and biologically inspired robotics. Sci- Ego-noise reduction using a motor data-guided multichan-
ence, 318, 1088–1093. nel dictionary. In W. Burgard (Ed.) Proceedings of the
Pfeifer, R., & Scheier, C. (2001). Understanding intelligence. IEEE/RSJ International Conference on Intelligent Robots
Cambridge, MA: The MIT Press. and Systems (IROS) (pp. 1281–1286). New York, NY:
Pickering, M. J., & Clark, A. (2014). Getting ahead: Forward IEEE.
models and their place in cognitive architecture. Trends in Shapiro, L. (2007). The embodied cognition research pro-
Cognitive Sciences, 18, 451–456. gramme. Philosophy Compass, 2, 338–346.
Pico, A., Schillaci, G., Hafner, V. V., & Lara, B. (2017). How Shapiro, L. (2010). Embodied cognition. London, England:
do I sound like? Forward models for robot ego-noise pre- Routledge.
diction. In J. Santos - Victor, & G. Sandini, (Eds.), Pro- Shapiro, L. A. (2004). The mind incarnate. Cambridge, MA:
ceedings of the Joint IEEE International Conference on The MIT Press.
Development and Learning and Epigenetic Robotics (pp. Tsai, R. Y. (1987). A versatile camera calibration technique
246–251). New York, NY: IEEE. for high-accuracy 3D machine vision metrology using off-
Riedmiller, M., & Braun, H. (1993). A direct adaptive method the-shelf TV cameras and lenses. Journal of Robotics and
for faster backpropagation learning: The RPROP algo- Automation, 3, 323–344.
rithm. In Proceedings of the IEEE International Conference Tsakiris, M., Schütz-Bosbach, S., & Gallagher, S. (2007). On
on Neural Networks (pp. 586–591). New York, NY: IEEE. agency and body-ownership: Phenomenological and neu-
Robbins, P., & Aydede, M. (2009). The Cambridge handbook rocognitive reflections. Consciousness and Cognition, 16,
of situated cognition. Cambridge, UK: Cambridge Univer- 645–660.
sity Press. Van Gelder, T., & Port, R. F. (1995). It’s about time: An over-
Rodrı́guez, D. (2012) Análisis de la Cognición Corporizada view of the dynamical approach to cognition. In R. Port,
desde la perspectiva de la fenomenologı´a (Master’s thesis). & T. van Gelder (Eds.), Mind as motion: Explorations in
Facultad de Humanidades, Montevideo, Uruguay. the dynamics of cognition (pp. 1–43). Cambridge, MA: The
Rodrı́guez, D., Hermosillo, J., & Lara, B. (2012). Meaning in MIT Press.
artificial agents: The symbol grounding problem revisited. Varela, F. J., Thompson, E., & Rosch, E. (2017). The embo-
Minds and Machines, 22, 25–34. died mind: Cognitive science and human experience. Cam-
Rojas, E. (2014) Estudio sobre la caracterización de acciones bridge, MA: The MIT press.
basado en modelos internos (Master’s thesis). Facultad de von Hofsten, C. (2003). On the development of perception and
Humanidades, Montevideo, Uruguay. action. London, England: SAGE.
Rougier, N., & Boniface, Y. (2011). Dynamic self-organising von Hofsten, C. (2007). Action in development. Developmen-
map. Neurocomputing, 74, 1840–1847. tal Science, 10, 54–60.
Rowlands, M. (2010). The new science of the mind: From von Hofsten, C. (2009). Action, the foundation for cognitive
extended mind to embodied phenomenology. Cambridge, development. Scandinavian Journal of Psychology, 50,
MA: The MIT Press. 617–623.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1985). Ward, D., & Stapleton, M. (2012). Es are good. In F. Paglieri
Learning internal representations by error propagation (Ed.), Consciousness in interaction: The role of the natural
(Technical Report). California Univ San Diego La Jolla and social context in shaping consciousness (pp. 86–89).
Inst for Cognitive Science. Retrieved from http:// Amsterdam, The Netherlands: John Benjamins Publishing.
www.dtic.mil/dtic/tr/fulltext/u2/a164453.pdf Webb, B. (2004). Neural mechanisms for prediction: Do insects
Schillaci, G., Hafner, V. V., & Lara, B. (2012a). Coupled have forward models? Trends in Neurosciences, 27, 278–282.
inverse-forward models for action execution leading to Wegner, D. M., & Wheatley, T. (1999). Apparent mental cau-
tool-use in a humanoid robot. In J. Han, A. Tapus, & J. sation: Sources of the experience of will. American Psychol-
Crandall. (Eds.), Proceedings of the 7th ACM/IEEE Inter- ogist, 54, 480–492.
national Conference on Human-Robot Interaction (pp. 231– Weiss, C., Tsakiris, M., Haggard, P., & Schütz-Bosbach, S.
232). New York, NY: IEEE. (2014). Agency in the sensorimotor system and its relation
Schillaci, G., Hafner, V. V., & Lara, B. (2016). Exploration to explicit action awareness. Neuropsychologia, 52, 82–92.
behaviors, body representations, and simulation processes Wheeler, M., & Clark, A. (2008). Culture, embodiment and
for the development of cognition in artificial agents. Fron- genes: Unravelling the triple helix. Philosophical Transactions
tiers in Robotics and AI, 3, 39. of the Royal Society B: Biological Sciences, 363, 3563–3575.
Schillaci, G., Hafner, V. V., Lara, B., & Grosjean, M. (2013). Wilson, M. (2002). Six views of embodied cognition. Psycho-
Is that me? Sensorimotor learning and self-other nomic Bulletin & Review, 9, 625–636.
Wilson, M., & Knoblich, G. (2005). The case for motor invol- human superior parietal lobe. Nature Neuroscience, 1,
vement in perceiving conspecifics. Psychological Bulletin, 529–533.
131, 460–473. Wolpert, D. M., Miall, R. C., & Kawato, M. (1998b). Inter-
Wolpert, D. M., Ghahramani, Z., & Flanagan, J. R. (2001). nal models in the cerebellum. Trends in Cognitive Sciences,
Perspectives and problems in motor learning. Trends in 2, 338–347.
Cognitive Sciences, 5, 487–494. Zambelli, M., & Demiris, Y. (2015). Online ensemble learning
Wolpert, D. M., Ghahramani, Z., & Jordan, M. I. (1995). An of sensorimotor contingencies. In R. Téllez, G. Alenyà, C.
internal model for sensorimotor integration. Science, 269, Angulo, & J. Kevin O’Regan, (Eds.) Proceedings of the
1880–1882. Sensorimotor Contingencies for Robotics Workshop at
Wolpert, D. M., Goodbody, S. J., & Husain, M. (1998a). IROS. Berlin, Germany: Springer.
Maintaining internal representations: The role of the
About the Authors

Bruno Lara is a Professor at the Universidad Autónoma del Estado de Morelos (UAEM) and
the Head of the Cognitive Robotics Lab at the Science Research Center at the UAEM since
2005. He holds a PhD in Mechatronics from King’s College London. He did a postdoc in the
TheoriLabor in the University of Jena, working on evolutionary robotics, and then a postdoc in
the Max Planck Institute for Psychology Research in Munich, focusing on research on Cognitive
Robotics. In 2011, he spent a sabbatical stay in the Cognitive Robotics Lab at the Humboldt-
Universität zu Berlin. He is an Alexander von Humboldt Fellow. His research interests include
Internal Models, sensorimotor representations, and evolutionary robotics and learning.
Dadai Astorga obtained a Degree in Psychology from the UAEM and a Master’s degree in
Cognitive Sciences by the same university. She is an Active Member of the Cognitive Robotics
Laboratory. Her research is focused on Cognitive Sciences, specifically predictive coding, cogni-
tive robotics, agency, and autism.
Emmanuel Mendoza-Bock obtained a Master’s degree in Cognitive Sciences from the UAEM.
He is currently doing a PhD in Philosophy of Sciences at the National Autonomous University
of Mexico (UNAM), in the area of Philosophy of Cognitive Sciences. He is an Associate Student
of the Institute of Philosophical Investigations of the UNAM (EA-IIFs UNAM). His research is
focused on the philosophical analysis of the epistemic, theoretical, and methodological assump-
tions that underlie the postulates of Embodied Cognitive Sciences.
Manuel Pardo obtained a Degree in Philosophy and a Master’s degree in Cognitive Sciences
from the UAEM. He is an Active Member of the Cognitive Robotics Lab. His research interests
are focused on evolutionary robotics, Cognitive Sciences, and the philosophy of science.
Esaú Escobar received his BSc degree in the area of Computer Sciences from the Sciences
Faculty at UAEM in 2011 and PhD in Computational Modeling and Scientific Computing from
the Science Research Center at UAEM in 2017. He is currently a part-time Professor of
Computer Sciences in the Science Research Center at UAEM. He is an Active Member of the
Cognitive Robotics Lab. His research is focused on the study, development, and application of
Artificial Intelligence and machine learning in the field of cognitive robotics.
Alejandra Ciria obtained a Degree in Psychology from the Universidad de las Américas Puebla
(UDLAP). She obtained her Master’s degree in Cognitive Sciences from the UAEM. Currently,
she is about to obtain a PhD in Psychology, in the area of Experimental Psychology of
Behavior, from the Faculty of Psychology at the UNAM. She is an Active Member of the
Cognitive Robotics Lab. Her current research focuses on the study of Cognitive Sciences, specifi-
cally time perception, visual cognition, predictive processing, cognitive robotics, Internal
Models, and the philosophy of Artificial Intelligence.

Embodied Cognitive Robotics and The Learning of Sensorimotor Schemes

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Embodied Cognitive Robotics and The Learning of Sensorimotor Schemes

Uploaded by

Copyright:

Available Formats

Special Issue: Spotlight on 4E Cognition Research in Mexico

Bruno Lara1 , Dadai Astorga1, Emmanuel Mendoza-Bock2,

Handling Editor: Ximena González-Grandon, National Autonomous University of Mexico, Mexico

1. Introduction of success is one of the main reasons why critics of

contradictory since humans perform cognitive pro- 2.1. Internal Models

these representations are linked to the external world

where there were no obstacles. It is worth noting that

Funding Clark, A. (2015a). Embodied prediction. In: T. Metzinger, &

About the Authors

You might also like