Professional Documents
Culture Documents
Adaptive Behavior
1–14
Ó The Author(s) 2018
Embodied Cognitive Robotics and the Reprints and permissions:
sagepub.co.uk/journalsPermissions.nav
learning of sensorimotor schemes DOI: 10.1177/1059712318780679
journals.sagepub.com/home/adb
Abstract
Embodied Cognitive Robotics focuses its attention on the design of artificial agents capable of performing cognitive tasks
autonomously. A central issue in this consists in studying process by which agents learn through interaction with their
environment. Embodied Cognitive Robotics aims to implement models of cognitive processes coming from Cognitive
Sciences. The guidelines in this research area are a direct response to the shortcomings of Classical Artificial
Intelligence, where high-level tasks and behaviors were studied. This article describes the work carried out in the
Cognitive Robotics Laboratory at the Universidad Autónoma del Estado de Morelos (UAEM). Our work is based on the
concept of low-level sensorimotor schemes coded by Internal Models, thus falling as a matter of course within the tenets
of Embodied Cognition, particularly with the idea that cognition must be understood as occurring in agents that have a
body with which they interact in a specific environment. It is through this interaction that learning emerges laying the
ground for cognitive processes. Our research includes theoretical work laying the foundations of Embodied Cognitive
Robotics, as well as work with artificial and with natural agents.
Keywords
Embodied Cognitive Robotics, Internal Models, sensorimotor schemes, Embodied Cognition
and moving capabilities, higher order abilities should Internal Models have been proposed to code the sen-
become easier to implement. High-order abilities sorimotor schemes that emerge from the interaction of
include problem solving and language expert knowl- agents with their environment. Originating in control
edge, among others (Brooks, 1991). Following these theory (Miall & Wolpert, 1996), these models have been
lines, a fraction of researchers in AI turned to the use used to explain a wide range of cognitive phenomena
of robots as situated agents in what became known as (for a review of the topic, see Schillaci, Hafner, and
Embodied Cognitive Robotics (Clark & Grush, 1999; Lara, 2016). The core of the research reported here is
Pfeifer, Lungarella, & Iida, 2007; Pfeifer & Scheier, Internal Models and the multimodal associations that
2001). Embodied Cognitive Robotics is in accordance are necessary for them to emerge.
with the tenets of the embodied and embedded frame- This article is organized as follows: the next section
works, which highlight the importance of the interac- attempts to lay out the links between AI and Embodied
tion of agents with their environment for the Cognition, positioning the work reported here in the
development of cognitive processes (Pfeifer & Scheier, wide 4E cognition framework (cognition as embodied,
2001). embedded, extended, and enactive) and clarifying the
Development is about change throughout the life- reasons to frame it mainly within Embodied Cognition.
span of agents and this, without a doubt, requires learn- The section contains a subsection on Internal Models,
ing and adaptive capabilities. Movement is perhaps the which are at the core of the research carried out in our
most pervasive and fundamental of all psychological lab. The next section ‘‘Embodied sensorimotor
activities providing a highly significant source of per- schemes,’’ presents an overview of the work carried out
ceptual experiences (Adolph & Berger, 2006). Humans in our laboratory. First, it outlines some of the theore-
are endowed genetically with the capability and propen- tical discussions on artificial agents and intelligence,
sity to learn sensorimotor behavioral structures also then it presents work carried out on mobile agents,
referred to as sensorimotor schemes (Harvey, 2018). humanoids, and finally natural agents. The experiments
Human fetal motility starts at 7.5 weeks of gestational are ordered according to the complexity of the architec-
age with isolated stereotyped movements of arms, legs, tures used as well as the different cognitive processes
and head, leading to more varied and complex patterns they model. In the final section, we draw conclusions
of movement as early as at 9 weeks of gestational age and discuss further research.
(Lacquaniti, Ivanenko, & Zago, 2012; Lüchinger,
Hadders-Algra, Van Kan, & de Vries, 2008). Later on,
the newborn child is prepared to interact with the envi- 2. AI and Embodied Cognition
ronment and adapt to it, displaying functional goal-
directed, prospective and flexible movements (von AI aims to create tools or agents that are capable of
Hofsten, 2003, 2009). exhibiting intelligent behaviors. Among other things,
Movement cannot rely on sensorimotor feedback the AI community managed to create a machine that
because of the delays in efferent and afferent pathways successfully plays chess against the best human players.
and the corresponding processing delays. These delays However, to date there is still not a machine capable of
are known to vary across modalities in the order of physically moving the pieces on a chess board as skill-
approximately 200 ms (Desmurget & Grafton, 2000; fully as a child. As a consequence, the idea of having a
Miall & King, 2008; Miall, Weir, Wolpert, & Stein, body in order to act in the real world and give meaning
1993; von Hofsten, 2009). In order to overcome this to the internal processing produced a major shift in AI
control problem, agents progressively and successfully research. Having a body is necessary for cognition and
predict the consequences of behavior by learning the thus a prerequisit for intelligent behavior (e.g. Pfeifer
associations between actions and their consequences and Bongard, 2006). The implementation of these ideas
during the body–environment interaction. This interac- in AI became known as Embodied Cognitive Robotics.
tion leads to the continuous learning of rules and regu- Importantly, the Embodied Cognitive Robotics agenda
larities of events, further building more complex and its theoretical foundations acquired more solid
sensorimotor schemes (von Hofsten, 2007, 2009). ground as Cognitive Sciences suffered a major shift of
Moreover, a constant recalibration of learned sensori- paradigm, namely, the conception of the Embodied
motor schemes, in addition to the acquisition of new Cognition framework.
ones, and the development of the physical body enable The 4E cognition has been characterized, in general
behavioral patterns to become more complex, increas- terms, as the claim that cognition is embodied,
ing the possibilities for interaction with the world ‘‘for embedded, extended, and enactive. The core of this
further development of intelligence to proceed in an view is that motor skills (the body), the material and
incremental fashion’’ (Harvey, 2018, p. 26). Thus, sociocultural world (the environment), and their combi-
‘‘from a functional perspective, cognitive development nation in specific contexts (the body–environment
has to do with expanding the prospective control of interaction) are indispensable components of cognition
actions’’ (von Hofsten, 2007, p. 58). (Anderson, 2003; Chiel & Beer, 1997; Clark, 1999;
Lara et al. 3
Lakoff & Johnson, 1999; Noë, 2004; Pezzulo et al., regularities in sensory input enhancing internal infor-
2011; Pfeifer et al., 2007; Robbins & Aydede, 2009; mation processing, and from these regularities cognitive
Shapiro, 2010; Varela, Thompson, & Rosch, 2017; behavior can emerge. It is suggested that the associa-
Wilson, 2002). These four claims of cognition represent tion of sensory and motor information, known as sen-
a family of individual explanatory frameworks, which, sorimotor schemes, could be the ideal unit of analysis
in a strict sense, cannot be reduced to four labels. At for the study of cognition. Again, we believe that beha-
the same time, there are also frameworks that include vior is directly affected by two elements, the ecological
the affective approach, sometimes presented as 4EA niche in which the agent is immersed and the morphol-
cognition (see Kiverstein and Clark, 2009; Ward and ogy of the agent (location of its sensors and effectors).
Stapleton, 2012). Cognitive Sciences literature has Furthermore, Anderson (2003) suggests that beha-
emphasized the fact that there is no homogeneity vior represents emergent phenomena that can be attrib-
between the different 4E cognition frameworks, mainly uted to the physical foundations that have evolved over
because there are significant differences in the models time. Therefore, the foundations of cognitive processes
and explanations for the subject of study. It is thought lie in the different bodily modalities of the sensorimotor
that an attempt to characterize and generalize their system. Cognition must be understood in terms of how
interests might lead to neglect peculiar aspects that are it functions under the pressures of real-time interaction
particular to the explanatory frameworks for each of with the environment (Wilson, 2002, p. 626). When cog-
the claims (Anderson, 2003; Rowlands, 2010; Shapiro, nition takes place in a context of inputs and outputs rel-
2010). Also, placing the labels under the same general evant to a specific task, it means cognition is situated.
scheme risks losing both the individual nuances and the That is to say, during cognitive processing, perceptual
real disagreements between the different proposals (Di information continues to affect processing, and at the
Paolo, 2009; Kiverstein and Clark, 2009; Menary, same time motor activity affects the environment (on-
2010). line cognition).
The work conducted in our laboratory follows Given that not all cognitive agents are capable of
mainly the ideas of Embodied Cognition, based on the carrying out certain tasks, it is essential to bear in mind
notion that cognition is determined not only by brain that it is body morphology which allows agents to
processes but also by a combination of these, together diversify their interaction with a constantly changing
with the body morphology and capabilities particular environment. At the same time, it is the environment
to the agents situated in an environment (Rowlands, which allows an agent to evolve particular cognitive
2010). Specifically, our work takes, from the notion of capabilities (Pfeifer et al., 2007; Varela et al., 2017;
‘‘embodiment,’’ the principles of processing of sensori- Wheeler & Clark, 2008).
motor schemes, simulation models, and multimodal During development, agents learn sensorimotor
associations. In doing so, we distance ourselves from schemes; these provide predictions of sensory informa-
the controversies arising when attempting to bring all tion to efficiently interact with the environment
the E’s together under one umbrella. (Harvey, 2018). Sensorimotor schemes associate differ-
The bodily skills to respond to the environment are ent sensory modalities (e.g. exteroceptive and interocep-
as constitutive to cognition as internal processing. This tive) which reflect the knowledge of an agent about the
statement implies that in order to understand cognition regularities of the environment as consequences of its
it is necessary to understand the complex and dynamic actions. Moreover, the knowledge about sensory conse-
interaction between the brain, body, and environment quences of self-generated actions provides the feeling of
(Clark & Grush, 1999; Van Gelder & Port, 1995). It agency and, as a result, a subjective point of view of the
has been suggested that some behaviors that can be world. Importantly, the sensorimotor schemes provide
classified as cognitive arise from the dynamic interac- a link to Gibsonian affordances, which are particularly
tion between the basic physical abilities of the agents appealing for Embodied Cognitive Robotics (Nasuto &
and their environment. Thus, the physiology, the evolu- Hayashi, 2016).
tionary history, the practical activity of the agent, and Traditionally, cognitive processing implies the inte-
the situation it is immerse in play the role of modeling, gration of what has been known as high- and low-level
limiting, and grounding the projects of Embodied processes (Koeinig, Ktihnberger, & Kietzmann, 2013).
Cognition (Anderson, 2003). However, rather than insisting on this distinction, we
The physical structure of the body sets limits, gives assume that sensorimotor schemes underlie all types of
precise indications, and offers opportunities for neural cognitive processes.
control and cognitive processes (Chiel & Beer, 1997; A telling example is off-line cognition. Humans can
Rowlands, 2010; Shapiro, 2007). Pfeifer et al. (2007) have processes that are not necessarily situated. If
argue that the continuous and dynamic interaction of situatedness is a condition for cognition and it is not
the agent with its environment (physical processes and fulfilled, then it seems that these processes cannot be
incoming sensory information) induces statistical considered as cognitive. This intuitive idea is
4 Adaptive Behavior
2003). Through interaction with their environment, problem (Ince et al., 2009; Schmidt et al., 2016), because
agents acquire a repertoire of sensorimotor schemes. of the non-stationary nature of the ego-noise generated
The observed sensory consequences of an action allow by the autonomous agents (Ince et al., 2010). In an
a conversion by the inverse model into potential motor early bio-inspired approximation, Ince et al. (2009)
commands so that the observed movement can be pre- showed that it is possible to suppress this noise by let-
dicted by the forward model enabling imitation. ting the agent perform predictions about the sound pro-
The authors adhere to the possibility that mirror duced when executing a motor command and then
neurons in premotor areas could function as predictors subtracting that prediction from the total sound input.
of the observed behaviors from an associative learning Following these ideas, Pico et al. (2017) propose that
(Heyes, 2010). These associations are based on the sen- it is possible to not only solve this problem using pre-
sorimotor experiences of the agent and coded by dictions and forward models, but more importantly the
Internal Models. Similar work was performed by ego-noise encodes useful information about the agent
Oztop, Kawato, and Arbib (2006) in a simulated robot and its environment. To test this, Pico et al. (2017)
arm and by Demiris and Hayes (2002) on a developed a two-wheeled differential agent, capable of
demonstrator–imitator simulated environment. synchronizing motor commands and sensory data using
Using mobile agents, this idea was put to the test two controllers. A low-level controller is in charge of
(Arceo et al., 2013). Here, the authors let an agent asso- the sensors, and a high-level controller is in charge of
ciate the visual consequences of small movements to the making predictions and pre-processing the audio.
right and left. The visual information is the coordinates Using this agent and forward models, implemented
of a fiduciary marker in its field of view. The forward as a Multi-layer Perceptron Network, Pico et al. (2017)
model takes as input visual information and motor run two different experiments: In the first experiment
commands and delivers as output the changes in visual named ‘‘ego-noise classification,’’ the authors try to
information. The model is coded with a Multi-Layer classify the ego-noise of the agent as a consequence of
Perceptron network. The inverse model is a function its own motor commands using predictions. The motor
that finds a vector in a pool containing all collected tri- commands correspond to the velocity profiles of the
plets of visual information for time t, denoted by Vt , two motors in the agent. In total, 11 forward models
motor command, Mt , and the corresponding change in were trained, each corresponding to a velocity profile
visual information at time t + 1, denoted by vt + 1 . and predicting the ego-noise produced. Once the mod-
Once trained, the agent observes movements of the els are trained, the agent executes motor profiles and
marker mounted on a second agent. Using the observed the ego-noise produced is compared to the predictions
vt and vt + 1 , it uses the inverse model containing its of all the models. Finding the smallest Euclidean dis-
motor repertoire, to find the motor commands that best tance between the two signals, the motor profile caus-
bring about the visual changes. These motor commands ing the noise is found. The results of this experiment
are passed to the forward model, together with the ini- demonstrate that forward models can, in fact, code the
tial visual situation to perform a set of predicted visual relation between a motor command and the sound that
situations. The system then compares the suggested this produces.
visual situations with the observed ones in order to exe- In a second experiment ‘‘Motor babbling and ego-
cute that motor command in the triplet with the smal- noise prediction,’’ the focus was on the detection of
lest error in visual information. changes in the environment of the agent. A forward
A final work using mobile agents adheres to the model was trained in a flat arena to predict the sounds
claim that ownership of actions and its consequences in produced by the robot using random velocities for the
the world are fundamental components of the cognitive motors. Once the model is trained, the robot is tested
process of agency (Tsakiris, Schütz-Bosbach, & in the original arena and in a second one, containing a
Gallagher, 2007; Weiss, Tsakiris, Haggard, & Schütz- ramp. When the robot moves on the ramp, the error
Bosbach, 2014). The association between actions and between the ego-noise and the prediction increases sig-
sensory consequences in the world is at the core of the nificantly, suggesting that this type of implementation
notion of forward models and has been put to the test can not only give information about the agent, but also
by Pico, Schillaci, Hafner, and Lara (2017). The detect changes in its environment.
authors attempt to tackle the ego-noise problem in an
autonomous agent. The main issue here is for the agent
to suppress the sounds produced itself while moving 3.1.3. Humanoids. While all of the previous works have
around to increase its performance on the processing of been carried out using mobile agents, research has also
external sound sources, for tasks such as speech recog- been conducted using forward models in humanoid
nition or localization of other sound sources. platforms, namely, the NAO robot from Aldebaran.
Traditional methods for audio processing, such as The agent has been let to learn its own body capabilities
static noise reduction techniques or the use of different by performing motor babbling (Demiris & Dearden,
microphone arrays’ geometries, fail to address this 2005). The NAO robot learned the visual consequences
8 Adaptive Behavior
of moving its arm around and how these changed when absent without affecting its performance. Different cog-
the use of a tool modified its motor capabilities nitive processes have been modeled using SOIMA,
(Schillaci, Hafner, & Lara, 2012a). including coding the reaching space of the NAO’s arm,
Forward models were also used to let the agent learn saccade movements of the eyes, foveation of a point in
different actions by associating the motor commands to space and then reaching it with the arm (see Figure 3),
execute them with the visual consequences while execut- and the characterization of shorthand trajectories
ing those actions (Schillaci, Lara, & Hafner, 2012b). (Escobar Juárez, 2016). As can be seen from the exam-
Once trained, the system was capable of correctly recog- ple, the SOIMA can propagate information in any
nizing observed actions. The same principle is extended direction depending of the needs of the task, depending
to let the agent know whether an observed hand trajec- on the availble sensory or motor information.
tory is performed itself or by another agent based on We believe that this type of knowledge acquisition,
the amount of prediction error processed (Schillaci, based on a Piagetian motor babbling strategy (Harvey,
Hafner, Lara, & Grosjean, 2013). 2018), is not only in accordance with the principles of
In all the previously described implementations, the Embodied Cognition but also serves as a proof of con-
forward models were coded using Multi-layer cept. Namely, an agent can acquire the most primitive
Perceptron Networks, a straigthforward strategy that tools to interact with its environment by first forming a
allows the designer to concentrate on the modeled cog- basic body image based on its sensorimotor capabilities
nitive processes as well as the cognitive architecture. and then use this to further explore the world incremen-
However, the nature of these tools presents some short- tally acquiring more complex and adaptive behaviors.
comings such as the impossibility of adding knowledge
after training and more importantly does not work cor-
rectly when there is information missing.
3.2. Natural agents
The case of missing information is of particular To further test the concept of Internal Models (Wolpert
interest given that, as an example, when an agent learns et al., 1995; Wolpert, Miall, & Kawato, 1998b) as expla-
the reaching space of its arm, it does so by performing a natory mechanisms, several experiments have been car-
multimodal association. This association can include ried out with natural agents. As stated before, Internal
motor, tactile, and visual information. However, when Models are thought to be involved in learning and
an agent observes an arm moving, it is capable of recog- motor control; at the same time, they are supposed to
nizing the movement and reproduce the action even be at the center of action and perception. We have car-
when the agent does not have access to first-person ried out several experiments with humans related to
motor or visual information. Among other reasons, this tasks that involve the sense of agency, recognition of
led to a change of machine learning paradigm. actions, and the prediction of external events.
To tackle some of these issues, Escobar Juárez, A core element in the proposal of Internal Models is
Schillaci, Hermosillo Valadez, and Lara Guzmán their ability to explain the sense of agency understood
(2016) presented a Self-Organizing Internal Models as the sense of control over our actions and their effects
Architecture (SOIMA) based on Hebbian learning and (Haggard & Tsakiris, 2009). The sense of agency in the
self-organizing maps (SOMs; Kohonen, 1998). This framework of Internal Models arises as the result of the
architecture is inspired by the work of Morse, Greef, ability of the sensorimotor system to predict the conse-
Belpaeme, and Cangelosi (2010) where the iCub huma- quences of our actions. The predicted consequences
noid robot associates different modalities in a similar and the incoming sensory information are then com-
fashion but without the use of predictions based on pared in low-level sensorimotor processes (Haggard &
Internal Models. Chambon, 2012). However, alternative proposals sug-
In SOIMA, the regularities of a sensory modality are gest that the agency experience is the result of high-level
coded in a SOM so that there can be as many SOMs as cognitive processes, specifically causal inferences about
sensory modalities. The modalities are then connected the relationships between thoughts and actions
using Hebbian learning, specifically between a sensory (Wegner & Wheatley, 1999). Therefore, from this per-
situation, a motor change, and the sensory conse- spective, our experience of agency, like any other causal
quences of this. The connections are organized in what inference, lays its foundations around contingencies
we call a Multimodal Representation Map (sensorimo- that allow us to infer, in this particular case, an appar-
tor schemes). The result is a modular architecture that ent authorship.
codes the forward and the inverse model under the Taking this panorama as a reference, Astorga (2016)
same structure. Furthermore, SOMs and in particular investigated the effect of perceptual differences between
dynamic self-organizing maps (DSOMs; Rougier & judging the onset of a voluntary action and the onset of
Boniface, 2011) can keep learning, and new schemes its sensory consequence (sense of agency) and the judg-
are added to the knowledge of the agent as this inter- ments related to the onset of two contingent events
acts with its environment. When the architecture is in (causal inference). This work was carried out using the
use, information of one or more modalities can be paradigm of temporal binding proposed by Haggard,
Lara et al. 9
Figure 3. The connections in this SOIMA can be used by the NAO humanoid robot for foveation of a stimulus in its field of view
and then to bring its arm to point at it. The bottom maps are modal maps organizing the regularities of sensory information. The top
maps join events together as multimodal representations. So, for instance, an initial and a final (goal) position in the vision map are
connected to a unit in the Saccadic MMR, and this unit is connected to a motor command for the head which will bring the visual
stimulus to the center of the image. The right section of the SOIMA will bring the head and the arm to absolute positions in space,
with the arm pointing to the stimulus.
Clark, and Kalogeras (2002). Astorga (2016) found a 4. Conclusion and further research
temporal binding between events that involves agency
The research carried out in our laboratory follows the
significantly greater than when related events are judged
principles of the 4E cognition framework, specifically the
contingently. The result is consistent with Haggard and
ideas of Embodied Cognition. Interaction of agents with
Chambon (2012), in that the sense of agency is the result
their environment and sensorimotor schemes that emerge
of comparisons made by sensorimotor low-level cogni-
tive processes. The experimental results suggested that through this interaction have been the backbone of our
when there is a sensory consequence associated with a research. It is our conviction that Internal Models are
voluntary action (motor command), and because this the natural candidates for coding the aforementioned
information is highly relevant for the organism, priority schemes, as the capabilities to perform sensorimotor pre-
is given to this association, strengthening it through a dictions is an essential feature of cognitive agents.
temporal link between both events. As previously described, instead of imitating high-
Under the hypothesis of Internal Models and their level cognitive tasks in order to understand cognition, a
relationship with the sense of agency, Rojas (2014) fraction of the AI community turned to study what
worked on the issue of action recognition. This become known as low-level cognitive processes. We
research was based on the assumption that individu- believe that the study of sensorimotor schemes and the
als process perceptual information together with processes and capabilities that emerge from these repre-
motor information. The result of this processing is sent the fundaments for further research in Embodied
the acquisition of sensorimotor schemes, particular to Cognitive Robotics. The quest is not over yet; however,
the individual. When recognizing an action, these important steps have been taken.
schemes are used in simulation to solve the task suc- The experiments summarized here present results
cessfully. A fundamental element in action recogni- that support the main tenets of Embodied Cognition
tion comes from the discrimination between an action and the 4E cognition approaches in general, namely,
recognized as our own, or as executed by someone the fact that cognition must be understood as occurring
else (Knoblich, Seigerschmidt, Flach, & Prinz, 2002). in agents that have a body, and with this, they interact
Rojas (2014) found that individuals recognized faster in a specific environment. The use of Internal Models
between actions (take a rubber ball, hit it with the in our work serves as a proof of concept of different
fist, or make it roll) when these were previously exe- strategies for the acquisition of basic cognitive tools for
cuted by them. These findings can be seen as a result agents, laying the ground for further development of
of perceptual anticipation, a phenomenon that exem- more complex and adaptive behaviors. The implemen-
plifies the processing and preparation of actions. tations of Internal Models can be read as agents acquir-
Recognition of actions is carried out through simula- ing the capability of experiencing affordances of the
tions using as a reference the sensorimotor informa- environment they interact with. These affordances are
tion coded in the Internal Models of the individual coded using sensorimotor schemes particular to the
(Wilson & Knoblich, 2005; Wolpert et al., 1998b). agents, their bodily properties, and their environment.
10 Adaptive Behavior
Another significant contribution comes from the Clark, 2014). In the IFM, motor commands are
experiments centered on prediction, and the study of replaced by top-down sensory predictions issued by the
the capabilities agents acquire for successful interaction forward model, removing any distinction between
with their environment when they become predicting motor and sensory processing and also the need of an
agents. It has been shown that using Internal Models inverse model. Thus, IFMs posit a more complex (dis-
to code sensorimotor schemes in order to predict the tributed) forward model, mapping prior beliefs about
sensory consequences of actions allows different types the desired trajectories to sensory consequences
of behaviors to emerge. These behaviors are in accor- (Pickering & Clark, 2014, p. 456). However, in classic
dance with the principles of the Embodied Cognition control theory, the inverse model has played a signifi-
framework; particularly, we have addressed acquisition cant role despite learning difficulties inherent to the
of affordances, agency, recognition and imitation of model. Rejecting the idea and need of an inverse model
actions, and simulations. brings interesting issues but should be considered care-
Our experiments in natural agents have allowed us fully. As stated earlier, SOIMA intrinsically codes
to investigate important cognitive processes, such as inverse and forward models, so the relevance of the
action recognition and agency, in the light of Internal inverse model could be further studied. Second, predic-
Models. At the same time, these experiments provide tive processing suggests the concept of precision
valuable insights for the implementation of Internal weights as an attentional mechanism which modulates
Models in artificial agents. the incoming sensory information relevant to a context.
Following the latest paradigm in the Cognitive This attentional mechanism has generated many con-
Sciences, current research in our laboratory also focuses troversies and demands further research and under-
on modeling of perception and action in the framework standing. We argue that given the modular architecture
of predictive processing (Clark, 2013, 2015a). Predictive of the SOIMA, SOMs could be added to potentially
processing represents one of the most radical views code context to control the flow and processing of rele-
about the flow of sensory information processing dur- vant sensory information. Such a structure could be
ing perception, asserting that it is primarily a top-down considered as a precision weighting mechanism.
process, rather than a bottom-up process. However, the Furthermore, the effects of prediction error on the flow
suggestion of an inverted structure of the perceptual of upcoming information has not been widely modeled
system requires agents to first learn forward (genera- and explored. In the SOIMA, the flow of information
tive) models which capture the statistical structure of is not restricted to one direction. The bidirectional flow
the incoming sensory information by tracking the of information allows studying the effects of prediction
causes of that specific structure (Clark, 2013). Cognitive error on Hebbian learning processes and the way infor-
agents are thought to be actively predicting the stream mation propagates.
of sensory stimulation before it occurs, and only unex- Considering the core results of our work so far, we
pected sensed deviations from the predicted state (pre- are convinced that using Internal Models to learn and
diction error) are further processed bottom-up (Clark, recalibrate sensorimotor schemes leads to more com-
2015a, 2015b). The perceptual and motor systems work plex behavioral patterns for further development of
together to predict incoming sensory information in all intelligence. In this framework, we hold that intelligence
modalities: exteroceptive, interoceptive, and, in the case can be concieved as a behavior which leads an agent to
of the motor system, proprioceptive (Adams, Shipp, & the succesful execution of a task situated in a certain
Friston, 2013). In addition, in this framework, percep- context. Consequently, we claim that Internal Models
tion and action constitute ‘‘active inference’’ (Friston, are valuable tools for research which can significantly
Daunizeau, Kilner, & Kiebel, 2010) or ‘‘action-oriented contribute to the a better understanding of cognition.
predictive processing’’ (Clark, 2013). Through actions,
cognitive agents minimize prediction error about Acknowledgements
incoming sensory information, evidencing the intimate
The authors would like to thank the current and former
relationship between perception and action (Clark,
members and close collaborators of the Cognitive Robotics
2013; Friston, Kilner, & Harrison, 2006). Laboratory, particularly Guido Schillaci, Jorge Hermosillo,
In this line of research, we are focusing on the fol- Verena Hafner, Esau Escobar, Wilmer Gaona, Karla Baeza,
lowing open questions and future implementations. Diana Valenzo, Jorge Hernandez, Abraham Martinez, Javier
First, active inference suggests that there is no need for Vargas, David Castillo, Dairon Rodriguez, Eduardo Rojas
an inverse model or an optimal control mechanism, as and, Antonio Pico.
motor commands are conceived as proprioceptive pre-
dictions that drive action (Clark, 2015a; Pickering & Declaration of conflicting interests
Clark, 2014). In this framework, an integral forward The authors declared no potential conflicts of interest con-
model (IFM) is a forward (generative) model and the cerning the research, authorship, and publication of this
core machinery of perception and action (Pickering & article.
Lara et al. 11
Haggard, P., & Tsakiris, M. (2009). The experience of agency: robotics in Mechatronics (pp. 321–348). Berlin, Germany:
Feelings, judgments, and responsibility. Current Directions Springer International Publishing AG.
in Psychological Science, 18, 242–246. Lara Guzmán, B., Hermosillo Valadez, J., & Baeza Mariscal,
Harvey, K. (2018). An open-ended approach to Piagetian K. (2017). Intentionality and internal models in artificial
development of adaptive behavior. Open Access Library agents. Pragmatics & Cognition, 23, 209–237.
Journal, 5, e4434. Lüchinger, A. B., Hadders-Algra, M., Van Kan, C. M., & de
Hernandez Ochoa, J. L. (2017) Robotica Cognitiva y Emo- Vries, J. I. (2008). Fetal onset of general movements.
ciones (Master’s thesis). Centro de Investigación en Cien- Pediatric Research, 63, 191–195.
cias Cognitivas, Cuernavaca, Mexico. McCarthy, J., Minsky, M. L., Rochester, N., & Shannon, C.
Heyes, C. (2010). Where do mirror neurons come from?Neur- E. (2006). A proposal for the Dartmouth summer research
oscience and Biobehavioral Reviews, 34, 575–583. project on artificial intelligence. AI Magazine, 27, 12.
Hoffmann, H. (2007). Perception through visuomotor antici- Maye, A., & Engel, A. K. (2012). Using sensorimotor contin-
pation in a mobile robot. Neural Networks, 20, 22–33. gencies for prediction and action planning. In T. Ziemke,
Ince, G., Nakadai, K., Rodemann, T., Hasegawa, Y., Tsujino, C. Balkenius, & J. Hallam (Eds.), Proceedings of the Inter-
H., & Imura, J. I. (2009). Ego noise suppression of a robot national Conference on Simulation of Adaptive Behavior
using template subtraction. In Proceedings of the IEEE/ (pp. 106–116). Berlin, Germany: Springer.
RSJ International Conference on Intelligent Robots and Menary, R. (2010). Introduction to the special issue on 4E
Systems (IROS) (pp. 199–204). New York, NY: IEEE. cognition. Phenomenology and the Cognitive Sciences, 9,
Ince, G., Nakadai, K., Rodemann, T., Hasegawa, Y., Tsujino, 459–463.
H., & Imura, J. I. (2010). A hybrid framework for ego Mendoza Bock, E. (2013) Los términos nómadas de las Cien-
noise cancellation of a robot. In N. Xi, & W. R. Hamel cias Cognitivas: el caso de los Modelos Directos (Master’s
(Eds.), Proceedings of the IEEE International Conference thesis). Facultad de Humanidades, Montevideo, Uruguay.
on Robotics and Automation (pp. 3623–3628). New York, Miall, R. C. (2003). Connecting mirror neurons and forward
NY: IEEE. models. Neuroreport, 14, 2135–2137.
Jordan, M. I., & Rumelhart, D. E. (1992). Forward models: Miall, R. C., & King, D. (2008). State estimation in the cere-
Supervised learning with a distal teacher. Cognitive Sci- bellum. The Cerebellum, 7, 572–576.
ence, 16, 307–354. Miall, R. C., & Wolpert, D. M. (1996). Forward models for
Karniel, A. (2002). Three creatures named ‘‘forward model.’’ physiological motor control. Neural Networks, 9,
Neural Networks, 15, 305–307. 1265–1279.
Kawato, M. (1999). Internal models for motor control and Miall, R., Weir, D. J., Wolpert, D. M., & Stein, J. (1993). Is
trajectory planning. Current Opinion in Neurobiology, 9, the cerebellum a smith predictor? Journal of Motor Beha-
718–727. vior, 25, 203–216.
Kiverstein, J. (2007). Could a robot have a subjective point of Mingers, J. (2001). Embodying information systems: The con-
view?Journal of Consciousness Studies, 14, 127–139. tribution of phenomenology. Information and Organiza-
Kiverstein, J., & Clark, A. (2009). Introduction: Mind embo- tion, 11, 103–128.
died, embedded, enacted: One church or many?Topoi, 28, Möller, R., & Schenck, W. (2008). Bootstrapping cognition
1–7. from behavior a computerized thought experiment. Cogni-
Knoblich, G., & Flach, R. (2001). Predicting the effects of tive Science, 32, 504–542.
actions: Interactions of perception and action. Psychologi- Moons, T. (1998). Lecture notes in computer science. In
cal Science, 12, 467–472. R. Koch, & L. V. Gool (Eds.), 3D structure from multiple
Knoblich, G., Seigerschmidt, E., Flach, R., & Prinz, W. images of large-scale environments (Chapter: A guided tour
(2002). Authorship effects in the prediction of handwriting through multi view relations) (vol. 1506, pp. 304–346). Ber-
strokes: Evidence for action simulation during action per- lin, Germany: Springer.
ception. The Quarterly Journal of Experimental Psychol- Moravec, H. (1984) Locomotion, vision and intelligence. In
ogy: Section A, 55, 1027–1046. M. Brady, & R. Paul (Eds.), Robotics Research (pp. 215–
Koeinig, P., Ktihnberger, K. U., & Kietzmann, T. C. (2013). 224). Cambridge, MA: The MIT Press.
A unifying approach to high-and low-level cognition. In Morse, A. F., Greef, J. D., Belpaeme, T., & Cangelosi, A.
U. Gähde (Ed.), Models, simulations, and the reduction of (2010). Epigenetic robotics architecture (era). IEEE Trans-
complexity (vol. 4, pp. 117–139). Berlin, Germany: De actions on Autonomous Mental Development, 2, 325–339.
Gruyter. Nagel, T. (1974). What is it like to be a bat? The Philosophical
Kohonen, T. (1998). The self-organizing map. Neurocomput- Review, 83, 435–450.
ing, 21, 1–6. Nasuto, S. J., & Hayashi, Y. (2016). Anticipation: Beyond
Lacquaniti, F., Ivanenko, Y. P., & Zago, M. (2012). Develop- synthetic biology and cognitive robotics. Biosystems, 148,
ment of human locomotion. Current Opinion in Neurobiol- 22–31.
ogy, 22, 822–828. Newell, A., & Simon, H. A. (1976). Computer science as
Lakoff, G., & Johnson, M. (1999). Philosophy in the Flesh empirical inquiry: Symbols and search. Communications of
(vol. 4). New York, NY: Basic books. the ACM, 19, 113–126.
Lara, B., Ciria, A., Escobar, E., Gaona, W., & Hermosillo, J. Newen, A. (2017). What are cognitive processes? An example-
(2018). Cognitive robotics: The new challenges in artificial based approach. Synthese, 194, 4251–4268.
intelligence. In O. O. S. Villegas, M. Nandayapa, & I. Soto Noë, A. (2004). Action in perception. Cambridge, MA: The
(Eds.), Advanced topics on computer vision, control and MIT Press.
Lara et al. 13
Oztop, E., Kawato, M., & Arbib, M. (2006). Mirror neurons distinction in robotics. In J. Crandall, & S. Sabanovic,
and imitation: A computationally guided review. Neural (Eds.), Proceedings of the 8th ACM/IEEE International
Networks, 19, 254–271. doi:10.1016/j.neunet.2006.02.002. Conference on Human-Robot Interaction (pp. 223–224).
Pezzulo, G., Barsalou, L. W., Cangelosi, A., Fischer, M. H., New York, NY: IEEE.
McRae, K., & Spivey, M. J. (2011). The mechanics of Schillaci, G., Lara, B., & Hafner, V. (2012b). Internal simula-
embodiment: A dialog on embodiment and computational tions for behaviour selection and recognition. In A. Salah,
modeling. Frontiers in Psychology, 2, 5. J. Ruiz-del Solar, C. Mericli, & P. Y. Oudeyer (Eds.), Human
Pfeifer, R., & Bongard, J. (2006). How the body shapes the behavior understanding (Lecture notes in computer science)
way we think: A new view of intelligence. Cambridge, MA: (vol. 7559, pp. 148–160). Berlin, Germany: Springer.
The MIT Press. Schmidt, A., Deleforge, A., Kellermann, W., Schmidt, A.,
Pfeifer, R., Lungarella, M., & Iida, F. (2007). Self-organiza- Deleforge, A., Kellermann, W., & Kellermann, W. (2016).
tion, embodiment, and biologically inspired robotics. Sci- Ego-noise reduction using a motor data-guided multichan-
ence, 318, 1088–1093. nel dictionary. In W. Burgard (Ed.) Proceedings of the
Pfeifer, R., & Scheier, C. (2001). Understanding intelligence. IEEE/RSJ International Conference on Intelligent Robots
Cambridge, MA: The MIT Press. and Systems (IROS) (pp. 1281–1286). New York, NY:
Pickering, M. J., & Clark, A. (2014). Getting ahead: Forward IEEE.
models and their place in cognitive architecture. Trends in Shapiro, L. (2007). The embodied cognition research pro-
Cognitive Sciences, 18, 451–456. gramme. Philosophy Compass, 2, 338–346.
Pico, A., Schillaci, G., Hafner, V. V., & Lara, B. (2017). How Shapiro, L. (2010). Embodied cognition. London, England:
do I sound like? Forward models for robot ego-noise pre- Routledge.
diction. In J. Santos - Victor, & G. Sandini, (Eds.), Pro- Shapiro, L. A. (2004). The mind incarnate. Cambridge, MA:
ceedings of the Joint IEEE International Conference on The MIT Press.
Development and Learning and Epigenetic Robotics (pp. Tsai, R. Y. (1987). A versatile camera calibration technique
246–251). New York, NY: IEEE. for high-accuracy 3D machine vision metrology using off-
Riedmiller, M., & Braun, H. (1993). A direct adaptive method the-shelf TV cameras and lenses. Journal of Robotics and
for faster backpropagation learning: The RPROP algo- Automation, 3, 323–344.
rithm. In Proceedings of the IEEE International Conference Tsakiris, M., Schütz-Bosbach, S., & Gallagher, S. (2007). On
on Neural Networks (pp. 586–591). New York, NY: IEEE. agency and body-ownership: Phenomenological and neu-
Robbins, P., & Aydede, M. (2009). The Cambridge handbook rocognitive reflections. Consciousness and Cognition, 16,
of situated cognition. Cambridge, UK: Cambridge Univer- 645–660.
sity Press. Van Gelder, T., & Port, R. F. (1995). It’s about time: An over-
Rodrı́guez, D. (2012) Análisis de la Cognición Corporizada view of the dynamical approach to cognition. In R. Port,
desde la perspectiva de la fenomenologı´a (Master’s thesis). & T. van Gelder (Eds.), Mind as motion: Explorations in
Facultad de Humanidades, Montevideo, Uruguay. the dynamics of cognition (pp. 1–43). Cambridge, MA: The
Rodrı́guez, D., Hermosillo, J., & Lara, B. (2012). Meaning in MIT Press.
artificial agents: The symbol grounding problem revisited. Varela, F. J., Thompson, E., & Rosch, E. (2017). The embo-
Minds and Machines, 22, 25–34. died mind: Cognitive science and human experience. Cam-
Rojas, E. (2014) Estudio sobre la caracterización de acciones bridge, MA: The MIT press.
basado en modelos internos (Master’s thesis). Facultad de von Hofsten, C. (2003). On the development of perception and
Humanidades, Montevideo, Uruguay. action. London, England: SAGE.
Rougier, N., & Boniface, Y. (2011). Dynamic self-organising von Hofsten, C. (2007). Action in development. Developmen-
map. Neurocomputing, 74, 1840–1847. tal Science, 10, 54–60.
Rowlands, M. (2010). The new science of the mind: From von Hofsten, C. (2009). Action, the foundation for cognitive
extended mind to embodied phenomenology. Cambridge, development. Scandinavian Journal of Psychology, 50,
MA: The MIT Press. 617–623.
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1985). Ward, D., & Stapleton, M. (2012). Es are good. In F. Paglieri
Learning internal representations by error propagation (Ed.), Consciousness in interaction: The role of the natural
(Technical Report). California Univ San Diego La Jolla and social context in shaping consciousness (pp. 86–89).
Inst for Cognitive Science. Retrieved from http:// Amsterdam, The Netherlands: John Benjamins Publishing.
www.dtic.mil/dtic/tr/fulltext/u2/a164453.pdf Webb, B. (2004). Neural mechanisms for prediction: Do insects
Schillaci, G., Hafner, V. V., & Lara, B. (2012a). Coupled have forward models? Trends in Neurosciences, 27, 278–282.
inverse-forward models for action execution leading to Wegner, D. M., & Wheatley, T. (1999). Apparent mental cau-
tool-use in a humanoid robot. In J. Han, A. Tapus, & J. sation: Sources of the experience of will. American Psychol-
Crandall. (Eds.), Proceedings of the 7th ACM/IEEE Inter- ogist, 54, 480–492.
national Conference on Human-Robot Interaction (pp. 231– Weiss, C., Tsakiris, M., Haggard, P., & Schütz-Bosbach, S.
232). New York, NY: IEEE. (2014). Agency in the sensorimotor system and its relation
Schillaci, G., Hafner, V. V., & Lara, B. (2016). Exploration to explicit action awareness. Neuropsychologia, 52, 82–92.
behaviors, body representations, and simulation processes Wheeler, M., & Clark, A. (2008). Culture, embodiment and
for the development of cognition in artificial agents. Fron- genes: Unravelling the triple helix. Philosophical Transactions
tiers in Robotics and AI, 3, 39. of the Royal Society B: Biological Sciences, 363, 3563–3575.
Schillaci, G., Hafner, V. V., Lara, B., & Grosjean, M. (2013). Wilson, M. (2002). Six views of embodied cognition. Psycho-
Is that me? Sensorimotor learning and self-other nomic Bulletin & Review, 9, 625–636.
14 Adaptive Behavior
Wilson, M., & Knoblich, G. (2005). The case for motor invol- human superior parietal lobe. Nature Neuroscience, 1,
vement in perceiving conspecifics. Psychological Bulletin, 529–533.
131, 460–473. Wolpert, D. M., Miall, R. C., & Kawato, M. (1998b). Inter-
Wolpert, D. M., Ghahramani, Z., & Flanagan, J. R. (2001). nal models in the cerebellum. Trends in Cognitive Sciences,
Perspectives and problems in motor learning. Trends in 2, 338–347.
Cognitive Sciences, 5, 487–494. Zambelli, M., & Demiris, Y. (2015). Online ensemble learning
Wolpert, D. M., Ghahramani, Z., & Jordan, M. I. (1995). An of sensorimotor contingencies. In R. Téllez, G. Alenyà, C.
internal model for sensorimotor integration. Science, 269, Angulo, & J. Kevin O’Regan, (Eds.) Proceedings of the
1880–1882. Sensorimotor Contingencies for Robotics Workshop at
Wolpert, D. M., Goodbody, S. J., & Husain, M. (1998a). IROS. Berlin, Germany: Springer.
Maintaining internal representations: The role of the
Dadai Astorga obtained a Degree in Psychology from the UAEM and a Master’s degree in
Cognitive Sciences by the same university. She is an Active Member of the Cognitive Robotics
Laboratory. Her research is focused on Cognitive Sciences, specifically predictive coding, cogni-
tive robotics, agency, and autism.
Emmanuel Mendoza-Bock obtained a Master’s degree in Cognitive Sciences from the UAEM.
He is currently doing a PhD in Philosophy of Sciences at the National Autonomous University
of Mexico (UNAM), in the area of Philosophy of Cognitive Sciences. He is an Associate Student
of the Institute of Philosophical Investigations of the UNAM (EA-IIFs UNAM). His research is
focused on the philosophical analysis of the epistemic, theoretical, and methodological assump-
tions that underlie the postulates of Embodied Cognitive Sciences.
Manuel Pardo obtained a Degree in Philosophy and a Master’s degree in Cognitive Sciences
from the UAEM. He is an Active Member of the Cognitive Robotics Lab. His research interests
are focused on evolutionary robotics, Cognitive Sciences, and the philosophy of science.
Esaú Escobar received his BSc degree in the area of Computer Sciences from the Sciences
Faculty at UAEM in 2011 and PhD in Computational Modeling and Scientific Computing from
the Science Research Center at UAEM in 2017. He is currently a part-time Professor of
Computer Sciences in the Science Research Center at UAEM. He is an Active Member of the
Cognitive Robotics Lab. His research is focused on the study, development, and application of
Artificial Intelligence and machine learning in the field of cognitive robotics.
Alejandra Ciria obtained a Degree in Psychology from the Universidad de las Américas Puebla
(UDLAP). She obtained her Master’s degree in Cognitive Sciences from the UAEM. Currently,
she is about to obtain a PhD in Psychology, in the area of Experimental Psychology of
Behavior, from the Faculty of Psychology at the UNAM. She is an Active Member of the
Cognitive Robotics Lab. Her current research focuses on the study of Cognitive Sciences, specifi-
cally time perception, visual cognition, predictive processing, cognitive robotics, Internal
Models, and the philosophy of Artificial Intelligence.