You are on page 1of 46

Appendix A

INNOVATIVE METHOD FOR MOTOR LEARNING AND

NEUROLOGICAL REHABILITATION

By Prof. Avi Ohry

August 2008

In the last five years, “Bionics” company has developed a method for motor learning, applied by
means of a unique robotic technology, thus shortening the time needed for motor learning by
patients with neurological injuries by half, as compared to task exercising performed freely, or
aided by existing treatment means.

Existing methods applied to aid motor learning in general, and facilitate neurological
rehabilitation in particular, are based on verbal or physical guidance, requiring capabilities of cognitive
pre-processing in understanding mistakes or “corrective” actions by the patient. The method
behind the innovative technology introduced by “Bionics” is based on a primary and basic
learning phenomenon inherent in animals and humans that takes place during adjustment to a new
environment of forces. In practicing this method, while performing motor task exercises, forces
proportional to the patient’s mistakes are applied to him, enhancing his mistake. These forces
stimulate a primal instinct to adjust and oppose the “mistake enhancing” forces while applying
“corrective” forces, which are updated in the motor memory just like any exercise experience. The
primary advantage of this method is in stimulating automatic response with no preliminary cognitive
processing, which is necessary in traditional motor learning methods and specifically in neurological
rehabilitation, and which is not necessarily intact in patients with neurological damage.

This innovative technology, which combines computerized robotics with virtual reality for optimal
motor challenge, will lead the developing trend in rehabilitative robotics, which has been characterize d,
so far, by automation of existing methods used in rehabilitation, while allowing for independence and
multiple practice cycles. The innovative method and technology adapts itself, automatically or by
design of the rehabilitative system, to the perception and optimal gradual progress of patients
with various types of impairments.

Clinical trials were carried out at “Reut” Medical Center on a group of patients with
neurological damage, mostly following a stroke. The research findings demonstrated improvement
in motor learning and reduction of learning time by more than half the time compared with the
standard rehabilitative process, convenient and efficient interfacing to the method and the system,
and quick and easy integration in the comprehensive rehabilitation process.

Developing the method and the system, combined with findings of the clinical research, give rise
to a scientific breakthrough with ramifications of almost unprecedented scale, constituting a
revolutionary step in neurological rehabilitation in particular, and in motor learning in general, for
many varied fields combining motor learning and motor skills, such as motor development in children,
various sports fields, as well as simulators for fields such as aviation, security and art.
Appendix B

Motivating Rehabilitation by Distorting Reality


Motivating Rehabilitation by Distorting Reality*
James Patton, Yejun Wei, & Robert V. Kenyon
Chris Robert Scheidt
Scharver
Sensory Motor Performance Program Electronic Visualization Laboratory Dept. of Biomedical
(SMPP) (EVL) Engineering
Marquette University University of Illinois at Chicago Rehabilitation Institute of Chicago
Milwaukee, Wisconsin, USA Chicago, Illinois, USA Chicago, Illinois USA 60611
j-patton@northwestern.edu

Abstract – We have found, through a series of recent capabilities of a robot certainly allow for massed practice
experiments, encouraging evidence that the neuro- while simultaneously logging progress.
motor system is motivated to change motor patterns Moreover, the human brain and spinal cord remain
when exposed to visuo-motor tasks. We have also modifiable, even in the adult, and even following many
shown that the learning of these tasks can be brain injuries [16-20]. This neuroplasticity indicates that
heightened with forces and/or visual distortions that the structure and function of the brain can be altered
appropriately manipulate the error. This process does continuously in response to sensory stimulation and
not require intense concentration and it is often changing physical environments. Plasticity is a pivotal
considered a game. We describe the next generation of element of neuroscience and rehabilitation, since it is
dimensional three large-workspace, robotic likely to be the primary mechanism that underlies recovery
haptics/graphics systems for rehabilitation. from chronic neurological illness. Devices that encourage
and facilitate plasticity can also be used with drugs that
Index Terms – learning, adaptation, rehabilitation, further enhance the effects. Thus, it makes good sense to
human, stroke. study new and more efficient treatment involving
technology, robots, and virtual reality.
I. INTRODUCTION
The emergence of new robotic devices designed to
interface with humans has led to great strides in both
fundamental and clinical research on the sensory motor
system. Research has recently answered questions relevant
to rehabilitation, haptics (the study of artificially rendering
touch), motor control, and human-machine interactions.
Most importantly these devices have shown how humans
adapt under altered environmental conditions [1-8]. Here
we focus on experiments and technology to harness the
adaptive process for rehabilitation.
The recovering nervous system, such as in an individual
who has suffered a stroke, is an excellent candidate for
such adaptive training. The surviving stroke population in
the US is over 3 million and growing [9], and roughly one-
third of all individuals who experience a stroke will have
some residual impairment of the upper extremity [10].
Labor costs for rehabilitation comprise roughly 60 to 70%
of what the U.S. spends – about $30 billion per year [11].
If new technology could remove just 5% of the labor costs
on 10% of only the largest population (stroke survivors,
about 30%), the savings would be $300 million. Although
Medicare's 2001 incentives are for shortening the length of
stays and of therapy in hospitals, recent studies support
intensive therapy or “massed practice” for stroke survivors
[12, 13] and the constraint of the less-effected limb [14,
15]. It would appear that the tireless, precise, and swift
Fig. 1. Planar manipulandum robot. Forces are monitored with a load cell
at the handle (ATI F/T Gamma30/100) and encoders record position
(Teledyne Gurley 25/045-NB17-TA-PPA-QAR1S). Motors (PMI
* This work is partially supported by American Heart Association JR24M4CH) render forces at the subject’s hand.
0330411Z, NIH R24 HD39627, NIH 5 RO1 NS 35673, NIH
F32HD08658, NSF BES0238442 and the Falk Trust.
New technology has made possible many new and
imaginative possibilities for promoting adaptation. Robotic
Baseline
systems can be programmed to go far beyond the initial
idea of limb guidance or making the physical system easier
to manage (though these programs are important as well).
Recent research suggests that making conditions more
difficult can trigger functional recovery [21-26] and can
“trick” the nervous system into certain behaviors by giving
altered sensory feedback [27-33]. Interestingly, this
adaptive process appears to bypass conventional learning
mechanisms that require intense concentration -- results
are the same if there is conversation or background music, Early training
and it is often considered a game.
The sections that follow present two examples that have
shown promise for engaging and motivating recovery of
function in individuals that have suffered a neurological
injury.

II. ERROR AUGMENTATION


Several recent robot experiments on both healthy and on
stroke survivors have revealed the encouraging result that After-effects
After-effects
improvements occurred when the training forces tended to
magnify errors but not when the training forces reduced
the errors or when the forces were not present at all [17].
This led us to further investigate by custom designing a
force that was proportional to the error the subjects
initially made [23, 34]. During training, the force amplified
their initial error, but resulted in beneficial outcome (Fig.
2). A few (3 of the 13) subjects did not preserve their Final Final
beneficial after-effects to end of the experiment, (i.e., they
de-adapted much like healthy people do in such
experiments), but the remaining majority of subjects
preserved their benefit for 75 more movements, much
longer than healthy people typically de-adapt (Fig. 3). We
are currently working on a follow up study that involves
repeated visits to determine retention and incremental
gains.
An another experiment in healthy subjects focused
specifically on the type of on the error augmentation
strategy [35]. This revealed new insights for robotic Fig. 2. Motions to one of the targets of a stroke patient in successive
teaching. Four groups of subjects that each trained on the critical phases of an experiment. The thick lines represent average
planar robot (Fig. 1) with different types of error motion; thin lines represent individual reaching motion paths. Shaded
augmentation. Trajectory error from the ideal straight-lined areas are 95% confidence intervals; and dotted lines indicate ideal
trajectories. Training forces (green arrows) were specially designed to
movement were amplified on the visual display with a gain cause a beneficial after effect. Although these forces were turned off for
of *2, by *3.1 or by an “offset” -- a shift in their trajectory 120 movements after training (about 15 minutes) – 8 of the 11 stroke
that did not depend on the current error. subjects retained the benefits (in the Final Phase) much longer than a
We found that error-augmentation improved the rate healthy subject would have retained any adaptation effect. [Adapted from
[34]].
and extent of motor learning of the visuomotor rotation
(Fig 3). Furthermore, our results suggest that both error In summary, distortions that reshape the visual (via a
amplification and offset-augmentation may facilitate display) and mechanical (via a robot) experience can be
neuro-rehabilitation strategies that restore function in brain designed to amplify error, and they result in desired
injuries such as stroke. Interestingly, increasing the amount changes in the motor learning process. These results led us
of error augmentation so that it is too large appears to to a new family of technology that takes these “testbed”
diminish the benefits (*3.1 in Fig. 3). There appears to be experiments on the simple haptic/graphic display system to
several ways that error augmentation is successful in a more to functionally relevant, large workspace, three
speeding up learning. More experiments are needed to train can that system haptic/graphic dimensional
identify optimal conditions that capitalize on this individuals on everyday tasks.
phenomenon.
Time constant of error reduction world. Through adjusting the relative lighting levels under
100
the mirror, subjects are able to view their own limb and the
actual environment, with only the artificial virtual elements
that are needed [50]. Special design attention is given to
brightness (luminosity), field-of-view, and resolution. A
cinema-quality digital projector (Christie Mirage 3000
DLP) displays the images over five-foot-wide 1280x1024
Control (*1) )

*3.1
pixel image resulting in a 110º viewing angle. Infra-red
Movements

emitters synchronize the display of separate left and right


eye images through LCD shutter glasses.
The VRROOM system also integrates an Ascension
Flock of BirdsTM magnetic tracking system that tracks
*2

head position so that the visual display is rendered with the

Offset
appropriate viewer-centered perspective. The magnetic
tracking system currently uses two sensors to track other
body segments with continuous position and orientation
0
Fig. 3. Time constants of error reduction in healthy subjects (mean of
information. We propose to purchase two more sensors so
subjects) for different types of error augmentation. The *2 & Offset that head, back/trunk, shoulder, upper, and lower arms
groups learned in half the time. Error bars indicate the 95% confidence segments can all be tracked. It is important to note our tests
interval for all subjects in the group. have shown that neither the aluminum parts of the PARIS
system nor the electromagnetic radiation from the motors
III. AUGMENTED REALITY TECHNOLOGY THAT of the PHANToM distort the readings of the magnetic
ENGAGES THE PATIENT tracking system.
Much of this research has been constrained by the The VRROOM system also integrates several robotic
limitations of available technologies. Most systems are arms that suit different needs for generating end-effector
small with one or two degrees of freedom and hence do not forces or motions on varying scales. Two PHANToM
allow the complex behavior seen in everyday tasks. They robots (the Omni or the larger 3.0) provide a workspace
involve a visual display that often does not realistically measuring up to 900 x 900 x 300 mm with a maximum
overlay the actual motion. Recent research also supports continuous force of 3 Newtons (N) with transient peaks of
“task-specific activity for rehabilitation,” in which motions runs controller hardware-resident The N. 22
relevant to activities of daily living should be part of asynchronously with the computer, assuring stable,
recovery [16, 36]. In order to achieve significant advances uninterrupted control. The WAM (Barrett Technologies)
in the diverse fields, the next generation of human- can be used for strong impedance control applications that
interface robots must be strong, large, three dimensional, require precisely controlled forces and torques. Finally, the
safe, backdrivable (i.e., allow the user to easily push back) Haptic Master (FCS technologies) can be used for strong
and have an accompanying three-dimensional visual admittance applications that require precisely controlled
interface. Our current development work focuses on such a motions.
system [37].
The Virtual Reality and Robotic Optical Operations IV. SUMMARY, DISCUSSION AND CONCLUSIONS
Machine (VRROOM). VROOM is an integrated system This paper discusses adaptive training to teach
combining virtual reality graphics environment, haptic movements that does not require explicit instruction or a
robotic force feedback, and tracing of limb segments using large amount of attention, and can provide motivation
a magnetic tracking system (Fig. 4). The system’s primary simply by heightening the error and providing an
component is the visual display system, the Personal immersive and engaging experience. Our experimental
(PARIS), System Immersive Reality Augmented results all point to a single unifying theory: the judicious
developed in the Electronic Visualization Lab at the manipulation of error (through forces and/or visual
University of Illinois at Chicago. PARIS is currently the distortions) can lead to lasting desired changes by inducing
highest quality see-through augmented display system adaptation. Interestingly, this process appears to bypass
available. Most virtual reality displays are computationally conventional learning mechanisms that require intense
burdened by rendering an environment with objects that in concentration -- results are the same if there is
the end often do not look that real. Consequently a display conversation or background music, and it is often
that is slow with long latencies can hamper performance considered a game. Based on ours and others’ inspirational
even in the healthy [38-43] and cause motion sickness studies, these systems inevitably should lead the way to
[44]. Furthermore, when one also is controlling a haptic new clinical practices and commercialization.
robotic device, delays can lead to catastrophic instabilities There are several possible causes for why not all
[45, 46]. Our focus along with others [47-49] is on patients appear to retain the benefits of adaptive training.
reducing the amount of processing. PARIS projects First, it is possible that there are secondary, chronic
stereographic images onto a half-silvered mirror, allowing contractures in the peripheral passive tissues, common in
users to view virtual objects superimposed onto the real chronic stroke survivors. Such causes cannot be attributed
to faulty motor programming and hence cannot be learning process [19, 59-62]. Hence, going beyond virtual
manipulated using adaptation. Second, shifts in movement reality to distorted reality such as error augmentation is
patterns may have been “buried in the noise” of motor currently of great interest to our group [37].
variability because higher variability is common for stroke
survivors [17, 51-54]. ACKNOWLEDGMENT
Thanks to Sandro Mussa-Ivaldi for his insights and
participation in some of the studies mentioned. The authors
thank Xun Lou for his assistance in some of the computer
science and William Townsend of Barrett Technologies for
some of his input on robotic theory.

REFERENCES
[1] M. A. Conditt, F. Gandolfo, and F. A. Mussa-Ivaldi, "The motor
system does not learn the dynamics of the arm by rote memorization
of past experience," Journal of Neurophysiology, vol. 78, pp. 554-
560, 1997.
[2] R. Shadmehr and F. A. Mussa-Ivaldi, "Adaptive representation of
dynamics during learning of a motor task," Journal of Neuroscience,
vol. 14, pp. 3208-3224, 1994.
[3] R. Shadmehr and Z. M. Moussavi, "Spatial generalization from
learning dynamics of reaching movements," Journal of
Neuroscience, vol. 20, pp. 7807-15, 2000.
[4] D. M. Wolpert, Z. Ghahramani, and M. I. Jordan, "Are arm
trajectories planned in kinematic or dynamic coordinates? An
adaptation study," Experimental Brain Research, vol. 103, pp. 460-
70, 1995.
[5] R. Osu, E. Burdet, D. W. Franklin, T. E. Milner, and M. Kawato,
"Different mechanisms involved in adaptation to stable and unstable
dynamics," J Neurophysiol, vol. 90, pp. 3255-69, 2003.
[6] K. E. Novak, L. E. Miller, and J. C. Houk, "Features of motor
performance that drive adaptation in rapid hand movements,"
Experimental Brain Research, vol. 148, pp. 388-400, 2003.
[7] T. E. Milner, "Adaptation to destabilizing dynamics by means of
muscle cocontraction," Experimental Brain Research, vol. 143, pp.
406-16, 2002.
[8] D. W. Franklin and T. E. Milner, "Adaptive control of stiffness to
stabilize hand position with large loads," Exp Brain Res, vol. 152,
pp. 211-20, 2003.
[9] J. Broderick, S. Phillips, J. Whisnant, W. O'Fallon, and E.
IV. Bergstralh, "Incidence of rates of stroke in the eighties: the end of
the decline in stroke," Stroke, vol. 20, pp. 577-582, 1989.
DISCUSSION [10] C. Gray, J. French, D. Bates, N. Cartilidge, O. James, and G.
Fig. 4 Design concept of the VRROOM system, and the actual system Venables, "Motor recovery following acute stroke," Age Aging, vol.
being used. The subject should be able to either stand or sit in front of a 19, pp. 179-184, 1990.
large-workspace, 3-D robotic device and an accompanying 3d display that [11] D. B. Matchar, P. W. Duncan, G. P. Samsa, J. P. Whisnant, G. H.
allows the user to also see their own limb. DeFriese, D. J. Ballard, J. E. Paul, D. M. Witter, Jr., and J. P.
Mitchell, "The Stroke Prevention Patient Outcomes Research Team.
Goals and methods," Stroke, vol. 24, pp. 2135-42, 1993.
The reasons why adaptive training shows promise are [12] J. Sivenius, K. Pyorala, O. Heinonen, J. Salonen, and R. P, "The
not yet clear. One possible reason is that stroke survivors significance of intensity of rehabilitation of stroke - a controlled
have fewer remaining motor pathways, and their new trial," Stroke, vol. 16, pp. 928-31, 1985.
descending motor command signals are only a subset of [13] E. Taub, G. Uswatte, and R. Pidikiti, "Constraint-Induced
Movement Therapy: a new family of techniques with broad
their pre-injury signals, and are therefore inappropriate. application to physical rehabilitation--a clinical review. [see
Adaptive training may bring about a “motor epiphany,” comments]," Journal of Rehabilitation Research & Development,
much like how a coach gets an athlete to try the proper vol. 36, pp. 237-51, 1999.
strategy. It is also possible that spastic activity can be [14] E. Taub, N. Miller, T. Novack, E. d. Cook, W. Fleming, C.
Nepomuceno, J. Connell, and J. Crago, "Technique to improve
reduced by such repeated training. Another possible reason chronic motor deficit after stroke," Archives of physical medicine
may be that such learning is implicit, bypassing the areas and rehabilitation, vol. 74, pp. 347-354, 1993.
of the brain that are affected by the injury. Implicit [15] R. J. Nudo, "Recovery after damage to motor cortical areas,"
learning involves more primitive neural pathways [55-57], Current Opinion in Neurobiology, vol. 9, pp. 740-7, 1999.
[16] R. J. Nudo and K. M. Friel, "Cortical plasticity after stroke:
with more automatic recall [58]. Finally, the impaired implications for rehabilitation," Revue Neurologique, vol. 155, pp.
nervous system may require larger errors to begin to 713-7, 1999.
change, and adaptive training may “wake up” the learning [17] J. L. Patton, M. E. Phillips-Stoykov, M. Stojakovich, and F. A.
process. Intensifying error also leads to larger signal-to- Mussa-Ivaldi, "Evaluation of robotic training forces that either
enhance or reduce error in chronic hemiparetic stroke survivors,"
noise ratios for sensory feedback and self-evaluation. In Experimental Brain Research, vol. In press, 2005.
many artificial neural networks, the error signal drives the
Appendix C

Evaluation of robotic training forces that either


enhance or reduce error
in chronic hemiparetic stroke survivors
Exp Brain Res (2006) 168: 368–383
DOI 10.1007/s00221-005-0097-8

RESEA RCH ARTICLE

James L. Patton Æ Mary Ellen Stoykov Æ Mark Kovic


Ferdinando A. Mussa-Ivaldi

Evaluation of robotic training forces that either enhance or reduce error


in chronic hemiparetic stroke survivors

Received: 9 March 2004 / Accepted: 21 June 2005 / Published online: 26 October 2005
Springer-Verlag 2005

Abstract This investigation is one in a series of studies could adapt, as evidenced by significant after-effects.
that address the possibility of stroke rehabilitation using After-effects were not correlated with the clinical scores
robotic devices to facilitate ‘‘adaptive training.’’ Healthy that we used for measuring motor impairment. Further
subjects, after training in the presence of systematically examination revealed that significant improvements oc-
applied forces, typically exhibit a predictable ‘‘after-ef- curred only when the training forces magnified the ori-
fect.’’ A critical question is whether this adaptive char- ginal errors, and not when the training forces reduced
acteristic is preserved following stroke so that it might be the errors or were zero. Within this constrained experi-
exploited for restoring function. Another important mental task we found that error-enhancing therapy (as
question is whether subjects benefit more from training opposed to guiding the limb closer to the correct path)
forces that enhance their errors than from forces that to be more effective than therapy that assisted the sub-
reduce their errors. We exposed hemiparetic stroke ject.
survivors and healthy age-matched controls to a pattern
of disturbing forces that have been found by previous Keywords Human Æ Motor learning Æ Adaptation Æ
studies to induce a dramatic adaptation in healthy Control Æ Force fields Æ Robots Æ Haptics Æ Human–
individuals. Eighteen stroke survivors made 834 move- machine interface Æ Teaching Æ Rehabilitation Æ Stroke Æ
ments in the presence of a robot-generated force field Hemiparesis Æ Impairment Æ Lesion Æ Cortex
that pushed their hands proportional to its speed and
perpendicular to its direction of motion — either
clockwise or counterclockwise. We found that subjects Introduction

J. L. Patton (&)
It is appealing to consider using machines for the reha-
Sensory Motor Performance Program, Rehabilitation bilitation of brain-injured patients. Machine-assisted
Institute of Chicago, Physical Medicine & Rehabilitation, training can be highly accurate, can be sustained for very
Mechanical and Biomedical Engineering, long periods of time, can measure progress automati-
Northwestern University, 345 East Superior St., Room 1406, cally, and can produce a wide range of forces or mo-
Chicago, IL 60611, USA
E-mail: j-patton@northwestern.edu tions. Repetitive practice of the impaired limb has been
Tel.: +1-312-2381232 shown to be beneficial in improving functional ability
Fax: +1-312-2381232 (Wolf et al. 1989; Taub et al. 1993, 1999; Taub 2000).
Beyond the recommended therapy that strengthens and
M. E. Stoykov Æ M. Kovic
Sensory Motor Performance Program, Rehabilitation
stretches (Delisa and Gans 1993) lies the possibility of
Institute of Chicago, Occupational Therapy, neurofacilitation, or neuromuscular re-education
University of Illinois at Chicago, Chicago, IL, USA through techniques that incorporate our knowledge of
E-mail: maryp606132001@yahoo.com the neural circuitry. An important question is which
E-mail: sookovic@sbcglobal.net method, among the repertoire of possibilities, is best for
F. A. Mussa-Ivaldi motor recovery? Encouraging research (Patton et al.
Sensory Motor Performance Program, Rehabilitation 2001a, b; Dancausea et al. 2002; Patton and Mussa-
Institute of Chicago, Physiology, Physical Ivaldi 2003; Takahashi and Reinkensmeyer 2003) sug-
Medicine & Rehabilitation, Institute for Neuroscience, gests one method may be adaptive training, in which the
Mechanical and Biomedical Engineering,
Northwestern University, Chicago, IL, USA
natural adaptive tendencies of the nervous system are
E-mail: sandro@northwestern.edu used to facilitate motor recovery. This paper investigates
a critical question of the feasibility of adaptive training: error is likely to be a driving signal for adaptation and
Do stroke survivors preserve their ability to adapt? We learning (Rumelhart et al. 1986; Lisberger 1988; Ka-
also address the question of which type of training forces wato 1990; Dancausea et al. 2002). In a walking study,
are best and whether the effects of adaptation last after subjects significantly reduced the time required to
the forces are removed. predict the applied force field by approximately 26%
There have been a few promising preliminary studies when the field was transiently amplified (Emken and
on neurorehabilitation using mechatronic and robotic Reinkensmeyer 2005). Others have also emphasized
devices. A two-degree-of-freedom (DOF) robot manip- augmented or amplified error in the therapeutic pro-
ulator (similar to the one used in this study) was used to cess. (Winstein et al. 1999; Brewer et al. 2004; Emken
train stroke survivors in shoulder and elbow movement and Reinkensmeyer 2005). However, for such an er-
by moving the hand and forearm of the patient in the ror-enhancing, adaptive technique to work, the pa-
horizontal plane (Krebs et al. 1998b, 2000; Volpe et al. tient’s ability to adapt must be preserved following the
2000, 2001). This is an assistive form of therapy that injury.
guides the arm along the desired path and is different Recent studies of motor adaptation in healthy
from the strategy presented in this paper. Clinical testing individuals have demonstrated the excellent potential
of assistive training has been underway for several years, of the natural adaptive process in altering motor
and results have shown improved patient performance patterns. When people are repeatedly exposed to a
(Krebs et al. 1998b) with benefits lasting more than robot-generated force field applied to the hand (forces
3 years (Krebs et al. 1999b). This training has led to as a function of hand position and/or hand velocity)
increased clinical scores and greater gains in proximal that systematically disturbs limb motion, they are able
arm strength and greater recovery of functional inde- to recover their original kinematic patterns over a short
pendence (Volpe et al. 2000). period of practice (Shadmehr and Mussa-Ivaldi 1994).
An industrial robot (Puma 560) was also used to Subjects do this by cancelling the disturbance with an
apply forces to the paretic limbs of stroke survivors appropriate preplanned pattern of forces. This is a
through a customized forearm attachment (Burgar et al. form of feedforward control that is revealed by char-
2000; Lum et al. 2002). The robot could move the limb acteristic after-effects: when the disturbing force field is
to a target, applying spring-like forces toward the target unexpectedly removed, subjects make erroneous
or mirror the contralateral limb movements. These re- movements in directions opposite to the perturbing
sults provide convincing evidence that supplemental forces. Adaptation and its related after-effects have
robotic therapy can improve recovery. They do not, been demonstrated for different types of force fields,
however, indicate which type of robotic treatment offers ranging from simple position-, velocity-, and accelera-
the greatest advantage. Previous approaches have at- tion-dependent force fields (Bock 1990; Flash and
tempted to mimic the actions of the therapist by using a Gurevitch 1992; Shadmehr and Mussa-Ivaldi 1994;
robot to apply error-decreasing, assisting forces. In fact, Gandolfo et al. 1996; Conditt et al. 1997) to Coriolis
Kahn and colleagues (Kahn et al. 2001) suggested that forces caused by moving in a rotating room (Lackner
reducing errors during reaching training with a robotic and DiZio 1994) to skew-symmetric ‘‘curl’’ fields that
device does not provide any added benefit compared to produce forces in a direction perpendicular to the
repetitive reaching training where errors were allowed. velocity of the hand (Gandolfo et al. 1996). Similar
This paper tests an alternative approach that enhances results have also been observed after manipulations of
error and can only be implemented by a computer- visual perception that altered the visual feedback of
controlled device. movement (Held and Freedman 1963; Miall et al. 1993;
Interestingly, several theories have been proposed Pine et al. 1996; Krakauer et al. 1999).
for clinical treatment. Some sources suggest that pro- More recent studies support the view that subjects
viding manual guidance during reaching may facilitate adapt by learning the appropriate internal model of the
rehabilitation (Bobath 1978). Other theories advocate perturbing force field rather than learning an appropri-
using a component of resistance in a direction oppo- ate temporal sequence of muscle activations (Gandolfo
site to movement during diagonal reaching patterns et al. 1996; Conditt et al. 1997). Using this internal
(Voss et al. 1985). Although these approaches are in model, subjects are able to predict the effects of the
some ways mutually exclusive, their efficacy has not external field along a desired movement and use a
been tested objectively, and the most effective reha- feedforward control strategy (also called anticipatory
bilitation algorithm(s) have yet to be determined. New control) (Hemami and Stokes 1982; Ghez 1991). Mod-
techniques are currently being explored. For example, eling techniques have been successful in predicting both
one possible technique is to provide assistance by how the arm is disturbed by a force field and the after-
guiding (pulling) the hand toward the desired trajec- effects of training (Shadmehr and Mussa-Ivaldi 1994;
tory (Volpe et al. 1999; Lum et al. 2002). Another Kawato and Wolpert 1998; Bhushan and Shadmehr
possible technique is to provide resistance by either 1999). If this is true, one possible method for rehabili-
opposing the hand as it moves (Stein et al. 2004), or tation may be to use models to design the appropriate
by imposing forces that amplify the error. The latter force field that will result in beneficial after-effects.
method is justified by the observation that movement Again, this would only work if stroke survivors can
Table 1 Subject characteristicsVertical bars on the left side indicate the subjects that made repeat visits to the lab and participated in zero-
force experiments

adapt. Furthermore, after-effects would also have to be effects is preserved following cortical stroke. Reaching in
permanently retained for this approach to have reha- individuals with stroke is characterized by errors that
bilitative significance. reflect their poor ability to manage the interaction tor-
There are several recent studies providing encourag- ques (Beer et al. 2000). Aspects of reaching in cortical
ing evidence that the ability to adapt and exhibit after- stroke survivors resembles that of cerebellar patients
(Bastian et al. 1996). Adaptation and after-effects in Furthermore, we found that final improvements were
stroke survivors can be observed in the oculomotor most evident when the training forces magnified rather
(Weiner et al. 1983) and limb-motor systems (Raasch than reduced the original error. This study provides
et al. 1997; Dancausea et al. 2002). In fact, prism encouraging evidence that adaptive training could pro-
adaptation has been shown to trigger the recovery from vide an effective supplement to conventional therapy.
hemispatial neglect following stroke (Rossetti et al. This research was presented in preliminary form at the
1998). Stroke-related damage in the sensorimotor areas Society for Neuroscience meeting, 2001 (Patton et al.
appears to effect the processes underlying the control 2001b).
and execution of motor skills but not the learning of
those skills (Winstein et al. 1999). Recently, stroke sur-
vivors showed the ability to adapt to small elbow mo- Methods
tions that were disturbed by a spring-like force
disturbance (Dancausea et al. 2002). However, severely Experiments
affected individuals used atypical correction strategies.
Another recent study evaluated the ability to adapt to a Research was approved by the Northwestern University
sideways force during forward motions and found that Internal Review Board to conform to ethical standards
the after-effects were less evident in stroke survivors with laid down in the 1964 Declaration of Helsinki and fed-
more severe impairment (Takahashi and Reinkensmeyer eral mandates that protect research subjects. Before
2003). Furthermore, preliminary studies in our labora- beginning, each subject signed a consent form that
tory on stroke survivors have revealed that the after- conformed to these Northwestern University guidelines.
effects may persist longer when the after-effects resemble Twenty-seven stroke survivors, aged 30–72 years (mean
healthy unperturbed movements (Raasch et al. 1997; age 51), and four healthy controls, aged 32–61 years
Patton et al. 2001a, b). (mean age 47) volunteered to participate (Table 1). All
While conventional skill learning, such as learning to stroke participants were in the chronic stage, having
play a musical instrument, requires more conscious suffered a stroke 16–173 months prior to the experiment.
attention in order to achieve a goal, neuromotor adap- Our exclusion criteria were: 1) bilateral impairment, 2)
tation has been argued to be closely related to proce- severe sensory deficits in the limb, 3) aphasia, cognitive
dural learning and thus to be a form of implicit learning impairment or affective dysfunction that would influence
(Krebs et al. 2001). Hence, these learning mechanisms the ability to comprehend or to perform the experiment,
may offer an effective alternative to conventional 4) inability to provide an informed consent, and 5) other
methods of rehabilitation. Implicit learning takes place current severe medical problems.
without awareness of what has been taught (Squire Eight additional subjects were recruited whose data
1986), and often does not require complete conscious did not reach final analysis (not shown in Table 1). Of
attention. One example is procedural motor learning of these, one chose to abort the experiment due to his own
a motor sequence that is embedded in a seemingly ran- frustration (SA15); one healthy subject chose to quit
dom set of movements (Fitts 1964; Squire 1986; Gomez because of time constraints (SA23); two healthy controls
Beldarrain et al. 1999; Seidler et al. 2002). Another and three stroke patients had lost data due to technical
example is sensory-motor adaptation observed in force problems with the robotic device or data collection
field paradigms (Krebs et al. 2001). Following unilateral (SA26, SA30, SA31, SA39, and SA 44); one stroke
stroke, recent evidence suggests that learning is facili- subject had such a poor elbow movement that she was
tated by providing explicit information about the task not capable of completing the experiment (SA41).
that can enhance implicit motor learning (Boyd and Subjects held the free-extremity (here referred to as
Winstein 2001, 2003). If one could demonstrate that the ‘‘endpoint’’) of a DOF robot (Fig. 1) described
stroke subjects readily adapt to force training, and that elsewhere (Conditt et al. 1997; Scheidt et al. 2000).
beneficial after-effects persist, a new family of rehabili- Endpoint forces and torques were monitored with a six-
tation strategies would emerge. degree-of-freedom load cell fixed to the handle of the
To begin exploring the possibility of exploiting im- robot (Assurance Technologies Inc., model F/T Gamma
plicit learning mechanisms for poststroke rehabilitation, 30/100). The robot was equipped with position encoders
this paper explores the features of motor adaptation in that were used to record the angular position of the two
chronic stroke survivors during the execution of planar robotic joints with a resolution exceeding 20 arc/s of
multijoint movements that are disturbed by a force field. rotation (Teledyne Gurley, model 25/045-NB17-TA-
We studied six movement directions using a two-joint PPA-QAR1S). The position, velocity and acceleration of
planar robotic device, exposing hemiparetic stroke sur- the handle were derived from these two signals. Two
vivors and healthy age-matched controls to a force field torque motors were used to apply programmed forces to
that is commonly known to induce unequivocal adap- the hands of the subjects (PMI Motor Technologies,
tation in healthy individuals. We found that stroke model JR24M4CH).
survivors do adapt, albeit at a diminished level com- Subjects were seated so that the center of the range of
pared to healthy controls, and this capacity to adapt was targets — lying approximately at the center of their
not related to clinical scores of motor impairment. reachable workspace — was aligned with the shoulder,
in the proximal-distal direction (y-axis) (Fig. 1, right). where F is the vector of forces, x_ is the vector of
The experiment involved only the hemiparetic limbs of instantaneous velocity, and A is the gain. With this type
the stroke subjects, and this corresponded to nondomi- of field, the forces are always orthogonal to the velocity
nant limbs in 17 of the 27 subjects (See Table 1). Sub- of the hand and form either a clockwise or counter-
jects were asked to reach visual targets so that they made clockwise circulating pattern in the space of hand
a series of random 10 cm movements to the vertices of a velocities. This phase of 372 trials was subdivided into a
triangle. If subjects had difficulty in reaching the vertices block of 30 trials (five in each direction) at the beginning
of the triangle, we adjusted their chair position. To avoid and end of training for statistical analyses. After a sec-
fatigue, their elbow and forearm rested on a lightweight ond series of 240 trials, there was a rest period of
frictionless linkage (Fig. 1, left), and they could choose approximately 1 min while data collection equipment
to rest between movements (subjects rarely rested longer was reset, followed by a block of 72 trials and then the
than a few seconds every hundred movements). We final 30 trials (see Figs. 3 and 4).
controlled for a peak speed of 0.288 m/s by giving sub- During the learning phase, half of the subjects expe-
jects feedback at the end of each movement using col- rienced A = 20 NÆs/m (corresponding to clockwise for-
ored dots and auditory tones to let subjects know if they ces), and half experienced A = 20 NÆs/m
were going too fast, too slow, or within a range of (corresponding to counterclockwise forces) (c.f. Ta-
±0.05 m/s. Consequently, subjects’ speeds remained ble 1).
roughly constant throughout the entire experiment. Some of these subjects also served as their own con-
All subjects performed a total of 834 movements trols as subjects in the zero-force group (Table 1). Prior
(trials), broken down into the following experimental to experiencing any force field, some visited the lab and
phases: performed the same experiment without any forces ap-
plied. These subjects were used to determine if practicing
– Unperturbed familiarization: 60 movements (approxi-
the motion alone (without any forces) led to a beneficial
mately 5 min) to become familiar with the system and
outcome.
with the task of moving the manipulandum.
– Unperturbed baseline: 30 movements (approximately – After-effects: 240 movements (approximately 16 min),
2 min) to establish a baseline pattern of reaching with random, intermittent removal of the force field
movements. for one in eight of the trials (catch trials) to determine
– Learning: 372 movements (approximately 25 min) the after-effects.
with constant exposure to the ‘‘curl’’ force field, – Training refresher: Twelve movements (approxi-
governed by: mately 1 min), identical to the learning Phase.
– Washout: 120 movements (approximately 8 minutes),
0 A all without forces.
F ¼ x_
A 0

Fig. 1 Subjects positioning and


the experimental apparatus.
Two brushed DC torque
motors (PMI model
JR24M4CH, Kolmorgen
Motion Technologies, NY,
USA) control forces at a handle
via a 4-bar linkage. Rotational
digital encoders (model 25/045-
NB17-TA-PPA-QAR1S,
Teledyne-Gurley, Troy, NY,
USA) report absolute angular
position, and a 6-axis force/
torque sensor reports the
interface kinetics (Assurance
Technologies, Inc., TI F/T
Gamma 30/10, NC, USA). A
PC acquires the signals and
controls torque
a. Unperturbed baseline b. Early training c. Fi nal training

d. After-effects e. Final washout

(Stroke subject sa38)

Fig. 2 Movement paths for a mildly impaired stroke subject in intermittently in catch trials to test for after-effects (d). Finally,
successive phase of the experiment. For clarity, the starting points subject moves for 120 movements without forces, and the after-
of the triangle pattern in Fig. 1 were shifted to display all starting effects ‘‘wash out.’’ Results from the final 15 movements of the
points at the center. The ideal trajectories are the bold dotted lines, washout phase are shown in (e). Arrows indicate the movement
the average trajectories are represented as bold solid lines, and direction that showed the largest error in the baseline trials. This
individual trajectories are thin lines. The subject first performs movement error was amplified by the force field during training,
without force disturbances (a), and then experiences a prolonged but resulted in a reduction of error following training (d) that was
training (b and c). The training forces are then turned off sustained until the end of the experiment (e)

Subjects were also required to take breaks (approxi- often make excessively large corrections later in their
mately 1–2 min) after movements 307 and 650 so that movements that may depend on earlier errors (Krebs
our data collection equipment could be reset. The et al. 1999a; Beer et al. 2000). Second, we were primarily
movements in each direction were divided equally in interested in the early phase of the movement that best
each phase. At all times during the experiment, an reflects the operation of a feedforward controller based
additional set of ‘‘background’’ torques was generated on an internal model of the arm/environment dynamics.
to remove the inertial effects of the robot arm linkage, Our measure, the initial direction error, reflected this
resulting in the feeling of movement on a slippery sur- early phase of movement by forming a vector from the
face when the force field was not present. Motion and start point to 25% of the distance to the target (2.5 cm).
force data were collected at 100 Hz. This corresponded to approximately the first 200–
300 ms of movement that, if there were no error cor-
rections at the end of the movement, would last about
Analysis 1.1 s. Positive error corresponded to a counterclockwise
rotation from the actual trajectory to the desired tra-
We restricted our focus in this study to the earliest parts jectory, and zero corresponded to a straight-line to the
of movements for two reasons. First, stroke survivors target. Initial direction error was used for testing our
Fig. 3 Group results for stroke survivors showing error between lines represent 95% confidence intervals and means, respectively,
the actual trajectory and a straight-line movement. Each block of for the group. Small symbols and vertical thin lines show the
data along the horizontal axis represents a successive phases of the individual subject means and 95% confidence intervals. All phases
experiment. The training phase in which the force field was applied of the experiment that underwent statistical analyses contained 30
to the subject are indicated by the light shading in the background. trials (5 in each direction). A rest period (approximately 1–2 min)
Baseline is compared to after-effects to test our hypothesis that the occurred during the learning phase after a second series of 240 trials
method shifts trajectories toward the desired, resulting in a while data collection equipment was reset
significant reduction in error. Dashed lines shaded areas and dashed

hypotheses on the feedforward controller and also was direction from baseline to after-effects and for the ten-
found to be highly correlated with the perpendicular dency of the after-effects to disappear in the washout
distance measure used in other adaptation studies phase.
(Conditt et al. 1997; Scheidt and Rymer 2000; Thor-
oughman and Shadmehr 2000).
To quantify adaptation, we established the Adapta- Results
tion capacity, defined as the average shift in initial
direction from unperturbed baseline trials to the after- We found that both the healthy and the stroke subjects
effects catch trials. All hypotheses were tested using an demonstrated a clear ability to adapt when these subjects
alpha level of 0.05. We tested for a shift in initial moved their hands in the force field. The force field
Fig. 4 Group results for the same experimental conditions as in Fig. 3 but performed on a group of healthy subjects

significantly disturbed hand movement (Fig. 2b). Initial After training, the disturbance was unexpectedly re-
direction error did not significantly decrease after 330 moved and the initial direction error shifted signifi-
movements of practice (Fig. 2; compare b to c). After- cantly, exhibiting a clear after-effect of adaptation
effects were evident when the forces were removed (Fig. 2d). Both stroke and healthy subjects showed a
(Fig. 2d). These results were evident as a group as well marked shift in their initial direction that was opposite
(Fig. 3), with a significant shift in initial direction error to the direction seen when they were initially exposed to
from baseline to after-effects. As anticipated, we de- the force field (compare Figs. 3 and 4; compare phases
tected no significant change in any subjects’ movement 2–6). On average, the stroke survivors’ limb movements
speeds across the phases of the experiment, although were initially perturbed by the same amount (Fig. 3,
peak speeds were slightly but significantly below target Phase 2) as the healthy controls (Fig. 4, Phase 2). In
for stroke subjects (average of 0.218 m/s, P<0.05) but contrast, the after-effect of the stroke subjects was sig-
not for healthy subjects (average of 0.279 m/s). Speeds nificant (Fig. 3, Phase 6) but it was also significantly
for some movements were as high as twice the target, smaller than the healthy subjects by about 26% (Fig. 4,
resulting in peak forces in these extreme cases as high as Phase 6). All of the healthy subjects’ after-effects were
10 N. The maximum speeds 5th, 50th, 95th, and 99th significantly shifted from baseline, and while stroke
percentiles for stroke patients were: 0.101, 0.201, 0.361, subjects’ shifts were all in the direction one would expect
and 0.469 m/s, respectively, and for healthy they were as an after-effect, only 10 of the 18 had shifts that were
0.091, 0.255, 0.437, and 0.469 m/s, respectively. statistically significant (P<0.05 in individual t tests).
Stroke subjects as a group, however, showed a signifi- While Figs. 3 and 5 indicate a clear and positive an-
cant shift (P<0.05 in a paired t test of subject averages). swer to the question about whether stroke survivors can
Note that stroke survivors’ errors during baseline adapt, these data do not reveal how the training forces
appear similar to those of the healthy subjects (compare might restore function. In an initial attempt to shed
Figs. 3 and 4, Phase 1) because these figures display the some light on this question, our analysis program
average overall movement directions. Stroke survivors’ identified each subject’s directions of largest error — the
larger errors are revealed only by the larger error bars. directions that could be improved upon the most. Dur-
Stroke subjects were also more variable within move- ing training, these movement errors were either magni-
ment directions. Trial-to-trial standard deviations of the fied or reduced by the forces (depending on how the
initial direction error during the baseline phases was force field happened to be pushing for each subject and
nearly three times higher for the stroke subjects (the each direction). Movement directions of largest errors
average of the stroke subject’s standard deviations was (e.g., movements indicated by the arrows in Fig. 2) were
16.2° compared to 5.8° for healthy; P<0.001). It is im- only selected if they had significant error to begin with.
portant to stress that the goal of this study was to merely For each subject, our software selected and analyzed up
test for adaptation and not to correct the trajectories to two movement directions. The first was the direction
with a specially designed force field. Therefore, some that showed the largest initial error that was reduced by
after-effects were in the wrong direction (i.e., the errors the forces, while the second was the direction that
were amplified), and some were shifted beyond a straight showed the largest initial error that was amplified by the
path to the target. forces. If no significant error was present to begin with,
The observed shift in direction from baseline the movement was not considered. To determine the
(unperturbed) to after-effects (also unperturbed) is an amount of error amplification/reduction, we calculated
excellent indication of motor adaptation. Absence of the dot product between the average training force
adaptation would result in a zero shift. In contrast, direction and the average movement error direction
adaptation capacity, as quantified by the amount of this (horizontal axis on Fig. 6). Positive values of this dot
shift, was well above and significantly greater than zero product indicated that the training forces tended to
for stroke survivors (in Fig. 5, the wings indicate a 95% magnify the error, while negative values indicated that
confidence interval). Nevertheless, adaptation capacity the training forces tended to reduce the error.
was slightly but significantly larger for healthy subjects By the end of the experiment (five final movements in
(Fig. 5, wings). We detected no difference in adaptation each direction), after the forces had been turned off for
capacity between the subjects that received clockwise 120 movements (final washout phase), we found that the
and the subjects that received counterclockwise forces. relationship between error magnification and the
Stroke survivors’ changes in performance were not as improvement was significant (vertical axis on Fig. 6).
large as that of the healthy subjects for the same amount Significant improvements occurred only when the
of practice (compare Figs. 3 and 4), suggesting that training forces magnified the original errors, and not
longer training might also result in more persistent when the training forces reduced the errors or they were
adaptation in patients. zero [F(1,13)=4.29, P<0.001]. Because three of the
To further explore the possibility that individuals points were further than two standard deviations from
with stroke might have a decreased learning rate, we fit the mean and appeared as outliers (Fig. 6, indicated by
each subject’s learning phase data to a simple expo- small horizontal arrows), we reran the analysis without
nential model, them, but obtained the same results — error-amplifica-
tion training resulted in significant improvement and
A þ Be t/C error-reduction training resulted in significant detriment
where A is an offset, B the amount of learning (the by the end of the experiment. In summary, by restricting
change of the trajectory errors), C the time constant for our attention only to trials that had significant error to
the error to decrease (inversely related to rate of learn- begin with, and then by separating movement directions
ing), and t is the movement number. However, we found into error-reducing or error-enhancing training, the
no significant differences in either the time constant or evidence suggests that the error-enhancing forces may be
the amount of learning between the healthy subjects and more effective than the error-reducing forces for cor-
individuals with stroke. recting the initial movement direction of the hand.
Adaptation capacity was not found to correlate well
with the clinical measures of Elbow Modified Ashworth
(r2=0.0013876, P>0.8), Chedoke (r2=2.7943e-006, Discussion
P>0.9), and the upper extremity portion of the Fugl–
Meyer (r2=0.0040592, P>0.8) (Fig. 5b). Hence, it may This study provides new evidence that stroke survivors
be difficult to predict a person’s ability to adapt based on retain their ability to adapt their arm movements when
these clinical scores. They also fail to show any evidence they are exposed to an altered mechanical environment
that the severely impaired individuals lack the ability to (a force field), although at a somewhat diminished level.
adapt. This is evidenced by the Adaptation Capacity measure
b

Fig. 5 a Adaptation capacity, in degrees, for the healthy and the intervals of the group. Stroke survivors show a large capacity to
stroke survivor groups, determined by calculating each subject’s adapt, but not as strong as the healthy group. b Correlation of the
average change in initial direction error from the baseline phase adaptation capacity (horizontal axes) with three clinical scores that
and the after-effects phase. Wings represent 95% confidence were measured in this study. Each stroke survivor represents a dot

(Fig. 5). Moreover, for movement directions that begin ours in that brain-injured individuals needed more trials
with significant errors, significant improvement occurred to diminish errors and their movements were more
only when the training forces magnified the original er- variable.
rors (Fig. 6). Takahashi and Reinkensmeyer have also recently
With regard to the question of whether stroke sur- published new results on adaptation after stroke
vivors can adapt, this study confirms earlier studies (Takahashi and Reinkensmeyer 2003). Although their
conducted by others. Raasch and colleagues (Raasch study found that the ability to adapt was somewhat
et al. 1997) performed a preliminary study that provided diminished in stroke survivors, they found that the
encouraging evidence for adaptation in stroke survivors. ability to adapt was loosely correlated to clinical scores
In that study, a force was imposed during hand motion such as the Chedoke Mc-Master score. We found no
that pushed subjects’ hands away from the desired path such correlation in our group of subjects. Several
with a magnitude proportional to hand speed, a proto- important differences between our study and theirs may
col similar to that used in this study. A single-joint study account for the discrepancies in our results. (1) Our
by Dancausea and colleagues (Dancausea et al. 2002) study investigated mostly shorter movements. There-
demonstrated the ability of stroke survivors to adapt to fore, a larger proportion of the movement we observed
a spring-like force. Their results were consistent with was likely controlled by early feedforward (and hence
SA Subjects

Performance Improvement vs Enhancement of error


40

20
Improvement in initial direction (degrees)

Training Forces
Magnified Errors

20

Zero
Forces

Training Forces
Reduced Errors
40
1 0.5 0 0.5 1
Direction of Average error dotted with Direction of Average Training Force

Fig. 6 Cross plot of the performance improvement versus error the training forces tended to reduce the error. Boxes with horizontal
magnification caused by the force field for different movement centerlines represent the mean and 95% confidence intervals of the
directions on the stroke survivors. Performance improvement was three distinct groups: (1) the group in which the error was
calculated by measuring the reduction initial direction error from magnified during training (right), (2) the control group in which
the baseline phase to the final phase of the experiment. Positive error was unchanged because no forces were applied (center), and
represents an improvement in performance. Error magnification (3) the group in which error was reduced during training (left).
was determined by calculating the dot product between the average Vertical whiskers extending from the box plots indicate 2-standard
training force direction and the average movement error direction. deviations from the mean. The diagonal line represents linear least-
Positive values of this dot product indicated that the training forces squares regression fit of the data shown
tended to magnify the error, while negative values indicated that

internal-model driven) components of control. Takah- degree by factors such as contractures and spasticity that
ashi and Reinkensmeyer allowed the movement lengths are unrelated to the presence or absence of internal
to vary from subject to subject. Furthermore, in many models. We attempted to minimize these factors in our
cases the movements were close to the edge of the sub- study by targeting the midregion of the arm’s work-
jects’ achievable workspace. These longer and more space. (2) While Takahashi and Reinkensmeyer com-
distal movements may have been effected to a larger pared movement adaptation between the paretic and the
nonparetic limb, we compare adaptation in paretic impossible to keep the three groups of subjects fully
subjects to adaptation in healthy and age-matched balanced in all ways. For example, it is possible that the
controls. Indeed, individuals who have suffered a stroke clockwise force group had subjects with slightly higher
do not perform as well with their unimpaired arm as spasticity (see Table 1).
age-matched healthy subjects, but they tend to show the Another limitation is that we did not control for the
same abilities to learn (Pohl and Winstein 1999). (3) resting time. In order to prevent fatigue, intimidation,
Takahashi and Reinkensmeyer also did not support the and attentional loss due to boredom, subjects were free
arm against gravity. As those authors suggested, the to take rests. It remains to be seen whether rests may
multiaxis force generation constraints that come into play a critical part in the the adaptive process.
play when generating large forces (perhaps even if those An additional limitation to this study may be that the
large forces are required in only one direction — against focus was limited to straightness of the hand path.
gravity), or the inability to produce force at a high en- Making straighter and smoother movements need not to
ough rate, may limit adaptation in the most weakened be the only goal or the principal goal of a therapy.
subjects. Strength has been shown to be dramatically Optimal functional recovery for these individuals may
influenced by elevating the arm against gravity (Beer be something other than healthy-looking movements.
et al. 1999). The subjects in the present study did not Related to this issue is the breakdown of predictive
have to overcome this large gravitational force require- feedforward control and corrective feedback control.
ment, and thus were operating in a wholly different re- While our measure, initial direction error, does a good
gion in force space. (4)Takahashi and Reinkensmeyer job of characterizing the path errors early in each
used a maximum force of 3.5 N, while our robotic forces movement (first 25% of the distance to target), it does
were typically much stronger, reaching amplitudes as not directly address the time interval typically associated
high as 15 N. (5) While our study used perturbing forces with human feedforward control and the launching of a
that depended upon hand velocity, Takahashi and movement. The time that the subjects crossed the 25%
Reinkensmeyer used a time-dependent force with a mark was 0.17±0.04 s (average ± standard deviation),
constant direction (to the side). Nevertheless, it is pos- which was significantly different for the healthy subjects
sible that their forces may have led to a similar adap- at 0.14±0.03 s (two-tailed t test, P<0.05). While these
tation as ours because when subjects adapt to a values are still within the range of feedforward control
time-dependent force, they tend to build an internal that occurs prior to supraspinal-feedback corrections
representation that is not dependent on time (Conditt (0.12 – 0.18 s) (Schmidt 1988), they do not rule out
and Mussa-Ivaldi 1999). (6) Finally, the most provoca- elements of spinal reflex feedback that can be as fast as
tive difference is that our protocol enabled us to look at 0.03 s (Dewhurst 1967). Therefore, initial direction error
how improvement in performance is influenced by the should not be fully interpreted as a direct measure of
direction in which the training forces are acting. Al- feedforward control error.
though this study did not specifically intend to reduce or The most important limitation of this study is that
magnify error, our preliminary evidence suggests that only a proper prolonged clinical research study with
error-magnifying forces may be most effective. separate groups of stroke survivors (one group receiving
Our study also has some important limitations. First, prolonged error-enhancing forces, one group receiving
the apparatus did not allow motion out of the horizontal error-reducing forces and one group receiving no forces)
plane, and the effects of gravity were minimized at all is the only appropriate method for evaluating error-
times by an arm support (Fig. 1). A key motor deficit enhancement. What is presented here is only compelling
seen in stroke is the inability of the nervous system to preliminary evidence supporting such a strategy.
counteract gravity while still making targeted move- An issue that was not addressed by this study is the
ments (Dewald and Beer 2001). An important extension likely relation between the neural structures damaged by
of our approach would be to test adaptation in the the stroke and the adaptive performance. Lesion site
context of 3D activities that require supporting the arm information was not available for all subjects (Table 1),
against gravity. and in this initial study the data are still too sparse to
A second limitation is that the size and simplicity of draw conclusions on the location, extent, and severity of
the movements may not allow us to extend our conclu- the strokes and how they may have influenced each
sions to unconstrained and functionally relevant mo- individual’s results. Adaptability may also be influenced
tions. Larger 3D motions, consistent with the activities by other factors such as lesion hemisphere, the time since
of daily living would likely be more relevant to the stroke, and the type and dosage of rehabilitation that the
recovery of the most important motor functions. We subjects received. More data are constantly being ac-
expect that these shortcomings will be addressed by quired, but it may take large numbers of subjects before
using stronger full-dimensional robotic systems com- these factors show any effect. Nevertheless, adaptation
bined with more advanced visual display. The present was statistically significant in 10 of the 18 individuals
study also uses some subjects as their own controls. with stroke that we tested.
While we did not detect any difference in these subjects, While forces that amplified error appeared to benefit,
their increased exposure to the device might have biased forces that reduced error led to the opposite — after-
the data. Yet another potential problem was that it was effects that increased the initial direction error (Fig. 6).
Hence, errors can be decreased or increased with the meyer et al. 2003). It remains to be seen whether stroke
right transient perturbation. It could be that the nervous subjects tend to learn more slowly because they are more
system is simply less concerned with the movement variable or for some other reason.
straightness following stroke. Indeed, Krebs and (Krebs Nevertheless, several other studies agree with our
et al. 1999a) colleagues have shown that stroke subjects results that error augmentation leads to enhanced
exhibit jerky and multisegmented movements that tend learning. Learning how to counteract a force distur-
to coalesce into smoother, straighter movements as bance in a in a walking study increased by approxi-
recovery progresses. mately 26% when the field was transiently amplified
It is also important to speculate on the mechanisms (Emken and Reinkensmeyer 2005). Artificially giving
that might allow a stroke subject to decrease directional smaller feedback on force production has caused sub-
errors with this paradigm that cannot decrease by simple jects to apply larger forces to compensate (Brewer et al.
practice alone. Sensory feedback systems may need to 2005). Several studies have shown how the nervous
detect a stimulus with a magnitude that is large enough system can be ‘‘tricked’’ by giving altered sensory feed-
to trigger the recovery process. Such distorting inter- back (Flanagan and Rao 1995; Srinivasan and LaMotte
ventions that trigger recovery have been shown to be 1995; Robles-De-La-Torre and Hayward 2001; Ernst
promising in individuals with stroke that suffer from and Banks 2002; Sainburg et al. 2003; Brewer et al. 2004;
hemispatial neglect (Rossetti et al. 1998). Smaller errors Kording and Wolpert 2004a). However, augmented
may be imperceptible or considered less important than feedback on practice conditions has not always proven
other aspects of the movement such as getting to the therapeutically beneficial in stroke (Winstein et al. 1999).
target, conserving energy, or minimizing discomfort. It may be that there are limits to the amount of error
Another possibility is that the nervous system is trying augmentation that is useful (Kording and Wolpert
to use motor pathways that are no longer intact, and the 2004b; Wei et al. 2005).
learning is a way to trick the nervous system into trying The implications of successful implicit learning are
a new and nonintuitive pathway that it would otherwise that one can learn at a nearly subconscious level with
not ever consider. Such questions will require further minimal attention and with less motivation than more
study using imaging, transcranial magnetic stimulation, explicit types of practice, like pattern tracing. Training
or implants. typically requires a balance of repetitive practice,
A final question not answered in this study is whether strengthening, expert guidance, and appropriate feed-
beneficial after-effects persist beyond the final 120 back. We believe that the type of implicit learning
movements in which the forces were absent. Preliminary demonstrated in this study — one that augments errors
studies in our laboratory on stroke survivors have re- — may provide an excellent rehabilitation tool to en-
vealed that after-effects persist when these resemble hance performance. In fact, explicit information has
normal, unperturbed movements (Raasch et al. 1997). In been shown to disrupt the acquisition of motor skills in
fact, after-effects may become permanent if they are participants with stroke but not in healthy controls
perceived by the subject to be an improvement with re- (Boyd and Winstein 2003; Boyd and Winstein 2004).
spect to the initial behavior. However, not all patients One might suggest that the changes seen in stroke
may benefit from this type of procedure. Subjects who patients are due to a reduction in tone (spasticity).
show poor ability to adapt, such as cerebellar stroke Evidence has suggested that repeated muscle stretches
survivors, may have great difficulty dealing with resistive may temporarily reduce spastic hypertonia (Schmit et al.
techniques (Weiner et al. 1983; Sanes et al. 1990; Bastian 2000) (although we are not aware of this being shown
et al. 1996). Moreover, chronic stroke survivors suffering during voluntary movement). Even though a reduction
from long-term changes in their muscle systems (i.e., in tone cannot directly explain a benefit from error-
atrophy and tissue shortening) also may not benefit from augmenting forces (they would reduce the amount of
such techniques. stretch to any muscle, not increase it), one possibility
While our force fields shifted the central tendencies of exists where the result of Schmit and colleagues might
the subjects, they did nothing to change motor vari- apply. Since spasticity is velocity-dependent and higher
ability, which is known to be larger in stroke survivors velocities occur well after a movement has been laun-
(Fisk and Goodale 1988). Our study revealed that stroke ched, anticipating a spastic muscle action might cause a
survivors were more variable both within trials, from person to learn to precompensate with a shift early in the
movement to movement, and across subjects. It is logical movement so that the spastic response carries the limb to
that the nervous system would naturally reduce its the target. For example, a person might launch their
learning rate when sensory or motor inconsistencies and outward movements more laterally in anticipation of a
uncertainties caused by stroke make it difficult to form later biceps spasm. In this case, exposure to error-aug-
an exact internal model. Such is the case in artificial menting forces would further stretch and increase the
systems that learn (Rumelhart and McClelland 1986). excitation of the muscle. Repeating such activity might
Nevertheless, several studies suggest that increasing reduce the spastic hypertonia as was seen in isometric
trial-to-trial variability with externally applied forces elbow flexors (Schmit et al. 2000) once the forces stop,
does not impair motor learning rate in healthy subjects making it easier for the subject to aim more directly at
(Scheidt et al. 2001; Takahashi et al. 2001; Reinkens- the target. However, this explanation only works for
some of the outcomes of our study. Subjects that deviate
in the opposite direction to this example should have had increased initial direction error as an
after-effect, which was not the case. Furthermore, errors during planar movements after stroke
are believed to be con- sistent with a lack of feedforward compensation for interaction torques,
not an expression of spastic stretch reflexes (Beer et al. 2000). Hence a ‘‘stretching expla- nation’’
appears to be unlikely for our results, although
it remains to be seen whether spastic reflexes can be reduced for muscle stretches that take place
during re- peated voluntary movements.
The results of this study have possible implications in rehabilitation because a properly induced
adaptive process might be exploited to assist in the restoration of function. Robotic devices,
combined with sophisticated and precise computer programs, provide new possibili- ties for
improving and accelerating recovery. One can envision the possibility to custom-designed subject-
spe- cific force fields generated by a model of the patient’s biomechanics and motor impairments
(Mussa-Ivaldi and Patton 2000; Patton and Mussa-Ivaldi 2001; Patton and Mussa-Ivaldi 2003).
Preliminary testing of this ap- proach has proved successful in some stroke survivors (Patton et al.
2001a). Other objectives such as extending the range of motion would require other approaches.
What is clear is that opportunities for recovery after stroke are possible by extending intensive
therapy be- yond present inpatient rehabilitation stays, and robotic therapy may be one way to
economically accomplish this (Fasoli et al. 2004). While error-enhancing forces are useful for
inducing adaptive responses, they are likely to be most effective if combined with other rehabilitation
strategies.
Studies using robotics for rehabilitation, assessment, and training have had some success (Krebs et
al. 1998a; Lum et al. 1999). The many paradigms associated with adaptive training may add to the
repertoire of possible strategies for rehabilitation. In the search for the most optimal method of
training among the many possibili- ties, this preliminary encouraging evidence on error-
augmentation points to future studies that exploit the natural adaptive tendencies in the nervous
system for restoring function.

Acknowledgements Supported by AHA 0330411Z, NIH 1 R24


HD39627-01, NINDS R01 NS35673.
Appendix D

A Real-Time Haptic/Graphic Demonstration of how


Error Augmentation can Enhance Learning
A Real-Time Haptic/Graphic Demonstration of how
Error Augmentation can Enhance Learning *
Yejun Wei and James Patton Preeti Bajaj Robert Scheidt
Sensory Motor Performance Program Johns Hopkins University
Dept. Biomedical Engineering
Rehabilitation Institute of Chicago Biomedical Engineering
Marquette University
Northwestern University Baltimore, MD 21210
Milwaukee, WI
Chicago, IL 60611 preeti.bajaj@jhu.edu
scheidt@ieee.org
y-wei2@northwestern.edu
j-patton@northwestern.edu

Abstract – We developed a real-time controller for a 2 of skill in human movement [7, 8]. Secondly, larger errors
degree-of-freedom robotic system using xPC Target. Th is are likely to heighten motivation to learn by making the
system was used to investigate how different methods of consequence even small errors seem large. It also makes
performance error feedback can lead to faster and more errors more noticeable to the senses and hence may trigger
complete motor learning in individuals asked to compensate responses that would otherwise be lacking. Error
for a novel visuo-motor transformation (a 30 degree rotation).
augmentation may lead to larger changes in performance.
Four groups of human subjects were asked to reach with their
unseen arm to visual targets surrounding a central starting Finally, intensifying error can also lead to larger signal-to-
location. A cursor tracking hand motion was provided during noise ratios for sensory feedback and self- evaluation.
each reach. For one group of subjects, deviations from the One issue is clear from adaptive control and learning
“ideal” compensatory hand movement (i.e. trajectory errors) models, however –learning may become unstable if gains
were amplified with a gain of 2 whereas another group was are too high. Motor variability, sensor inaccuracies and
provided visual feedback with a gain of 3.1. Yet another group other uncertainties can cause endless over corrections that
was provided cursor feedback wherein the cursor was rotated do not converge to satisfactory performance. We
by an additional (constant) offset angle. We compared the rates hypothesized in this study that there was some optimal
at which the hand paths converged to the steady -state
trajectories. Our results demonstrate that error- augmentation
amount error augmentation.
can improve the rate and extent of motor learning of Recently we have shown that enhancing error by
visuomotor rotations in healthy subjects. Furthermore, our pushing the arm farther from its intended target can
results suggest that both error amplification and offset- facilitate re-learning of motor commands required to make
augmentation may facilitate neuro- rehabilitation strategies smooth and straight reaching movements [5]. In that study,
that restore function in brain injuries such as stroke. stroke survivors experienced training forces that either
amplified or reduced their hand path errors. Significant
Index Terms – neuro- robotics, error augmentation, xPC trajectory improvements occurred only when the training
Target, motor learning, and visual distortion forces magnified the original errors, and not when the
training forces reduced the errors or were absent. Hence
I. INTRODUCTION error-enhancing training may be an effective way to
promote functional motor recovery for brain injured
In recent years, experiments that alter the sensory and
individuals.
motor environment of an individual have explored new and
exciting possibilities for tele-assistive teaching and Sensory- motor adaptation has been observed when
robotically-enhanced rehabilitation techniques. For example, there is a distortion in the mechanical realm [9-11], but is
robotic devices can be programmed to provide precise also observed when there is a distortion in the visuomotor
forces that restore a brain injured individual’s movement realm [12-15]. In fact, visuomotor adaptation can even
patterns to a healthier pattern [1-4]. However, the use of trigger recovery of sensory disorders such as hemispatial
robotics to promote physical rehabilitation is still in a neglect secondary to stroke [16]. Both mechanical and
formative stage, and initial attempts to exploit the intrinsic visuomotor adaptation appear to involve similar neural
adaptive capacity of the human sensory- motor system for mechanisms [17]. Hence, we restrict our focus in this initial
rehabilitative purposes are ongoing. In a promising study visuomotor easily- implemented more the to study
using specially-designed training forces, stroke survivors distortions and healthy adult subjects.
could make movements they previously could not [5]. This While our preliminary results using error-amplification are
paper presents an initial exploration into the possibilities of encouraging, there are a variety of ways to augment or
a complimentary technique -- error-augmentation -- for intensify error. Among these the most obvious are linear
affine distortions of gain and offset. The first, gain, is the
facilitating sensory- motor learning.
most obvious way to augment error. If subjects are
Several lines of reasoning suggest that augmenting error
instructed to move in a straight line to a target, a gain of 2
may enhance motor learning. First, many models and
augmentation would mean that any deviation from the
artificial learning systems such as neural networks suggest
straight line would be displayed 2 times that distance from
that error drives learning, so that one can learn more quickly
if error is larger [6]. Such error-driven learning processes the line. However, recent work on motor learning suggests
that there may be a practical limit to gain augmentation.
are believed to be central to adaptation and the acquisition
*
Supported by American Heart Association 0330411Z , NIH R24 HD39627, NIH 5 RO1 NS 35673, NIH F32HD08658, Whitaker RG010157, NSF
BES0238442 and the Falk Trust
Scheidt and colleagues [18] have found that when force is and Apex, NC, USA) reported the interface kinetics. In this
used to disturb motions, subjects incrementally updated their experiment, we only used the motors to remove the modeled
behavior from one movement to the next based on the error inertial effects of the robot, rendering a nearly impedance-
they experienced on most recent att empts. This update was free movement of the handle.
best represented by a transfer function that correspond ed to While seated in front of the robot and holding the robot
lead-lag compensator , in which the average value for the handle, subjects were instructed to make reaching
pole was 0.322. Inverting this transfer function suggests that movements by following cues presented from an LCD
a gain of 3.1 is the approximate limit to which gain could be projector on a horizontal projection plane (Fig. 1). Vision of
amplified in order to obtain rapid learning without leading the subject’s arm was obscured by the projection plane, and
to instability. Since there is recent evidence to show that hence a wide variety of visual distortions were possible,
vision and force distortions are linked [17, 19], we tested the including the rotational distortion used in the experiments
limiting gain of 3.1 as well as a more moderate gain of 2 in described below. Previous developments with this robot
this experiment even though we focused on visual rather were controlled by a PC running DOS to acquire the signals
than force distortions. Specifically, we explored the and control torques. For this experiment, a real-time control
gain error and rate learning between relationship system was developed using MathWorks xPC target TM.
augmentation using these two candidates. The control schematic, shown in Fig. 2, illustrates the two
An alternative approach, error-offset augmentation PCs: a ‘Host’ running with MATLAB-Simulink and MS
accentuates error by adding a constant “expected error” to C++ compiler, and an ‘xPC-Target’ real-time kernel. Each
the visual feedback of hand path. Hence, if a subject’s error separate element (referred to as a model) was developed
is 2 cm to the right, they might train with a visual feedback using MATLAB-Simulink. A low- level Target model (Fig.
that has a 2 cm bias to the right. Offset-augmentation may 2, bottom) was first compiled on the Host PC and then
prove to be superior to gain fo r several reasons. passed to the Target. It consisted of position, I/O, and torque
Magnification using offset is more stable than that of gain blocks and was dedicated to real- time control at 200 Hz. It
because the augmented error display does not grow larger also receives commands from and broadcasts motion and
with error. Moreover, in contrast to gain- augmentation, the force data to the Host. The Host model (Fig. 2, top) issued
offset is independent of the size of errors made later in executive commands to the Target, managed the experiment,
training when the subject is closer to the desired goal. displayed visual feedback to the user via a calibrated
Therefore, offset-augmentation continues to present large overhead projector, and collected and stored data (100 Hz).
errors that continue to motivate learning. One potential
problem is that offset error augmentation does not know
when to stop -- one can over- learn beyond the desired goal..
While these theoretical assertions could be made about these
candidates for error-augmentation, only experimental tests
will truly support their validity.
This paper evaluates several of these candidate
strategies for error-augmentation on healthy subjects. The
magnitudes of error magnification studied in this project
involved gain factors of 1 (normal conditions), gain of 2,
offset, and a gain of 3.1. The goal of the experiment was to
best condition error-augmentation which determine
facilitated the learning of a visuo-motor distortion. We
hypothesized that: 1) Subjects in all groups could adapt to
the visual distortion; 2) Error enhancement would be most
evident in the case of offset error augmentation; 3) The
groups differ in how they are able to generalize what they
learned to unpracticed directions of movement. Our results
showed encouraging evidence for the use of error
augmentation in haptic/graphic systems for robotic teaching,
telemanipulation, and rehabilitation. Figure 1. Robotic manipulandum and display apparatus used.
Subjects’ view of their own arm was blocked by a platform
II. M ETHODS where artificial visual feedback was projected.

A. Experimental Apparatus The communication between the Target PC and the


The experiment was carried out on a planar Host PC was achieved through UDP (User Datagram
Manipulandum Robot (Fig. 1), which is consisted of two Protocol), which is a transport protocol that was layered on
brushed DC torque motors (PMI model JR24M4CH, top of the Internet Protocol (IP). UDP is characterized by its
Kolmorgen Motion Technologies, NY, USA). The motors unencumbering nature because it uses a ‘send-and- forget’
are capable of delivering forces at the handle via a Four bar strategy that can ensure reliable real-time control of the
Linkage. Rotational digital encoders (model 25/045-NB17- robot even when information fails to arrive within a single
TA-PPA-QAR1S, Teledyne-Gurley, Troy, NY, USA) sampling period.
reported absolute angular position, and a 6-axis force/torque
sensor (Assurance Technologies, Inc., TI F/T Gamma 30/10, B. Subjects
Sixteen neurologically normal adults (22-30 years old)
volunteered. The subjects were divided evenly and the subject performs a movement along the path of the
randomly into four groups. All subjects gave informed average initial error Phase 4. However, the gain (*2) and
written consent in accordance with the ethics committee offset strategies differ dramatically at other locations. An
(Internal Review Board at Northwestern University). Each extreme example is when the subject performs the ideal
subject only participated in one protocol to prevent cross- trajectory. Then error is zero, so the subject experiencing the
over effects. gain (*2 or *3.1 ) will see their trajectory match their desired.
However, the subject experiencing the offset will still
C. Experimental Protocol
perceive an error. Hence, an offset error augmentation does
Subjects were requested to make successive outward
not decrease with learning like the gain error augmentation
reach-and-stop movements to visually displayed targets.
does.
Targets were spread evenly along a circle with radius 0.1m.
Return movements to the center point were not analyzed.
We controlled for a speed of 0.45 m/s by giving subjects B A
feedback at the end of each movement using colored dots
and auditory tones to let subjects know if they were going
too fast, too slow, or within a range of ±0.05 m/s.
Consequently, subjects’ speeds remained roughly constant 2*e e+eo
across the entire experiment. e e

Fig. 3 Illustration of the error- augmentation strategies. The ideal


trajectory, approp riate for the rotated environment is indicated as a
dotted line. The trajectory that the subject actually moves along is
represented by the thin line. Each instant, the cursor (large dot) is
displayed by calculating the current error and either multiplying that
error (A ) or by adding a constant e0 to that error (B), resulting in the
trajectory that the subject sees (thick lines).

For all subjects, the goal was to learn to perform


movements to the targets within the allowed range of speeds.
All subjects had to do this in the presence of a visual
distortion and three of the four groups were subjected to
error augmentation. Implicitly they all made movements that
were as straight as possible to the target. The first group of
subjects was a control that experienced the visual rotation
only with no error augmentation (essentially a gain of 1).
The second group (*2) experienced a gain of 2 as shown in
Fig 3B. The third group (Offset Group) experienced an
offset as shown in Fig 3B. The fourth group (*3.1 Group)
Fig. 2. Control schematics for control using xPC. Two
experienced a gain of 3.1.
computers operate the robot and display. One manages the Each protocol entailed 12 phases of experimentation
experiment, renders a feedback display, and stores data (Host that varied only in the values of the gain and offset factors,
PC); the other is dedicated to reliable control of robotic forces
(Target PC).
described below:
Familiarization: 15 movements; 5 to each target, to 1.
All four types of error augmentation in this study were become familiar with the system.
Baselin e, 15 movements; 5 to each target, with no visual 2.
derived from a simple affine transformation. That is, the
rotation or error enhancement. This established a
cursor location was moved either by a multiple of the baseline pattern.
current error vector (a gain, Fig. 3A), or shifted by a Rotated baselin e, 15 movements; 5 each to target that 3.
constant eo. (an offset, Fig 3B). A perpendicular vector from were thirty degrees away from t those in Phase 2.
the ideal straight-lined movement was used to characterize Initial Exposure: 120 movements. Here, one movement 4.
the current error, and that vector was used to alter the in eight (totally 15; 5 each to target) was with a 30º
position of the cursor for error magnification. The constant rotation of the visual field. There was no error
eo, was the average initial error, determined for each subject in augmentation. The average of these 15 ‘initial exposure’
movements is recorded to eo as a function of distance
each of the three possible directions of movement at the
from the starting point.
beginning of the experiment in (Phase 4, described below). Early , Intermed iate, and Late Learning: I n thes e trials 5- 7.
To determine eo, we intermittently exposed each subject to (390 movements in all) the four groups experienced the
the visual rotation early in the experiment. same visual rotation of thirty degrees, but movements
Note that the two examples in Fig. 3 appear the same. also included error augmentation, dependent upon the
The final location of the cursor appears in the same spot if group descriptions above. Also during this phase, all of
the four groups experienced ‘catch-trials’ that were Quite interestingly, our error augmentation approaches
randomly presented once every eight movements. During were found to successfully enhance learning in several
these catch trials their respective error augmentation was aspects. First, the Offset group proved to learn significantly
removed. Hence for all subjects, these catch trials were more than all other groups (Figure 6, top) (p<0.002).
the same (30º rotation of the visual field with no error
Moreover, error augmentation sped up learning for two of
augmentation, occurring at the same movement number).
These catch trials were used to monitor and compare the three groups -- Learning for Gain *2 and Offset Groups
learning across all groups. was both significantly faster than the other groups (Fig. 5,
Evaluation: In all 15 of these trials all subjects 8. middle two tracings, and Fig. 6, bottom figure) (p<0.006).
experienced the same visual rotation of th irty degrees Overall in this experiment, the Offset Group learned best in
with no error augmentation. This consisted of 5 terms of magnitude and speed of learning.
movements to each target.
9- 12. Early, rotated , middle, and late ‘washout.’ Here , all
visual rotations and error enhancements were removed to
study how the nervous system de-adapts back to a
normal behavior. Phase 9 was composed of 10
movements to each target. Phase 10 consisted of 5
movements to each target, but the targets corresponded
to the same target locations in phase 3. Phases 11 and 1 2
consisted of 40 movements to each target.

D. Data Analysis
The measure of interest of this study was the change of
the trajectory error compared with an ideal, straight line
movement to the target. This ideal closely represented the
movements of subjects under normal conditions when there
is no distortion or error-augmentation (Fig 1, figures on left
side), as observed in previous studies [20, 21]. The
trajectory error was defined as the maximum distance (also
often called the infinity norm or Chebychev norm) between
the actual trajectory and the desired trajectory described
above. Other error measures yield similar results.
We made four key different comparisons between the
results of four groups of subjects: The amount and rate of
adaptation, the amount and rate of washout. We also
identified the change after the first catch trial among the
groups; and the extent generalization.
In assessment of the learning rate of the adaptation to Fig. 4. Representative trajectories of the hand. Each row of plots displays
the rotated visual field, the trajectory errors were grouped data from a typical subject from each group. Each column represents a
into 5 trials a block and fit into an exponential curve, critical phase of the experiment. Only the catch trials are shown for the
−t/ C Learning Phase. Red lines indicate the path the subjects should have
A + Be reached to successfully complete the task.
where A is an offset, B is the amount of learning (the
change of the trajectory errors), and C is the rate of learning All subjects de-adapted in about 70 movements after
(time constant for the error to decrease). Data for this training when the visual distortion was removed. However,
analysis was restricted to the catch trials during learning we found no significant differences in the magnitude or in
Phases 5-7 and another fit to the data from the washout rate of de-adaptation (Figure 6, teal bars).
phases (9, 11 and 12). We also tested a gain of 3.1 to see if this large amount
Significance for all statistics was assessed using of gain might lead to more complete learning after a single
ANOVAs and Tukey post-hoc Comparisons at α=0.05. trial of exposure. We looked at the trials immediately
following the first exposure to 3.1, which was designed to
III. RESULTS be a catch trial, but we found no significant differences
among the groups on the improvements following that
Subjects of all four types of error augmentation showed single initial catch trial.
evidence of learning. Subjects made curved trajectories Finally, all groups were ab le to generalize their learning
when first exposed to visual distortion (Fig 4, second skills well to unpracticed targets, but there was no indication
column of plots), but recovered their ability to produce a from our data that any one of the groups differed from the
straight line at the end of training (Fig 4, third column of others.
plots). When they were returned to normal (un-distorted)
conditions, they displayed the characteristic after-effects IV. D ISCUSSION
that curved opposite to initial exposure phase, and these
after-effects gradually washed out (Fig 4, rightmost column This paper evaluated several candidate strategies
of plots). Subjects in all the four groups presented large regarding error augmentation to investigate how healthy
after-effects of adaptation, which was strong evidence that subjects learn. The goal of the experiment was to determine
adaptation had taken place. which error-augmentation condition is optimal for learning a
visuo- motor distortion. The smaller time constants for gain
Control

*2

Offset

*3.1

Fig. 5. Learning curves for representative subjects in each group. Small dots represent the trajectory error for a movement, and the bold dots represent the
mean trajectory error for 5 successive movements in combination. Learning (5-7) and washout (9- 12) phases were each fit to exponential curves (lines). For
the learning phases (5-7), only the catch trials are shown because these trials were used for the regression lin es that characterize the rate of change and amount
of error reduction. For these catch trials, the conditions were the same for all groups (30º rotation of the visual field with no error augmentation, occurring at
the same movement number). Conditions were the normal for all groups during the washout phases (9-12), and hence the regression used all trials (as shown).
of 2 and offset demonstrate that error augmentation can lead to learning beyond the goal, which occur in some trials
increase the rate of learning. Moreover, the Offset Group in in this experiment. However, the advantage of Offset is that
learned significantly more the other groups. it may overcome the problem of diminishing returns due to
The offset condition, while a less intuitive, appears to small errors tha t are often seen at the end of the learning
allow subjects to adapt to the visually rotated environment process. Therefore a more intelligent implementation may
more efficiently and accurately than the other methods of be a ‘scheduled’ mixture of offset and gain, in which the
augmentation tested. As stated previously, the difference offset factor is extinguished when the subject learns beyond
between the offset and gain error augmentation condition is the goal, may be optimal.
that there is a constant error adding to movements in the Our results also demonstrate limits on the effectiveness
Offset Group that does not decrease with improvement. of a gain augmentation strategy. The gain 3.1 in the
Offset offers one other advantage – it is more stable than the experiment did no better than the control (gain 1) and worse
gain-based approach, which can have an unwieldy display than gain 2, possibly because the larger gain may have
of errors when the subject makes a large mistake. decreased the relative stability of the adaptation process
There are several undesired effects from offset beyond that which subjects were comfortable, thus causing
condition as well. The Offset Group’s larger amount of them to down- regulate their internal feedback gain so that
learning may be due to the fact that they were required, in the overall gain approached “normal”. Had they not done so,
effect, to learn a rotation of as large as 60 degrees. The noise and sensorimotor uncertainty could possibly lead to
offset condition delivers visual feedback that always learning. unstable consequently and overcorrections
deviates from proprioceptive feedback the same amount -- Consequently, there is likely an optimal gain between 1 and
30 degrees in this experiment. This means that offset can 3.1. Because there is some recent evidence to show that
vision and force distortions are linked [17, 19], our results REFERENCES
may not be true for all tasks and contexts. Moreover, error [1] H. I. Krebs, N. Hogan, M. L. Aisen, and B. T. Volpe, "Robot -aided
augmentation using sensory feedback such as proprioceptive neurorehabilitation," IEEE Transactions on Rehabilitation Eng ineering,
or haptic forces may be more effective at different gains. vol. 6, pp. 75-87, 1998.
Offset, which was not scaled in this experiment, could also [2] P. Lum, C. Burgar, P. Shor, M. Majmundar, and M. Van der Loos,
"Robot- assisted movement training compared with conventional
have an optimal value. Moreover, some combination of gain therapy techniques for the rehabilitation of upper limb motor function
and offset strategies may lead to the best possible learning following stroke," Archives of Physical Medicine and Rehabilitation,
pattern. vol. 83, pp. 952-959, 2002.
[3] D. J. Reinkensmeyer, L. E. Kahn, M. Averbuch, A. McKenna- Cole, B.
D. Schmit, and W. Z. Rymer, "Understanding and treating arm
movement impairment after chronic brain injury: pro gress with the
ARM guide," Journal of Rehabilitation Research & Development , vol.
37, pp. 653- 62, 2000.
[4] J. Patton, F. Mussa-Ivaldi, and W. Rymer, "Altering Movement
Patterns in Healthy and Brain- Injured Subjects Via Custom Designed
Robotic Forces," presented at EMBC2001, the 23rd Annual
International Conference of the IEEE Engineering in Medicine and
Biology Society (EMBS), Istanbul, Turkey, 2001.
[5] J. L. Patton, M. E. Phillips- Stoykov, M. Stojakovich, and F. A. Mussa-
Ivaldi, "Evaluation of robotic training forces that either enhance or
reduce error in chronic hemiparetic stroke survivors," Experimental
Brain Research , vol. Accepted pending revisions, 2004.
[6] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning
representations by back-propagating errors," Nature (London), vol. 323,
pp. 533- 536, 1986.
[7] M. Kawato, "Feedback-error -learning neural network for supervised
learning," in Advanced neural computers, R. Eckmiller, Ed.
Amsterdam: North-Holland, 1990, pp. 365- 372.
[8] D. M. Wolpert, Z. Ghahramani, and M. I. Jordan, "An internal model
for sensorimotor integration," Science, vol. 269, pp. 1880-2, 1995.
[9] O. Bock, "Load compensation in human goal-directed arm
movements," Behavioral Brain Research , vol. 41, pp. 167-177, 1990.
[10] J. R. Lackner and P. DiZio, "Rapid adaptation to Coriolis force
perturbations of arm trajectories," Journal of Neurophysiology, vol. 72,
pp. 299- 313, 1994.
[11] R. Shadmehr and F. A. Mussa- Ivaldi, "Adaptive representation of
dynamics during learning of a motor tas k," Journal of Neuroscience,
vol. 14, pp. 3208-3224, 1994.
[12] F. A. Miles and B. B. Eighmy, "Long -term adaptive changes in
primate vestibuloocular reflex I: behavioral Observations," Journal of
Neurophysiology, vol. 43, pp. 1406-1425, 1980.
[13] J. R. Flanagan and A. K. Rao, "Trajectory adaptation to a nonlinear
visuomotor transformation: evidence of motion planning in visually
perceived space," Journal of Neurophysiology, vol. 74, pp. 2174 -8,
Fig. 6. The amount (top) and time constant (bottom) of error decay during 1995.
learning (green) and washout (teal) for the [14] Z. M. Pine, J. W. Krakauer, J. Gordon, and C. Ghez, "Learning of
four groups. Horizontal lines indicate significant difference between the scaling factors and reference axes for reaching movements,"
two groups beneath their tips. Neuroreport, vol. 7, pp. 2357- 61, 1996.
[15] H. Imamizu, T. Kuroda, S. Miyauchi, T. Yoshioka, and M. Kawato,
Finally, while our results were on healthy subjects, they "Modular organization of internal models of tools in the human
cerebellum," Proceedings of the National Academy of Science, vol.
have excellent implications for robotic neurorehabilitation.
100, pp. 5461-5466, 2003.
The results of the experiment bring new possibilities to [16] Y. Rossetti, G. Rode, L. Pisella, A. Farne, L. Li, D. Boisson, and M. T.
rehabilitation methods which employ robotics. Increased Perenin, "Prism adaptation to a rightward optical deviation
rates of learning via error augmentation would be quite rehabilitates left hemispatial neglect," Nature, vol. 395, pp. 166- 9,
1998.
valuable to therapists. The results found in this experiment [17] C. Tong, D. M. Wolpert, and J. R. Flanagan, "Kinematics and
support and expand upon those found in a recent study [5] dynamics are not represented independently in motor working
that reported when brain injured individuals were subjected memory: evidence from an interference study," Journal of
to error augmenting vs. reducing forces, error augmenting Neuroscience, vol. 22, pp. 1108-13, 2002.
[18] R. A. Scheidt, J. B. Dingwell, and F. A. Mussa- Ivaldi, "Learning to
better healthier movement patterns. The experiment affirms move amid uncertainty," J Neurophysiol, vol. 86, pp. 971 -85, 2001.
that increasing the error perceived by the subject increases [19] Y. Wei and J. L. Patton, "Forces that Supplement Visuomotor
the rate of learning. Error augmentation may ‘wake up’ an Learning: A 'Sensory Crossover' Experiment ," Experimental Brain
inattentive nervous system and trigger the recovery process Research, conditionally accepted pending revisions, 2004.
[20] P. Morasso, "Spatial control of arm movements," Experimental Brain
by supplying heightened, magnified sensory feedback about Research, vol. 42, pp. 223- 227, 1981.
a persons’ motor deficit. [21] T. Flash and N. Hogan, "The coordination of arm movement s: An
experimentally confirmed mathematical model.," Journal of
ACKNOWLEDGMENTS Neuroscience, vol. 5, pp. 1688- 1703, 1985.
.

Appendix E

Haptic Identification of Surfaces as Fields of Force


J Neurophysiol 95: 1068 –1077, 2006.
First published October 5, 2005; doi:10.1152/jn.00610.2005.

Haptic Identification of Surfaces as Fields of Force

Vikram S. Chib,1,2,3 James L. Patton,1,2 Kevin M. Lynch,3 and Ferdinando A. Mussa-Ivaldi1,2,4


1
Sensory Motor Performance Program, Rehabilitation Institute of Chicago; and 2Department of Biomedical Engineering,
3
Laboratory for Intelligent Mechanical Systems, Department of Mechanical Engineering, and 4Department of Physiology,
Northwestern University, Chicago, Illinois
Submitted 13 June 2005; accepted in final form 29 September 2005

Chib, Vikram S., James L. Patton, Kevin M. Lynch, and Ferdi- limb matching procedure, in which subjects adjusted the stiff-
nando A. Mussa-Ivaldi. Haptic identification of surfaces as fields of ness of a motor connected to one arm until it was perceived to
force. J Neurophysiol 95: 1068 –1077, 2006. First published October be the same as that connected to the other arm. These studies
5, 2005; doi:10.1152/jn.00610.2005. The ability to discriminate an concluded that the sensitivity of stiffness discrimination was
object’s shape and mechanical properties from touch is one of the
much worse than would be expected by combining the sensi-
most fundamental somatosensory functions. When exploring physical
properties of an object, such as stiffness and curvature, humans probe tivities for force and displacement discrimination.
the object’s surface and obtain information from the many sensory Rigid objects, such as walls and table tops, are characterized
receptors in their upper limbs. This sensory information is critical for by high-impedance boundaries. When the hand comes in con-
the guidance of actions. We studied how humans acquire an internal tact with these objects, there is minimal or negligible penetra-
representation of the shape and mechanical properties of surfaces and tion inside the boundary, regardless of the force applied to the
how this information affects the execution of trajectories over the object boundary. Other objects, such as pillows and computer
surface. Experiments involved subjects executing trajectories while keyboards, respond to applied forces with larger displace-
holding a planar manipulandum that renders planar virtual objects ments. In mechanical terms, these different behaviors are
with variable shape and mechanical properties. Subjects were in- captured by describing objects as fields of position-dependent
structed to make reaching movements with the hand between points
forces. Stiffness is the factor that characterizes how an object
on the boundary of a curved virtual disk of varying stiffness and
curvature. The results suggest two classifications of adaptive re- responds to a given displacement at its surface. In this study,
sponses: force perturbations and object boundaries. In the first case, a we investigate interactions between the hand and objects as
rectilinear hand movement is enforced by opposing the interaction interactions of the hand with external force fields.
forces. In the second case, the trajectory conforms to the object Unperturbed free reaching movements have been studied
boundary so as to reduce interaction forces. While this dichotomy is extensively (Flash and Hogan 1985; Hogan 1984; Morasso
evident for very rigid and very soft objects, the likelihood of an object 1981). More recently, there has been a growing body of
boundary classification depended, in a smooth and monotonic way, on investigation concerning the effect of deterministic force fields
the average force experienced during the initial movements. Further- on free reaching movements of the arm (Flash and Gurevich
more, the observed response across a variety of stiffness values lead 1992; Gandolfo et al. 1996; Matsuoka 1998; Shadmehr and
to a constant average interaction force after adaptation. This suggests
Mussa-Ivaldi 1994; Thoroughman and Shadmehr 2000). How-
that the nervous system may select from the two responses through a
mechanism that attempts to establish a constant interaction force. ever, the idea of using similar experimental paradigms for
studying the interaction with object boundaries has not yet
been explored. Studies of free reaching movements in a field of
INTRODUCTION forces have highlighted the existence of adaptive mechanisms
that tend to restore straight-line movements and the kinematics
Studies have been performed to understand how humans of the unperturbed hand motion. This makes sense if one
perceive shape through active touch. Kappers et al. (1994) indeed considers the external forces as a perturbation to be
found that we are capable of learning and distinguishing slight rejected. However, the same mechanism would not be appro-
differences in the curvature of various surfaces. Further study priate when the hand encounters an unexpected hard surface in
of actively touched curved surfaces has shown that adaptation a dark room. In this case, “fighting the field” would be
and after effects are present after haptic exploration (Vogels et pointless and painful. One would rather comply and modify the
al. 1996). These after effects are manifested as flat surfaces trajectory to avoid the wall or to move smoothly over its
being judged as convex after touching of a concave surface and surface. While these considerations are self-evident, a question
flat surfaces being judged as concave after touching of a arises as to what mechanical conditions would lead the motor
convex surface. Haptic after effects increase with the time of system to react to a force field as a disturbance to overcome or
contact with the curved surface and decrease with the time as a boundary to comply with. Here, we address this question
elapsed between the touching of two different surfaces (Vogels together with some corollary issues. Is the dichotomy between
et al. 2001). the representation of fields as disturbances or boundaries the
The stiffness, or degree of rigidity, of an object is critically outcome of an adaptive process? Is the dichotomy itself char-
important for manipulation. Psychophysical studies have been acterized by a sharp, “two state” transition, like the Necker-
performed to determine thresholds for stiffness discrimination cube illusion and other perceptual dichotomies? Is there a
(Jones and Hunter 1990). These studies used a contralateral
The costs of publication of this article were defrayed in part by the payment
Address for reprint requests and other correspondence: V. S. Chib, 345 East of page charges. The article must therefore be hereby marked “advertisement”
Superior St., Suite 1406, Chicago, IL 60611 (E-mail: v-chib@northwestern.edu). in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
critical value in the local variation of the force field (i.e., the was instrumented with joint encoders that report the joint angles at a
“stiffness”) that the motor system uses to discriminate bound- frequency of 100 Hz. Position and velocity of the manipulandum
aries from disturbances? We have addressed these questions by handle were computed from these encoder signals. The manipulan-
observing the effect on hand movements of force fields that dum was also equipped with two torque motors that generated the
force fields corresponding to the virtual object. Endpoint forces were
emulated a circular planar object of variable stiffness and
acquired using a six-degrees-of-freedom load cell fixed to the handle
curvature. of the robot. Visual feedback was given to subjects through a projec-
Our findings show the existence of both compensatory and tion system. This system displayed a cursor registered to the move-
compliant responses occurring at different values of surface ment of the manipulandum handle, as well as start and goal positions,
stiffness. The lowest values of stiffness lead to compensatory to prompt subjects’ movements. The cursor and visual cues were
responses in which a free space trajectory is recovered, presented through an LCD projector projecting on a horizontal plane.
whereas the highest values lead to compliant responses. How- Vision of the subjects’ arm was obscured by the projection plane.
ever, the experiments also revealed a continuum of motor
responses, rather than a sharp transition. A smooth transition Force fields
from compensation to compliance appeared across a contin-
uum of stiffness levels. Objects with different stiffness in- The force fields experienced by subjects were computed in real time
duced, as expected, different levels of interaction forces. How- using the formula
ever, a striking effect of practice was a strong tendency toward K R r) + br r<R
a fixed average interaction force that did not depend on the f (1)
0 r R
object’s stiffness. Thus we observed either compensatory re-
sponses or compliant adaptive responses depending on the This expression defines a circular, viscoelastic force field in polar
coordinates; where f is the magnitude of interface force produced by
relation of the initial interaction force to this final fixed level of
the robot, R is the radius of the virtual disk, r is the distance of the
interaction force. Taken together, these studies reveal a mech- handle from the center of the disk, K is a spring constant, and b is a
anism of adaptation that may subserve the implicit learning of damping constant. The interface force is always directed radially away
object shapes through repeated mechanical interactions. from the center of the disk. Damping is added to alleviate instabilities
encountered at higher stiffnesses.
METHODS

Experimental apparatus Subjects


Experiments were performed using a two-degrees-of-freedom pla- Twenty-two naive, healthy, volunteers (age range, 18 –35 yr) par-
nar manipulandum as seen in Fig. 1. Subjects made goal-directed ticipated in this study after giving informed consent in accordance
movements in the plane of the manipulandum while grasping its with the standards of the Institutional Review Board of Northwestern
handle. The manipulandum was similar to those previously described University. All subjects were right-handed and had normal vision or
(Mussa-Ivaldi and Bizzi 2000; Shadmehr and Mussa-Ivaldi 1994). It vision that was corrected to normal. Subjects were divided into two
groups. The first experimental group (low-curvature group), consist-
ing of nine subjects, was prompted to make reaching movements in
the presence of a virtual surface of curvature 15 m 1 (R 6.5 cm).
The second group of nine subjects (high-curvature group) was
prompted to make movements in the presence of a virtual surface of
curvature 20 m 1 (R 5.0 cm). To evaluate memory effects related to
order of stiffness presentation, a group of four subjects was used
(memory effect group). These subjects were presented surfaces in
order of descending stiffness, as opposed to ascending order of
stiffness presentation in the case of the low- and high-curvature
groups.

Experimental protocol
Subjects made goal-directed reaching movements from a start
target to a goal target. Their arms were supported against gravity and
constrained to the plane of movement of the manipulandum by a
low-inertia arm support. During a trial, a target was projected onto the
subject’s workspace, and the subject was asked to make one contin-
uous movement to place a cursor registered to the manipulandum
handle within the target, while achieving a desired maximum velocity.
The next target appeared after the subject held the cursor at the prior
target for 1 s. Subjects were given feedback as to whether they moved
faster or slower than the desired maximum velocity. The optimal
speed of movement was specified before each experiment. When
subjects achieved a maximum velocity >5% faster than the desired
velocity, the target turned green. If the target was reached with a
maximum velocity >5% slower than the desired velocity, the target
turned blue. When the target was reached within the desired maximum
FIG. 1. Schematic of manipulandum and dimensions of the position-depen- velocity range, the target was animated to “explode,” and a distinctive
dent force fields. sound was presented to reinforce the perception of a successful
movement. These feedback cues allowed subjects to achieve a con- AVERAGE INTERFACE FORCE. The measure of interface force was
sistent maximum speed of movement. used to evaluate the forces imposed by the virtual object during
Before the introduction of force fields, subjects practiced making subjects’ movements. Force measures were calculated using subjects’
point-to-point movements under the required velocity constraints, in position and velocity signals and Eq. 1. The calculated force values
the absence of a virtual object, for 60 movements. To assess the were integrated over the duration of the movement to acquire a
typical performance of subject, undisturbed in free space, objects were resulting force cost (Eq. 3) for an entire reaching movement. This
not introduced during this baseline unperturbed phase. This phase of measure expresses the forces imposed by the environment and not the
the experiment allowed subjects to familiarize themselves with the forces produced by the subject
passive dynamics of the manipulandum.
After the baseline unperturbed phase, virtual objects were presented
to the subject. Subjects were only provided with a haptic rendering of
the virtual object: visual information regarding the geometry of the
Jti
tf
U f Udi
F (3)
object was not presented. The dimensions of the virtual objects are if ii
shown schematically in Fig. 1. A testing phase consisted of the subject
moving between targets located on the boundary of the virtual object. Psychomeiric funciion
Subjects made 100 reaching movements between the presented start
and goal positions. The first 50 movements of a testing phase served A common means of quantifying a subject’s performance of a
as a practice period for the subject to acquire information about the psychophysical task is the fitting of a psychometric function (Wich-
virtual surface. During the final 50 movements of the testing phase, mann and Hill 2001a). The psychometric function relates an observ-
catch trials—movements during which, unexpectedly, no force field er’s performance of a psychophysical task to some physical aspect of
was present—were introduced pseudorandomly for 12.5% of the stimulus. For these experiments, the performance metric used was the
movements. These catch trials were introduced to reveal any adapta- sign of the ARD. A two-alternative paradigm was implemented for the
tions of the feedforward motor command that may have occurred after catch trials performed at each stiffness level. Catch trials having a
training with the virtual object (Shadmehr and Mussa-Ivaldi 1994). negative ARD were classified as perception of a field, whereas those
After completion of the phase consisting of 100 movements with having a positive ARD were classified as perception of an object
the virtual object, a “wash-out” phase, consisting of 50 movements in boundary or surface. Subjects’ results were compiled into a single
a null field, was introduced. This phase allowed for deadaptation and group measure for each stiffness level. This measure was expressed as
unlearning of the field encountered during the previous phase. Six the proportion of positive surface responses at each stiffness level.
different stiffness levels were tested (K 200, 400, 800, 1,200, 1,600, The general form of the psychometric function is
2,000 N/m). The stiffness levels were presented in order of increasing
magnitude. One group of subjects (low-curvature group) was exposed (x,a, ,y,X) y + (1 y X)F(x:a, ) (4)
to the various stiffness levels with a virtual disk of curvature 15 m 1
The shape of the curve is determined by the parameters (a, , X) and
(R 6.5 cm), whereas a second group (high-curvature group) was the choice of a two-parameter function F, which is typically a sigmoid
exposed to the same stiffness levels with a virtual disk of curvature 20 function. For these experiments, a cumulative Gaussian was used for
m 1 (R 5 cm). The experimental protocol and instructions were F. From the defined range, it follows that the parameter y gives the
exactly the same for both groups: the only difference between the lower bound of x, which can be interpreted as the base rate of
experiments was the curvature of the fields presented to subjects. performance in the absence of a signal. The upper bound of the curve,
Field stiffness was presented in ascending order after preliminary the performance level for an arbitrarily large stimulus level, is given
studies showed a marked memory effect when surfaces of high as 1 X. For these experiments, X 0, because it was found that as
stiffness were presented before those of lower stiffness. When sub- stiffness increased subjects had a greater propensity to discriminate a
jects were presented with a high stiffness field that clearly revealed the surface. Between the two bounds, the shape of the curve is determined
shape of the boundary, they showed a marked tendency to identify the by a and .
boundaries of subsequent low stiffness fields. That is, the identifica- Bootstrapping was used to generate 95% CIs for the resulting
tion of a rigid boundary tended to persist in lower stiffness fields. This psychometric functions. These CIs represent the variability in sub-
finding is presented at the end of RESULTS. Because the main interest jects’ probability of perceiving a surface given the independent
of this study was to find the minimum stiffness that would lead to variable of surface stiffness. The bootstrap method is a Monte Carlo
identification of a boundary and to determine whether such a value has resampling technique relying on a large number of simulated repeti-
the property of a threshold leading to an abrupt change in behavior, we tions of the original experiment. Simulated repetitions of the experi-
gradually increased the presented stiffness levels in search of such a ment are obtained by repetitively resampling subsamples of the data.
threshold. Bootstrap methods are especially well suited for analysis of psycho-
physical data because their accuracy does not rely on a large number
Trajeciory analysis or trials, as do methods from conventional statistical asymptotic
theory (Wichmann and Hill 2001b). To construct 95% CIs for psy-
Two different measures were used to quantify subjects’ response to
chometric functions obtained during this study, random combinations
virtual objects and their subsequent learning.
of subject’s psychometric functions were sampled with replacement
AREA REACHING DEVIATION. The measure of area reaching devia- 1,000 times.
tion (ARD) was used to evaluate a subjects’ deviation from a straight
line path. This measure was defined as the signed area between the
RESULTS
trial path and a reference straight-line path between the start and goal
positions. Paths to the right of the reference straight-line path yielded Adapiaiion io viriual objecis of varying siiffness
positive ARD, whereas those to the left yield negative ARD. If the
trajectory is monotonic in y, the signed area reaching deviation can be A typical set of movement trajectories for a subject in the
computed by low-curvature group, with different field stiffness and at dif-
ferent stages of learning (early exposure, mid exposure, and
A J yf
x dy (2) late exposure), is shown in Fig. 2. At all levels of stiffness,
during the early exposure phase, the effect of the virtual object
yi
posure, subjects produced movements that followed the virtual
surface.
Data from the washout periods after force field learning
indicate that the effects of force field adaptation were com-
pletely suppressed by the end of the 50 trial washout periods
(Fig. 3). Washout blocks were presented between the presen-
tations of fields of different stiffness. During washout blocks,
subjects’ level of ARD converged to zero, which is consistent
with the production of straight line trajectories after move-
ments in the null field. This finding does not exclude the
possibility that subjects maintained some memory trace of
earlier exposure and that this memory effect could lead to a
slowly changing perception of stiffness. Furthermore, even a
more prolonged period of rest would not be adequate to
extinguish that effect, because some memory of experienced
force fields have been documented to persist for weeks (Shad-
mehr and Brashers-Krug 1997). However, if present, this
memory did not have an impact on the baseline movements
before each block of trials.
These qualitative observations of subjects’ adaptations to
virtual objects of varying stiffness are quantified and summa-
rized for all subjects in Fig. 4A. As previously described, with
low stiffness values (200 and 400 N/m), after adaptation,
subjects produced straight-line movements through the field
(Fig. 2). This result was captured by the ARD before and at the
end of practice. ARD measures the area spanned by a subject’s
FIG. 2. Low-curvature group trajectories. The 1st 6 trajectories from vari- deviation from a straight path (see METHODS). ARD for virtual
ous stages of adaptation for a representative subject. Green squares represent
the start position: red circles represent the goal position.
surfaces of 200 and 400 N/m were significantly different (P <
0.05: Fig. 4A). At these low-stiffness values, ARD was reduced
on the hand trajectory was quite significant. The time-course of to nearly zero, indicating that subjects attempted—and suc-
movements during the early exposure phase can be divided into ceeded—to produce straight-line movements. At the high stiff-
two parts. During the first part, the hand was driven off course ness levels, ARD was not significantly reduced after learning.
by the field and forced away from a straight-line trajectory. The data in Fig. 4A show that the measure of ARD increases as
During the second part of the movement, after the force field of the stiffness level increases. Thus as stiffness is increased,
subjects began to produce trajectories that conformed to the
the virtual object had caused the hand to veer off course from
boundary of the virtual object, as opposed to actively counter-
the target, subjects made a second movement back toward the
acting the forces generated by the virtual object.
target. At low stiffness values (200 and 400 N/m), after The shapes of the adapted trajectories varied with the levels
adaptation, subjects produced straight-line movements through of stiffness (Fig. 2, late field exposure). This was also reflected
the field (Fig. 2). by the pattern of ARD (Fig. 4A). However, a unifying feature
Repeated practice of movements with high-stiffness virtual among the adapted movements is the average interface force.
disks (K 800, 1,200, 1,600, and 2,000 N/m) resulted in a Through the continuum of stiffness levels, subjects tended to
markedly different adaptation. As with low stiffness disks, produce similar average interface forces after adaptation (Fig.
initial exposure to the field resulted in a two-segment move- 4B). A two-factor ANOVA did not find a significant difference
ment, with the first portion corresponding to the hand being among average interface force for the six stiffness levels for
perturbed by the force field and forced away from the boundary subjects in the low-curvature group after adaptation had oc-
of the virtual object and the second portion corresponding to a curred (F6,5 0.187: P 0.966).
recovery to the goal position (Fig. 2). This qualitative pattern After effects from adaptation to the low-stiffness field were
of initial responses was remarkably similar at all field observed during catch trials at the end on training. During
strengths. However, unlike adaptation to low stiffness fields, catch trials, the virtual disk was unexpectedly removed from
adaptation to high stiffness fields did not result in subjects the path, and subjects made movements in an unperturbed
recovering straight-line movements. During mid and late ex- environment. These after effects are mirror images of the

FIG. 3. Learning curve for the low-curvature


group. White blocks represent null field presentation,
dark gray blocks represent force field presentation,
and light gray blocks represent a phase of pseudoran-
dom catch trials.
FIG. 4. Learning data for the low-curvature
group. Measures shown are an average of the 1st
and last 5 trials averaged across all subjects. A:
area reaching deviation (ARD). B: interface force.

responses to the initial field exposure (Fig. 2). This suggests probability of perceiving the field as an object boundary, or
that, in the presence of low-stiffness position-dependent force surface, progressively increased as the level of stiffness in-
fields, subjects adapted by developing an internal representa- creased, reaching chance (0.5) in the stiffness range of 1,000 –
tion of the field. The internal representation predicted and 1,300 N/m.
canceled the forces of the virtual disk. At higher stiffness
levels, this was not the case. Once a stiffness threshold was Influence of surface curvaiure
exceeded, subjects’ after effects appeared in the direction of
the applied forces and following the profile of the virtual To assess the effect surface curvature has on the adaptive
surface. The amount by which the after effects appeared in the interaction with an object, an additional subject group engaged
direction of the applied forces of the virtual surface and away in the previously described experimental protocol. This group,
from a straight line trajectory increased as the stiffness of the the high-curvature group, interacted with a virtual surface of
surface increased. increased curvature (20 m 1).
Group results of ARD for catch trials showed a gradual For the high-curvature group, we did not observe force field
transition between negative after effects at low stiffness, which adaptation with recovery of straight-line movements at low
indicates that subjects produced a compensatory response op- stiffness values. Instead, after training with the low stiffness
posing the forces associated with the virtual disk, and positive surface, subjects exhibited movements that complied with and
after effects at high stiffness, which indicates that subjects conformed to the circular disk boundary (Fig. 6). This result
produced a response conforming to the boundary of the virtual was manifested in subjects catch trial movements. During these
surface (Fig. 5A). We used the measure of ARD to derive a movements, subjects increased ARD at lower stiffness levels
binary classification of force fields as either objects or distur- after learning (Fig. 7A). Furthermore, after effects from this
bances. Specifically, we used catch trials leading to a positive higher curvature level remained positive and gradually in-
ARD to classify subjects’ perception of a virtual disk of a given creased across all stiffness levels, indicating that subjects
stiffness as being a surface. This allowed us to fit a psycho- produced after effects in the direction of the applied forces and
metric function to the response (Fig. 5B). The psychometric approximately conforming to the profile of the virtual bound-
function, consistent with the underlying set of ARDs, showed ary. The high-curvature group showed a monotonic increase of
a smooth pattern of classification across stiffness levels. The ARD for catch trials (Fig. 8A) and probability of perceiving a

A −3
B
x 10
3 1
Probability of Perceiving a Surface

2
Area Reaching Deviation (m 2)

0.75

0 0.5

−1

0.25 Psychometric Function


Bootstrap 95% CI
−2

−3 0
0 200 400 800 1200 1600 2000 0 200 400 800 1200 1600 2000

Stiffness (N/m) Stiffness (N/m)

FIG. 5. Low-curvature group catch trial data and psychometric function. A: ARD for catch trials of subjects interacting with a surface of curvature 15 m 1.
measure of ARD. This result suggests that an intended inter-
face force is being regulated by the control system during
haptic exploration.

Regulaiion of average inierface force


Our findings suggest that the unifying theme across these
stiffness and curvature levels was a subject’s tendency to
generate a constant level of interface force regardless of object
mechanics. A regression of interface force before learning
versus ARD after learning (Fig. 10A) showed very similar
trends in both curvature groups. It is important to note that the
forces presented in these regressions are those produced solely
by the environment and do not include forces generated by the
subject. Considering the similarity in these regressions, we are
led to consider a single model to describe the experimental
data. Indeed, using a single model to account for the data is
consistent with the more general Occam’s razor principle of
minimizing the number of parameters (i.e., the complexity of
the model or the “size” of the hypothesis space) needed to
account for the data. This single model suggests the existence
FIG. 6. High-curvature group trajectories. The 1st 6 trajectories from var- of a common adaptation strategy for the low-curvature and
ious stages of adaptation for a representative subject. Green squares represent high-curvature groups, whose outcome depends on the amount
the start position: red circles represent the goal position. of interface force experienced in the early interactions with the
surface as a boundary (Fig. 8B) as the stiffness level of the field/surface. Accordingly, the level of initial interface force
virtual surface increased. Again, the learning curves from this experienced may be the key factor in subjects’ perception of a
protocol show after-effect washout after 50 movements in a field as a surface. A gradual trend was seen between subjects’
null field (Fig. 9). level of interface force before learning and their probability of
As in the case of the surface experienced by the low- perceiving a surface (Fig. 10B). The threshold of preadaptation
curvature group, the learned behavior followed a gradual trend interface force at which subjects began to perceive surfaces
across stiffness levels. Another common feature between the greater than chance occurs at 1.0 N. The fact that interface
adaptations to fields of different curvatures was the average force, rather than stiffness or curvature, is the best predictor of
interface force experienced after learning. Results showed that, the final classification suggests (see DISCUSSION) that the feed-
in the case of the higher curvature surface, subjects again forward command plays a central role in haptic perception.
achieved an invariant level of interface force (Fig. 9B). While
the level of force was larger than that found for the low- Responses io decreasing siiffness: a memory effeci
curvature group, it remained relatively constant across all
stiffness levels. A two-factor ANOVA without replication did We found a strong memory effect when surfaces of high
not find a significant difference among the six stiffness levels stiffness were presented before those of lower stiffness. Four
(F6,5 1.272: P 0.307) after learning. For both groups, high subjects were presented with field stiffness in descending order
curvature and low curvature, the relative excursion of average and were asked to execute the reaching task in the presence of
interface force was smaller than the excursion in the kinematic a field with the same geometry as presented to the low-
A B
−3
x 10
10 1
Probability of Perceiving a Surface

8
Area Reaching Deviation (m 2)

4 0.75

Psychometric Function
Bootstrap 95% CI
0

−2 0.5
0 200 400 800 1200 1600 2000 0 200 400 800 1200 1600 2000

Stiffness (N/m) Stiffness (N/m)

FIG. 7. High-curvature group catch trial data and psychometric function. A: ARD for catch trials of subjects interacting with a surface of curvature 20 m 1.
Colored lines are data from a single subject: bold black line is a group average. B: a psychometric function computed for all subjects.
FIG. 8. Learning curve for the high-curvature
group. White blocks represent null field presentation,
dark gray blocks represent force field presentation,
and light gray blocks represent a phase of pseudoran-
dom catch trials.

curvature group. We observed that, as the stiffness of the field (Fig. 4B). Furthermore, at low stiffness levels (200 and 400
was reduced, starting from a level that clearly revealed the N/m), we found a significant reduction in lateral deviation after
shape of the boundary, subjects had a marked tendency to practice. The same subjects generated after effects with devi-
move along the boundaries of subsequent low stiffness sur- ation opposite to the direction of the force. This is consistent
faces. Thus the identification of a rigid boundary persisted in with the hypothesis that, at low stiffness and curvature, sub-
lower stiffness fields. The data in Fig. 11 show ARD for catch jects developed an internal representation of the forces gener-
trials from these subjects. It is apparent that the after effects ated by the virtual object. This result is in holding with
were in the direction of the boundary (i.e., positive ARD) even observations from previous force field adaptation experiments,
at the lowest stiffness values, suggesting that subjects were where the ability to recover a straight-line movement after
biased by their initial perception of surface boundaries during adaptation was considered to result from an internal model of
initial high stiffness interactions. This is clearly in contrast the perturbing field (Shadmehr and Mussa-Ivaldi 1994: Thor-
with the trend shown in Fig. 5A. Because of this memory oughman and Shadmehr 2000).
effect, we decided to present the subsequent boundaries in At the higher stiffness and curvature levels, subjects aban-
ascending order, which would have allowed us to detect a doned the goal of reducing straight line deviations. Instead,
transition in behavior if a critical value of stiffness delimited they reduced the interface force by complying with the shape
the classification of a force field as either a disturbance or an of the object boundary. This trend is shown by increases in
object. lateral deviation and reductions in interface force after learn-
ing. This suggests that, at high stiffness, subjects formed an
DISCUSSION internal representation of the virtual surface based on the
reduction of the interface force to a given level.
We found that the adaptation of movements to a virtual After effects from catch trials at higher stiffness levels
surface is dependent on both surface stiffness and surface further show the difference in adaptation between low stiffness
curvature. For a given surface curvature, when subjects exceed and high stiffness fields (Fig. 5A). At lower stiffness levels, the
a threshold stiffness level, they learned to produce a smooth area reaching deviation in catch trials was negative, indicating
trajectory on the boundary of the surface. In contrast, at lower an adaptive mechanism that compensated for the field dynam-
stiffness, they adapted by recovering the unperturbed kinemat- ics and resulted in straight-line movements in the presence of
ics of hand movements in free space. While these distinctly the field. At higher stiffness and curvature levels (Fig. 8A), area
different adaptation strategies were clearly evident at the ex- reaching deviation for after effects became positive, consistent
tremes of tested stiffness, we did not observe a distinct dichot- with after effects in the direction of the applied forces and
omy but a smooth transition through a continuum of stiffness approximately following the profile of the virtual boundary.
levels. Furthermore, across these stiffness levels subjects pro- The magnitude of this positive after effect increased with
duced an invariant level of average interface force, suggesting increasing stiffness. Below a threshold value (K 1,200 N/m:
a common underlying strategy of interaction. K 15 m 1), the magnitude of the negative after effect
At the lowest stiffness (200 N/m) and low curvature, sub- increased with decreasing stiffness. In the case of high curva-
jects produced a higher average interface force after adaptation ture interactions, subjects positive after effects persisted
compared with interface forces observed before adaptation throughout the continuum of stiffness levels. While after ef-

FIG. 9. Learning data for the high-curvature


group. Measures shown are an average of the 1st
and last 5 trials averaged across all subjects. A:
ARD. B: interface force.
FIG. 10. Unified curvature results. A:
ARD after learning vs. interface force before
learning for both the low-curvature and high-
curvature groups. B: unified psychometric
function of perception of surfaces for an
average interface force before learning.

fects were always positive during these interactions, the mag- with a discrete switching of responses when the field stiffness
nitude of the after effects increased as surface stiffness in- reaches a perceptual threshold. However, our results are un-
creased. equivocally dismissing such a possibility. Instead, we find a
Taken together, these data suggest that subjects may have gradual transition of after effects, from compensatory to com-
learned to respond to the perturbation in two ways: 1) by pliant responses. This modification suggests that subjects alter
enforcing a nominal trajectory (the straight line from start to their desired trajectory of movement in the face of environ-
end target) at low stiffness and 2) by modifying the nominal mental conditions. In the case of low stiffness, the desired
trajectory, so as to comply with the perceived object boundary. trajectory is a straight line, whereas at higher stiffness, the
The two responses are evident at the extremes of the tested desired trajectory is one that traces the object boundary. As
stiffness values: the first response appears at low stiffness, surface stiffness increases, and interaction forces begin to
whereas the second appears at high stiffness. Interestingly, increase, subjects may modify their desired trajectory of move-
however, we did not observe a sharp transition between these ment to accommodate the increasing forces of interaction.
responses, but a smooth change from 1 to 2. We wish to stress Rather than fighting the interactions forces through the contin-
that, in the in the context of force field adaptation, this is a uum of surface stiffness, subjects develop a desired trajectory
rather paradoxical finding. that maintains a constant level of interaction force and con-
Based on earlier studies of adaptation, one would expect to forms to the object boundary.
observe increasingly larger after effects as the intensity of the The gradual transition of responses suggests that subjects
perturbing field increases. In contrast, a smooth transition of may execute hand movements by combining multiple modules
responses, such as the one observed in this study (Fig. 5A), of control. This is consistent with the view that movements are
corresponds to a progressive reduction of the after effects as generated by a combination of time-varying force fields (Bizzi
the force field stiffness increases. The tendency to larger after et al. 1991, 1995: d’ Avella and Bizzi 1998). The control
effect with increasing stiffness would have been consistent module that dominates during interaction with low stiffness
−3
x 10
3

Descending Order of Stiffness Presentation

2
Area Reaching Deviation (m 2)

FIG. 11. ARD for catch trials of subjects interacting with a


0 surface of curvature 15 m 1. Surfaces were presented in order
of descending stiffness. Colored lines are data from single
subjects: bold black line is the group average.

−1

−2

−3
0 200 400 800 1200 1600 2000
Stiffness (N/m)
surfaces is one that strives to resist the perturbation of the field lower force contact and be suppressed and turn off once a
and recover a straight motion of the hand across the external threshold of force has been reached (Johansson and Westling
field. At higher stiffness levels, the prevailing control module 1991). Collectively, a network of receptors distributed on the
is one that modifies the nominal trajectory to comply with the surface of the hand allow for a narrow range of detection of
object boundary. Through a continuum of force fields, the force (pressure) and its geometric distribution on the hand.
weighting of these two modules would result in the observed During a high level of contact force, a large number of
smooth transition of learned behavior. receptors in the network may be overly excited and become
While the concept of two control modules being combined inactive, resulting in a deficit in sensory information. Con-
to form field/object representations is plausible, this is not the versely, if too low a level of force is applied a smaller number
only plausible explanation of subjects’ behavior. Rather than a of receptors may be activated, again resulting in a deficit in
linear interpolation between two control schemes, subjects’ sensory information. Therefore there may be an optimal con-
adaptation may also by described by a simple continuous
tact force that allows the task to be completed while maximiz-
representation that estimates the external state as a function of
ing activation of the receptor network and thus maximizing
experienced dynamics. This idea of a single continuous repre-
sentation is supported by the linear relationship between pre- sensory and haptic feedback about the surface in contact.
adaptation interface force and postadaptation after effects (Fig. While the two adaptive behaviors are present (i.e., restora-
10A). tion of an unperturbed motion and a compliant response), there
The invariance of average interface force after learning, seems to be a rather gradual transition between them. This
together with the consistent relation between initial interface finding is in contrast with our own initial prediction and with
force and final adapted response, is in holding with other some known perceptual transitions (e.g., the Necker cube
findings that suggest that there may be very simple mecha- illusion). Furthermore, we found that the observed response
nisms for scaling muscle activations and coordination patterns across a variety of stiffness values leads to a constant average
to produce consistently low amounts of net fingertip force interaction force after adaptation. This gives support to the
(Valero-Cuevas 2000). Subjects may choose control policies hypothesis that the adaptation response may be mediated by a
that result in the nominal interface force they are willing to mechanism that attempts to enforce a constant interaction force
sustain. This level of interface force remains constant across (rather than a nominal trajectory). However, further research is
stiffness levels within experiments with the same curvature. needed to understand the control mechanisms the CNS uses
The idea of constancy of interaction force has also been when developing internal representations of the objects with
reported in another a recent haptic contact experiment, in which we interact.
which subjects explore a virtual environment of varying stiff-
ness using an instrumented stylus (Walker and Hong 2004). It GRANTS
has also been shown that there may be very simple mechanisms
for scaling muscle activations and coordination patterns to This work was supported by National Institute of Neurological Disorders
and Stroke Grants R01-NS-35673 and F31-NS-49795.
produce consistently low amounts of net force.
The idea that the nervous system uses an invariant average
interface force as a cue for the mediation of control policies is REFERENCES
further suggested by the similarity in behaviors seen for sur- Bizzi E, Giszter S, Loeb E, Mussa-Ivaldi F, and Saltiel P. Modular
faces of different curvature. It is likely that with a surface of organization of motor behavior in the frog’s spinal cord. Trends Neurosci
higher curvature, subjects experience larger interface forces 18: 442– 446, 1995.
caused by an initial increased level of surface penetration. This Bizzi E, Giszter S, and Mussa-Ivaldi F. Computations underlying the
execution of movement: a novel biological perspective. Science 253: 287–
may explain why they tend to comply with the surface curva- 291, 1991.
ture starting from the lowest stiffness levels. Colgate J and Brown J. Factors affecting the z-width of a haptic display. In:
One should observe that the interface force is not, per se, a IEEE Iniernaiional Conference on Roboiics and Auiomaiion. San Diego,
good classifier of the rigidity of a boundary. Different interface CA, 1994, p. 3205–3210.
d’ Avella A and Bizzi E. Low dimensionality of suprasinally induced force
forces against a given object can be generated by hand move- fields. Proc Nail Acad Sci USA 95: 7711–7714, 1998.
ments driven by different motor commands. One can deliber- Fasse E, Hogan N, Kay B, and Mussa-Ivaldi F. Haptic interaction with
ately push against a constraint or generate a light touch. virtual objects. Biol Cybern 82: 69 – 83, 2000.
However, given an invariant motor command, boundaries with Flash T and Gurevich I. Arm stiffness and movement adaptation to external
different degrees of rigidity will induce different levels of loads. In: Annual Conference on Engineering in Medicine and Biology,
Orlando, FL, 1992, p. 885– 886. (Medical Biology Conference 13)
interface force. The variable pattern of contact forces in the Flash T and Hogan N. The coordination of arm movements: an experimen-
initial phase of each experiment (Figs. 4B and 7B) indicates tally confirmed model. J Neurosci 5: 1688 –1703, 1985.
that subjects produced a relatively stereotyped motor com- Gandolfo F, Mussa-Ivaldi F, and Bizzi E. Motor learning by field approxi-
mand, based on the assumption that the hand was to move in mation. Proc Nail Acad Sci USA 93: 3843–3846, 1996.
Hogan N. An organizing principle for a class of voluntary movements.
free space. Under this condition, a given level of contact force J Neurosci 4: 2745–2754, 1984.
can be taken as an indicator of the rigidity of the encountered Hogan N, Kay B, Fasse E, and Mussa-Ivaldi F. Haptic illusions: experiments
boundary and can therefore be used for classification (Fig. on human manipulation and perception of “virtual objects”. In: Cold Spring
10B). Harbor Symposium on Quaniiiaiive Biology. Cold Spring Harbor, NY: Cold
A physiological account for the detection and regulation of Spring Harbor Laboratory Press, 1990, p. 925–931.
Johansson R and Westling G. Afferent signals during manipulative tasks in
interface force may result from the manner in which tactile man. In: Informaiion Processing in ihe Somaiosensory Sysiem, edited by
afferents respond to touch. During grip tasks, FA I mechano- Frazen O and Westman J. London, UK: Macmillan Press Ltd, 1991, p.
receptors in glabrous skin tend to initiate a response and fire at 25– 47.
‫הפתעה מלאכותית‬
‫גלילאו‪ ,‬יום שני‪ 28 ,‬ביולי ‪2008‬‬

‫מהי הפתעה‪ ,‬מה חשיבותה‪ ,‬מודלים קוגנטיביים ורגשיים‪ ,‬ואיך עולם התוכנה משתלב בתהליך‬

‫הרובוט דומו‬

‫הפתעה היא תגובה למצב שלא צפינו אותו מראש‪ .‬אם אין לך כל ציפייה לגבי‬
‫העתיד‪ ,‬לא תוכל להיות מופתע‪ ,‬אבל קשה לדמיין קיום כזה‪ .‬לצפות איך יתפתח‬
‫המצב בעתיד‪ ,‬לתכנן לפי ציפיות אלו‪ ,‬לבדוק אם הציפיות התממשו ולתקן את‬
‫התכניות בהתאם ‪ -‬תיאור זה ממצה מגוון רחב של התנהגויות של חיות ושל‬
‫בני‪-‬אדם‪ ,‬וכן של פעולות של מערכות מכניות ואלקטרוניות‪.‬‬

‫בלא ניבוי המצב העתידי לא נוכל להתאים את תכניותינו למצבים שבהם סביר‬
‫שנמצא את עצמנו‪ .‬הצורך להשוות בין התוצאה שנצפתה לבין התוצאה שקרתה‬
‫בפועל נובע מכך שהידע שלנו אינו מושלם‪ ,‬ולכן גם הניבויים שלנו אינם יכולים‬
‫להיות מושלמים‪ .‬למעשה‪ ,‬יש סיבות תיאורטיות לחשוב שניבויים אינם יכולים כלל להיות מושלמים ‪ -‬אם כתוצאה‬
‫מהאקראיות של פיזיקת הקוואנטים ואם כתוצאה מהמתמטיקה של מערכות כאוטיות‪ ,‬שמופיעה גם בפיזיקה‬
‫הקלאסית‪.‬‬

‫החשיבות שבגורם ההפתעה‬

‫התנהגות של ניבוי ותיקון אינה חייבת להיות מסובכת‪ :‬לדוגמה‪ ,‬נניח כי שלב מסוים בתהליך הייצור דורש מרובוט‬
‫תעשייתי להזיז את הזרוע ‪ 25‬מעלות ימינה ‪.‬התוכנה תחשב כי כדי להגיע לתזוזה כזו‪ ,‬יש להפעיל את המנוע‬
‫המתאים בסדרה של ‪ 1,500‬אותות ספרתיים‪ ,‬שכל אחד מהם מקדם את הזרוע בשיעור קטן בכיוון הנכון‪.‬‬

‫מתכנני הרובוט לא יסתפקו בכך‪ :‬הרובוט כולל גם חיישן אחד לפחות המודד את מיקום הזרוע בפועל‪ .‬תוך כדי‬
‫התנועה‪ ,‬התוכנה תשווה את המקום שבו צפויה להיות הזרוע )לפי האותות שנשלחו למנוע( עם המקום שבו‬
‫נמצאת הזרוע לפי הדיווח מהחיישן‪ .‬ייתכן שיידרש תיקון קל‪ ,‬כתוצאה מכך שאף אחד ממרכיבי המערכת אינו‬
‫מושלם‪ .‬מנגנון תיקון כזה הוא מנגנון משוב קלאסי‪.‬‬

‫נניח שאכן‪ ,‬כתוצאה מהמשוב‪ ,‬יופעל המנוע ב‪ 1,498-‬אותות במקום ב‪ 1,500-‬עד להגעת הזרוע למיקום הנדרש‪.‬‬
‫אם "טעות" כזו )של רק מעט יותר מעשירית האחוז )היא בתוך תחומי התכנון המקורי‪ ,‬סביר שרוב האנשים‬
‫יסכימו כי זוהי פעולה תקינה שלא תפתיע את מתכנני הרובוט ‪ -‬או את הרובוט עצמו‪ ,‬אילו היתה לו יכולת להיות‬
‫מופתע‪.‬‬

‫אבל ייתכן גם מצב אחר‪ :‬לאחר שעברה שליש מהדרך‪ ,‬הזרוע עוצרת ואותות נוספים אינם גורמים לה להמשיך‬
‫בדרכה‪ .‬זהו מצב שאינו צריך לקרות‪ .‬מה היינו רוצים שהרובוט יעשה במצב כזה? האם על התוכנה להמשיך‬
‫לנסות להפעיל את מנועי הרובוט כדי להביא את הזרוע ליעדה? נראה שעדיף לעצור את התהליך‪ ,‬ואף להסיט‬
‫חזרה את הזרוע‪ ,‬כפי שקורה כאשר דלתות המעלית מתחילות להיסגר ונתקלות במכשול‪ ,‬וכפי שקורה לנו ‪-‬‬
‫הרבה יותר מדי פעמים ‪ -‬עם מערכת ההפעלה במחשב ‪.‬אפשר גם לדמיין רובוט משוכלל מפנה מצלמות אל‬
‫האזור הבעייתי כדי לנסות לפענח מה גרם לעצירה‪ .‬אילו היה זה אדם שהיה עוצר פעולה שגרתית כאשר זו לא‬
‫התפתחה בצורה הנורמלית‪ ,‬מסלק את ידיו מהמקום שבו היו ומפנה את כל תשומת לבו אל אותה פעולה‪ ,‬היינו‬
‫מפרשים את התנהגותו כגילוי של הפתעה‪.‬‬

‫אם כך‪ ,‬אפשר לראות את ההפתעה כמשרתת צורך חשוב‪ :‬הצורך לגלות כי קרה משהו השונה מהותית מכל מה‬
‫שציפינו לו; להגיב תגובה מהירה עוד בטרם פענחנו את השוני ואת הסיבה ‪ -‬מכיוון שייתכן כי שוני זה מהווה סיכון‬
‫לנו או להצלחת המשימות שאנו מנסים לבצע; ולגייס משאבים כדי להבין את המצב וליצור תכנית פעולה חדשה‪.‬‬
‫משאבים אלו עשויים להיות קוגניטיביים )הפניית קשב(‪ ,‬סנסוריים( הפניית חיישנים(‪ ,‬מוטוריים )הפעלת תנועות‬
‫מתאימות( או אנרגטיים‪ .‬ברור שהפניה כזו של משאבים משפיעה על פעולות אחרות המתבצעות באותו הזמן‪,‬‬
‫ועשויה להגביר את צריכת האנרגיה או ליצור סיכונים נוספים )למשל‪ :‬כאשר הצורך במידע נוסף מחייב את החיה‬
‫הניצודה להוציא את ראשה מהמחבוא(‪ .‬לכן‪ ,‬ההפתעה גוררת תגובה של עוררות כללית המשפיעה על כל מרכיבי‬
‫ההתנהגות‪ ,‬ומביאה פעילות שגרתית אל "אור הזרקורים" של המודעּות והתכנון‪.‬‬

‫מודלים קוגניטיביים של הפתעה‬

‫בכנס העשרים של ‪ IJCAI (International Joint Conference on Artificial Intelligence),‬הכנס הבינלאומי‬


‫לבינה מלאכותית‪ ,‬שנערך בהודו בינואר ‪, 2007‬הוצג מאמר בשם "הפתעה כקיצור‪-‬דרך לציפייה"‪ .‬המאמר‪ ,‬מאת‬
‫מישל פיונטי ‪ (Piunti),‬כריסטיאנו קסטלפרנצ'י )‪ (Castelfranchi‬ורינו פלקונה )‪ (Falcone‬מהמכון למדעים‬
‫וטכנולוגיות קוגניטיביים באיטליה‪ ,‬מציע מודל התנהגותי שבו ההתנהגות אינה מבוססת רק על למידה‬
‫סטטיסטית‪ ,‬אלא גם על ייצוג של "אמונות" ‪",‬מטרות" ו"ציפיות‪".‬‬

‫למידה סטטיסטית מקשרת בין פריטי מידע כך שאפשר להשתמש בקישור כזה לניבוי‪ :‬אם ב‪ 80%-‬מהמקרים‬
‫שבהם רובוט עבר באזור ‪ X‬הוא מצא שם חפצים מהסוג שהוטל עליו לאסוף‪ ,‬כאשר הרובוט יבחן פעולות שונות‬
‫הוא יוכל לדרג את הפעולה" תנועה לכיוון "‪ X‬כבעלת עדיפות גבוהה‪ ,‬כתוצאה מהערכה כי יש סבירות גבוהה‬
‫שיוכל למצוא שם את הפריטים שהוא מחפש‪ .‬ההבדל בין למידה כזו לבין יצירת" אמונה )‪" (belief‬הוא עדין אך‬
‫משמעותי‪ :‬קשר סטטיסטי יכול להיות בעל חוזק כלשהו‪ ,‬וכל פריט מידע נוסף יכול להקטין או להגדיל את חוזק‬
‫הקשר‪ .‬לכן קשה להגדיר איזה מידע חדש יכול להיות מוגדר כהפתעה‪ .‬לעומת זאת‪" ,‬אמונה" מעלה את הקשר‬
‫לדרגת ניבוי‪ ,‬כך שכל פריט מידע הסותר את הניבוי יהווה הפתעה‪ .‬מובן שלא כל אמונה חזקה באותה מידה‪,‬‬
‫וככל שהאמונה "חזקה" יותר‪ ,‬מידע הסותר אותה צריך להוות הפתעה גדולה יותר‪.‬‬

‫לפי המודל הפסיכו‪-‬אבולוציוני של וולף מאייר ‪ (Meyer),‬המהווה חלק מהתשתית התיאורטית של מחקר זה‪,‬‬
‫אירועים הגורמים להפתעה מפעילים את התהליך הזה‪ :‬ראשית‪ ,‬המידע מהחושים או מהחיישנים מזוהה כמהווה‬
‫הפתעה מעבר לסף מסוים‪ .‬כתוצאה מכך‪ ,‬מופסקים או מעוכבים תהליכים קוגניטיביים אחרים‪ ,‬כדי להפנות‬
‫משאבים לחקירת האירוע‪ .‬במקביל לחקירת האירוע‪ ,‬ננקטות גם פעולות מיידיות המכוונות לאיסוף מידע נוסף‬
‫)כמו זקיפת אפרכסות האוזניים בחיות רבות( או להיערכות למשמעויות אפשריות של ההפתעה )לדוגמה‪ ,‬יצורים‬
‫שבהם התגובה הראשונה לרעש מפתיע היא היצמדות לקרקע וקפיאה במקום(‪ .‬בהמשך ‪,‬כתוצאה מחקירת‬
‫המידע הראשוני והמידע הנוסף שהתקבל‪ ,‬מתעדכנות האמונות וכתוצאה מכך נוצרות ציפיות חדשות ומטרות‬
‫חדשות‪.‬‬

‫מודלים רגשיים של הפתעה‬

‫המודל של שילוב הרגשות בתהליך קבלת ההחלטות קיבל תימוכין גם מעבודתם של חוקר המוח אנטוניו דמסיו‬
‫)‪(Damasio‬וקבוצתו‪ .‬הם טוענים כי המוח קורא את התגובות הרגשיות )כמו דופק והזעה( ואז מגיע להחלטה‪,‬‬
‫שהיא בדרך‪-‬כלל נכונה ‪.‬אם מערכת קריאה ואינטגרציה זו‪ ,‬האונה הקדם‪-‬מצחית‪ ,‬נפגעת‪ ,‬האדם יודע להסיק‬
‫מסקנות תיאורטיות נכונות‪ ,‬אך נוטה לטעויות קשות ביישום של המסקנות לגבי עצמו‪.‬‬

‫בעשור האחרון נוצרו בסיסים תיאורטיים וניסוייים להבנת התפקיד של רגשות בקבלת החלטות וביצירת ציפיות‪.‬‬
‫מחקרים בנוירו‪-‬פסיכולוגיה ובתחום החדש של נוירו‪-‬כלכלה ‪ ( Neuroeconomics -‬המפגש של מדעי הכלכלה‬
‫עם מדעי המוח( מראים כי רגשות יכולים לשמש כמנגנון המפעיל עוררות ועוזר לעבור במהירות בין התנהגויות‪,‬‬
‫כמו למשל בין איסוף מידע‪ ,‬איסוף מזון או תגובות הגנתיות‪ .‬זוהי הכוונה בביטוי "קיצור‪-‬דרך" בכותרת המאמר‪:‬‬
‫כפי שהתברר לכלכלנים בעשורים האחרונים‪ ,‬גם במקרה הלא‪-‬מציאותי של קיום מידע מושלם‪ ,‬מגבלות של זמן‬
‫ומשאבים קוגניטיביים אינן מאפשרות ניתוח מלא והסקת כל המסקנות ה"מתחבאות "באותו מידע‪ .‬מצבים‬
‫רגשיים כמו הפתעה משמשים כאסטרטגיה לניהול יעיל של חלוקת הזמן והמשאבים בין ניתוח לבין פעולה‪ ,‬כאשר‬
‫המידע אינו שלם‪ ,‬ואפילו המידע שכבר הושג‪ ,‬ניתוחו עדיין לא הושלם במלואו‪.‬‬

‫הסוכן והמאורה‬
‫כדי לבחון את האפקטיביות של מודלים אלה‪ ,‬החליטו פיונטי ועמיתיו למחקר להתחיל בהגדרה של סביבה‬
‫פשוטה ושל "סוכנים" פשוטים הפועלים בתוכה‪" .‬סוכן ‪",‬בהקשר זה‪ ,‬הוא עצם ‪ -‬וירטואלי או פיזי ‪ -‬שיש לו יכולת‬
‫לאסוף מידע‪ ,‬לפעול על פי המידע שקיבל וליצור אינטראקציה עם סביבתו‪ .‬החוקרים יצרו תוכנה המדמה סביבה‬
‫וירטואלית המיוצגת במפה דו‪-‬ממדית‪ ,‬שבה קירות ומכשולים מגבילים את מסלולי התנועה האפשריים‪ .‬בשלושה‬
‫אתרים בתוך סביבה זו מופיעים שלושה סוגי" מזון"‪ ,‬בתדירות שונה‪ .‬לכל סוג מזון יש טיב שונה עבור הסוכן‪.‬‬
‫מטרת הסוכן היא לנוע על המפה‪ ,‬למצוא מזון ולהביא אותו למקום מוגדר )שאפשר לחשוב עליו בתור‬
‫"המאורה"(‪ .‬כאשר המזון מגיע למאורה‪ ,‬הסוכן מרוויח אנרגיה שכמותה תלויה בכמות המזון‪ ,‬בטיב המזון ובזמן‬
‫שעבר מרגע האיסוף עד רגע הגעתו למאורה‪.‬‬

‫הסביבה כוללת גם סכנות‪ ,‬שבמקרה זה מוגדרות כ"מדורות"‪ .‬מדורות מופיעות בתחילה כ"עשן"‪ ,‬המשמש‬
‫כאזהרה‪ ,‬ואז מתפתחות ל"להבה" היכולה להזיק לסוכן בכך שהיא מפחיתה מהאנרגיה שלו‪ .‬מדורות נפוצות יותר‬
‫בחלקים מסוימים של המפה מאשר באחרים‪ .‬מדורה שהופיעה במפה יכולה גם לנוע ולהגיע למקומות אחרים‪.‬‬

‫בכל רגע‪ ,‬הסוכן יכול להחליט באיזו מהירות לנוע וכמה אנרגיה להפנות לחיישניו‪ .‬ככל שהחיישנים מקבלים‬
‫אנרגיה גבוהה יותר‪ ,‬הם יכולים לחוש מוקדם יותר בסכנות ובמזון‪ .‬מובן שגם תנועה מהירה צורכת אנרגיה רבה‬
‫יותר מאשר תנועה אטית‪ .‬כאמור‪ ,‬תנועה מהירה עשויה להביא את המזון מהר יותר למאורה ולכן להגדיל את‬
‫רווח האנרגיה‪ .‬אפשר להסיק‪ ,‬אם כן‪ ,‬שאף שהסביבה שתוארה כאן פשוטה לאין שיעור מסביבות "אמיתיות"‪ ,‬היא‬
‫מספיק מורכבת כדי ליצור אתגרים קשים לתכנון ולהתנהגות‪.‬‬

‫סוכן בעל מצבים רגשיים‬

‫המאמר מציג השוואה בין שני סוכנים‪ .‬הסוכן הראשון קיבל את השם ‪SEU (Subjective Expected Utility),‬‬
‫משום שבכל רגע הוא בוחר מבין הפעולות האפשריות )תנועה‪ ,‬שינוי מהירות‪ ,‬שינוי האנרגיה המוקצית לחיישנים‪,‬‬
‫איסוף מזון וכו'( אותה פעולה הצפויה להביא לתועלת )‪ (utility‬הגבוהה ביותר‪ ,‬לפי הידע החלקי והסובייקטיבי של‬
‫אותו סוכן‪.‬‬

‫הסוכן האחר קיבל את השם )‪ MS (Mental States‬כי נוסף על מנגנון ה ‪-SEU,‬הוא כולל גם "מצבים נפשיים"‪.‬‬
‫המצבים האפשריים של סוכן זה הם "נורמלי" ‪",‬משועמם"‪" ,‬מרוגש"‪" ,‬זהיר" ו"סקרן"‪ .‬כל מצב משפיע בצורה‬
‫שונה על חישוב התועלת הצפויה מהפעולות האפשריות‪ ,‬ולכן מוביל להתנהגות שונה‪ .‬הגורם המשפיע על מעבר‬
‫בין מצבים אלה הוא הופעתם של אירועים המהווים הפתעות חיוביות או שליליות‪ .‬סדרה של הפתעות חיוביות‬
‫תשפיע על הסוכן לעבור למצב "מרוגש"‪ ,‬שבו הערכת התועלת של פעילויות המביאות לאיסוף מהיר של "מזון"‬
‫תהיה גבוהה יותר מההערכה הנגזרת ממנגנון ה ‪-SEU,‬ולכן עשויה לגבור על הערכת מרכיבי הסיכון של אותן‬
‫פעולות‪ .‬בדומה לכך‪ ,‬הצטברות של הפתעות שליליות תוביל למצב רגשי" זהיר"‪ ,‬שבו הדגש הוא על הימנעות‬
‫מסיכונים‪.‬‬

‫אין להסיק מתיאור זה כי אותו "סוכן" ‪ -‬תוכנת מחשב הפועלת בתוך סימולציה ממוחשבת של סביבת קיום‬
‫פשוטה ‪ -‬הוא באמת בעל רגשות בני‪-‬השוואה לרגשות אנושיים‪ .‬אילו היה הדבר כך‪ ,‬היינו אולי צריכים להסס‬
‫בטרם נכבה את המחשב ‪,‬או "נדליק" להבות המאיימות לשרוף את הסוכן שלנו‪ .‬כאן‪ ,‬המונח "מצב רגשי "משמש‬
‫רק כדי להיעזר באנלוגיה למודלים קוגניטיביים ופסיכו‪-‬אבולוציוניים‪.‬‬

‫עם זאת‪ ,‬האנלוגיה מעניינת מספיק כדי שנוכל לתהות‪ ,‬בצורה דומה לשאלות עבור בני‪-‬אדם‪ :‬מה התועלת‬
‫בהטיית השיפוט על‪-‬ידי המצב הרגשי? האם בחינה רציונלית וחסרת פניות של מיטב המידע העומד לרשותנו‬
‫אינה עדיפה על החלטה" רגשית?"‬

‫ניצחון הרגש על הרציונל‬

‫מתברר שההטיה הרגשית היא אכן המנגנון העדיף‪ ,‬לפחות על פי תוצאותיו של מחקר זה‪ .‬עבור סביבות‬
‫"בטוחות" )עם מספר קטן של להבות(‪ ,‬סוכן ‪ MS‬פעל בצורה" סקרנית" יותר‪ ,‬וכתוצאה חקר את סביבתו וניצל את‬
‫מקורות המזון בצורה יעילה יותר‪ .‬עבור סביבות "מסוכנות"‪ ,‬סוכן ‪ MS‬פעל רוב הזמן במצב הרגשי "זהיר ‪",‬וכך‬
‫נמנע מהנזק שספג סוכן‪SEU.‬‬
‫אפשר להקשות כאן‪ :‬אם המצב הרגשי מתבטא בהתנהגות המתאימה את עצמה למאפייני הסביבה )כמו שכיחות‬
‫הסכנות ושכיחות מקורות המזון(‪ ,‬האם לא היה נכון יותר לשפר את התוכנה של הסוכן כך שתכלול חישובים לגבי‬
‫מאפייני סביבה אלה‪ ,‬תלמד אותם ותשתמש בהם כדי להגיע לאותן החלטות יעילות מבלי להזדקק לרעיון של‬
‫"מצבים רגשיים?"‬

‫המצבים הרגשיים כפי שהוגדרו במחקר זה הם בהכרח פחות מדויקים‪ ,‬מכיוון שאין להם עוצמה או שילוב ‪ -‬הסוכן‬
‫אינו יכול להיות ‪ 20%‬סקרן ו‪ 30%-‬מרוגש ‪.‬כותבי המאמר אינם מתייחסים לכך‪ ,‬אבל נראה לי כי מודל כזה‬
‫ללמידה רציונלית של מאפייני הסביבה אינו נכון‪ ,‬גם מכיוון שהוא אינו כללי אלא דורש פיתוח ספציפי עבור כל‬
‫שינוי בהתנהגות הסביבה‪ ,‬בחיישנים וכו'; וגם כי בעולם האמיתי יש עלות גבוהה לשיפורים קוגניטיביים‪ ,‬במונחים‬
‫של צריכת אנרגיה ושל אדפטציות מיוחדות )כמו המאפיינים הייחודיים של האדם הקשורים לגודל הראש" ‪).‬קיצור‬
‫הדרך" של השימוש ברגשות הוא תחליף יעיל לשיפורים כאלה‪.‬‬

‫תוכנה שמבינה מתי אדם מופתע‬

‫הסוכן שתואר לעיל הוא תוכנה שאין לה כל אינטראקציה עם בני‪-‬אדם‪ ,‬אבל רוב התוכנות פותחו כדי לשרת צרכים‬
‫אנושיים ולתקשר עם משתמשים‪ .‬תוכנות כאלה יכולות לתקשר בצורה יעילה יותר אם יכללו מודלים קוגניטיביים‬
‫שיוכלו לחזות מה יפתיע את המשתמש‪.‬‬

‫המגזין רב‪-‬ההשפעה ‪ Technology Review ,‬המוצא לאור על‪-‬ידי( ‪MIT‬המכון הטכנולוגי של מסצ'וסטס(‪,‬‬
‫מפרסם מדי שנה דו"ח מיוחד המציין עשר טכנולוגיות חדשות הצפויות להשפיע על העולם‪ .‬בין הטכנולוגיות‬
‫שנבחרו עבור שנת ‪ 2008‬נמצא גם הרעיון של יצירת מודלים להפתעה‪ .‬הדוגמה שמביאים כותבי הדו"ח היא‬
‫חיזוי תנועה‪.‬‬

‫הרעיון עצמו פשוט‪ ,‬אם כי קשה למימוש‪ :‬אם נאסוף מידע רב המתאר את מהירות התנועה בחלקים גדולים‬
‫מרשת הכבישים והרחובות‪ ,‬כמה פעמים בכל שעה‪ ,‬במשך שנה ויותר‪ ,‬נוכל לענות על שאלות כמו "כמה זמן ייקח‬
‫לי להגיע מביתי למרכז העיר ביום רביעי הבא אם אצא בשש בערב?"‪ .‬התשובה אינה בהכרח אותה תשובה‬
‫שתתקבל עבור כל יום חול באותה שעה‪ :‬ייתכן שליום רביעי יש פרופיל שונה מאשר לימי שבוע אחרים‪ ,‬או אולי‬
‫יום רביעי הקרוב הוא היום האחרון לפני סוף חודש ‪,‬וכו'‪ .‬זהו שימוש אופייני‪ ,‬אם כי מאתגר‪ ,‬לטכנולוגיות של‬
‫"כריית מידע ‪",‬שהמשותף להן הוא סקירה של כמויות גדולות של מידע כדי לזהות תבניות אופייניות ולהסיק‬
‫מסקנות‪ .‬טכנולוגיות אלה משתמשות לעתים קרובות ברעיונות ובאלגוריתמים מתחום הבינה המלאכותית‪ .‬כמה‬
‫חברות מסחריות מספקות תוכנה כזו עבור חיזוי תנועה‪.‬‬

‫לפחות חברה אחת ‪ - Inrix -‬שואפת לספק למשתמשיה מידע שיעזור להם יותר ‪.‬מפתחי התוכנה‪ ,‬שהחלה את‬
‫דרכה כפרויקט פנימי של מיקרוסופט‪ ,‬הגיעו למסקנה כי כאשר תושב מקומי רוצה לדעת מה מצב התנועה‪ ,‬יש‬
‫דברים רבים שהוא כבר יודע ‪.‬אם השאלה נשאלת בשעות העומס‪ ,‬אין טעם לספק רשימה ארוכה של דרכים‬
‫שכצפוי התנועה בהן עמוסה ואטית‪ .‬הרבה יותר שימושי לאותו נהג לשמוע על דרכים שבצורה מפתיעה צפויות‬
‫להיות נוחות לנהיגה בשעה הקרובה‪ .‬לשם כך יש צורך במודל קוגניטיבי של הפתעה‪ :‬מה צפוי אדם לדעת‪ ,‬ומה‬
‫גודל הפער בין הציפייה לבין המציאות שדי בו להוות הפתעה? התשובות לשתי השאלות שונות מאדם לאדם ‪,‬‬
‫ולכן התוכנה מאפשרת לכל משתמש להתאים את התנהגות התוכנה לפי הידע שלו ולפי העדפותיו האישיות‪.‬‬

‫כותבי הדו"ח מסכימים עם מפתחי ‪ Inrix‬כי גישה זו היא כללית וכי יש לה פוטנציאל לתרומה משמעותית לצורה‬
‫שבה נעבוד עם מחשבים בעתיד‪ .‬אם נשתמש במנוע חיפוש כדי ללמוד על נושא כלשהו‪ ,‬ואם אותו מנוע חיפוש‬
‫כבר למד מספיק עלינו כדי לנחש בצורה מושכלת מה אנחנו כבר יודעים על אותו נושא‪ ,‬אנו נעדיף לקבל רק את‬
‫המידע המפתיע‪ .‬חשוב להדגיש כי מידע מפתיע אינו רק כזה שאינו ידוע לנו‪ ,‬אלא גם כזה שהוא מנוגד לציפיות‬
‫שלנו‪ .‬יכולת כזו היא כנראה עדיין רחוקה‪ ,‬אבל שימושים קרובים הרבה יותר עשויים להיות בתחומים הקלאסיים‬
‫של עיבוד וניתוח מידע‪ :‬מכל הגרפים והטבלאות המספריות המגיעים לשולחנו של חוקר מודיעין‪ ,‬או משקיע‬
‫בבורסה‪ ,‬או מנהל שיווק‪ ,‬מהו המידע הלא‪-‬צפוי‪ ,‬המפתיע ‪,‬הרומז כי משהו חדש קורה ומזמין לבדוק אם נוצרו‬
‫סיכונים או הזדמנויות חדשות? ייתכן שבקרוב נוכל לצפות מהתוכנה שלנו לשלוף פריטים כאלה ולהסב אליהם‬
‫את תשומת לבנו‪.‬‬
‫תוכנה שמפתיעה את מפתחיה‬

‫כל מתכנת מופתע מפעם לפעם מהתנהגות התוכנה שכתב‪ ,‬אבל בדרך‪-‬כלל זוהי תוצאה של טעות בתכנות‬
‫)"באג"(‪ .‬כשפרופ' מייקל ליטמן )‪ (Littman‬מאוניברסיטת ראטגרס בניו‪-‬ג'רזי הופתע מהתנהגות הרובוט שבנה‪,‬‬
‫הסיבה לכך היתה שונה‪ ,‬כפי שמתועד בסרטון שהעלה לאתר ‪ youtube.‬סרטון זה זכה במקום הראשון‬
‫בקטגוריית" וידיאו קצר"‪ ,‬בתחרות שנערכה במסגרת כנס האגודה האמריקנית לבינה מלאכותית בשנת ‪.2007‬‬

‫מטרתו של ליטמן היתה ליצור רובוט לומד‪ .‬הרובוט היה ‪ AIBO,‬הכלבלב הידוע של חברת סוני‪ ,‬והתוכנה ששלטה‬
‫בו ניסתה למצוא עבור הרובוט דרך החוצה מתוך חדרון סגור וחשוך‪ .‬בחדרון היה מתג שלחיצה עליו פתחה את‬
‫הדלת ואפשרה את היציאה‪ ,‬אבל לא היה אפשר לזהות את המתג בחשכה‪ .‬מתג אחר היה מואר‪ ,‬ולחיצה עליו‬
‫הדליקה את האור בחדרון כך שעיניו של הרובוט יכלו לגלות את מתג היציאה‪.‬‬

‫הרובוט הושם בחדר החשוך כמה פעמים‪ ,‬ובכל פעם הועמד בכיוון שונה ובמקום שונה‪ .‬לאחר ניסיונות אלו‪,‬‬
‫התוכנה הצליחה ללמוד דרך מהירה לצאת מהחדר‪ ,‬אבל זו לא היתה הדרך שלה ציפה ליטמן ‪ -‬הרובוט מצא‬
‫שיטה מהירה יותר מאשר הליכה אל מתג התאורה‪ ,‬לחיצה עליו ואז הליכה אל מתג פתיחת הדלת‪ .‬הקוראים ירצו‬
‫אולי לעצור לרגע ולחשוב מהי אותה שיטה מהירה‪.‬‬

‫לפני התשובה‪ ,‬הנה רמז‪ :‬מה היה קורה אילו כל ניסיון היה מתחיל כאשר הרובוט היה נמצא במקום ובכיוון‬
‫קבועים? מה היה יכול הרובוט ללמוד אז?‬

‫והתשובה‪ :‬הכלבלב האלקטרוני למד לכוון את עיניו אל המתג המואר‪ ,‬למקם את גופו בזווית הנכונה יחסית לאותו‬
‫מתג‪ ,‬וללכת אחורה עד שחלקו האחורי לחץ על מתג פתיחת הדלת‪.‬‬

‫טור זה עסק בהפתעה כמנגנון חסכוני להחלטה מהירה בתנאי אי‪-‬ודאות; בהפתעה כמנגנון לתקשורת יעילה;‬
‫ובהפתעה כתוצאה בלתי נמנעת של מערכות מורכבות ודינמיות‪ .‬אולי אין זה צריך להפתיע אותנו כאשר אנו‬
‫מוצאים את מושג ההפתעה קשור בצורה כה עמוקה לבינה מלאכותית ‪ -‬הרי זה כה אנושי להיות מודעים לכך‬
‫שהמידע שבידינו הוא תמיד חלקי ולא‪-‬מדויק‪ ,‬ולחיות עם מודעות זו על‪-‬ידי איזון בין הימנעות מהפתעות לא‪-‬‬
‫נעימות לבין משיכה אל החדש והמפתיע‪.‬‬
‫‪Appendix F‬‬

‫הפתעה מלאכותית‬

You might also like