Professional Documents
Culture Documents
ADAPTIVE NEURO-CONTROL
FOR SPACECRAFT ATIlTuDE CONTROL
K. KrishnaKumar
Department of Aerospace Engineering
The University of Alabama, Tuscaloosa, AL 35487-0280, USA.
e-mail:kkrishna@ual vm.ua.edu
.- ~ .-
advantages m using neuro-controllers for spacecraft attitude Now, consider the single neuron in the network
control. The Space Station (SS) is chosen as the ideal st"sh0wnmFigure 1. An individualneuronbas many
example for demonstrating these advantages. In what inputs depending on the number of connections. Each
follows, we first present the fundamentals of neuro-control connection to the neumn has a weight associated with it.
relevant to this study and outline a procedure for design and After the net input is calculated, it is converted into an
impkmentation of robust, adaptive neurocontrollers. Next, activation value through a functional relationship. The
we present the non-linear S S model used and show the power of ANN lies ultbin this transfer function. Two types
capabilities of neurocontrol in controlling the pitch attitude of transfer functions are used in this study. These are:
of the SS.These caprabihties are: (a) synthesis of robust non- (a) Sigmoidal function (for all hidden neurons):
liuear concr~llaws using neural networks; and @) adaptively
modifying neuro-controller characteristics using a memory
filter for the back-propagation of error.
2.0 ADAF"IvE NEURO-CONTROL CONCEPTS @) Ramp Function (for all input and output neurons):
2.1 Artifcial Neural Networks 2.2 Supervised learning using backpropagation (BP)of
The ANN structure used in this study is shown in error
Figure 1. It is assumed that every neuron takes connections The most important concept that underlies the ANN
from any or a l l neurons to the left of it. The degree of design is the come@of learning from experience. There are
connectivity depends on the number of connections within several useful learning paradigms for ANN, but the most
the structure. A fully forwardconnected network takes all widely used paradigm is the back-propagation (BP)
possible connections. This ANN configuration provides a algorithm. In BP, the ANN learns by repeated exposure to
network that can have up to "h" (number of hidden neurons) a set of training examples. The leaming takes place through
hidden layers. It is known that as the number of layers a first-order reduction in the output error quantity.
increases the accuracy of the mapping increases. One Backpopgation has been successfully used for mapping of
drawback of increasing the number of layers is that the non-linear functions.
number of parameters to be optimized increases and thus
affects the generalization capabilities of the networks (see 2.3 Neuro-control Techniques
hhelevich et al. [l?). A technique is outlined in reference Two neuroant1-01 techniques used in this study are
15 that optimizes the connectivity pattern as part of the outlined below.
supervised leaming. This technique has the flexibility to off-linesumised controller synthesis using batch learning
maintain the desired accuracy as well as reduce the number Supervised neurocontrol is s& to the indirect
of parameters m the network. Also, the network optimization control used in the adaptive control literature. In traditional
can potentially have up to 'h" neurons in the first layer. It adaptive control, a hear system model is identified and
has been shown before that this is important for using this model the control is adapted. Similarly, in
approximation capabilities of the neural network. As shown supervised neurocontrol, first the system is copied using a
later,tbe opcimauy connected network provides a foundation neural network. This system is called the neuro-model. For
for synthesizing robust neurocontrollers. robust control applications, it is important that the NNM is
general enough to accommodate uncertainties in the system
t modeling. The technique used in this study to arrive at a
U
I
{ Y robust NNM is outlined in Reference 15. (Henceforth,
neural networks whose connections axe pruned to provide
robust mapping are referenced as optimallv connected
networks.) Next, the controller weights are tuned using the
back-propagation of error through the neuro-model. Once
again, it is desired to train a NNC to be general enough to
accommodatemodeling errors. The role of the neuro-model
Figure 1. Artificial Neural Network Structure is to relate the output error to the controller error. It should
1154
be noted that the error definition for the model and the p t control actio-. One approximate, but effective way to
controller are different as shown in Figure 2. The error in simulate this correlation in real-time is to introduce a
the case of the controller is the error between the actual memory filter between the NNM and the error definition (as
system trajectory and a desired trajectory generated by a shown in Figure 3). The bandwidth of the memory filter is
reference model. set less than or equal to the bandwidth of the system. In this
manner, the BP of error "remembers" the past error and
tunes the weights, on-line, proportional to both present and
- - past errors. In this study, a first order filter of the form
I, U(K.1)
~ YAK'
J ~ + ~ = ~ J is~Used.
+ Eh ~ this equation, is the error
S Y m
i+r
CONTROL
For on-line adaptive control, the NNM is updated TORQUE
each time step and the NNC weights are tuned using the
updated NNM. In general, the NNM is updated at a
frequency higher than the NNC updates. For the NNC
1 ' INNER LOOP
-
ANCiLLa AND
RATES I
daptation, as statecl earlier, BTT is inadequate for real-time MOMENTUM
applications. A controller action at the present time OUTER
MAh'ACER
influences the future response of the system. The time LOOP U
horizon of this response is directly related to the bandwidth
of the system that is being controlled. BTT essentially Figure 4. Inner and outer loops for Space Station control.
correlates this timedependent error characteristics with the
1155
3.1 Space Station Model
Reference 1 presents the complete non-linear
model of the SS. Assumpt~onsof small rolvyaw attitude
..* .. e,+a,e,=o
generator
ed+a2ed+al
of the form
- . i
where €$is the relative pitch angle with respect to the Local-
U
D
Q
.
.
-
-
__-
Descred Output
Opiimally Connected N e i v o r k Output
V d d and Local-Horizontal (LVLH)ecphibrium point, n -2.00 -
is the orbital angular velocity, I,,,, are the principal
of inertia, and 4 is the control torque.
n = 0.0011 d s e ~
I, =5O.28x1O6 slug f?
4= lO.8OxlO6 slug f?
&=58.57~106slugftz 1 w 2.00 3 00 4.3c 5 oc
No. oi Orbits
1156
It was originally believed that it would be necessary connected networks, were tested for their robustness to
to train the controller for many random excitations to uzlcertaioees inthemoment of iner&ia of the pitch axis of the
gene& the controller characteristics. h interesting SS. Figures 7 and 8 SUrmDarize the tracking error
dkoveay was ideotified zeganimg the training of a NNC for performances and control energy utilized for control,
control. It WBS onty necessary to train the NNC for a single respectively. It is interesting to note that the optimally
distuhance magnitude and duration. Other disturbance COIlDected network outperforms the fully connected network.
magnitudes and time durations could be controlled without Also, beyond 15 % inertia variation, the controller
having to train the NNC for many random combinations. performances are unacceptable. This is where we need to
Concisely, the NNC had "learned" the dynamics of the adapt the NNM and the NNC to effectively realize an
model and to control the response to any disturbance. Figure adapwe controuer. Figures 7 and 8 also present dah for the
6 shows the response comparison between the neuro- adaptive controller perfonnance for the varying inertia
controller and the c o d e d output for an initial condition characteristics. The adapive control problem is discussed
disturbance. next.
600.00 13.00
j dYVIWWAdoptrve neu-a-control v
--_
d -3
'="-1300 '
:
.
/' - Commended trajectory
Non-adoptive neuro-contro;
Adaptive neuro-control
E -30 03 '
U
0
'
. 0
No. of Oroits
0
i
0
_-- - i
2 -60.00 1
e
- Non odopt ive neuro-con! rol
Adoptive neuro-control
0"c -a0 oo
OOC
3
*
0.50 1.00 1.50 2.30 2.50 3.00 3.53 2.00
No. of Orbiis
IO
2ercent Voriciion in Pitch Inertia
1157
case. For the BP of error, a first order memory filter is [3] Byun, K. W.,Wie, B., Geller, D., and Sunkel, J. W., 'Robust
H-Infinity Control Design for the Space Station with Structured
introduced between the NNM and the error as shown in
Parameter Uncertainty,' presented at ALAA Guidance, Navigation and
Figure 3. For the first order filter, a time constant of lo00 Control Conference, Portland, OR, August 2@22, 1990.
seconds was used in this study. During the NNC weight 141 Balas, G., hckard, A., and Harduvel, J., 'Applications of
updates, NNM connection weights remain unchanged. Mu-Synthesis Techniques to Momentum Management and AtLitude
Figures 9a and 9b show controlled responses using adaptwe Cor~mlof the Space Station,' presented at AlAA Guidance, Navigation
and Control Conference, January 1991.
a d " - a d a p e -trol. Although the pitch response [SI parks, Akzander G, and Sunkel, J. W., "Adaptive Anitude Control
looks identical, the controller response (uJ for the non- aod Momentum MIM~CUIC~I for Large-Angle Spacecraft Maneuvers, "
adaptive case is unacceptable. For adaptive neuro-control, Jouranl of Guidance, Control, and Dynamics, Vol. ISr, no. 4, July -
the contml response initially starts out oscillatory and 8s August 1992.
[6] Vadali, S. R. and H.S. Oh, "Space Station Attitude Control and
adaptation progresses, the control response improves M o m e m " Management: A nonlincar Look,' Journal of Guidance,
dramatically. Control, and Dynamics, May-June 1992.
4. CONCLUSIONS [7] Bossart, Theodore C. and Singh, Sahjendra N., "Invertibility of
Map, Zero Dynamics and Nonlinear Control of Space Station,' AIAA-
This study demonstrated the applications of neuro- 91-2663-CP.
[8] Parlos, A. G., Atiya, .". F., and Sunkel, I. W., 'Parameter
wotrol to adapve non-Iinear control of Space Station. Two Estimation in Space Systems using Recumnt Neural Networks,' paper
capabilities of neuro-controllers were demonstrated using a no. 91-2716, AIAA Guidance, Navigation, and Control Conference,
non-hear model of the Phase I Space Station Freedom. August 199I .
These capabilities are: (a) synthesis of robust non-linear [9] Chipman, R., Lam, Q., and Sunkel, J. W., 'Mass Property
Identification: A Comparison Between Extended Kalman Filter and
control laws using neural networks; and (b) adaptively
Neurof3ter Approach,' paper no. 91-2664. AIAA Guidance,
"gneurocontrollercharacteristics for varying inertia Navigation, and Control Conference, August 1991.
characteristics using a memory filter. [lo] Kumar, R. R., Seywald, H,Dcshpande, S. M., and Rahman, Z.,
The "-controller techniques presented seem to 'Artificial Nwral Networks in Space Station Optimal Attitude Control, "
be robust based on the successful results obtained using prrsentcd at the World Space Congress, Washington, DC,Aug 28-Sep
5,1992.
straight forward applications of these techniques. To build [ I l l Narendra, K. S., "Adaptive Control Using Neural Networks".
upon the ideas preseded, "-controllers for the three-axis Neural Networks for ControUeditcd by W. Thomas Miller IJI, Richard
w e neuro-controller synthesis is currently in progress. S. Sunon, and Paul J. Werbos. MTT Press, p. 287, Cambridge, MA,
This study examined decentralized neuro-control to 1990.
(121 Werbos, Paul J. "Rack-propagationThrough Time:What It Does
demoostrate the leaming and adaptation capabilities of ANN and How To Do It,.' Proceedings of the EEE, August, 1990.
based on direct ouput fedback. For space applications, the [13] Werbos, P. J. 'A Menu of Designs for Reinfo"t Learning
fixed software/hardware environment provided by ANN Over Ti,' Neural Networks for Control, edited by W. Thomas Miller
synthesis makes it easy to mod@ the controller 111, Richard S. Sumn, and Paul J. Werbos. MIT Ress, p. 67,
characteristics from a remote site, if there is a need. Any Cambridge, MA, 1990.
[14] Barto, A. G., Sumn, R. S., and Anderson, C. W. 'Neuron-like
other controller design could easily be copied using ANN Adaptive elemelll~that can solve difficult control problems", IEEE
a d can take advantage of having a fixed softwarefbardware Trans, SMC., SMC-13, p. 834-846, 1983.
structure for the neuro-controller. [15] KrishnaKumar, K., 'Optimization of the neural net connectivity
Further research in this area needs to examine the pancm using a back-propagation algorithm," Journal of
Neumomputing, 1993.
stabllay and performance robustness of these techniques. [I61 Qin, Si-Zhao, Su, Hong-Te, McAvoy, Thomas J., "Comparison
Specifically, it will be interesting to examine the theoretical of Four Neural Net Learning Methods for Dynamic System
aspects of the optimally connected networks in providing Identification', IEEE Transactions on Neural Networks, Vol. 3, No.
robust control capabilities. Also, the feasibility of using the 1, pp. 122-130, January, 1992.
"oxy fiber concept for dynamic systems with high modal
[I71 V. V. Anshelevich et al., On the ability of neural networks to
perform generalimion by induction, Biological Cybernetics 61, 125-
densities need to be examined. 128, 1989.
ACKNOWLEDGEMENT
This material is based upon work partly supported
by the National Science Foundation under Grant No. ECS-
9113283.
REFERENCES
[ I ] Wie, B., Byun, K. W., Wamn, W., Geller, D., Long, D., and
Sunkel, J. W., 'New Approach to AttitudelMomentumControl of the
Space Station,' Journal of Guidance, Control, and Dynamics, vol 12,
no. 5 , pp. 714-722, 1989.
121 Yeichner, J., Lee, J., and Barmws, O., 'Overview of Space
Station Attitude Control Syacm with Active Momentum
Management,' AAS paper 88-044, February 1988.
1158