Professional Documents
Culture Documents
Editors
Alejandro Gómez, Sarah Henao, and O. Lucia Quintero
On behalf of the School of Sciences of Universidad EAFIT, we would like to thank all those who made the XVII Latin
American Conference on Automatic Control and this proceedings book possible.
Of course as a conference, the objectives to share technical discussions and experiences, allowed many students to
learn how to construct a real society of Control Engineers. Nothing is perfect but we are completely sure that this
conference will lead us to improve and enhance our skills for research. Academic friction is essential to grow but we
need to look ahead and work on the top research areas to build a new era of Automatic Control, without forget the
basics.
The successful completion of a project like this requires the help and cooperation of many people. First of all, we
would like express our sincere thanks the authors that contributed to the conference and submitted their papers. They
trusted us and received our comments and improved their work. The papers submitted from all around Latin America
and Europe were the core of the project and the level of their work has raised the level of the School of Sciences and
the university. We greatly appreciate the authors for giving us the opportunity in putting together this book and
joined the conference. The work of each one of the authors was crucial in creating this book and in the success of the
conference.
It was a great pleasure to work along side with our colleagues from different countries. We would also like to express
our appreciation to the reviewers that participated in this project. Their dedication to the execution of the conference
is truly noticed. Each paper had three reviewers in the submission/acceptation process and after the conference a
new review from the chair team. I feel to acknowledge a deep sense of gratitude to our colleagues because of their
support and hard work during the evaluations post conference and the conference courses (pre -during -post) in which
the students had the possibility to learn from real experts about several applied and theoretical topics.
The successful execution of the conference is also due to the hard work and availability of the chair committee who has
been irreplaceable. The chairmen who participated in this project had the responsibility of choosing the top papers
of this conference in which we sent to the RIELAC Journal.
Furthermore, we would like to share our gratitude for Juan Fernando Duque Cardona from the CEC (Centro de Ed-
ucación Continua) for his undeniable patience. This project would not have been possible without the cooperation of
Alejandro Gómez Montoya, Luz Elena Giraldo, Lina Marcela Duque and Sarah Henao Gallego on their commitment
on the project “XVII Latin American Conference on Automatic Control”.
As student use this book to learn about how to progressively contribute in several fields from the theoretical and
practical point of view, both simulation and experimental works will illustrate the capabilities of the people who work
in our countries improving the knowledge by teaching the basic steps to the new generation of Control Engineers.
Lucı́a
CONTENTS
2 ARTIFICIAL INTELLIGENCE 1
2.1 Automatic Face Recognition in Thermal Images Using Deep Convolutional Neural Networks . . . . . . 1
2.2 Classification of emotions by Artificial Neural Networks: a comparative study . . . . . . . . . . . . . . 7
2.3 Detection and Diagnosis of Breast Tumors using Deep Convolutional Neural Networks . . . . . . . . . 11
2.4 Dynamic clustering for process supervision . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3 AUTOMATION 26
3.1 Coloured Petri Nets for Implementation of Safety Instrumented Systems in Critical Production Systems 26
3.2 Comparación del Lenguaje VHDL-AMS con los Métodos de Simulación OrcadPspice y MatlabSimulink
a través de un Sistema de Distribución de Energı́a Eléctrica con Carga no Lineal . . . . . . . . . . . . 34
3.3 Modeling of a Variable-BVR Rotary Valve Free Piston Expander/Compressor . . . . . . . . . . . . . . 42
3.4 Models for Planning and Supervisory Control for the feeding raw material in cement production . . . 49
4 BIOMEDICAL - BIOENGINEERING 57
4.1 Desarrollo de la etapa tobillo-pie de un sistema de rehabilitación de marcha para niños con PCI . . . . 57
4.2 Drive System Development for Gait Rehabilitation Exoskeleton . . . . . . . . . . . . . . . . . . . . . . 66
4.3 Output-Feedback Model Predictive Control for Tight Glycaemic Control in Patients at the Intensive
Care Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
4.4 Sistema vestible para detección de estados fisiológicos y emocionales en entornos industriales . . . . . 78
5 BIOPROCESSES 82
5.1 Advanced Control of a fed-batch reaction system to increase the productivity in the polyhydroxyalka-
noates production process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
5.2 Dissolved Oxygen Dynamic Model for Endospore-Forming Bacteria batch bioprocess . . . . . . . . . . 89
5.3 Nonlinear State Estimation for Batch Process with Delayed Measurements . . . . . . . . . . . . . . . . 95
5.4 Output-Feedback Model Predictive Control for Dissolved Oxygen Control in a Biological Wastewater
Treatment Plant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
8.10 PID Optimal Controller with Filtered Derivative Part for Unstable First Order Plus Time Delay Systems195
8.11 SISO Pole Placement Algorithm: A Linear Transformation Approach . . . . . . . . . . . . . . . . . . . 201
12 OBSERVERS 333
12.1 A Soft Sensor for Biomass in a Batch Process with Delayed Measurements . . . . . . . . . . . . . . . . 333
12.2 Control y estimación de par en un motor Diésel con turbocompresor y recirculación de gases de escape 340
12.3 Fixed-Time Convergent Unknown Input Observer for LTI Systems . . . . . . . . . . . . . . . . . . . . 348
12.4 Observer Designs for a Turbocharger System of a Diesel Engine . . . . . . . . . . . . . . . . . . . . . . 354
13 OPTIMIZATION 362
13.1 Evolutionary selection of optimal weighting matrices for LQR controllers and parameters of robust PID
on benchmark plants . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362
13.2 Evolutive Extension: A biological approach to heuristic algorithms . . . . . . . . . . . . . . . . . . . . 373
13.3 Parameter Optimization of Sliding Mode Observer-based Controller for 2 DOF Stewart Platform . . . 379
13.4 Synthesis of Four-bar Mechanisms for Trajectory Control Using the Modified Brainstorm Optimization
Algorithm and Linkage Normalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385
16 ROBOTICS 534
16.1 Consenso en la dinámica de los estados de robots móviles tipo (3,0) . . . . . . . . . . . . . . . . . . . . 534
16.2 Control de Seguimiento de Trayectorias para un AR.Drone 2.0 Utilizando Observadores de Estados . . 542
16.3 Control Robusto QFT Sobre un Robot Móvil de Autobalance Basado en Péndulo Invertido Empleando
un Sistema Embebido . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548
16.4 Control Servovisual No Calibrado De Robots Planares Sin Modelo Dinámico . . . . . . . . . . . . . . 557
16.5 Diseño de trayectorias caóticas en robots móviles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 563
16.6 Formacı́on en grupos de robots móviles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 570
16.7 Generalized Proportional Integral Control for Aperiodic Gait Stabilization of a Bipedal Robot with
Seven Degrees of Freedom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576
16.8 Método de optimización para la sintonización del control PD de un robot móvil . . . . . . . . . . . . . 582
16.9 Sistema para el control de trayectoria de un robot diferencial . . . . . . . . . . . . . . . . . . . . . . . 588
16.10Utilización de un sistema embebido para la teleoperación de un manipulador movil, utilizando un control
discreto . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 596
Julián Barreiro
Candidato a Doctor en Ingenierı́a
Facultad de Ingenierı́a
Universidad de los Andes
Colombia
ARTIFICIAL INTELLIGENCE
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Abstract: Recently, the use of infrared images has shown be a feasible technique for addressing
problems as illumination dependency and facial expressions in face recognition applications.
Due the recent approaches that use deep learning methodologies for image analysis, which have
had remarkable performances, a deep learning strategy for facial recognition in thermographic
images is proposed. A convolutional neural network is designed and evaluated for recognizing
different people in an experimental home-made database. The use of convolutional networks
avoids to implement preprocessing and feature extraction algorithms, which is one of the
important parts in image classification. Results show significant improvements compared to
others works reported in the literature, which demonstrates robustness and effectiveness of the
proposed approach.
Keywords: Face recognition, Thermal images, Deep learning, Convolutional neural networks
2
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Deep learning is the concept that involves a set of machine 2.3 Softmax
learning approaches that use complex architectures of non-
linear transformations to representing relevant information Softmax function is an activation function, which is a
in non-structured data with high levels of abstraction generalization of a logistic function that maps a K -
(Schmidhuber, 2015). These approaches appear a few years dimensional vector Z of real values to a K -dimensional
ago due to the development of new processors with the vector σ(Z) of real values in the range 0, 1. This func-
capacity of executing different processes using parallel tion allows to handle multiclass in the output layer; it is
programming strategies. From the proposed Deep learning computed one unique output when there are several units
approaches, convolutional neural networks have recently in the output layer. Due that Softmax function mapping
been widely adopted due that it is easier to train and is now considered as scores with unnormalized log proba-
generalized much better than other architectures. In this bilities for each class, the cost function correction named
paper, a convolutional neural network is proposed to Cross entropy loss can be defined by Equation 3.
solve the problem of identification of people in thermal
images (Hassairi et al., 2015). efyi
Li = −Log( Pj=1 ) (3)
J efj
2.1 Convolutional Neural Networks (CNN)
with fj being the j − th element of class vector scores f .
CNN is a feed-forward neural network with a particular Likewise, the Softmax activation function is defined by
architecture that allows to process an entire image without equation 4.
preprocessing requirements. It consists of a set of convolu-
tional and subsampling layers followed by fully connected ezj
sof tmax(z) = Pk=1 f orj = 1, ..., K. (4)
layers. Each convolutional layer accepts a feature map (in ezk
K
this case an image) and transforms it to another feature
map (another image) through a differentiable function with z being an arbitrary vector with real values to be
(i.e., kernels). Neurons in the same feature map share the scaled into a zero-to-one values.
same kernel, which allows to go deeper between every
convolution for the transformation of the images Gong 2.4 Dropout Regularization
et al. (2015). Because CNNs can learn from raw data
automating the process of feature extraction (Ji et al., Dropout regularization is used for preventing co-adaption
2013), these are known as end-to-end methods i.e., they among units. This novel technique detailed in (Srivastava
can compute feature maps using spatial information of et al., 2014) avoids overfitting by dropping out units
pictures. The CNN algorithms have demonstrated high randomly in hidden layers during training. for doing so,
performance in pattern recognition tasks over the past few outputs of units with a probability of 0.5 in hidden
years, especially in computer vision where CNNs outper- layers are set to zero during the forward pass, as it is
form conventional classifiers, for instance at the ImageNet explained in Krizhevsky et al. (2012). The ”Dropped out”
Classification challenge, CNNs have been used by the bet- neurons not participate in backpropagation. So, every
ter proposals in the last years (Krizhevsky et al., 2012), algorithm execution changes the net architecture, but
(Simonyan and Zisserman, 2014) (Szegedy et al., 2015). neurons also share its weights. This characteristic reduces
co-adaptations of the net since every neuron does not
2.2 Rectified linear Units (ReLU) depend on other neurons in the net. Therefore, the main
goal of dropout is to add stochastically noise in the
Rectified Linear Units (ReLU) is an activation function, activation states of certain hidden units (Srivastava et al.,
which is the most commonly deployed activation function 2014).
for the outputs of the CNN neurons. This function was
developed as an alternative to classical way of to determine 2.5 Adaptive Moment Estimation (Adam Optimizer)
the output of a neuron x in a net, it is given by Equation 1.
F (x) = tanh(x) = (1 − e−x )−1 (1) Adam is a stochastic optimization algorithm introduced
in Kingma and Ba (2014), which employs first order gra-
According to Glorot et al. (2011), the main advantage of dients as updating mechanism, it uses exponential moving
ReLU activation is that non-linear units are not saturated averages denoted by mt and squared gradients vt . Param-
compared to other non-linear activation functions (such as eters updating is performed defining an objective function
Logistic or Tanh units). This characteristic implies expen- f (θ), which is tuned by evaluating random subsets known
sive processing, doing that networks with saturated neu- as mini batches.
rons be slower than those using ReLU function. Although
ReLU is not symmetric, this property can be achieved by 3. MATERIALS AND METHODS
combining two units Glorot et al. (2011), and sometimes
this function is replaced with a smooth version named
3.1 The thermal Image Database
softplus.
In a general form, the output of a neuron with ReLU An image database was acquired using a FLIR A655SC
activation function can be computed by Equation 2. thermographic camera, configured to a 6.3 Hz sampling
frequency. A group of 21 people with ages between 22 and
3
CHAPTER 1. ARTIFICIAL INTELLIGENCE
45 years was involved. For each persona thermographic The number of input layers was varied according to the
video with approximately 80 images of 640×480 pixels was number of pixels in the input images. So, input layers with
recorded. During recording, each person was instructed to 76800, 19200 and 4800 neurons were evaluated (for 320 ×
rotate his/her head in four directions (up, down, right 240, 160×120 and 80×60 pixels, respectively). This means
and left). Resulting videos were converted to gray-scale, that each neuron in the first layer possesses information of
with pixel values distribution remaining those of infrared each pixel in the image. Size image variations also modify
radiation for each frame. From each video, a set of images the size of the image in the last convolutional layer but did
were manually selected, for doing so it was considered that not change operations in others layer sizes because these
heads were completely forward or slightly tilted. In this do not depend on the size of the input vector.
sense; the database was composed of 588 images, i.e., 21
people and 28 images per person. 4. EXPERIMENTAL RESULTS
Discarding the implementation of any preprocessing al-
gorithm (this means that every input for the network 4.1 People recognition using CNN
corresponds to every pixel of the input image to be clas-
sified for the network), the use of original sized images Implementation of the convolutional neural network was
represented a high computational cost to training stages. performed using TensorFlow framework Abadi et al.
So, images were conveniently scaled using a typical bicubic (2015); proposed approach was evaluated using a 10-fold
interpolation. Preserving spectral relationship, We evalu- cross validation strategy . For picking the training samples,
ate three different scales, i.e. 320 × 240, 160 × 120 and it was considered to keep a stratified probability function
80 × 60 pixels. Although this down-scaling could means by selecting the 10% of data per class for validation test.
losing information; the loss is not considerable enough to Along with the 10-fold cross validation, the data were
prevent the net to recognize all people. In Fig. 1 could be randomly permuted before of evaluating the model. This
observed some examples of the subsets of images involved also guaranteed that each experiment had different input
in this study. data for training and validation stages; and demonstrates
independence of them for learning task.
Figure 3 shows the average accuracy in the validation stage
for different image size evaluated (320×240, 160×120 and
80 × 60), when number of iterations was varied between 10
and 100. In the three cases, perfect recognition i.e., a 100%
of accuracy is obtained before of 30 iterations, although it
is faster for larger images (around of 10 iterations), which
was expected because this kind of image preserves more
information than the small one.
With the purpose of developing a biometric system capable (32 cores) and 32GB RAM inside. Tables 1 and 2 show
of identifying every person according to the experimen- the time consuming in the GPU and CPU configurations.
tal database described in subsection 3.1; a convolutional Interation time refers to processing time (in seconds) for
neural network was implemented. The main goal of using each execution cycle of the algorithm; training time is
CNN was to reach that without to apply any preprocessing the time (in seconds) taken for training the overall net,
technique. The Architecture of the proposed net is illus- when accuracy of 100% is accomplished; Validation time
trated in Figure 2. It consists of 6 convolutional layers is the cost of classifying a subject after training process
followed by 2 densely-connected layers that were randomly and memory consumption corresponds to RAM memory
weighted. Convolving layers implemented kernels using the required to execute the training of the network.
RelU function activation for image transformations. Ev- Table 1. Processing times for GPU Implemen-
ery convolutional layer was connected to the immediately tation
next convolutional layer so that there were no connections
between nonconsecutive layers. The last convolving layer Iteration Training Validation Memory
was connected to two densely-connected layers, which were Size Time Time Time consumption
composed of 200 and 100 output neurons, respectively. (s) (s) (s) (MB)
320x240 8.6211 283.927 0.210511 2440
Last of them uses Softmax and dropout functions for
160x120 2.29826 47.9025 0.0594253 2360
handling multiple classes in the output layer and avoiding 80x60 0.615124 24.1448 0.017618 2298
overfitting.
4
CHAPTER 1. ARTIFICIAL INTELLIGENCE
5
CHAPTER 1. ARTIFICIAL INTELLIGENCE
6
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Fernando J. Muñoz*, Emanuel B. Tello*, Daniel Patiño**, Flavio Roberti**, Elisa Pérez*, Natalia M. López*
*Universidad Nacional de San Juan. Facultad de Ingeniería. Gabinete de Tecnología Médica - San Juan, Argentina.
**Universidad Nacional de San Juan. Facultad de Ingeniería. Instituto de Automática - San Juan, Argentina.
Abstract: Every emotion evidences a biological sign while predisposes the body to a different kind of
response. In Human-Computer interaction area, the voice recognition techniques are widely used in text
engines. In this context, the automatic emotion recognition of the speech aims at identifying the
emotional or physical condition of a human being from his voice. Both emotional and physical states of a
speaker are included in so-called paralinguistic aspects. In this work we used a speech database
containing 7 different emotions in which 4 emotions were selected and 12 features were extracted.
Emotions were classified by different types of neural networks in order to compare the efficiency of them
to discriminate different moods.
Keywords: Classifiers, Speech recognition, Neural-Network Models, Emotions, Audio Features.
1. INTRODUCTION
evidence the performance of classifiers and the capacity of
Emotions govern almost all modes of human communication: description of the features set used.
facial expressions, gesture, posture, voice tone, words,
breathing, body temperature, etc. Human perception of
2. MATERIALS AND METHODS
emotions is about 55 % from facial expressions, and 38 %
from 7% speech from text (Busso et al., 2004). Although the 2.1 Database
emotional state does not alter the linguistic content, this is an
important human communication factor as it provides The Berlin database was used, containing about 500 speech
feedback information for the development of applications in a audio files performed by 10 different actors, expressing
wide spectrum, such as assistive technologies, psychiatric different emotions: happiness, anger, sadness, fear, boredom
diagnosis and lie detection (Nasr and Ouf, 2012). and disgust, as well as a neutral expression (serene). There
Emotion recognition methods require the extraction of certain are a total of ten different phrases. Four emotions of the
characteristics of speech (Espinosa and Reyes García, 2010). database were selected: happiness, sadness, fear and neutral.
Some are based on acoustic features such as Mel Frequency The choice was based on the regionalization proposed in
Cepstrum Coefficients (MFCC) and the Fundamental Russell circumplex model (Figure 1) in which each selected
Frequency to detect emotional signs. Others use prosodic emotion characterizes one quadrant model (Russell, 1980).
features in speech to achieve higher classification accuracy.
Many researches focus on the classification using artificial
neural networks (ANN) (Pérez-Gaspar et al., 2015). In this
work different topologies of ANN based on naive Bayes
(NB), Support Vector Machines (SVM), Multi-layer
Perceptron (MLP) and Radial Basis Function (RBF)
classifiers were designed in order to compare and classify a
set of audio signals from Berlin database, which show
opposite emotional states. A total of 12 features were taken
from the audio signals. The performance of the classifiers
was evaluated through 2 stages. In the first stage, three
emotions (happiness, sadness, and neutral) were selected. In
the second stage, a fourth emotion (fear) was added. The
ANNs were adjusted to achieve optimal classification.
Finally the comparison between the different hit rates
obtained by the classifiers was established in order to
Fig 1. Russell Circumplex model (Russell, 1980)
7
CHAPTER 1. ARTIFICIAL INTELLIGENCE
8
CHAPTER 1. ARTIFICIAL INTELLIGENCE
examples, if they are separable or quasi-separable, or in a database, 15 for each emotion (happiness, sadness and
transformed space (feature space), if the examples are not neutral). The 12 features of the samples were extracted,
linearly separable in the original space. In these cases, the normalized and introduced in the different topologies. The
search for the hyperplane will be done implicitly using kernel estimated outputs were compared with the expected values.
functions (Chavan and Gohokar, 2012). The percentages of successes and overall performance for
emotions classified in different network models are shown in
In this work, a total of three SVM classifiers (Figure 4) were
Table 1.
designed based on combinations of the emotions selected:
Happiness/Sadness; Happiness/Neutral; Sadness/Neutral. In the second stage, 35 samples were added to the training
After training, different kernels of inner product were used set, corresponding with the “fear” emotion and 15 samples
(Linear, Quadratic, Polynomial, Radial Basis Function and were used for simulation. The estimated outputs were
Multilayer Perceptron) to analyse which of them provided a compared with the expected values. The percentage of hits
better performance in data classification. Then, three new and overall performance are shown in Table 2.
classifiers were designed to include the emotion “Fear”:
Happiness/Fear; Neutral/Fear; Sadness/Fear.
Table 1. Hit rate and overall performance, with three
emotions.
ANN Happiness Sadness Neutral Overall Performance
MLP 73.33% 66.67% 66.67% 68.89%
RBF 46.67% 80% 33.33% 53.33%
SVM-L 73.33% 86.66% 93.33% 84.44%
SVM-Q 46.67% 66.67% 53.33% 55.56%
SVM-P 73.33% 60% 40% 57.77%
SVM-RBF 80% 93.33% 46.67% 73.33%
SVM-MLP 73.33% 66.67% 60% 66.67%
NB 86.67% 93.33% 80% 86.67%
9
CHAPTER 1. ARTIFICIAL INTELLIGENCE
10
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Abstract: We present an application of Deep Convolutional Neural Networks (CNN) for the
detection and diagnosis of breast tumors. The images used in this study have been extracted
from the mini-MIAS database of mammograms. The proposed system has been implemented in
three stages: (a) crop, rotation and resize of the original mammogram; (b) feature extraction
using a pretrained CNN model (AlexNet and VGG); (c) training of a Support Vector Machine
(SVM) at the classification task using the previously extracted features. In this research, the
goal of the system is to distinguish between three classes of patients: those with benign, malign
or without tumor. Experiments show that feature extraction using pretrained models provides
satisfactory results, achieving a 64.52% test accuracy. It is worth noting the impact of the data
augmentation process and the balance of the number of examples per class on the performance
of the system.
11
CHAPTER 1. ARTIFICIAL INTELLIGENCE
transformation of local modulus maxima in conjunction trained using both dropout and max-norm regularization
with the region growing algorithm were used as edge seg- techniques.
mentation method. Finally, five textural features and five
Jiao et al. (2016) obtained an accuracy of 96.7% classifying
morphological features were extracted from the resulting
the breast masses between benign and malign from the
image and these were used at the classification task. The
DDSM database using a CNN as feature extractor and a
ELM classifier exhibited better performance than the SVM
SVM as classifier. The images were previously normalized
classifier.
and whitened. On the other hand, the CNN was trained
Dheeba et al. (2014) obtained an accuracy of 93.67% with a subset of ImageNet, dataset produced by Rus-
classifying between normal and abnormal tissues with an sakovsky et al. (2015), and the features to perform the
optimized neural network using Particle Swarm Optimiza- classification were extracted from two different layers of
tion. The experiment was carried out with their private the CNN.
database of mammograms and the classification was done
Abdel-Zaher and Eldeib (2016) developed a classifier using
with the Laws Texture Energy Measures extracted from a
the weights of a previously trained deep belief network as
ROI of dimension 15 ˆ 15 pixels.
the initial parameters for a neural network with Lieben-
Peng et al. (2016) obtained an accuracy of 96% using an berg Marquardt learning function. This model was tested
artificial neural network to classify the mammograms from on the Wisconsin Breast Cancer Dataset, obtaining an
MIAS database. They defined three different categories accuracy of 99.68%.
to carry out the experiment: normal, with presence of a
benign tumor and with presence of a malign tumor. A
median filter and the seeded region growing algorithm were
used to remove the noise of the original images. Then, they 3. DEEP LEARNING FOR IMAGE CLASSIFICATION
extracted 16 features related to the texture properties of
the images and five of them were selected. The feature This section is based on the works from Guo et al. (2015)
selection algorithm, which is based on the rough-set theory, and LeCun et al. (2015).
was developed by the authors.
Around 2006, the results obtained by a group of re-
Mahersia et al. (2016) achieved recognition rates of 97.08% searchers working together in parallel projects in the
and 95.42% on the MIAS database using a neural net- Canadian Institute for Advanced Research renovated the
work with a Bayesian back-propagation algorithm and interest of the community for the deep neural networks.
an ANFIS system as classifiers, respectively. The breasts The main four works Bengio et al. (2006); Hinton (2005);
were classified into two categories: normal and cancerous. Hinton et al. (2006); Marc’Aurelio Ranzato et al. (2006),
The mammograms from this database were first enhanced, introduced unsupervised learning procedures to pure su-
removing the noise and details that may interfere with the pervised learning procedures. The objective of each layer in
recognition of the tumors. Then a generalized Gaussian the neural network was to learn the inputs of the previous
density model for wavelet coefficients was used as feature layer, as stated by LeCun et al. (2015). This approach
extractor. performed well in comparison with the existent artificial
intelligence techniques in tasks such as recognizing hand-
2.2 Analysis of Breast Cancer using Deep Learning written digits, specially when the amount of labeled data
was limited, as mentioned by Sermanet et al. (2013).
Ertosun and Rubin (2015) used three different architec- Since the rise of Deep Learning, the CNN model outper-
tures of CNNs to locate masses in mammography images. formed the fully connected neural networks in tasks related
They selected 2420 images from the DDSM dataset and to natural image classification. However, this approach was
divided these images into training, validation and test sets, not seriously used at classification problems until 2012.
containing 80%, 10% and 10% of the images, respectively. During six years in which CNNs were laid aside, the meth-
They also used cropping, translation, rotation, flipping and ods based on the Bag of Visual Words model, that were
scaling techniques to get an augmented training set, in or- the state of the art techniques for image classification, were
der to improve the generalization ability of the system. The improved by Lazebnik et al. (2006) with the incorporation
experiment was divided into two stages: the first consisted of spatial geometry, through the use of spatial pyramids.
in the classification of a mammography as containing or
not masses and the second in the localization of masses in The turning point for image classification was 2012. In this
the images. year, AlexNet, a CNN with five convolutional layers and
three fully connected layers developed by Krizhevsky et al.
Arevalo et al. (2015) obtained 86% of area under the (2012), outperformed the existing methodologies and won
Receiver Operating Characteristic (ROC) curve by classi- the ImageNet Large Scale Visual Recognition Challenge
fying mammography mass lesions using a CNN as feature (ILSVRC) in 2012, almost halving the error rate of the
extractor and a SVM as classifier. The data to carry out model in second place from Russakovsky et al. (2015).
the experiment was the BCDR-F03 dataset, which is part According to LeCun et al. (2015), this success reflected
of the BCDR database. This data was composed by 736 the new developments in graphic hardware and algorithms:
images, 426 containing benign mass lesions and the rest the increased chip processing abilities (GPU units), the
containing malignant lesions. The data augmentation was use of Rectified Linear Unit (ReLU) as neural activation
achieved by flipping and rotating the images. In addi- functions, a novel regularization technique developed by
tion, the mammography images were normalized by the Srivastava et al. (2014) and the advances in algorithms for
use of global and contrast normalization. The CNN was data augmentation.
12
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Since the success of AlexNet in 2012, several improvements is generally strictly smaller than the dimensions of the
of this model have been performed. Zeiler and Fergus image, this leads to a local connectivity structure among
(2014) established a technique to analyze the responses the neurons. Each of this convolutional volumes has an
of intermediate layers, what enabled them to implement additional hyper-parameter, S, which corresponds with
Clarifai, winning the ILSVRC. the stride that the filter is going to slide spatially in the
image.
In 2014, deeper architectures were finally used. VGG from
Simonyan and Zisserman (2015) and GoogLeNet from Let’s denote a particular training example as XHˆW ˆD
Szegedy et al. (2015) networks obtained the second and and a convolution filter WF ˆF ˆD . As it is familiar from
first place in ILSVRC, respectively. The VGG network the usual Multi-Layer Perceptron, it is customary to add
from had 13-16 convolutional layers, while GoogLeNet had a bias term b to each of the linear combinations formed.
21 convolutional layers. Finally, a (commonly non-linear) activation function, for
example ReLU, is applied to the convolution between the
He et al. (2015b) proposed a model that surpassed for the input image and the kernels, which yields an activation
first time human-level performance on the ImageNet 2012 map A of the dimensions 1 ` N S´F ˆ 1 ` N S´F ˆ 1:
test dataset, with a network with the same architecture
of VGG. In addition, He et al. (2015a) also established a A “ f pX ˇ W ` bq
new framework to train deeper networks called the residual where ˇ represents the valid convolution between the
learning. They developed ResNet, a 152-layers network operands and f is the activation function.
and won ILSVRC.
Appending the activation maps found by applying K
For Guo et al. (2015), researchers are focusing in three diferent kernels to the input example, an activation volume
main aspects to further improve the performance of Deep of dimensions 1` N S´F ˆ1` N S´F ˆK is obtained. Note that
Learning models: (a) the implementation of larger net- depending on the dimensions of the image, the filter and
works: ResNet, GoogLeNet and VGG models have shown the size of the stride, the resulting activation volume may
that the networks with a larger number of layers out- reduce its spatial dimensions very quickly. An alternative
perform the simpler ones; (b) the use of multiple net- to control this situation in advance is the use of padding
works, where every network can execute all the process techniques to the original image.
independently, so the responses of all the networks are
combined in order to obtain the final result; and (c) the Finally, in order to perform dimensionality reduction di-
introduction of external information from other resources rectly on the data, pooling layers are applied to an acti-
and the use of shallow structures. In this aspect, one of vation volume or even the input image itself. These layers
the most important developments is Regions with CNN subsample its inputs, typically with mean or max pooling,
Features method by Girshick et al. (2014), in which the over contiguous regions of size P ˆ P .
features extracted from a CNN feed a SVM. Figure 1 shows an example of a typical architecture for a
Other research projects have focused their efforts on get- CNN in which two convolutional and two pooling layers are
ting a further understanding of what deep neural networks applied to the original image. In this case, the extracted
learn, addressing the problem from both a theoretical and features obtained as are fed into a fully connected layer to
a empirical perspective. For instance, Li et al. (2016) have perform the classification task. Note that it is possible to
recently studied convergent learning, aiming to analyze change the classifier set up at the end of the network with,
cases in which different neural networks learn similar rep- for example, a SVM or a softmax classifier.
resentations. In this work, they propose a method for
quantifying the similarity between deep neural networks 4.2 Back-propagation Algorithm
and showed that there exist basic features which are
learned by multiple networks with the same architectures The summary presented in this section is heavily based
but different random initialization. on the Unsupervised Feature Learning and Deep Learning
Tutorial from Ng et al. (2010). For simplicity, we will
4. THEORETICAL BACKGROUND illustrate the algorithm assuming that we have a CNN
with the input layer followed by a convolutional volume, a
4.1 Convolutional Neural Networks pooling layer and finally a fully connected layer.
CNNs are a type of biologically-inspired feed-forward Let’s denote by δ pl`1q the error term in the pl ` 1q-th
networks characterized by a sparse local connectivity and layer in the network with labeled training data px, yq,
weight sharing among its neurons. A CNN can also be seen parameters pW, bq and cost function JpW, b; x, yq. If the
as a sequence of convolutional and subsampling layers in l-th layer is densely connected to the former, the error for
which the input is a set of H ˆ W ˆ D images, where this layer can be computed by:
¨
´ ¯
H is the height, W is the width and D is the number of δ plq “ pW plq qt δ pl`1q f 1 pz plq q
channels which, in the case of RGB images corresponds to
D “ 3. where ¨ represents element-wise multiplication and f is
the activation function.
Following Ng et al. (2010), a typical convolutional layer
(volume) is formed by K filters (kernels) of size F ˆ The gradients are:
F ˆ D, where F ď H and F ď W . These filters are
∇W plq JpW, b; x, yq “ δ pl`1q paplq qt
usually randomly initialized and are the parameters to be
tuned in the training process. Since the size of the filter ∇bplq JpW, b; x, yq “ δ pl`1q
13
CHAPTER 1. ARTIFICIAL INTELLIGENCE
If the l-th layer is a convolutional and subsampling layer, Table 1. Confusion Matrix
then the error is propagated through as:
¨
´ ¯ Predicted Class
plq plq pl`1q plq
δk “ upsample pWk qt δk f 1 pzk q P N
Actual Class
True False
where k indexes the filter number and the upsample P
Positives Negatives
function propagates the error through the pooling layer
by calculating the error related to each input unit. False True
N
Positives Negatives
Finally, the gradient for each filter map can be found by:
ÿ
m
plq pl`1q
∇W plq JpW, b; x, yq “ pai q ˇ rot90pδk , 2q Fawcett (2006) defines a confusion matrix as a tool that
k
i“1 allows to visualize the performance of a classifier in a
ÿ ´ pl`1q ¯ supervised learning problem. By means of this matrix it
∇bplq JpW, b; x, yq “ δk is possible to asses whether the system is commonly con-
k a,b
a,b fusing pairs of classes. In the aforementioned problem, the
where aplq is the input to the l-th layer and rot90pA, kq confusion matrix summarizes the four possible outcomes
rotates the input array A counterclockwise by k ˚ 90 from the classifier.
degrees.
5. MAMMOGRAMS CLASSIFICATION
4.3 Linear Support Vector Machines
5.1 Data
Suppose we are given a training data set of size n examples
of the form: The mammograms used for the commitment of this work
were retrieved from the database of the MIAS, collected
tpX1 , y1 q, pX1 , y1 q, ..., pXn , yn qu
by Suckling et al. (1994), which is known as mini-MIAS
where each yi is either 1 or ´1 and each Xi is a p- since the images of the original MIAS database has been
dimensional vector. Thus, assuming that the data is lin- reduced to 200 micron pixel edge and the dimension of the
early separable, we want to find the hyperplane that sep- mammograms has been fixed to 1024 ˆ 1024 pixels. This
arates the group of tXi u for which yi “ 1 from those for database contains 322 mammograms and the intensity
which yi “ ´1 so that the distance between the hyperplane of every pixel is between 0 and 255. This database also
and the nearest point from either group is maximized. For includes information about the class and the severity of
that reason, it is also called a maximum-margin classifier. abnormalities that may be present in the mammograms, as
This can be formally expressed as: well as the coordinates of the center of these abnormalities.
1 It must be mentioned that we only used the mammogram
minw‰0,b ||w||2 images and the required information to divide the mammo-
2
t
s.t. yi pw Xi ` bq ě 1 pi “ 1, 2, ..., nq grams into three categories: patients with benign, malign
b or without tumor.
. Recall that ||w|| represents the separation of the hyper-
plane from the origin along the normal vector w when the 5.2 Data Preprocessing
hyperplane is expressed as wX ´ b “ 0.
Mammogram Cropping Mammograms contain black
4.4 Confusion Matrix zones in the borders which may difficult the classification
task. For this reason, we designed an algorithm to elimi-
Consider a classification problem with only two classes: nate these black zones based on the sum of the pixels over
positive (P) and negative (N). For every training example, the column. The algorithm finds the first column, say Cl ,
there are only four possible outcomes. If the training on the left of the mammogram in which the sum of the
example is positive and the prediction is positive, we call it pixels exceeds a given threshold P . Now, from this point,
a true positive; and if the prediction is negative, it is called the algorithm finds the first column, Cr , in which the sum
a false negative. On the other hand, if the training example of the pixels is not greater than P . Then, the new image is
is negative and it is classified as negative, it is called a true the one enclosed between Cl and Cr . This algorithm was
negative; otherwise, it is a false positive. Table 1 displays applied to every mammogram of the 322 retrieved from the
an example of a confusion matrix for a two-class problem. aforementioned database, taking P “ 500. An example of
14
CHAPTER 1. ARTIFICIAL INTELLIGENCE
5.5 Results
Chatfield et al. (2014). The features selected to perform Malign 27.39 56.12 16.49 56.12
the classification of mammograms were the activations of Normal 31.34 56.29 12.36 12.36
the last convolutional layer of the CNN. Then, in both
Total 38.35 34.96 27.97 35.01
cases, 4096 features have been extracted for each image.
In order to feed both pretrained CNNs with the cropped
images, it was necessary to convert every mammogram into
a three channel image by repeating the single channel three
times. Then, the resulting image was resized depending In Tables 3 and 4 the confusion matrices corresponding
on the input dimension of the CNN (227 ˆ 227 pixels to the response of the system when AlexNet and VGG-F
for AlexNet and 224 ˆ 224 pixels for VGG-F). Finally, CNNs are used as feature extractors in conjunction with a
the average image (which is included with the tuned SVM as classifier are exhibited. Table 3 shows the accuracy
parameters of the pretrained models used) was subtracted of the system on the test set after performing the data
from the resized image. augmentation when the CNN used is AlexNet.
15
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Table 3. Confusion matrix for Aug- ‚ To test other classifier structures: neural networks,
mented MIAS test set predictions and fuzzy inference systems or clustering techniques.
feature extraction using AlexNet.
Target
REFERENCES
Benign Malign Normal Total
Benign 61.79 20.33 17.87 61.79 Abdel-Zaher, A.M. and Eldeib, A.M. (2016). Breast
Output
Malign 18.79 61.75 19.46 61.75 cancer classification using deep belief networks. Ex-
Normal 22.88 20.67 56.46 56.46 pert Systems with Applications, 46, 139 – 144. doi:
Total 59.73 60.10 60.20 60.01
http://dx.doi.org/10.1016/j.eswa.2015.10.015.
Alolfe, M.A., Mohamed, W.A., Youssef, A.B.M., Mo-
hamed, A.S., and Kadah, Y.M. (2009). Computer aided
On the other side, Table 4 shows the response of the diagnosis in digital mammography using combined sup-
system when VGG-F is the feature extractor. It can be port vector machine and linear discriminant analyasis
noted that the performance of the system has dramatically classification. In 2009 16th IEEE International Con-
increased after artificially augmenting the dataset: from ference on Image Processing (ICIP), 2609–2612. doi:
35% to 60.01% and 64.52% using AlexNet and VGG-F, 10.1109/ICIP.2009.5413992.
respectively. Arevalo, J., González, F.A., Ramos-Pollán, R., Oliveira,
J.L., and Lopez, M.A.G. (2015). Convolutional neu-
Table 4. Confusion matrix for Aug- ral networks for mammography mass lesion classifica-
mented MIAS test set predictions and tion. In 2015 37th Annual International Conference of
feature extraction using VGG. the IEEE Engineering in Medicine and Biology Society
(EMBC), 797–800. doi:10.1109/EMBC.2015.7318482.
Target Bengio, Y., Lamblin, P., Popovici, D., and Larochelle,
Benign Malign Normal Total H. (2006). Greedy Layer-Wise Training of Deep Net-
Benign 63.63 18.45 17.92 63.63 works. Advances in neural information processing sys-
Output
16
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Hinton, G.E., Osindero, S., and Teh, Y.W. (2006). A fast Suckling, J., Parker, J., Dance, D., Astley, S., Hutt, I.,
learning algorithm for deep belief nets. Neural computa- Boggis, C., Ricketts, I., Stamatakis, E., Cerneaz, N.,
tion, 18(7), 1527–54. doi:10.1162/neco.2006.18.7.1527. Kok, S., et al. (1994). The mammographic image analy-
Jiao, Z., Gao, X., Wang, Y., and Li, J. (2016). A sis society digital mammogram database. In Exerpta
deep feature based framework for breast masses clas- Medica. International Congress Series, volume 1069,
sification. Neurocomputing, 197, 221 – 231. doi: 375–378.
http://dx.doi.org/10.1016/j.neucom.2016.02.060. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.,
Krizhevsky, A., Sutskever, I., and Hinton, G.E. Anguelov, D., Erhan, D., Vanhoucke, V., and Rabi-
(2012). ImageNet Classification with Deep novich, A. (2015). Going deeper with convolutions.
Convolutional Neural Networks. Advances In In Proceedings of the IEEE Conference on Computer
Neural Information Processing Systems, 1–9. doi: Vision and Pattern Recognition, 1–9.
http://dx.doi.org/10.1016/j.protcy.2014.09.007. Wang, Z., Yu, G., Kang, Y., Zhao, Y., and Qu,
Lazebnik, S., Schmid, C., and Ponce, J. (2006). Be- Q. (2014). Breast tumor detection in digi-
yond bags of features: Spatial pyramid matching for tal mammography based on extreme learning ma-
recognizing natural scene categories. In Proceedings of chine. Neurocomputing, 128, 175 – 184. doi:
the IEEE Computer Society Conference on Computer http://dx.doi.org/10.1016/j.neucom.2013.05.053.
Vision and Pattern Recognition, volume 2, 2169–2178. World Health Organization (2016). Breast
doi:10.1109/CVPR.2006.68. cancer: prevention and control. URL
LeCun, Y., Bengio, Y., and Hinton, G. (2015). http://www.who.int/cancer/detection/breastcancer/en/.
Deep learning. Nature, 521(7553), 436–444. doi: [Accessed: 19- May- 2016].
10.1038/nature14539. Zeiler, M.D. and Fergus, R. (2014). Visualizing and
Li, Y., Yosinski, J., Clune, J., Lipson, H., and Hopcroft, understanding convolutional networks. In Proceedings
J. (2016). Convergent Learning: Do different neural of the ECCV International Workshop on Statistical
networks learn the same representations? In ICLR, 1– Learning in Computer Vision, 818–833. Springer.
21.
LISA Lab (2016). My LeNet. Retrieved 2016-5-26.
Mahersia, H., Boulehmi, H., and Hamrouni, K. (2016). De-
velopment of intelligent systems based on Bayesian reg-
ularization network and neuro-fuzzy models for mass de-
tection in mammograms: A comparative analysis . Com-
puter Methods and Programs in Biomedicine, 126, 46 –
62. doi:http://dx.doi.org/10.1016/j.cmpb.2015.10.017.
Marc’Aurelio Ranzato, C.P., Chopra, S., and LeCun, Y.
(2006). Efficient learning of sparse representations
with an energy-based model. In Advances in Neural
Information Processing Systems, volume 19, 1137–1144.
Ng, A., Ngiam, J., Foo, C.Y., Mai, Y.,
and Suen, C. (2010). UFLDL tutorial.
http://ufldl.stanford.edu/wiki/index.php/UFLDL Tutorial.
Peng, W., Mayorga, R., and Hussein, E. (2016).
An automated confirmatory system for analysis
of mammograms. Computer Methods and Pro-
grams in Biomedicine, 125, 134 – 144. doi:
http://dx.doi.org/10.1016/j.cmpb.2015.09.019.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S.,
Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein,
M., Berg, A.C., and Fei-Fei, L. (2015). ImageNet
Large Scale Visual Recognition Challenge. International
Journal of Computer Vision (IJCV), 115(3), 211–252.
doi:10.1007/s11263-015-0816-y.
Sermanet, P., Kavukcuoglu, K., Chintala, S., and LeCun,
Y. (2013). Pedestrian detection with unsupervised
multi-stage feature learning. In Proceedings of the
IEEE Conference on Computer Vision and Pattern
Recognition, 3626–3633.
Simonyan, K. and Zisserman, A. (2015). Very Deep
Convolutional Networks for Large-Scale Image Recog-
nition. Proceedings of the ICLR, 1–14. doi:
10.1016/j.infsof.2008.09.005.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I.,
and Salakhutdinov, R. (2014). Dropout: A simple way to
prevent neural networks from overfitting. The Journal
of Machine Learning Research, 15(1), 1929–1958.
17
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Abstract: In the context of data-driven monitoring, evolving environments challenge researchers with
non-stationary data flows where the concepts (or states) being tracked can change over time. This issue
is common in real industrial environments that suffer wear over time, implying for instance that the
”normal” state undergoes drift. This requires monitoring algorithms suited to represent the evolution of
the system behavior and in real industrial environments, also suited to represent time dependent features.
This paper proposes a unified clustering approach to monitoring evolving environments using a two-
stages distance-based and density-based algorithm. In this approach the system measurements trends,
found on the fly, are characterized and then used as input to the clustering algorithm that provides as
output clusters representing the system current state. Due to a forgetting process, clusters may emerge,
drift, merge, split or disappear, hence following the evolution of the system. The dynamic clustering
algorithm shows good outlier rejection capability when tested on an industrial benchmark suffering
faults with varying magnitude.
1. INTRODUCTION does not adapt dynamically (its structure and sometimes even
its parameters).
Technological advances of past decades have resulted in sys- The classification techniques can establish a model of the
tems highly adaptive to a constantly changing environment. In system functional states by extracting knowledge from various
addition, they have changed the way enterprises get information attributes. This knowledge is related to a particular behavior,
about the state of their systems. Huge amounts of data, arising without being represented by a set of analytic relations. The
from various sources, are generally collected and they are avail- modifications of these characteristics enable the detection of
able for further analysis. These two facts have promoted the abnormal operations (Isaza et al., 2009).
success of machine learning approaches for diagnostics tasks,
although they must face new challenges, in particular building In the context when new objects submitted to the classifier
efficient classification algorithms that adapt the targeted model during the recognition stage do not imply novelty detection
– or classifier – to the evolution of the system and that scale to nor a change of the classifier the classification is called static
big data. classification ((Joentgen et al., 1999)). If the classifier changes
in time (dynamically), following the system, the classification
Many researchers have used pattern recognition, neural net- task becomes a dynamic classification problem.
works and clustering techniques for fault diagnosis (Maurya
et al., 2010; Hedjazi et al., 2010; Eduardo Mendel et al., 2011). Dynamism in the classifier is achieved when not only the
To carry out diagnosis using a pattern recognition method, there parameters but the classifier structure changes according to
are two principal stages: training (or learning) and recognition. input data in an automatic way. Abrupt changes in the data can
One known disadvantage of classic data-driven techniques is be captured by cluster creation or elimination. Smooth changes
that they often address only anticipated fault conditions (Vacht- are usually reflected as cluster drifts and less frequently as
sevanos et al., 2007), missing new or unknown data of which cluster merging and splitting.
it was not aware of during training. Other disadvantage is that Among the techniques that have been used for dynamic classifi-
training examples usually follow static distributions that remain cation, we can mention: evolving clustering ((Angelov, 2011)),
unchanged over time. If the performance of the algorithm de- Self-Adaptive feed-forward neural network (SAFN) ((Li et al.,
creases below a given threshold it should be re-learned, but it 2011)), LAMDA (Learning Algorithm for Multivariate Data
Analysis) ((Kempowsky et al., 2006)), Growing Gaussian Mix-
? This work was supported in part by the Colombian administrative depart- ture Models (2G2M) ((Bouchachia and Vanaret, 2011)), CluS-
ment of science, technology and innovation COLCIENCIAS, the Universidad tream (Aggarwal et al., 2003), ClusTree (Kranen et al., 2011)
Nacional de Colombia and the Laboratory for Analysis and Architecture of and DenStream (Cao et al., 2006). Some of these alternatives
Systems LAAS-CNRS
18
CHAPTER 1. ARTIFICIAL INTELLIGENCE
are really complex and hence not suited to handle online large distance density final
amounts of process data, such as data arriving in a stream. Their based
clustering
based
requirements in terms of memory and processor power are too Clustering Clustering
high. Two-stages clustering (online/offline) has emerged as an
alternative to deal with large amounts of data arriving at fast
rates. Examples of this approach can be found in (Aggarwal
et al., 2003; Kranen et al., 2011; Cao et al., 2006).
Qualitative trends (QT) is a user friendly representation of fea-
tures that have been successfully applied in the fields of pro- Fig. 1. Principle of DyClee
cess monitoring and fault diagnosis (Dash et al., 2004; Maurya Episodes are defined by three elements: a trend context T C, a
et al., 2007, 2010; Gamero et al., 2014). This non model-based set of auxiliary variables AV and a time interval Ti leading to
technique exploits historical data of a process to characterize (1):
its behavior using a qualitative language. This representation e(xi ) = {T C, AV, Ti } (1)
reduces the complexity of system states by allowing only a
finite set of descriptors (Gamero et al., 2014). Polynomial fit- As introduced before, to find the trend context polynomial fit
based methods have been used to extract time-series trends can be used. Instead of using an entirely qualitative represen-
owing to its shorter computational time and higher robustness tation using an alphabet of primitives like Maurya et al. (2007)
to noise. The main advantages of the QT representation are its or Gamero et al. (2014), we use the polynomial coefficients as
interpretability, the complexity reduction and the robustness in trend context.
presence of noise. On the contrary, one main drawback is the
lack of differentiation of episodes following the same qualita- When data arrives, the preprocessing stage performs a polyno-
tive trend, i.e. the concept of magnitude is completely lost. mial fit in order to find the underlying trend. If the polynomial
representation is considered as good, i.e. the polynomial fitting
In (Barbosa et al., 2015) we proposed a monitoring algorithm error is lesser than the signal noise variance, the polynomial
which couples a dynamic clustering method with an on-line coefficients are used as T C and the polynomial start and end
trend extraction algorithm that works incrementally on the in- time-stamps define Ti .
coming data. The algorithm was used to achieve on-line di-
agnosis on the continuous stirred tank heater (CSTH) model The distance-based clustering stage creates µ-clusters that are
developed by (Thornhill et al., 2008). In this paper we improve summarized representations of the data set made by using some
our previous work by: (1) introducing a global and local density statistical and temporal information. Formally, a µ-cluster is
analyses that improves the system tracking specially in pres- a hyper box representing a group of data points close in all
ence of novel unknown behavior. (2) Introducing a variable dimensions and whose information is summarized in the tuple:
representation that allows quick detection by an operator. (3) µCk = (nk , LSk , SSk , tlk , tsk , Dk , Classk ) (2)
Proposing several functions to represent the forgetting pro-
cess. This Dynamic Clustering algorithm for tracking Evolving where nk is the number of objects in the µ-cluster k, LSk ∈ <d
Environments is called DyClee. We especially want to address is the vector containing the linear sum of each feature over the
the outlier rejection and the evolution characterization capabil- nk objects, SSk ∈ <d is the square sum of feature over the nk
ities of our algorithm both in a local and global sense. objects, tlk is the time when the last object was assigned to that
µ-cluster, tsk is the time when the µ-cluster was created, Dk is
This paper is organized as follows: Section 2 introduces our
the µ-cluster density and Classk is the µ-cluster label if known.
algorithm including the dual global and local density analysis.
In order to maintain an up-to-date structure allowing to track
Section 3 presents DyClee forgetting process including the new
system evolution, µ-clusters are weighted with a forgetting
proposed functions. Section 4 presents outlier detection and
function.
its application to detect unknown system behaviors. Section 5
shows some capabilities of our algorithm in toy examples and The density-based stage analyses the distribution of those µ-
Section 6 shows the same capabilities in the CSTH benchmark. clusters whose density is considered as medium or high and
Finally the conclusions are presented in section 7 creates the final clusters by a density based approach, that is,
dense µ-clusters that are close enough (connected) are said
to belong to the same cluster. A µ-cluster is qualified as one
2. DYNAMIC CLUSTERING ALGORITHM of three options: dense µ-cluster (DµC), semi-dense µ-cluster
(SµC) or low density (outlier) Oµ-cluster (OµC).
This paper uses the distance- and density-based clustering In our previous work the dense character of a µ-cluster was
approach introduced in (Barbosa et al., 2015) improved to be found using a user specified parameter named α. Specifically,
able to detect local and global outliers and to follow different being K the total number of µ-clusters, DµCs are the µ-clusters
evolution dynamics. The algorithm description is shown in with Dj ≥ α avg (D1 · · · DK ), SµC are the µ-clusters with
figure 1. Dj ≥ α2 avg (D1 · · · DK ), and OµC are the µ-clusters with
densities lower than that. The problem with this approach is
System data is considered to arrive in stream. Data streams that it demands user knowledge about the density distribution
take the form of time series providing the values of the signals of the clusters and it does not guarantee cluster homogeneity.
measured on a given process at each sampled time. In order to
extract trend information, this work proposes to process each In this paper we implements two different automatic ap-
time series xi into episodes, to generate an abstraction of the proaches to establish the dense character of the µ-clusters,
original signal into a simpler qualitative-like, yet quantitative, named global-density approach and local-density approach.
representation. The former approach allows to detect clusters with similar
19
CHAPTER 1. ARTIFICIAL INTELLIGENCE
densities while the later allows the detection of clusters with Unlike the previously named approaches, DyClee analyses the
varied densities. DyClee global- and local-density approaches dense character of each µ-cluster regarding the density of the
are further explained in the following subsections. These ap- other µ-clusters in the same group. This approach allows what
proaches are the first contribution of this paper. is called multi-density clustering (Mitra et al., 2003).
As in the global approach the average and the median are
2.1 Global-density analysis chosen as thresholds, but they are applied recursively in the
µ-clusters groups. In other words, for each group Gk , the µ-
In the global-density approach density is considered as a µ- clusters having their density higher than or equal to the average
cluster characteristic regarding all the µ-clusters. In this sense, density of the group (avg(DGk )) and higher than or equal to the
two measures are considered as representative of the µ-clusters median density of the group (median(DGk )) are considered as
in a global sense, the average of µ-cluster’s density and the dense. µ-clusters having a density higher than or equal to only
median. These measures will work as thresholds for establish- one of those measures (either average or median) are considered
ing the dense character of a µ-cluster. The intuition behind as SµCs and those with density below both measures are
the selection of these measures is that the median and average considered as OµCs. Summarizing:
densities of an heterogeneous group are significantly different,
DµC ⇔ Dz ≥ median DGk ∧ Dz ≥ avg DGk , (6)
although, if the group is uniformly dense, these two quantities
are equal. SµC ⇔ Dz ≥ median DGk ∨ Dz ≥ avg DGk , (7)
Formally, a µ-cluster µCz is said to be dense at time t if its OµC ⇔ Dz < median DGk ∧ Dz < avg DGk . (8)
density is greater than or equal to both global measures, i.e. the
median and the average. On the contrary if its density is bigger The µ-clusters group shown in Figure 2 is analyzed in this
or equal to one of the two measures and lower than the other the manner: First, the groups of µ-clusters are found and then, for
µ-cluster is said to be semi-dense. Finally if the µ-cluster µCk each group, a cluster is formed with the denser connected µ-
have a density bellow both thresholds it is said to be an OµC. clusters as shown at the left of Figure 3. Once formed, the rest
These conditions are represented in inequalities 3 to 5, where of the group is analyzed looking for the denser µ-clusters (with
Di is the density of the µ-cluster i and K is the total number of respect to the remaining elements of the group). If no dense
µ-clusters. region is found the next group in analyzed following the same
DµC ⇔ Dz ≥ median (D1 · · · DK ) ∧ Dz ≥ avg (D1 · · · DK ) (3) method until all groups are analyzed. The final classification
is shown at the right of Figure 3. The group or groups with
SµC ⇔ Dz ≥ median (D1 · · · DK ) ∨ Dz ≥ avg (D1 · · · DK ) (4) the lower density are taken as outliers. It is worth noting that
OµC ⇔ Dz < median (D1 · · · DK ) ∧ Dz < avg (D1 · · · DK ) (5) global-density approaches are unable to detect the green cluster
since the densities of the µ-clusters in that group are low, with
As stated before, in order to find the final clusters, the dense respect to the others clusters.
character of the µ-clusters and its connections are analyzed. A
set of connected µ-clusters is said to be a group. Groups of
µ-clusters are analyzed recursively in order to find the clusters
within. A cluster is created if every inside µ-cluster of the group
is a DµC and every border µ-cluster is either a DµC or an SµC.
20
CHAPTER 1. ARTIFICIAL INTELLIGENCE
Forgetting factor
The simplest forgetting function corresponds to a linear decay exponential
0.6
as given in equation (12). The function slope m could be trapezoidal
inversely proportional to the time it takes to the function to go 0.4
from one to zero, m = 1/tw=0 . This function with tw=0 =
0.2
6000 is plotted in blue in Figure 4. A linear decay has been
used above all in biological and physical systems. 0.0
( 0 2000 4000 6000 8000 10000
1 − m (t − tlk ) t − tlk ≤ tw=0 Time (secs)
f (t, tlk ) = (12)
0 t − tlk > tw=0 Fig. 4. Decay functions used to emulate data forgetting
If a non-forgetting time range is appended to a linear decay, others. Most of classification monitoring techniques focus on
we get a trapezoidal decay profile. This type of function is reject the abnormal observations derived from noised data.
used as profile in electronic applications. The trapezoidal decay In general, outliers indicate noise, damage or errors. Never-
function is given in equation (13) where ta represents the no theless, outliers can also be an indication of the occurrence of
forgetting time, and tw=0 the time when the function reaches unknown events or unexpected patterns resulting from system
zero. This function is represented in red in Figure 4. evolution or reconfiguration.
1 t − tlk ≤ ta Most commonly used outlier detection approaches are those
m−t
f (t, tlk ) = ta ≤ t − tlk ≤ tw=0 (13) classified as distribution-based, distance-based (Knorr et al.,
m − ta 2000), density-based (Breunig et al., 2000) or angle-based
0 (Kriegel et al., 2008). If the probability density function (or
t − tlk > tw=0
functions) is (are) known and alleged to follow a normal distri-
Statistical processes use overall an exponential decay function. bution, the simplest approach to outlier detection is to compute
This function is shown in equation (14), where λd is a positive the probability of a sample to belong to a class (or classes),
rate known as exponential decay constant. The function is plot- and then use a threshold to establish its outlier character. This
ted in magenta in Figure 4. This type of functions is the most approach is equivalent to finding the distance of the sample to
widely used to model decay since it has applications in all fields class means and threshold on the basis of how many standard
of science. A generalization of exponential decay is shown in deviations away the sample is (Markou and Singh, 2003).
equation (15). It is the decay function used by (Aggarwal et al., Unfortunately, in most real world applications the data distri-
2003) and (Kranen et al., 2011) and a graphical representation bution is not known a priori, as a result, non-parametric meth-
can be found in green in Figure 4. The change in the base from ods are generally preferred. Non parametric approaches like
e to any value β gives interesting properties. For example, if distance-based, density-based or angle-based make no assump-
β is chosen to be β = 2ψ, then the time at which half of the tion about the statistic properties of the data which makes them
data is forgotten, is ψλ
1
d
. This function is known as the half life more flexible than parametric methods. In this paper we use the
function and is widely used in biological processes. clustering distance and density framework to detect abnormal
f (t, tlk ) = e−λd (t−tlk ) (14) data which may or may not represent the evolution of a known
−λd (t−tlk )
state or the emergence of a novel behavior.
f (t, tlk ) = β (15)
Since no formal unique definition of outlier exists, the notion of
Figure 4 shows the shape for the named functions in the case outlier may greatly differ from one outlier detection technique
where tw=0 = 6000, ta = 2000. For simplicity in the figure tlk to another. In this work, the outlierness or not of a sample
is set to zero. As previously mentioned, DyClee implements all is related to the outlierness of the µ-cluster that contains its
the functions shown in equations (12) to (15), allowing proper information. Explicitly, for DyClee, an outlier sample is defined
adaptation to all kind of process evolutions. In DyClee the as: A sample contained in an OµC and an OµC is defined
forgetting process impacts clusters density. As explained in the as a representation of samples that, due to their position and
previous section, each tglobal period the density of all µ-clusters density, do not correspond to any of the identified clusters nor
is recalculated. Density change implies µ-cluster type change, represents relevant novel behavior.
its decrease makes Dµ-clusters become Sµ-clusters and Sµ-
The distance and density analysis that DyClee performs allows
clusters become Oµ-clusters, and vice-versa.
it to detect both, global and local multivariate outliers, the
former in all configurations and the latter when multi-density
4. OUTLIER DETECTION AND NOVELTY DETECTION
clustering is used. The outlier character of a sample is binary,
that is, a sample can only be considered as outlier or not outlier.
Outlier detection has been a widely investigated problem in sev- Nevertheless, the outlierness of a µ-cluster may be associated
eral disciplines, including statistics, data mining and machine with its density which provides a score about how much outlier
learning and several approaches have been proposed to deal a µ-cluster is. In other words, the set of all Oµ-clusters can
with the outliers rejection problem ((Hawkins, 1980),(Markou exhibit different densities although their densities remain close
and Singh, 2003),(Zimek et al., 2012)). Outlier detection meth- to each other, that is, in the same relative level with respect to
ods can be classified according to several characteristics: the the clusters densities (Sµ-clusters and Dµ-clusters).
use of training (pre-labeled) data, the assumption of a standard
statistical distribution, the type of data set, dimension of de- If global density is used, DyClee detects only global outliers. As
tected outliers, type of detected outliers (global/local), among said in the previous section, in general, a µ-cluster is considered
21
CHAPTER 1. ARTIFICIAL INTELLIGENCE
as outlier if its density is lower than the median and the average
of all clusters densities, as shown in equation 5. If local-density
analysis is used, the equation 8 shows that an µ-cluster is
considered as outlier iff its density is lower than the group
median density and the group average density, being its group
the set of all the µ-clusters connected (directly or indirectly) to
it.
Using the local-density analysis allow low density populations
(as faults) to be represented as well as high density populations
(as is usually the case of normal behavior). In addition, the
local-density analysis allows the detection of novelty behavior
in its early stages when only a few objects giving evidence of Fig. 6. At left, Clusters of varied density. At right DyClee
this evolution are present. While it is deemed desirable to detect clustering results
clusters of multiple densities, it is also important to maintain the
since faulty states are uncommon samples from those states will
ability to reject outliers. DyClee’s solution to outlier rejection
be also rare.
is based on the exclusion of the µ-clusters found to have low
density in each analyzed group. The toy example chose to test this feature is the multi-density
set used in (Fahim et al., 2010). In this set four different
clusters exhibit densities varying in a wide range. Two of these
5. TESTING IN TOY EXAMPLES
classes present also overlapping. The clustering results of our
algorithm are shown in Figure 6. We can see that the four
This section will illustrate how DyClee can be used to achieve classes are correctly recognized.
some important features in supervision. The first test aims to
show the capability of tracking the system through different Another highly desired feature comes from the fact that, in most
operation points, even if these points are relatively close to each of the real world applications, non-stationarity is typical and
other. To this end, we select the R15 dataset from (Veenman data is expected to evolve over time. In this example our algo-
et al., 2002). This dataset of 600 points is generated by 15 rithm is confronted with slowly time changing distributions, or
similar 2D Gaussian distributions. DyClee deals with high as is usually called in the online learning scenario, concept drift
overlapped data distributions from its conception. Figure 5 (Gama et al., 2014).
shows DyClee clustering results compared to those achieved in
These drifts can come, for example, from physical wear of some
(Veenman et al., 2002). The 15 classes are correctly recognized.
mechanical parts, by the sensitivity loss of some sensors or
by the addition of a new source of noise. In order to shown
DyClee performance over this issue, a synthetic set was created.
Three clearly differentiated distributions are used to form three
clusters. Two of this groups drift in time until shift positions as
can be seen in Figure 7. Synthetic data are generated changing
the center of the distribution each 100 samples.
4
k=0 4
k=1500
3 3
2 2
1 1
0 0
−1 −1
−2 −2
−3 −3
−4 −4
−10 −5 0 5 10 −10 −5 0 5 10
4
k=2900 4
k=5300
1 1
−1 −1
another.
Fig. 7. Concept Drift toy example
As a second test we explore the ability of our algorithm to
cluster multi-density distributions. As stated before, the ability Dynamic data cause several clusterers to fail in finding clusters
to detect distributions evidencing different amount of samples distribution change since they cannot cope with evolution,
or concentrations is a desirable feature in supervision since it losing hence, the tracking over the system state. Snapshots
can improves the detection and characterization of unknown be- showing the distribution of µ-clusters in several time instants
haviors. Multi-density situation are very common in industrial are depicted in Figure 8. It can be seen how clusters evolution
environments, where the measures coming from the process in is followed thanks to the drift of some of the existent µ-clusters
normal operation mode(s) are wide more abundant that those and to the creation of new µ-clusters. Growth in the amount of
coming from start-up or maintenance routines. Even more, clusters can be seen between snapshots one and two, and again
22
CHAPTER 1. ARTIFICIAL INTELLIGENCE
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0 considered faults, tank leakage and valve stuck are found.
Unclass class 1 class 2 class 3 Unclass class 1 class 2 class 3
0.8 0.8
6.1 Scenario 1:Tracking state drift
0.6 0.6
One common problem in industrial applications is that states
0.4 0.4
might drift when the physical parts of the system are exposed
0.2 0.2 to wearing processes. In the case of the CSTH we simulate the
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.0 0.2 0.4 0.6 0.8 1.0
evolution of OP 1 when residues accumulate in the border of
Unclass class 1 class 2 class 3 Unclass class 1 class 2 class 3
the output pipe causing a drop in the maximal output flow.
The process measurements for this scenario can be seen as
Fig. 8. µ-clusters following system evolution continuous lines in Figure 10. This figure also shows that our
clustering algorithm is capable of following this evolution. The
In the next section these features help in the diagnosis of the polynomial fit is also show in the figure, represented as dashed
CSTH benchmark introduced in (Thornhill et al., 2008). lines.
6. DIAGNOSIS CASE STUDY: THE CSTH FlowCW TankTemp Level_trend
TankLevel Flow_trend Temp_trend
The CSTH is a benchmark of a stirred tank in which hot 12.5
(50◦ ) and cold (24◦ ) water are mixed and further heated using
steam; the final mix is then drained using a long pipe. The 12.0
System measures (mA)
23
CHAPTER 1. ARTIFICIAL INTELLIGENCE
FlowCW FlowCW
0.65 0.65
0.60 0.60
0.55 0.55
0.50 0.50
0.45 0.45
0.40 0.40
t= 250 t= 2500
16
14
12
10 14 16 18
10 12
6 8
8
6
0 500000 1000000 1500000 2000000
Time (seconds)
10
Cluster \#
8
6
4 TankLevel TankTemp
2
0
Clus 1 Clus 3 Clus 5 Clus 7 Clus 9
Clus 2 Clus 4 Clus 6 Clus 8 Clus 10
Fig. 12. DyClee clustering results and process measurements
for scenario 2 Fig. 14. Radar clusters’ representation for scenario 2
Since DyClee works under a non-supervised learning paradigm,
only the system measures are necessary as input for the al- 7. CONCLUSIONS
gorithm. Unlike most of the tracking algorithms that can only
track known behaviors, DyClee start of not behavior at all and
built his knowledge when new behaviors are recognized. This In this paper, we propose a multi-density improvement to dy-
can be seen in Figure 12 where DyClee can successfully track namic clustering that follows system evolution by adapting its
online the process and its evolution. Ten different behaviors forgetting process. This work illustrates with practical exam-
are recognized. The label associated to these clusters is re- ples, how data-based clustering under non supervised learning
lated to their order of apparition, and zero represents the non- paradigm can help in process supervision. The proposed algo-
representative behavior caused by extremely noised samples or rithm is used to monitor industrial processes and has proved
by transition states. to be capable to detect different types of faults including those
with time varying dynamics on the selected benchmark. The
In order to illustrate DyClee structural evolution a snapshot of algorithm shows good performance in presence of disturbances
the µ-clusters distribution in t between 405000 and 900000 and the results follow the evolution of the system. Tests have
seconds is presented in Figure 13. For this experiment the shown performance improvement when local-density analyses
sampling time were Ts = 0.5s. The algorithm start window size are used together with the fast distance-based clustering.
24
CHAPTER 1. ARTIFICIAL INTELLIGENCE
From a technical point of view, it would be interesting to com- the diagnosis of complex systems. In Fault Detection,
pare the proposed algorithm with systems of different nature Supervision and Safety of Technical Processes, 1132–1137.
and include categorical features, which may consolidate the Joentgen, A., Mikenina, L., Weber, R., and Zimmermann, H.
proposed methodology. (1999). Dynamic fuzzy data analysis based on similarity
between functions. Fuzzy Sets and Systems, 105(1), 81–90.
Kempowsky, T., Subias, A., and Aguilar-Martin, J. (2006).
REFERENCES Process situation assessment: From a fuzzy partition to a
finite state machine. Engineering Applications of Artificial
Aggarwal, C.C., Han, J., Wang, J., and Yu, P.S. (2003). A Intelligence, 19(5), 461–477.
framework for clustering evolving data streams. In Proceed- Knorr, E.M., Ng, R.T., and Tucakov, V. (2000). Distance-based
ings of the 29th international conference on Very large data outliers: algorithms and applications. The VLDB Journal –
bases-Volume 29, 81–92. VLDB Endowment. The International Journal on Very Large Data Bases, 8(3-4),
Angelov, P. (2011). Fuzzily connected multimodel systems 237–253.
evolving autonomously from data streams. Systems, Man, Kranen, P., Assent, I., Baldauf, C., and Seidl, T. (2011). The
and Cybernetics, Part B: Cybernetics, IEEE Transactions on, ClusTree: indexing micro-clusters for any stream mining.
41(4), 898–910. Knoledge and information systems, 29(2), 249–272.
Barbosa, N., Massuyes, L.T., and Grisales, V.H. (2015). A data- Kriegel, H.P., Shubert, M., and Zimek, A. (2008). Angle-based
based dynamic classification technique: A two-stage density outlier detection in high-dimensional data. In Proceedings of
approach. In SAFEPROCESS 2015, Proceedings of the 9th the 14th ACM SIGKDD International Conference on Knowl-
IFAC Symposium on Fault Detection, Supervision and Safety edge Discovery and Data Mining, KDD ’08, 444–452. ACM,
for Technical Processes, 1224–1231. IFAC. New York, NY, USA.
Bouchachia, A. and Vanaret, C. (2011). Incremental learning Li, K., Yao, F., and Liu, R. (2011). An online clustering algo-
based on growing gaussian mixture models. In Machine rithm. In Fuzzy Systems and Knowledge Discovery (FSKD),
Learning and Applications and Workshops (ICMLA), 2011 2011 Eighth International Conference on, volume 2, 1104–
10th International Conference on, volume 2, 47–52. IEEE, 1108. IEEE.
Elsevier. Markou, M. and Singh, S. (2003). Novelty detection: a review–
Breunig, M.M., Kriegel, H.P., Ng, R.T., and Sander, J. (2000). part 1: statistical approaches and part 2: neural network based
Lof: Identifying density-based local outliers. SIGMOD Rec., approaches. Signal processing, 83(12), 2481–2497.
29(2), 93–104. Maurya, M.R., Paritosh, P.K., Rengaswamy, R., and Venkata-
Cao, F., Ester, M., Qian, W., and Zhou, A. (2006). Density- subramanian, V. (2010). A framework for on-line trend
based clustering over an evolving data stream with noise. In extraction and fault diagnosis. Engineering Applications of
SDM, 326–337. Artificial Intelligence, 23(6), 950 – 960.
Dash, S., Maurya, M.R., Venkatasubramanian, V., and Ren- Maurya, M.R., Rengaswamy, R., and Venkatasubramanian, V.
gaswamy, R. (2004). A novel interval-halving framework for (2007). Fault diagnosis using dynamic trend analysis: A
automated identification of process trends. AIChE journal, review and recent developments. Engineering Applications
50(1), 149–162. of Artificial Intelligence, 20(2), 133 – 146.
Eduardo Mendel, F.M.V., Rauber, T.W., and Batista, R.J. Mitra, P., Pal, S.K., and Siddiqi, M.A. (2003). Non-convex
(2011). Condition monitoring based on kernel classifier clustering using expectation maximization algorithm with
ensembles. In 9th IEEE International Conference on Indus- rough set initialization. Pattern Recognition Letters, 24(6),
trial Informatics (INDIN), 81 –85. doi:10.1109/INDIN.2011. 863 – 873.
6034841. Thornhill, N. (web resource). The csth web
Ester, M., Kriegel, H.P., Sander, J., and Xu, X. (1996). A site. Accessed: 2015-02-12. http://personal-
density-based algorithm for discovering clusters in large pages.ps.ic.ac.uk/˜nina/CSTHSimulation/index.htm.
spatial databdata with noise. In KDD, 226–231. Thornhill, N.F., Patwardhan, S.C., and Shah, S.L. (2008). A
Fahim, A., Salem, A.E., Torkey, F., Ramadan, M., Saake, G., continuous stirred tank heater simulation model with appli-
et al. (2010). Scalable varied density clustering algorithm cations. Journal of Process Control, 18(3), 347–360.
for large datasets. Journal of Software Engineering and Vachtsevanos, G., Lewis, F., Roemer, M., Hess, A., and Wu,
Applications, 3(06), 593. B. (2007). Intelligent Fault Diagnosis and Prognosis for
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., and Engineering Systems. John Wiley & Sons, Inc.
Bouchachia, A. (2014). A survey on concept drift adaptation. Veenman, C.J., Reinders, M.J., and Backer, E. (2002). A
ACM Computing Surveys (CSUR), 46(4), 44. maximum variance cluster algorithm. Pattern Analysis and
Gamero, F.I., Melndez, J., and Colomer, J. (2014). Process Machine Intelligence, IEEE Transactions on, 24(9), 1273–
diagnosis based on qualitative trend similarities using a se- 1280.
quence matching algorithm. Journal of Process Control, Zimek, A., Schubert, E., and Kriegel, H.P. (2012). A survey on
24(9), 1412 – 1424. unsupervised outlier detection in high-dimensional numeri-
Hawkins, D.M. (1980). Identification of outliers, volume 11. cal data. Statistical Analysis and Data Mining, 5(5), 363–
Springer. 387.
Hedjazi, L., Kempowsky-Hamon, T., Despnes, L., Le Lann,
M.V., Elgue, S., and Aguilar-Martin, J. (2010). Sensor
placement and fault detection using an efficient fuzzy feature
selection approach. In Decision and Control (CDC), 2010
49th IEEE Conference on, 6827 –6832.
Isaza, C.V., Orantes, A., Kempowsky-Hamon, T., and Lann,
M.V.L. (2009). Contribution of fuzzy classification for
25
CHAPTER 2
AUTOMATION
CHAPTER 2. AUTOMATION
Abstract: The inherent complexity of critical production systems, inserted in a context of sustainability
policies and environmental preservation and protection of the people, are the motivation for the design of
safety control systems, which aim to reduce the risk, inherent of any production system, to a safe level.
According to the experts, the concept of Safety Instrumented Systems (SIS) is a solution in that promotes
the reduction of risk through hierarchical layers to prevent and / or mitigate faults. Standards such as IEC
61508/61511 refer to the performance requirements, but do not mention implementation methods. Whereas
the occurrence of a fault may be associated with the occurrence of an event, Petri nets become an efficient
tool for design the detection and treatment of the effects of the occurrence of faults. This paper proposes a
method for implementing SIS in critical production systems from risk analysis techniques, fuzzy logic and
Coloured Petri Nets, considering the integration of the prevention SIS and the mitigation SIS.
Keywords: Critical Production Systems, Faults, Safety Instrumented System, Coloured Petri Nets.
complexity of the SPs, serious risks to the physical integrity of
1. INTRODUCTION
people, the environment and economic losses resulting from
In this first decade of the century XXI many studies have damage to the equipment itself (Sallak, et al., 2008). Although
indicated that automation processes are undergoing many studies have been presented both for diagnosis and for
transformations that have been strongly influenced by the the treatment of a particular class of faults (Morales et al.,
advance of technology and computing resources, becoming 2007) (Ru & Hadjicostis, 2008) (Wang et al., 2008) (Zhang &
increasingly complex due to their dynamic and needed to Jiang, 2008) (Summers & Raney, 1999) (Sallak, et al., 2008),
address issues such as global market competitive production accidents continue to occur.
and technology used, among other factors (Chen & Dai, 2004),
In this context, according to specialists, the use of Safety
(Santos Filho, 2000), (Wu, et al., 2008). Given this new
Instrumented Systems - SIS is a solution to this problem by
scenario, industrial processes and their control are becoming
inserting successive risk reduction layers to prevent and / or
more and more complex. Additionally, organizations have
mitigate the effects of occurrence of a fault in a SP. SIS are
focused on policies to achieve and to demonstrate people’s
referenced in standards such as IEC 61508 and IEC 61511, that
safety and health, environmental management system, and the
lists the performance requirements and the life cycle for a
capability in risk management.
design of a SIS. However, the standards make no mention of
In a globalized and competitive environment in which methods for SIS implementation.
organizations are inserted, it is essential to adopt strategic
According to the relevance of the subject, several works have
plans and operational practices that ensure the ability to adapt
been published. (Squillante JR, 2013) proposes a method for
rapidly and consequent change of the systems-productive but
implementing the SIS prevention, in that the risk of an SP is
hitherto conceived. The expectation is that in addition to result
evaluated by a HAZOP (Hazard and Operability) study. The
in a process with effective cost reduction, high product quality
results are refined by Bayesian networks (BN), and the control
and flexibility of production lines, and reduction of new
algorithm for the detection, diagnosis and treatment of a fault
products and delivery development times (Santos Filho, 2000)
are modelled by classical Petri nets. (Souza, 2014) proposes a
(Chen & Dai, 2004) (Wu et al., 2008), also causes the
system for the mitigation of SIS, in which the critical elements
reduction of environmental impacts of the process. The results
of the SP are defined from risk analysis techniques, and actions
of this new scenario are Productive Systems (SPs) that perform
of mitigation are determined by fuzzy logic. The both studies
highly complex processes (Sampaio, 2011) (Ferreira et al.,
resulted in graphs of high complexity due to the
2014) that might not be achievable by conventional production
interrelationship of arcs of the PN net. Another factor was the
methods (Mazzolini, et al., 2011). Because of this complexity
limited integration of prevention and mitigation models,
inherent in any modern production system, some states, though
revealing gaps for the actual implementation of a SIS.
undesirable, can be achieved, it could be mentioned: the fault
states of components, design flaws, or operational errors, This paper proposes a method for the implementation of SIS
including intentional, and environmental events that involve critical SPs with reference to IEC 61508 and IEC 61511,
the system. Such occurrences could result, depending on the considering the integration of prevention and mitigation SIS,
27
CHAPTER 2. AUTOMATION
from the use of high-level Petri nets - HLPN for formal logic components and / or operational human errors that may
modelling of control algorithms. be associated with the occurrence of the top event. The
analysis is done from the construction of a logical tree. In this
2. FUNDAMENTAL CONCEPTS
way, one obtains a graph which can be used to identify all
2.1 Safety Instrumented Systems possible causes for the occurrence of a fault (Modarres, et al.,
2010). The graph enables an analysis of the "top-down", which
Safety Instrumented System (SIS) is a safety control system
results in understanding how the event occurred. In the
which aims to reduce the risks associated with SPs. In general, analysis "bottom-up" it has been "why" of the event. The
the role of a SIS is to monitor through security sensors, critical advantage of FTA on the FMEA is that one can have a
events in the industrial process and indicate alarms or perform
combination of several elements or multiple failure modes, the
preset actions through security actuators, for the prevention of
graph connected by logic elements such as "and" and "or".
accidents or mitigation of the consequences generated by these
events (Goble, 1998). The study of operability and risks, or HAZOP (HAZard and
OPerability studies) defined in IEC 61882 (IEC, 2001) was
According to IEC 61508, an SIS consists of independent layers developed for efficient and detailed examination of the
and risk reduction successive, which can be implemented for variables of a process having a strong resemblance to the
safety control systems that operate independently of the Basic
FMEA. Hazop identifies the ways in which the process
System Process Control - BPCS. The structure of an SIS with
equipment may fail or be improperly operated. It is developed
Reference to IEC 61508/61511 standards is illustrated in by a multidisciplinary team, being guided by the application of
Figure 1. specific words - words guide - each process variable. Thus, to
generate the deviation of operational standards, which are
analyzed in relation to their causes and consequences.
2.3 Petri nets
Petri net (PN) as a graphical tool and mathematics provides a
uniform way for model, analysis and design of Discrete Event
Systems - SEDs (Adam et al, 1998; Nassar, et al. 2008;
Zurawski & Zhou, 1994), being effective as a description of
technical and specification processes (Hamadi & Benatallah,
2003; Morales et al, 2007; Yoo, et al, 2010). It provides a
representation that can be used both as a conceptual model and
functional model of a system that can analyze and validate the
operation of the system at each stage of its development cycle.
The PN can also be used as a design tool, allowing for easy
interpretation and identification of processes and their
dynamic behavior and / or systems being modelled (Nassar et
al., 2008). The models based on NP can be used for qualitative
and quantitative assessment involving the analysis of the
behavioral properties and performance measure, respectively.
Moreover, with the development of software simulators
(Zurawski & Zhou, 1994), has provided tools for editing and
Fig. 1. Risk reduction layers (extracted from IEC 61511-1) analysis of these models. Enables the representation of the
system dynamics and structure at various levels of abstraction,
according to the with-complexity system (Nassar, et al., 2008).
2.2 Risk Analysis Techniques It is able to model synchronization process, the occurrence of
asynchronous events, competitors and conflict operations, or
This section introduces the fundamental concepts of some risk resource sharing (Adam et al, 1998; Nassar, et al, 2008).
analysis techniques, such as FMEA, FTA, HAZOP and What-
If technique. Among the extensions of PN can mention the high-level Petri
nets - HLPN (Smith, 1998) (ISO / IEC 15909, 2000) and the
The FMEA technique, described in IEC 60812 (IEC 2006), Coloured Petri nets - CPN (Jensen, 1997), in which the
consists of a detailed and systematic study of possible failures graphical representation the model is reduced, without losing
of the components of a system. The failure modes of each the formalism of classical PN, increasing the level of
component are identified, and a severity level is associated abstraction and the power of process modelling, in which the
with its effect, and assess the likelihood of their occurrence. network places have individual marks and separated by type,
The FMEA also discusses actions to eliminate, mitigate and declared the identity of each variable. The arcs have markings
control the causes and consequences of failures. (Lewis, 1995). and filters that select the type of mark that can flow through
Another technique used is the Fault Tree - FTA, deductive this arc and the transition that occur in different modes
reasoning methodology described in IEC 62025 (2008) that confined to the locality rule their pre-set and post-set. The
part of a top event, which is the occurrence of a specific fault dynamic graph composed is the result, so the whole graph,
in a system, which aims to determine the relationship fault registration and declaration.
28
CHAPTER 2. AUTOMATION
29
CHAPTER 2. AUTOMATION
3.3 Detection of effects of critical element´s faults Table 2. Cause - effect matrix for SIF 1
For each fault mode, a specific sensor links every critical
component to the mitigation PLC. According to IEC 61508,
such sensors must be independent of the BPCS. Redundant
architectures (Squillante Jr, et al., 2011), such as the criteria
voting 2oo3 (two of three), avoid spurious faults and sensor
reading errors.
For the results of the FMEA and the FTA of previous step,
thermocouples were inserted with the respective Tags TET-
211A to TET 211D, respectively to each of the four
compressor units. The cause-effect matrix for mitigation of SIF 1 consists of all
actions foreseen in the SIS prevention for SIF1, in addition to
3.4 Prevention and mitigation actions fast and pressurized stop with carbon dioxide from the
compressor units and closing stock valves with their tags: XV-
For each effect of a critical fault, detected by the SIS mitigation 10A, XV-11A, 10B-XV, XV-11B, 10C-XV, XV-11C, 10D
sensors, SIS mitigation actuators implement a mitigation XV-XV-11D, 25A-XV, XV-26A, 25B-XV, XV-26B, XV-
action, controlled by the SIS mitigation control layer, aiming 25C, XV-26C, XV-25D, XV-26D, XV-16A, XV-16B, XV-
to preserve people, environment and equipment. 16C, XV-16D. The result is an array of 4 lines with the same
initiators events SIF1 prevention, but with a total of 36
To determine de mitigation actions will make use of What-If columns.
technique, based on human knowledge and records of
occurrence of faults, its effects and the actions proposed to The analysis of the whole process resulted in 12 SIFs, 72
mitigate its effects. initializers events (causes), and 108 actions (effects).
To mitigate the effects caused by the occurrence of a fault in 3.6 Control Algorithm Models - PN
the compressor, beside the action of shutdown from the The next step is the control algorithm modelling to detect the
prevention layer, a suggested action to mitigate the effects is initiator event each SIF, represented by the signs of the
the forced cooling, in case of preventive layer is not sufficient respective sensors, the confirmation, and treatment of fault,
or if the temporal variation of temperature proves too high. represented by their prevention and mitigation actions, based
on the results obtained by the team of experts in the process.
Their prevention and mitigation actions are performed until all
sensors indicate the safe state of their respective variables in a
logical "and". After this check, the actions of prevention /
mitigation are ceased, and the process should, according to
IEC 61508, undergo a thorough survey process for possible
fixes and maintenance for the process to be restarted.
This sequence of actions can be classified as a sequence of
discrete events, so can be modelled by Petri nets. For each SIF
is generated a model of prevention and mitigation, and
initializers events may be common to models of prevention
Fig. 4. P&ID of carbon dioxide cylinders of station and mitigation, and prevention and mitigation may be common
After this study and compilation of mitigation actions, will to different SIFs.
determine which actuators required for each mitigation action. For each model generated, the properties of bounding, liveness
and safety are checked, with the aid of computational tools as
3.5 Cause-effect matrix and identifying SIFs
PIPE2 and HPSim.
For each initiator event, represented by the sensor signals, The results were high extended and complex graphs,
determines their prevention and mitigation, the representation considering that the process resulted in a total of 40 PN
is making in an array, in which the causes are listed in columns models, 72 initializers events and 57 actuators. Figures 5 and
and the effects associated with mitigation actions (actuators) 6 (annex) illustrates the PN model for the prevention and
in the respective lines. mitigation of SIF1, respectively.
Initializers common events can be grouped, representing,
3.7 Parameter determination – fuzzy logic
according to IEC 61508, the instrumented safety functions -
SIFs. Mitigation actions can also be common to different SIFs. From the initiators events and their prevention and mitigation
The cause-and-effect matrix for SIF 1, represented by the developed qualitatively in the cause-effect matrix, the next
signals from the sensors TET211 is illustrated in Table 3. step is the analysis of these signals. From the results of What-
If technical team of experts, fuzzy logic can be used to
associate their actions of actuators for the ranges of process
30
CHAPTER 2. AUTOMATION
variables, or variables associated with any failure of a Applying the fuzzy logic to the whole process was obtained a
component parameter. total of 180 rules, resulting in a greatly compromised
understanding due to the interrelationship of common
To illustrate the algorithm, the expert reports, to a value
actuators to different rules.
between 10% and 30% above the set point temperature of
TET211 shutdown must be performed. Longer a value above 3.8 Integration of PN and fuzzy models – Coloured Petri nets
50% would be unacceptable, which requires a mitigation
action beyond the proposed shutdown. Table 7 shows this From the results obtained in fuzzy logic, the next step is the
result in a fuzzy membership function implemented in the use of high-level Petri nets, in particular the Coloured Petri
fuzzy MatLab® toolbox. Note that we have gaps in which two nets for the formal modelling of the control algorithm,
actions may be occurring, with different proportions. considering the integration of the models in classical PN of
Table 7. Membership functions for temperature prevention and mitigation SIFs.
Coloured Petri nets, due to its high power of representation,
Temperature Action
resulted in simplified graphs, compared with the classical PN,
Set Point BPCS without losing the power of representation. Another factor that
contributed to the simplification was the use of hierarchical
Above 30% of SP Prevention models for all SIFs, in which the covering properties and
liveliness can be checked from the use of own elements in
Above 50% of SP Mitigate hierarchical transitions. Figure 12 illustrates the model in CPN
of the SIS with reference to IEC 61508/61511.
1 BPCS CLOSED
31
CHAPTER 2. AUTOMATION
the SIF1. Other SIFs has isomorphic models. applying risk analysis techniques to determine the critical
elements. The next step consists in determine the control
signals (sensors) that can be associated with the occurrence of
faults of that critical elements which may or may not be
associated with process variables. Fuzzy logic is used to
determine the intervals of the respective sensor signals to
indicate the need to promote the shutdown and enable their
mitigation actions. Petri nets are used for the for the control
algorithm and formal model verification. However, the use of
classical PN can result in high complexity graphs, in fact
verified results. The high-level Petri nets were then used,
resulting in a better understanding of graphs, without losing
the complexity of the classical model and the results of fuzzy
Fig. 13. CPN model for prevention – SIF1
logic. Another advantage is the generation of hierarchical
models, the models for the prevention and mitigation can be
modelled in a hierarchical manner for the different SIFS
process, greatly facilitating the generation of the final control
algorithm SIS.
REFERENCES
Bell, R., 2005. “Introduction to IEC 61508”. In Proceedings of ACS
Workshop on Tools and Standards. Sydney, Australia.
Chen, C. and Dai, J., 2004. Design and high-level synthesis of hybrid
controller. In Proc. of IEEE Conference.
IEC, I.E.C., “Programmable Controllers IEC 61131-7: Fuzzy Control
programming”, 2000.
IEC, I.E.C., 2003a. “Functional safety - safety instrumented systems
Fig. 14. CPN model for mitigation – SIF 1 for the process industry sector - part 1 (IEC 61511)”
IEC, I.E.C., 2010. “Functional safety of electrical / electronic /
3.9 Control codes generation based on IEC 61131-7 programmable electronic safety-related systems (IEC 61508)”.
ISO/IEC 15.909, “High Level Nets: Concepts, Definitions and
The control codes, implementable in commercial controllers, Graphical Notations”, Final Draft, v. 4.7.1, 2000.
can be written from the results of fuzzy logic, based on the Jensen, K., “Coloured Petri Nets. Basic Concepts, Analysis Methods
dynamical evolution of the CPN models. The IEC 61131-7 and Paractical Use.”, Springer-Verlag, 1997.
reference to conversion of fuzzy logic in structured text Lee, C. C.: “Fuzzy logic in control system: fuzzy logic controller Part
1”. In IEEE Transactions n System, Man and Cybernetic, Vol
language. Figure 15 show an example of control code, based
20, n⁰2, p. 404-418, 1990.
on fuzzy logic. Lewis, E.E., 1995. Introduction to Reliability Engineering. 2⁰ Ed.,
John Wiley&Sons.
Modarres, M., Kaminskiy, M., Krivstov, V., 2010. Reliability
Engineering and Risk Analysis:a practical guide.,CRC Press.
Popa, D. D., Craciunescu, A, Kreindler, L.: “A PI-Fuzzy controller
designated for industrial motor control applications”. In ISIE
IEEE International Symposium on Applications, Industrial
Eletronics, 2008.
Sallak, M.; Simon, C.; Aubry, J., A fuzzy probabilistic approach for
determining safety integrity level, IEEE Transaction on Fuzzy
Systems, vol 16, n. 1, pp. 239-248, 2008.
Smith, E.; “Principles of High Level Petri Nets”, LNCS 1491, pp.
174-210.
Souza, J.A.L., “Mitigação de Falhas Críticas em Sistemas
Produtivos”. Dissertação de Mestrado. Escola Politécnica da
Universidade de São Paulo. 2014.
Squillante Jr, R.; Santos Fo, D.J.; Souza, J.A.L.; Junqueira, F,;
Myiagi, P.E. “Safety in Supervisory Control for Critical
Systems”. IFIP International Federation for Information
Fig. 15. Control code, based on IEC 61131-7 Processing (DoCEIS 2013), vol 394, pp. 261-270,
Summers, A.; Raney, G. Common cause and common sense,
designing failure out of your safety instrumented systems (SIS).
In: ISA Transactions, vol 38, pp. 291-299, 1999.
4. CONCLUSIONS Zadeh, L. A.: Fuzzy Sets. Information and Control, Vol 8, p. 338-353,
1965.
The paper presents a method for the implementation of SIS to
Zhang, Y; Jiang, J. Bibliographical review on reconfigurable fault-
mitigate faults in critical SPs with reference to IEC tolerant control systems, Annual Reviews in Control, vol 32, pp.
61508/61511. A team of experts in the process assist in 229-252, 2008.
32
CHAPTER 2. AUTOMATION
Appendix A. PN model for SIF 1 Prevention Appendix B. PN model for SIF 1 Mitigation
33
CHAPTER 2. AUTOMATION
Resumen: Este artículo presenta una plataforma de modelado y simulación de un sistema de distribución
de energía eléctrica con una carga trifásica no lineal, utilizando el lenguaje VHDL-AMS (lenguaje de
descripción de hardware de altísima velocidad que incluye extensiones digitales, analógicas y de señal
mixta) y, además, empleando dos métodos de simulación ampliamente conocidos: Orcad/Pspice y
Matlab/Simulink, con miras a establecer una comparación cuantitativa entre los dos métodos de
simulación y el lenguaje VHDL-AMS. El caso de estudio usado en esta investigación consiste en un
alimentador de distribución de 23 nodos, considerando un escenario real de distribución de energía
eléctrica radial típico brasileño, en el cual fue aplicada una herramienta de concentración y reducción de
cargas eléctricas. Los resultados de las simulaciones comparativas demuestran la eficacia y exactitud de
la plataforma de simulación desarrollada en VHDL-AMS, así como también se evidencia que el método
de simulación Orcad/Pspice presenta mayores similitudes con los resultados de las simulaciones en
VHDL-AMS, en comparación con los resultados obtenidos en Matlab/Simulink. Los modelos en VHDL-
AMS viabilizan el desarrollo de una futura arquitectura de simulación en tiempo real y control para el
alimentador de distribución de energía eléctrica, utilizando los lenguajes de descripción de hardware
VHDL-AMS y VHDL.
Palabras clave: Cargas Eléctricas No Lineales, Lenguajes de Descripción de Hardware,
Matlab/Simulink, Orcad/Pspice, Sistemas de Distribución de Energía Eléctrica, VHDL-AMS.
Por lo anterior, en este trabajo, además de proponer una
1. INTRODUCCIÓN
herramienta que utiliza el lenguaje VHDL-AMS para la
De acuerdo a Oliveira et al. (2011), la evolución tecnológica simulación completa de una red de distribución de
de los dispositivos electrónicos ha ocasionado un incremento alimentación primaria con carga no lineal, considerando la
en el número de cargas no lineales, las cuales hoy por hoy existencia de elementos de compensación y regulación de
aún están creciendo en todos los sectores, especialmente en el tensión, compara cuantitativamente los métodos de
sector industrial, afectando la seguridad y la continuidad del simulación Orcad/Pspice y Matlab/Simulink con el lenguaje
servicio y por otra parte exigiendo un mayor esfuerzo a las VHDL-AMS. Es importante mencionar que esta herramienta
tecnologías de compensación y regulación convencionales, propuesta es fundamental para el desarrollo de un simulador
existentes en los Sistemas de Distribución de Energía en tiempo real para sistemas de distribución de energía
Eléctrica (SDEE). eléctrica, en el cual, basados en los resultados comparativos
Por otra parte, McGranaghan (2007) considera que las de este trabajo (al ratificarse la eficacia del lenguaje VHDL-
empresas de distribución de electricidad son responsables de AMS vs Orcad/Pspice y Matlab/Simulink), se utilizará al
las condiciones del suministro de la energía eléctrica, estas lenguaje de descripción de hardware VHDL-AMS como
compañías necesitan herramientas de ingeniería y plataforma de modelación y simulación de un sistema
conocimiento científico para interpretar y aplicar la eléctrico y posteriormente se utilizará al lenguaje VHDL (el
información recibida, de tal manera que puedan tomar cual es perfectamente compatible con VHDL-AMS), para el
decisiones técnicas acertadas. En este contexto, el diseño del sistema de gerenciamiento y control de la red de
procesamiento y análisis de los indicadores de calidad de la distribución, cumpliendo así con los objetivos de gestión de
energía eléctrica (CEE) pasa por un proceso de investigación los indicadores de la CEE y el control en tiempo real de los
de los fenómenos en régimen permanente y transitorio de los SDEE.
sistemas de distribución.
34
CHAPTER 2. AUTOMATION
35
CHAPTER 2. AUTOMATION
36
CHAPTER 2. AUTOMATION
4 4
x 10 x 10
1,5 1,5
1 1
0,5 0,5
Tensión (V)
Tensión (V)
0 0
-0,5 -0,5
-1 -1
-1,5 -1,5
0,5 0,505 0,51 0,515 0,52 0,525 0,5 0,505 0,51 0,515 0,52 0,525
Tiempo (s) Tiempo (s)
Va VHDL-AMS Va SIMULINK Va PSPICE
Va VHDL-AMS
Vb VHDL-AMS Vb SIMULINK Vb PSPICE
Vb VHDL-AMS
Vc VHDL-AMS Vc SIMULINK Vc PSPICE
Vc VHDL-AMS
Fig. 3. Tensiones de Fase de la Línea que Alimenta a la Fig. 6. Tensiones de Fase de la Línea que Alimenta a la
Carga con la Peor Regulación de Tensión (LINHA_PRI- Única Carga No Lineal Desequilibrada (LINHA_PRI-
X539727_1605338). VHDL-AMS vs SIMULINK. X28676063_4782056). VHDL-AMS vs PSPICE.
150 4
x 10
1,5
100
1
50
Corriente (A)
0,5
Tensión (V)
0
0
-50 -0,5
-100 -1
-150 -1,5
0,5 0,505 0,51 0,515 0,52 0,525 0,5 0,505 0,51 0,515 0,52 0,525
Tiempo (s) Tiempo (s)
Fig. 4. Corrientes de la Línea que Alimenta a la Carga con la Fig. 7. Tensiones de Fase de la Línea que Alimenta a la
Peor Regulación de Tensión (LINHA_PRI- Única Carga No Lineal Desequilibrada (LINHA_PRI-
X539727_1605338). VHDL-AMS vs PSPICE. X28676063_4782056). VHDL-AMS vs SIMULINK.
150 200
150
100
100
50
Corriente (A)
Corriente (A)
50
0 0
-50
-50
-100
-100 -150
-150 -200
0,5 0,505 0,51 0,515 0,52 0,525 0,5 0,505 0,51 0,515 0,52 0,525
Tiempo (s) Tiempo (s)
Ia VHDL-AMS Ia SIMULINK Ia VHDL-AMS Ia PSPICE
Ib VHDL-AMS Ib SIMULINK Ib VHDL-AMS Ib PSPICE
Ic VHDL-AMS Ic SIMULINK Ic VHDL-AMS Ic PSPICE
Fig. 5. Corrientes de la Línea que Alimenta a la Carga con la Fig. 8. Corrientes de la Línea que Alimenta a la Única Carga
Peor Regulación de Tensión (LINHA_PRI- No Lineal Desequilibrada (LINHA_PRI-
X539727_1605338). VHDL-AMS vs SIMULINK. X28676063_4782056). VHDL-AMS vs PSPICE.
37
CHAPTER 2. AUTOMATION
200 600
150
400
100
200
Corriente (A)
Corriente (A)
50
0 0
-50
-200
-100
-150 -400
-200
0,5 0,505 0,51 0,515 0,52 0,525 -600
Tiempo (s) 0,5 0,505 0,51 0,515 0,52 0,525
Tiempo (s)
Ia VHDL-AMS Ia SIMULINK
Ib VHDL-AMS Ib SIMULINK Ia VHDL-AMS Ia PSPICE
Ic VHDL-AMS Ic SIMULINK Ib VHDL-AMS Ib PSPICE
Ic VHDL-AMS Ic PSPICE
Fig. 9. Corrientes de la Línea que Alimenta a la Única Carga
No Lineal Desequilibrada (LINHA_PRI- Fig. 12. Corrientes a la Entrada del Regulador de Tensión
X28676063_4782056). VHDL-AMS vs SIMULINK. (LINHA_PRIX28760949_960319). VHDL-AMS vs PSPICE.
4 600
x 10
1,5
400
1
200
Corriente (A)
0,5
Tensión (V)
0
0
-200
-0,5
-400
-1
-600
-1,5 0,5 0,505 0,51 0,515 0,52 0,525
0,5 0,505 0,51 0,515 0,52 0,525 Tiempo (s)
Tiempo (s)
Ia VHDL-AMS Ia SIMULINK
Va VHDL-AMS Va PSPICE Ib VHDL-AMS Ib SIMULINK
Vb VHDL-AMS Vb PSPICE Ic VHDL-AMS Ic SIMULINK
Vc VHDL-AMS Vc PSPICE
Fig. 13. Corrientes a la Entrada del Regulador de Tensión
Fig. 10. Tensiones de Fase a la Entrada del Regulador de (LINHA_PRIX28760949_960319). VHDL-AMS vs
Tensión (LINHA_PRIX28760949_960319). VHDL-AMS vs SIMULINK.
PSPICE. 4
x 10
4 1,5
x 10
1,5
1
1
0,5
Tensión (V)
0,5
Tensión (V)
0
0
-0,5
-0,5
-1
-1
-1,5 -1,5
0,5 0,505 0,51 0,515 0,52 0,525
0,5 0,505 0,51 0,515 0,52 0,525
Tiempo (s) Tiempo (s)
Va VHDL-AMS Va SIMULINK Va VHDL-AMS Va PSPICE
Vb VHDL-AMS Vb SIMULINK Vb VHDL-AMS Vb PSPICE
Vc VHDL-AMS Vc SIMULINK Vc VHDL-AMS Vc PSPICE
Fig. 11. Tensiones de Fase a la Entrada del Regulador de Fig. 14. Tensiones de Fase a la Salida del Regulador de
Tensión (LINHA_PRIX28760949_960319). VHDL-AMS vs Tensión (LINHA_PRIX9603201_960320). VHDL-AMS vs
SIMULINK. PSPICE.
38
CHAPTER 2. AUTOMATION
4
x 10
1,5
Las tablas 1 hasta 8 presentan una descripción cuantitativa
1
comparativa para los resultados de las simulaciones
presentados en las Fig. 2 hasta Fig. 17. En estas tablas el
0,5
valor eficaz (RMS) es calculado para el estado estacionario,
Tensión (V)
Fase
Fase
0,5 0,505 0,51 0,515 0,52 0,525 estacionario (V) Métodos de estacionario (A) Métodos de
Tiempo (s) VHDL-AMS PSPICE Simulación VHDL-AMS PSPICE Simulación
Va VHDL-AMS Va SIMULINK A 7.320,20 7.321,80 0,022% A 72,88 73,25 0,508%
Vb VHDL-AMS Vb SIMULINK B 7.364,30 7.363,40 0,012% B 72,47 72,84 0,507%
Vc VHDL-AMS Vc SIMULINK C 7.356,20 7.361,50 0,072% C 72,63 73,04 0,563%
Fig. 15. Tensiones de Fase a la Salida del Regulador de
Tensión (LINHA_PRIX9603201_960320). VHDL-AMS vs
SIMULINK. Tabla 2. Tensiones de fase y corrientes de la línea que
600 alimenta a la carga con la peor regulación de tensión para
VHDL-AMS Y SIMULINK
400
Valor RMS de estado %Error Valor RMS de estado %Error
Fase
Fase
estacionario (V) Métodos de estacionario (A) Métodos de
200 VHDL-AMS SIMULINK Simulación VHDL-AMS SIMULINK Simulación
Corriente (A)
Fase
Fig. 16. Corrientes a la Salida del Regulador de Tensión B 7.377,00 7.374,10 0,039% B 105,05 105,22 0,163%
(LINHA_PRIX9603201_960320). VHDL-AMS vs PSPICE. C 7.369,00 7.372,60 0,049% C 98,78 98,86 0,085%
600
Tabla 4. Tensiones de Fase y Corrientes de la Línea que
400 Alimenta a la Única Carga No Lineal Desequilibrada
para VHDL-AMS y SIMULINK
200
Corriente (A)
Fase
Fase
39
CHAPTER 2. AUTOMATION
Tabla 6. Tensiones de Fase y Corrientes a la Entrada del Se debe resaltar que los resultados obtenidos en este artículo
Regulador de Tensión para VHDL-AMS y SIMULINK servirán como base para el desarrollo de una Arquitectura de
Valor RMS de estado %Error Valor RMS de estado %Error
Simulación en Tiempo Real y Control (ASTR&C) para el
alimentador de distribución de energía eléctrica, con el fin de
Fase
Fase
estacionario (V) Métodos de estacionario (A) Métodos de
VHDL-AMS PSPICE Simulación VHDL-AMS PSPICE Simulación analizar la calidad de la energía y mejorar las acciones de
A 8.239,90 8.238,60 0,016% A 403,98 405,29 0,324% control en los sistemas de distribución, buscando de esta
B 8.232,70 8.228,20 0,055% B 405,64 406,98 0,329% forma aumentar la confiabilidad y sostenibilidad del sistema
C 8.280,50 8.283,40 0,035% C 400,08 401,49 0,351% eléctrico de potencia. Esta arquitectura utilizará los lenguajes
de descripción de hardware VHDL-AMS, para la simulación
en tiempo real, y VHDL para el desarrollo de un sistema de
Tabla 7. Tensiones de Fase y Corrientes a la Salida del gerenciamiento de la distribución y control.
Regulador de Tensión para VHDL-AMS y PSPICE
Valor RMS de estado %Error Valor RMS de estado %Error Ambos lenguajes VHDL y VHDL-AMS, junto con la
Fase
Fase
estacionario (V) Métodos de estacionario (A) Métodos de información del sistema de distribución de energía eléctrica,
VHDL-AMS PSPICE Simulación VHDL-AMS PSPICE Simulación permitirán la simulación en tiempo real y el control de los
A 7.414,50 7.414,20 0,004% A 403,98 405,29 0,324% alimentadores de distribución de energía eléctrica.
B 7.459,40 7.456,50 0,039% B 405,64 406,98 0,329%
C 7.451,00 7.454,60 0,048% C 400,08 401,49 0,351% Desde el punto de vista de la docencia e investigación, este
trabajo se constituye en un enlace entre los sistemas
eléctricos de potencia (redes de distribución de energía
Tabla 8. Tensiones de Fase y Corrientes a la Salida del
eléctrica) y los sistemas electrónicos que utilizan lenguajes de
Regulador de Tensión para VHDL-AMS y SIMULINK
descripción de hardware (lenguaje VHDL-AMS). De esta
Valor RMS de estado %Error Valor RMS de estado %Error forma, se utilizó una herramienta de modelado y simulación,
Fase
Fase
estacionario (V) Métodos de estacionario (A) Métodos de diseñada originalmente para describir sistemas analógicos,
VHDL-AMS SIMULINK Simulación VHDL-AMS SIMULINK Simulación digitales y de señal mixta (VHDL-AMS), para una aplicación
A 7.414,50 7.414,50 0,000% A 403,98 400,09 0,972%
focalizada a los sistemas eléctricos de distribución.
B 7.459,40 7.457,89 0,020% B 405,64 401,67 0,988%
C 7.451,00 7.436,54 0,194% C 400,08 395,91 1,052% AGRADECIMIENTOS
40
CHAPTER 2. AUTOMATION
41
CHAPTER 2. AUTOMATION
1. INTRODUCTION
42
CHAPTER 2. AUTOMATION
Primary section
Secondary section
Fig. 2. Variable volume ratio
Fig. 3. Standard linear motor
front view
Secondary section
Primary section
Primary section
V2 1
BV R = = , (1)
V1 χ
43
CHAPTER 2. AUTOMATION
3.2 Rotation
Clearance volume
working chamber. The general equation for torque balance
Inlet closed
0.6 at the motor shaft can be written for the prototype as
follows:
Tel = Tj + Tf ric , (4)
0.4 where Tel electromagnetic torque, Tj - inertia torque and
Tf ric - friction torque. The moment of inertia defining
0.2 the inertia torque is the sum of moments of inertia of all
components in the rotation train: the servo motor, the
0.0 piston and shafts. These can be calculated using the sizes
0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 of the components used. These components rotate about
×10−3 the same axis and are have cylindric shape. The moment
Volume, m3 of inertia of a hollow cylinder is:
Fig. 5. pV-diagram 1 2 2
J = m(Dout + Din ), (5)
8
where m is the mass of the rotating component, Dout -
the outer diameter, Din - the inner one which is equal to
6 zero for shafts and the servo motor rotor. The shaft mass
and the diameter is relatively small and therefore can be
5 neglected.
44
CHAPTER 2. AUTOMATION
45
CHAPTER 2. AUTOMATION
. position, v = 0 m/s
Piston at the extreme left Piston at the extreme. left position, ω = ω0
no φ > k4 yes
φ - rotation angle,
x - displacement
Tem = 0 Tem = −Tnom
Tem = Tnom
Fem = −Fdelp + Ff ric Fem = Fnom
Tem = −Tnom
Fem = −Fnom ∗ k3 Fem = Fnom
no ω < ω0 yes
Piston at the extreme right position, v = 0 m/s
Tem = −Tf ric
Fig. 8. Piston linear movement algorithm
4.4 Rotation control
Piston at the extreme right position, ω = ω0
Intake: While the piston starts to move away from
its extreme left position, the rotation velocity is ω0 . By
applying a decelerating torque of −Tnom at the moment Fig. 9. Piston rotation algorithm
defined by k4, the rotation decreases, ideally down to
zero (k5 = 0), when the skirt opening is aligned with Deceleration After the inlet is closed (φ > βport +βopen ),
the inlet opening. The resulting speed can be adjusted by the piston rotation speed needs to be reduced until it
the coefficient k5 if the rotation should not be completely reaches ω0 , then the piston rotates with a constant speed.
stopped but just reduced to allow a higher piston response. The torque applied from the servo motor is equal to the
This is an open loop control since a high positioning one caused by friction. At the end of the stroke, a PI-action
accuracy is not required. can be applied for a better accuracy.
The piston travels to the right without rotation. When The resulting velocity profile vs. time is shown on Fig.12.
the displacement reaches a certain value defined by k6, It can be seen that the higher the inlet pressure, the faster
the rotation is accelerated with a torque of Tnom in the piston reaches its maximal translation speed, so a lower
order to close the inlet port when it is dictated by the brake force is needed to decelerate it until the extreme
thermodynamic model. right position.
46
CHAPTER 2. AUTOMATION
×106 7
1.0 Inlet pressure 0.6 MPa Inlet pressure 0.6 MPa
Inlet pressure 1.0 MPa 6 Inlet pressure 1.0 MPa
Clearance volume
0.8 Inlet closed 5
Pressure, Pa
Speed, m/s
0.6 4
3
0.4 Swept volume
2
0.2 1
0.0 0
0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.00 0.02 0.04 0.06 0.08 0.10 0.12
Volume, m3 ×10−3 Time, s
Fig. 10. Piston velocity vs. displacement Fig. 12. Piston velocity vs. displacement
π
7 2
Inlet pressure 0.6 MPa Rotation accelerates
6 Inlet pressure 1.0 MPa 3 Rotation decelerates
8π
Rotationangle ϕ, rad
1 Leading edge
4 4π
3 1
8π
Trailing edge
2 Inlet port
0
1
0 − 18 π
0.00 0.05 0.10 0.15 0.20 0.25 0.00 0.05 0.10 0.15 0.20 0.25
Displacement, m Displacement, m
Fig. 11. Piston velocity vs. displacement Fig. 13. Piston velocity vs. displacement
Two inlet pressures 1.0 MPa and 0.6 MPa are compared
and the configuring factors are identified (Fig. 10).
π
By performing such simulations within the expected inlet 2
Inlet pressure 0.6 MPa
pressure range with a certain step, a matrix of these Inlet pressure 1.0 MPa
Rotation angle ϕ, rad
1
4π
4.5 Parameters description
47
CHAPTER 2. AUTOMATION
5. RESULTS AND DISCUSSIONS Imran, M., Usman, M., Park, B.S., and Lee, D.H. (2016).
Volumetric expanders for low-grade and waste heat re-
The developed model is based on parameters of the se- covery applications. Renewable and Sustainable Energy
lected linear motor and the servomotor and allows to Reviews, 57, 1090–1109.
synchronize the rotation with the linear movement of the Lemort, V., Quoilin, S., Cuevas, C., and Lebrun, J. (2009).
expander. By decreasing the linear speed during the intake Testing and modeling a scroll expander integrated into
phase, the pressure loss at the inlet can be reduced. How- an organic rankine cycle. Applied Thermal Engineering.,
ever, reduced translation speed means lower frequency and 29, 3094–3102.
the mass flow rate through the expander, so the optimum Mikalsen, R. and Roskilly, A. (2008). The design and sim-
has to be found (Fig. 12). ulation of a two-stroke free-piston compression ignition
engine for electrical power generation. Applied Thermal
The rotation of the piston can be increased or decreased
Engineering, 28(56), 589–600.
when necessary for a better fit of the inlet. A moderate
Petrichenko, D., Tatarnikov, A., and Papkin, I. (2015).
rotating acceleration/deceleration is achievable in combi-
Approach to electromagnetic control of the extreme
nation with the piston deceleration during the intake. The
positions of a piston in a free piston generator. Modern
final deceleration must be adjusted so the leading edge of
Applied Science, 9(1), 119–128.
the skirt opening reaches the outlet port at the end of
Tran, X.B. and Yanada, H. (2013). Dynamic friction
the stroke. In this case this angle is π/2 (Fig. 13). Two
behaviors of pneumatic cylinders. Intelligent Control
rotation/translation profiles for different inlet pressures
and Automation, 4(2).
are shown on Fig. 14
The shape of the inlet port needs to be adjusted to Appendix A. NOMENCLATURE
”follow” an optimal intake profile in order to keep the
inlet intersection area around its maximum during the A.1 Latin characters
intake phase. Otherwise higher pressure ratios will require
high dynamics from electromagnetic train or will cause D diameter (m)
relatively high intake losses. F force (N)
L length (m)
The presented model is focused on the inlet control and m mass (kg)
therefore simplified by setting the outlet pressure as con- n number of ports (-)
stant. p pressure (Pa)
S intersection area (m2 )
6. CONCLUSIONS
t time (s)
V volume (m3 )
The dynamics of the system depend on the linear generator
x displacement (m)
used. Industrial linear motors are characterized by large
moving masses. It is possible to reduce the weight of
the piston if magnets are directly attached to it. A high A.2 Greek characters
dynamics of the piston movement is necessary to increase
β port angle (rad)
the resulting frequency and the volumetric flow rate of the
∆ difference (-)
machine. A relatively high static force is required to keep
υ speed (m/s)
the piston under control at its extreme positions. The use
φ angle of rotation (rad)
of a position encoder ensures the high accuracy of the inlet
ϕ filling factor (-)
timing, which is crucial for the system efficiency.
χ relative displacement (-)
The proposed system contains no bouncing devices such
as gas- or mechanical springs, which are typically used A.3 Subscript
in free piston machines. Instead, the piston movement
is fully controlled, so its velocity becomes zero at both cyl cylinder
extreme positions. A higher system efficiency is expected dis discharge
since mechanical wear or thermodynamic irreversibilities el electromagnetic
are avoided. fr friction
max maximal
REFERENCES min minimal
Bell, I.H., Wronski, J., Quoilin, S., and Lemort, V. (2014). open opening
Pure and pseudo-pure fluid thermophysical property port port
evaluation and the open-source thermophysical property rot rotation
library coolprop. Industrial & engineering chemistry su supply
research, 53(6), 2498–2508.
Gusev, S., Ziviani, D., De Viaene, J., Derammelaere, S.,
and van den Broek, M. (2016). Modelling and prelimi-
nary design of a variable-bvr rotary valve expander with
an integrated linear generator. In Proceedings of the
17th International Refrigeration and Air Conditioning
Conference at Purdue.
48
CHAPTER 2. AUTOMATION
Abstract:
The clinker production process is mainly affected by the quality of the raw material. The
chemical composition required for the formation of clinker is giving by the mixing of different
limestone sources, clay, and iron ore. The optimal combination may be determined in principle
by linear programming techniques, but the variation of the quality of limestone mines is high, so
it is necessary to adjust the mix online, by using transport mechanisms which are discrete and
sharing other resources. The feed to the mill is divided into two processes : crushing and forming
stacks ( pre- homogenization ) and the mixture preparation online in the mill for forming flour
( homogenization ).
The goal is to design mechanisms that allow the proper proportions of minerals to the furnace
inlet through programming online resources for pre–homogenization : the limestone quarries,
trucks, crusher and transport mechanism of the raw material to deposit. Supervisory control
techniques are used, and models are constructed as discrete event systems to ensure that
the mixture is as homogeneous as possible over time. A particular architecture (Holonic
architecture), it is used in the solution; which allows an easy and effective implementation of
an on-line supervisor. Supervisor uses the material existence and cost knowledge. A reference is
made to an application that is a Discrete Event System model interpreter for the implementation
of the supervisor.
1. INTRODUCCIÓN raw mill and at this stage a first mixture of stone from
different sources is performed to obtain a limestone
The cement industry is critical in the development of con- (CaO) pre - homogenizing. The mixture is made to
temporary society to be basic raw material in the construc- comply with predetermined percentages of limestone.
tion industry . The central element in the manufacture of (3) Homogenizing. It makes a mixture of limestone (pre-
cement is the clinker production, obtained by calcining a homogenizing) with iron ore and clay to achieve the
mixture mainly of: Calcium Oxide (CaO), silicon dioxide ideal blend, and in some cases corrective limestone is
(SiO2 ), Aluminum Oxide (Al2 03 ), and iron oxide (F e2 03 ) added. A stage of homogenization process is the raw
found in limestone, clay and the iron ore. The process for milling , where minerals are reduced to a diameter of
Portland cement production is shown in Figure 1, and it millimeters, so that the mixture is given particle level.
is divided into the following stages: Then, this milled mixture is carried deposits where
homogenization of the mixture is complete, and then
(1) Conveying limestone; it is transported to warehouses fed to the furnace. The product obtained is the raw
in the cement plant and placed in cells having the meal, which feeds the oven.
same quality. (4) Clinkering, consisting in transforming the mixture (
(2) Crushing and pre-homogenizing, which reduces the raw meal ) in clinker by a cooking process; and finally,
size of the limestone rocks that can be used by the (5) Cement production. By grinding the clinker and
adding other products like gypsum, limestone, poz-
1 Thanks to the Senescyt of Ecuador who partially funded the
zolan, among others, that give texture and resistance
project and Cementos Guapán in Azogues, Ecuador for the infor-
mation provided.
49
CHAPTER 2. AUTOMATION
to use. The product obtained is distributed in bags quality obtained in the crushing process; supervisory con-
(50 kg), or in bulk. trol mechanism associated with the coordination of the
Limestone
stages.
Limestone
Clay Clay
Quarring raw
Iron ore
Processing Raw Iron ore Preparation of raw meal
(grinding −
The paper is organized as follows: the first section presents
materials Materials (Crushing)
homogenization)
roughly the methodology to be used ( This has been proven
in commercial applications for companies in Merida,
Raw meal
Gypsum Venezuela and Medellin, Colombia ) for the behavioral
model of the process, in particular the model of the prod-
Cement production
Packaging / Distribution
cement
Clinker
Rotary Kiln
(clinkerization)
uct routing. In the second section, the functional units
(Grinding)
and the associated IDEF diagrams are defined. In a third
section the product route and Petri nets models associated
Fig. 1. General Process flow for the Production of cement. with it and the functional units are shown, and as the
Modified from Huntzinger and Eatmon [2009] model of behavior of the process is determined. A fourth
unit shows how the pattern of behavior found is the model
When having the homogenized mixture, the mixture qual- used for planning, programming, monitoring the process.
ity will determine its performance in the clinkering process. Finally results and future work are presented. The results
The oven has a continuous operation, so the input product were found using a software tool developed by Janus Sys-
must maintain a constant flow. The flow is determined by tems Merida.
the oven capacity product quality at its input.
Input limestone comes from different quarries and has a 2. PRODUCTION SYSTEMS MODELING USING
high variability in their chemical composition and moisture DISCRETE EVENT SYSTEMS TECHNIQUES
levels , not only for its origin from different quarries, but
also by the geological formation of the quarries. Limestone To obtain a useful model production we require that serves
has an economic value according to their origin. The issue to identify the model; in our case the models are used
is to maintain an ideal choice for a flow that is determined for: Planning, Programming, Supervision, Coordination,
by the furnace capacity to process raw meal mix at the Monitoring of the production process. So far the useful
lowest cost. The cost includes the cost of raw material and descriptions thresholds used to detect conditions of the
processing cost in the oven. The cost of processing furnace system; or the occurrence of events that can initiate,
varies according to the quality of the raw meal. continue, enable, stop a process. This leads us naturally
An ideal mixing quality for the oven, which is feasible to the use of a description based on discrete event system,
with available feedstock is established. It seeks to get that regardless of the nature or form of processing that has
quality over time constantly mixing raw material from each of the entities making up the production process. A
different quarries through dynamic allocation of resources Dynamic Discrete Event System is a system whose dy-
and routes for forming pre-homogenizing limestone crush- namics is defined by changes in discrete variables, and this
ing corrective clay and preparation of corrective limestone, dynamic is driven by the occurrence of events. Within the
in amounts sufficient to maintain the constant flow of formal techniques for the representation of Discrete Event
feeding to the furnace. The second step is feeding the mill Systems to find those based on Finite State Automata
uses a shared resource that is the transport system inputs, Ramadge and Wonham [1989], Wonham [2014] and Petri
it goes from the tank pre-homogenized until the hoppers nets DAVID and ALLA [2001], David and Alla [2005].
at the entrance to the mill. Discrete Event Systems allow modeling the behavior of
systems at different levels of the plant, from the plant floor
Planning, programming, monitoring and even control pro- allowing applications for PLC programming Uzam et al.
duction processes, can be obtained from models that re- [1996], Fabian and Hellgren [1998], Zhou and Twiss [1998];
flect the behavior of the system in each of the above at the supervisory level, in the description and supervisory
situations. Everything based on the basic objective is to control of batch processes Andreu et al. [1994, 1995],
produce, and which focuses on a request from a customer Viswanathan et al. [1998], Tittus and Åkesson [1999b,a],
that accrues on, either in a delivery in warehouse or a Tittus and Lennartson [1999], supervision of hybrid sys-
production order in production or both. In most cases, tems Antsaklis et al. [1998], Lemmon et al. [1999]; for
when planning in a production process it is assumed for levels of management, modeling and implementation of
convenience a normal operation behavior; if required to do business processes van der Aalst [1998], van Der Aalst
planning online, then you need to know the state of the and Van Hee [2004], van Der Aalst et al. [2003a,b]; and
process in order to generate an acceptable plan and, this integration between different levels Vernadat [2014], ISA-
is nothing more than take the product model (product 95 [2000], Chacón et al. [2008].
route) and map on the set of available process equip-
ment, incorporating physical constraints, considerations As we see, there are many authors who use DES as natural
interconnection delays, which sets the schedule of when models to describe the production process (logical part), it
an order should start and completion date. Similarly, it is necessary to establish a production process configuration
requires a model for supervisory control schemes, whether (physical part) so it can be executed. Figure 2 shows
coordination between units or supervision in each unit the procedure for establishing a configuration and run
(resource). It is necessary to establish a planning mech- production activities, which is similar to the procedure
anism for determining the amount of principal limestone, proposed in Covanich and McFarlane [2009]. From the
clay and limestone corrective to be produced; the online point of view of modeling, we see that is the model of
selection of the limestone quarries that is based on the behavior (route product) which is the generator of the
50
CHAPTER 2. AUTOMATION
entire sequence shown in Figure 2, and therefore of the belonging to the unit. This set of structured models can
Planning, Programming, and Execution. be grouped to form holarchies . A holarchy shows a model
Planing Schedulling Execution
in principle overall ( useful) of the functional unit to
accomplish a target.
This core set of universal behavior patterns in the UF
configuration
Alternatives
alternatives
Supervisor
generation
definition
execution
Holarchy
Resource
liberation
selection
Product
Building
*
Precedence
1 Raw Material Stage StageTransicion
* -time interval * next
* 2.3 Modelling by Petri Nets
0..1 * 1
*
implemented by
Petri Nets can describe the different processes that occur
1..* *
Ability Process Output Event
in a production system in an easy way. We use the
same model for planning and programming activities and
to monitor the actual process. The network locations
Fig. 3. Product Model, Production Model in UML represent us process states, or the resources used, as shown
in Figure 5.
2.2 Description of the behavior of the Functional Unit
Raw material Nill process
51
CHAPTER 2. AUTOMATION
Production
coordination
The behavior of an elementary process can be expressed goal (Amount of material to be processed)
52
CHAPTER 2. AUTOMATION
given in Figure 7. The figure shows the information and determined by the optimization system by using figures in
product flows shown for the two processes analyzed. It the table 3. The crusher works 8 hours/day, 5 days/week
can be realized that pre - homogenization part has a and must process at least the amount of product that will
feeding system that is discret and, the the second part be consumed by the furnace daily taking into account the
(homogenization) has a feeding systems that is by batch. performance of the raw material in the clinkering process.
The output of the whole process should be continuous, and The amount of production is known by a weighbridge
this is achieved by the use of a temporary storage system system. The information and products flows are shown in
between stages. the IDEF Figure 9
Quality parameters information product goal
Production
coordination Schedulling
product quality
goal (Amount of material to be processed)
Mixing bed
Principal limestone
Halls Transport Crushing
Corrective limestone
Corrective clay Crushing Mixing bed
Truck routing construction
limestone A materials
Limestone B corrective limestone
limestone C limestone
materials
sand Raw Material
Truck Crusher Storage
Homogeneization
Truck scale
Homogeneization &
Transport Mixing Milling
Storage
raw meal
transporting belt
Fig. 9. IDEF0 for homogenisation process
Fig. 7. Pre - homogenisation and Homogenization pro- Preparation of the main limestone The objective in the
cesses in the production of Clinker first step is to mix the various limestone quarries for a pre-
homogenised limestone with a percentage of CaO over 70
% (Titration 100).
3.3 Functional units in the first part: Crushing process and
Pre-homogenization processes This process gets a product that, in addition to calcium
oxide, has silicon, aluminum, iron and magnesium with the
The Functional Units in the pre - homogenisation are: values given in Table 3. The product is stored in stacks of
Quarries, Transport Unit, Grinding Unit and Storage Unit. layers of material.
The process should produce three products: the main
SiO2 (%) Al2 03 (%) F e2 O3 (%) CaO
limestone, corrective limestone and, clay corrective. These
products are three of the four inputs to the raw mill. Min 7 1 1 70
The main resources that are used at this stage are the Max 10 4.5 4 75
crusher and trucks. This stage consists of three processes: Table 3. Expected range of chemical compo-
the transport of materials , mixing crushing of materials nents of the limestone mixing bed
and creating layered stacks . The three processes are given
separately and are the result of a plan that specifies the The properties of the materials, according to their origin,
amount of material that has been produced and the quality are given in the table 4.
expected as the IDEF shown in Figure 8.
Cantera SiO2 (%) Al2 03 (%) F e2 O3 (%) CaO (%)
Indicators Production
A 13.1 1.7 0.8 45.2
Homogenization
management goals B 6.9 0.5 0.2 50.7
A1.1
C 2.5 0.7 2.8 45.5
Arcilla 43.3 27.4 11.5 0.5
Main limestone
Expected
Table 4. Chemical composition of limestone in
Crrective limestone
Raw Crushing
Iron ore
Grinding feed
blending
each quarry
Clay
materials
A1.2 A0
Volquetas
Triturador
Recurso colector transporting belt
The estimated amount of %CaO in the limestone pre -
Humano
homogenized is done by the following calculation:
Raw meal non
%CaO1 × M1 + %CaO2 × M2
homogenized
Raw meal
%CaO = (1)
Grinding Homogenization M1 + M2
Moisture and the presence of clay affect the grinding
A1.4 A1.5
Homogeneizador
process, so that when selecting the sources of limestone,
humidity is a new restriction on the crusher. Deficiencies
Fig. 8. Products and information flows in the process of of other chemicals are resolved in the process of feeding
homogenization raw flour mill.
The crusher is the primary constraint and is used to form For the mixing bed in formation, the expected amount
the stacks of main limestone, limestone and corrective of CaO is calculated for each truck using the following
corrective clay. The amount of each of the materials is equation:
53
CHAPTER 2. AUTOMATION
%CaOack−1 M assack−1 + %CaOk M assk Figure 11 shows the behavioral model in Petri Nets for the
%CaOack = (2) production process of the main limestone.
M assack−1 + M assk
limestone A
mill. limestone B
n2
t−duration
m
Principal limestone
Preparation of corrective limestone Corrective limestone Fig. 11. Product Model (Behavior of the production pro-
is limestone with a high percentage of CaO. It will be cess)
used in the raw mill to ensure the required percentages of The behavior model production process limestone is shown
calcium carbonate. If the pre-homogenization process was in Figure 12. We can see that this is just the instantiation
efficient, the amount of corrective limestone must tend to (in detail) of the product model on the equipment, in
zero. order to have the physical process, including capabilities,
restrictions, etc.
4. BEHAVIORAL MODELS OF FUNCTIONAL UNITS
AND PRODUCT IN PETRI NETS
Crushing discharge
management
A1.2.1
Pre-homogenized
Processed Hall
raw material
A1.2.4
Halls
weight measurement
A3
Crushing
A1.2.3
Transportation
Trucks
Weigth Servicios
Fig. 12. Production process that implements the steps of
A1.2.2 service Generales
production Product Model
A3.1 A4
54
CHAPTER 2. AUTOMATION
Corrective limestone
zation maintaining current existence of materials, use of
equipment, hours worked, etc.
The modeling process is relatively easy, and the possibility
Supervisor
Corrective clay
55
CHAPTER 2. AUTOMATION
zación integrada de procesos de producción basada en Vernadat, F. (2014). Enterprise modeling in the context
el enfoque holónico. In XIII CLCA/VI CAC. of enterprise engineering: State of the art and outlook.
Chacón, E., Besembel, I., Rivero, M., and Cardillo, J. International Journal of Production Management and
(2008). Advances in Robotics, Automation and Control, Engineering, 2(2), 57–73.
chapter The holonic production unit: an approach for an Viswanathan, S., Johnsson, C., Srinivasan, R., Venkata-
architecture of embedded production process, 301–314. subramanian, V., and Ärzen, K.E. (1998). Automating
ITech ISA. operating procedure synthesis for batch processes: Part
Covanich, W. and McFarlane, D. (2009). Assessing ease i. knowledge representation and planning framework.
of reconfiguration of conventional and holonic manufac- Computers & chemical engineering, 22(11), 1673–1685.
turing systems: Approach and case study. Engineering Wonham, W. (2014). SUPERVISORY CONTROL OF
Applications of Artificial Intelligence, 22, 1015 – 1024. DISCRETE-EVENT SYSTEMS. University of Toronto.
DAVID, R. and ALLA, H. (2001). On hybrid petri Zhou, M. and Twiss, E. (1998). Design of industrial
nets. Discrete Event Dynamic Systems: Theory and automated systems via relay ladder logic programming
Applications, 11, 9–40. and petri nets. Systems, Man, and Cybernetics, Part C:
David, R. and Alla, H. (2005). Discrete, Continuous, and Applications and Reviews, IEEE Transactions on, 28(1),
Hybrid Petri Nets. Springer. 137–150.
Fabian, M. and Hellgren, A. (1998). Plc-based implemen-
tation of supervisory control for discrete event systems.
In Decision and Control, 1998. Proceedings of the 37th
IEEE Conference on, volume 3, 3305–3310. IEEE.
Huntzinger, D.N. and Eatmon, T.D. (2009). A life-cycle
assessment of portland cement manufacturing: compar-
ing the traditional process with alternative technologies.
Journal of Cleaner Production, 17(7), 668–675.
IDEF (????). Integrated definition models.
http://www.idef.com/.
ISA-95 (2000). Ansi/isa–s95.00.01–2000, enterprise-
control system integration. part 1: Models and termi-
nology. Technical report, ISA.
Johnsson, C. (1997). Recipe-based batch control using
high-level grafchart. Technical report, Department of
Automatic Control, Lund Institute of Technology.
Lemmon, M., He, K., and Markovsky, I. (1999). Super-
visory hybrid systems. Control Systems, IEEE, 19(4),
42–55.
Ramadge, P.J. and Wonham, W.M. (1989). The control of
discrete event systems. Proceedings of the IEEE, 77(1),
81–98.
Tittus, M. and Åkesson, K. (1999a). Deadlock avoidance in
batch processes. In Proc. of 14th IFAC World Congress,
Beijing, PR China, 397–402. Citeseer.
Tittus, M. and Åkesson, K. (1999b). Petri net models in
batch control. Mathematical and Computer Modelling
of Dynamical Systems, 5(2), 113–132.
Tittus, M. and Lennartson, B. (1999). Hierarchical su-
pervisory control for batch processes. Control Systems
Technology, IEEE Transactions on, 7(5), 542–554.
Uzam, M., Jones, A., and Ajlouni, N. (1996). Conver-
sion of petri net controllers for manufacturing systems
into ladder logic diagrams. In Emerging Technologies
and Factory Automation, 1996. EFTA’96. Proceedings.,
1996 IEEE Conference on, volume 2, 649–655. IEEE.
van Der Aalst, W. and Van Hee, K.M. (2004). Workflow
management: models, methods, and systems. MIT Press.
van der Aalst, W.M. (1998). The application of petri nets
to workflow management. Journal of circuits, systems,
and computers, 8(1), 21–66.
van Der Aalst, W.M., Ter Hofstede, A.H., Kiepuszewski,
B., and Barros, A.P. (2003a). Workflow patterns. Dis-
tributed and parallel databases, 14(1), 5–51.
van Der Aalst, W.M., Ter Hofstede, A.H., and Weske, M.
(2003b). Business process management: A survey. In
Business process management, 1–12. Springer.
56
CHAPTER 3
BIOMEDICAL - BIOENGINEERING
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
Abstract: Se presenta en este artículo el diseño del concepto de un sistema para rehabilitación de la marcha en
infantes con parálisis cerebral, PCI y la metodología para el diseño de la primera etapa: tobillo-pie. La cual involucra
el análisis del miembro inferior para obtener un modelo matemático del mismo, determinando su cinemática y la
trayectoria que sigue el tobillo durante el ciclo de marcha, para encontrar un mecanismo que pueda emular su
movimiento, en este caso fue un mecanismo de cuatro barras. La metodología incluye el planteamiento de un
problema de optimización numérica para encontrar la síntesis dimensional del mecanismo utilizando un algoritmo
metaheurístico. La mejor solución obtenida se implementó en CAD, el análisis de funcionamiento mediante
simulación permitió evaluar su funcionalidad como dispositivo de rehabilitación y determinar la factibilidad de
construcción del sistema. Adicionalmente, el modelo en CAD se enlazo con un software de análisis matemático para
realizar un control de posición para evaluar su operación y desempeño.
58
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
En los grandes Centros de Rehabilitación en América Latina, del tobillo; este mecanismo está unido a un sistema de
ya se utilizan equipos de rehabilitación automatizados como sujeción de rodilla para impedir que el paciente rote de
los mencionados, sin embargo se ha detectado la necesidad manera incorrecta la rodilla al momento de realizar la terapia
de desarrollar sistemas propios, especialmente para la y emular su movimiento real al caminar.
rehabilitación en pacientes con edades entre 2‐12 años, que
El sistema tiene un soporte de peso corporal, el cual además
dadas sus características, su rehabilitación no siempre es
permite a la cadera del paciente realizar la trayectoria
eficaz utilizando los equipos actuales y sus adecuaciones.
correcta en el plano transversal al caminar (movimiento
arriba y bajo). También tiene un dispositivo en forma de
Tomando en cuenta esta necesidad, el grupo de investigación
pechera para mantener al paciente erguido, junto con un
que presenta este artículo, lleva tres años trabajando en este
mecanismo que apoya al paciente para que realice el balanceo
proyecto. La primera parte del desarrollo, fue la
correcto de los brazos y así llevar a cabo el movimiento
investigación inicial del problema a partir de la información
correspondiente al tronco durante el proceso de marcha. Los
recopilada de diferentes fuentes; pediatras, ortopedistas,
mecanismos de cadera, soporte de peso corporal, balanceo de
terapistas y pacientes, se realizó el diseño del concepto del
brazos y pechera están unidos a una estructura de soporte
sistema de rehabilitación, el cual se presenta en Figueroa et
común.
al. (2014). Inicialmente, se desarrolló la etapa que
corresponde al tobillo-pie. Para ello se obtiene la cinemática
directa del miembro inferior, luego la trayectoria que sigue
el tobillo a partir del desplazamiento angular de las
articulaciones de rodilla y cadera en el plano sagital y de los
parámetros estructurales de la extremidad inferior, y con
base en esta trayectoria se elige un mecanismo capaz de
reproducirla. En Figueroa (2016), se presentó un primer
diseño a partir de un robot manipulador del tipo PPR, la base
es un robot cartesiano y el grado de libertad rotacional
corresponde al efector final de tipo placa para el pie. Se
realizaron varias pruebas sobre este prototipo y se optó por
evaluar otras opciones para reducir el número de actuadores
y suavizar la trayectoria a seguir, utilizando el diseño
concurrente para lograrlo.
En este artículo se presenta el nivel de desarrollo que se ha
alcanzado al considerar un nuevo mecanismo para esta
etapa, se propuso el uso de un mecanismo de cuatro barras,
debido a que su topología permite la generación de
trayectorias que corresponden al movimiento de interés para
la rehabilitación del tobillo y solo utiliza un actuador, luego Figura 1. Diseño del Concepto, Sistema de Rehabilitación
se presenta la síntesis dimensional del mecanismo, que se para Infantes
realiza a partir de una metodología basada en el diseño
óptimo. El problema de diseño se plantea como un problema La metodología propuesta para el desarrollo del Sistema de
de optimización numérica y se resuelve mediante el uso de Rehabilitación de Marcha se presenta en el diagrama de flujo
técnicas heurísticas; los resultados obtenidos determinan las de la Figura. 2. Se basa en el diseño concurrente, donde las
dimensiones del mecanismo seleccionado, se presenta el etapas de diseño interactúan entre sí, de tal forma que las
diseño en CAD del mecanismo, la evaluación de su limitaciones del funcionamiento de un sistema son
operación y su esquema básico de control. determinadas tanto por sus componentes, como por sus
interconexiones, Portilla (2006). Así, una representación
En la sección 2 se presentan las generalidades del diseño del matemática (cinemática y dinámica) del sistema que contenga
concepto del sistema de rehabilitación. En la sección 3 la toda la información relevante, permite definir el proceso de
metodología para el diseño de la estructura de la etapa síntesis. El enfoque concurrente integra varias estrategias de
tobillo-pie. En la sección 4, se evalúa el diseño a partir de un diseño, incluyendo el diseño mediante uno o varios
modelo en CAD que se enlaza con un sistema de control de problemas de optimización numérica, que son resueltos a
posición desarrollado en Simulink de MatLab. Las partir de diversas técnicas heurísticas, dada la complejidad de
conclusiones del trabajo se presentan en la sección 5. su solución. Así como verificar el diseño realizado, utilizando
prototipos virtuales y simulación en tiempo real, a partir de
2. SISTEMA DE REHABILITACIÓN DE MARCHA diferentes configuraciones como “software-in-the-loop”,
Isermann, (2008).
Una generalización del diseño del concepto del sistema de
rehabilitación de marcha, desarrollado en Figueroa (2016), se La etapa correspondiente a tobillo-pie, se diseñó con la
sintetiza en la Figura 2. El sistema está compuesto por un metodología presentada y tiene la finalidad de reproducir el
mecanismo para cada pie, que le proporciona al paciente la movimiento natural del miembro inferior durante la
capacidad de realizar el seguimiento de la trayectoria natural locomoción en el plano sagital, tomando como referencia el
59
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
tobillo. A partir de la cinemática del miembro inferior, se dónde l1 y l2 son los distancias cadera-rodilla y rodilla-
obtuvo la trayectoria que el tobillo sigue en el plano sagital y tobillo, respectivamente y q1 y q2 son los ángulos de rotación
con base en esta se seleccionaron los mecanismos idóneos respecto al eje Z, de la cadera y la rodilla respectivamente.
para seguirla. Con técnicas de optimización numérica se
encontraron las dimensiones adecuadas del mecanismo que se 2.2 Trayectoria que sigue el tobillo.
diseñó en CAD, como se presenta a continuación.
La marcha es una sucesión de movimientos que se realiza de
manera repetitiva en el denominado ciclo de marcha, dicho
ciclo se genera en tres fases: 1) fase de apoyo, que representa
el 60% del ciclo y en la cual el pie de referencia está tocando
el suelo mediante el talón; 2) fase de balanceo, que representa
un 40% del ciclo y en donde el pie de referencia no toca el
suelo, mientras que el pie contralateral está oscilando; y 3)
fase de doble apoyo, donde los dos pies tocan el suelo, es
decir; el pie de referencia está en contacto inicial del talón
mientras que el pie contralateral está en fase de despeje,
representando el 10% de la fase de apoyo y el 10% de la fase
de balanceo, como se presenta en Benedetti, et al. (1998).
Con el fin de generar una trayectoria como la que realiza el
tobillo durante la locomoción humana proyectada en el plano
sagital, tomando como datos de base los ángulos formados
por las articulaciones de cadera y rodilla durante un ciclo de
marcha estándar reportados en Benedetti, et al. (1998), y
utilizando la herramienta interpolación shape- preserving
Interpolat, que está incluida en el software Matlab R2013b®,
se generaron los polinomios con los cuales es posible calcular
los ángulos de movimiento de la articulación de la cadera (q1)
y de la rodilla (q2) al realizar una marcha normal. Los
polinomios están parametrizados a partir del porcentaje del
ciclo de marcha, x, lo que permite encontrar el valor de los
Figura 2. Metodología de diseño utilizada para el desarrollo ángulos (q1) y (q2) en cualquier etapa del ciclo de marcha,
del sistema de rehabilitación de marcha como se observa en (3) y (4).
60
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
61
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
D2 l2 sin(2 2 ) (22)
D3 r2 sin(2 ) l3 sin(3 3 ) (23)
D4 r1 sin(1 ) l4 sin(4 ) (24)
El Lagrangiano se reescribe entonces como,
1
L (2 ) 22 mi gDi (25)
2
62
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
63
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
Figura A1. Vista Superior Figura A1. Vista Lateral Figura A1. Frontal
64
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
65
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
* School of Electrical and Electronic Engineering, Universidad del Valle, Cali, Colombia
(Tel: +57-2-4028239; e-mail: sergey.gonzalez@correounivalle.edu.co).
** School of Electrical and Electronic Engineering, Universidad del Valle, Cali, Colombia
(e-mail: jose.ramirez@correounivalle.edu.co)
Abstract: The research considers an assisted rehabilitation platform with a lower limbs exoskeleton, it
has a crane to support the weight of patient and exoskeleton, and also, it includes a control system to
synchronize the patient´s position on treadmill. This paper focuses on development of drive system for
trajectory tracking control based on angular positions in a lower limb exoskeleton walking on a treadmill.
It describes the logic states for servo-controllers algorithm and human-machine interface development to
manage the drive system. It is presented experimental results under different operating conditions and
performance analysis of the system.
Keywords: Assisted, drive, exoskeleton, gait, human-machine interface, joint, platform, rehabilitation,
tracking, trajectory.
66
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
67
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
In communication frame, three data are sent to the servo- The program into servo-controllers has a structure of state
controller: the angle value Array[i], the position in the array machine, Fig. 5.
to store the data, i, and the amount of data in the trajectory
array, length(Array). With this information into servo- The states are activated by the action on the button panel of
controller, the algorithm can reconstruct and store in a new engineering interface. Following, the states are described:
array the path that was sent. Then, the servo-controller of
right knee is specified as master and the other slaves. The
master role is to lead the synchronization of movement of the
other servo-motors, since, the slaves servo-controllers begin
to read the trajectories arrays and to apply the angles values
when the master indicates.
In the routine for reading trajectories arrays, the program
looks up the angle value in the array and replace it in Eq. (1),
then it executes the movement, Angulomotor , and moves to the
next angle of the array. To this process, it is configured a
runtime called sampling time or reading, tm .
100
Angulomotor [i ] AngularGain Array i * Offset (1)
360
Fig. 5. State machine for servo-controllers program.
The relationship between i and gait cycle time, t gait , is
given by Eq. (2). Software initialized: The servo-controllers are turned on and
the human-machine interface is running. It is loaded the
t gait (tm )i (2) operation settings of the servo-controllers, such as digital
inputs and outputs, parameters velocity and acceleration,
AngularGain, Offset and sample time are control parameters home position, etc. In this state, when servo-controllers are
that can be modified from interface and then are sent to the turned on, the angular position of joints is calibrated to zero
servo-controllers via RS-485 communication. The and thus, servo-motors are positioned in an initial condition
AngularGain parameter varies the step length, the Offset before turn. Also, it is allowed to activate the command
corrects the posture angle of the patient and sample time Trajectory load.
governs the cycle gait.
Angular trajectory loaded: When the control command,
The time of cycle gait, t gait , is Eq. (3). Trajectory load, is executed, the angular trajectories are sent
from the interface to the servo-controllers via RS-485. The
2 Lp arrays are reconstructed by servo-controller´s program and is
t gait (3) waited the full send of trajectories. It is allowed to activate
Vm the commands Trajectory load and Start walk cycle.
Where, L p is the step length and Vm is gait speed and in Exoskeleton is walking: When executing the command, Start
walk cycle, the exoskeleton performs walking cycles and only
compliance with limits, tm , it has Eq. (4). in this state, the user can send control parameters: walking
time, walk cycles, offsets and angular gains.
68
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
To perform gait movements, the Fig. 6 shows the algorithm right knee and the others are slaves. The master handles a
executed inside of servo-controllers. digital output and slaves handle a digital input, the
connection is shown in Fig. 3.
The first three steps in algorithm, the servo-motors move to
an initial position; this is necessary for smooth movement The synchronization is done through a digital pin and its
and ramp up without generating the follow error (Harmonic status is evaluated: the master and slave load angle value in a
Drive AG 2010b) in the servo-controller, and avoid strong motion buffer (Harmonic Drive AG 2010b); then, the master
changes in the angular acceleration. This generates the gait generates a high state to be read by slaves and thus apply
initial step in the exoskeleton. It is allowed to activate the movement. In this way, it is achieved that the servo motors
commands Finish walk cycle, Pause and Stop. move synchronously due to management master.
69
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
and walk cycles to generate the movements; the upper left Rigth hip angle
data for hip and knee joints, and with a sampling time of
-30
15 (ms) . With different sampling times, the tracking
trajectory of each servo-motor with the reference is -40
10[ms] - Sample Time
compared. The Fig. 9 and Fig. 10 show that angular -50 20[ms]
trajectories tracking without load by each servo-motor is 25[ms]
30[ms]
successful. -60
35[ms]
Target values
Left hip angle -70
0 0.5 1 1.5 2 2.5 3 3.5
Angular position [Deg]
70
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
7. REFERENCES
Duschau-Wicke, A. et al., 2010. Path Control: A Method for
Patient-Cooperative Robot-Aided Gait Rehabilitation.
IEEE Transactions on Neural Systems and
Rehabilitation Engineering, 18(1), pp.38–48.
Emken, J.L. et al., 2008. Feasibility of Manual Teach-and-
Replay and Continuous Impedance Shaping for Robotic
Locomotor Training Following Spinal Cord Injury. IEEE
Transactions on Biomedical Engineering, 55(1), pp.322–
334.
González-Mejía, S., 2014. Diseño e implementación de una
estrategia de control para equilibrio en posición bípeda
de un exoesqueleto de miembros inferiores. Master
thesis, Universidad del Valle.
González-Mejía, S. & Ramírez-Scarpetta, J.M., 2014. Design
and Implementation of a control strategy for static
balance of a lower limbs exoskeleton. Ingeniería y
Competitividad, 16(1), pp.229 – 238.
Harmonic Drive AG, 2010a. Product Documentation SC-610
Series Servo Controllers. Available at:
http://www.harmonicdrive.de/english/products/servoprod
ucts/servo-controller/sc-610/product-
documentation.html.
Harmonic Drive AG, 2010b. Set-up-Software Work Bench
v5 for SC-610-Series Servo Controllers. , pp.1 – 35.
Available at:
http://www.treffer.com.br/produtos/harmonicdrive/PDF/
setup_software_sc_610.pdf [Accessed June 1, 2016].
Harmonic Drive AG, 2010c. The CHA Series digital AC
hollow-shaft servo actuator. Available at:
http://www.harmonicdrive.de/english/products/servoprod
ucts/servoactuators/cha/ [Accessed February 8, 2014].
Hussain, S., Xie, S.Q. & Jamwal, P.K., 2013. Robust
Nonlinear Control of an Intrinsically Compliant Robotic
71
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
Abstract: the glucose homoeostasis is responsible for regulating the blood glucose concentration
to stay around of 5.56 mmol/L. However, this regulatory mechanism may be affected by Diabetes
Mellitus (Type 1, Type 2, and gestational) or as a side effect of a critical condition especially
in patients who are at an intensive care unit (ICU). From the control engineering point of view,
there is an approach to deal with pathological conditions leading to the glucose impairment
and is based on the automatic control of insulin infusion in order to avoid dangerous conditions
as hyperglycaemia and hypoglycaemia. This paper describes the design and simulation of a
model predictive controller combined with an Extended Kalman filter for tight glycemic control
of patients at the ICU. From the simulations performed, the controller is able to deal with
the enteral feeding delivered to the patient, which is an unavoidable disturbance in a realistic
scenario. The paper concludes that care must be paid to perform the controller tuning, especially
the matrix related to the state penalty. Finally, future would be directed to the design of
nonlinear model predictive controllers in order to obtain better controller performance.
Keywords: Model predictive control, Intensive care patients, Extended Kalman filter,
output-feedback, tight glycemic control, Diabetes Mellitus.
72
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
73
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
74
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
criterion. The weighting matrices allowing the most ac- The reader is advised about the notation abuse since xk
ceptable filter performance are presented below: is used for both the nonlinear and linear models. In this
sense, it worth to point out that the notation depends on
the context. For instance, xk in the linear model stands
100 0 0 0 for the state in deviation variables.
0 100 0 0
Qe,k = (25)
0 0 100 0
0 0 0 100 Objective function: in the literature of MPC three
cost functions appear typically, i.e., GPC (Generalized
Re,k =1 (26) Predictive Control), LQPC (Linear Quadratic Predictive
Control) and zone control (Van den Boom and Backx,
2010). In this case study the following LQPC cost function
100 0 0 0
was chosen
+ 0 100 0 0
Pe,0 = (27)
0 0 100 0 Np −1
X Nc −1
X
0 0 0 100 ˜ x0 ) = x̂T Pc x̂N +
J(∆u, x̂T ∆uT
N k Qc x̂k + k Rc ∆uk (35)
k=0 k=0
with Fk the state matrix, Gk is the input matrix, Bd is State constraints: the only state constraint considered is
the matrix relating the unknown disturbance to the model, that every state should be greater than zero.
and Hk is the output matrix. The numerical values of the
latter matrices are xk ≥ 0
T
where xk = [BG,k IG,k Qk Ik P1,k P2,k ] .
0.9758 0 −0.007 −0.00 0 0.0005
0.009 0.990 −0.00 −0.00 0 0
Input constraints: as it was shown before, the control
0 0 0.994 0.002 0 0
Fk = (31)
0 0 0.002 0.814 0 0 input corresponds to uex , the exogenous insulin. In this
0 0 0 0 0.965 0 sense, the smaller amount of insulin to be infused is
0 0 0 0 0.034 0.993 zero and the largest amount is provided by the physical
constraint of the catheter.
−0.0000
−0.0000 0 6 uex,k 6 20mU/min
0.0004
Gk = (32)
0.2870 Initial conditions and controller tuning: the MPC
0
was designed by setting a control horizon Nc = 50, and a
0 prediction horizon Np = 100. Regarding to the setting up
0 of the weighting matrices, a greater weight was assigned for
0 states BG,k and IG,k because the purpose of the MPC is to
0 control glucose levels. The tuning of the matrices Pc , Qc ,
Bd = (33)
0 and Rc was performed by a trial-and-error procedure such
0.9828 that an acceptable performance of the MPC is reached.
0.0171 The matrices are presented below
Hk = [0 1 0 0 0 0] (34)
75
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
10000 0 0 0 0 0
Interstitial glucose
0 10000 0 0 0 0 5.002
Estimated state
(mmol/L)
0 0 1 0 0 0 5 Real state
Qc = (37)
0 0 0 1 0 0 4.998
0 0 0 0 1 0 4.996
0 0 0 00 1 0 200 400 600 800 1000
5.005
Blood glucose
10000 0 0 0 0 0
(mmol/L)
0 10000 0 0 0 0 5
0 0 1 0 0 0
Pc = (38)
0 0 0 1 0 0 4.995
0 0 0 0 1 0
0 200 400 600 800 1000
Interstitial insulin
10.775
0 0 0 00 1
(mU/L)
Rc = 1 (39) 10.77
10.765
5. THE COUPLING OF THE EKF AND THE MPC 0 200 400 600 800 1000
Time (minutes)
The separation principle states that if a stable observer is (a) Estimation of IG,k ,BG,k and Qk
combined with a stable controller, the resulting coupled
system is stable and functional. In this case, an EKF
Plasma insulin
is tuned to work with the MPC to perform an output
(mU/L)
20.01
feedback to control the blood glucose concentration of a 20.005
Estimated state
patient at the ICU. The simulation of the MPC coupled Real state
with EKF was performed in the software Matlabr version 20
0 200 400 600 800 1000
2016a using the Runge Kutta numerical method with a
stomach (mmol)
22.5
Glucose in the
6
Real State
5.5
(mmol/L)
Estimated state
First, according to the evidence from the simulations, it is
5
concluded that the design of combined control and estima-
tion schemes becomes a reachable technology to be used 4.5
0 200 400 600 800 1000
in health care applications as the presented. In this sense, 20
input (mU/min)
76
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
art of the strategy and hence to propose control schemes Appendix A. MATHEMATICAL MODEL
tailored to the specific case study. PARAMETERS
77
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
Abstract: This paper presents the design and implementation of a wearable system for monitoring
physiologic variables. The system is composed by a master module, responsible for receiving the data
and the Wi-Fi transmission. Also, a set of node sensors (or slaves) were developed, allowing the acquisi-
tion of multiple biological signals in a modular and versatile arrangement. The nodes are connected by
Low Energy Blue-tooth to the Master. In this paper we present the nodes for pulse oximetry, temperature,
electrodermal activity and voice acquisition. The results are encouraging and the entire system has an
adequate performance and versatility.
Keywords: wearable sensor, wireless, low energy, electrodermal activity, pulse oximetry
78
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
79
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
T= -0.1921266499 (°C)⁄mV*VTAO+211.2190436 °C
90
Hbpm
(3)
80
70
La anterior ecuación se obtuvo a través de mediciones 1 2 3 4 5 6 7 8 9 10
80
CHAPTER 3. BIOMEDICAL - BIOENGINEERING
valor es consecuente con el objetivo de este trabajo, ya que se El diseño implementado con nodos sensores comunicándose
esperan detectar pequeñas variaciones de temperatura en forma inalámbrica con el módulo maestro posibilitó la
corporal. eliminación de conexiones cableadas en la plataforma de
La EDA [8] se testeó colocando el sensor sobre la muñeca de sensores vestibles, lo que disminuyó las fallas en el sistema
los voluntarios y realizando actividad física, conformando la debido a movimientos bruscos de los usuarios. Además la
curva de registro que se aprecia en la Figura 6 a modo de utilización de WiFi para la conexión a la red WLAN
ejemplo. Se observa el cambio de conductividad a posibilitó el monitoreo tanto en aplicaciones hogareñas como
consecuencia de la sudoración, como una modificación de la industriales. De esta manera, se pueden realizar ampliaciones
tensión medida en el sensor. La señal obtenida con el de la red de datos y visualizaciones a mayores distancias, y
micrófono conserva los contenidos frecuenciales y de calidad desde cualquier dispositivo con conexión WiFi (PCs, tablets,
deseados, ya que dependen del micrófono comercial usado y smartphones, etc). Este es un desarrollo preliminar, y en
no es parte del diseño específico para este proyecto. próximos trabajos se agregarán sensores inerciales y de
2.8
electromiografía a fin de ampliar las variables registradas.
Son necesarias pruebas en un ambiente controlado con
2.6 inducción de emociones o de alteraciones fisiológicas y
2.4
ambientales a fin de validar el desarrollo y obtener los datos
para un análisis multimodal de las señales. Sin embargo, la
2.2 construcción de un modelo de comportamiento debe ser
EDA
2
personalizado. Esto requiere la recolección de datos previos,
y un modelo dinámico de ajuste y aprendizaje para cada
1.8 operario.
1.6 REFERENCIAS
1.4
11 11.5 12 12.5 13 13.5
[1] Teng X-F, Zhang Y-T, Poon CCY, Bonato P (2008): Wearable medical
Time (hs) systems for p-Health. IEEE Reviews in Biomedical Engineering, 1:62-
Figura 6: Respuesta del sensor de sudoración en función el tiempo, 74.
registrando la tensión proporcional a la conductividad galvánica de la piel. [2] Bonato P (2010): Wearable sensors and systems. From enabling
technology to clinical applications. IEEE Eng Med Biol Mag, 29:25-36.
[3] Anzaldo D (2015): Wearable sport technology – Market landscape and
La interfaz gráfica permite visualizar los módulos diseñados compute SoC trends. IEEE Conference Publications: International SoC
hasta el momento con claridad y en forma amigable. Design Conference: 217-218.
[4] Chen B, Patel S, Buckley T, Rednic R, McClure D, Shih L, Tarsy D,
Welsh M and Bonato P (2011): A Web-Based system for home
4. CONCLUSIONES monitoring of patients with Parkinson´s disease using wearable
sensors. IEEE Transactions on Biomedical Engineering, Vol. 58, No
El sistema cumple con las expectativas de desempeño, 3:831-836.
precisión, y repetibilidad de un dispositivo vestible, como [5] https://www.bluetooth.com.
[6] Asada HH, Shaltis P, Reisner A, Rhee S, Hutchinson RC(2003):
también condiciones de autonomía, bajo peso y costo Mobile monitoring with wearable photoplethysmographic biosensors.
requeridos en estos equipos. Los valores obtenidos de SpO2, IEEE Eng Med Biol Mag , 22:28-40.
Hbpm, Temperatura y EDA se encuentran en el rango [7] http://en-us.fluke.com/products/thermometers/thermometer.html.
esperado y sin errores de medición en las pruebas efectuadas. [8] Piccinini D, Andino N, Ponce S, Roberti M, López N.(2015):
Wearable system for acquisition and monitoring of biological signals.
Journal of Physics: Conference Series.Bristol: IOP Publishing ,Institute
of Physics.
81
CHAPTER 4
BIOPROCESSES
CHAPTER 4. BIOPROCESSES
In this work, the optimizing control of the fed-batch process for the production of Polyhydroxyalkanoates
is carried out by formulating and solving a dynamic optimization problem in order to maximize the
process productivity. The optimization problem is subject to constraints on the feed flow rates, the final
volume and the maximum concentrations able to be reached on the substrate and nitrogen-source in order
to avoid inhibition. For solving the dynamic optimization problem, different parameterization strategies
of the control vector were used in order to compare their effect on the dynamic behavior of the biological
variables, and therefore, on the process productivity. Results have shown that sinusoidal parameterization
of the control profiles lead to higher productivity values while avoiding abrupt changes on the
microorganism environment. Furthermore, it is shown that coupling the optimizing control to a neural
network soft-sensor developed for predicting the number average molecular weight of the biopolymer is
a good strategy in order to fulfill the end-product specifications. In this case, the optimal solution leads to
a productivity over 300 g, while keeping the number average molecular weight at common reported
values for important applications (between 4x105 - 2x106 g/mol).
Keywords: Advanced Control, Optimization, State Estimation, Biotechnology, Simulation.
developed for predicting the Number Average Molecular
1. INTRODUCTION
Weight (Mn).
Polyhydroxyalkanoates (PHAs) are polymers from biological
2. MODELLING OF THE PHA’s PROCESS
origins, which are currently claimed to be an environmentally
friendly option for replacing petroleum based plastic The developed model is based on the work by (Shahhosseini
materials in a wide number of applications. However, 2004), (Khanna & Srivastava 2006), (Amicarelli et al. 2008)
production costs of these plastic materials are still higher than and (Chatzidoukasa, Penlogloub & Kiparissides 2013). The
the petroleum based ones, which has prevented the expansion model is described by equations (1-8), where the dynamic
of the biopolymer industry, despite the fact of its innumerable equations describing the behavior of the biomass (X),
environmental advantages. Therefore, in the last years, substrate (S), biopolymer (P), nitrogen-source (N), dissolved
scientists have put many effort in improving the technical and oxygen (O2L) and dissolved carbon dioxide (CO2)
economic feasibility of the process. Some of these works concentrations are given. The specific growth rate (μ)
focused on using alternative low cost substrates (Lee & Na depends on the glucose concentration (S), nitrogen-source
2013), (Dietrich, Illman & Crooks 2013). Other works have concentration (N) and Dissolved Oxygen concentration
focused on using tools from the Process Systems Engineering (O2L), following a sigmoidal relationship. F1, F2 and F3
(PSE), in order to address the optimization and control of the correspond to the feed flow rates of the substrate, nitrogen-
process, towards increasing its productivity (Keshavarz & source and oxygen, respectively. Sin, Nin and Oin correspond
Roy 2010), (Khanna & Srivastava 2006) and (López , Bucalá to the concentrations of substrate, nitrogen-source and
& Villar 2010). oxygen in the feed. Finally, V is the fermentation volume.
In this work, it is proposed to maximize the productivity of 𝑁
( )
𝑁 𝑛𝑘
𝑂2 𝐿
the PHA’s production process by applying an optimizing 𝜇 = 𝜇𝑚 ( 𝑁
𝑆
) ∗ (1 − ( 𝑆
) ) ∗ (( )) (1)
( )+𝐾𝑠𝑟 𝑆𝑚 𝐾𝑜𝑥 ∗𝑋+𝑂2 𝐿
𝑆
control strategy based on a first principle model containing
𝐹1 +𝐹2
the most relevant dynamics for the process (substrates, 𝑋̇ = μX − X (2)
𝑉
biomass, polymer, dissolved Carbon Dioxide and Dissolved
Oxygen). It is also expected to include characteristics of the
desired product such as molecular weight distribution 𝑆̇ = − ((𝐶𝑠𝑥 μ 𝑋) + (𝑅𝑐𝑠𝑥 𝑋) + 𝐶𝑠𝑝 ((𝐾1 μ𝑋) + (𝐾2 𝑋))) +
(MWD), the Number Average Molecular Weight (Mn), the
𝐹1 𝐹1 +𝐹2
Weight Average Molecular Weight (Mw) and/or the 𝑉
𝑆𝑖𝑛 −
𝑉
S (3)
Polydispersity Index (PDI). Due to the importance of
assuring some of these characteristics, the optimizing control 𝐹 +𝐹
strategy proposed in this work, is coupled to a soft-sensor 𝑃̇ = (𝐾1 μ𝑋) + (𝐾2 𝑋) − 1 2 𝑃 (4)
𝑉
83
CHAPTER 4. BIOPROCESSES
̇ 𝐹 𝐹 +𝐹
𝑁̇ = −((𝐶𝑛𝑥 μ 𝑋) + (𝑅𝑐𝑛𝑥 𝑋)) + 2 𝑁𝑖𝑛 − 1 2 N (5) 3. SOFT–SENSOR DEVELOPMENT FOR
𝑉 𝑉
MOLECULAR WEIGHT DISTRIBUTION
𝑂2̇ 𝐿 = ((𝐾𝐿(𝑂2𝐿𝑒𝑞 − 𝑂2 𝐿)) − (𝐾3 μ𝑋) − ((𝐾4 𝐾1 μ𝑋) + (𝐾4 𝐾2 𝑋)))) + In the case of polymer production, end-product properties are
𝐹3
𝑂𝑖𝑛 −
𝐹1 +𝐹2
𝑂2 𝐿 (6) highly related to the Molecular Weight Distribution (MWD)
𝑉 𝑉
achieved during the process, (Sudesh, Abe & Doi 2000).
Therefore, it is important to control the MWD during the
𝐹 +𝐹
𝐶𝑂̇ 2 = (𝛼1 𝜇 + 𝛼2 )𝑋 + 𝛼3 − 1 2 𝐶𝑂2 (7) process operation. However, it has to be considered that one
𝑉
of the main limitations for controlling is the difficulty of
getting cheap and reliable on-line measurements of the
𝑉̇ = 𝐹1 + 𝐹2 (8)
MWD. Previous works by (Penloglou et al. 2010) and
The model contains 19 parameters. For identifying those (Penloglou et al. 2012) proposed the combination of a
parameters, a hybrid strategy combining the simulated polymerization and a macroscopic model in order to
annealing and the interior point method was used. The determine the MWD in the PHA production. However, the
objective function for parameter identification is given by (9). mentioned works don´t take into account any control strategy
in order to achieve a desired final molecular weight
∆ij distribution. Based on the mentioned works, a structured
SSWR = ∑ni=1 ∑m
j=1 (9)
W2j macroscopic/polymerization kinetic model was simulated in
order to obtain enough “in silico data” for building an
Where SSWR represents the sum of the square weighed Artificial Neural Network (ANN) capable of predicting the
residuals, n and ‘m’ are the total number of experimental data number average molecular weight (Mn). Such ANN is used
points and variables, respectively. Wj is a normalization as a soft-sensor inside the optimizing control strategy, in
factor for each variable. Δij is the difference between the order to monitor the Mn during the process operation and to
predicted and experimental data. Experimental data for drive the optimization towards finding the conditions for
identification and validation was taken from (Khanna & assuring maximal productivity while keeping the Mn at a pre-
Srivastava 2005). For finding an optimal set of parameters, established range (i.e. required for the biopolymer to fulfill
the methodology by (Wu et al. 2013) was applied. Results certain mechanical/performance properties)
showed that only 7 parameters are identifiable. Therefore, a
re-optimization strategy was performed for finding a better For simulating the macroscopic/polymerization kinetic model
value for these sensitive parameters while keeping fixed the reported by (Penloglou et al. 2010), (Kumar & Ramkrishna
remaining 12 parameters. Figure 1 shows the model results 1996) and (Saliakas et al. 2007), a mathematical technique
after parameter identification. It can be observed that model called fixed pivot was used in order to discretize the set of
predictions are in good consent with experimental data. ordinary differential equations that describe the model. It is
important to notice that the integration between the
40 15 macroscopic and the polymerization models is the most
Model Validation
Experimental Data
Model Validation important partof the MWD-prediction.. The polymerization
Experimental Data
30 model allows predicting Mn as a function of the monomer
10
production rate (𝐽𝑀 (𝑡)). Furthermore, 𝐽𝑀 (𝑡) is determined by
X (g/L)
P (g/L)
84
CHAPTER 4. BIOPROCESSES
Table 1. Parameters of the polymerization model 0.999. Figure 4 compares the ANN predictions against the
validation data.
Parameters Reported by Adjusted ANN Performance (Red) Val Data (Blue)
(Penloglou et al. 2012) Parameters 1
ki (h-1) 0.62 ±9 x 104 0.64x104
kP(h-1) 0.46±5x105 0.44x105 0.8
𝑘𝑡∗ (h-1) 0.14±1x101 0.101x101
Mn (g/mol)
∗
𝑘𝑚1 (h-1) 0.11±2x10-3 0.114x10-3 0.6
km2(l/mol/h) 0.85±15x107 0.75x107
𝑘𝑑∗ (h-1) 0.83±6x102 0.25x102 0.4
𝑌𝑀 0.35±2x10-2 0.23x10-2
𝐹
0.2
3 6
Model Validation
Experimental Data 0
0 100 200 300 400
2 4 Time (h)
P (g/l)
S (g/l)
85
CHAPTER 4. BIOPROCESSES
pulse, iii) piecewise constant, and iv) sinusoidal The dynamic optimization problem expressed in (15) was
parameterization. . solved at two different scenarios, in order to compare and to
show the importance of including end-product specifications
For the piecewise constant parameterization, the feed flow as part of the control problem during PHA production. The
rates are described by equations (10-11). scenarios are, i) Optimizing control without constraints on
Mn, and ii) Optimizing control with constraints on Mn.
𝐹1,2 = ∑𝑚
𝑗=1 𝑎𝑖𝑜𝑘 𝜑(𝑡𝑖−1 , 𝑡𝑖 )(𝑢𝑚𝑎𝑥 − 𝑢𝑚𝑖𝑛 ) + 𝑢𝑚𝑖𝑛 (10)
4.1 Optimizing control without constraints on Mn
0, 𝑡 < 𝑡𝑖−1
𝜑(𝑡𝑖−1 , 𝑡𝑖 )= {1, 𝑡𝑖−1 ≤ 𝑡 < 𝑡𝑖 (11) Figures 6 and 7 show the optimal F1 and F2 feeding profiles
0, 𝑡 ≥ 𝑡𝑖 for each parameterization case, obtained by solving the
dynamic optimization problem given by equation (15) where
Where m=12 is the number of steps. umax=0.3 and umin=0 no constraints on Mn are used.
correspond to the maximum and minimum values for each
step. The 𝑎𝑖𝑜𝑘 is the parameter that defines the control vector 0.18
Constant
profile, and is therefore the decision variable of the dynamic 0.16
Pulse
optimization problem. For this specific case, this type of Piecew ise
0.14
parameterization uses 12 parameters for each flow rate. Siusoidal
0.12
On the other hand, the sinusoidal parameterization of the feed
F1 (L/h)
0.1
flow rates was implemented as described by equation (12)
(Ochoa 2016). 0.08
0.06
𝑡−𝑡0
𝐹1,2 = 𝑎𝑜 + 𝑎1 𝐶𝑜𝑠 (𝑤1 ( ) + ∅1 ) +
𝑡𝑓 −𝑡0 0.04
𝑡−𝑡0 0.02
𝑎2 𝐶𝑜𝑠 (𝑤2 ( ) + ∅2 ) (12)
𝑡𝑓 −𝑡0
0
0 10 20 30 40 50
Time (h)
Where w1, w2 are the frequency, and ∅1 and ∅2 are the phase
angle of the sinusoidal profile. For this specific case, this type Figure 5. Optimal Feeding profiles for substrate (F1)
of parameterization uses seven parameters for each flow rate.
𝐿
0 ≤ 𝐹2 ≤ 2 ( ) (15b)
ℎ
0.06
𝑔
max (𝑆(𝑡)) ≤ 90.11 ( ) (15c)
𝐿 0.04
𝑔
max(𝑁(𝑡)) ≤ 10.11 ( ) (15d)
𝐿 0.02
𝑔
𝑆𝑖𝑛 ≤ 800 ( ) (15e) 0
𝐿 0 10 20 30 40 50
𝑔 Time (h)
𝑁𝑖𝑛 ≤ 70 ( ) (15f) Figure 6. Optimal feeding profiles for nitrogen-source (F2)
𝐿
𝑉 ≤ 10𝐿 (15g)
As it can be observed, the sinusoidal parameterization is a
smooth strategy (i.e. avoids abrupt and sudden changes in the
Where tf is the duration of the process. Constraints (15a-15b) microorganisms environment), which helps reducing
take care of the maximum and minimum values allowable for inhibitory effects on the microorganism. Inhibitory effects are
F1 and F2, respectively. Max S and max N, are the maximum more pronounced when the microorganism experiences a
substrate and nitrogen-source concentrations allowed during rapid increase on the substrate/ nutrient concentrations. Such
the fermentation (i.e. for avoiding inhibition). Sin and Nin are rapid scenarios are more prompted to take place when step
the substrate and nitrogen-source concentrations in the feed type policies (as for example, when the pulse or piecewise
flows F1 and F2, respectively. V is the fermentation volume. constant feeding policies are followed) are used.
Table 2 shows the productivity reached by applying the four
kinds of profiles to the fed-batch PHA production.
86
CHAPTER 4. BIOPROCESSES
Furthermore, the required nitrogen-source and substrate feed constraints on the Mn which is a low value for industrial and
concentrations for keeping that productivity are shown. The commercial applications. Therefore, in the next section,
constant and pulse feeding policies are usually used in implementation of the optimizing control including
practice due to the easiness of implementation. However, as constraints on Mn is addressed.
results show in Table 2, these policies resulted in very low
productivity values, when compared against the sinusoidal 4.2 Optimizing control coupled with constraints on Mn
and piecewise constant policies. The sinusoidal feeding
policy resulted in the higher productivity, followed by the Taking into account that the sinusoidal parameterization
piecewise constant. Furthermore, the computational time for showed the best results in the previous case, such
solving the dynamic optimization by the sinusoidal parameterization was used in the present scenario. The
parameterization was lower for the sinusoidal than for the dynamic optimization problem is the same as in equation (15-
piecewise constant (results not shown). This is due to the fact 15g) but including the following constraint on Mn (Gonzales
that the sinusoidal approach has a lower number of decision García et al. 2013)
variables, which of course impacts the time for reaching an
optimal solution. The lowest computational time was reached 𝑔 𝑔
400000 ( ) ≤ 𝑀𝑛(𝑡) ≤ 2000000 ( ) (16)
for the constant feeding policy, and the pulse strategy, 𝑚𝑜𝑙 𝑚𝑜𝑙
Mn (g/mol)
productivity
Feeding Productivity Nin (g/L) Sin(g/L) 1
Strategy (g) in F2 in F1
Constant 364.45 48.66 495.93 0.5
Pulse 183.61 38.44 790.59
Piecewise 402.90 43.37 448.52 0
0 10 20 30 40 50
Sinusoidal 405.17 42.35 463.61 Time (h)
As mentioned, the piecewise and sinusoidal feeding strategies Figure 8. Optimizing control results for Mn: without and
reached higher productivity values by using a similar with constraint on Mn.
tendency in the feed rates. However, the abrupt changes on
the feed flows calculated by the piecewise strategy could The objective is to keep Mn inside the desired range in order
generate cellular stress due to the strong and rapid variations to obtain a polymer with adequate thermoplastic properties.
that take place on the microorganism environment, which is Figure 9 show the feeding profiles that keep Mn at the
reflected on the substrate/nutrients concentrations on the desired range shown in Figure 8. It is important to notice that
culture media as shown Figure 7. Such stress will definitively the final Mn is 8.07x105 g/mol, which corresponds to values
affect the performance of the microorganism. reported in the literature for PHA applications. Finally, the
productivity reached was 369.13 g which is lower than in the
100
Constant
previous case, but still fulfilling the required constraint on
Pulse Mn.
80 Pisecew ise
Sinusoidal
0.1
F1 Sinusoidal Profile
60 F2 Sinusoidal Profile
0.08
S (g/L)
40
F1 ,F2(L/h)
0.06
20
0.04
0
0 10 20 30 40 50 0.02
Time (h)
Figure 7. Substrate behavior due to the profiles F1 0
0 10 20 30 40 50
Time (h)
Figure 8 shows the predicted number average molecular
Figure 9. Carbon Source Profile F1 and Nitrogen Source
weight distribution (Mn) by the neural network (described in
Profile F2 with constrint in Mn
section 3), when the sinusoidal parameterization is used for
the input flows F1 and F2 (profiles shown in figures 5 and 6)).
As it is shown, the predicted Mn results in a finalvalue of
1x105 if the optimizing control problem doesn´t consider
87
CHAPTER 4. BIOPROCESSES
88
CHAPTER 4. BIOPROCESSES
Abstract: This paper extends and generalizes a model of Dissolved Oxygen (DO) dynamic developed for
the δ - endotoxins of Bacillus thuringiensis (Bt) batch production process to a DO dynamical model that
can be used for a broad class aerobic Endospore-forming Bacteria (AEFB). The most significant feature
of this type of microorganism is the endospores generation in their life cycle. The generalization of the
model allows to use the same parametric model structure for process control and state estimation
purposes in a considerable variety of AEFB batch bioprocesses by selecting an appropriate set of
parameters for each specific bioprocess.
Keywords: Bioprocess Model, Aerobic Endospore-Forming Bacteria (AEFB), Batch Process, Control
Process, State Estimation.
called endospore. Bacillus is a large group, including Bacillus
1. INTRODUCTION
cereus, Bacillus clausii, Bacillus halodenitrificans, Bacillus
In a previous work (Amicarelli et al., 2010) the authors subtilis, Bacillus thuringiensis, among others. Bacillus
developed a dissolved oxygen (DO) dynamics model to spores, also called endospores, are resistant to harsh chemical
complete a pre-existing model (Atehortúa et al., 2007) for the and physical conditions. Clostridium, Sporolactobacillus are
δ - endotoxins of Bacillus thuringiensis (Bt) batch production group of anaerobic spore forming bacteria. Clostridium genus
process. The inclusion of this dynamic is useful for control consists of more than a hundred known species such as
purposes since the dissolved oxygen in the culture medium is Clostridium perfringens, Clostridium botulinum, Clostridium
a key variable to maintain the aerobic microorganism difficile, Clostridium tetani and Clostridium sordellii.
population in adequate levels and conditions for the Clostridium thermocellum are used commercially to produce
effectiveness of the fermentation process. This mentioned ethanol, Clostridium acetobutylicum to produce acetone, and
model was successfully considered in subsequent works to Clostridium diolis to convert fatty acids to yeasts and
propose different DO control strategies through different propanediol, among others. The processes that involve AEFB
approaches i) control based on Lyapunov theory have two classes of cells: vegetative cells and sporulated
(Amicarelli et al., 2010), ii) control based on nonlinear cells. The proposed dissolved oxygen model (Amicarelli et
dynamic inversion (Rómoli et al., 2016), iii) Predictive al., 2010) allows estimating or quantifying how much oxygen
Control Based Model (CPBM) (Alzate et al., 2016) and is consumed for vegetative growth and for cell maintenance
classical controllers as PID, PI. Is important also to mention for Bt fermentations. When the EFB are in the vegetative
that dissolved oxygen concentration can also be used for state, their growth, as well as their substrate utilization and
estimating the microorganisms concentration in the culture decay processes are almost the same as for common bacteria.
medium when the oxygen consumption for the When the culture medium exhibit nutritional limitation, the
microorganism is known (Amicarelli et al., 2015; A EFB sporulate (Atehortúa et al., 2006; Errington, 2003;
Amicarelli et al., 2014; di Sciascio & Amicarelli, 2008; Hoch, 1993; Sonenshein, 2000). In general, the bioprocess
Rómoli et al., 2016). As was mentioned early, model was models available in literature normally include cellular
initially developed for Bt batch fermentation but is possible to dynamics: the main substrate and generation of product
extend the use of this model for a more general class of dynamics. However, dissolved oxygen is not considered as a
microorganism named: Aerobic Endospore-forming Bacteria part of the model. Nevertheless, it is feasible to include this
(AEFB). The Endospore-forming Bacteria (EFB) are dynamic since it can be easily measured online. Furthermore,
microorganism as Bacillus and Clostridium that can survive this is a useful variable for the manipulation as control action
in hostile environments by producing endospores and then of bioprocess.
rapidly germinating to vegetative cells and growth when they
In this work, the authors extend the use of the DO model for
encounter favorable environmental conditions. Bacillus genus
control and estimation purposes, to other batch bioprocess
are Gram-positive, aerobic or facultative Endospore-forming
with similar characteristics with the aim to achieve other
Bacteria, that is, Bacillus are organism capable of growth in
products. Example of bioprocess with other finalities can be
the presence of oxygen, and forms a type of resting cell
89
CHAPTER 4. BIOPROCESSES
the production of biosurfactant by Bacillus subtilis (Davis et where K air is an oxygen consumption parameter by growth
al., 2001; Yeh et al., 2006). The genus bacillus have general
(constant for each fermentation), and Fair is the inlet air flow
features that include the degradation of substrates derived
from vegetal and animal sources including cellulose, starch, rate that enters the bio-reactor. For a given bioreactor
pectin, proteins, agar, hydrocarbons, and others; antibiotic configuration, Fair is mainly a function of the impeller
production; nitrification; denitrification; nitrogen fixation; agitation speed. Based on experimental evidence (Atehortúa
facultative lithotrophy; autotrophy; acidophily; alkaliphily; et al, 2007), it is assumed that the oxygen consumption rate
psychrophily; thermophily; and parasitism (Todar, 2006). (OUR) depends on the total cells X, that is, both vegetative
Three microorganisms are currently in use for industrial and sporulated cells consume oxygen at different rates,
riboflavin production. The hemiascomycetes Ashbya gossypii, therefore:
a lamentous fungus, and Candida famata are naturally
producers of this vitamin (Stahmann et al., 2000). Vitamins, OUR(t ) qO2 X (t )
(3)
C, B2 (riboflavin), vitamin B12, linolenic acid, and ergosterol
are produced by the use of microorganisms. Gram positive Because dissolved oxygen is considered as a second substrate
bacteria, producing inhibitory substances like cyclic peptides qO2 is:
and bacteriocins, with a broad antimicrobial spectrum and a
history of safe use in food (Cleveland et al., 2001; Holzapfel X X
qO2 mO2
et al., 1995). A relevant related paper is the work of Park, YX / O2 YXmax
/ O2
Rittmann, and Bae (Park et al., 2009) when it is developed a (4)
model called Life-Cycle Kinetic Model for Endospore – Where YX / O2 the observed biomass yield is based on oxygen
Forming Bacteria. This model includes Germination, max
Sporulation and dissolved oxygen dynamics. However, the consumed, YX / O2 is the true biomass yield based on oxygen
expression and balances are impractical for control an used for growth, mO is the oxygen consumption coefficient
2
estimation purposes due to their complexity and the
for respiration maintenance, and X is the net specific
difficulties to measure each involved variable online.
Motivated in this fact, in the present work, the authors report growth rate of total cells.
the use of a simple parametric dynamic model for these X
objectives allowing better control of dissolved oxygen OUR(t ) qO2 X (t ) X (t ) mO2 X (t )
concentration in the culture medium of this class of YXmax
/ O2
O consumption
2
bioprocesses. O consumption for cell maintenance
2
for biomass growth
(5)
This paper is organized as follows: Section 2 presents the
With the aim to define X , from the total cells balance:
model and it generalization for AEFB. Section 3 presents
validations model through experimental data and model X (t ) X v (t ) X s (t )
comparisons. Finally, in Section 4 conclusions are stated. (6)
where X v is the vegetative cells concentration, X s is the
2. GENERALIZATION OF THE PROPOSED DISSOLVED sporulated cells concentration and the vegetative cell balance:
OXYGEN DYNAMIC MODEL
dX v (t )
X v (t ) kd X v (t ) ks X v (t )
The standard model for dissolved oxygen balance is: dt
Growth rate of vegetative cell death rate of sporulation rate of
vegetative cell vegetative cell
(7)
dCDO (t )
K L a Csat CDO (t ) qO X (t ) OTR(t ) OUR(t ) where is the gross specific growth rate of vegetative cells,
dt 2
(1)
kd is the relative death rate of vegetative cells, k s is a
Where X is the total cells concentration, CDO is the
kinetic constant representing the spore formation rate, and
dissolved oxygen concentration, Csat the oxygen saturation v is the net specific growth rate of vegetative cells.
in equilibrium with the oxygen partial pressure of the gaseous
phase, K L a is the volumetric oxygen mass transfer rate, and v kd ks (8)
qO2 is the net specific oxygen uptake rate. The first term in Operating with (6) and (7):
dX (t ) dX v (t ) dX s (t )
the second member is the rate of aeration or OTR (oxygen kd X v (t )
transfer rate from air bubble to liquid phase), and the second dt dt dt
X (t )
term is the rate of oxygen consumption by cells or OUR kd v X (t ) kd f v X (t ) X X (t )
X (t )
(oxygen uptake rate of cells per volume of broth).
fv
(9)
The aeration term (OTR) can be written as follows where fv is a fraction of vegetative cells, then:
(Amicarelli et al., 2010):
OTR(t ) Kair Fair Csat CDO (t ) X
1 dX (t ) d [log X (t )]
kd f v
(2) X (t ) dt dt
(10)
90
CHAPTER 4. BIOPROCESSES
v
For Bacillus thuringiensis qO2 is positive because the Table 1: Variables in the Dissolved Oxygen model
Symbol Description
inequality / O2 mO2 max
0 kd YXmax always holds,
X Total cell concentration g L1
therefore qO2 is also positive.
YXmax
/ O2 263.15
CDO Dissolved oxygen concentration g L1
mO2 0.00073 kd YXmax
/ O mO 0.092
KO Oxygen saturation constant g oxygen L1
2 2
kd 0.1
0.1 0.192
1 dX (t )
OUR(t) mO2 X (t ) YX / O2 Observed biomass growth yield based on
YXmax
/ O2 dt
(14)
This form of implementation of the oxygen consumption oxygen consumed g cells g oxygen1
model has been widely reported in the literature; see the
review by Garcia-Ochoa et al, 2010, p. 296, Equation (12) 1 Oxygen consumption constant by growth
(Garcia-Ochoa et al., 2010), and references therein. This
Y max
X / O2
[dimensionless] g oxygen g cells 1
model form has also been specifically applied to modelling
91
CHAPTER 4. BIOPROCESSES
K air Ventilation constant L1 Bt and Sb respectively. The collected data from the Sb
fermentations is a set of biomass measurements and in the
work of Park (Park et al., 2009) there are not reported
Volumetric oxygen mass transfer rate h
KL a 1
experimental DO data in correspondence with the biomass
concentration, despite of this, we compare the DO obtained
by the model of Park and our model (see Figure 4).
qO2 Net specific oxygen uptake Experimental Total Cells Concentration for Bt
12
L h1 2
0
0 2 4 6 8 10 12 14 16 18
Time [hr]
OTR Oxygen transfer rate g oxygen L h 1 1
0.2
0.15
Table 2: Parameters in the model for Bt fermentations
0.1
Symbol Value 0.05
Csat 0.00745
0
0 2 4 6 8 10 12 14 16 18
mO2 0.729 10-3 Time [hr]
92
CHAPTER 4. BIOPROCESSES
DO Concentration [g/L]
6
tk k Ts , 1 k N , 5
3
Notice that (19) can be written compactly in the following 2
form: 1
yˆ (tk ) 1 1(tk ) 2 2 (tk ) 3 3 (tk ) T (tk )
0 2 4 6 8 10 12 14 16 18
(20) Time [hr]
This last equation (20) represents a discrete-time linearly Fig. 3. DO Concentration Approximation Model for Bacillus
parameterized model where: thuringiensis fermentation.
DO Concentration [g/L]
(tk ) 2 (tk ) X (tk ) X (tk 1) ,
3 (tk )
0.006
X (tk )
1 0.004
1
1 1 0.002
2 max (21) DO output Model
1 K air Fin TS
DO output Park Model
YX / O2
3
0
0 2 4 6 8 10 12 14 16 18
mO2 TS
Time[hr]
A column vector featuring the unknown parameters are Fig. 4. DO Concentration Approximation Model for Bacillus
subtiles fermentation.
the parameters vector, whereas (tk ) is the regression
vector formed by known signals. From a set of N input- With reference to Fig. 4, it can be observed that our DO
output experimental data, sampled at equally-spaced time model output adequately represent the dynamic behavior
intervals Ts , it is possible to estimate the parameters vector according to the considered biomass data. Oxygen
1 consumption is evident during the first hours of fermentation
and, there from the physical parameters estimates max , (approximately 8 h) since the process is on the
YX / O2
microorganism exponential growth stage. Next, the oxygen
mO2 , and K air (see (21)). Then, the estimated parameters of requirements decrease in the sporulation stage. Park’s model
the model are obtained directly from (21): predicts a DO level recovery two hours before (see Fig. 6).
This is may be due to the affection of the others terms
2 involved in such model. See, equation A.9 in (Park et al,
1 2009). It can be concluded that for practical aims and control
Y max 1
X / O2 purposes for the DO variable for this process. Our proposed
K mO2 3
model (16) satisfactory represents the behavior of Bt and Sb
1 TS batch fermentations. Moreover, the model is based on the
K air
1 biomass values only. This is clear advantage when is
1
compared with more complex models that require more
Fair in TS 1 variables, which are in many cases hard to obtain. Biomass
(22)
data that can be sensed provided the specific sensors, o by a
3.3 Model Results proper estimators design as proposed and validated in the
previous papers: (di Sciascio & Amicarelli, 2008), (Adriana
The results obtained with the proposed model for the Bt and Amicarelli et al., 2014), (Amicarelli et al., 2015), (Rómoli et
Sb fermentations are shown in Fig. 3 and 4 respectively. al., 2016).
Results for Bt fermentations were previously reported and
discussed in (Amicarelli et al., 2010). However, an example 4. CONCLUSIONS
of the behavior represented by the model in (16) for Bacillus This paper presented an extension or generalization of a
thuringiensis can be seen in Fig. 1. In the case of Bacillus dissolved oxygen dynamic model for batch fermentations.
subtiles, we considered the experimental results reported by The generalization was performed from a particular previous
(Park et al, 2009). With this information, we obtained the DO model for Bacillus thuringiensis (Amicarelli et al., 2010) and
output values with our proposed model and we finally was now extended for a more general class of aerobics
microorganism named Aerobic Endospore-Forming Bacteria
93
CHAPTER 4. BIOPROCESSES
(AEFB). This model generalization allows the scientific di Sciascio, F., & Amicarelli, A. N. (2008). Biomass
community to use the same model for control and state estimation in batch biotechnological processes by
estimation purposes not only in Bt but also in a considerable Bayesian Gaussian process regression. Computers &
variety of bioprocesses. The model was validated with Chemical Engineering, 32, 3264-3273.
experimental data of batch fermentations with Bacillus Errington, J. (2003). Regulation of endospore formation in
Subtiles and Bacillus thuringiensis. The model was also Bacillus subtilis. Nature Reviews Microbiology, 1,
compared against more complex obtaining similar results, 117-126.
making easy its practical implementation for related Garcia-Ochoa, F., Gomez, E., Santos, V. E., & Merchuk, J.
applications. C. (2010). Oxygen uptake rate in microbial
ACKNOWLEDGMENT processes: an overview. Biochemical engineering
journal, 49, 289-307.
The authors wish to thank the following organizations who Hoch, J. A. (1993). The phosphorelay signal transduction
contributed to the completion of this work: Universidad pathway in the initiation of Bacillus subtilis
Nacional de San Juan (UNSJ, Argentina), and Consejo sporulation. Journal of cellular biochemistry, 51, 55-
Nacional de Investigaciones Científicas y Técnicas 61.
(CONICET, Argentina). Holzapfel, W., Geisen, R., & Schillinger, U. (1995).
REFERENCES Biological preservation of foods with reference to
protective cultures, bacteriocins and food-grade
Alzate, A., Amicarelli, A., Gómez, L., & di Sciascio, F. enzymes. International journal of food
(2016). Métodos de Quasi-Monte Carlo para el microbiology, 24, 343-362.
Cálculo de Conjuntos en Teoría de Conjuntos en Ollis, D. F. (1983). A Simple Batch Fermentation Model:
Control. In Argencon 2016. Buenos Aires Theme and Variations. Annals of the New York
Argentina: IEEE Argentina. Academy of Sciences, 413, 144-156.
Amicarelli, A., di Sciascio, F., Toibero, J., & Alvarez, H. Park, S., Rittmann, B. E., & Bae, W. (2009). Life‐cycle
(2010). Including dissolved oxygen dynamics into kinetic model for endospore‐forming bacteria,
the Bt δ-endotoxins production process model and including germination and sporulation.
its application to process control. Brazilian Journal Biotechnology and Bioengineering, 104, 1012-1024.
of Chemical Engineering, 27, 41-62. Rómoli, S., Amicarelli, A., Ortiz, O., Scaglia, G., & di
Amicarelli, A., Montoya, L. Q., & di Sciascio, F. (2015). Sciascio, F. (2016). Nonlinear Control of the
Substrate Feeding Strategy Integrated with a Dissolved Oxygen Concentration Integrated With a
Biomass Bayesian Estimator for a Biotechnological Biomass Estimator for Production of Bacillus
Process. International Journal of Chemical Reactor Thuringiensis δ-Endotoxins. Computer & Chemical
Engineering, Accepted for publication. Engineering, Accepted for publication.
Amicarelli, A., Quintero, O., & Sciascio, F. (2014). Behavior Rowe, G. E., Margaritis, A., & Wei, N. (2003). Specific
comparison for biomass observers in batch Oxygen Uptake Rate Variations during Batch
processes. Asia-Pacific Journal of Chemical Fermentation of Bacillus thuringiensis Subspecies
Engineering, 9, 81-92. kurstaki HD‐1. Biotechnology progress, 19, 1439-
Amicarelli, A., Quintero, O., & Sciascio, F. (2014). Behavior 1443.
comparison for biomass observers in batch Ryder, D., & Sinclair, C. (1972). Model for the growth of
processes. Asia‐Pacific Journal of Chemical aerobic microorganisms under oxygen limiting
Engineering, 9, 81-92. conditions. Biotechnology and Bioengineering, 14,
Atehortúa, P., Álvarez, H., & Orduz, S. (2006). Comments 787-798.
on: “A Sporulation Kinetic Model for Batch Growth Sonenshein, A. L. (2000). Control of sporulation initiation in
of B. thuringiensis”. The Canadian Journal of Bacillus subtilis. Current opinion in microbiology, 3,
Chemical Engineering, 84, 386-388. 561-566.
Atehortúa, P., Álvarez, H., & Orduz, S. (2007). Modeling of Stahmann, K.-P., Revuelta, J., & Seulberger, H. (2000).
growth and sporulation of Bacillus thuringiensis in Three biotechnical processes using Ashbya gossypii,
an intermittent fed batch culture with total cell Candida famata, or Bacillus subtilis compete with
retention. Bioprocess and Biosystems Engineering, chemical riboflavin production. Applied
30, 447-456. Microbiology and Biotechnology, 53, 509-516.
Cleveland, J., Montville, T. J., Nes, I. F., & Chikindas, M. L. Todar, K. (2006). Todar's online textbook of bacteriology:
(2001). Bacteriocins: safe, natural antimicrobials for University of Wisconsin-Madison Department of
food preservation. International journal of food Bacteriology.
microbiology, 71, 1-20. Yeh, M.-S., Wei, Y.-H., & Chang, J.-S. (2006). Bioreactor
Davis, D., Lynch, H., & Varley, J. (2001). The application of design for enhanced carrier-assisted surfactin
foaming for the recovery of surfactin from B. production with Bacillus subtilis. Process
subtilis ATCC 21332 cultures. Enzyme and Biochemistry, 41, 1799-1805.
microbial technology, 28, 346-354.
94
CHAPTER 4. BIOPROCESSES
Abstract: This paper presents a nonlinear state estimation subject to delayed measurement for
the biomass in a batch bioprocess. The estimator scheme is based on an Extended Kalman Filter
with state augmentation method to incorporate delayed measurements. A methodology to use
the sample-state augmentation method is described. The proposed estimator is applied in the
δ-endotoxins production of Bacillus thuringiensis. Simulation results show the feasibility of the
proposed estimator.
Keywords: Batch Process, Extended Kalman Filter, Fixed Lag Smoothing, Delayed
Measurements.
95
CHAPTER 4. BIOPROCESSES
Major Instance No
Minor Instance
Moreover, to incorporate the different measurements on On the other side, the augmented state method is
state estimation techniques two moments are presented: based on increasing the state space with information of
major instance and minor instance. Major instance offline measurement and subsequently extended model
refers to the moment in which both measures are is incorporated into the state estimation technique. As
available (online and offline). Minor instance refers to shown in the algorithm of Figure 2, in these methods is
moment in which only available online measurement. only modified the model and the representation of the
When they used information sources with different state space is conserved. Because of this, these methods
characteristics, collateral problems such as: multi- become favorable for its extension to different estimation
sampling or asynchronism, missing and spurious data, and control techniques. In (Anderson and Moore, 2005)
redundant information, among others are presented (Guo state augmentation is defined for fixed-lag smoothing
and Huang, 2015; Guo et al., 2014). method. In the Fixed-Lag Smoothing method, the Ns past
states are smoothed based on online measurements of the
In this sense, there are different methods to manage minor instances. When offline and delayed measurement
and incorporate nonuniform and delayed information in is obtained, both measurements (offline and online) are
stochastic state estimation techniques. The methods can used to smooth out the state between s and s + Ns .
be classified into two types: fusing measurements and Methods based on augmentation state, retains the state
96
CHAPTER 4. BIOPROCESSES
97
CHAPTER 4. BIOPROCESSES
98
CHAPTER 4. BIOPROCESSES
35 −3 −3
x 10 x 10
7
30 8
sp measured
6.5
sp real 7
25
sp estimated
sp smoothed 6
20 6
5
15 5.5
4
10 3
od real 5
od estimated
5 2
od smoothed
arrival time 4.5
1
0 sampling time
0 5 10 15
time [h] 0
0 5 10 15 6 8 10
time [h] time [h]
Figure 4. Substrate concentration sp .
Figure 6. Dissolved oxygen concentration od .
12
12 7
10
6
Total biomass concentration [g/L]
10
xv real
Vegetative cells concentration [g/L]
4
xT real 3
x measured 4
T
xT estimated
2 2
x smoothed
T
2
sampling time
0 1
0 5 10 15
time [h] 0
0 5 10 15 0 2 4
time [h] time [h]
Figure 5. Total biomass concentration xT .
Figure 7. Vegetative cells concentration xv .
In Figure 4 is possible to observe the correct estimation
of the substrate concentration sp , but this variable is not 5. CONCLUSIONS
important to estimate because can be measured online.
However, it is possible to notice the effect of filtering In this paper was presented a nonlinear state estimation
performed by the estimator in the variable sp . In Figure subject to delayed measurement for the biomass in
5, the EKF without off-line measurements produces an a batch bioprocess including the dissolved oxygen
estimate that deviates from the actual value since the dynamic. It was described the methodology for to
beginning, however, the EKF with offline measurements use sample-state augmentation method. The proposed
converges quickly when such measurement arrives. This estimator was applied to the δ-endotoxins production of
tendency to converge continuing with each new additional Bacillus thuringiensis and the incorporating of delayed
off-line measurement. Additionally, Figures 6, 7 and 8 measurements to EKF improved the performance of the
show a similar behavior is observed to the other estimated estimate. Simulations show the feasibility of the proposed
variables. In these figures, arrival times and sampling estimator.
time of off-line measurement are detailed in order to
visualize the effect of the off-line measurement in the EKF REFERENCES
improved.
Alexander, H.L. (1991). State estimation for distributed
Finally, it was observed that it is possible to estimate the systems with sensing delay. Data Structures and
concentrations online in a batch process that includes the Target Classification, SPIE, 1470, 103–111. doi:
dynamic of dissolved oxygen concentration. The results 10.1117/12.44843.
can be used for monitoring, control or fault detection in Amicarelli, a., Di Sciascio, F., Toibero, J.M., and Alvarez,
batch processes. H. (2010). Including dissolved oxygen dynamics into
99
CHAPTER 4. BIOPROCESSES
100
CHAPTER 4. BIOPROCESSES
Abstract: The biological wastewater treatment has become a major operation to maintain
appropriate levels of organic matter in the wastewater to be discharged. This reduction of
organic matter is performed by microorganisms, which require oxygen concentrations adequate
to survive. Therefore, the oxygen dissolved control is a critical operation in the biological
wastewater treatment plants. In this paper, the design of a coupled estimation and control
strategy is presented for a biological wastewater treatment plant. The coupled estimation and
control strategy is composed by a Kalman filter and a Model Predictive Controller (MPC). The
results obtained showed that for disturbances nearby to the operating point, the estimated state
converges to the actual state and the controller maintains the dissolved oxygen levels within a
narrow range between 5.3 and 6.7 mg L approximately.
Keywords: Dissolved oxygen, Kalman filter, model predictive control, biological wastewater
treatment.
101
CHAPTER 4. BIOPROCESSES
Sometimes, there is no inhibition by oxygen excess, but is maintained at an equilibrium, and the remainder effluent
this operation represents cost overruns. In processes where is discarded as waste sludge Moltzer (2008), Nejjari et al.
obtaining a metabolite from cell growth, also death and (1999).
inhibition phenomena are present due to absence or excess
of oxygen, respectively. Moreover, production of secondary
products (non-desirable products) have been reported,
owing to limitations in oxygen mass transfer Åmand et al.
(2013).
Due to of the importance of dissolved oxygen in the
biological wastewater treatment plants and the difficulty
of monitoring the main process variables a state estimator
(Kalman filter) and a model-based control are presented
in order to perform an output-feedback loop, as it is shown
in Figure 1.
Fig. 2. Process flow diagram for a biological wastewater
treatment plant. Adapted from Moltzer (2008)
The model is based on the following assumptions:
• The aerated treatment tank is considered to be per-
fectly mixed so that the concentration of each com-
ponent is spacially homogeneous.
• Bioreaction does not take place in the clarifier.
• The sludge (biomass) is the only recycled component
into the aerated treatment tank.
The model is represented for the Equations (1)-(4). These
equations represent the balances of biomass (X), substrate
(S), dissolved oxygen (Co ), and recycled biomass (Xr ).
dX(t)
Fig. 1. Estimation and control scheme = µX(t) − D(t)(1 + r)X(t) + rD(t)Xr (t) (1)
dt
The rest of the paper is as follows. In Section 2, materials
and methods are presented in this section the mathema- dS(t) µ
tical model, state estimator design, and design of model = − X(t) − D(t)(1 + r)S(t) + D(t)Sin (2)
dt Y
predictive controller are presented. In Section 3 the re-
sults and discussion are presented. Final comments are
presented in Section 4. dCo (t) µ
= −ko X(t) − D(t)(1 + r)C(t) + D(t)Cin
dt Y (3)
2. MATERIALS AND METHODS +KL a(Cs − C(t))
102
CHAPTER 4. BIOPROCESSES
103
CHAPTER 4. BIOPROCESSES
The tuning matrices associated with the model uncertainty 0.02h−1 ≤D ≤ 0.15h−1
and the measurement noise, Qk−1 and Rk , respectively, (21)
were set as 0h−1 ≤w ≤ 300h−1
104
CHAPTER 4. BIOPROCESSES
changes between upper and lower restriction, as occurs Figures 5 and 7 shown the state behavior with a dis-
with the dilution rate (D). turbance of 300 mg mg
L and 150 L , respectively. The first
figure presents the biomass behavior, the second figure
the substrate concentration behavior, the third figure the
250
dissolved oxygen concentration behavior, and the final
X (mg/L)
50 300
X (mg/L)
0 100 200 300 400 500 State estimation
8 200 Controlled state
Co(mg/L)
S (mg/L)
2
0 100 200 300 400 500 100
500
X (mg/L)
50
400 0 100 200 300 400 500
8
r
Co (mg/L)
300
0 100 200 300 400 500 6
Time (hours) 4
2
0 100 200 300 400 500
600
Fig. 3. State behavior with a disturbance of in the 250 mg
Xr (mg/L)
L 500
substrate concentration in the feed stream at the time
400
50h
300
0 100 200 300 400 500
Time (hours)
0.2
0.1 L in the
substrate concentration in the feed stream at the time
50h
0
0 100 200 300 400 500
400
w (h )
−1
200 0.2
D (h−1)
0 0.1
0 100 200 300 400 500
260 0
0 50 100 150 200 250 300 350 400 450 500
S (mg/L)
240
220 Disturbance 400
in
180 200
0 100 200 300 400 500
Time (hours)
0
0 50 100 150 200 250 300 350 400 450 500
250
The behaviors of the states with the different disturbances Disturbance
200 Disturbance estimation
applied are similar (Figures 5 - 8). The biomass in the 150
reactor and in the recycle stream in open-loop present a 0 50 100 150 200 250 300 350 400 450 500
Time (seconds)
steady state value above than the steady state value of the
closed-loop system. The substrate state presented a steady
state value in open-loop below than the presented by the Fig. 6. Control actions and estimation disturbance
controlled system. The behavior of the state substrate in (300 mg
L )
the closed-loop is due to that is presents lower production
of biomass and recycled biomass, so that there is an insu-
fficient concentration of microorganisms to be responsible In Figures 7 and 8, the estimator converges faster if
for degrading the substrate. compared to previous scenarios.
105
CHAPTER 4. BIOPROCESSES
Controlled state
200
Open loop state
strategy could improve the oscillations in the control
100
actions, causing these have a softer behavior.
0 100 200 300 400 500
100
REFERENCES
S (mg/L)
50
Åmand, L. (2011). Control of aeration systems in activated
0 sludge processes–a review. IVL Swedish Environmental
0 100 200 300 400 500 Research Institute/Department of Information Technol-
8
ogy, Uppsala University, Uppsala, Sweden.
Co (mg/L)
6
Åmand, L., Olsson, G., and Carlsson, B. (2013). Aeration
4 control - A review. Water Science and Technology, 67,
2
0 100 200 300 400 500 2374–2398. doi:10.2166/wst.2013.139.
400 Amicarelli, A., Di Sciascio, F., Toibero, J.M., and Alvarez,
X (mg/L)
300
H. (2010). Including dissolved oxygen dynamics into
the Bt δ -Endotoxins Production process model and
r
200
0 100 200 300 400 500
its application to process control. Brazilian Journal of
Time (hours) Chemical Engineering, 27(01), 41–62.
Atia, D., Fahmy, F., Ahmed, N., and Dorrah, H. (2011).
Design and Control Strategy of Diffused Air Aeration
Fig. 7. State behavior with a disturbance of 150 mg
L in the System. World Academy of Science, Engineering and
substrate concentration in the feed stream at the time Technology, (3), 666–670.
50h Davison, E. and Smith, H. (1971). Pole assignment in lin-
ear time-invariant multivariable systems with constant
disturbances. Automatica, 7(4), 489–498.
0.2 Garcia, C.E., Prett, D.M., and Morari, M. (1989). Model
predictive control: theory and practice—a survey. Au-
D (h−1)
100
Simon, D. (2006). Optimal state estimation: Kalman, H
0 100 200 300 400 500 infinity, and nonlinear approaches. John Wiley & Sons.
Time (hours)
FINAL COMMENTS
106
CHAPTER 5
Abstract: This paper presents the design and implementation of an automated control system
for a hydroponic testbed for “indoor domestic environments”. This testbed is an indoor research
system in which automatic control techniques, low-cost computational micro-controllers, and
monitoring technologies are being explored for possible applications and deployments in low
research budget and limited technology access. The goal of the system is to use automation
techniques to improve crop productivity in scenarios in which even water is a scarce resource.
The experimental low-cost facility at this point is focusing on control variables such as irrigation
time, luminosity, temperature, and relative humidity.
108
CHAPTER 5. GREEN PROCESS CONTROL
Micro-controllers are used to recollect data from the sen- lack of oxygen is called anoxia, this causes that the root
sors, process information (using pre-programmed func- development begins a premature degradation of the roots,
tions) and provide an output to perform an action (e.g. reaching, in extreme cases, the death of plants (Mustroph
actuators). In this work, sensors provide information about and Albrecht (2003)).
the environment in which the crops are, this allows to
collect data about the different crop phases of experimen- 2.2 Description of the AILCG
tation and provide an action through the actuators.
The proposed AILCG considers a “Light-Emitting Diode The Figure 1 shows physical design scheme of the built
(LED)” control, the sun photo-period, different irrigation AILCG as it can be seen, the structure is divided into
techniques control, water aeration control, and climate five different tray-levels, each of them has a purpose,
condition control to provide the needed tools to perform function and/or different works. The tray-levels are Level
research and hence obtain a healthy plant growing. 0, Level 1, Level 2, Level 3 and Level 4. The AILCG is
composed of a rack where there are two different types
This paper is organized as follows, Section 2 is about the of hydroponic crops and a piping circuit for recirculating
AILCG as the whole, in Section 3 the control system the nutrient solution through the system. At the Level 0
is described, Section 4 results are shown and finally,
conclusions are stated.
109
CHAPTER 5. GREEN PROCESS CONTROL
Aeroponic Cultivation Bed. The cultivation beds, at The operated sensors in the control system are: “DHT11
tray-levels 2 and 3, have ten nebulizers each of one, which and DHT22 (DHTXX)” which provides relative humid-
are responsible for spraying the solution to the plant roots. ity and temperature in the environment, DS18B20 is a
The solution flows through the pipes and is delivered to waterproof sensor to provide water solution temperature.
the crop. Also, there is an outlet to recirculate the solution. DS18B20 and DHTXX sensors use the 1-wire protocol,
The aeroponic cultivation bed was made of styrofoam which is a communication protocol designed by Dallas
(expanded polystyrene). A sketch of aeroponic cultivation Semiconductor, it is based on a bus, a master and several
bed is shown in Figure 2. slaves one data line (Zhang et al. (2010)).
The 1-Wire protocol specifies a master and one or more
Hydroponic Cultivation Bed. This cultivation bed, at
slaves which send information to a single data bus, a
Level 1, is designed to get the solution of the aeroponic
resistor is connected to +5V DC to “pull up” the signal.
crops (tray-levels 2 and 3) that they did not use. The
nutrient solution which leaves from the cultivation bed
is renovated in the reservoir where it will be oxygenated 3.1 Light Control System
again, hence the anoxia in crops is avoided. This kind of hy-
droponic system is based on water culture system, so cul- LED lights are handled by a microcontroller, the input
tivation beds are floating on a base of Styrofoam allowing power of the LEDs is logged into a database, for this
that plant roots are in contact with the nutrient solution we decided to set an on/off control using a Pulse Width
every moment. Level 1 it has a transparent polypropylene Modulation (PWM). The light period may be adjusted by
container. A sketch of hydroponic cultivation bed is shown time or by using a photo-resistance. The time-based light
in Figure 3. period might be set up from 7:00am to 7:00pm or any other
time range, as shown in Figure 5.
2.4 AILCG Design On the other hand, given the analog value of the photo-
resistance, the microcontroller decides whether to turn on
The rack has four metal shelves panels and four metal or off the LEDs, this strategy allows the microcontroller
beams, the metal beams forms two upright frames which perceive the sun photo-period by using a photo-resistance
are joint to the shelves using screws. to receive any light beam.
110
CHAPTER 5. GREEN PROCESS CONTROL
111
CHAPTER 5. GREEN PROCESS CONTROL
112
CHAPTER 5. GREEN PROCESS CONTROL
equipment, reducing the humidity between 10:00am and evaluated in hydroponic lettuce production. Computers
6:00pm. and Electronics in Agriculture, 84, 53–61.
Harper, C. and Siller, M. (2015). OpenAG: A globally
5. CONCLUSION distributed network of food computing. IEEE Pervasive
Comput., 14(4), 24–27.
A contribution of this article is the automation of a Low- Jr., J.B.J. (2004). Hydroponics: A Practical Guide for the
Cost Indoor Greenhouse. The plants growing in the present Soilless Grower. CRC Press.
AILCG have been more efficient by providing an adequate Lieth, J.H. and Oki, L.R. (2008). Irrigation In Soilles
environment using micro-controllers, respect to a manual Production. In Soilless Culture, 117–156. Elsevier BV.
control. Also, we have identified and worked with four Mohr, H. and Schopfer, P. (2010). Plant Physiology.
basic components that influences the plant development, Springer.
which are: Muñoz, P., Montero, J., Antón, A., and Giuffrida, F.
(1999). Effect of insect-proof screens and roof open-
• Irrigation control. ings on greenhouse ventilation. Journal of Agricultural
• Lighting control. Engineering Research, 73(2), 171–178.
• Water condition control. Mustroph, A. and Albrecht, G. (2003). Tolerance of crop
• Climate condition control. plants to oxygen deficiency stress: fermentative activity
From the experimental point of view, the data gathered and photosynthetic capacity of entire seedlings under
from sensors and actuators allows to measure, make obser- hypoxia and anoxia. Physiol Plant, 117(4), 508–520.
vations and make decisions. Finally, this AILCG provides Parker, R. (2009). Plant and Soil Science: Fundamentals
a testbed to perform research, allowing to design new and and Applications. DELMAR.
better control strategies to indoor greenhouses. Putter, E. and Gouws, J. (1996). An automatic con-
troller for a greenhouse using a supervisory expert sys-
5.1 Future Work tem. In Proceedings of 8th Mediterranean Electrotech-
nical Conference on Industrial Applications in Power
Systems, Computer Science and Telecommunications
Today we are working with different ICT architectures,
(MELECON 96). IEEE.
which are not part of the current article, once these
Schempf, H., Graham, T., Fuchs, R., and Gasior, C.
architectures are validated they are going to be published
(2001). Automated container-handling system for con-
on a next paper.
tainer production nurseries. In Proceedings 2001 ICRA.
The next step is to implement an Internet of Plant (IoP) IEEE International Conference on Robotics and Au-
node consisting of a Personal Food Computer (PFC) tomation (Cat. No.01CH37164). IEEE.
(Harper and Siller (2015)), with the aim to contribute and Son, K.H. and Oh, M.M. (2013). Leaf shape, growth, and
share knowledge and experience to its open forum and antioxidant phenolic compounds of two lettuce cultivars
community. grown under various combinations of blue and red light-
emitting diodes. HortScience, 48.
REFERENCES Thompson, H., Langhans, R.W., Both, A.J., and Albright,
L.D. (1998). Shoot and root temperature effects on
Badamasi, Y.A. (2014). The working principle of an lettuce growth in a floating hydroponic system. Journal
arduino. In 2014 11th International Conference on of the American Society for Horticultural Science.
Electronics, Computer and Computation (ICECCO). UN (2004). World population to 2300. Technical report,
IEEE. United Nations.
Bhutada, S., Shetty, S., Malye, R., Sharma, V., Menon, Zhang, C., Feng, X., and Li, L. (2010). The key tech-
S., and Ramamoorthy, R. (2005). Implementation of a nologies of a distributed temperature monitoring system
fully automated greenhouse using SCADA tool like Lab- based on 1-Wire bus. In 2010 8th World Congress on
VIEW. In Proceedings, 2005 IEEE/ASME International Intelligent Control and Automation. IEEE.
Conference on Advanced Intelligent Mechatronics., 741–
746. IEEE.
Boulard, T. and Draoui, B. (1995). Natural ventilation of
a greenhouse with continuous roof vents: Measurements
and data analysis. Journal of Agricultural Engineering
Research, 61(1), 27–35.
Casas-Carrillo, R., Begovich, O., Ruiz-Leon, J., and Siller,
M. (2015). Characterizing the behavior of “greenhouse
climate”: a LabVIEWTM application. In 2015 12th In-
ternational Conference on Electrical Engineering, Com-
puting Science and Automatic Control (CCE). IEEE.
Chang, C.L. and Chang, K.P. (2014). The growth response
of leaf lettuce at different stages to multiplewavelength-
band light-emitting diode lighting. Scientia Horticul-
turae, 179, 78–84.
Domingues, D.S., Takahashi, H.W., Camara, C.A., and
Nixdorf, S.L. (2012). Automated system developed
to control pH and concentration of nutrient solution
113
CHAPTER 5. GREEN PROCESS CONTROL
Abstract
This work deals with the modeling and climate control of greenhouses. In order to achieve
these objectives, this paper extends previously reported greenhouse modeling methodologies
and proposes a control strategy. More precisely, the model methodology is based on a modular
technique using timed continuous Petri nets, and the proposed discrete control law is based
on the contribution degree of controllable transitions. The model and control methodology
aim to contribute by simplifying the manipulation of greenhouse variables. Thus, the possible
human errors introduced during the modeling stage or operation stage are reduced. A simulation
example is presented in order to illustrate the modelling methodology and the control design
and its application.
Keywords: Hybrid systems, Timed Continuous Petri Net, Temperature control, Greenhouse,
Modeling, Control, Product semantic, Contribution degree.
114
CHAPTER 5. GREEN PROCESS CONTROL
mal systems, (Júlvez and Boel, 2010) for traffic systems enabling degree depends on the continuous P N semantic.
and (Mahulea et al., 2012) for manufacturing systems. The firing of the enabled transition ti in a certain amount
αi ≤ enab(ti , m) leads to a new marking m0 = m +
In this paper, in order to represent the internal envi- αi C[P, ti ], where C is the incidence matrix, defined by
ronment (climate variables), the continuous part of the C = P ost − P re.
greenhouse is modeled by T CP N using product semantics
(T CP N − P S). This semantic has been widely used to If m is reachable from m0 by firing each enabled transition
model biological systems in (Heiner et al., 2008), (Silva ti in a certain amount σi ≤ enab(ti , m), then m = m0 +
and Recalde, 2002), (Baldan et al., 2011), (Tovany et al., C→−σ and it is named the fundamental equation where
2013), where the nonlinear part is introduced by the join →
−σ ∈ {R+ ∪ 0}|T | is the firing count vector.
transitions (Silva and Recalde, 2004), (Garcia-Malacara Definition 2. A P -invariant (P -semiflow) of a net N is a
et al., 2013). For these reasons, the incremental model rational-valued solution Y of the equation Y T · C = 0.
methodology from (Tovany et al., 2013) is used to rep-
Proposition 3. Let (N, m0 ) be a system, and let I be a
resent the temperature and humidity behaviour adding a ∗
thermal actuator to the greenhouse model. On the other P -invariant of N . If m0 → m0 , then I · m = I · m0 (Dessel
hand, an On/Off local control law is presented. In (Ross- and Esparza, 1995).
León et al., 2014) the contribution degree is defined as the Definition 4. A T -invariant (T -semiflow) of a net N is a
dot product between the marking error, i.e. the difference rational-valued solution of the equation C · X = 0.
between the required and the actual markings, and the Proposition 5. Let σ be a finite sequence of transitions
columns of the incidence matrix. Thus, if the result is a of a net N which is enabled at marking m. Then the
positive scalar value, the firing of these transitions con- → σ
P arikh vector σ is a T -invariant iff m → m (i.e., iff the
tributes to reduce the marking error for T CP N models occurrence of σ reproduces the marking m) (Dessel and
with big populations and all transitions are controllable. Esparza, 1995).
Based on this, the control strategy proposed by (Ross-
León et al., 2014) is used and applied to T CP N − P S The set of all reachable markings from m0 is called the
model with controllable and uncontrollable transitions. A reachability set and it is denoted by RS(N, m0 ).
simulation example is presented to illustrate the modelling
methodology, the control design and its application. A ContP N is bounded when every place is bounded, i.e.
∀p ∈ P, ∃bp ∈ R s.t m[p] ≤ bp at every reachable marking
The paper is organized as follows. In Section 2, basic con- m, and it is live when every transition is live (it can
cepts related to Timed Continuous Petri Nets (T CP N 0 s) ultimately be fired from every reachable marking).
systems and controllability concepts are presented. Section
Definition 6. A Timed Continuous Petri Net (T CP N )
3 presents a Petri net modeling procedure for greenhouses.
is a time-driven continuous-state system described by
Section 4 deals with the problem of reaching a required
the 3-tuple (N, λ, m0 ) where (N, m0 ) is a continuous
state and synthesizes a Lyapunov function for solving
this problem. Finally, Section 5 presents an example of a P N and the vector λ ∈ {R+ ∪ 0}|T | represents the
greenhouse model as a case of study and the temperature transition rates that determine the temporal evolution of
control using the proposed control law. the system. Transitions fire according to certain speed,
which generally is a function of the degree rates and
the instantaneous marking. Such function depends on the
2. FUNDAMENTALS semantics associated to the transitions. Under infinite
server semantics the flow (the transitions firing speed,
In this section, we provide the basic background on Petri denoted as f (m)) through a transition ti is defined as the
nets (P N ) and Timed Continuous Petri Nets (T CP N ) un- product of the rate, λi , and enab(ti , m), the instantaneous
der product server semantics (P S). Notice that T CP N 0 s enabling of the transition, i.e., fi (m) = λi enab(ti , m) =
under P S belong to the class of positive Nonlinear Systems m[p ]
(Murata, 1989), (Silva and Recalde, 2002). λi minp∈• ti P re[pjj,ti ] .
115
CHAPTER 5. GREEN PROCESS CONTROL
Tg
Heating Pvar Pvar
Pipes Pd Pd
116
CHAPTER 5. GREEN PROCESS CONTROL
117
CHAPTER 5. GREEN PROCESS CONTROL
a control law Ic (τ ); 0 ≤ τ ≤ τf feasible in the T CP N such Theorem 11. (Ross-León et al., 2014) Let (N, m0 , λ) be a
that m(τss ) = mr and Ic (τss ) = Icr , ∀τss ≥ τf . live and bounded T CP N system where all transitions are
controllable. Then, the control law
The greenhouse control problem is stated as a RCP ,
where the equilibrium point is given by target indoor
temperature and the computed Ic . Since the set Eg = 1 if Ψi > 0
Ick = (10)
{a1 , a¯1 , ..., a¯n } is representing a set of events, then the 0 otherwise
control value of Ic in Equation (6) must be zero or one.
leads the initial marking m0 = m(0) to the required
Notice that the modules in the modeling methodology are marking mr ∈ int(TR ).
of type balance, generation or consumption module. Gen-
eration modules (solar radiation, condensation of moisture Proof. Let V (•) be the function
on walls and roof) increase the indoor temperature, while
consumption modules reduce it. For fixed environmental 1 T
conditions if the greenhouse heaters are turned on and V (e) = e (τ )e(τ ). (11)
2
coolers are turned off, then the indoor temperature will
reach a maximum value, named Tmax ; otherwise if the We claim that V (e) is a Lyapunov function, i.e. it is
greenhouse heaters are turned off and coolers are turned positive definite, its derivative is negative definite and it
on, then the indoor temperature will reach a minimum fulfils V̇ (x) = 0 only for x = 0, as shown below.
value, named Tmin . Thus the indoor temperature can be Clearly, (11) is positive definite. Now, the differentiate of
controlled in the range TR = [Tmin , Tmax ]. V (e) is
The control law solving the RCP problem will be obtained
based on the results from (Ross-León et al., 2014), which V̇ (e) = eT (τ )ė = −eT (τ )CIc f (τ ). (12)
considers the control problem in live and bounded T CP N ,
with controllable transitions and infinite server semantic. Since mr ∈ int(TR ), then for every time τ there exists a
In this work, the implementation of the control strategy to transition firing vector σ(τ ) > 0 such that e(τ ) = mr −
nonlinear hybrid systems is presented. For this reason, the m(τ ) = Cσ(τ ). Thus
product semantics is used to represent the non linearity of
the system.
eT (τ )e(τ ) = eT (τ )Cσ(τ ) > 0 (13)
Definition 8. Let (N, m0 , λ) be a live and bounded T CP N
system and mr ∈ int(TR ) be the required target marking. for e(τ ) 6= 0. Rewriting (13) we have
Then the marking error is computed as
e(τ ) = mr − m(τ ). (8) eT (τ )Cσ(τ ) = eT (τ )[σ1 c1 + σ2 c2 + · · · + σ|T | c|T | ]
(14)
Definition 9. Let (N, m0 , λ) be a live and bounded T CP N = σ1 eT (τ )c1 + σ2 eT (τ )c2 + · · · + σ|T | c|T | > 0.
system. The Contribution Degree Ψk (τ ) of a transition Also σi depends on τ . Since ∀i, σi ≥ 0, then there exists
tk of N is defined as at least one ck such that σk eT (τ )ck > 0 for e(τ ) 6= 0.
Notice that σk > 0 is strictly positive, thus eT (τ )ck > 0
is also strictly positive. Then from (10) we have that
Ψk (τ ) = eT (τ )ck (9) Ic (k) = 1. Substituting these values in (12), then V̇ (e)
could be zero or negative. If Equation (12) is negative,
where ck is the k-th column of the incidence matrix C. then the derivative of the Lyapunov function is negative,
decreasing the system error.
Notice that the contribution degree is representing the pro-
jection (dot product) of how the transition removes/adds However, Equation (12) could be zero when the error is
tokens (column ck ) over the error. Thus, it provides infor- different from zero iff f (τ ) = 0 or Ic f (τ ) = 0 or Ic f (τ ) 6= 0
mation about how much a transition is capable of decreas- is in the kernel of C or CIc f (τ ) is in the kernel of eT . Now,
ing the error marking. we will show that these cases are not possible.
Lemma 10. (Ross-León et al., 2014) Consider the defini- In the first case if f (τ ) = 0 means that the T CP N system
tion of the marking error and let (N, m0 , λ) be a live and is blocked, then it is not live; a contradiction.
bounded T CP N system and v a T -semiflow of N . Then
P
tk ∈|v| vk Ψk (τ ) = 0.
In the second case Ic f (τ ) = 0, then the flow of transitions
is being blocked by Ic since its contribution degree is
Proof. The result follows from negative or the contribution degree is positive and Ic is not
blocking, but the flow of these transitions is zero. Since the
X T CP N is live and bounded, then by Lemma 10 there are
vk Ψk (τ ) = eT (τ )Cv = 0. positive and negative contribution degrees. Then there is
tk ∈|v|
an Ic (k) = 1 with its corresponding flow fk = 0. Thus, at
least one input place to tk has zero tokens, meaning that
From Lemma 10 we have that the sum of every contribu- the marking m is in the border, i.e. it is not in int(TR ), and
tion degree in a given T -semifow is zero. Therefore, there V̇ = 0. Then N is not evolving at all. Since the error is not
must be both transitions with positive and transitions with zero, there exists a transition tj1 such that Ic (τ )(j1 ) = 1,
negative contribution degrees in it. and tj1 cannot be fired, thus at least one input place pk1 to
118
CHAPTER 5. GREEN PROCESS CONTROL
Temperature (°K)
positive and Ic f (τ ) cannot form a T -semiflow at time τ . 310
A contradiction. 305
In the fourth case CIc f (τ ) is in the kernel of eT , but this 300
is not possible because Ic was chosen to enforce (14) to
be positive by selecting only the positive terms, thus even 295
if fk is not equal to σk , it is positive and (12) cannot be
290
zero.
285
Thus V̇ (e) = −eT (τ )CIc f (τ ) < 0 in all the cases, 0 2 4 6 8
time (hrs)
therefore (11) is a Lyapunov function. Then the error
is asymptotically stable and the control (10) leads the
marking from m0 to mr . Figure 4. Temperature Tg indoor greenhouse without con-
trol.
5. CASE OF STUDY obtaining an optimal environment for crops production in
greenhouses depends on the ability to control the temper-
In this section the case of study is presented. We use ature inside; therefore, we will show that the previously
greenhouse parameters values from (Van Straten et al., proposed control law is able to control the temperature
2010) as in Table 2 and Table 3, but also, parameters can Tg .
be estimated by a regression algorithm (Pérez-González Now, in order to apply the control law strategy, it is
et al., 2014). The greenhouse model is constructed with considered that the windows and fans are activated with
the following actuators: windows, fans, humidifier and pipe the same control signal Ic1 , in the case of the humidifier,
system (see Fig. 1). Using these actuators and parameters the control signal was Ic2 , similarly the control signal Ic3
the T CP N − P S greenhouse model is built according to activate the pipe system. The system is simulated using
Section 3. Notice that, the windows, fans and pipe system the control strategy and establishing a set point for the
are represent by balance T CP N modules. However, the temperature Tg (marking required m1 = 320o K) and
humidifier is represented by a consumption T CP N mod- it is considered that the other climate variables are not
ule. restricted.
This modeling methodology was implemented in a Graph- Fig. 5 presents the temperature Tg behavior inside the
ical User Interface (GU I) programed in Matlab
. c Each greenhouse using the proposed control strategy. Notice
elementary Petri net module is represented by an element that the simulation results show that with the control
(icon) in the interface (for instance, actuators, perturba- strategy the requirements are reached. Fig. 6 shows the
tions) that can be added in a modular way while the evolution of the proposed control law for the fans and
T CP N model is built. Therefore, the greenhouse-GU I windows, humidifier and pipe system.
provides facilities to operate the greenhouse system; any
operator can easily manage the greenhouse, just by select-
6. CONCLUSIONS
ing the actuator or sensor icon in the interface and link it
with another element.
This work addressed the problem of modeling and reach-
The temperature Tg inside the greenhouse is depicted in ing a required temperature environment in a greenhouse
Fig. 4; the system is simulated in a period of 8 hours. The system. The modeling methodology presented provides a
figure shows that the temperature is increasing during the graphical representation of climate variables which allows
day and decreasing at sunset. However, the problem of an easy understanding of their interaction; moreover, the
119
CHAPTER 5. GREEN PROCESS CONTROL
1
Systems, Man and Cybernetics, Part A: Systems and
I
0
0 2 4 6 8
Humans, IEEE Transactions on, 40(4), 686–697.
time (hrs) Mahulea, C., Seatzu, C., Cabasino, M., and Silva, M.
2 (2012). Fault diagnosis of discrete-event systems using
Pipe system continuous petri nets. Systems, Man and Cybernetics,
c3
1
I
120
CHAPTER 6
INTELLIGENT CONTROL
CHAPTER 6. INTELLIGENT CONTROL
Abstract: One of the main focuses of humanoid robotics is bipedal locomotion which requires a series of
continuous phases, called gait cycle. Here is essential to maintain balance, compensate the dynamic of
the whole multivariable and nonlinear system. Despite good bipedal gait algorithms have been achieved
in various researches, performance is still not enough to be compared to biological systems. Several
research projects have tried to compensate this through methods in machine learning, to autonomously
generate gaits and stabilize robots. In this paper is firstly presented how a neuroevolution algorithm
achieves the stability of a biped robot simulation in the sagittal plane that starts in unstable position. Then
three different methods are proposed and analyzed in order to generate full body motions in any state
perceived by the robot, complying with the specified task by means of the motor skills learned through
neuroevolution together with the implementation of another feedforward neural network.
Keywords: Artificial Intelligence, Neuroevolution, Neural Network, Genetic Algorithm, Biped
Locomotion, Stability, Nonlinear Control, Inverted Pendulum.
biped robot in a phase of the gait cycle, restricted by the
1. INTRODUCTION AND BACKGROUND
sagittal plane.
Humanoid robotics has made great strides in the last 30 years While there are multiple methods in machine learning,
regarding the general control of all the robot limbs that allow neuroevolution algorithms are promising in this kind of
them to perform complex tasks. Research has focused on problems (Gomez & Miikkulainen, 2003). Research in these
human bipedal locomotion, because a machine with this algorithms has resulted in different methods in their
capability easily works in human environments, rather than a application as it is NEAT (Stanley & Miikkulainen, 2002a),
conventional robot equipped with wheels. HyperNEAT (Stanley et al., 2009) or TWEANNS (Stanley &
Miikkulainen, 2002b), among others. These modify the
However this presents great control challenges, caused by the topology of the neural network as it is produced the evolution
large number of variables involved to achieve gaits without of the individuals (Back et al., 1997).
falling in a non-ideal environment. Maintaining balance is a
fundamental and necessary requirement in implementing These methods have shown very good results in benchmarks
such systems (Siciliano & Khatib, 2008). such as the simultaneous stabilization of two inverted
pendulums, exceeding in performance to other alternatives in
Various approaches have been studied to generate
unsupervised learning, e.g. reinforcement learning (Gomez &
dynamically stable legged locomotion, such as the ZMP Miikkulainen, 1999).
(Zero Moment Point) (Vukobratović & Borovac, 2004).
Nevertheless, it is still difficult to find a robot that However, the classic implementation of neuroevolution
outperforms a biological system, leading this to research in algorithms, where is evolved a neural network of fixed
self-teaching machines. In this way, many projects have topology through genetic algorithms, can also have
focused on generating gaits in different kinds of robots using satisfactory results in several tasks related to robotics
machine learning (Tedrake et al., 2005; Manoonpong et al., (Siciliano & Khatib, 2008).
2007; Allen & Faloutsos, 2009; Clune et al., 2009; Yosinski
Every individual in the genetic algorithm starts in an initial
et al., 2011).
state and then when each iteration occurs it jumps to a new
Though it is essential to generate whole body motions, state, which will be the result of actions taken in the previous.
learned by the own robot, the focus of this paper is to
Thus the neural network acts as the agent of a reinforcement
understand that is also important the implementation of some
learning algorithm, in which for each state, is executed a
motor skills to achieve robot stabilization, in this case of a
particular action. However, while reinforcement learning
122
CHAPTER 6. INTELLIGENT CONTROL
knows the actions to be made in every state, neuroevolution extension of knee, legs, arms and the ankle of the supporting
provides an overview of the problem and consequently the foot (see Fig. 1).
neural network is not learned for the state, but for the
ultimate goal of the task.
When the optimum individual is found for the current task, a
series of states and actions occurs, but the neural network is
only functional for the evaluated situation and the states seen
by the agent, i.e. the neural network might not be useful for
other initial conditions or states (Whiteson, 2012).
In this regard, problems with large number of variables, such
as humanoid robots (Benbrahim & Franklin, 1997), generate
numerous states for which the neural network found, should
work. Therefore if the goal is to make a robot control,
Fig. 1. Sagittal plane and joints of the robot simulation for the
capable to respond to certain situations properly, even
unknown, it is required the generalization of what was implementation of stability control. The red lines represent
learned by the trained neural network. the links of the robot.
123
CHAPTER 6. INTELLIGENT CONTROL
124
CHAPTER 6. INTELLIGENT CONTROL
Fig. 5. Simulation of the bipedal robot in the sagittal plane, This method seeks to train a second neural network by
executing one stabilization process. The red lines represent backpropagation, its inputs are the 12 variables that define
the robot links and the blue line represents the inverted the states, and the outputs are the 267 weights and biases that
pendulum. As can be seen, the movement of the robot determine the neural network that ultimately generates the
maintains it vertical. actions for the robot. Therefore, it is intended that under a
known initial condition, a specific neural network executes
the right set of actions that achieve robot balance.
The neural network is made of 186 neurons in the hidden
layer, chosen arbitrarily, requiring with this an updating
process of 52347 weights and biases, leading to a slow
learning rate and high computational cost.
Furthermore, is not guaranteed for unknown initial conditions
that the neural network output defines correctly the 267
weights and biases for a neural network that achieves
stability, this is because the successful solutions of the
problems already executed, might be formed by very
different weights, even for similar cases.
Because of the high computational cost and the other reasons
exposed, this method was not tested.
Fig. 6. Change of angle θ of the inverted pendulum
representation. Here the reference is -180°, i.e. when the 4.3 Training of Neural Network with State-Action Data
pendulum is vertical, moreover the actions of the neural
network begin 10 iterations after the simulation is started, in When simulation is executed, the robot begins in an initial
this way the robot should recover from falling. state and through the actions made inside the environment,
passes to another state. In this way, although the neural
network is learned according to the overview of the genetic
The mere implementation of neuroevolution in a real robot is
algorithm, for each state executes an action that determines a
unpractical, because the learning process with the genetic
subsequent state and its sequence will define the success rate
algorithm, for a particular case, requires a large number of
of the process in the corresponding task.
tests, and the solution could be only useful for that situation.
Specifically, each simulation is executed for 3 seconds, in
In this sense, an offline learning of different cases in
300 iterations, thus having 300 different states and actions for
conjunction with the generalization of the motor skills and
every case evaluated. Therefore, in this method is intended to
combining it with some online learning could be very useful
train a second neural network with sets of state-action. A
in solving different control tasks on real robots. Therefore, to
diagram of this method is shown in Fig. 7.
increase neural network robustness and in order to achieve an
overall stability control, three approaches presented below This neural network, called neural network for robot control,
were proposed. is similarly formed to the executed in neuroevolution, with 12
input variables and 7 outputs, but now with 70 neurons in the
4.1 Retraining of the Neural Network hidden layer (see Fig. 8). Because the network is trained by
backpropagation with the Levenberg–Marquardt algorithm,
One of the main causes of the low robustness is because the based on the state-actions sets obtained in the neuroevolution
neural network was only trained for a particular case, process, the possibility of losing usefulness for different
therefore a possible solution is to retrain it again with the cases is reduced, being with this a data fitting problem.
genetic algorithm, but starting with an initial population
125
CHAPTER 6. INTELLIGENT CONTROL
126
CHAPTER 6. INTELLIGENT CONTROL
In order to generate a control useful for different states and Gomez, F. J., & Miikkulainen, R. (1999, July). Solving non-
then being able to produce general motor skills for the task, Markovian control tasks with neuroevolution. In IJCAI
three approaches were proposed. (Vol. 99, pp. 1356-1361).
Gomez, F. J., & Miikkulainen, R. (2003). Robust non-linear
The first method, which consisted in retraining the neural
control through neuroevolution. Computer Science
network by the neuroevolution algorithm based on the
Department, University of Texas at Austin.
previous case individuals, did not achieve good results,
Landau, L., & Lifshitz, E. (1988). urso de f sica te rica.
because the retraining process modifies the weights of the
Barcelona etc. evert .
network and is not functional again for the previous case.
Lenz, I., Lee, H., & Saxena, A. (2015). Deep learning for
The second method, in which was intended to get at the detecting robotic grasps. The International Journal of
output of a feedforward neural network the individuals that Robotics Research, 34(4-5), 705-724.
configured the neural network, that ultimately would give the Manoonpong, P., Geng, T., Kulvicius, T., Porr, B., &
actions, was not carried out due to the high computational Wörgötter, F. (2007). Adaptive, fast walking in a biped
cost and the limited guarantees that this method could find robot under neuronal control and learning. PLoS Comput
right individuals to perform the task. Biol, 3(7), e134.
Marion, . (2000). in mica de las part culas y sistemas.
In the third method, a neural network was trained by
Barcelona ditorial evert .
backpropagation with state-action datasets, obtained in the
Pastor, P., Hoffmann, H., Asfour, T., & Schaal, S. (2009,
tests made through neuroevolution, had promising results
May). Learning and generalization of motor skills by
being capable to output the right actions according to the
learning from demonstration. In Robotics and
state seen by the robot, even without being previously Automation, 2009. ICRA'09. IEEE International
learned. Conference on (pp. 763-768). IEEE.
It is concluded that the third method is the most appropriate Pastor, P., Kalakrishnan, M., Righetti, L., & Schaal, S. (2012,
approach to generate the control of this system and other kind November). Towards associative skill memories. In
of tasks through full body motions. Humanoid Robots (Humanoids), 2012 12th IEEE-RAS
International Conference on (pp. 309-315). IEEE.
This last approach allows the system from a simulation, learn Peters, J. R. (2007). Machine learning of motor skills for
by itself according to its surrounding environment, being able robotics (Doctoral dissertation, University of Southern
later to generalize a proper behavior to achieve a specific California).
task, facilitating its implementation on the real robot. Schaal, S. (1999). Is imitation learning the route to humanoid
In this way, research will continue around a larger number of robots?. Trends in cognitive sciences, 3(6), 233-242.
tests to strengthen control and then assess its applicability in Siciliano, B., & Khatib, O. (2008). Biped Robots in the ZMP
other types of robotic systems. Scheme. Handbook of robotics, Springer. ISBN: 978-3-
540-23957-4.
In addition the main focus of this research is to develop such Stanley, K. O., & Miikkulainen, R. (2002a). Evolving neural
controls in more complex simulations, with the system’s networks through augmenting topologies. Evolutionary
behavior in a three dimensional environment and with the full computation, 10(2), 99-127.
dynamics of the robot, and thus being able to be later Stanley, K. O., & Miikkulainen, R. (2002b). Efficient
extrapolated to a real robot. evolution of neural network topologies. In Evolutionary
Computation, 2002. CEC'02. Proceedings of the 2002
REFERENCES Congress on (Vol. 2, pp. 1757-1762). IEEE.
Stanley, K. O., D'Ambrosio, D. B., & Gauci, J. (2009). A
Allen, B. F., & Faloutsos, P. (2009). Evolved controllers for hypercube-based encoding for evolving large-scale
simulated locomotion. In Motion in games (pp. 219- neural networks. Artificial life, 15(2), 185-212.
230). Springer Berlin Heidelberg. Tedrake, R., Zhang, T. W., & Seung, H. S. (2005, June).
Back, T., Fogel, D. B., & Michalewicz, Z. (1997). Handbook Learning to walk in 20 minutes. In Proceedings of the
of evolutionary computation. IOP Publishing Ltd.. Fourteenth Yale Workshop on Adaptive and Learning
Benbrahim, H., & Franklin, J. A. (1997). Biped dynamic Systems (Vol. 95585).
walking using reinforcement learning. Robotics and Vukobratović, M., & Borovac, B. (2004). Zero-moment
Autonomous Systems, 22(3), 283-302. point—thirty five years of its life. International Journal
Clune, J., Beckmann, B. E., Ofria, C., & Pennock, R. T. of Humanoid Robotics, 1(01), 157-173.
(2009, May). Evolving coordinated quadruped gaits with Whiteson, S. (2012). Evolutionary computation for
the HyperNEAT generative encoding. In Evolutionary reinforcement learning. In Reinforcement Learning (pp.
Computation, 2009. CEC'09. IEEE Congress on (pp. 325-355). Springer Berlin Heidelberg.
2764-2771). IEEE. Yosinski, J., Clune, J., Hidalgo, D., Nguyen, S., Zagal, J., &
Goldberg, D. E., & Deb, K. (1991). A comparative analysis Lipson, H. (2011). Evolving robot gaits in hardware: the
of selection schemes used in genetic algorithms. HyperNEAT generative encoding vs. parameter
Foundations of genetic algorithms, 1, 69-93. optimization. In Proceedings of the 20th European
Goldberg, D. E., & Holland, J. H. (1988). Genetic algorithms Conference on Artificial Life (pp. 890-897).
and machine learning. Machine learning, 3(2), 95-99.
127
CHAPTER 6. INTELLIGENT CONTROL
Palabras Clave: Control Difuso, Ósmosis Inversa, Planta Multivariable, Educación en Control.
128
CHAPTER 6. INTELLIGENT CONTROL
controlador PI. Los resultados son simulaciones con uso multimedia para eliminar sólidos en suspensión, bacterias,
de un software basado en DMC llamado ONLINE que virus y coloides. Asimismo, se lleva a cabo el ajuste de pH,
actuó como caja negra, donde se variaron parámetros cuyo fin es la precipitación de carbonatos en el agua de
como los horizontes de predicción y control, ası́ como los alimentación y también la reducción de la conductividad
pesos del controlador. Otro trabajo importante propone del agua permeada. La etapa de presurización se lleva a
Assef (Assef et al., 1997), quien desarrolla un controlador cabo cuando el agua pre-tratada fluye a través de la bomba
predictivo basado en modelo (CMPC) considerando un de alta presión; mediante la variación de la velocidad del
sistema MIMO para una planta de OI de 4 salidas (flujo, motor de la bomba es posible variar la presión del agua
conductividad, presión en la membrana y pH de entrada) y que fluye a través de ella. La separación de iones se realiza
2 entradas (flujo de agua salobre rechazada y flujo de ácido dentro de las membranas de OI.
de entrada). La identificación fue realizada a partir de un
Estas membranas inhiben el paso de sales, pero permiten
estado estacionario, aplicando escalones a cada entrada y
el paso de agua y como resultado se tiene un flujo de agua
registrando las respuestas en cada una de las salidas. Para
la implementación del control también se trabajó con un permeada y otro de salmuera. Dado que las membranas no
sistema de tipo caja negra, el cual permitı́a agregar restric- ofrecen un rechazo de sales de 100 %, es usual que el agua
ciones a las variables de entrada. Los resultados mostraron permeada tenga algunas sales disueltas. La última etapa
un incremento del rendimiento en flujo desalinizado; sin es la de almacenamiento y consiste en el acopio del agua
embargo, el software usado solo permite realizar cambios producto y de la salmuera en tanques.
en los pesos y horizontes.
En la actualidad, la creciente demanda en consumo de
agua desalinizada, plantea el requerimiento de sistemas de
control para unidades de OI que permitan maximizar el
rendimiento en términos de cantidad de flujo y calidad
del agua permeada, ası́ como minimizar el consumo de
energı́a. En este caso, resultarı́a conveniente el uso de un
sistema de control con el cual se logren los requerimientos
mencionados y que en la práctica resulte de fácil diseño
e implementación. En este trabajo se propone una meto-
dologı́a, la cual permite diseñar e implementar el sistema
de control para una unidad de OI empleando el control
difuso como una alternativa de aplicación práctica que no
está basada en modelos matemáticos.
En la sección 2, se describe una unidad experimental de OI.
La metodologı́a de diseño del sistema con control difuso Fig. 1. Unidad de OI del Laboratorio de Control y Auto-
propuesta es desarrollada en la sección 3, donde también matización de la PUCP.
se describe el diseño del sistema con control predictivo
GPC. En la sección 4, el desempeño del sistema de control Las variables controladas en este proceso son el flujo (F ) y
con controladores difusos propuesto se comparará con la conductividad (C ) del permeado. Las variables de entra-
los alcanzados por otros controladores (GPC y PI). Las da del proceso son aquellas que mediante su manipulación
conclusiones son descritas en la sección 5. se pueda alcanzar los valores deseados de las variables de
salida. En este proceso las variables de entrada son la
2. DESCRIPCIÓN DE UNA PLANTA PILOTO DE presión (P ), con la que el agua salina ingresa a la mem-
DESALINIZACIÓN POR OI brana y que es manipulada con el variador de frecuencia
de la bomba y, el potencial de hidrógeno (pH ) del agua de
La planta de desalinización por OI objeto de estudio es una alimentación, que es manipulada variando la cantidad de
planta piloto que se encuentra situada en el Laboratorio aditivo dosificado en la etapa de pre-tratamiento. La plan-
de Control y Automatización de la PUCP (ver Fig. 1), en ta piloto cuenta con diversos instrumentos como sensores
esta se pueden realizar ensayos experimentales de labora- de P , pH , F , C e incluso sensores de temperatura. Un
torio aplicados a la investigación. Esta planta cuenta con diagrama funcional del proceso descrito se puede observar
componentes básicos que permiten la desmineralización en la Fig. 2.
y separación de sólidos disueltos del agua aplicando una
presión superior y en sentido inverso a la presión osmótica. Mediante ensayos experimentales en estado estacionario en
Los cuatro procesos fundamentales que se utilizan son: esta unidad de OI, se determinaron los rangos lineales de
pre-tratamiento, presurización, separación por membrana las variables, los cuales se muestran en la Tabla 1.
y almacenamiento. Es decir, su construcción se basó en
etapas en las cuales se eliminan los componentes orgánicos Tabla 1. Rango de operación lineal de las
y las sales presentes en el agua de alimentación, utilizando variables del bastidor de OI.
la tecnologı́a de OI.
Variable Rango Lineal
El pre-tratamiento tiene como objetivo la preparación Flujo, gpm 0.85-1.25
del agua de alimentación (extraı́da de un tanque) de tal Presión, psi 800-1000
manera que sea compatible con la membrana de OI. Esto Conductividad, µS/cm 400-450
es, la aplicación de filtros UV, filtros de carbón y filtros pH 6-7
129
CHAPTER 6. INTELLIGENT CONTROL
3.1 Diseño con Control Difuso Fig. 3. Sistema de Control con Control difuso propuesto
para una unidad de OI.
El uso del control difuso se presenta como alternativa in-
teresante en el diseño del sistema de control de un bastidor
de OI de una planta desalinizadora, debido a que este tipo respecto del tiempo (deF /dt). Posteriormente se definen
de control presenta la caracterı́stica principal de no utilizar 3 funciones de membresı́a sobre el rango de variación del
un modelo del proceso durante el diseño del mismo, sino flujo volumétrico del permeado, su razón de cambio y la
parte de conocimiento experimental previo (experiencia variación de la presión del agua de alimentación. Estas
del operador) para desarrollar una base de reglas con la funciones de membresı́a se refieren al grado de pertenencia
cual se pueda mantener las variables de control en las con respecto a un valor lingüı́stico. La cantidad de las mis-
referencias deseadas. La estrategia de control propuesta mas fue suficiente pues la variable de flujo se caracteriza
usa dos bloques difusos, donde la variable elaborada por por ser una variable rápida cuando se controla mediante la
el primer controlador difuso es una de las entradas del se- presión. De acuerdo a la demanda de la planta objeto de
gundo controlador difuso (ver Fig. 3). Con esta estrategia, estudio los rangos determinados fueron los siguientes: ±0.1
gpm para el error en el flujo volumétrico del permeado,
es posible mantener la variable de conductividad eléctrica
±0.5 gpm/s para la razón de cambio y ±5 psi para la
del permeado (C ) en la referencia deseada tomando en
presión de alimentación.
consideración el efecto que tiene la manipulación de la
variable presión en el agua de alimentación, durante el El segundo controlador se diseña considerando además la
control de la variable flujo volumétrico del permeado (F ). influencia de la presión en el agua de alimentación, elabo-
Las señales de entrada del controlador son los errores en rada por el primer controlador, como variable de entrada
el flujo y conductividad (eF y eC ) producidos luego de adicional. Se definen, entonces, 5 funciones de pertenencia
la comparación de las variables medidas con los valores para las variables de entrada de este controlador. El rango
de consigna. Por otro lado, las señales de salida del de variación para la variable de entrada fue de ±10 µS/cm
controlador son las variaciones en la presión (∆P ) y el para la conductividad, y ±150 psi para la variable adicio-
pH (∆pH ) necesarios para alcanzar las referencias. nal. Sin embargo, debido a la fuerte interacción de esta
última, se definieron 7 particiones en variable de pH con
En un primer paso, se realiza el diseño del primer contro- un rango de variación de ±0.6. Seguidamente, se desarrolla
lador difuso. Para ello se establece como entrada adicional la base de reglas para mantener la variación en el flujo
la razón de cambio del flujo volumétrico del permeado con volumétrico del permeado (ver Tabla 2).
130
CHAPTER 6. INTELLIGENT CONTROL
131
CHAPTER 6. INTELLIGENT CONTROL
4. COMPARACIÓN DEL DESEMPEÑO DEL menor tiempo en un caso (flujo) con uso del GPC y en
CONTROL DIFUSO, PREDICTIVO Y PI otro (conductividad) con uso del controlador difuso.
Para evaluar el desempeño de los controladores frente
A continuación, se muestran las pruebas de simulación al seguimiento de una referencia, se realizó un cambio
del sistema de control aplicando el controlador difuso, abrupto en la referencia del flujo de permeado de 1.2
el controlador GPC, ası́ como el controlador PI, con el gpm a 1.0 gpm, interpretándose como un instante donde
objeto de evaluar la efectividad de los mismos frente la producción es mı́nima. Este escenario implica la dis-
a cambios de referencia y presencia de perturbaciones minución de la energı́a entregada por la bomba de alta
durante condiciones reales de operación. Los parámetros presión, la que debe ser sopesada mediante el ajuste del
de los controladores fueron establecidos de tal manera valor del pH operando la bomba dosificadora. En la Fig. 7
que garanticen una rápida respuesta y un error en estado se observa las respuestas temporales, donde el PI presenta
estacionario nulo. un mayor sobreimpulso, mientras que con el controlador
Las ganancias integrativas del controlador difuso son: difuso disminuye. A su vez, el controlador GPC logra en
Ki =50 (lazo de control de flujo) y Ki =-0.0002 (lazo de menor tiempo alcanzar la señal de referencia, a costa de
control de conductividad). Para el caso del controlador un mayor esfuerzo de control.
GPC, los parámetros de diseño fueron: horizonte de control 1.25
Nu =3, horizonte de predicción N2 =10, R = diag(3·105 ,2), Referencia
Flujo (gpm)
Q = diag(0.5,103 ). El controlador PI con ganancias del 1.2 PI
Fuzzy
lazo de control de flujo de permeado: Kp =536 y ti =13.8, 1.15 GPC
ası́ mismo, un Kp =-0.05 y ti =108.6 para el lazo de control
de conductividad del permeado. 1.1
460
Referencia 408
440 PI
Fuzzy Referencia
420 GPC 406 PI
Fuzzy
400 GPC
404
0 50 100 150 200 250 300 0 100 200 300 400 500 600 700 800 900
Tiempo (seg) Tiempo (seg)
Fig. 6. Comparación de respuestas temporales para el Fig. 8. Comparación de respuestas temporales en la varia-
caso de variación simultánea en ambas señales de ble de conductividad eléctrica frente a una perturba-
referencia. ción tipo escalón en el flujo de -0.05 gpm.
132
CHAPTER 6. INTELLIGENT CONTROL
133
CHAPTER 6. INTELLIGENT CONTROL
Abstract: The modeling and autonomous control of truck-trailer mobile robots for trajectory following
are addressed. The robot kinematical model is analyzed and used for designing a positioning control
system based on linear controllers integrated in a fuzzy-logic approach. The design takes into account
both positioning performance and jack-knife avoidance. The results of robot positioning control are
extended to trajectory following for which a novel strategy is proposed applicable to general shape
desired trajectories. The effectiveness of the proposed methods are verified for linear, circular and
sinusoidal trajectories where the mobile robot converges to the desired trajectories, avoiding jack-knife
positions and with bounded values of input steering angle.
Keywords: Mobile robot, Fuzzy control, Linear-fuzzy integration, Trailer-type robot, Trajectory
following
controllers based on feedback linearization and backstepping
1. INTRODUCTION
techniques for positioning and path tracking control. The
Autonomous truck-trailer mobile robots are used in diverse differentially flat structure of mobile robots has been used for
fields in industry given their advantages in delivery and designing controllers in Rouchon et al, (1993) and Michalek
transportation applications. They can accomplish et al. (2012).
transportation tasks in a faster and cheaper way compared to
Fuzzy logic have been used in Cheng et al. (2009), Tanaka et
multiple individual mobile robots. Their transportation
al. (1994) and Kong et al. (1992) to propose diverse control
capacity increases with the number of trailers pulled or
strategies based on human driver experience expressed
pushed by a truck moving forward or backward.
through linguistic rules. Neural networks have been applied
However, truck-trailer mobile robots configure a complex, in Nguyen et al. (1989) and Moran (2004) for training
nonlinear, unstable, underactuated and nonholonomic system connectionist controllers based on static or dynamic learning
difficult to control, especially when moving backwards, algorithms. Other techniques based on genetic algorithms and
which have led to an intensive research work for analyzing their integration with neural networks and fuzzy systems,
their motion characteristics and autonomous control. The have been proposed in Kinjo et al. (2000). The control
most of work have been based on robot kinematical model schemes have been applied to robot positioning, backing up,
valid when the robot moves at low speeds without wheels linear and nonlinear trajectory following, path planning,
side-slipping. In this condition, the robot motion is parallel parking, jack-knife avoidance, robots formation
determined only by geometrical considerations independent among other control objectives.
of masses, inertias and road friction forces.
Diverse control strategies have been proposed to make the
truck-trailer robot autonomously moves describing desired
2. PROBLEM DEFINITION AND CONTROL STRATEGY
trajectories in complex environments. Approximate
linearization and feedback linearization of kinematic model The problem to be solved is the designing of an autonomous
equations were used in David et al. (2014), Altafini et al. control system for the positioning and trajectory following of
(2001) and Laumond et al. (1998) for designing stabilizing a truck-trailer mobile robot. The positioning control problem
controllers for robot positioning applicable to a limited range is shown in Figure 1: the mobile robot, starting from arbitrary
of operating conditions. Chained representation of robot initial positions, should achieve the desired position without
kinematical equations have been used in Sordalen et al. colliding with obstacles around the goal position.
(1993) and Fierro et al. (1995) for designing nonlinear
134
CHAPTER 6. INTELLIGENT CONTROL
Y Axis
Trailer (L 2)
Obstacle
12 2
Initial Position
1
Desired
Position
Truck (L 1)
Obstacle
X Axis
1 ……………………… (3)
Traction
wheels
Trailer 2 ……………………… (4)
Fig. 2. Truck-trailer mobile robot. and from equations (3), (4) and (5) the equation of 12 is
obtained:
12 ………………. (6)
Assuming that left and right wheels move in a similar
pattern, the truck-trailer robot can be modeled as a two Considering that the traction wheels moves at constant
articulated bars as it is shown in Figure 3. Coordinates backward speed =constant and defining the state vector
represent the position of trailer rear wheel, and and control vector as:
are the angles of truck and trailer respect to X axis,
is the angle of truck respect to trailer, is the …………………….. (7)
steering angle, and and are the lengths of truck
and trailer, respectively. Counter clockwise angles are ……………………. (8)
positive.
135
CHAPTER 6. INTELLIGENT CONTROL
equations (2), (4) and (6) can be represented by the following Zero:
affine nonlinear state-space equation: IF -90o -60o MF = 0
IF -60o 0o MF = /60 + 1
………………………. (9) IF 0o 60o MF = - /60 + 1
IF 60o 90o MF = 0
Positive Big:
The no inclusion of coordinate in state vector simplifies IF -90o 0o MF = 0
the controller design process without affecting the robot IF 0o 60o MF = /60
positioning and trajectory following capacity as far as it
moves only forward or only backward. IF 60o 90o MF = 1
..……… (10) As it was stated, control law (11) is valid when =Zero,
A full-state stabilizing control law for the linear system is but it does not apply for other linguistic values of . Then,
given by: different controllers will be designed for the other two
partitions (Negative Big and Positive Big).
…………… (11)
From equation (6), it is noted that there is an inverse
where coefficients , and are properly chosen so that relationship between 12 and : if increases,
the closed-loop linear system is stable. This control law is 12 decreases and vice versa. Also, it is clear that one of the
only valid around =0 and it is not guaranteed it will positioning control objectives is to keep truck-trailer angle
stabilize the mobile robot for other angles in the range at small values in order to avoid jack-knife positions. To
-90o to +90o, where the extreme values correspond to jack- do that, the following fuzzy reasoning is applied: if is
knife positions. To solve this problem, a rule-based fuzzy Positive Big, 12 should be negative for bringing toward
control will be applied: the range of variation of will be zero and, to achieve that, should be positive.
partitioned in three parts and a simple linear controller is Similarly, when is Negative Big, 12 should be positive
designed for each part. Afterwards, the three controllers are for bringing toward zero and, to achieve that,
integrated in a fuzzy-logic approach as a weighted sum of should be negative. This reasoning is summarized in the
their outputs (weights given by the membership values of following fuzzy rules:
each partition).
Figure 4 shows the partitions and membership functions of IF Positive Big THEN max (positive)
angle in the range -90o to +90o (Negative Big, Zero, IF Zero THEN Equation (11)
Positive Big). Membership functions (MF) are described by
IF Negative Big THEN min (negative)
the following equations:
……………………………(12)
Negative Big:
IF -90o -60o MF = 1 It is important to point out that the proposed control strategy
o
IF -60 0o MF = - /60 does not only focus on attainment of the desired position but
o o
IF 0 90 MF = 0 also on the avoidance of jack-knife positions (angle close
to +90o or -90o). This fuzzy control law was applied to the
136
CHAPTER 6. INTELLIGENT CONTROL
3. TRAJECTORY FOLLOWING
It is important to note that, since velocity v is constant, when 3.1.1 Linear desired trajectory
the robot converges to line y=y* and moves along it toward
the goal position, the value of equals to v. It is equivalent to Figure 7 shows the linear trajectory control problem. Point P
state that coordinate is proportional to time and behaves as represents the robot instantaneous position given by
an independent variable. coordinates and line AB, with inclination angle ,
represents the trajectory to be followed. The equation of line
AB is:
137
CHAPTER 6. INTELLIGENT CONTROL
40 B
y* (a)
2* =
Y 12* = 0
Axis
P: Present Mobile
y position robot
2*
y*
P*: Instantaneous
desired position
Initial
Position
A
-40
0 x 80
X Axis
(b)
* ………………………… (15)
138
CHAPTER 6. INTELLIGENT CONTROL
….……… (18)
(b)
The inclination angle of the tangent to the circular
trajectory at point P* represents the desired instantaneous
inclination angle of the trailer and can be determined by
geometrical relationships as: Initial
Position
..………………. (19)
……….………… (20)
Fig. 10. Trajectories of mobile robot from two different
It is noted that the value of is constant and does not initial positions converging into the circular desired
depend on robot coordinates or angles. This result is expected trajectory.
considering the truck and trailer relative position required to Initial position (a) =15, =-30, =-45o, =0o
(b) =25, =-5, =-135o, =0o
139
CHAPTER 6. INTELLIGENT CONTROL
, trailer inclination angle , truck-trailer angle , and
steering angle corresponding to the trajectories showed in
Figure 12. It is noted that the response for controller Fuzzy 1
Fig. 11. Partitions and membership functions of truck- converges faster than controller Fuzzy 2 at the expense of
trailer angle for controllers Fuzzy 1 y Fuzzy 2. higher values of truck-trailer angle and steering angle .
In both cases jack-knife positions are avoided. From these
results it is concluded that smaller ranges of the central
partition Zero result in lower values of angle . These
Membership functions (MF) for controller Fuzzy 1 are results validate the coherence of the proposed control
described by the following equations: strategy.
Negative Big:
IF -90o -80o MF = 1
o
IF -80 0o MF = - /80
o o
IF 0 90 MF = 0
Zero: Fuzzy 2
IF -90o -80o MF = 0
IF -80o 0o MF = /80 + 1 Fuzzy 1
o
IF 0 80o MF = - /80 + 1
Initial
IF 80o 90o MF = 0 position
Positive Big:
IF -90o 0o MF = 0 Desired
o
IF 0 80o MF = /80 position
o o
IF 80 90 MF = 1
140
CHAPTER 6. INTELLIGENT CONTROL
6. REFERENCES
141
CHAPTER 7
LINEAR SYSTEMS
CHAPTER 7. LINEAR SYSTEMS
Abstract: In this paper a sliding-mode observer for linear time-invariant systems is proposed.
The observer is based on integral sliding modes and the equivalent control method. In order
to induce a sliding mode in the output error, a second order sliding mode algorithm is used.
Convergence proofs of the proposed observer are presented. In order to expose the features of this
proposal, a design example over a DC motor model is exposed, noiseless and noisy measurements
cases are considered. For this case, the simulation shows the high performance of the integral
observer.
143
CHAPTER 7. LINEAR SYSTEMS
eliminating the reaching phase and ensuring robustness where x ∈ Rn is the state vector, u ∈ Rm is the
from the initial time. input vector, y ∈ Rk is the output vector, A ∈ Rn×n
is the transition matrix, B ∈ Rn×m is the input-state
Consequently, in this paper an integral sliding mode-
distribution matrix and C ∈ Rk×n is the output matrix,
based observer for linear systems is proposed. The observer
which will be assumed to have full row rank so the
structure is similar to the observer presented on Drakunov
measured outputs are independent. Additionally, it will
(1992), but using integral SM. In addition, a step by step
be assumed that the pair (A, C) is observable.
design of the proposed observer is provided along with a
design example over a DC motor model. This paper deals with the case when the measured output
is a part of the state. In this case, the system (4) can be
The following sections are organized as follows: Section 2 rewritten as:
presents the preliminaries for the observer. In Section 3,
the integral SM observer is presented. A design example ẋ1 = A11 x1 + A12 x2 + B1 u
is analyzed in Section 4. In Section 5 the simulation ẋ2 = A21 x1 + A22 x2 + B2 u (5)
results are shown. Finally, the conclusions of this paper y = x1 ,
are presented in Section 6.
where A11 ∈ Rk×k , A12 ∈ Rk×(n−k) , A21 ∈ R(n−k)×k ,
2. MATHEMATICAL PRELIMINARIES A22 ∈ R(n−k)×(n−k) , B1 ∈ Rk×m , B2 ∈ R(n−k)×m , are
partitions of the matrices A and B, such that:
This section presents the previous results needed for the
proposed observer. A11 A12 B1
A= , B= ;
A21 A22 B2
2.1 The Super-Twisting Algorithm y = x1 ∈ Rk is the measured part of the state vector and
x2 ∈ R(n−k) is the unmeasured part of the state vector.
Consider the first order perturbed system
Many linear systems can be directly expressed in the form
ξ˙ = −u + ∆, (1)
described by (5) (i.e., the measured output is a part of the
where ξ, ∆, u ∈ R. state vector). If not, under the assumption that C is full
The super-twisting controller u = ST (ξ) (Levant, 1993), rank, there is always a linear transformation which allows
is defined as to express the system (4) in the form (5), as described in
1 (Utkin, 1992). For instance, assuming the output vector y
ST (ξ) = α1 |ξ| 2 sign(ξ) + w may be represented as:
(2)
ẇ = α2 sign(ξ),
y = K1 x1 + K2 x2 , xT = [x1 x2 ]T , x1 ∈ Rk , x2 ∈ R(n−k) ,
with sign(x) = 1 for x > 0, sign(x) = −1 for x < 0 and consider a coordinate transformation x 7→ T x associated
sign(0) ∈ {−1, 1}. with the invertible matrix
K1 K2
For the system (1), the controller (2) is applied, yielding T =
Ik 0
the closed loop system:
1
ξ˙ = −α1 |ξ| 2 sign(ξ) + q Applying the change of coordinates x →7 T x, the triplet
(3) (A, B, C) has the form:
q̇ = −α2 sign(ξ) + ∆,˙
A11 A12 B1
where q = w + ∆. T AT −1 = , TB = , CT −1 = [Ik 0] .
A21 A22 B2
˙
Assuming that ∆ < δ, the super-twisting gains are
1
selected as: α1 = 1.5δ 2 and α2 = 1.1δ. Therefore, a sliding 3. INTEGRAL SLIDING MODE OBSERVER
mode is induced on the manifold (ξ, q) = (0, 0) in a finite-
time tq > 0 (Moreno and Osorio, 2008). Thus, from (3), 3.1 Observer Scheme
the term w in (2) becomes equal to −∆.
Now, consider the multi-variable case, with ξ = Based on (5), the following state observer is proposed:
[ξ1 . . . ξp ]T , ∆ =[∆1 . . . ∆p ]T , u = [u1 . . . u p ]T ∈
˙ ˙ x̂˙ 1 = A11 x̂1 + A12 x̂2 + B1 u + v0 + v1
Rp . Assuming ∆ < δ, it can be shown that ∆ i <
x̂˙ 2 = A21 x̂1 + A22 x̂2 + B2 u + L2 v1
δi ∀i ∈ 1, . . . , p. In this case, define u = ST (ξ) = v0 = L1 x̃1 (6)
[ST (ξ1 ) . . . ST (ξp )] and note that this multi-variable
v1 = ST {σ}
case is simply the same as having p (1)-like scalar systems.
σ = x̃1 + z,
2.2 Linear Systems
where x̂1 and x̂2 are the estimates of x1 and x2 ,
Consider the following time-invariant linear system respectively; x̃1 = x1 − x̂1 is the estimation error variable;
represented by the following state space equation: v0 ∈ Rk and v1 ∈ Rk are the observer input injections;
σ ∈ Rk is the sliding variable and z ∈ Rk is an integral
ẋ = Ax + Bu
(4) variable to be defined thereafter. Finally, L1 ∈ Rk×k and
y = Cx, L2 ∈ R(n−k)×k are the observer gains.
144
CHAPTER 7. LINEAR SYSTEMS
145
CHAPTER 7. LINEAR SYSTEMS
32 45
31
40
30
35
current [A]
current [A]
29
x1
30
x1 estimated
28
25
27
x1 measured
20 x1 estimated
26
x1
25 15
0 5 10 15 20 25 30 35 40 45 0 5 10 15 20 25 30 35 40 45
time[s] time[s]
Figure 1. Armature current (i) (actual and estimated). Figure 3. Armature current (i) (measured, estimated and
actual).
260
252
250 250
260
248
240
250 246
speed [rad/s]
speed [rad/s]
230 244
240
speed [rad/s]
242
x2 220
230 240
x2 estimated
210 238
220 x2
x2 estimated 236
200
210 0 10 20 30 40 20 40
time[s] time[s]
200
0 5 10 15 20 25 30 35 40 45 Figure 4. Shaft speed (ω) (actual and estimated).
time[s]
Figure 2. Shaft speed (ω) (actual and estimated). Fig. 3 and Fig. 4 show the comparison between the
actual and estimated variables corresponding to the
armature current i and shaft speed ω respectively, for noisy
measurements.
Fig. 1 and Fig. 2 show the comparison between the actual Based on the presented figures, it can be observed a
and estimated variables corresponding to the armature good performance of the proposed observer. Under noisy
current i and shaft speed ω respectively, for noiseless conditions the armature current estimation î is much
measurements. closer to its actual value than its measurement (Fig 3).
In addition, a correct estimation of ω using the integral
sliding mode observer is achieved (Figs. 2, 4) making
the proposed observer suitable for observer-based control
applications.
5.2 Noisy Measurements
6. CONCLUSION
In this subsection, it is assumed that the current
measurements were corrupted by a normally distributed In this paper an integral sliding mode observer for linear
random signal with zero mean and a variance of 10. This time-invariant systems is proposed. The convergence of the
assumed variance corresponds to a current sensor with an estimation errors to zero for the proposed observer was
accuracy of ±9.5 A. This large variance was assumed to see proved. A step by step design of the proposed observer
significant variations in the simulation due to the noise and was provided along with a design example over a DC motor
verify the filtering capabilities of the proposed observer. model. The simulation results of the example shown the
The following results were obtained: filtering capabilities of the proposed observer.
146
CHAPTER 7. LINEAR SYSTEMS
147
CHAPTER 7. LINEAR SYSTEMS
148
CHAPTER 7. LINEAR SYSTEMS
mapean el intervalo [−h, 0) a Rn , equipado con la norma Observación 4. La matriz fundamental K(t) presenta una
de convergencia uniforme kϕkh = supθ∈[−h,0) kϕ(θ)k . discontinuidad de tipo salto en t = 0 dada por
Dada una función inicial ϕ ∈ PC ([−h, 0) , Rn ), existe una ∆K(0) , K(0) − K(−0) = I − K0 − (−K0 ) = I.
solución única x(t, ϕ) de (1) la cual está definida para todo Por otro lado, la matriz de Lyapunov U (τ ) es continua
t ∈ [−h, ∞) Melchor-Aguilar et al. (2010). Está solución para todo τ ∈ [−h, h].
es continua para todo t > 0 y en t = 0 presenta una
discontinuidad de tipo salto dada por También se ha demostrado un enunciado converso al Teo-
rema 2 en Melchor-Aguilar et al. (2010). De manera más
∆x(0, ϕ) = x(0, ϕ) − x(−0, ϕ) precisa, bajo la suposición de la estabilidad exponencial de
Z 0 (1) y definiendo en PC ([−h, 0) , Rn ) la funcional
=F ϕ(θ)dθ − ϕ(−0). Z 0
−h w(ϕ) = ϕT (−h)W0 ϕ(−h) + ϕT (θ)W1 ϕ(θ)dθ,
Definición 1. Melchor-Aguilar et al. (2010) El sistema (1) −h
se dice ser exponencialmente estable si existen constantes donde W0 y W1 son matrices definidas positivas, la fun-
µ ≥ 1 y α > 0 tales que toda solución de (1) satisface la cional de tipo completo correspondiente
desigualdad Z 0 T Z 0
kx(t, ϕ)k ≤ µe−αt kϕkh , ∀t ≥ 0. v(xt ) = F x(t + θ)dθ U (0) F x(t + θ)dθ
−h −h
Para presentar las condiciones de Lyapunov-Krasovskii Z 0 T Z 0
para la estabilidad exponencial de (1) dadas en Melchor- −2 F x(t + θ)dθ U (−h − θ)F x(t + θ)dθ
−h −h
Aguilar et al. (2010) se introducirá un poco de termi- Z 0 Z 0
nologı́a. Como es habitual, se define el estado natural de + xT (t + θ1 )F T U (θ1 − θ2 )F x(t + θ2 )dθ2 dθ1
(1) por −h −h
Z 0
xt (θ, ϕ) , x(t + θ, ϕ), θ ∈ [−h, 0) .
− xT (t + θ1 )F T K0T W ×
Debido a la discontinuidad de tipo salto de la solución −h
"Z Z ! #
θ1 −θ2
en t = 0, se sigue que xt (θ, ϕ) ∈ PC ([−h, 0) , Rn ) para 0
t ∈ [0, h) , mientras que xt (θ, ϕ) ∈ C ([−h, 0) , Rn ) para × K(ξ)dξ F x(t + θ2 )dθ2 dθ1
−h −h−θ2
t ≥ h. Como consecuencia de ello, en el enfoque de Z 0
Lyapunov-Krasovskii, las funcionales deben estar definidas
+ xT (t + θ) [W0 + (θ + h) W1 ] x(t + θ)dθ, (3)
en el espacio infinito dimensional PC ([−h, 0) , Rn ). −h
Por simplicidad de la notación, se escribe xt (ϕ) en lugar de donde U (·) es la matriz de Lyapunov del sistema (1)
xt (θ, ϕ), θ ∈ [−h, 0). Asimismo, cuando la función inicial asociada con la matriz W = W0 + hW1 , satisface la
es irrelevante en el contexto, simplemente se escribe x(t) ecuación
y xt en lugar de x(t, ϕ) y xt (ϕ). d
v(xt ) = −w(xt ), t ≥ 0.
Teorema 2. Melchor-Aguilar et al. (2010) El sistema (1) es dt
exponencialmente estable si existe una funcional continua Se muestra en Melchor-Aguilar et al. (2010) que si el sis-
v : PC ([−h, 0) , Rm ) → R tal que t → v(xt (ϕ)) es tema (1) es exponencialmente estable entonces la funcional
diferenciable y se cumplen las condiciones siguientes: (3) satisface las condiciones del Teorema 2.
R0 2 R0 2 Se sigue de (3) que el cálculo de la matriz U (τ ) es
(1) α1 −h kϕ(θ)k dθ ≤ v(ϕ) ≤ α2 −h kϕ(θ)k dθ, para
constantes 0 < α1R ≤ α2 , fundamental para construir la funcional v(xt ). En Melchor-
d 0 2 Aguilar et al. (2010) se demostró que U (τ ) satisface las
(2) dt v(xt (ϕ)) ≤ −β −h kx(t + θ, ϕ)k dθ, para una con- propiedades siguientes.
stante β > 0.
Lema 5. La matriz de Lyapunov U (τ ) satisface la ecuación
Sea K(t) la solución de la ecuación matricial dinámica
Z 0 Z 0
K(t) = K(t + θ)dθ F, U (τ ) = U (τ + θ)dθ F, τ ≥ 0. (4)
−h
−h
con función inicial K(t) = −K0 , t ∈ [−h, 0) , donde Lema 6. Sea W = W T . Entonces la matriz de Lyapunov
−1
K0 = (I − hF ) . La matriz K(t) se conoce como la U (τ ) satisface
Z τ
matriz fundamental del sistema (1), ver Melchor-Aguilar T
U (τ ) = K0 W K(ξ)dξ + U T (−τ ), τ ∈ [0, h]. (5)
et al. (2010). 0
Suponga que (1) es exponencialmente estable. Dada W = Lema 7. La matriz de Lyapunov U (τ ) satisface
W T se define
Z la matriz
∞ T
U (τ ) , K T (t)W K(t + τ )dt, τ ∈ [−h, h] . (2) −K(0)W K(0) = [U (0)F − U (−h)F ]
0 + [U (0)F − U (−h)F ] . (6)
Note que la estabilidad exponencial del sistema (1) garan-
tiza la existencia de la integral impropia en (2). Las condiciones (4), (5) y (6) son llamadas respectiva-
Definición 3. La matriz (2) es la matriz de Lyapunov del mente la propiedad dinámica, la propiedad de simetrı́a y
sistema (1) asociada a la matriz W . la propiedad algebraica. Claramente, estas tres propiedades
149
CHAPTER 7. LINEAR SYSTEMS
proveen una alternativa más práctica de calcular la matriz Entonces, de acuerdo con el método en Garcia-Lozano and
de Lyapunov que la integral impropia definida en (2). Kharitonov (2006), se comparan U (jr) y U ((j + 1)r) para
obtener
La propiedad dinámica define U (τ ) como una solución
de la ecuación (4). Para calcular dicha solución se nece-
sita conocer su correspondiente función inicial. La función Uj+1 − Uj = U ((j + 1)r) − U (jr)
Z r
inicial no esta dada explı́citamente. Por otro lado, la
propiedad de simetrı́a (5) junto con la propiedad alge- = (U (jr + θ) − Φ((j − N )r + θ)) dθ F.
0
braica (6) proveen información implı́cita de la función
inicial desconocida. Sustituyendo las matrices U (jr +θ) y Φ((j −N )r +θ) bajo
la integral por sus aproximaciones lineales a pedazos (8) y
Como se mencionó en la introducción, existen algunos (7) y usando la propiedad de simetrı́a (5) en los puntos de
métodos para calcular matrices de Lyapunov de sistemas la partición
diferenciales con retardo Kharitonov (2013) pero, sin em-
bargo, no pueden ser aplicados directamente al cálculo Uj = ΦTj + K0T W Vj , j = 0, 1, . . . , N, (9)
de matrices de Lyapunov de sistemas integrales con re- donde Z jr
tardo. El problema principal es que para aplicar dichos
métodos se necesita derivar la ecuación dinámica (1) y Vj = K(ξ)dξ, (10)
0
esto dará lugar a una ecuación matricial diferencial con se llega al conjunto de N ecuaciones lineales expresadas en
retardo inestable, ver Melchor-Aguilar et al. (2010) para términos de las matrices desconocidas Φj , j = 0, 1, . . . , N −
más detalles. 1 siguiente:
Por otro lado, parece natural analizar la posibilidad de r r
aplicar, no los métodos para sistemas diferenciales, pero r
F + I ΦTj + F − I ΦTj+1 − (ΦN −j +
si las ideas principales detrás de ellos al caso de los sis- 2 2 2
temas integrales con retardo. Ası́, en la siguiente sección, r
+ΦN −j−1 ) F = K0T W Vj+1 − Vj − (Vj + Vj+1 ) (11).
se aplicarán las ideas principales en Garcia-Lozano and 2
Kharitonov (2006) para la construcción numérica de ma- Agregando a este conjunto la ecuación algebraica (6) en
trices de Lyapunov para sistemas integrales con retardo. los puntos de la partición
T
3. APLICACIÓN DEL MÉTODO NUMÉRICO DE (Φ0 − ΦN ) F + F T (Φ0 − ΦN ) = −K T (0)W K(0) (12)
SISTEMAS DIFERENCIALES CON RETARDO se llega a un sistema de N + 1 ecuaciones matriciales para
N + 1 matrices desconocidas Φj , j = 0, 1, . . . , N .
Siguiendo las ideas expuestas en Garcia-Lozano and
Kharitonov (2006) definimos la función matricial Φ (τ ), En principio, la solución del sistema de ecuaciones (11)-
τ ∈ [−h, 0], como la función inicial desconocida para (12) proporciona las matrices Φj y la formula (7) da la
la ecuación dinámica (4) y dividimos el intervalo [−h, 0] aproximación deseada de la función inicial.
en N segmentos de la misma longitud [−(j + 1)r, −jr] Considere el caso escalar y dos particiones del intervalo
h [−h, 0], es decir. F ∈ R y N = 2 lo que lleva a r = h2 . En
,j = 0, 1, 2, . . . , N − 1, donde r = .
N este caso, el sistema lineal de ecuaciones (11)-(12) puede
Ahora, introducimos N + 1 matrices desconocidas Φj = ser escrito como AX = B, donde
Φ(−jr), j = 0, 1, 2, . . . , N, y definimos la aproximación r
lineal a pedazos continua de la función inicial Φ(τ ) como r
F + 1 −1 − F
sigue: 2 r 2
r T
s + jr s + jr A= F 1 − 1 , X = [Φ0 Φ1 Φ2 ] ,
Φ̂(s) = 1 + Φj + − Φj+1 , (7) 2 2
r r 2F 0 −2F
donde s ∈ [−(j + 1)r, −jr], j = 0, 1, 2, . . . , N − 1. r
K0 W V1 − V1 F
Sea U (τ ) = U (τ, Φ) la solución de (4) correspondiente a 2
r .
la función inicial Φ. Note que para la construcción de la B=
funcional (3), U (τ ) solamente se necesita conocer para los K0 W (V1 − V1 ) − (V1 + V2 ) F
2
valores de τ ∈ [0, h]. En consecuencia, también dividimos 0
el intervalo [0, h] en N segmentos de la misma longitud Se tiene que
[jr, (j + 1)r], j = 0, 1, 2, . . . , N − 1, introducimos N + 1
matrices desconocidas Uj = U (jr), j = 0, 1, 2, . . . , N, y det(A) = −rF 2 − 2F − rF 2 + 2F + rF 2 + rF 2 = 0
definimos la aproximación lineal a pedazos de U (τ ) como lo cual implica que la matriz A es singular para todos los
sigue:
valores de F ∈ R y h > 0.
s − jr s − jr Se sigue que el sistema de ecuaciones (11)-(12) no es
Û (s) = 1 − Uj + Uj+1 , (8)
r r consistente ya que no hay una solución única de AX = B
donde s ∈ [jr, (j + 1)r], j = 0, 1, 2, · · · , N − 1. y, por lo tanto, la metodologı́a expuesta en Garcia-Lozano
and Kharitonov (2006) para sistemas diferenciales con
Para τ = jr se tiene retardo no provee una solución apropiada al problema del
Z 0
calculo de matrices de Lyapunov para sistemas integrales
U (jr) = U (jr + θ) dθ F.
−h con retardo.
150
CHAPTER 7. LINEAR SYSTEMS
j−1 Z
X
4. APROXIMACIÓN LINEAL A PEDAZOS (j−k)r
n̂j = Û (ξ)dξ, j = 1, 2, · · · , N, (14)
k=0 (j−k−1)r
El análisis en la sección 3 muestra que se requiere un nuevo
método para calcular soluciones de la ecuación dinámica n̂0 = 0
(4), satisfaciendo las condiciones (5) y (6) para calcular donde n̂j denota la aproximación de nj .
matrices de Lyapunov de sistemas integrales con retardo.
En esta sección, se propone dicho método. Sustituyendo Û (ξ) en la integral del lado derecho de la
expresión para n̂j por su aproximación lineal a pedazos
(7), un término de la integral satisface
4.1 Función inicial aproximada
Z (j−k)r
Considerar las mismas particiones de los intervalos [−h, 0] Û (ξ)dξ =
y [0, h] en N segmentos iguales definidos en la sección 3 (j−k−1)r
y las aproximaciones lineales a pedazos continuas de la Z (j−k)r
ξ − (j − k − 1)r
función inicial Φ̂(s) y la matriz Û (s) definidas respectiva- = 1− Uj−k−1 +
(j−k−1)r r
mente por (7) y (8).
ξ − (j − k − 1)r
Ahora, para τ = jr se obtiene + Uj−k dξ.
Z 0 Z ! r
jr
U (jr) = U (jr + θ)dθ F = U (ξ)dξ F. Mediante cálculos directos se obtiene
−h (j−N )r
j−1
rX
Dado que (j − N )r ≤ 0, j = 0, 1, . . . , N , se puede escribir n̂j = (Uj−k−1 + Uj−k ) , j = 1, · · · , N, (15)
la ecuación anterior como 2
k=0
n̂0 = 0.
Z 0 Z jr Ahora, notar que si N es suficientemente grande entonces
de (13) se sigue que
U (jr) = Φ(ξ)dξ + U (ξ)dξ
F. (13)
Uj = (m̂j + n̂j ) F, j = 0, 1, . . . , N.
(j−N )r | 0
{z }
| {z }
mj nj A partir de esta expresión, (14), (15) y la propiedad de
simetrı́a en los puntos de la partición (9) se llega a las
Considerar el término mj en (13). Reescribiendo la inte- ecuaciones matriciales siguientes:
gral en suma de integrales en intervalos de longitud r y
sustituyendo Φ(ξ) por su aproximación Φ̂(ξ) se obtiene • Para j = 0
N −j−1
r X
−j−1 Z −(N −j−k−1)r
NX Φ0 − (ΦN −j−k−1 + ΦN −j−k ) F = 0. (16)
2
m̂j = Φ̂(ξ)dξ, j = 0, 1, · · · , N − 1, k=0
k=0 −(N −j−k)r • Para j = 1, 2, · · · , N − 1
m̂N = 0, "N −j−1
r X
donde m̂j denota la aproximación del término mj . ΦTj − (ΦN −j−k−1 + ΦN −j−k ) +
2
k=0
Sustituyendo Φ̂(ξ) en la integral del lado derecho de la j−1
#
X
expresión para m̂j por su aproximación lineal a pedazos T T
+ Φj−k−1 + Φj−k F =
(7), un término de la integral satisface k=0
j−1
!
Z −(N −j−k−1)r rX
= K0T W (Vj−k−1 + Vj−k ) F − Vj . (17)
Φ̂(ξ)dξ = 2
−(N −j−k)r k=0
Z −(N −j−k−1)r • Para j = N
ξ + (N − j − k − 1)r
= 1+ j−1
r rX T
−(N −j−k)r
ΦTN − Φj−k−1 + ΦTj−k F =
ξ + (N − j − k − 1)r 2
k=0
×ΦN −j−k−1 + − ΦN −j−k dξ. !
r j−1
rX
Cálculos directos de la ecuación anterior conducen a = K0T W (Vj−k−1 + Vj−k ) F − VN .(18)
2
k=0
N −j−1
r X Notar que las ecuaciones (16)-(18) describen un sistema
m̂j = (ΦN −j−k−1 + ΦN −j−k ) , j = 0, 1, · · · , N − 1, de N + 1 ecuaciones matriciales para N + 1 matrices
2
k=0 incógnitas Φj , j = 0, 1, · · · , N . La solución de este sistema
m̂N = 0. de ecuaciones provee las matrices Φj , j = 0, 1, 2, . . . , N y la
Considerar el término nj en (13). De manera similar, ree- formula (7) permite el calculo de la aproximación deseada
scribiendo la integral como suma de integrales en intervalos de la función inicial matricial.
de longitud r y sustituyendo U (ξ) por su aproximación A fin de mostrar que el sistema de ecuaciones matriciales
Û (ξ) se obtiene (16)-(18) no presenta el mismo problema inconsistente
151
CHAPTER 7. LINEAR SYSTEMS
solución para todos los sistemas integrales escalares con donde φj = vec(Φj ), j = 1, 2, . . . , N.
retardo exponencialmente estables.
Observación 8. Notar que el sistema de ecuaciones ma- 4.3 Matriz de Lyapunov aproximada
triciales (16)-(18) está bien definido sin involucrar la
propiedad algebraica (6) a diferencia del método propuesto Con la aproximación lineal a pedazos de la función inicial
en Garcia-Lozano and Kharitonov (2006) el cual requiere calculada, se plantea el problema de buscar la solución
la correspondiente propiedad algebraica para matrices de correspondiente de la ecuación dinámica (4). De hecho, el
Lyapunov de sistemas diferenciales con retardo. problema puede ser formulado como el problema de valor
inicial general siguiente:
4.2 Forma vectorial Z 0
U (τ ) = U (τ + θ)dθ F, τ ≥ 0, (19)
Para encontrar una solución del sistema de ecuaciones −h
(16)-(18) es conveniente escribirlo en forma vectorial por U (τ ) = Φ(τ ), τ ∈ [−h, 0]. (20)
2
medio de la operación vector vec(Q) = q, donde q ∈ Rn Notar que el problema de valor inicial (19)-(20) no se puede
se obtiene apilando las columnas de Q ∈ Rn×n . resolver por medio del método paso a paso conocido para
construir soluciones de sistemas diferenciales con retardo
La vectorización de la matriz C = AXB es vec(C) =
Bellman and Cooke (1963).
(A ⊗ B) vec(X), donde
A continuación, se propone un método para resolver el
b11 A b21 A · · · bn1 A
b12 A b22 A · · · bn2 A problema de valor inicial (19)-(20) el cual está inspirado
A⊗B = ... .. .. .. en el método paso a paso para sistemas diferenciales con
. . . retardo y, en realidad, este puede ser usado para construir
b1n A b2n A · · · bnn A soluciones numéricas de sistemas integrales con retardo de
es el producto de Kronecker de las matrices A y B. Por la forma (1).
otro lado, la vectorización de la matriz D = AX T B es De (19) se tiene para τ ∈ [0, h]
vec(D) = (A ◦ B) vec(X), donde A ◦ B está definida por Z τ
A1 B1T A2 B1T · · · An B1T U (τ ) = W (τ ) + U (ξ)dξ F, (21)
A1 B T A2 B T · · · An B T 0
2 2 2 donde Z
A◦B = . .. .. .. , 0
.. . . . W (τ ) = Φ(ξ)dξ F.
A1 BnT A2 BnT · · · An BnT τ −h
denotando con Aj , Bj , j = 1, 2, . . . , N, los vectores Se sigue que el problema de valor inicial (19)-(20) es
columna de A y B, respectivamente. equivalente al problema de encontrar una solución de la
ecuación integral (21).
Entonces, el sistema de ecuaciones matriciales (16)-(18)
puede ser escrito en forma vectorial como sigue: Sean U0 (τ ) = W (τ ) y Uj (τ ), j = 1, 2, 3, . . . , sucesiones
descritas como sigue:
Z τ
Uj (τ ) = W (τ ) + Uj−1 (ξ)dξ F, τ ∈ [0, h].
0
152
CHAPTER 7. LINEAR SYSTEMS
0.8 0.15
K11 (t)
0.6 K21 (t) 0.1
U (τ )
K(t)
0
-0.05
-0.2
-0.1 U11 (τ )
-0.4 U21 (τ )
-0.15 U12 (τ )
-0.6
U22 (τ )
-0.8 -0.2
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 -1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1
t τ
153
CHAPTER 7. LINEAR SYSTEMS
154
CHAPTER 7. LINEAR SYSTEMS
Vázquez Rosas C.D,*, del Muro Cuéllar B,*, Márquez Rubio J.F,*
*Escuela Superior de Ingeniería Mecánica y Eléctrica, Unidad Culhuacán, Instituto
Politécnico Nacional, Santa Ana 1000 México D.F, 04430
E-mail:carlos_daniel_vaz@hotmail.com, bdelmuro@yahoo.com, jfcomr23@yahoo.com.mx
155
CHAPTER 7. LINEAR SYSTEMS
(Novella et al., 2012) propone una modificación a la Para el control proporcional, C(s)=k aplicado al
estructura original del predictor de Smith para sistema de la Ec (1) la respuesta del sistema en lazo
afrontar el caso de sistemas inestables que contienen abierto es:
un cero de fase mínima, pero esta estrategia se limita
a tener un tiempo de retardo menor a la constante de 𝑁(𝑠) −𝜏𝑠
tiempo inestable del sistema. En (Lee et al., 2010) 𝑄(𝑠) = 𝐶(𝑠)𝐺(𝑠) = 𝑘 𝑒 (2)
𝐷(𝑠)
se muestran las condiciones de estabilidad para La ecuación a lazo cerrado del sistema de la Ec (2)
diversos tipos de sistemas que tienen ceros, es:
estabilizándolos con controladores tipo P, PI, PID.
En (Vazquez et al., 2012) se presentan las 𝑌(𝑠) 𝑘N(s)𝑒 −𝜏𝑠
condiciones necesarias y suficientes para sistemas = (3)
𝑅(𝑠) 𝐷(𝑠) + 𝑘𝑁(𝑠)𝑒 −𝜏𝑠
de alto orden que contienen un cero de fase mínima,
además de presentar una estrategia de control para Donde R(s) es una entrada de referencia y el término
afrontar grandes retardos. Cabe señalar que los “k” es la ganancia del control proporcional. Se
trabajos anteriores solo toman en cuenta el caso que puede notar que el término 𝑒 −𝜏𝑠 está localizado en
exista un solo cero de fase mínima en la dinámica la ecuación característica del sistema generando un
del sistema. sistema que tiene un infinito de polos lo cual
dificulta su análisis de estabilidad.
Este trabajo presenta de manera explícita las
condiciones suficientes de estabilidad para un tipo
3. RESULTADOS PREELIMINARES.
de sistemas con retardo, inestables de alto orden,
con un polo inestable, “n-1” estables y “n” ceros de
Se presentan los resultados preliminares que fueron
fase mínima utilizando como herramienta de
utilizados para la obtención del resultado principal
análisis el criterio de estabilidad de Nyquist.
de este trabajo.
El trabajo se organiza de la siguiente forma: en la
(Xiang et al., 2007) Lema 1. Dada la función de
Sección 2 se presentan el planteamiento del
transferencia a lazo abierto mostrada en (2) con 𝑃 +
problema. En la Sección 3 se muestran resultados
polos inestables, el sistema a lazo cerrado es estable
preliminares que fueron utilizados para la obtención
si la gráfica de Nyquist de (3) encierra el punto
de los resultados de este trabajo, posteriormente en
crítico (-1,0), 𝑃 + veces en sentido anti-horario.
la Sección 4 se presentan los resultados principales
de este trabajo. En la Sección 5 se presenta un
(Xiang et al., 2007) Lema 2. Para el sistema a lazo
ejemplo acompañado de una simulación numérica
abierto descrito en (2)
para verificar las condiciones de estabilidad
obtenidas y finalmente en la Sección 6 se presentan
las conclusiones. lim │G(j𝜔)│ < 1
⍵→∞
156
CHAPTER 7. LINEAR SYSTEMS
𝑁(𝑠) −𝜏𝑠
𝑄(𝑠) = 𝐶(𝑠)𝐺(𝑠) = 𝑘 𝑒 Un requerimiento para obtener la trayectoria
𝐷(𝑠)
(𝑠 + 𝑏1 )(𝑠 + 𝑏2 ) … (𝑠 + 𝑏𝑛 ) −𝜏𝑠 deseada en la gráfica de Nyquist, es
𝑑∠𝐺(𝑗𝜔)
⃒⍵→0 >
=𝑘 𝑒 (4) 𝑑𝜔
(𝑠 − 𝑎1 )(𝑠 + 𝑎2 ) … (𝑠 + 𝑐𝑛 ) 0
Donde 𝑏1, 𝑏2,… 𝑏𝑛 >0, 𝑎1 , 𝑎2 ,… 𝑎𝑛 >0 y 𝜏>0.
𝑑∠𝐺(𝜔) 1 1 1 1
Teorema 1. Existe un control proporcional que ⃒⍵→0 = −𝜏 + + + … +
𝑑𝜔 𝑎1 𝑏1 𝑏2 𝑏𝑛
estabiliza el sistema descrito en (4) si se cumplen las
1 1
tres relaciones siguientes: − …− < 0 (10)
𝑎2 𝑎𝑛
1 1 1
i) 𝜏 < 𝑎 + ∑𝑛𝑖=1 𝑏 − ∑𝑛𝑖=2 𝑎 De modo que si se cumple la condición (10) existe
1 𝑖 𝑖
1 1 1 1 una frecuencia 𝜔∗ > 0 tal que ∠𝐺(𝜔∗ ) > −180. La
ii) 𝑏1 2
+𝑏 2+ ⋯𝑏 2 < 𝑎12
+ fase toma un valor mayor a -180, después se
2 𝑛
1 1 requiere que la fase decrezca nuevamente a -180
𝑎2 2
…𝑎 2
𝑛 efectuando la intersección con el eje real negativo
iii) 𝑏1 𝑏2 … 𝑏𝑛 > 𝑎1 𝑎2 … 𝑎𝑛 con ∠𝐺(𝜔𝑐 ) = −180 para alguna frecuencia
positiva 𝜔𝑐 .
Demostración: i) Considere el sistema de la Ec (4) Otra manera de encontrar las condiciones de retardo
en el dominio de la frecuencia. descritas en el Teorema 1 es con una aproximación
lineal local a partir de una serie de Taylor para los
𝑄(𝑗𝜔) términos que contengan tan-1 y ver cómo se
(𝑗𝜔 + 𝑏1 )(𝑗⍵ + 𝑏2 ) … (𝑗⍵ + 𝑏𝑛 ) −𝜏𝑗⍵ comporta a valores muy pequeños del argumento
= 𝑒 (5)
(𝑗𝜔 − 𝑎1 )(𝑗𝜔 + 𝑎2 ) … (𝑗𝜔 + 𝑎𝑛 ) (teniendo frecuencias muy bajas) con la condición:
La derivada de la ecuación (7) es: ii) Para el control proporcional, C(s)=k, la respuesta
en frecuencia de lazo abierto es:
157
CHAPTER 7. LINEAR SYSTEMS
5. EJEMPLOS.
1 1 1 1
+ (𝜔 2+𝑏 + ⋯ (𝜔 2+𝑏 < (𝜔 2+𝑎 + Ejemplo 1. Considere el sistema con retardo de
(𝜔 2+𝑏12) 2
2 )
2
𝑛 )
2
1 )
1 1 quinto orden con 5 ceros de fase mínima en su
(𝜔 2+𝑎2 2)
+ ⋯ (𝜔 2+𝑎 2
(18) dinámica dado por la siguiente función de
𝑛 )
transferencia:
A partir del Lema 1,es necesario tener 𝑀𝑄 (𝜔∗ ) >
1 para alguna frecuencia y así encerrar el punto 𝑌(𝑠)
crítico (-1,0) obteniendo la siguiente condición: 𝑈(𝑠)
(𝑠 + 1.2)(𝑠 + 1.8)(𝑠 + 2.4)(𝑠 + 3.3)(𝑠 + 7) −τ𝑠
= 𝑒 (23)
(𝑠 − 1)(𝑠 + 1.5)(𝑠 + 2)(𝑠 + 3)(𝑠 + 7.3)
(𝜔 2 + 𝑏1 2 )(𝜔 2 + 𝑏2 2 )
𝑘√ >1 (19)
(𝜔 2 + 𝑎1 2 )(𝜔 2 + 𝑎2 2 ) A partir del Teorema 1 podemos concluir que el
rango de retardos bajo el cual el sistema descrito en
Evaluando la condición (18) para frecuencias ⍵ → (23) puede ser estabilizado con una
0 se obtiene: retroalimentación estática de salida es: 0<τ<1.6145.
158
CHAPTER 7. LINEAR SYSTEMS
estabilidad del sistema en lazo cerrado. A partir de Eng. Commun. 133(1995) 11-30
la ecuación (22) se calcula una ganancia K=0.57
para asegurar la estabilidad a lazo cerrado del Lee S.C., Wang Q.G., Stabilization conditions for a
sistema. class of unstable delay processes of higher order,
Journal of the Taiwan Institute of Chemical
Engineers, 41 (2010) 440-445
159
CHAPTER 7. LINEAR SYSTEMS
160
CHAPTER 7. LINEAR SYSTEMS
aerodinámico, justificando como su control a distancia Por lo que la ecuación que describe el movimiento del
produce un retardo de tiempo en la señal de control. En sistema resulta,
la Sección 3 se diseña un predictor–observador al utilizar
un modelo en coordenadas adelantadas. En la Sección 4 Jb + mℓ2 θ̈ + mgℓ sin θ = u
se muestra una solución al problema de seguimiento de o bien,
trayectorias mediante estados estimados futuros. En la
Sección 5 muestra la evaluación numérica y experimental. mgℓ u
Finalmente, en la Sección 6 se muestran las conclusiones θ̈ + 2
sin θ = .
Jb + mℓ Jb + mℓ2
del trabajo.
Para obtener un modelo en el espacio de estados, tome en
2. ACTUADOR AERODINÁMICO cuenta las variables x1 = θ y x2 = θ̇. Entonces se tiene,
Considere el sistema barra-péndulo simple actuado por ẋ1 = x2
una hélice, como se muestra en la Figura 1. Donde θ es mgℓ u (1)
ẋ2 = − sin (x1 ) + .
el ángulo con respecto a la lı́nea vertical, ℓ es la distancia Jb + mℓ2 Jb + mℓ2
del punto de rotación hasta el actuador aerodinámico, F
es la fuerza que se genera en el actuador aerodinámico, 2.1 Consideración de tiempos de retardo
xi y zi son los ejes inerciales, y u es el momento externo
alrededor del eje de rotación yi . Asumiendo que el sistema de control se encuentra en una
posición remota y que existe un canal de comunicación
para el envı́o de la señal de control y la recepción de señales
medidas como se muestra en la Figura 2. El sistema (1)
puede modelarse como un sistema que incluye un retardo
de tiempo τ en la señal de entrada (tiempos muertos) que
puede incluir también los posibles tiempos de cómputo.
0 Bajo estas condiciones, el sistema (1) se reescribe en la
forma,
u xi ẋ1 = x2
ẋ2 = a sin (x1 ) + bu (t − τ ) . (2)
θ ℓ donde,
mgℓ 1
a=− 2
y b= . (3)
F Jb + mℓ Jb + mℓ2
Nótese que en general los retardos asociados con el canal
de comunicación resultan variantes en el tiempo, una
mg estrategia comúnmente utilizada es tomar el valor máximo
del retardo en el canal de comunicación y asumir el retardo
zi τ como constante.
Figura 1. Actuador aerodinámico. Envı́o señal de Control
El modelo del actuador aerodinámico barra-hélice se ob-
tendrá mediante la metodologı́a de Euler-Lagrange. Para
tal efecto, las energı́as cinética y potencial están dadas por,
Sistema Canal
de Actuador
1 1 1 de Aerodinámico
T = Jb θ̇2 + mℓ2 θ̇2 = θ̇2 Jb + mℓ 2
Control Comunicación
2 2 2
V = mgℓ (1 − cos θ)
1
donde Jb = 12 mb ℓ2b es el momento de inercia de la barra
con respecto a su centro de gravedad. El Lagrangiano del Mediciones del Sistema
sistema resulta entonces,
Figura 2. Sistema con retardo en el canal de comunicación.
L=T −V
1
= θ̇2 Jb + mℓ2 − mgℓ (1 − cos θ) . 3. PREDICTOR-OBSERVADOR PROPUESTO
2
El modelo se obtiene a partir de la ecuación de Euler- El sistema (2), puede reescribirse como un sistema libre de
Lagrange, retardos al considerar las variables adelantadas,
w1 (t) = x1 (t + τ ) , w2 (t) = x2 (t + τ ) . (4)
d ∂L ∂L
− = u. Bajo estas nuevas coordenadas, se obtiene la nueva repre-
dt ∂ θ̇ ∂θ
Nótese que, sentación,
ẇ1 = w2
ẇ2 = a sin (w1 ) + bu (5)
∂L d ∂L
= Jb + mℓ2 θ̇, = Jb + mℓ2 θ̈ en donde el retardo de tiempo ya no está presente.
∂ θ̇ dt ∂ θ̇
∂L Nótese que las señales w1 y w2 corresponden a señales
= −mgℓ sin θ. futuras no medibles, por lo que es necesario diseñar un
∂θ
161
CHAPTER 7. LINEAR SYSTEMS
predictor–observador para poder estimarlas. Considerando Observación 1. Definiendo los errores, edx1 = x1 − x1d y
el sistema en adelanto (5), es posible proponer el siguiente edx2 = x2 − x2d , el error de seguimiento puede escribirse
predictor-observador, alternativamente en la forma,
ŵ˙ 1 = ŵ2 + λ1 [w1 (t − τ ) − ŵ1 (t − τ )] ėdx1 (t) = edx2 (t)
(6)
ŵ˙ 2 = a sin (ŵ1 ) + bu + λ0 [w1 (t − τ ) − ŵ1 (t − τ )] . ėdx2 (t) = −k1 edx1 (t) − k2 edx2 (t)
+k1 ew1 (t − τ ) + k2 ew2 (t − τ ) + f (·)
Nótese que el predictor-observador anterior puede reescri-
e (t−τ ) e (t−τ )
birse tomando la señal medible x1 (t) en la forma, donde, f (·) = 2a sin w1 2 cos edx1 − w1 2 + x1d .
ŵ˙ 1 = ŵ2 + λ1 [x1 (t) − ŵ1 (t − τ )]
(7) La dinámica de lazo cerrado (13) en coordenadas de los
ŵ˙ 2 = a sin (ŵ1 ) + bu + λ0 [x1 (t) − ŵ1 (t − τ )] . errores de predicción-observación y seguimiento, se puede
expresar en forma vectorial como,
La dinámica de los errores de predicción puede analizarse
al definir las señales de error, ėw1 0 1 0 0 ew1
ėw2 0 0 0 0 ew2
ew1 = w1 − ŵ1 , ew2 = w2 − ŵ2 . (8) ė = 0 0 0 1 ed1
d1
ėd2 k1 k2−k 1 −k2 ed2
La dinámica del error de predicción resulta,
−λ1 0 0 0 ew1 (t − τ ) 0
ėw1 = ew2 − λ1 ew1 (t − τ ) −λ 0 0 0 e (t − τ ) f (ew , ed , wd )
+ 0 0 0 0 0 w2 +
ėw2 = −λ0 ew1 (t − τ ) + a [sin (w1 ) − sin (ŵ1 )] . ed1 (t − τ ) 0
que puede reescribirse como, 0 0 0 0 ed2 (t − τ ) f (ew , ed , wd )
(15)
ėw1 = ew2 − λ1 ew1 (t − τ ) con f (ew , ed , wd ) definida en (14).
e (9)
ėw2 = −λ0 ew1 (t − τ ) + 2a sin w21 cos w1 − 2
ŵ1
.
Alternativamente, definiendo e = [ew1 , ew2 , ed1 , ed2 ]⊤ y
considerando la estructura de la ecuación anterior, el
4. PROBLEMA DE SEGUIMIENTO DE sistema (15) puede reescribirse en la forma,
TRAYECTORIAS
ė = A0 e + A1 e (t − τ ) + f¯ (e, wd ) . (16)
A continuación se presentará una solución al problema de ¯
con A0 , A1 , f definidas apropiadamente.
seguimiento de trayectorias asociado con el sistema (2),
mediante la sı́ntesis de una retroalimentación basada en Nótese ahora que f (e, wd ), puede acotarse por arriba como
estados futuros estimados por el predictor-observador (7). sigue,
Supóngase que se desea seguir una trayectoria definida por kf (e, wd ) k = k2a sin ew1
2 cos ed1 − ew1
2 + w1d k
w1d (t) y w2d (t). Defı́nase los errores de seguimiento, ≤ k2a sin ew1 kk cos ed1 − ew1
2 2 + w1d k
ed1 (t) = w1 (t) − w1d (t) , ed2 (t) = w2 (t) − w2d (t) . (10) ≤ a|ew1 |
por lo tanto, es posible concluir que,
Considere entonces la siguiente retroalimentación basada
en estados futuros predecidos, kf¯ (e, wd ) k ≤ ρkek (17)
1 para una constante ρ positiva.
u (t) = b [−a sin (ŵ1 ) − k2 (ŵ2 − w2d ) (11)
−k1 (ŵ1 − w1d ) + ẇ2d ] 4.1 Estabilidad en lazo cerrado
Por otra parte, de los errores de observación, (8), se tiene
que, ŵ1 = w1 − ew1 , ŵ2 = w2 − ew2 . Entonces, la dinámica A continuación la estabilidad del sistema en el error (16)
de error de seguimiento resulta, se establecerá en dos etapas. Nótese inicialmente que
f¯ (e, wd ) corresponde a una perturbación acotada (17)
ėd1 = ed2 para el sistema (16). Considere entonces el sistema (16)
ėd2 = −k1 ed1 − k2 ed2 + k1 ew1 + k2 ew2 (12) libre de perturbaciones, esto es, el sistema,
e
+2a sin w21 cos w1 + 2
ŵ1
. ė = A0 e + A1 e(t − τ )
(18)
e(θ) = ϕ(θ), θ ∈ [−τ, 0]
Nótese también que,
w +w −e ew1
con la condición inicial ϕ(θ), θ ∈ [−τ, 0] y ϕ ∈
w1 +ŵ1
2 = 1 21 w1 = w1 − 2 PC [−τ, 0] , R4 al espacio de las funciones continuas a
e
= ed1 + w1d − w21 . pedazos en R4 definidas en [−τ, 0].
Finalmente, la dinámica del error en lazo cerrado (error Dado que los parámetros de las matrices A0 , A1 se pueden
de predicción y error de seguimiento) queda representada elegir libremente (dependen del predictor (7) y la retroali-
en la forma, mentación (11)), es siempre posible estabilizar asintótica-
mente el sistema (18), para un valor constante, acotado del
ėw1 = ew2 − λ1 ew1 (t − τ ) retardo τ . Esto se puede hacer, por ejemplo, al considerar
ėw2 = −λ0 ew1 (t − τ ) + f (ew , ed , wd ) la Proposición 5.2.2, Capı́tulo 5 de Gu et al. (2003). Basado
ėd1 = ed2 (13) en lo anterior, para demostrar la estabilidad del sistema
ėd2 = −k1 ed1 − k2 ed2 + k1 ew1 + k2 ew2 perturbado (16) se considerará, sin pérdida de generali-
+f (ew , ed , wd ) . dad, que existen constantes positivas λ0 , λ1 , k1 , k2 que
donde, estabilizan el sistema (18) para un valor τ > 0.
e ew
f (ew , ed , wd ) = 2a sin
w1
cos ed1 − 1 + w1d . Suponga entonces que el sistema (18) es asintóticamente
2 2 estable (es posible mostrar que para esta clase de sistemas
(14) la estabilidad resulta del tipo exponencial, Bellman and
162
CHAPTER 7. LINEAR SYSTEMS
(20) −0.4
La matriz de Lyapunov en (19) (Kharitonov (2013)),
Z ∞ −0.5
163
CHAPTER 7. LINEAR SYSTEMS
0.3
ed1(t)
0.2 ed2(t)
0.1
ed1(t) [rad], ed2(t) [rad/s]
−0.1
−0.2
−0.3
−0.4
−0.5
−0.6
−0.7
0 5 10 15 20
Tiempo [s]
0.1 0.03
0.08 0.02
u(t) [N m]
ew1(t−τ), [rad]
0.06 0.01
0.04 0
0.02 −0.01
0 −0.02
−0.02 −0.03
−0.04 −0.04
0 5 10 15 20
Tiempo [s] −0.05
0 20 40 60 80 100
Tiempo [s]
Figura 5. Señal de control: u(t).
Figura 7. Error de observación: ew1 (t).
5.2 Resultados experimentales
control u (t) aplicada al sistema y las trayectorias deseada
El esquema del predictor–observador (7) y la retroalimen- y real. Es importante mencionar, que el seguimiento de la
tación propuesta (11) se implementaron en la plataforma trayectoria se ve afectado debido a perturbaciones externas
experimental mostrada en la Figura 6. La plataforma se no modeladas, lo que hace que el sistema presente un
compone de una estructura principal que soporta una error en estado estacionario. Para verificar esta hipótesis,
barra que actúa como péndulo, un motor tipo brushless se realiza una simulación sumando un término constan-
U2216 KV900 con una hélice, necesarios para generar la te δ en la ecuación de la aceleración angular (2). Este
fuerza para levantar la barra. Un inclinómetro X3Q-T de término constante representa perturbaciones externas no
US Digital para determinar el desplazamiento angular que modeladas. En la Figura 11, se muestra la trayectoria que
experimenta la barra, y una tarjeta de desarrollo eZdsp sigue el sistema en presencia de una perturbación externa
TMS320F28335 de Spectrum Digital Inc. para implemen- δ = −0.05.
tar el esquema propuesto.
6. CONCLUSIONES
Los parámetros utilizados en el predictor observador (7) y
en la retroalimentación (11) se muestran en la Tabla 2. En
Se presenta una solución al problema de seguimiento de
Tabla 2. Parámetros para el experimento. trayectorias asociado a un actuador aerodinámico afectado
por retardos de tiempo en la señal de control. La solución
Parámetro Valor Parámetro Valor propuesta se basa en estados estimados futuros obtenidos
k1 0.15 λ0 16 con un predictor observador que es capaz de proveer
k2 0.015 λ1 10 los estados futuros necesarios. Se muestra formalmente
τ 0.1 s la convergencia de los errores de predicción ası́ como la
convergencia de los errores de seguimiento. El esquema
la Figura 7 se muestra el error de observación ew1 (t − τ ), de control es evaluado mediante simulaciones numéricas
y como se puede apreciar, el error se mantiene cercano a y con experimentos en tiempo real en una plataforma
cero. En la Figura 8 se presenta el error de seguimiento de laboratorio. Se deja como extensión a este trabajo la
ed1 (t). En las Figuras 9 y 10 se muestran la señal de compensación o cancelación de perturbaciones externas.
164
CHAPTER 7. LINEAR SYSTEMS
0.2 0.35
ed1(t) x (t)
1d
0.15 0.3 x (t)
1
0.25
0.1
0.2
0.15
0
0.1
−0.05
0.05
−0.1
0
−0.15
−0.05
−0.2 −0.1
0 20 40 60 80 100 0 5 10 15 20
Tiempo [s] Tiempo [s]
Figura 8. Error de seguimiento: ed1 (t). Figura 11. Trayectoria de referencia x1d (t) y real x1 (t) con
una perturbación δ = −0.05.
0.2
u(t)
2399 vol.3. doi:10.1109/IJCNN.2003.1223787.
Fridman, E. (2014). Introduction to Time-Delay Systems:
0.15 Analysis and Control. Birkhäuser Basel, Switzerland.
Gu, K., Chen, J., and Kharitonov, V.L. (2003). Stability of
time-delay systems. Springer Science & Business Media.
0.1 Hale, J.K. and Lunel, S.M.V. (2013). Introduction to
u(t) [N m]
0.4
of linear systems with multiple input delays. Applied
Mathematics and Computation, 244, 303 – 311.
0.3
1
0.2
of nonlinear systems. In Decision and Control, 2000.
Proceedings of the 39th IEEE Conference on, volume 5,
0.1
4428–4429. IEEE.
0
Münz, U., Ebenbauer, C., Haag, T., and Allgöwer, F.
(2009). Stability analysis of time-delay systems with
−0.1
incommensurate delays using positive polynomials. Au-
0 20 40 60 80 100 tomatic Control, IEEE Transactions on, 54(5), 1019–
Tiempo [s]
1024.
Niculescu, S.I. (2001). Delay effects on stability: a robust
Figura 10. Trayectoria de referencia x1d (t) y real x1 (t). control approach, volume 269. Springer Science &
REFERENCIAS Business Media.
Roh, Y.H. and Oh, J.H. (1999). Sliding mode control
Bellman, R.E. and Cooke, K.L. (1963). Differential- with delay compensation for uncertain input-delay sys-
difference equations. Rand Corporation. tems. In American Control Conference, 1999. Pro-
Chiang, C.C. and Tung, T.C. (2003). Robust tracking ceedings of the 1999, volume 1, 309–313 vol.1. doi:
control of uncertain nonlinear systems with an input 10.1109/ACC.1999.782790.
time delay. In Neural Networks, 2003. Proceedings of Smith, O.J. (1959). A controller to overcome dead time.
the International Joint Conference on, volume 3, 2394– iSA journal, 6(2), 28–33.
165
CHAPTER 7. LINEAR SYSTEMS
Abstract:
In this work we provide a design method of preserving order observer-based dynamic output-
feedback controllers for a family of discrete-time linear systems. This design considers the
absence and/or presence of disturbances/uncertainties in the discrete-time systems. The control
problem is focused in two purposes: (i) the stabilization of closed-loop systems by means of upper
and/or lower estimations, and (ii) the state estimation preserves the partial-order with respect
to the real state trajectory at each instant time. The proposed method studies the properties of
Lyapunov stability and cooperativeness. The first property determines the exponential stability
of the closed loop system (with controller) in combination with exponential convergence to zero
of the observation errors, when there are no uncertain terms, while the second one ensures
the partial ordering between upper/lower estimations with state. The observers gain can be
computed through a solution of a Bilinear Matrix Inequality (BMI) and a Linear Matrix
Inequality (LMI).
166
CHAPTER 7. LINEAR SYSTEMS
the stability in the sense of Lyapunov and cooperativeness. In the following paragraphs, we present the characteri-
Moreover, the computational implementation is considered zation of discrete-time systems. For more details to see
by means of a BMI and a LMI. (Mazenc et al., 2014; Hirsch and Smith, 2005).
This paper is organized as follows. The notations and Proposition 2. (Hirsch and Smith (2005)). ΓNL is a discrete-
prelimiaries are presented in Section 2. The design method time cooperative system if and only if the following condi-
of controllers based on preserving partial-order (interval) tions are satisfied,
observers, for nominal and perturbed discrete-time sys-
tems, is proposed in Section 3. Several practical aspects (a). Jf (x) 0, (b). Jf (u) 0, (c). Jh(x) 0. 2
are mentioned in Section 4. Concluding remarks are es-
tablished in Section 5. The linear cooperative systems represent a particular case
of the definition 3 ; its characterization is given in the next
2. PRELIMINARIES paragraph.
Proposition 3. The Linear Time-Invariant (LTI) system
For a better understanding, we provide the following described by
descriptions.
x(k + 1) = Ax(k) + Bu(k) , x (k0 ) = xk0 ,
2.1 Notations ΓL : (2)
y(k) = Cx(k) ,
M
M 0 : M is a Metzler matrix, if and only if where (x(k), u(k), y(k)) ∈ Rn × Rm × Rp are the state, the
Mij ≥ 0, ∀i 6= j, ∀i ∈ 1, ..., n, ∀j ∈ 1, ..., n. input and the measurement output vectors, respectively.
A 0 : A is a nonnegative matrix, sii Aij ≥ 0, ΓL is a cooperative system if and only if
∀i, j = 1, ..., n.
A ≥ 0 : A is a positive semi-definite matrix. (i). A 0, (ii). B 0 , (iii). C 0
A > 0 : A is a positive definite matrix.
B ≤ 0 : B represents a negative semi-definite matrix. For the family of discrete-time linear systems, the coopera-
B < 0 : B represents a negative semi-definite matrix. tiveness is analog to positivity property, which states that
x 0 : x is a nonnegative vector, iff xi ≥ 0, ∀i = 1, ..., n. all variables: input, output and state, are nonnegative at
x z : represents the partial-order between two vectors, all instant times.
iff xi − zi ≥ 0, ∀i = 1, ..., n.
kxk : is the Euclidean norm. 2.3 Stability of discrete-time linear systems
Jf (x) : denotes the Jacobian matrix of the nonlinear
functionf (t, x, u) with respect to the variable x In this subsection, the stability of the linear discrete-time
Jf (u) : is the Jacobian matrix of the nonlinear function systems will be analyzed.
f (t, x, u) w.r.t the variable u.
Definition 4. (Khalil (2002)). We consider the system of
the form
2.2 Systems that preserver partial-order
x(k + 1) = Ax(k), x(k0 ) = xk0 , (3)
The cooperativeness is a systemic property that ensures where A ∈ Rn×n and k ∈ N, if there exist the constants
the partial order between the state trajectories and output, α > 0 and ρ > 0 such that
depending on the partial order between the initial states kx(k, xk0 )k ≤ αkxk0 kexp(−ρk) (4)
and inputs. is satisfied for all k ∈ N, xk0 ∈ Rn , then the origin is a
Definition 1. Consider the time-discrete nonlinear system globally exponentially stable equilibrium point.
x (k + 1) = f (k, x(k), u(k)) , x(k0 ) = xk0 , Definition 5. (Jiang and Wang (2001)). The discrete-time
ΓNL : (1) linear system affected by the presence of additive exoge-
y (k) = h (k, x(k), u(k)) ,
nous inputs, is given by
where (x(k), u(k), y(k)) ∈ Rn × Rm × Rp is the state, the x(k + 1) = Ax(k) + b(k), x(k0 ) = xk0 (5)
input and the output vectors of ΓNL , respectively, at each is (globally) Input-State Stable (ISS) w.r.t b(k), if there
instant time k ∈ N. We assume that f : N×Rn ×Rm → Rn exist a function β of class KL, and a function γ of class K
and h : N × Rn × Rm → Rp are continuous functions. such that, for each initial state x(k0 ) ∈ Rn and each input
The system ΓNL is cooperative if there exists an partial b(k), then the solution x(t) exist for all t ≥ t0 , it holds
ordering between two initial states and two inputs (if they that
exist), that is; x10 x20 , u1 (k) u2 (k) , ∀k. Then, kx(k, xk0 )k ≤ β(kxk0 k, k) + γ(kb(k)) (6)
the state and output trajectories will preserve the partial-
order for all instant times: where γ is known as an ISS-gain for ΓL .
167
CHAPTER 7. LINEAR SYSTEMS
∀ x(k) ∈ Rn , for the family of Linear time-discrete S1. The pair (A, Bu ) is stabilizable, and
nominal systems in (3). The expression ∆V is given as S2. The pair (A, C) is detectable.
the Lyapunov difference. For the disturbed system in (5),
an additional condition is required: 3.1.1 Preserving partial-order observers
∆V = V (x(k + 1)) − V (x(k)) ≤ −α3 (||x(k)||) + γ(||b(k)||), We propose an Luenberger state observer for the family of
(10) discrete-time systems ΠN in order to estimate the unknown
∀b(k) ∈ Rn . state variables,
The characterizations of both definitions 5 and 6, can b(k + 1) = Ab
x y (k) − y(k)) + Bu u(k),
x(k) + L(b
ΠO :
be expressed in the ambient of Linear Matrix Inequalities yb(k) = C x
b(k), x (k0 ) = xk0
(LMI’s). (13)
Lema 6. Consider the nominal system in (3). If there where xb(k) is the estimated state. L ∈ Rn×q is the observer
exists a positive symmetric matrix P = P T > 0 and the gain associated to the output injection.
positive constant > 0, such that
The estimation error has been defined as e(k) , x b(k) −
[AT P A − P + P ] ≤ 0 (11) x(k), and the output error is ye(k) , yb(k) − y(k). Consis-
is satisfied, then x(k) = 0 is an globally exponentially stable tently, the dynamics of estimation error are given by
equilibrium point. 2
e(k + 1) = AL e(k), e(k0 ) = ek0 ,
Now, we present the characterization of practical stability ΠE : (14)
ye(k) = Ce(k),
for the system ΓL , when an exogenous input b(k) appears
in the dynamics. In this sense, we analyze the property by where the matrix AL , A + LC. From ΠO , the sufficient
means of the input-to-state stability (ISS) condition for ΓL conditions are established to define the preserving partial-
in the following Lemma. order observer for the nominal systems ΠN .
Lema 7. Consider the perturbed system in (5). If condi- In the following paragraph we establish the cooperative-
tions from Lemma 6 are satisfied, then the system in (5) ness conditions for the state observer ΠO .
is ISS with respect to the input b(k).
Definition 8. (Avilés and Moreno (2014)). The observer ΠO
Proof. Initially the demonstration of Lemma 6 is pre- is said an upper/lower preserving partial-order observer if
sented. If the Lyapunov function candidate V (x(k)) = (i). The estimation error system ΠE is cooperative, that
xT (k)P x(k) is defined for the system in (3), its Lyapunov is, given e0 0 (−e0 0), then e(t) 0 (−e(t)
difference along the trajectory of (3) is given as 0), leading to, if xb0 x0 (x0 x b0 ) → x
b (t)
∆V = V (x(k+1))−V (x(k)) = xT (AT P A−P )x ≤ −xT P x x (t) (x(t) x
b(t)), ∀t ≥ 0.
Now, the same Lyapunov function candidate is proposed
for the perturbed system, then its derivative is bounded (ii). The estimation error converge asymptotically to zero,
by that is kb
x(k) − x(k)k → 0 as k → ∞. 2
∆V ≤ − (1 − θ) V (x(k)) − θV (x(k)) + bT (k)P b(k) The first condition (i) is ensured, if
+ 2xT (k)AT P b(k) AL 0 (15)
is a nonnegative matrix, which is provide by the appli-
Completing squares, the property (6) holds. cation of the cooperative property in the estimation error
system (see Proposition 2 ). Thus, the condition guarantees
3. PRESERVING ORDER OBSERVER-BASED the partial ordering between estimation and state trajec-
CONTROL DESIGN tories for all instate times.
In this section, we present the main results of the de- An upper and a lower preserving partial-order observer
sign of preserving order observer-based dynamic output- form an interval observer, whose characteristic behavior
feedback controllers for discrete-time systems. Firstly, the guarantees that the estimations always stay above and
controller design conditions are studied for the nominal below the true state trajectory.
case, then we extend the notions to deal the presence of
disturbances/uncertainties in the dynamical system. 3.1.2 Control design method
Adding an output feedback control to the system ΠO ,
3.1 Nominal System we provide a control based on a lower/upper preserving
partial-order observer,
Consider the family of nominal systems given by (
b(k + 1) = Ab
x y (k) − y(k)) + Bu u(k),
x(k) + L(b
ΠU : yb(k) = C x
b(k), x (k0 ) = xk0 ,
x(k + 1) = Ax(k) + Bu u(k), x (k0 ) = xk0 , u(k) = −K x b(k),
ΠN : (12)
y(k) = Cx(k), (16)
where K ∈ Rn×m is the controller gain. Consequently, the
where x(k) ∈ Rn is the state, y(k) ∈ Rp is the mea- closed-loop system ΠU is combined with the estimation
surement output, u ∈ Rm is the control input. A ∈ error system, which can be described by
Rn×n , Bu ∈ Rn×m , C ∈ Rp×n , are constant matrices
such that are satisfied the next suppositions:
168
CHAPTER 7. LINEAR SYSTEMS
x(k + 1) AK −Bu K x(k) where π(t, x) represents the disturbances and/or paramet-
ΠLC : = (17) ric uncertainty affecting to ΠP . This term satisfies the
e(k + 1) 0 AL e(k)
inequality
where the matrix AK = A − Bu K. In the Theorem 9
is shown the stability condition of the augmented system
(17). π + (k, y(k)) π(k, x(k)) π − (k, y(k)) (21)
Theorem 9. Consider the nominal system in (17). Suppose which implies that the disturbance π(t, x) is bounded by
that the conditions (S1) and (S2) are satisfied. The sys- intervals, where π + (t, y) and π − (t, y) are known Lipschitz
tem ΠLC is globally exponentially stable in the sense of functions.
Lyapunov, if there exist the matrices K, L, P1 = P1T > 0,
P2 = P2T > 0 and the constants 1 > 0, 2 > 0, such that
3.2.2 Preserving partial-order observers for disturbed sys-
tems
ATK P1 AK − P1 + 1 P1 −ATK P1 Bu K
≤0 (18)
−K T BuT P1 AK Υ In order to estimate the unknown state of ΠP , two
Luenberger observers are proposed, given by
is fulfilled, where Υ = K T Bu P1 Bu K+ATL P2 AL −P2 +2 P2 .
+
Proof. xb (k + 1) = Abx+ (k) + Bu u(k) + π + (t, y)
For the augmented system in (17), a Lyapunov function ΦO + : +L yb+ (k) − y(k) , x
+
b+
b+ (k0 ) = x k0 ,
+ +
candidate is proposed as yb (k) = C x
b (k),
V (x(k), e(k)) = xT (k)P1 x(k) + eT (k)P2 e(k), (19) (22)
and its Lyapunov difference along the trajectories of the −
system ΠLC , ∆V (x, e), is given by xb (k + 1) = Ab x− (k) + Bu u(k) + π − (t, y)
T T ΦO − : +L yb− (k) − y(k) , x
−
b−
b− (k0 ) = x k0 ,
x AK P1 AK − P1 −ATK P1 Bu K x − −
yb (k) = C xb (k),
e −K T BuT P1 AK Υa e
T T
(23)
≤ −1 x P1 x − 2 e P2 e where xb+ (k) and xb− (k) are the estimate states of ΠO+
and ΠO− , respectively. The observer gains are L+ ∈ Rn×q ,
where Υa = K T Bu P1 Bu K + ATL P2 AL − P2 . Then, the
L− ∈ Rn×q .
origin, x(k) = 0, e(k) = 0, is globally exponentially
equilibrium point of the augmented system ΠLC in (17). The estimation errors are defined as e+ (k) , x b+ (k)−x(k),
e (k) , x(k) − x
− −
b (k), and the output estimate errors as
A main result of this work is presented in the following ye+ (k) , yb+ (k) − y(k), ye− (k) , y − yb− (k). Therefore, the
Theorem, which considers the design of a preserving order error dynamics are given by
observer-based control. +
e (k + 1) = A+ + +
L e (k) + w (k), e+ (k0 ) = e+
k0
Theorem 10. Consider the nominal system ΠN . Suppose ΦE + : + +
ye (k) = Ce (k)
that conditions (S1) and (S2) are satisfied. If conditions
w+ (k) = π + (k, y(k)) − π(k, x(k))
• Cooperativeness in (15) (24)
• Stability in (18)
−
are satisfied, then the augmented closed-loop system ΠLC e (k + 1) = A− − −
e− (k0 ) = e−
L e (k) + w (k), k0
is globally exponentially stable. Moreover, ΠU is a preserv- ΦE − : − −
ye (k) = Ce (k)
ing order observer.
w− (k) = π(k, x(k)) − π − (k, y(k))
Remark 11. Note that the design method presented in (25)
Theorem 10, establishes the stabilization of ΠN by means where w+ (k) and w− (k) are errors associated to the
of output feedback of ΠU . Moreover, the Theorem 10 disturbance, which represent the exogenous inputs for the
ensures that an upper or a lower estimation bound to the systems ΦE+ and ΦE− .
real state trajectory for all times. It is possible to eject
simultaneously a pair of preserving partial-order observers An extension of definition is presented for the family of
depending on an upper and an lower initial state, implying linear disturbed systems.
that the estimations form an interval, which contains the Definition 12. The estimator ΦO+ (ΦO+ ) is an upper
state trajectory. This is the typical notion of the interval (lower) preserving partial-order observer for the disturbed
observers. system ΦP , if satisfies the following conditions:
(i). The error system estimation ΦE+ (ΦE− ) is cooper-
3.2 Disturbed systems
b+
ative: if x k0 xk0 (xk0 x b− b+ (k)
k0 ), then x
x (k) (x(k) xb (k)), ∀k ∈ N.
−
Consider the discrete-time linear system
169
CHAPTER 7. LINEAR SYSTEMS
Note that item (i) of the definition 12, describes the cha- A− +
L 0, AL 0
racteristics of partial ordering between the state estima- For the second condition, its proposed a candidate Lya-
tions and trajectories. This is provide because the system punov function for the closed-loop system ΦLC ,
ΦE + (ΦE − ) is cooperative, implying that conditions
T T
A+ (A− V x, e+ , e− = xT P1 x + e+ P2 e+ + e− P3 e− (28)
L 0 L 0)
are satisfied, if the uncertain input w+ (k) 0 (w− (k) The Lyapunov difference along the trajectory of ΦLC ,
0) ∀k ∈ N, and the initial state e+ − ∆V (x, e+ , e− ) is given by the expression
k0 0 (ek0 0).
170
CHAPTER 7. LINEAR SYSTEMS
It is shown that the resulting augmented system, which Hirsch, M. and Smith, H. (2005). Monotone maps: a re-
combines the closed-loop systems with observer error dy- view. Journal of Difference Equations and Applications,
namics, is globally (ISS) exponentially stable using the 11(4-5), 379–398.
upper/lower estimations, depending on the (presence) ab- Hirsch, M. and Smith, H. (2004). Monotone Dynamical
sence of disturbances. Additionally, it is ensured the par- Systems. Handbook of differential equations: ordinary
tial ordering between estimations and state trajectories. differential equations. Vol. II, Elsevier B. V., Amster-
The observers gain can be obtained by solving a Bilinear dam 239-357.
Matrix Inequality (BMI) and a Linear Matrix Inequality Jiang, Z.P. and Wang, Y. (2001). Input-to-state stability
(LMI). for discrete-time nonlinear systems. Automatica, 37(6),
857–869.
ACKNOWLEDGEMENTS Khalil, H.K. (2002). Nonlinear Systems. Prentice Hall,
New York, USA, third edition.
J.D. Avilés acknowledge the support of CONACyT Mex- Mazenc, F., Dinh, T., and Niculescu, S. (2013). Robust
ico in the frame of National Posdoctorals Stays (CVU interval observers for discrete time systems of luenberger
208496). J. Moreno gratefully acknowledge the financial type. ACC, Washington USA, 2484–2489.
support from Fondo de Colaboración del II-FI, UNAM, Mazenc, F., Dinh, T., and Niculescu, S. (2014). Interval
project IISGBAS-122-2014, PAPIIT-UNAM under grant observers for discrete time systems. International Jour-
IN113614, and CONACyT project 241171. nal of Robust and Nonlinear Control, (17), 2867–2890.
REFERENCES Raı̈ssi, T., Efimov, D., and Zolghadri, A. (2012). Interval
state estimation for a class of nonlinear systems. IEEE
Alcaraz-Gonzalez, V., Harmand, J., Rapaport, A., Steyer, Trans. on Aut. Control, 57(1), 260–265.
J., Gonzalez-Alvarez, V., and Pelayo-Ortiz, C. (2002).
Software sensors for highly uncertain wwtps: a new
approach based on interval observers. Water Res,
36(2515).
Angeli, D. and Sontag, D. (2003). Monotone control
systems. IEEE Transactions Automatic, 48(10), 1684–
1698.
Avilés, J. and Moreno, J. (2009). Cooperative observers
for nonlinear systems. Joint 48th IEEE Conference
on Decision and Control and 28th Chinese Control
Conference, Shanghai, China, 6125–6130.
Avilés, J. and Moreno, J. (2013). Preserving order ob-
servers for nonlinear systems. International Journal of
Robust and Nonlinear Control, doi: 10.1002/rnc.2975.
Avilés, J. and Moreno, J. (2014). Diseño de observadores
que preservan el orden para sistemas no lineales usando
transformación de coordenadas. Congreso Latinoameri-
cano de Control Automático.
Avilés, J. and Moreno, J. (2015a). Método para disenar
controladores basados en observadores que preservan el
orden parcial para sistemas lineales. Congreso Nacional
del AMCA, Cuernavaca, Morelos.
Avilés, J. and Moreno, J. (2015b). Observadores que
preservan el orden parcial para sistemas en tiempo
discreto. Congreso Nacional del AMCA, Cuernavaca,
Morelos.
Bernard, O. and Gouze, J. (2004). Closed loop observers
bundle for uncertain biotechnological models. Journal
of Process Control, 14(3), 765–774.
Efimov, D., Perruquetti, W., Raı̈ssi, T., and Zolghadri, A.
(2013a). Interval observers for time-varying discrete-
time systems. IEEE Transactions on Automatic Con-
trol, 58(12), 3218–3224.
Efimov, D., Raissi, T., Perruquetti, W., and Zolghadri, A.
(2015). Design of interval observers for estimation and
stabilization of discrete-time lpv systems. IMA Journal
of Mathematical Control and Information, dnv023.
Efimov, D., Raissi, T., and Zolghadri, A. (2013b). Control
of nonlinear and lpv systems: interval observer-based
framework. IEEE Transactions on Automatic Control.
Gouze, J.L., Rapaport, A., and Hadj-Sadok, M.Z. (2000).
Interval observers for uncertain biological systems. Ecol
Modelling, 133(1-2), 45–56.
171
CHAPTER 7. LINEAR SYSTEMS
Abstract: In this work, we use coates determinant to obtain transfer functions from graphs
and in this way analyze robustness behaviour of networks against perturbations. Furthermore,
we make a comparison between obtained results and values of Sinai-Kolmogorov and Loop
Entropies to observe network performance in terms of noise resilience.
1. INTRODUCTION
172
CHAPTER 7. LINEAR SYSTEMS
ogy, Microbiology, Ecology, Sociology and other fields of In addition to Gibbs and Von Neumann Entropy ap-
Engineering through graph analysis. In next sections we proaches, Sinai-Kolmogorov Entropy is implemented in
are going to show one of this applications, specifically in Demetrius and Manke (2005) to measure how much skill
network robustness. has a network to reject fluctuations. Its value is calculated
from the largest Adjacency matrix eigenvalue (λ) through
Complex Networks and Information Theory Graph and the following expression:
Complex Networks theories have had high relevance in H = log(λ). (1)
last years in Engineering, specifically in applications such
as social and sensor Networks, Internet, and smart grids, According to Demetrius et al. (2004), network Entropy (H)
to model and design them. On the other hand, there is directly correlated with the fluctuation decay rate (R),
exists a raising interest to involve Information theory which is defined as:
definitions, specially Entropy and Mutual Information, to 1
analyze topology, robustness and security from a static R = lim − logP (t) ,
t→∞ t
point of view, i.e, graphs whose behaviour is not described
by spectral analysis but through algebraic mechanisms where P (t) is the probability that the sample mean 1
(Rashevsky, 1955; Solé R. V., 2004). As a result, some new deviates more than from its unperturbed value at time
perspectives have appeared to include not only Shannon t. Therefore, for high values of R, network fluctuations
Entropy but also physical and statistical definitions such tend to be short, whereas for low values, those tend to
as Von Neumann and Kolmogorov-Sinai entropies, to be longer. Consequently, since H and R are positively
explore network behaviour by means of Laplacian spectral correlated, Entropy is a measure of network robustness
2
analysis to study dynamics and evolution. The following .
two sections describe the most important works which On the other hand, a novel definition, named loop Entropy,
relates Entropy, Complex Networks and Graph theories, has been established in De Badyn et al. (2016), which is
developed both in statical and dynamical environments. given by the following expression:
Xn
SG = logλi ,
1.1.2.1. A static perspective As we mentioned before, i=2
Entropy requires a probability distribution to be calcu-
lated, and in this regard, some methods have appeared. where λ0i s
are the eigenvalues of the graph Laplacian LG
First, Rashevsky (1955) and Trucco (1956) use node de- and n is the number of nodes. The sum begins from 2
pendence or automorphism concept to split the graph in because λ1 = 0. There is a set of non negative eigenvalues
{n−1}
vertex sets, then, dividing the number of nodes in each set associated to a sub-matrix LG (Dirichlet Matrix) of
by the total amount of nodes in the graph, it is possible to LG , which is obtained by arbitrary elimination of the i
assign a probability for every one. In contrast, in Dehmer row and the i column of LG . In this sense it is possible
(2008) the probability distribution is obtained attributing to obtain an Entropy expression in terms of this Dirichlet
a probability to each node instead of a set of them, which Matix as follows:
avoids difficult algorithms necessary to obtain graph au- n−1
X {n−1}
tomorphisms or vertex partitioning. Other related works SG = logλi (LG ) + log(n),
have used degree distributions of network topologies such i=1
as Scale Free or Random networks to find Entropy values, which results in:
meanly to describe graph heterogeneity, which is inversely SG = log (nτ (G)) , (2)
correlated with network symmetry (Xiao et al., 2008).
where τ (G) is the number of spanning trees in the graph.
The above result shows a direct relation between Entropy
1.1.2.2. A dynamic perspective A different research have and the number of spanning trees of a graph, which means
shown ways to obtain Shannon Entropy using spectral that the network robustness increases with the number of
graph analysis instead of probability distributions. Some of spanning trees.
them have emerged to find similarities between Quantum
mechanics concepts (Gibbs and Von Neumann Entropies) 1.2 Coates Determinant
and Shannon theory, meanly aiming to employ MaxEnt
Following the procedure in Desoer (1960), it is possible
principle in other research areas such as Ecology, Biology
to obtain a transfer function for every graph turning it
and Physics. First, Passerini and Severini (2008) compute
into a flow graph, which is a directed graph where each
Von Neumann Entropy (an extension of Gibbs Entropy)
node has a self-loop. This process is based in the Coates
through a probability distribution taken from the nor-
Determinant, which is given by the next expression:
malized graph Laplacian, demonstrating that graphs with
long paths (less quantity of short paths), connected com- X
ponents and regular shapes have higher Entropy values. ∆c = (−1)n (−1)Lρ C(Go )ρ,
Secondly, Anand and Bianconi (2009) calculate Gibbs En- p
tropy through graph adjacency matrix and a probability 1 Sample mean indicates that R is taken in a process where data
distribution of having connected two nodes. This work collection is performed.
shows a way to find a “canonical network ensemble” which 2 Results in Demetrius and Manke (2005) shows that also in this
satisfies structural constraints as link or node number, i.e, case, Entropy and short paths are inversely proportional, therefore
a set of network configurations that can accomplish the a Scale Free network is more robust than a Regular or Erdös-Renyi
same function using the same resources. case.
173
CHAPTER 7. LINEAR SYSTEMS
where Lρ is the number of directed loops in the ρt h where every self-loop of the flow graph has a weight
connection, Go is the flow graph G with the source node of s+LGii . Therefore, following the above procedure and
0 deleted and C(Go) is the summation of the connection using (3) to find Numerator and Denominator expressions
gains (Desoer, 1960). as is exposed in Desoer (1960), we obtain the transfer
function for the graph of Fig. 2, which is given by
The next example explains this procedure(De Badyn et al.,
2016). s3 + 6s2 + 10s + 4
T (s) = 4 .
Example: Initially we have the graph in Fig. 2. s + 8s3 + 20s2 + 16s
2. METHODOLOGY
174
CHAPTER 7. LINEAR SYSTEMS
1
Table 2. Loop-Entropy values for each graph
0
Graph Loop-Entropy
4 nodes, 1 spanning tree 1.3863 -1
Number 2
input signal
ouput for 3 spanning trees transfer function
1.5
As we mentioned in previous sections, entropy and ro-
bustness have a direct relation. In this section we probe 1
175
CHAPTER 7. LINEAR SYSTEMS
-0.5
-1
-1.5
0 20 40 60 80 100
REFERENCES
Anand, K. and Bianconi, G. (2009). Entropy measures
for networks: Toward an information theory of complex
topologies. Physical Review E, 80(4), 045102. doi:
10.1103/PhysRevE.80.045102.
De Badyn, M.H., Chapman, A., and Mesbahi, M.
(2016). Network entropy: A system-theoretic perspec-
tive. In Proceedings of the IEEE Conference on Deci-
sion and Control, volume 2016-Febru, 5512–5517. Insti-
tute of Electrical and Electronics Engineers Inc. doi:
10.1109/CDC.2015.7403083.
Dehmer, M. (2008). A NOVEL METHOD FOR MEA-
SURING THE STRUCTURAL INFORMATION CON-
TENT OF NETWORKS. Cybernetics and Systems,
39(8), 825–842. doi:10.1080/01969720802435925.
Demetrius, L., Gundlach, V.M., and Ochs, G. (2004).
Complexity and demographic stability in population
models. Theoretical population biology, 65(3), 211–25.
doi:10.1016/j.tpb.2003.12.002.
Demetrius, L. and Manke, T. (2005). Robustness and
network evolution?an entropic principle. Physica A:
Statistical Mechanics and its Applications, 346(3-4),
682–696. doi:10.1016/j.physa.2004.07.011.
176
CHAPTER 7. LINEAR SYSTEMS
Abstract: In this work it is addressed the problem of the stabilization and control of a specific
class of high-order unstable linear systems with time delay. The system under consideration has
one unstable pole, q stable poles and a minimum phase zero. Sufficient conditions to guarantee
the stability of the closed loop system by a modified P I scheme are provided. The proposed
strategy consist of a modified version of the traditional P I controller, which include a first order
filter and conduces to the, here called, P If controller. This new scheme allows the improvement
of the existing results when a traditional P I is used for high-order systems with time delay. The
proposed result is illustrated by its application to a numerical example: control of a Continuously
Stirred Tank Reactor (CSTR) linear model.
1. INTRODUCTION only has a time delay but also it is unstable. For this
reason, the interest of dealing with unstable processes
Time delays often arise in control systems, either from containing a delay term has been growing in the control
delays in the process itself or from delays during the community Lee et al. (2010), Sipahi et al. (2011) and Gu
processing of sensed signals. Industrial processes often et al. (2003) recently.
presents time delays introduced by the finite time that
The control problem of time delay systems has been
material takes to flow through pipes. In the same way time
studied by different perspectives. The simplest approach
delay could be produced due to heat and mass transfer
consists on the approximation of the delay operator by
in chemical industries, heavy computations and hardware
means of Taylor or Pade series expansions which leads to
restrictions of computational systems, high inertia in sys-
a nonminimum- phase models with rational transfer func-
tems with heavy machinery and communications delays in
tion representation Munz et al. (2009). Another approach
space craft and remote operation.
is to compensate the effect of time delays by removing the
Typical examples of systems exhibiting time delays are exponential term from the characteristic equation of the
chemical processes Richard (2003), transportation sys- process. This technique was introduced by Smith (1957)
tems, communication and power systems Franklin et al. and it is well-known as Smith Predictor (SP). This tech-
(1995); Wang et al. (1999). In some others cases, delays nique does not have a stabilization step which restricts
are introduced by sensors and actuators devices Xian et al. its application to open-loop stable plants. To deal with
(2005), Liu et al. (2005). In most cases this phenomenon this disadvantage, some modifications of the original SP
is one of the main causes of instability and poor perfor- structure have been proposed (for instance Palmor (1996),
mance and it produces, in general, unwanted behaviors Seshagiri et al. (2007), Kawnish and Choudhury (2012)).
in dynamical systems. So, it is necessary to take special
Another solution to control delayed systems is to use a
attention to the stability analysis and controller design to
Proportional-Integral- Derivative (P ID) controllers. P ID
handle systems with time delays. Thus, there exist great
controller is widely used in the control of industrial pro-
motivation for the study of effects causing a time delay
cesses due to its simple structure in many practical pro-
in dynamic systems Hu and Lin (2001), Trentelman et al.
cesses (Silva et al. (2005)). Also, they are frequently used
(2001) and Shamsuzzoha et al. (2009). In addition, the
to stabilize unstable time-delay processes. However, sta-
problem becomes more complicated when the system not
bility conditions for such processes are a very challenging
? M. Velasco-Villa is on sabbatical stay supported by Conacyt (No. topic. Huang and Chen (1997), use root locus diagrams
260936) from Mechatronic Section of the Electrical Engineering to study the stabilizability problem of unstable delay pro-
Department.
177
CHAPTER 7. LINEAR SYSTEMS
2. PROBLEM STATEMENT Proof. Suppose that condition (4) is satisfied. Due to the
freedom selecting the parameter φ of the P If controller
Consider the following class of single input single output given by (2), it is possible to obtain a cancellation of the
(SISO) linear time invariant systems (LTI) with delay in zero of the system (1) by φ = β. Therefore, the open-loop
the input-output path given by, response is obtained as,
178
CHAPTER 7. LINEAR SYSTEMS
Therefore, from the decreasing and increasing properties
(jω
(jω)2 + (kf + ki + φ)jω + ki φ +
β)
Qq −jωτ stated in (10) and (11), and after some computations, the
Q(jω) = kp α
(jω
+
e . (5)
jω (jω − γ) φ) m=1
(jω + δm ) set of stabilizing k¯f values are given by,
¯
kp = kp (kf + ki + φ)
When the integral part k¯i is considered to be different from
zero, the phase expression 6 Q(jω) and magnitude expres-
k¯f = kf +k1 i +φ
(6) sion MQ(jω) are represented by (13) and (14), respectively
k¯ = ki φ .
i kf +ki +φ ω
6 Q(jω) = − π − arctan − ωτ +
a
From (6), the open-loop response (5) can be rewriten as, X q
k¯i ω
arctan k¯f ω − − arctan (13)
ω m=1
bm
¯
(jω)2 + ( k1¯ )jω + kk¯i
Q(jω) = k¯p k¯f α Qqf e−jωτ . (7)
f
X
q Taking into account k¯i = 0 the magnitude expressions
ω ω
6 Q(jω) = − π − arctan −ωτ +arctan k¯f ω − arctan (14) and (9) are equivalents. In the same way, it can
γ δm
m=1 also be noted that (13) is equivalent to (8) when k¯i = 0.
(8) Considering this fact, it is possible to follow the principle
of argument continuity for k¯i , this is, it is always possible
to choose a small enough k¯i gain such that conditions (10)
v and (11) are fulfilled. Thus, once an stabilizing k¯f gain is
u 2
u 1 + k¯f ω 2 found, it is always possible to choose a small enough k¯i
MQ(jω) = k αt
¯ p Qq . (9)
(ω + γ ) m=1 ω 2 + δm 2
2 2 gain such that the Nyquist stability criterion is satisfied
and the system (7) can be stabilized by a P If controller.
Taking into account the Nyquist stability criterion, system From this fact, if the phase and magnitude condition are
(7) will be stable if and only if N + P = 0, where P the satisfied, then there exists a adequate kp > 0 such that the
number of poles in the right half complex plane and N the Nyquist diagram will start in the correct position and with
number of rotations to the point (−1, 0j). In this case, as a counterclockwise rotation, then the Nyquist stability
P = 1, it is required to have a counterclockwise rotation criterion will be satisfied. The range of k¯p values is given
to the point (−1; 0) in the Nyquist diagram. by,
Therefore, since the phase expression has an initial value
of −π and assuming that condition (4) is satisfied, to
have a counterclockwise direction in the Nyquist diagram k¯p (ωc1 ) < k¯p < k¯p (ωc2 )
the magnitude expression MQ(jω) must be an strictly
decreasing function of ω and the phase expression 6 Q(jω) with ωc1 < ωc2 being the first two phase crossover frequen-
must be an increasing function of ω. From this fact, it is cies solutions of,
possible to prove based on (4) that,
ω X
q ω
ci k¯i ci
2 arctan −ωc τ +arctan k¯f ωci − − arctan = 0.
d MQ γ ωci δm
< 0. (10) m=1
dω k¯ α
2 2 (15)
p ω=0
and where ωci=1,2 are crossover frequencies and k¯p (ωci ) are
given by
d
(6 Q(jω)) >0 (11) v
dω ω=0 u 2 Qq 2
1 u ωci + γ 2 2
m=1 ωci + δm
¯
kp (ωci ) = u 2 . (16)
Consequently, if (4) is satisfied, there exist gains k¯p , k¯f and α1 t ¯
k¯i such that the system (1) is stable in the closed-loop. 1 + k¯f ω ci − ωkci
i
179
CHAPTER 7. LINEAR SYSTEMS
Thus, if condition (4) is is fulfilled, there always exist k¯f , Table 1. CSTR parameters.
k¯i and k¯p gains which are able to stabilize systems of the
class (1) in close-loop by using a P If control structure. Description Variables Values Unit
Remark 2. It is worth noting that the proposed P If con- Volume of the reactor V 85 f t3
troller, in this work, not only maintains the basic proper- Activation energy
Heat transfer coefficient
−∆E
U
32, 400
75
Btu/lbmol
btu/hr◦ F
ties of a convencional P I controller regarding disturbance Heat of Reaction −∆H 39, 000 Btu/lbmolP O
rejection and reference tracking of step type signals. Be- Heat transfer area A 88 f t2
Frequency factor k0 16.96x1012 hr−1
sides, it presents two advantages over P/P I controller; Ideal gas constant R 1.987 Btu/lbmol◦ F
as
q the P stability condition is improved adding the term
Volume cooling liquid in the chamber Vj 21.25 f t3
1 q 1
γ2 + m=1 δm 2 , i.e., the P If allows stabilizing systems
Heat capacity on reactor with density of the reagent pcp 53.25 Btu/f t3 ◦ F
with larger delays than the ones allowed by the conven-
Density of the cooling liquid with heat capacity of the
tional P/P I controller. The second advantage is that the cooling liquid
pj cpj 55.6 Btu/f t3 ◦ F
o o
(CAf , CA , F o , Fjf
o
, T o , Tjo , Tfo , Tjf
o
)=
(0.132, 0.066, 340, 28.75, 101.1, 55, 60, 50).
o
where CA is the concentration of the product A, F o
Fig. 2. Unstable continuously stirred tank reactor (CSTR). o
is the concentration of the product A, Fjf is the flow
o
cooling liquid (reactor jacket), T is the temperature in
The purpose of measuring the temperature in a reactant the Reactor, Tjo is the temperature in the cooling liquid
A is to manipulate the speed of the chemical reaction and,
thus, to improve the selectivity of the reaction system. To (reactor jacket), Tfo is the Input temperature product A
o
remove the heat of the reaction, the reactor is surrounded and Tjf is the cooling liquid temperature at the input-
by a jacket (coolant jacket) through which flows a coolant. liquid.
Often, the heat transfer fluid is pumped through agitation The linear representation around the considered operating
nozzles that make the fluid circulate through the jacket at point it is obtained as,
high velocity as you can see in Fig. 2.
From Bequette (2003) system parameters are listed in Tj (s) 0.8714s + 6.963
Table 1. = 2 . (20)
Tjf (s) s + 2.848s − 1.132
Differential equations of the process shown in Fig. 2 are
Notice that the linear system (20) is of the form given
characterized as follows:
in equation (1) with τ = 0 since it has an unstable pole,
(1) The balance of the mass on component A is, a stable pole and a zero . Using high quality sensors to
determine the temperature (Tj ) in CSRT reactors usually
dCA means high costs. This disadvantage in the sensor gener-
V = F CAf − F CA − VrA
dt ates a dead-time in the measurement of the temperature,
with which creates a problem for the control strategy design,
−∆E
that increases its difficulty when the system is unstable.
rA = k0 e RT CA For that reason, it is possible to consider this effect by
180
CHAPTER 7. LINEAR SYSTEMS
Tj (oF)
0.8
Taking into account equation (1), the system parameters
0.6
(21) are γ = 0.35, δ = −3.20, β = 7.97 and τ = 0.3. From
the stability condition shown in Theorem 1 and assuming
0.4
that m = 1,
0.2
r
1 1 1 1
τ = 0.3 < − + 2
+ 2 = 5.3 0
γ δ γ δ 0 50 100 150 200
Time (hours)
therefore, system (21) can be stabilized by a P If con- Fig. 4. CSTR response by using a P If .
troller. To get the controller, the first step considers the
free pole φ given by the P If which is located on the same 0.05
position as zero β in order to cancel this dynamic. Then, CSTR nominal response
φ = 7.97. From equation (12), the range of values for the CSTR response with delay uncertainty up to 28%.
stabilizing gain k¯f is −2.21 < k¯f < 2.84. 0
0.023 0.51 −0.2
P If = 0.23 1 + + . 0 50 100 150 200
s s + 7.97 Time (hours)
Fig. 5. Control signal.
In order to illustrate that the Nyquist stability criterion
has been satisfied, Fig. 3 shows the existence of counter- 5. CONCLUSION
clockwise rotation to the point (−1, 0j) in the Nyquist
diagram. This paper presents a sufficient condition for the stabi-
lization of a high order unstable linear systme with time
delay and one “stable” zero by using a modified-P I con-
troller (P If ). Additionally, the procedure to determine the
parameters for the stabilizing gains kp , ki and kf are given
1 in order to provide an accurate P If controller tuning. The
k =2
p proposed scheme not only maintains the basic properties
0.5
of a convencional P I controller, but also, the stability con-
Imaginary Axis
181
CHAPTER 7. LINEAR SYSTEMS
182
CHAPTER 7. LINEAR SYSTEMS
1. INTRODUCTION tuning gains and the fixed stabilization time, making this
time hard to tune.
Several applications are characterized for requiring hard To overcome the above, a class of first-order dynamical
time response constraints. In order to deal with those systems with the minimum upper bound of the fixed
requirements, various developments concerning to concept stabilization time equal to their only tuning gain has been
of finite-time stability have been carried out (see for studied (Sánchez-Torres et al., 2014; Sánchez-Torres et al.,
example: Roxin (1966); Weiss and Infante (1967); Michel 2015). It is said that these systems exhibit the property of
and Porter (1972); Haimo (1986); Utkin (1992); Bhat and predefined-time stability.
Bernstein (2000); Moulay and Perruquetti (2005, 2006)).
Nevertheless, usually this finite time is an unbounded In this sense, this paper introduces the concept of non-
function of the initial conditions of the system. singular predefined-time stable manifolds. Similarly to
Jiménez-Rodrı́guez et al. (2016), the proposed scheme
With the aim to eliminate this boundlessness, the notion allows to define second-order predefined-time stable
of fixed-time stability have been studied in Andrieu systems as a nested application of first-order predefined-
et al. (2008); Cruz-Zavala et al. (2010); Polyakov (2012); time stabilizing functions, with the difference that such
Fraguela et al. (2012); Polyakov and Fridman (2014). function which defines the manifold and its derivative
Fixed-time stability represents a significant advantage along the system trajectories are continuous, therefore no
over finite-time stability due to its desired feature of the singularities are presented for the system evolution.
convergence time, as a function of the initial conditions, is
bounded. That makes the fixed-time stability a valuable Finally, this idea is used to solve the problem of predefined-
feature in estimation and optimization problems. time exact tracking in fully actuated mechanical systems,
assuming the availability of the state and the desired
For the most of the proposed fixed-time stable system, trajectory and its two first derivatives measurements.
there are problems related with the convergence time.
First, the bounds of the fixed stabilization time found In the following, Section 2 presents the mathematical
by Lyapunov analysis constitute usually conservative preliminaries needed to introduce the proposed results.
estimations, i.e. they are much larger than the true Section 3 exposes the main result of this paper, which is
fixed stabilization time (see for example Cruz-Zavala the non-singular predefined-time stable manifold design.
et al. (2011), where the upper bound estimation is Section 4 presents a non-singular second-order predefined-
approximately 100 times larger than the actual true fixed time tracking controller for fully actuated mechanical
stabilization time). Second, and as consequence, it is often systems. Section 5 describes the model of a planar two-link
complicated to find a direct relationship between the manipulator, where the proposed controller is applied. The
183
CHAPTER 7. LINEAR SYSTEMS
of the settling time function for the system (1), i.e., will denote the k × k matrix defined as
T = {Tmax > 0 ∶ ∀x0 ∈ Rn ∶ T (x0 ) ≤ Tmax } . ⎡ v1 0 ⋯ 0 ⎤
⎢ ⎥
⎢ 0 v2 ⋯ 0 ⎥
(2)
diag(v) = ⎢⎢ ⎥.
⎥
⎢ ⋮ ⋮ ⋱ ⋮ ⎥
The minimum bound of the settling-time function Tf , is
defined as: ⎢ 0 0 ⋯ vk ⎥
Tf = inf T = sup T (x0 ). (3) ⎣ ⎦
x0 ∈Rn Remark 2.5. The properties (i), (ii), (iii), (iv) and (vi) of
Remark 2.2. In a strict sense, the time Tf can be Remark 2.3 remain the same. For v ∈ Rk , the derivatives
considered as the true fixed-time in which the system (1) of the functions ∣⋅∣ , ⌊⋅⌉ and Φp (⋅; Tc ) are:
h h
stabilizes.
∂ ∣v∣
h
Definition 2.4. (Sánchez-Torres et al., 2014; Sánchez- = diag [h ⌊v1 ⌉ h ⌊vk ⌉ ] = hdiag ⌊v⌉
h−1 h−1 h−1
... ,
Torres et al., 2015) For the case of fixed time stability when ∂v
∂ ⌊v⌉
the time Tf defined in (3) can be tuned by a particular h
selection of the parameters ρ of the system (1), it is said = diag [h ∣v1 ∣ . . . h ∣vk ∣ ] = hdiag ∣v∣
h−1 h−1 h−1
that the origin of the system (1) is predefined-time stable. ∂v
Definition 2.5. Let h ≥ 0. For x ∈ R, define the function and
∂Φp (v; Tc ) dΦp (v1 ; Tc ) dΦp (vk ; Tc ))
⌊x⌉ = ∣x∣ sign(x), = diag [ ],
h h
...
with sign(x) = 1 for x > 0, sign(x) = −1 for x < 0 and
∂v dv1 dvk
sign(0) ∈ [−1, 1]. respectively.
Remark 2.3. For x ∈ R, some properties of the function Remark 2.6. It is important to note that if k = 1, all
⌊⋅⌉h are: the extensions reduce to the scalar case considered by
Definition 2.5, Remark 2.3 and Definition 2.6.
⌊x⌉ is continuous for h > 0.
h
(i)
⌊x⌉ = sign(x).
0 From the Definition 2.6 of the stabilizing function, the
(ii)
⌊x⌉ = ⌊x⌉ = x, following Lemma presents a dynamical system with the
1
(iii)
⌊0⌉ = 0 for h > 0.
h predefined-time stability property.
(iv)
Lemma 2.1. (Sánchez-Torres et al., 2014; Sánchez-Torres
= h ⌊x⌉ = h ∣x∣ .
h h
h−1 h−1
(v) d∣x∣ and d⌊x⌉ et al., 2015) The origin of the system
h1 , h2 ∈ R, it follows:
dx dx
ẋ = −Φr (x; Tc )
(vi) For
h +h
⋅ ∣x∣ 1 ∣x∣ 2 = ∣x∣ 1 2
h h (6)
h +h with Tc > 0, and 0 < r < 1 is predefined-time stable with
⋅ ⌊x⌉ ∣x∣ = ∣x∣ 1 ⌊x⌉ 2 = ⌊x⌉ 1 2 Tf = Tc . That is, x(t) = 0 for t > Tc in spite of the x(0)
h1 h2 h h
h1 +h2
⋅ ⌊x⌉ ⌊x⌉ = ∣x∣
h1 h2
value.
184
CHAPTER 7. LINEAR SYSTEMS
where ⌊Φp (x1 ; Tc1 )⌉ 1−p = [ T1c p ] exp ( 1−p ∣x1 ∣p ) ⌊x1 ⌉ with
1
1 1−p 1 A common problem in mechanical systems control is
to track a desired time-dependent trajectory described
0 < p < 12 . by the triplet (qd (t), q̇d (t), q̈d (t)) of desired position
185
CHAPTER 7. LINEAR SYSTEMS
qd (t) = [qd1 (t) ⋯ qdn (t)]T ∈ Rn , velocity q̇d (t) = 5. EXAMPLE: TRAJECTORY TRACKING FOR A
[q̇d1 (t) ⋯ q̇dn (t)]T ∈ Rn and acceleration q̈d (t) = TWO-LINK MANIPULATOR
[q̈d1 (t) ⋯ q̈dn (t)]T ∈ Rn , which are all assumed to be
known. Consider a planar, two-link manipulator with revolute
joints as the one exposed in Utkin et al. (2009) (see Fig.
To be consequent with the state space notation, the desired 1). The manipulator link lengths are L1 and L2 , the link
position and velocity vectors are redefined as x1,d = qd masses (concentrated in the end of each link) are M1 and
and x2,d = q̇d = ẋ1,d , respectively. Then, defining the error M2 . The manipulator is operated in the plane, such that
variables as e1 = x1 −x1,d (position error) and e2 = x2 −x2,d the gravity acts along the z−axis.
(velocity error), the error dynamics are:
ė1 = e2
Examining the geometry, it can be seen that the end-
effector (the end of the second link, where the mass M2 is
ė2 = f (x1 , x2 ) + B(x1 , x2 )u − ẍ1,d ,
(17)
concentrated) position (xw , yw ) is given by
with initial conditions e1 (0) = e1,0 = x1,0 − x1,d (0), e2 (0) = xw = L1 cos(q1 ) + L2 cos(q1 + q2 )
e2,0 = x2,0 − x2,d (0). yw = L1 sin(q1 ) + L2 sin(q1 + q2 ),
The task is to design a state-feedback, second-order, where q1 and q2 are the joint positions (angular positions).
predefined-time controller to track the desired trajectory. y
In other words, the error variables e1 and e2 are to be
stabilized in predefined time with available measurements
of x1 , x2 , x1,d , x2,d = ẋ1,d and ẍ1,d .
M2
yw
c11 = −hq̇2
Hence, for the system (19) the following controller is
proposed:
c12 = −h(q̇1 + q̇2 )
u = −B −1 (x1 , x2 )[f (x1 , x2 ) − ẍ1,d + c21 = hq̇1
c22 = 0,
diag ∣e2 ∣ p−1 [Ψ(s1 )e2 + Φr (s2 ; Tc2 )] ], (20)
p
186
CHAPTER 7. LINEAR SYSTEMS
Control signals
The desired circular trajectory in the joint coordinates −20
Time [s]
12
Figure 4. Control signal. First component (gray and solid)
and second component (black and solid).
10
8
Variable s2
6 0.3
0.2
4 Start of desired trajectory
2 0.1
0 0
y
−2 −0.1
0 0.5 1 1.5 2 2.5 3 3.5 4
Start of actual trajectory
Time [s]
−0.2
1
Note that σ2 (t) = 0 for t ≥ 0.47 s < Tc2 = 0.5 s (Fig. 2).
0.5 Once the error variables slide over the manifold σ2 = 0,
this motion is governed by the reduced order system
ė1 = e2 = −Φp1 (e1 ; Tc1 ).
0
Error variables
−0.5
1st comp. e1(t) [rad] This imply that the error variables are exactly zero for
t > Tc1 + Tc2 = 1.5 s. In fact, from Fig. 3, it can be seen
−1 2nd comp. e (t) [rad]
1
that e1 (t) = e2 (t) = 0 for t ≥ 0.74 s < Tc1 +Tc2 = 1.5 s. Fig. 4
1st comp. e2(t) [rad/s]
−1.5
2nd comp. e2(t) [rad/s]
−2
shows the control signal (torque) versus time. Finally, from
Fig. 5, it can be seen the reference tracking in rectangular
−2.5
coordinates.
−3
Table 1. Parameters of the two-link manipula-
−3.5
0 0.5 1 1.5 2 2.5 3 3.5 4 tor model.
Time [s]
Parameter Values Unit
Figure 3. Error variables. First component of e1 (dark M1 0.2 kg
gray and thick), second component of e1 (black and M2 0.2 kg
dashed), first component of e2 (light gray and solid) L1 0.2 m
and second component of e2 (black and solid).
L2 0.2 m
187
CHAPTER 7. LINEAR SYSTEMS
188
CHAPTER 7. LINEAR SYSTEMS
AbstractIn this paper the problem of stable estimation for linear parameter varying (LPV) systems in
finite-time setting is considered. In order to bound the states and the estimation errors, a parameter-
dependent filter is proposed. Under the assumption that the parameters variation are sufficiently small,
the filter existence and synthesis is characterized by linear matrix inequalities (LMI) conditions. A
numerical example is provided to illustrate the proposed technique.
Keywords: Filter design, Disturbance signals, Uncertain linear systems, Linear parameter varying
systems, Finite time stability, Finite time boundedness, Linear matrix inequality.
189
CHAPTER 7. LINEAR SYSTEMS
( )
Therefore, it is interesting the search for solutions that uses less N
computational effort than the DLMIs. Thus, in this paper we ∆N = θ ∈ R : ∑ θi = 1, θi ≥ 0 .
N
aim achieving this goal by using only the LMI framework. i=0
Under the assumption that the parameters of the plant are To account the information given by the parameter α, parameter-
sufficiently slow time-varying, a new synthesis condition for a dependent matrices are used in the dynamics of a full order
homogeneous polynomially parameter-dependent FTB filter for proper filter, precisely:
continuous-time LPV systems is derived in this paper. Those ẋ f (t) = A f (α)x f (t) + B f (α)y(t),
systems are an indexed collection of linear systems in which (4)
z f (t) = C f (α)x f (t),
the indexing parameter is independent of the state (Shamma,
2012). Depending on the scenario, this indexing parameter where x f (t) ∈ Rn is the filter state and z f (t) ∈ R p the estimated
can be seen as a parametric uncertainty of the model or a signal. Coupling the filter to the plant, the equations that de-
measurable parameter possibly read in real time, which can scribe the augmented system dynamics are given by
be used in the design of a controller or a filter that accounts ς̇ (t) = Ā(α)ς (t) + B̄(α)w(t),
for all possible variations of this parameter. Furthermore, the (5)
e(t) = C̄(α)ς (t),
proposed approach also considers that for a particular case
where the LMIs depend on a parameter α in the unit simplex, where ς (t) = [x(t)0 x f (t)0 ]0 , e(t) = z(t) − z f (t), and
homogeneous polynomial structures can be used in the search A(α) 0 B(α)
for less conservative sets of design conditions, as done in Ā(α) = , B̄(α) = ,
B f (α)Cy (α) A f (α) B f (α)D(α)
Oliveira and Peres (2007).
C̄(α) = [Cz (α) −C f (α)] .
This paper is organized as follows. Detailing of the problem and
auxiliary lemmas are presented in Section 2. The main theorem, It is desirable that both the state of the plant and the error
where LMI conditions are derived for the synthesis of a filter between it and the state of the filter are bounded during a finite
that solves the FTB problem for LPV systems, is proved in time horizon. This motivates Definition 1.
Section 3. A numerical example is given in Section 4 to illus- Definition 1. Given three positive scalars c1 , c2 and T , with
trate the application of the technique. Finally, the conclusion is c2 > c1 , positive definite matrices R p ∈ Rn×n and Re ∈ Rn×n
presented in Section 5. and the class of signals Wd , the LPV system
In the sequel the following notation will be used: The symbol ς̇ (t) = Ā(α)ς (t) + B̄(α)w(t) (6)
(0 ) indicates the transpose of a matrix; P > 0 means that P is is FTB with respect to (c1 , c2 , Wd , T, R p , Re ), if
symmetric positive definite. R represents the set of real num- 0
bers, Z+ = {0, 1, 2, . . .} the set of nonnegative integers. card (·) x(0) Rp 0 x(0)
≤ c1
denotes the cardinality of a set. λmax (·) and λmin (·) indicate, x(0) − x f (0) 0 Re x(0) − x f (0)
respectively, the maximum and the minimum eigenvalue of the implies that
argument. The term (?) indicates symmetric terms in the LMIs 0
x(t) Rp 0 x(t)
and I and 0 are the identity and the zero matrices of suitable ≤ c2
x(t) − x f (t) 0 Re x(t) − x f (t)
dimensions.
for all w ∈ Wd and for all t ∈ [0, T ].
2. PROBLEM STATEMENT AND PRELIMINARY Remark 1. It should be noted that the definition of a FTB
RESULTS system presented here is a specialization for LPV systems of
the definition presented in Amato et al. (2001) with R chosen
Consider a LPV system with t ∈ [0, T ] and as:
R + Re −Re
ẋ (t) = A (α (t)) x (t) + B (α (t)) w (t) , (2) R= p . (7)
−Re Re
y (t) = Cy (α (t)) x (t) + D (α (t)) w (t) , The matrices R p and Re can be seen as weighting matrices that
z (t) = Cz (α (t)) x (t) , set the importance between bounding the states of the plant and
where x(t) ∈ Rn is the state space vector, y(t) ∈ Rq is the the error between it and the states of the filter. In contrast to
measured output, z(t) ∈ R p is the signal to be estimated and a classical scenario in which the Lyapunov stability of x − x f
w(t) ∈ Rr is the noise input with bounded L2 norm. The pa- implies the Lyapunov stability of z − z f using a Luenberger
rameter α(t) is assumed to be available online and is continu- observer as a filter, constraining the error x − x f in a region
ous with respect to its time dependence—which, to lighten the during a finite time does not necessarily imply that z − z f
notation, will be omitted wherever there is no ambiguity. satisfies the same constraining. In fact, z − z f can be bounded
independently of x − x f , motivating the next definition.
All matrices are real, with appropriate dimensions and belongs
to the polytope P Definition 2. Given a symmetric positive definite matrix Ω, the
filter (4) is said to be Ω-bounded in finite time T if its estimation
A(α) B(α) N Ai Bi error e (t) satisfies
Cy (α) D(α) = ∑ αi Cyi Di . (3)
e0 (t)e(t) < ς 0 (t)Ω−1 ς (t), ∀t ∈ [0, T ]. (8)
Cz (α) − i=1 Czi −
Taking into account the above discussion, the FTB filtering
For all t ∈ [0, T ], the system matrices are given by the convex problem to be solved in this work is formally stated as follows.
combination of the known vertices of the polytope P.
Problem 1. Assuming that α ∈ ∆N is available online for every
The vector of time-varying parameters α ∈ RN belongs for all t ∈ [0, T ] and its variation is sufficiently small, find matrices
t ∈ [0, T ] to the unit N-simplex ∆N , that is: A f (α), B f (α) and C f (α) in (4), such that the augmented
190
CHAPTER 7. LINEAR SYSTEMS
system (5) is FTB with respect to (c1 , c2 , Wd , T, R p , Re ) and the together with the following change of variables
estimation error is Ω-bounded in finite time T . M(α) = −KA f (α)U (α) Z (α) , (13a)
L(α) = −KB f (α), (13b)
The subsequent lemma, from Amato et al. (2001) with an
extended proof in Borges et al. (2013) considering a wider class F(α) = C f (α)Γ22U (α) Z (α) , (13c)
of noises, presents a sufficient condition to analyze if a LPV W (α) = Q−1
2 (α) , (13d)
system is FTB and it is used in the solution of Problem 1.
where X (α), Y (α) and Q−1
2 (α) are chosen such that
Lemma 1. For a sufficiently slow varying parameter α, sys-
tem (6) is FTB with respect to (c1 , c2 , Wd , T, R p , Re ), if, for Z(α) = X −1 (α) ,
all α ∈ ∆N , there exist positive definite symmetric matrices W (α) = Q−1 2 (α) ,
Q1 (α) ∈ R2n×2n , Q2 (α) ∈ Rr×r and a positive scalar β such K = Y (α) − Z(α).
that
By multiplying the LMI (11a) on the left by H̄ 0 (α) and on the
Ā(α)Q̃1 (α) + Q̃1 (α)Ā0 (α) − β Q̃1 (α) B̄(α)Q2 (α) right by H̄ (α), and multiplying the result on the left by H̃ 0 (α)
< 0,
(?) −β Q2 (α) and on the right by H̃ (α), with
(9) −1
c1 d c2 e−β T N (α) 0 H (α) 0
H̄ (α) = , H̃ (α) = ,
+ < , (10) ? I ? I
λmin [Q1 (α)] λmin [Q2 (α)] λmax [Q1 (α)]
I 0
in which Q̃1 (α) = R−1/2 Q1 (α)R−1/2 , with R given by (7). N (α) = ,
0 X (α)
3. MAIN RESULTS the LMI (9) is obtained. Moreover, it is easy to see that
LMI (10) is satisfied if the conditions
The following theorem presents sufficient conditions, in terms c1 µ1 + dµ3 < c2 e−β T µ2 , (14)
of a parameter-dependent LMI, for the synthesis of matrices
that solves Problem 1. Q−1
2 (α) < µ3 I, (15)
Theorem 1. Given a LPV continuous-time system (2), param- µ2 I < Q1 (α) < µ1 I,
−1
(16)
eters (c1 , c2 , d, T, R p , Re ) and a fixed scalar parameter β , if, are guaranteed.
for each α ∈ ∆N , there exist symmetric positive definite ma- Inequalities (14) and (15) are LMIs (11b) and (11c), respec-
trices K ∈ Rn×n , W (α) ∈ Rr×r and Z(α) ∈ Rn×n ; matrices tively. By multiplying inequality (16) on the left and on the right
L(α) ∈ Rn×q , M(α) ∈ Rn×n and F(α) ∈ R p×n and positive by R1/2 , with R given by (7), one has
real scalars µ1 , µ2 and µ3 , such that
" # 1 (α) < µ1 R.
µ2 R < Q̃−1 (17)
M11 (α) M12 (α) KB(α) + L(α)D(α)
(?) M22 (α) Z(α)B(α) < 0, Knowing that the identity Q̃1 (α) Q̃−1
1 (α) = I gives the equa-
(?) (?) −βW (α) tions
(11a)
M11 (α) = −β K − M(α) − M 0 (α), X (α)Y (α) +U 0 (α)V (α) = I,
M12 (α) = KA(α) + L(α)Cy (α) + M(α),
A0 (α)Z(α) + Z(α)A(α) − β Z(α), X (α)V 0 (α) +U 0 (α) Ŷ (α) = 0,
one has for U (α) = X (α) that
c1 µ1 + dµ3 < c2 e−β T µ2 , (11b)
V (α) = −K,
W (α) < µ3 I, (11c)
µ2 R p < Z(α) < µ1 R p , (11d) Ŷ (α) = K.
µ2 Re < K < µ1 Re , (11e) Consequently, inequality (17) is satisfied if, and only if
K 0 F 0 (α) K + Z(α) −K
µ2 R < < µ1 R. (18)
(?) Z(α) Cz0 (α) − F 0 (α) > 0 , (11f) −K K
(?) (?) I
Left-multiplying LMI (18) by G0 and right-multiplying by G,
then for a sufficiently slow varying parameter α there exists with
a filter in the form (4), such that the augmented system (5) is
I 0
FTB with respect to (c1 , c2 , Wd , T, R p , Re ) and the filter is also G=
I I
Ω-bounded in finite time T for Ω = ΓQ̃1 Γ0 , Γ = diag(I, Γ22 ), one can see that inequality (18) is equivalent to LMIs (11d) and
with Γ22 non-singular. A realization of the filter is given by the (11e). At last, by multiplying LMI (11f) on the left by H̄ 0 (α)
matrices: and on the right by H̄ (α), multiplying the result on the left by
A f (α) = −K −1 M(α),
J˜0 (α) and on the right by J˜(α), with
B f (α) = −K −1 L(α), (12) −1
J (α) Γ−1 22 0 , J (α) =
I X (α)
J˜(α) =
C f (α) = F(α)Γ−1
22 . 0 I 0 X (α)
and then applying Schur complement in the resulting matrix,
Proof. As presented in Chilali and Gahinet (1996) in the con-
one has
text of pole placement, consider the partitioned matrices
e0 (t)e(t) < ς 0 (t)Ω−1 ς (t),
X (α) U 0 (α) Y (α) V 0 (α) which guarantees the constraint in the estimation error.
Q̃1 (α) = , Q̃−1
1 (α) = ,
U (α) X̂ (α) V (α) Ŷ (α)
By the choice of U(α), one has that U(α)Z(α) = I and that the
Y (α) I filter matrices A f (α), B f (α) and C f (α) can be recovered from
H (α) = ,
V (α) 0 the change of variables in (13).
191
CHAPTER 7. LINEAR SYSTEMS
Remark 2. It should be remarked that if β is not fixed, the pro- be used to automatically carry this relaxation (Agulhari et al.,
posed conditions are not longer linear for a fixed α. Actually, 2012).
they are not even bilinear due to (11b). Aside from that, a binary
It is important to note that for the particular case that g = 0, the
search for a suitable β can be guided by balancing (11a) and
recuperation of the filter from the LMI variables using (12) is
(11b) and should not be a computational burden since β is just
free from the parameter α, and consequently, the assumption
a scalar variable.
that α can be read in real time is no longer required; the filter
In the definition of Problem 1, Ω is a design parameter that is robust in the sense that the parameter can be considered
must be adjusted for a suitable weighting between the size of uncertain. The reason why is used degrees greater than zero
output estimation error and the size of the filter and plant states. is that a sequence of less conservative LMI relaxations may
It should be observed that there is no loss of generality to write be obtained in the conditions of Theorem 1 by increasing the
Ω = ΓQ̃1 Γ0 and to consider the parameter Γ as invertible (Ω, degree g, as will be clearer in the next theorems.
Γ, Q̃1 are invertible). The choice of Γ = diag(I, Γ22 ) allows to Theorem 2. For given ḡ and µ̄2 , let c∗2 (ḡ) be the optimal solu-
directly adjust the matrix C f with a scale factor given by Γ22 . tion of
Consequently, the quality of the realization {A f , B f ,C f } of the min c2
filter can be improved without deteriorating the estimate error.
such that (11) holds with g = ḡ and µ2 = µ̄2 .
Theorem 1 leads to a LMI feasibility problem that must be
Then, c∗2 (ḡ + 1) ≤ c∗2 (ḡ).
satisfied for all parameters α ∈ ∆N . Although this is an infinite
dimension problem in the parameter α, the fact that it lies in the Proof. If there exist scalars µ1 ,µ̄2 and µ3 ; matrix K ∈ Rn×n ;
unit N-simplex can be used to find sufficient LMI conditions and matrices W , Z, L, M, F ∈ H(g) such that (11) holds, then
written only in terms of the vertices of the polytope (Bliman µ1 , µ̄2 , µ3 , K and the following matrices
et al., 2006). ! ! ! ! !
N N N N N
In fact, using the relaxation proposed in Oliveira and Peres ∑ αi W, ∑ αi Z, ∑ αi L, ∑ αi M, ∑ αi F,
(2007) one can write the parameter-dependent LMIs in The- i=1 i=1 i=1 i=1 i=1
orem 1 as LMIs that are independent of the parameter α. As belonging to H(g+1) , are a particular solution to (11), since
the level of relaxation increases, it is possible to achieve less α ∈ ∆N . Hence, the minimization of c2 subject to (11) for ḡ + 1
conservative sets of conditions and tending to necessary and produces at least the same optimal value obtained with ḡ, which
sufficient conditions. implies that c∗2 (ḡ + 1) ≤ c∗2 (ḡ).
For this purpose, the next Definition 3 generalizes the linear Theorem 3. For given ḡ, µ̄1 and µ̄3 , let c∗1 (ḡ) and d ∗ (ḡ) be the
dependence on the parameter α to a homogeneous polynomial optimal solution of problems
dependence.
max c1
Definition 3. A matrix M (α) is homogeneous polynomially such that (11) holds with g = ḡ and µ1 = µ̄1 ,
parameter-dependent (HPPD) on α ∈ ∆N with degree g if it can
be expressed as max d
M (α) = ∑ α1k1 α2k2 · · · αNkN Mk , (19) such that (11) holds with g = ḡ and µ3 = µ̄3 ,
k∈Sg
respectively. Then c∗1 (ḡ) ≤ c∗1 (ḡ + 1) and d ∗ (ḡ) ≤ d (ḡ + 1).
with ( )
N
N Proof. Similar to Theorem 2.
Sg = k ∈ Z+ : ∑ ki = g .
i=1 The optimization problems in Theorem 2 and Theorem 3 can
Mk are the matrices coefficients of the monomials of M (α), be seen as optimum filtering problems within FTB context. For
where example, to design a filter which rejects the maximum possible
(N + g − 1)! types of disturbances, one may try to maximize d.
card (Sg ) = .
g! (N − 1)! The computational complexity of the LMIs is estimated by the
number of scalar variables V and the number of LMI scalar
The set of HPPD of α ∈ ∆N with degree g matrices is denoted
rows L. For Theorem (1),
by H(g) and the subset corresponding to the matrices with order
m × n is denoted by Hm×n . q (q + 1)
(g) V = n (p + q + n) card (Sg ) + n (n + 1) + + 3, (20)
2
The relaxation proposed in Oliveira and Peres (2007) can be L = (2n + r) card Sg+ f +1 + (4n + p + r) card Sg+ f + n + 1.
used in the parameter-dependent LMIs of Theorem 1 by forcing (21)
a homogeneous polynomial structure in the LMI variables
L(α), M(α) and F(α), turning them into HPPD matrices. By increasing the degree g, the number of decision variables
The relaxed condition is given by the LMIs stemming from is also increased and in consequence, the complexity of the
the matrices coefficients of the HPPD matrices, and the LMI LMIs also raises. However, by using an extension of Pólya’s
variables are the matrix coefficients of the monomials of the theorem (Oliveira and Peres, 2005, 2007), and based on the
HPPD matrices. Whilst this procedure is systematic, it can fact that the time-varying parameters α belong to the unit N-
become very complex as the degree g of the HPPD matrices simplex, the conditions of Theorem (1) may also be improved
increases. Nevertheless, the specialized parser ROLMIP 1 can using a sufficiently large positive integer f with no increase in
1 Available for download at http://www.dt.fee.unicamp.br/~agulhari/
the number of variables for a given degree g by multiplying the
f
rolmip/rolmip.htm. LMIs (11) by the factor ∑Ni=1 αi .
192
CHAPTER 7. LINEAR SYSTEMS
4. NUMERICAL EXAMPLE Figure 1. Sum of the weighted quadratic norm of the states of
the plant and observation error.
The numerical example was performed using the solver Se-
DuMi (Sturm, 1999) and the parsers YALMIP (Löfberg, 2004)
and ROLMIP (Agulhari et al., 2012) within Matlab environ- 2.5
e0 (t)e(t)
ment. ς 0 (t)Ω−1 ς (t)
2
Consider the system (2) with matrices in polytope (3) with the
following vertices 1.5
−1.0 2.0 −1.0 2.0
A1 = , A2 = ,
−3.0 −2.0 −3.0 −1.0 1
−2.0 2.0 −2.0 2.0
A3 = , A4 = , 0.5
−3.0 −2.0 −3.0 −1.0
−0.5 −0.1 −0.5 −0.1 0
B1 = , B2 = , B3 = , B4 = ,
0.1 0.1 0.5 0.5 0 0.5 1 1.5
Time (s)
Cy1 = [1.0 0.5] ,Cy2 = [1.2 0.5] ,
Cy3 = [1.0 0.6] ,Cy4 = [1.2 0.6] , Figure 2. Time simulation comparing the norm of the error with
the bound imposed by Ω.
Cz1 = [0.6 1.0] ,Cz2 = [1.0 1.0] ,
Cz3 = [0.6 1.2] ,Cz4 = [1.0 1.2] , Consider now system (2) with the varying parameter (22) with
D1 = D2 = 0.1, D3 = D4 = 0.2 ω = 0.001. We wish to check if the FTB filter obtained by
and the slowly varying parameter Theorem 2 with µ̄2 = 1, g = 3 and f = 1 satisfies the FTB
condition with respect to (0.1, 3.06, W1 , 1.5, 4I, 4I) and the Ω-
α(t) = 12 sin2 (ωt) , 12 sin2 (ωt) , 21 cos2 (ωt) , 21 cos2 (ωt) , bound condition defined in (8) considering a disturbance given
(22) by the step function
with ω sufficiently small. It is easy to check that (22) belongs
to ∆4 for all t ≥ 0. 0, if t ≤ 1.1s,
w(t) =
0.9, if t > 1.1s,
Theorems 2 and 3 along with Pólya’s relaxation are applied
with µ̄1 = 1, µ̄2 = 1, µ̄3 = 1 and β = 0.6 in order to investigate which represents the worst type of signal belonging to the class
the effect of increasing g and f in the search of minimum upper W1 . Since this filter is obtained using the assumption that α
bounds of c2 attained by the conditions of Theorem 2 and also is a slowly varying parameter, one must verify if α̇ is really
in the search of maximum lower bounds of c1 and d attained sufficiently small by time-domain simulations. Considering
by the conditions of Theorem 3. The chosen FTB parameters zero initial conditions, a time-simulation was performed in the
for this example are c1 = 0.1, c2 = 2, d = 1, T = 1.5, Re = time interval t ∈ [0, 1.5s].
4I and R p = 4I. The results of the optimization problem are
As shown in Figure 1, the designed filter satisfies the FTB
summarized in Table 1 (when the parameter is the value being
condition and
optimized the corresponding value of the parameter should be 0
ignored). x(t) Rp 0 x(t)
max = 0.06,
t∈[0,1.5s] x(t) − x f (t) 0 Re x(t) − x f (t)
As can be seen in Table 1, the conditions of Theorem 3 are not
able to provide a robust filter nor a LPV filter with a c1 > 0. which is approximately 2% of the value of c2 = 3.0653. More-
Also, by using degree g = 3 and f = 1 it was possible to over, as can be seen in Figure 2, the estimation error of the filter
obtain an upper bound to c2 approximately 13.35% smaller also satisfies the Ω-bound condition defined in (8). The tracking
than the robust filter corresponding to g = 0 and f = 0. Finally, error is shown in Figure 3. Although the choice of Γ22 did not
as illustrated by the maximum lower bounds obtained by d, it nullify the error between z and z f , it ensured a maximum error
may happen that the gain obtained increasing g and f is not of 0.0764. The difficulty of having a null estimation error in a
appreciable. In this case, it would be better use the robust filter finite time horizon is due to the small time that the filter has to
corresponding to g = 0 if existent. dynamically estimate the output z.
193
CHAPTER 7. LINEAR SYSTEMS
194
CHAPTER 7. LINEAR SYSTEMS
PID Optimal Controller with Filtered Derivative Part for Unstable First Order
Plus Time Delay Systems
J. A. Cárdenas-Valderrama*, J. F. Marquez-Rubio*, O. Jiménez-Ramírez*
Abstract: A typical problem in the derivative part of the PID controllers is its practical implementation.
This work avoids to assume high values for the filter coefficient instead, a stability analysis of a proposed
filtered PID controller to the case of systems with time delay, particularly the Unstable First Order plus
Time Delay (UFOPTD) systems is presented. This analysis becomes interesting since analytical results
have not been presented in literature. The aim of the obtained results is to illustrate how the stability
conditions are modified when different values of the filter coefficient are chosen. In fact, the parametric
stabilization region is obtained based in previous literature results. As a second goal of the present
research is to obtain the optimal parameters of the proposed filtered PID controller such that the energy
of control input and system states be minimized. An optimization methodology based on the second
method of Lyapunov is obtained.
Keywords: PID controller, Stabilization, Time-delay, Unstable Processes, Optimization.
provided a complete parametrization of the stabilizing P and
1. INTRODUCTION
PI controllers in the case 𝜃 < 𝜃𝑢𝑛 and the stabilizing PID
Time-delay of control inputs is a common phenomenon in controllers for the case 𝜃 < 2𝜃𝑢𝑛 . Recently, Marquez et al.,
diverse application fields, including chemical processes, (2014) consider the stabilization of UFOPTD system by
hydraulic systems and servomechanisms. Recently, some using a traditional Proportional-Derivative (PD) controller,
systems are remotely controlled generating a time delay showing that the derivative term minimizes the effects of the
between the controlled system and the control stage. The time-delay. In Lee et al., (2010) some results about the
control problem of time delay systems arises mainly by the stabilization of a delayed system with one unstable pole and
induced transcendental term in the characteristic equation, several stable poles by using P, PI, PD and PID controllers
which gives as result a characteristic equation having an are provided. An important issue in the derivative part of PID
infinite number of poles. Due to the complexity of the controllers is its practical implementation. The solution of
problem, several researchers have devoted its efforts for using a filter to the derivative action has been widely
designing control strategies that provide an adequate proposed in the literature. However, just high values of the
performance of the system; see for instance Seshagiri, et al., filter coefficient is suggested in order to recuperate the
(2007). To the case of open-loop stable process, the well- inherent derivative action of the controller. The task of this
known Smith Predictor Compensator (SPC) has been used as work is to analyze the stability properties when a filtered PID
a traditional control structure Seshagiri, et al., (2007). controller is considered to the case of UFOPTD system and
Considering a similar approach, some works have been showing how the stability conditions of the closed-loop
reported in order to deal with unstable processes, see for system can be modified for different values of the filter
instance, Liu et al., (2005), Seshagiri and Chidambaram coefficient. As we mentioned before, the control strategies
(2005). As a first attempt to analyse a generalized class of previously cited only provides stability results and there are
delayed systems, the case of first order delayed system has not result related to the performance of the controlled
been widely studied by using the basis provided by the SPC, variable. It is known that when a control strategy is applied to
see for instance Seshagiri, et al., (2007), Michiels et al., a system it is desired to obtain optimal performance of the
(2002), Marquez et al., (2012). With a simple and different system with respect to specific variables. Also, in this work a
perspective, some authors have regarded to analyse the case simple and effective methodology to obtain the optimal gains
when the system is controlled by a Proportional (P), 𝑘𝑝 , 𝑘𝑖 and 𝑘𝑑 gains is proposed in order to minimize the
Proportional-Integral (PI) and Proportional-Integral- energy at the control input and the system states. The
Derivative (PID). Nesimioglu and Soylemez Nesimioglu and proposed optimization methodology is based on an
controllers. Hwang and Hwang (2004) used the D-partition approximation of the delay term as well as on the second
technique to estimate the stabilization limits of PID method of Lyapunov. It is important to note that the proposed
compensation, showing that an UFOPTD system can be optimization strategy assumes that the stabilizing region of
stabilized if 𝜃 < 𝜃𝑢𝑛 , where 𝜃 is the time-delay and 𝜃𝑢𝑛 is the control parameters is known, and such stabilizing regions
the unstable constant-time. Silva and Bhattacharyya (2005) are obtained from previous literature results such as Lee et
195
CHAPTER 7. LINEAR SYSTEMS
196
CHAPTER 7. LINEAR SYSTEMS
197
CHAPTER 7. LINEAR SYSTEMS
1 𝑘𝑑 1 2
0.3 + < < √1 + ( ) ,
𝑁 𝑘𝑝 𝑁
𝑘𝑑 𝑘𝑖 1 𝜔
𝑎𝑡𝑔(𝜔) + 𝑎𝑡𝑔 ( 𝜔− ) − 𝑎𝑡𝑔 ( ) − 1.3𝜔 > 0,
𝑘𝑝 𝑘𝑝 𝜔 𝑁
Fig. 2. Permissible region for N. 𝑘
For positive values of 𝜔 such that 𝑘 𝑖 ≥ 0, and
𝑝
Fig 2. is a plot of 𝜃-𝑁 that shows the maximum value that
2 2
could be 𝜃 for a set of values of 𝑁. It can be seen that for 2 )(1+𝜔𝑐1 )
(1+𝜔𝑐1 2 )(1+𝜔𝑐2 )
(1+𝜔𝑐2
2 𝑁 2 𝑁
small values of 𝑁, the maximum value of 𝜃 is close to 1 and 1.3√ 2 < 𝑘𝑝 < 1.3√ 2 .
𝑘 𝑘 1 𝑘 𝑘 1
for big values of 𝑁, the maximum value of 𝜃 converges to 2. 1+(𝑘𝑑 𝜔𝑐1−𝑘 𝑖 𝜔 ) 1+(𝑘𝑑 𝜔𝑐2−𝑘 𝑖 𝜔 )
𝑝 𝑝 𝑐1 𝑝 𝑝 𝑐2
Notice that for 𝑁 = 50, the maximum value of 𝜃.is close to
2.
198
CHAPTER 7. LINEAR SYSTEMS
199
CHAPTER 7. LINEAR SYSTEMS
200
CHAPTER 7. LINEAR SYSTEMS
Abstract: In this paper, an algorithm for SISO Pole Placement based on linear algebra
concepts it’s developed. This algorithm uses the knowledge of the degrees of certain polynomials
associated to the Internal Model Principle and Stable Zero-Pole cancellations involved in the
equation of the closed loop and it’s coefficients, generating a linear system of equations for the
desired closed loop poles in a systematic way.
201
CHAPTER 7. LINEAR SYSTEMS
that the reference or disturbance generating polynomial (o The closed loop polynomial Acl (s) ≡ Acl is given by the
simply generating polynomial ) must be in the denominator following Diophantine equation
of K(s) (Goodwin et al., 2001). This can also be achieved A L + B P = Acl
by solving the Diophantine equation (1). Sometimes it is where
desirable to force the controller to cancel a subset of stable
poles and/or zeros of the plant model, this is also taking deg{Acl } = deg{A} + deg{L} = 2n − 1
into account by this systematization, which arises to an and so
algorithm to solve this problem in an automatic way. This Acl = c2n−1 s2n−1 + c2n−2 s2n−2 + · · · + c1 s + c0
approach can be used for design adaptive controllers, or Let Vl , Vp , W be finite-dimensional vector spaces over R
simply synthesize PID controllers. such as
Vl = span{sn−1 , sn−2 , . . . , s, 1}
2. LINEAR TRANSFORMATION APPROACH
Vp = span{sn−1 , sn−2 , . . . , s, 1}
2.1 Notation W = span{s2n−1 , s2n−2 , . . . , s, 1}
Notice that L ∈ Vl , P ∈ Vp and Acl ∈ W. Although
Let X(s) = xn sn + xn−1 sn−1 + · · · + x1 s + x0 be a
in this case Vl is exactly the same space as Vp , we keep
polynomial with real coefficients.
the subscripts for the sake of clarity. Let Vl ⊕V ˜ p be the
Notation 1. The set of all coefficients of X(s) (in decreas- external direct sum of Vl and Vp . Let V and W be a basis
ing power order) is denoted by CX = {xn , xn−1 , · · · , x1 , x0 }
for Vl ⊕V
˜ p and W respectively, such as
Notation 2. The degree of X(s) is denoted by deg{X(s)} [
V = {(sn−1 , 0), (sn−2 , 0), . . . , (s, 0), (1, 0)} (2)
Let V, W be finite-dimensional vector spaces over a field
K and choose bases V = {v1 , . . . , vm } for V and W =
{w1 , . . . , wn } for W. {(0, sn−1 ), (0, sn−2 ), . . . , (0, s), (0, 1)}
Notation 3. The dimension of V is denoted by dim {V}.
W = {s2n−1 , s2n−2 , . . . , s, 1} (3)
Notation 4. Let v ∗ ∈ V. The coordinates of v ∗ in the basis
V are denoted by (v ∗ )V ∈ Rm . Define the linear transformation Φ as it follows
Φ : Vl ⊕V
˜ p −→ W
Notation 5. Let T : V −→ W be a linear transformation
from V to W. The matrix associated to T choosing bases Φ{(l, p)} 7→ A l + B p
V and W is denoted by TV W . The construction of the matrix associated to the linear
Definition 1. The external direct sum of V and W, de- transformation Φ in the bases V and W starts by comput-
noted by V⊕W ˜ is defined as the set of all ordered pairs ing the transformation of every vector of V
(v, w) with v ∈ V and w ∈ W. Scalar multiplication is Φ{(sn−1 , 0)} 7→ A sn−1 = an s2n−1 + · · · + a0 sn−1
defined by c(v, w) = (cv, cw) with c ∈ K, and addition is Φ{(sn−2 , 0)} 7→ A sn−2 = an s2n−2 + · · · + a0 sn−2
defined by (v, w) + (v 0 , w0 ) = (v + v 0 , w + w0 ). One checks .. ..
the other classical axioms for a vector space. . .
Φ{(s, 0)} 7→ As = an sn+1 + · · · + a0 s
Note that the external direct sum of V and W can be Φ{(1, 0)} 7 → A = an sn + · · · + a0
expressed as the internal direct sum of (V, 0) and (0, W). Φ{(0, sn−1 )} 7→ B sn−1 = bm sm+n−1 + · · · + b0 sn−1
A basis for V⊕W
˜ is given by Φ{(0, sn−2 )} 7 B sn−2
→ = bm sm+n−2 + · · · + b0 sn−2
[ .. ..
{(vi , 0)} {(0, wj )} . .
Φ{(0, s)} →
7 B s = bm sm+1 + · · · + b0 s
Φ{(0, 1)} → 7 B = bm sm + · · · + b0
2.2 Pole Placement (4)
Getting coordinates in basis W yields
Given the control loop of one degree of freedom as in
Figure 1. Let G0 (s) be the process nominal model and (A sn−1 )W = (CA , 0, 0, . . . . . . . . . . . . , 0, 0, 0)
K(s) the biproper controller defined as (A sn−2 )W = (0, CA , 0, . . . . . . . . . . . . , 0, 0, 0)
B(s) B P (s) P ..
G0 (s) = ≡ K(s) = ≡ .
A(s) A L(s) L
where (A s)W = (0, 0, 0, . . . . . . . . . . . . , 0, CA , 0)
(A)W = (0, 0, 0, . . . . . . . . . . . . , 0, 0, CA )
A = an sn + an−1 sn−1 + · · · + a1 s + a0
n−m
z }| { (5)
B = bm sm + bm−1 sm−1 + · · · + b1 s + b0
(B sn−1 )W = (0, . . . , 0, CB , 0, 0, . . . , 0, 0, 0)
P = pn−1 sn−1 + pn−2 sn−2 + · · · + p1 s + p0
(B sn−2 )W = (0, . . . , 0, 0, CB , 0, . . . , 0, 0, 0)
L = ln−1 sn−1 + ln−2 sn−2 + · · · + l1 s + l0 ..
The degrees of the polynomials are as it follows .
deg{A} = n (B s)W = (0, . . . , 0, 0, 0, 0, . . . , 0, CB , 0)
deg{B} = m m≤n (B)W = (0, . . . , 0, 0, 0, 0, . . . , 0, 0, CB )
deg{P } = n − 1 Notice that every vector in R2n defined above it’s a shift of
deg{L} = n − 1 the coefficients of A and B polynomials respectively. Define
202
CHAPTER 7. LINEAR SYSTEMS
203
CHAPTER 7. LINEAR SYSTEMS
Where CAcl = {c6 , c5 , c4 , c3 , c2 , c1 , c0 } are the coefficients Z
ξà = (à sn−z−1 )TZ W · · · (Ã)TZ W
of the desired closed loop polynomial Acl . If we choose | {z }
Acl = (s + 3)6 , then the controller transfer function is dim{Z Vl̃ }=n−z
given by Z
ξB̃ = (B̃ sn−w−1 )TZ W · · · (B̃)TZ W
45s4 + 209s3 + 482s2 + 853s + 729 | {z }
K(s) = (8) dim{Z Vp̃ }=n−w
s(s2 + 1)(s − 31)
The steady state response of the closed loop of Figure 1 Then, the matrix associated to the transformation in the
is indeed the one chosen, but the transient response is bases Z V and Z W
affected by the dynamics of the zeros of the controller Z
ΦZ V Z W = Z ξà Z ξB̃
and the nominal model. The behavior of this zeros is
undesirable if they are located at the right side of the
Where Z ΦZ V Z W ∈ R2n−z−w × 2n−z−w is a Sylvester Matrix
poles in the left semi-plane of the complex plane (Seron
et al., 1997). This can be sometimes avoided if we propose associated to the polynomials à and B̃; and |Z ΦZ V Z W | =
6 0
Stable Zero-Pole cancellations between the controller and because à and B̃ are coprime.
the plant, as we explain in the following section.
3. DEVELOPMENT OF THE ALGORITHM
2.4 Stable Zero-Pole cancellations
Combining the criterion developed in Section 2.3 and
In addition to the implementation of the Internal Model Section 2.4 one can construct a linear transformation
Principle, it’s from interest to obtain a systematic way that takes into account the Internal Model Principle and
to perform Stable Zero-Pole cancellations. To achieve that Stable Zero-Pole cancellations at the same time, providing
goal, the controller denominator(numerator) must include a systematic way to obtain the matrix involved in the
the pole(zero) dynamics to cancel. Suppose that the stable determination of the coefficients of the desired closed loop
dynamics to cancel are represented by two polynomials polynomial. Choosing
α(s) ≡ α (poles) and β(s) ≡ β (zeros) such that L = Γ βL∗
A = α Ã P = αP∗ (10)
B = β B̃ Acl = α βA∗cl
where The corresponding Diophantine equation remains as fol-
deg{α} = w lows
deg{β} = z A∗ L∗ + B ∗ P ∗ = A∗cl
The Diophantine equation associated to the closed loop where
A L + B P = Acl −→ α Ã L + β B̃ P = Acl (9) Γ 1
A∗ = A and B ∗ = B (11)
Choosing L = β L̃ and P = α P̃ the equation (9) can be α β
expressed as The corresponding degrees are
à L̃ + B̃ P̃ = Ãcl deg{L∗ } = n − z − 1
deg{P ∗ } = n + q − w − 1
with Acl = α β Ãcl so that the remaining closed loop
poles after the cancellations (Ãcl ) can be chose arbitrarily. deg{A∗cl } = 2n + q − z − w − 1
The corresponding degrees using the same design criterion Let ∗ Vl∗ , ∗ Vp∗ , ∗ W be vector spaces over R such as
(biproper controller of one degree less than the plant) ∗
Vl∗ = span{sn−z−1 , sn−z−2 , . . . , s, 1}
remains as follows ∗
Vp∗ = span{sn+q−w−1 , sn+q−w−2 , . . . , s, 1}
deg{L̃} = n − z − 1
∗
W = span{s2n+q−z−w−1 , s2n+q−z−w−2 , . . . , s, 1}
deg{P̃ } = n − w − 1
So that L∗ ∈ ∗ Vl∗ , P ∗ ∈ ∗ Vp∗ and A∗cl ∈ ∗ W. Let
deg{Ãcl } = 2n − z − w − 1
Vl ⊕ Vp∗ be the external direct sum of ∗ Vl∗ and ∗ Vp∗ .
∗ ∗ ˜∗
Let Z Vl̃ , Z Vp̃ , Z W be vector spaces over R such as Construct bases ∗ V and ∗ W in the same way as in (2) and
Z
Vl̃ = span{sn−z−1 , sn−z−2 , . . . , s, 1} (3). Define the linear transformation ∗ Φ as it follows
∗
Z
Vp̃ = span{sn−w−1 , sn−w−2 , . . . , s, 1} Φ : ∗ Vl ∗ ⊕
˜ ∗ Vp∗ −→ ∗ W
∗
Z Φ{(l , p )} 7→ A∗ l∗ + B ∗ p∗
∗ ∗
W = span{s2n−z−w−1 , s2n−z−w−2 , . . . , s, 1}
Using the same criterion as in (4) and (5) with bases ∗ V
So that L̃ ∈ Z Vl̃ , P̃ ∈ Z Vp̃ and Ãcl ∈ Z W. Let Z Vl̃ ⊕
˜ Z Vp̃ be
and ∗ W respectively, the construction of the submatrices
the external direct sum of Vl̃ and Vp̃ . Construct bases
Z Z ∗
ξA∗ ∈ R2n+q−z−w × n−z and ∗ ξB ∗ ∈ R2n+q−z−w × n+q−w
Z
V and Z W in the same way as in (2) and (3). Define the (in columns)
linear transformation Z Φ as it follows ∗
Z ξA∗ = (A∗ sn−z−1 )T∗ W · · · (A∗ )T∗ W
Φ : Z Vl̃ ⊕
˜ Z Vp̃ −→ Z W | {z }
dim{∗ Vl∗ }=n−z
Z
Φ{(˜l, p̃)} 7→ Ã ˜l + B̃ p̃
∗
(12)
Using the same criterion as in (4) and (5) with bases Z V ξB ∗ = (B ∗ sn+q−w−1 )T∗ W · · · (B ∗ )T∗ W
and Z W respectively, the submatrices Z ξà ∈ R2n−z−w × n−z | {z }
dim{∗ Vp∗ }=n+q−w
and Z ξB̃ ∈ R2n−z−w × n−w (in columns)
204
CHAPTER 7. LINEAR SYSTEMS
Then, the matrix associated to the transformation in the In Figure 2 we show the step response from reference input
bases ∗ V and ∗ W to output using the controller developed in (8) and using
∗
Φ∗ V ∗ W = [ ∗ ξA∗ ∗ ξB ∗ ] the controller with zero-pole cancellations (13). An output
disturbance d0 (t) = sin(t) was injected at t = 5[sec.].
Where Φ∗V W ∈ R2n+q−z−w × 2n+q−z−w is a Sylvester Ma-
trix associated to the polynomials A∗ and B ∗ .
2.5
In synthesis, the algorithm can be summarized in the without cancellation
following simple steps with cancellation
205
CHAPTER 8
MOBILE ROBOTS
CHAPTER 8. MOBILE ROBOTS
Abstract: The application of metrics quality of control (QoC) in mobile robotics could be a
difficult task because not always is possible access the data of embedded control units of the
robot effectors. This paper presents an heuristic geometric-based altrernative for performance
evaluation of embedded controllers. Geometric characteristics of simple circular and linear
paths described by the robot are considered for performance evaluation of the controllers.
For experimental evaluation of our approach, two PI embedded controllers were designed and
implemented in a two wheeled mobile robot (2WMR). Results of conducted experiments showed
that our approach constitutes an alternative of performance evaluation of embedded controllers
when it is not possible to access data of the control units.
207
CHAPTER 8. MOBILE ROBOTS
208
CHAPTER 8. MOBILE ROBOTS
considering the forces and torques involved. For a 2WMR experimental validation of our approach. The block dia-
the Kinematic model allows to express its velocities as gram of Carlitos is shown in Fig. ??. Actuation system is
functions of velocities of its wheels and its geometric composed of two DC motors with coupled gearboxes and a
parameters [?, ?]. dual H-bridge motor driver for PWM control signals am-
plification. The proprioceptive sensory system integrates
For 2WMRs a pure rolling motion without slipping is
quadrature encoders mounted over the axes of both DC
obtained when the robot rotates around an external point
motors, and a CMPS03 digital compass for estimation of
located over the common axis of both driving wheels. This
robot head orientation. Encoders provide 1125 counts per
point is known as the instantaneous center of curvature
revolution.
(ICC) or instantaneous center of rotation (ICR). The
ICC will move by changing velocities of the two driving The exteroceptive system is made up of a sonars ring
wheels, allowing the robot carry out different paths. At with twelve SRF02 devices for distance measurement to
each time instant, right and left wheels follow paths around surrounding obstacles. The top side of sonars ring has
the ICC at the same angular velocity ω = dψ/dt as is a 802.15.4 radio for wireless command reception and
shown in Figure ??. Table ?? shows the symbols used in data transmission. Processing unit correspond to a 32-bit
this section, with L = 22.1 cm. The linear velocities of the microcontroller capable of operate up 50 MHz. We use the
driving wheels are given by Freescale MCF51QE128 microcontroller to process pulses
L from encoders and generate PWM control signal for each
vL = ω(R + ). (3) motor. Control algorithms were coded in C language and
2
programmed in microcontroller’s internal flash memory.
L Carlitos robot was conceived and designed having in ac-
vR = ω(R − ). (4) count modularity and scalability criteria.
2
where R is the distance between the ICC and the middle
point CR between the two wheels. The angular velocity of
the robot is obtained subtracting (??) from (??) and is
expressed by
v − vR
ω= L . (5)
L
By adding (??) and (??) and substituting (??) in the result
we obtain
v + vR L
R= L · . (6)
vL − v R 2
Equations (??) and (??) allow to establish the angular
velocity of the robot and the instantaneous radius of
curvature as functions of linear velocities of the wheels
and their separation L. Finally, the linear velocity of the
midpoint CR is given by the average of the wheel velocities
v + vR
vCR = ωR = L . (7) Fig. 4. Block diagram and a picture of the experimental
2
robot
To take advantage of the robot’s resources, a C language
algorithm was developed to support the effectors modelling
task. Initially the robot was placed in an obstacles free
indoor environment awaiting the start command sent from
a PC via a 802.15.4 radio link. As is shown in Figure
??, once received the start order, an step PWM signal
with a duty cycle (DC) of 50% it is generated by the
microcontroller to drive both motors. This signal was
maintained for 1.8 seconds. Subsequently, the duty cycle
of the PWM signal was incremented to 60%. This step
was also maintained by 1.8 seconds. During the total
Table 1. Kinematic model variables
Parameter Description
L Distance between support points of wheels (cm)
Fig. 3. Differential drive motion of a mobile robot R Instantaneous radius of curvature (cm)
CR Midpoint between driver wheels (cm)
ψ Robot orientation angle (rad)
Effectors Modeling In a previous work [?] a teaching ω Angular velocity of 2WMR (rad · s−1 )
and research Car-like Autonomous vehicle (Carlitos) was vL , vR Velocities of wheels along the ground (cm · s−1 )
designed and constructed. This 2WRM is used here for vCR Linear velocity of CR (cm · s−1 )
209
CHAPTER 8. MOBILE ROBOTS
160
Left Motor 0.8 0.608 0.568
Right Motor
140 0.9 0.594 0.506
1.0 0.558 0.443
120
For discrete-time systems θd is calculated as
100
0 10 20 30
a) nT (T=0.1 s) T 0.1
θd = θ + = 0.1159 + = 0.1659. (14)
60 PWM 50 -> 60% 2 2
PWM duty cycle (%)
210
CHAPTER 8. MOBILE ROBOTS
τ θ 5. VALIDATION EXPERIMENTS
Kc = (0.9 + )
Kθ 12τ
0.2222 0.1659 Two simple trajectories and its geometrical characteristics
= (0.9 + ) were considered for controllers performance evaluation.
2.63 × 0.1659 12 × 0.2222
Equations (??) and (??) shows how to synthesize the two
= 0.49. (19)
paths of interest for this study:
i. Circular path. The path is a circle of radius R with
θ(30τ + 3θ)
τi = center in ICC. For locate the ICC outside the line
9τ + 20θ segment that connects both wheels, must meet |R| =
0.1659(30 × 0.2222 + 3 × 0.1659) L/2 condition. The robot moves clockwise when vL >
=
9 × 0.2222 + 20 × 0.1659 vR .
= 0.2235. (20) ii. Linear path. The robot moves straight line when
ω = 0, this is met for vL = vR .
As in the above method q0 and q1 are calculated by Should be noted that the actual trajectories of the robot
differ from the theoretical because of uncertainty in the
T measurements of the sensors and uncertainty in control
q0 = Kc (1 + )
2τi commands.
0.1
= 0.49(1 + ) = 0.5996. (21) Circular displacement. For validation of CT E metric
2 × 0.2235
defined by Equation ??, three circular paths C1 , C2 and C3
were conducted for each controller. The reference velocities
T for each path were established as, C1 : vL = 80, vR = 60,
q1 = −Kc (1 − )
2τi C2 : vL = 100, vR = 80, and C3 : vL = 120, vR = 100.
0.1 Table ?? presents the experimental results obtained for
= −0.49(1 − ) = −0.3804. (22) the Ciancone-Marlin controller and table ?? for Cohen-
2 × 0.2235
Coon controller.
Figure ?? shows the response of the two controllers to a Table 3. Results of CT E metric for Ciancone-
setpoint speed of 100 pp/T . Note that the wheel speeds Marlin controller
are expressed in pulses per sampling time (pp/T ). At the
top, the response of Ciancone-Marlin controller and, at the Path φr (cm) φt (cm) CT E (cm)
bottom, the response of Cohen-Coon controller. A C lan- C1 153 148.4 4.6
C2 193.4 190.8 2.6
guage program for effectors control, data acquisition and
C3 235 233.5 1.5
its wireless transmission, was developed and programmed
Mean 2.90
in the internal flash memory of the robot microcontroller.
Figure ?? was drawn with the data received. The difference
equation for PI controller implementation is given by Table 4. Results of CT E metric for Cohen-
Coon controller
m(k) = q0 e(k) + q1 e(k − 1) + m(k − 1). (23) Path φr (cm) φt (cm) CT E (cm)
C1 151.5 148.4 3.1
C2 192.6 190.8 1.8
C3 234.7 233.5 1.2
Ciancone-Marlin Mean 2.03
Pulses per samplin time (pp/T)
100
100
50
trollers
Metric Ciancone-Marlin Cohen-Coon
CAD 55.6 50.3
0
0 50 100 150 200
nT, T=0.1 s For comparative purposes, two metrics for quality of
control (QoC) evaluation were considered, these are: IAE
and ISE. These two metrics express QoC in terms of the
Fig. 6. Step responses of controllers, Ciancone-Marlin error e(t), which is defined as the difference between the
(top), and Cohen-Coon (bottom) set-point r(t) and the system output y(t). Continuous-time
and discrete-time forms of IAE and ISE are given by
211
CHAPTER 8. MOBILE ROBOTS
212
CHAPTER 8. MOBILE ROBOTS
CINVESTAV-IPN
Departamento de Ingenierı́a Eléctrica , Sección de Mecatrónica
Av. I.P.N. No. 2508, Col. San Pedro Zacatenco 07360, México, D.F.
(e-mail: rdcruz,aangeles,velasco@cinvestav.mx).
Abstract: The main objective of this paper is to maintain a leader follower formation with
a time gap separation between the mobile robots, with this control, the longitudinal distance
variation between robots only depends on the leader velocity. The formation problem is solved
using an observer that serves as a virtual reference to the follower robot that, correspond to
the leader trajectory delayed τ units of time. The strategy is theoretically formally proven, and
numerical simulations are presented.
213
CHAPTER 8. MOBILE ROBOTS
Consider a leader robot describing any feasible trajectory ew1 (t) = w1 (t) − ŵ1 (t)
on the X − Y plane. It is desired that a follower robot
ew2 (t) = w2 (t) − ŵ2 (t) (7)
tracks the trajectory described by the leader robot delayed
τ units of time, where τ represents the desired time gap ew3 (t) = w3 (t) − ŵ3 (t)
separation between the leader and the follower robot. are the observation error variables, and λ1 , λ2 , λ3 are
The delayed leader trajectory is obtained by considering constant positive design gains.
an input-delay observer based on the actual measurements
of the leader robot. Under these conditions, it is intended 3.1 Observation error analysis
that the follower robot tracks the delayed states provided
by the observer. This time gap separation, based on a To analyze the observation error, a change of coordinates
leader time delayed trajectory, is depicted in Fig. 2. is done in the form,
" # " #" #
3. OBSERVER DESIGN e1 cos(w3 ) sin(w3 ) 0 ew1
e2 = − sin(w3 ) cos(w3 ) 0 ew2 (8)
The development of the mentioned observer will be de- e3 0 0 1 ew3
scribed taking into account the following assumptions. this is,
214
CHAPTER 8. MOBILE ROBOTS
e1 = ew1 cos(w3 ) + ew2 sin(w3 ) exponential convergence of the observation errors e1 (t),
e2 (t) depend on the perturbation-free system (13) this is,
e2 = −ew1 sin(w3 ) + ew2 cos(w3 ) (9)
on the system,
e3 = ew3 . ˙ = Āē(t)
ē(t) (17)
The observation error dynamics is obtained by taking the A candidate Lyapunov function of the form,
time derivative of (9). Taking the first line, and defining 1 1
λ1 = λ2 = λ, V = e21 + e22 (18)
2 2
produces
ė1 = ėw1 cos(w3 ) − ew1 ẇ3 sin(w3 ) + ėw2 sin(w3 )
+ew2 ẇ3 cos(w3 ) V̇ = e1 ė1 + e2 ė2
e
3
= vL (t − τ )2 sin2 − λe1 + e2 wL (t − τ ). = e1 (−λe1 + e2 wL (t − τ )) + e2 (−λe2 − e1 wL (t − τ ))
2
Now with the second line of system (9), = −λe21 − λe22
1 1
= −2λ( e21 + e22 )
ė2 = −ėw1 sin(w3 ) − ew1 ẇ3 cos(w3 ) + ėw2 cos(w3 ) 2 2
= −2λV (19)
−ew2 ẇ3 sin(w3 )
This concludes the proof.
= vL (t − τ ) sin(e3 ) − λe2 − e1 wL (t − τ ) In what follows, the estimated state ŵ(t) will be used
finally, as a desired trajectory for the follower robot in order to
solve the leader-follower formation problem. Notice that τ
ė3 = wL (t − τ ) − wL (t − τ ) − λ3 ew3 represents the time gap between the vehicles.
= −λ3 e3 .
4. TIME GAP TRACKING CONTROL STRATEGY
The observation error dynamics is,
e Consider the follower robot dynamics,
3
ė1 = −λe1 + vL (t − τ )2 sin2 + e2 wL (t − τ ) "
2 #
ẋF (t) cos(θF )vF (t)
ė2 = −λe2 + vL (t − τ ) sin(e3 ) − e1 wL (t − τ ) (10) ẏF (t) = sin(θF )vF (t) (20)
ė3 = −λ3 e3 . θ̇F (t) wF (t)
The observation convergence properties can be formally and take into account the reference trayectory of system
stated in the next lemma. (6). From system (20) it is possible to initially define,
Lemma 3. Suppose that Assumptions 1 and 2 are satisfied
and that λ1 , λ2 , λ3 > 0. Then, the states defined by the ẋF = vF cos(θF ) = ξ1
observer given in (6) exponentially converge to the leader
robot trajectory delayed τ units of time. ẏF = vF sin(θF ) = ξ2 (21)
θ̇F = wF = ξ3
Proof. Notice first that for λ3 > 0,
ė3 (t) = −λ3 e3 (t) (11) for
is exponentially stable, so, now the problem reduces to
demonstrate that e1 and e2 also converge to the origin. ξ1 = ŵ˙ 1 − k1 x̃
With this aim, define, ξ2 = ŵ˙ 2 − k2 ỹ (22)
T
ē(t) = [ e1 (t) e2 (t) ] (12) ξ3 = ŵ˙ 3 − k3 θ̃
then, the dynamics of e1 (t), e2 (t) can be expressed in the with
form,
˙ = Āē(t) + F̄ (e3 , ŵ)vL (t − τ )
ē(t) (13) x̃ = xF − ŵ1
where,
ỹ = yF − ŵ2 (23)
−λ1 wL (t − τ ) θ̃ = θF − ŵ3
Ā = (14)
−wL (t − τ ) −λ2
" e # From (21) it is obtained,
3
2 sin2
F̄ (e3 , ŵ) = 2 . (15) cos2 (θF )vF + sin2 (θF )vF = ξ1 cos(θF ) + ξ2 sin(θF )(24)
sin(e3 )
To show the convergence to the origin of ē(t), notice that, so the control signals are,
F̄ (e3 , ŵ)vL (t − τ ) ≤ F̄ (e3 , ŵ) |vL | ≤ β |e3 | v̄L (16)
vF = (ŵ˙ 1 − k1 x̃) cos(θF ) + (ŵ˙ 2 − k2 ỹ) sin(θF ) (25)
with β > 0. Therefore, F̄ (e3 , ŵ) tends to zero as e3 (t)
converges to the origin, regardless of the evolution of e1 (t) wF = ŵ˙ 3 − k3 θ̃. (26)
and e2 (t). Therefore, F̄ (e3 , ŵ)vL (t−τ ) is a vanishing signal To analyze the tracking error, consider now the new error
for the system given by (13), this fact implies that the signals,
215
CHAPTER 8. MOBILE ROBOTS
" # " # x̃ Remark 4. Note that the variable m1 (t) depend on the
es1 cos(θF ) sin(θF ) 0
es2 = − sin(θF ) cos(θF ) 0 ỹ . (27) observation errors e1 (t), e2 (t) and on the tracking error
es3 0 0 1 θ̃ es3 (t), that have previously been proven that exponentially
converge to the origin. Therefore m1 (t) → 0 as t → ∞ .
The dynamic of the tracking error can be obtained by
taking the time derivative of system (27). Notice that, Since the functions m1 is independent of es1 (t), es2 (t), it
can be considered as an exogenous input signal. This fact
ės1 = x̃˙ cos(θF ) − x̃θ̇F sin(θF ) + ỹ˙ sin(θF ) + ỹ θ̇F cos(θF ) allows to establish the stability of the closed loop system
given by system (30) in terms of the free-perturbation
= vF − vL (t − τ ) cos(es3 ) + wF (es2 ) − λ1 ew1 cos(θF ) system (31).
−λ2 ew2 sin(θF ), Lemma 5. Suppose that k, k3 > 0 and that the angular
The second line of (27) produces, input velocity of the follower robot satisfies for all t that
wF (t) 6= 0. Then, the closed loop tracking error dynamic
ės2 = −x̃˙ sin(θF ) − x̃θ̇F cos(θF ) + ỹ˙ cos(θF ) − ỹ θ̇F sin(θF ) given by Eqn. (30) is asymptotically stable. Consequently,
the observer-based feedback (29) asimptotically solves the
= vL (t − τ ) sin(es3 ) − wF es1 + λ1 ew1 sin(θF ) time-gap tracking problem associated with the leader and
−λ2 ew2 cos(θF ). follower robots.
Finally, Proof. The statement of the lemma is equivalent to state
the stability of system (31). With this aim consider the
˙ candidate Lyapunov function,
ės3 = θ̃
1 1
= wF − ŵ˙ 3 . V = e2s1 (t) + e2s2 (t) (32)
2 2
The tracking error dynamic takes the form, therefore,
ės1 = vF − vL (t − τ ) cos(es3 ) + es2 wF − λ1 ew1 cos(θF ) V̇ = es1 (t)ės1 (t) + es2 (t)ės2 (t)
−λ2 ew2 sin(θF ) = −ke2s1 (t). (33)
ės2 = vL (t − τ ) sin(es3 ) − es1 wF + λ1 ew1 sin(θF ) (28) Under this condition, it is clear that,
−λ2 ew2 cos(θF ) V̇ ≤ 0.
ės3 = wF − k3 θ̃. Therefore, the states of the system given by Eqn. (31)
are stable. To show that the states converge to the origin,
Notice that the control signals, in the new coordinates (27) notice that V̇ is uniformily continuous since V̈ is bounded.
with k1 = k2 = k, take the form, Invoquing Barbalat’s lemma Slotine et al. [1991], V̇ → 0
and consequently, from the first line of (31) it is obtained
wF = ŵ˙ 3 − k3 es3
vF = −kes1 vL (t − τ ) cos(es3 ) + λ2 ew2 sin(θF ) (29) 0 = es2 ŵ˙ 3 − k3 es2 es3 (34)
+λ1 ew1 cos(θF ) this is,
4.1 Controlled system with observed states (ŵ˙ 3 − k3 es3 )es2 = 0 (35)
˙
Considering the assumption that wF 6= 0, thus ŵ3 6= 0, and
System (28) in closed loop with the control signal (29), taking into account that es3 → 0, it is clear that es2 → 0
produces the system, as t → ∞.
is a time varying term that do no explicitly depend on the x = 0.2(a + b) cos(npt) cos(pt)
tracking errors es1 and es2 . Notice from (30) that es3 is y = 0.2(a + b) cos(npt) sin(pt) (36)
exponentially stable, and remember that e1 , e2 and e3 are
exponentially stable, so the stability of (30) can be stated where n = 3 corresponds to the number of petals, a =
by considering the subsystem, 13, b = 7, p = (2π/20). Equation (36) allows to define the
input signals for the leader robot as,
ės1 = −kes1 + es2 ŵ˙ 3 − k3 es2 es3 p (ÿ ẋ − ẍẏ)
vL = ẋ2 + ẏ 2 , wL = 2 (37)
ės = −es ŵ˙ 3 + k3 es es
2 1 1 3 (31) (ẋ + ẏ 2 )
216
CHAPTER 8. MOBILE ROBOTS
Table 1.
es1 [m]
1
Robot x[m] y[m] θ[rad]
0
Leader 4 0 π/2
Follower 4 0 π/2 0 5 10 15 20 25 30
Observer 3 -0.5 0 Time [s]
es2 [m]
0
-0.5
-1
4 -1.5
0 5 10 15 20 25 30
LEADER
Time [s]
3 OBSERVER
0
es3 [rad]
FOLLOWER -0.5
2 -1
-1.5
0 5 10 15 20 25 30
1 Time [s]
Y [m]
0
Fig. 5. Position tracking errors
-1
5
-2
Vi [m/s]
0
-3
LEADER
FOLLOWER
-4 -5
-3 -2 -1 0 1 2 3 4 5 0 5 10 15 20 25 30
X [m] Time [s]
2
0
3
ew1 -2
ew2 0 5 10 15 20 25 30
2 ew3 Time [s]
217
CHAPTER 8. MOBILE ROBOTS
IEEE, 2007.
1
Siddharth Swaminathan, Mike Phillips, and Maxim
Likhachev. Planning for multi-agent teams with leader
switching. In Robotics and Automation (ICRA), 2015
0.5 IEEE International Conference on, pages 5403–5410.
IEEE, 2015.
Herbert G Tanner, George J Pappas, and Vijay Kumar.
0 Leader-to-formation stability. Robotics and Automation,
0 5 10 15 20 25 30 IEEE Transactions on, 20(3):443–455, 2004.
Time [s]
REFERENCES
Tucker Balch and Ronald C Arkin. Behavior-based for-
mation control for multirobot teams. Robotics and Au-
tomation, IEEE Transactions on, 14(6):926–939, 1998.
Zevi Bareket, Paul S Fancher, Huei Peng, Kangwon Lee,
and Charbel A Assaf. Methodology for assessing adap-
tive cruise control behavior. Intelligent Transportation
Systems, IEEE Transactions on, 4(3):123–131, 2003.
DE Canudas, Bruno Siciliano, and Georges Bastin. Theory
of robot control. GBR, 1996.
Luca Consolini, Fabio Morbidi, Domenico Prattichizzo,
and Mario Tosques. Leader–follower formation control
of nonholonomic mobile robots with input constraints.
Automatica, 44(5):1343–1349, 2008.
Jaydev P Desai, Jim Ostrowski, and Vijay Kumar. Con-
trolling formations of multiple mobile robots. In
Robotics and Automation, 1998. Proceedings. 1998
IEEE International Conference on, volume 4, pages
2864–2869. IEEE, 1998.
218
CHAPTER 8. MOBILE ROBOTS
Abstract: This paper presents a kinematic control for an omnidirectional mobile manipulator
using quaternions notation for the orientation of the end-effector. We describe the Denavit-
Hartenberg parameters for the robot and the sensor. The inverse kinematics for the robot is
calculated by using the pseudo-inverse of the geometric Jacobian. In order to obtain a desired
position for the end-effector a RGB-D camera is attached to the robot. An algorithm to extract
specific object color from a white background in an image is implemented using the RGB-D
sensor. The algorithm is able to find the object’s contour and centroid. Experimental results
shows that the kinematic control assisted by a RGB-D camera allows to reach and grasp objects
of interest.
219
CHAPTER 8. MOBILE ROBOTS
for the characterization. Moreover in Sharma et al. (2012) 3. FORWARD KINEMATICS
a unified inverse kinematic for Kuka youBot is established
allowing to resolve efficiently the singularities of the whole The Kuka youBot’s mobile base has four Mecanum wheels
kinematic model. However the author does not use the that allow free displacement in the Cartesian plane. How-
Denavit-Hartenberg parameters for the kinematic model ever in this kind of system it is necesary to consider the
and can not control the pose for the youBot’s end-effector. rollers disposition inside the Mecanum wheel in order to
obtain a better kinematic model. According to Nagatani
A well documented work is presented in Keiser (2013), et al. (2000), the kinematic configuration of an omnidirec-
where the author applied a torque control at the youBot tional mobile robot is given by the set of equations (1).
arm in order to pickup objects from the ground. The
objects identification is acomplish by using an RGB-D
sensor and the Point Cloud Library. The RBG-D sensor 1
ẋ = vl = (dwb1 + dwb2 + dwb3 + dwb4 )
is fixed in a supporting structure located on the back of 4
the mobile platform, allowing the objects appear inside 1
ẏ = vt = (−dwb1 + dwb2 + dwb3 − dwb4 ) tan(αb ) (1)
the sensor’s field of view. However the arm control was 4
implemented as an open loop, it means that the arm’s ˙ 1
θb = va = ω = (dwb1 + dwb2 + dwb3 + dwb4 )β
trajectory is computed offline. 4
In Dogar et al. (2014) a group of Kuka youBots is used where dwbi (for i = 1, . . . , 4) represents the linear velocity
to manipulate forniture parts to assembly a table. In this of each wheel of the platform. The variables vt , vl and
work the robots are supervised by an external camera va are the transversal, longitudinal and angular velocities
array that allows the system to know the element’s posi- respectively. The αb and β parameters are computed
tion at any time. Moreover some robots have a camera in experimentally because they depend on the roller’s angle
hand configuration, resulting into an extremelly controlled inside the Mecanum wheel.
experimental environment.
The mobile platform movement is given by the combi-
In Schoen and Rus (2013) four youBots are used, two of nation of the wheel’s linear velocities. For example, the
them are the suppliers and the rest serve as assemblers. lateral traslation and rotation movements are depicted in
The proposal uses adaptive planning, where even if there Figure 1, where the faded platform shows where the robot
are missing pieces, the robots are able to adjust the will be at a certain time.
planning task to assemble the final object. Another aspect
of the article is that it handles subtasks times, so it can get
a parameterization of all homework time and measure the
efficiency of the proposed algorithm. The vision system
is composed by several cameras that allows to track the
robots and the assembly parts, and with that information,
the authors are able to hand over parts from one robot to
another without resting the pieces on the surface.
Another recent work related to assembly parts is described
in Wang et al. (2013), where the Kuka youBot locates a
furniture piece that is randomly placed on the floor. To
achieve accurate part identification and the robot location, Fig. 1. Mobile platform movement according
the authors use an array of radio frequency identifiers in to the individual rotation combination of the
the robot to compute its location. four mecanum wheels.
220
CHAPTER 8. MOBILE ROBOTS
T8w = T1w T21 . . . T87 (3)
The end-effector position is given by the first three terms
of the last column in the homogeneous matrix T8w .
3.1 Quaternions
221
CHAPTER 8. MOBILE ROBOTS
and the D-H parameters shown before. The geometric they can be programmed using open source codes such as
Jacobian is constructed as follows: OpenNI and OpenCV.
" #
j pi · · · j pn
Jg = ··· (16)
joi · · · jpn
Where jpi have two values according to the robot mod-
elling show in Figure 3:
• For the prismatic joints:
jpi = Zi−1 (17) Fig. 4. RGB-D sensor characteristics.
w
Where Zi−1 is the third column of Ri−1 .
• For the revolute joints: The procedure to extract an object of interest from an
jpi = Zi−1 × (pi − pi−1 ) (18) RGB image is to convert the image from the RGB color
Where pe are the first three elements of the fourth space to HSV color space. The conversion is because the
column of Tiw and pi−1 are the first three elements of HSV color space is more robust to illumination variations
the fourth column of Ti−1w . than the RGB color space. The next step is to process the
image in order to find all the blue pixels in the scene, then
The end-effector’s orientation can be described into the all the pixels found are gathered into a contour, so that the
geometric Jacobian as: centroid of this pixels group can be computed (see Figure
5). The object’s centroid is paired with the pointcloud in
−Quat(2) −Quat(3) −Quat(4) order to obtain its 3D position with respect to the camera.
Quat(1) Quat(4) −Quat(3)
Jgo = (19) The object’s position must be in world’s coordinates. The
−Quat(4) Quat(1) Quat(2)
homogeneous transformation between the inertial frame w
Quat(3) −Quat(2) Quat(1)
and the camera frame allows to pass the object’s position
With equation (19) a temporal Jacobian can be con- from camera to the world. The first three parameters of
structed. the Table 1 and the D-H parameters for the camera are
considered to calculate the homogeneous transformation.
I3x3 0
Jtemp = (20)
0 J go Tcw = T1w T21 T32 Tc3 (26)
The Jacobian matriz using quaternions to describe the With the transformation Tcw the object’s centroid (Pc ) can
end-effector orientation is computed using the temporal be projected into the inertial frame as follows:
Jacobian and the geometric Jacobian from equation (16).
Pw = Tcw [Pc , 1]T (27)
J = Jtemp Jg (21)
Where the first three terms of Pw are the coordinates of
The Jacobian can be splitted in two matrix, one for the the object in the inertial frame.
position and the other for orientation.
222
CHAPTER 8. MOBILE ROBOTS
6. CONTROL LAW
The control law proposed in this paper is a PD control for
the youBot’s end-effector. The desired pose and position
for the end-effector are the centroid of the object that are
measured by the RGB-D sensor and can be rewritten as
follows:
EEposd =[xd , yd , zd ] = Pw (1 : 3)
EEposd
V ecq =
|EEposd |
angled angled
Quatd = cos , V ecq sin (28)
2 2
Finally, the end-effector desired position is constructed as: Fig. 6. Experimental set, consists of a Kuka
youBot with an asus xtion pro mounted on a
fixed structure and a blue object placed on a
P osd = [EEposd , Quatd ]T (29) white table at a random position.
The control law proposed is:
223
CHAPTER 8. MOBILE ROBOTS
Hvilshj, M., Bgh, S., Madsen, O., and Kristiansen, M.
(2009). The mobile robot “little helper”: Concepts, ideas
and working principles. In Emerging Technologies Fac-
tory Automation, 2009. ETFA 2009. IEEE Conference
on, 1–4.
Keiser, B. (2013). Torque Control of a KUKA youBot
Arm. Master’s thesis, Robotics and Perception Group,
University of Zurich.
Madsen, O., Bøgh, S., Schou, C., Andersen, R.S.,
Damgaard, J.S., Pedersen, M.R., and Krüger, V. (2015).
Integration of mobile manipulators in an industrial pro-
duction. Industrial Robot: An International Journal,
42(1), 11–18.
Nagatani, K., Tachibana, S., Sofne, M., and Tanaka, Y.
Fig. 9. Object’s position for the experiment. (2000). Improvement of odometry for omnidirectional
vehicle using optical flow information. In Intelligent
Robots and Systems, 2000. (IROS 2000). Proceedings.
2000 IEEE/RSJ International Conference on, volume 1,
468–473 vol.1.
Schoen, T. and Rus, D. (2013). Decentralized robotic
assembly with physical ordering and timing con-
straints. In Intelligent Robots and Systems (IROS), 2013
IEEE/RSJ International Conference on, 5764–5771.
Sharma, S., Kraetzschmar, G.K., Scheurer, C., and
Bischoff, R. (2012). Unified closed form inverse kine-
matics for the kuka youbot. In Robotics; Proceedings of
ROBOTIK 2012; 7th German Conference on, 1–6.
Wang, J., Adib, F., Knepper, R., Katabi, D., and Rus, D.
(2013). Rf-compass: Robot object manipulation using
rfids. In Proceedings of the 19th Annual International
Conference on Mobile Computing & Networking,
Fig. 10. End effector pose and position errors. MobiCom ’13, 3–14. ACM, New York, NY, USA.
Zhang, L. and Zhou, C. (2013). Kuka youbot arm shortest
detection algorithm is required because the illumination path planning based on geodesics. In Robotics and
variations affect the object detection rate. The stability Biomimetics (ROBIO), 2013 IEEE International Con-
proof must be done in order to tune the control gains and ference on, 2317–2321.
to demonstrate mathematically the convergency for this
kind of control.
REFERENCES
Bischoff, R., Huggenberger, U., and Prassler, E. (2011).
Kuka youbot - a mobile manipulator for research and
education. In Robotics and Automation (ICRA), 2011
IEEE International Conference on, 1–4.
Cho, Y.C. and Jeon, J.W. (2008). Remote robot control
system based on dtmf of mobile phone. In 2008 6th
IEEE International Conference on Industrial Informat-
ics, 1441–1446. doi:10.1109/INDIN.2008.4618331.
Corke, P. (1996). Visual control of robots: high-
performance visual servoing. Robotics and mechatronics
series. Research Studies Press.
Dogar, M., Knepper, R.A., Spielberg, A., Choi, C., Chris-
tensen, H.I., and Rus, D. (2014). Towards coordinated
precision assembly with robot teams. Proceedings of the
International Symposium of Experimental Robotics.
Dwiputra, R., Zakharov, A., Chakirov, R., and Prassler,
E. (2014). Modelica model for the youbot manipula-
tor. In Proceedings of the 10th International Modelica
Conference, March 10-12, 2014, Lund, Sweden, 1205 –
1212.
Hutchinson, S., Hager, G., and Corke, P. (1996). A tutorial
on visual servo control. Robotics and Automation, IEEE
Transactions on, 12(5), 651–670.
224
CHAPTER 8. MOBILE ROBOTS
Navigation Assistance System for the Visually Impaired People Using the
Modified Fictitious Force Algorithm
Sánchez B. David. Ortega H. Christian.
Sotomayor O. Nelson, Chávez G. Danilo
Escuela Politécnica Nacional
Departamento de Automatización y Control Industrial, Ladrón de Guevara E11-253,
Quito, Ecuador
(e-mail: david.sanchez@epn.edu.ec, christian.ortega01@epn.edu.ec,
nelson.sotomayor@epn.edu.ec, danilo.chavez@epn.edu.ec)
Abstract: This work is focused on the implementation of a prototype that assists, complementary to the
white cane, in the navigation of blind people by detecting obstacles with artificial vision. The navigation
system was developed based on the Kinect sensor coupled to a helmet, which in addition to providing a
color image of the environment, it is also capable of delivering a depth image of the same; this
information was processed in a computer and based on the modified fictitious force algorithm by Scaglia
et al. (2009), an obstacle free route was determined, which is transmitted to the user through audible
messages; furthermore a complementary vibration system was implemented and mounted on a vest,
which will serve as backup in order to alert the person about near obstacles. The prototype also has an
identification and decoding system for QR codes that helps for a better orientation of the visually
impaired person within the environment in which they manage themselves.
Keywords: Visual disability, vision, fictitious force, depth, navigation.
1. INTRODUCTION The work is divided as stated below: Section 2 presents the
main characteristics of the used sensor, section 3 describes
Nowadays people deprived partial or totally of the sense of
the implemented prototype, section 4 shows the acquisition
sight are limited in their mobilization for the development of
and processing of the delivered image by the Kinect sensor to
several important aspects of their lives. Mobilizing involves a
set the initial navigation parameters, section 5 details how the
risk because of the obstacles that stand in their way, therefore
modified fictitious force algorithm was implemented using
several devices have been developed that help these people in
artificial vision for obstacle avoidance, section 6 explains
their navigation; especially in places where there are
about the QR codes detection and their functionality within
suspended objects, as these cannot be detected by the white
navigation, section 7 is about the generation and emission of
cane. Unfortunately, access to these devices is limited due to
audible instructions, while in section 8 system operation and
their high cost and lack of information about the existence of
interaction vibration button are explained. In Section 9
them.
experimental results for people with and without visual
INEC (2011) underline that the limited visual ability of impairment are shown.
individuals influences their lives in several important aspects 2. MICROSOFT KINECT SENSOR
such as social, academic and occupational. Navigation for
these people either in work or in a domestic environment The Microsoft Kinect sensor for windows was used in the
implies a big challenge because of the many obstacles that color image acquisition, which is used to find defined pattern
threaten their physical integrity, and the inadequacy of the matching within the entire image that determine the existence
cities for the blind; so, most of them should be guided by a of a two-dimensional bar code (QR Code); as well as the
walking stick, a guide dog, a relative or a friend. depth image, which delivers depth information to digitized
objects within the image and based on this, perform the
Through this prototype and using the Microsoft Kinect tool, necessary calculations of angles and required distances for the
an assistance system for the blind was developed to development of the implemented algorithm.
complement the white cane in order to allow them to safely
navigate through the detection and avoidance of obstacles, The Fig. 1, show the most important technical specifications
which they could collide with; especially objects that are not about the Kinect for Windows for example, the minimum or
at floor level. In addition, more information on the maximum distance visible into the range the navigation or the
environment that surrounds the visually impaired person is horizontal and vertical field of view, some of the most
provided by detecting certain visual messages that could be important characteristics used into the algorithm.
located at specific points of importance, such as an
information desk or an emergency exit.
225
CHAPTER 8. MOBILE ROBOTS
226
CHAPTER 8. MOBILE ROBOTS
communication transmits information to a microcontroller in navigate without the possibility of getting impacted by any
order to enable or disable vibration motors; as well as receive objects around them. To fulfill this purpose, there are
and transmit the actions performed by the user via an required the following input data to the system: The amount
interaction button. This button enables or disables the voice of digitized objects in the sensor’s image processing and the
instructions issued by the developed system or issues the distances that these are from the person.
system’s use instructions.
For the algorithm implementation it is necessary to establish a
Kinect sensor initialization was performed using the toolkit coordinate axes system that is considered for the
'Microsoft Kinect API v1.0' and its own LabVIEW library, mathematical and geometric algorithm development, as well
where the inclination angle, that should have the sensor based as generate the repulsive force emitted by the objects in the
on the user's height, is calculated. environment. It is also necessary to establish an attractive
force that is always attached to the visually impaired person
In order to obtain the maximum depth value that the Kinect within their navigation and finally, these forces will relate
sensor can visualize a triangle between points ABC is formed with a vector sum, generating a final vector containing the
as shown in Fig. 5, it is also used to estimate a user's height, rotation direction necessary to avoid the impact of the person
which corresponds to a leg of the triangle in mention. with the obstacles, as Ribeiro (2005) indicate.
Furthermore, it is established through testing and error that
the proper distance as the second leg should be approximately
2 m, the calculation of an ideal depth (D_IDEAL) is 5.1 Coordinate axes System.
performed using the Pythagorean Theorem.
The established coordinate axes system will remain invariant
This value of ideal depth is not applicative since it shows the during the development of the algorithm throughout the
ground as an obstacle to avoid when that's not the truth, thus mathematical processing of the navigation system and it is
it was decided to find a desired depth value (D_WISHED) attached to the visually impaired person as shown in Fig. 6.
that will show the obstacles to be avoided limited to a
minimum height (h '). Based on several tests and because it is
a complementary system to the white cane, it is determined
that the minimum height of the obstacle to be avoided is 35%
of the approximate user’s height value, which is calculated by
the system. To find the desired depth value, a similarity
between the A'BC' and ABC is established achieving to
eliminate the safety depth measure (D_SEC), the resulting
expression is shown in (1)
(1)
Based on (1) it is determined that desired depth is 65% of the Fig. 6. Coordinate axes System.
ideal depth. Establishing this depth as the maximum threshold
value (THR_MÁX) that the Kinect sensor can visualize in its
5.2 Repulsive Force Generation.
environment.
Kean et al. (2011) underline that the minimum established In order to generate a repulsive force emitted by the
threshold in the Kinect sensor is specified by its limited obstacles, force shall be understood as a vector and therefore
permissible depth acquisition, which is 45cm in near mode, it has characteristics such as a direction (incidence angle) and
therefore the value of THR_MIN is equal to this distance. a module, which is inversely proportional to the distance
between the person and the obstacle’s center of mass. The
mathematical expression showing the magnitude calculation
for the repulsive force |⃗⃗⃗⃗ | is (2).
|⃗⃗⃗⃗ | { (2)
Where:
Fig. 5. Getting the maximum depth of vision. Pmáx = Maximum vision depth set in the Kinect sensor.
6. ALGORITHM IMPLEMENTATION OF FICTITIOUS Pmed = Measured depth between the obstacle and the
MODIFIED FORCE BY ARTIFICIAL VISION visually impaired person.
The modified fictitious force algorithm will determine an Pmín = Minimum vision depth set in the Kinect sensor.
obstacle free route where the visually impaired person can
227
CHAPTER 8. MOBILE ROBOTS
As it can be seen in (3), the repulsive force magnitude is null Adapting this definition for the visually impaired person,
when objects are outside the range of vision, as well as an such force is mentioned as a vector that is attached to the
increase in module is seen as the obstacle gets closer to the user, with a direction in the y positive axis and with a
person. Finally, magnitude is greater than 1 when the distance constant magnitude equal to 1. This force is responsible for
to the obstacle is less than the depth of investment forces generating the movement towards in the person’s trajectory.
which warrants immediate evasion action. Fig. 9 shows the attraction force (⃗⃗⃗⃗ ) that the user generates
on their route.
|⃗⃗⃗⃗ | {( ] (3)
Where:
Pinv = Depth set for the investment of the final evasion
force.
The angle theta (θ) shown in Fig. 7, is defined by (4),
mathematical expression that relates the data of horizontal
vision angle of the Kinect sensor, the missing value of 61° to
complete the reference to the x positive axis and the x
coordinate of each object’s center of mass. An example of the Fig. 9. Attractive force generated in the user.
above is shown in Fig. 8. It should be noted that the value of
0.0890625 is the relation between pixel and maximum 5.4 Final Rotation Force.
horizontal vision angle of the Kinect sensor.
The attraction and repulsion forces obtained above are related
in order to calculate a specific evasion angle, which represent
the direction of an obstacle free route within the digitized
environment by the Kinect sensor.
For obtaining this angle, vector addition is applied by the
polygon method, in this case vectors are all fictitious forces
generated by both the obstacles and the user, thus a final
resultant force (⃗⃗⃗⃗ ) is obtained with a module and angle
value, referred to the x positive axis.
Fig. 7. Generation of a fictitious repulsive force from the
Based on the critical depths established as the Kinect sensor
obstacle to the visually impaired person.
vision, the following consideration can be seen: if the
obstacle is within the region between the investment depth
forces ( ) and the Kinect’s minimum vision depth
( ), the vector’s magnitude will have a greater value than
the unit. To calculate the final repulsive force magnitude
when two or more objects are digitized by the Kinect sensor
in the environment, (2) is used, with the difference that the
measured depth ( ) is set as the average of all obstacles
distances calculated from the visually impaired person, also
leaving this magnitude value escaladed as shown in (3) and
Fig. 10.
(( )( )) (4)
Where:
x = Horizontal position of the digitized obstacle’s center in
the navigation environment.
An attractive force from mobile robots algorithms is one that Fig. 10. Final force obtaining in an environment with the
drives the robot to reach a certain point in its trajectory called presence of two obstacles, vector sum by the polygon method
‘goal’, as mentioned by Ribeiro (2005). for repulsive vectors and subsequent vector sum by the
polygon method with the attraction force.
228
CHAPTER 8. MOBILE ROBOTS
8. PROTOTYPE DESCRIPTION
Fig. 11. QR code reading. The prototype consists of six main parts:
7. TRANSMISSION OF INSTRUCTIONS AND VEST.
A white industrial helmet with all necessary adaptations
7. 1 Audible messages generation as control instructions. to incorporate the Kinect sensor.
Audible messages are the control actions that are delivered to Sensor Kinect, which is responsible for depth image
the visually impaired people, which will be emitted by the acquisition.
main processor (computer) via a bluetooth headset.
Mini Headset KCH-850, is the device by which the user
As shown in Fig. 12, control actions will be issued to the will hear all audible messages issued by the system.
visually impaired person according to the region where the
final rotational force is oriented. If the evasion angle is from Control, communication and power supply regulation
85° to 95°, an instruction to keep forward without altering module; which is mainly the system responsible of
their direction will be emitted. If the angle is between 260° communication between the computer and the
and 280°, it means that the obstacle is in front of the user and microcontroller Atmega8.
they are forced to stop in their trajectory, then they must
evaluate the obstacles existence with a head movement so Power supply, which is a LiPo battery in charge of
they can know which route to be heading. supplying power to the entire system.
Fig. 12. Evasion angle interpretation for control instructions. 9. EXPERIMENTAL RESULTS
7.2. Switching and vibration interaction system Tests on the prototype with visually impaired people were
This system consists of vibration motors and a button, both performed using the system with and without the white cane;
placed in the user's vest. Motors’ objective shown in Fig. 13 as the prototype is a complementary system.
229
CHAPTER 8. MOBILE ROBOTS
9.1 Obstacle detection Ranges. Tests with the white cane were performed by 3 people with
total visual impairment, and these were held in different
These tests were performed in order to validate the calibrated structured and unstructured environments both indoors and
distances as a range of maximum and minimum depth outdoors. Halls, a parking lot, a backyard and an auditorium
necessary for obstacles visualization within a given space as were chosen.
shown in Fig. 7. The minimum depth that the sensor can
visualize is 45 [cm] due to Kinect’s self-limitation, the System’s operation was verified in narrow roads such as
maximum depth will be set by the algorithm that calculates corridors, where the system tends to make the blind person
the inclination and the sensor’s maximum depth based on the moves in a oscillating way because evasion depends entirely
user’s approximate height. Table 1 shows a sampling for on the interpretation of the instruction by the user and the
different heights and depth ranges established by the Kinect inclination taken by them. Fig. 14 shows an example of a
sensor. person with total acquired visual disability moving in an
oscillating way.
Table 1. Maximum and minimum depth ranges The usefulness of vibration backup system was reflected, as
when the system allowed the user to get too close to the walls
(less than 1m), vibration alerted him to stop or lower his
speed until another voice instruction is emitted by the system.
9.3 Results with visually impaired people The implementation of the modified fictitious force
algorithm helps to despise supposed obstacles that the
These tests were performed by 6 people (4 women and 2 camera vision manifests in the user’s trajectory, such as
men) of different ages in the range of 30 to 56 years old and the ground, where any evasion action should be emitted.
with three different types of visual impairment:
Detection and especially instruction for obstacles
Low vision. avoidance in the environment depend largely on the speed
with which the user walks. The system has a higher
Total blindness acquired. efficiency when the person moves at a slower pace as it
allows the system to end a voice instruction before
Total blindness of birth. moving on to the next.
230
CHAPTER 8. MOBILE ROBOTS
REFERENCES
INEC. (2011). 2010 Census Results. [Online]. Available:
www.ecuadorencifras.gob.ec/resultados/.
Kean, S., Hall, J., and Perry, P. (2011). Meet the Kinect, New
York: Apress.
Microsoft. (2014). Coordinate Spaces. Microsoft. [Online].
Available: https://msdn.microsoft.com/en-
us/library/hh973078.aspx.
Córdova, F. (2013). Theft/abandonment detection of indoors
objects using depth cameras. [Online]. Available:
http://arantxa.ii.uam.es/~jms/pfcsteleco/lecturas/2012121
2FabricioACordovaLucero.pdf.
Scaglia, G. (2009). Mobile robot navigation in unstructured
environments using linear algebra. Madrid: RIAI.
Ribeiro, M. Isabel (2005). Obstacle Avoidance. [Online].
Available:http:user.isr.ist.utl/~mir/pub/ObstacleAvoidanc
e.pdf.
Salvatore, J., Osio, J., Morales, M (2014). Object detection
using Kinect sensor. LACCEI.
231
CHAPTER 8. MOBILE ROBOTS
Abstract: Sharing the workspace or interacting directly with people in a social way is currently
a key challenge in the design of mobile robot applications. An interesting field of study is
related with the so-called hidden dimension, which relates the sense of security that a human
feels when interacting with another one. Therefore, the researchers have been interested in
developing human-friendly applications by abstracting social cues for the robotic design. One
of these is the social zone, which acts like a repulsive potential field, and if it is not respected,
an uncomfortable situation for humans is produced. In robotics, many approaches have been
proposed to define and to avoid these fields, which contributes to improve the human comfort
during interactions at least from a qualitative perspective. This paper proposes a novel null-
space-based (NSB) algorithm for a non-holonomic mobile robot platform, which is programmed
to escort a human in a behavior-based paradigm. The emphasis is posed in the evasion of other
humans in the environment by considering them as elliptical potential fields with non-holonomic
motion. Finally, simulation results are presented to show the performance of the proposed control
algorithm.
232
CHAPTER 8. MOBILE ROBOTS
tasks, and it should be chosen a behavior that balances elliptical Gaussian form, and b is the minor-axis length
the sub-objectives. The performance of the control system (see Fig. 1).
depends on how these controllers are fused. In Antonelli
Let’s define a shape-variant potential field by considering
et al. (2005) an experimental comparison among three
the major-axis as time dependent. It is based on a social
approaches are analyzed: the layered control system, the
heuristic that considers that a human needs a long and
motor scheme control and null-space behavioral control
clear space to move while walking (Chi-Pang et al., 2011).
(Chiaverini, 1997). The main difference between them is
the way they manage the sub-task outputs. Specifically,
the null-space based approach derives from an inverse kine-
matics solution by exploiting the kinematic redundancy, in
which there are more degrees of freedom than necessary
to fulfil the task. From a practical point of view, the
kinematic redundancy is not strictly present in the robotic
system, but the task can give redundancy to the system.
In robotic manipulators, the simplest method to define
the null-space is based on the inverse Jacobian (Whitney,
1969). This theory for redundant robotic manipulators has
been proposed for the execution of different formation-
control missions with multi-mobile-robots systems (An-
tonelli et al., 2009), whose collective task make the system
redundant.
This paper proposes to use the null-space based (NSB)
approach to establish an cooperative leader-follower con-
trol that considers multiple kinematic tasks during the
interaction with humans. The first one consists in keep-
ing out of social zones of humans, which are defined by
considering an elliptical potential fields moving with a Fig. 1. Schematic description of the social zone.
non-holonomic nature. Additionally, two secondary tasks
are proposed in the NSB paradigm, which are related to In consequence, the time-derivative of this field is
keep the orientation and distance of the formation. The x˙∗
designed control is simulated to test its performance. It y˙∗
has been made emphasis in the evasion of humans with dVh ∂Vh ∂Vh ∂V ∂Vh ∂Vh
x˙∗ .
= , ∗, ∗, ∗,
the above mentioned social zones. dt ∂x ∂y ∂xh ∂yh ∂a ˙∗
∗ h
yh
In this way, in Section 2 an elliptical potential field is ȧ
proposed, and, the Jacobian, that relates the potential Solving this expression through algebraic steps, it results
variation with the motion of the robot is established. Later, that
in Section 3 the avoidance of this field is defined through dVh
the minimum norm solution, i.e. by using the pseudo- = − 2Vh (x∗ − x∗h )/a2 , (y ∗ − yh∗ )/b2 ẋ∗ + · · ·
inverse of the Jacobian matrix, where additional secondary dt
tasks to keep the formation with the human leader are + 2Vh (x∗ − x∗h )/a2 , (y ∗ − yh∗ )/b2 ẋ∗h + · · ·
included as part of a null-space based design. Finally, the (x∗ − x∗h )2
performance of the algorithm is tested through simulation + 2Vh ȧ, (1)
a3
in Section 4. Also, the paper conclusions are presented in T T
Section 5. where ẋ∗ = [ẋ∗ , ẏ ∗ ] , ẋ∗h = [ẋ∗h , ẏh∗ ] . It can be compactly
expressed like
233
CHAPTER 8. MOBILE ROBOTS
− sin θh x + cos θh y The desired velocities ẋd could be applied directly to
where p := . Similarly for the human certain kind of robots, however it is usual to define the
− cos θh x − sin θh y
position, i-tasks through position errors, therefore it is used the so-
called closed-loop inverse-kinematics (CLIK) version of the
ẋh
cos θ sin θ − sin θ x + cos θ y
h h ẏ algorithm for each task, namely
ẋ∗h = h h h h
h ,
− sin θh cos θh − cos θh xh − sin θh yh ẋdi = J†i (x) (q̇di + Ki q̃i ) . (8)
θ̇h
where Ki is a positive defined design matrix, and, q̃i is the
ẋh
task error defined as q̃i := qdi − q. i is the task number
= J∗h ẏh .
which are organized by priority. In this way, according to
θ̇h
(Baillieul et al., 1984; Chiaverini, 1997), a non-minimun
If additionally it is considered a non-holonomic motion for norm solution of CLIK for hthree tasks
can be written
i as
the human gait (Arechavaleta et al., 2006; Leica et al.,
ẋd = ẋd1 + I − J†1 J1 ẋd2 + I − J†2 J2 ẋd3 . (9)
2014), i.e,
" #
ẋh cos θh 0 Each task velocity is computed individually and the overall
ẏh = sin θh 0 νh = Jh νh , NSB control action always fulfills the highest priority task
ωh ωh at non-singular configurations. However the fulfillment of
θ̇h 0 1
the lower priority tasks should be discussed in a case-by-
then it results, case basis.
νh
ẋ∗h = J∗h Jh , (4)
ωh 3.1 Stability analysis
∗ 1 − sin θh xh + cos θh yh
where Jh Jh := . By multiplying each member of (9) by J1 under the
0 − cos θh xh − sin θh yh
assumption that it is a full-range matrix, and noting that
In this way, by substituting (3) and (4) in (2), results that J1 (I − J†1 J1 ) = 0, then
∗ ν q̇1 = q̇d1 + kq̃1 . (10)
V̇h = Js Rẋ + Js pωh − Js Jh Jh h + Jf ȧ,
ωh
˙
This expression can be expressed as q̃ = −kq̃1 . If it is also
:= Jo ẋ + g. defined a Lyapunov candidate function with the form
Therefore, the total repulsive effect over each agent i in a 1
position xi := (xi , yi ) is calculated as the sum of all the V = q̃T1 q̃1 , (11)
2
repulsive effects Vhj generated by n human obstacles in then the time derivative of this function results
the shared scenario, i.e.
Xn V̇ = −q̃1 kq̃1 ,
Vi = Vhj , where k as a positive defined matrix makes the system
j=1 asymptotically stable, i.e. q̃1 → 0 with t → ∞.
and, in consequence To establish the stability for secondary tasks, it is sup-
n
X n
X n
X posed that the secondary task does not affect the primary
V̇i = V̇hj = Joj ẋi + gj . (5) task and the two tasks are in consequence compatible.
j=1 j=1 j=1
It is expressed by the linearly independence of the sub-
3. NULL-SPACE BASED CONTROL spaces N (J1 ) and N (J2 ), namely
N ⊥ (J1 ) ∩ N ⊥ (J2 ) = {0},
Let q ∈ Rm be the task variables that represent the when
states of the system to be controlled. Also, let x := N ⊥ (J1 ) ≡ N (J2 )
[x1 , x2 , · · · , xn , yn ]T ∈ R2n be the array with the positions
(xn , yn ) of the n members of the formation. Then, the i.e, the subspaces N ⊥ (J1 ) and N ⊥ (J2 ) are orthogonal. It
relation between q and x is expressed like can be expressed as
q = f (x). R(J†2 ) ⊆ N (J1 ), (12)
By considering its temporal derivative or well,
df (x) J2 J†1 = 0.
q̇ = ẋ = J(x)ẋ, (6) In this way, if (9) is pre-multiplied by J2 then it results
dx
where J := J(x) ∈ Rm×2n is the Jacobian matrix that
associated to the task. In this way, the redundancy of this q̇2 = q̇d2 + kq̃2 ,
system is given by the condition m < 2n, where 2n is which analogously to (10) by considering the Lyapunov
interpreted as the degrees of freedom of the system. In this candidate function (11) results to be asymptotically stable.
way, the problem is focused on defining control actions for Similarly for the third task,
each member to carry out the formation control objectives.
A solution to this problem is typically given by considering R(J†3 ) ⊆ N (J1 ) ∩ N (J2 ). (13)
the minimum norm solution, i.e., by using the pseudo- In this way, following a similar procedure for the second
inverse of the Jacobian matrix J† := J† (x) = JT (JJT )−1 task, the third task is demonstrated to be also asymptot-
as follows ically stable, provided that the third task does not affect
ẋd = J† q̇. (7) the second and the first tasks.
234
CHAPTER 8. MOBILE ROBOTS
Let us consider a formation of two agents of which one is Let’s q2 := [γ, xf , yf ]T be the formation posture variables,
a human. For this, it must be guaranteed two facts: the where
first one is related to the mutual coordination between y1 − y2
γ = atan , xf = x2 , yf = y2 , (16)
formation members, and, the second one related to the x1 − x2
presence of another humans in the environment. with γ the orientation of the formation and (xf , yf ) the
To guarantee the stability of the first problem, it must be position of the formation. In consequence,
designed a control in which the human acts like a leader, q2 = f (x).
i.e., he is capable to move freely, while the formation is Note that the position of the formation is centered on the
moving accordingly to maintain a pre-established forma- human leader. It is given to guarantee that the human
tion posture or shape. In this way, the formation is named is free to move in whatever direction he wants; therefore,
as human-centered because it is not possible to govern the the formation is capable to keep the formation posture by
human actions. itself.
On the other hand, for the second fact, the presence of By considering its time derivative, it results
humans and its avoidance must be established as primary
goal. These two design qualities are established in the q̇2 = J2 ẋ (17)
following Sections. where,
y1 − y2 x1 − x2 y1 − y2 x1 − x2
First, let’s x := [x1 , y1 , x2 , y2 ]T be a vector with the − −
d2 d2 d2 d2
Cartesian positions (xi , yi ), i = {1, 2} of both agents, J2 := , (18)
where the first agent is a robot follower and the second 0 0 1 0
agent a human leader. Later, the variables for each task 0 0 0 1
are defined as follows. and, its pseudo-inverse is expressed like
J†2 =
3.3 Principal task: Social zone avoidance (y1 − y2 )2 (x1 − x2 )(y1 − y2 )
y1 − y2 −
d2 d2
2
The meddling avoidance of the robot into human social −(x − x ) − (x1 − x2 )(y1 − y2 ) (x1 − x2 ) ,
zones is defined as the principal task. Then, let’s Vi be the 1 2
d2 d2
task variable that represents the repulsive effect over each
0 1 0
individual because of the presence of other individuals. For 0 0 1
this, consider (5) to be expressed for both agents (human
and robot) like where d2 = (x1 − x2 )2 + (y1 − y2 )2 .
(1)
V̇1 J 01×2 g The minimal norm solution for this task is expressed as,
= ẋ + 1 ,
V̇2 01×2 J(2) g2 ẋd2 = J†2 (q̇2d + K2 q̃2 ) , (19)
:= J1 ẋ + g, (14) T
where q2d := [γd , xf d (t), yf d (t)] , q̃2 := q2d − q2 , and,
(i)
Pn Pn K2 := k2 I3 . In this way, if the desired trajectory for
where J := j=1 Joj , gi := j=1 gj are the Jacobians
the formation is the human trajectory, i.e., xf d (t) =
and compensation motion factor for each individual i = xh , yf d (t) = yh , then it results in a trivial solution for
{1, 2} under the presence of n human obstacles. the human leader and a control action for the robot, which
In this way, the minimal norm solution for the formation are given by
in this task is expressed as, " ! !#
0 γd − γ
ẋd2 = J2†
ẋh + k2 0 ,
ẋd1 = J†1 V̇d + K1 Ṽ − g , (15) ẏh 0
where Vd := [Vd1 , Vd2 ]T and Ṽd := [Vd1 − V1 , Vd2 − V2 ]T . k2 (y1 − y2 )γ̃
k (x − x2 )γ̃
Note that in the expression (14) the resultant Jacobian = 2 1 .
ẋh
J1 brings out an uncoupled condition for the agents, i.e., ẏh
each individual is in charge of avoiding its obstacles. In
consequence, the robot agent is controlled to avoid its 3.5 Tertiary task: Formation shape
human obstacles, and, as a trivial fact, the human agent
avoids its obstacles by himself without affecting the system
Finally, the tertiary task is related to the formation shape,
stability.
i.e., the distance between the individuals. Let’s consider
The incorporation of g improves the performance of the q3 := d as the only formation shape variable, where
evasion of humans compared to other approaches for p
d = (x1 − x2 )2 + (y1 − y2 )2 , (20)
common dynamic obstacles. This is because it does not
only take into account the linear motion of the obstacle, which is the distance between them. By considering its
but also it is related to the angular motion of the human time derivative, it results
obstacle, and, its dynamic shape. q̇3 = J3 ẋ (21)
235
CHAPTER 8. MOBILE ROBOTS
r (1) . (26)
ẏd 8
3.7 Discussion 4
HUMAN LEADER
2
-10
leader trajectory. In this way, to get a rigid formation, the 0 2 4 6 8
[m]
10 12 14 16 18
236
CHAPTER 8. MOBILE ROBOTS
evasion of the dynamic obstacle. Note that by including view of the trajectories developed by the robot during the
the motion compensation term g it is reduced the meddling evasion of the dynamic obstacle. Note that by including the
in the social zone during the evasion as consequence of motion compensation term g, it is reduced the meddling
considering the variation of the field with the motion, in the social zone during the evasion by getting a longer
which in this case generates a fastest deviation to avoid but an early evasion.
the human social zone. This effect can be also seen in a
lower potential repulsion values over the robot during the
d˜ γ̃
simulation time (see Fig. 3c).
0.5
1
d˜ γ̃ 0
[rad]
[m]
0.5 -0.5 0.5
1
0
-1
[rad]
0
[m]
-0.5
0.5
-1 0 50 100 150 0 50 100 150
[s] [s]
-1.5 0
(a) Distance error. (b) Orientation error.
0 50 100 0 50 100
[s] [s] V2
×10 -3 -1.4
5
(a) Distance error. (b) Orientation error.
4 -1.6
×10 -3 V2
8 3 -1.8
[m]
[J]
-1.5
2 -2
6 -1.6
1 -2.2 g=0
[m]
-1.7
[J]
4 g 6= 0
0 -2.4
-1.8 0 50 100 150 8 10 12
2 g=0 [s] [m]
-1.9
g 6= 0
0 (c) Repulsive potential (d) Dynamic object “g”
0 50 100 11 12 13 field over robot. effect.
[s] [m]
(c) Repulsive potential (d) Dynamic object “g”
Fig. 5. Formation motion errors during frontal meddling.
field over robot. effect.
5. CONCLUSION
Fig. 3. Formation motion errors during lateral meddling.
This paper has presented a novel null-space based (NSB)
4.2 Simulation 2: Front meddling motion control to follow a human in an unstructured
scenario by considering a differential drive mobile robot.
For this simulation it is proposed to consider a frontal For this, it has been proposed the definition of a “social”
meddling between the robot and a human obstacle with potential field, which according the researchers, improves
time variant velocity. In this case, it is emphasized the the social acceptance of the mobile robots during human-
improving of the performance, regarding the avoidance of robot interactions. Additionally, it has been supposed a
the meddling in the elliptical social field when its major non-holonomic motion of the human obstacles that allows
axis is not fixed but defined as a := a(t) = b + ka νh , where to define a more natural behavior during human obstacle
b is a constant that defines the minor axis of the potential evasion. As a complementary part of the motion control,
field, ka is a positive constant design, and, νh is the linear it has been proposed a flexible formation control where
velocity of the human obstacle. the human is considered as part of the formation. For
The time-parametrized trajectories are presented in Fig. this, it has been necessary to consider a leader/human-
4. centered scheme. Finally, simulation results are presented
to contrast the positive effect that has the inclusion of the
3
HUMAN LEADER
potential field variation during the motion of the human
2
1
DYNAMIC HUMAN OBSTACLE
obstacle, regarding lower meddling in the social zone of
ROBOT FOLLOWER
0 t 3= 31 [sec] the human obstacles. It is believed that the proposed
[m]
-1
algorithm improves the social acceptance of mobile robots
-2
-3
t 1= 7 [sec]
t 2= 19 [sec] during interactions with humans.
-4
STATIC HUMAN OBSTACLE
0 2 4 6 8 10 12 14 16 18 REFERENCES
[m]
Antonelli, G., Arrichiello, F., and Chiaverini, S. (2009).
Fig. 4. Top view trajectories. Experiments of formation control with multirobot sys-
tems using the null-space-based behavioral control.
Additionally, the motion errors for each sub-task are pre- IEEE Transactions on Control Systems Technology,
sented in Fig. 5, where it is also presented an augmented 17(5), 1173–1182.
237
CHAPTER 8. MOBILE ROBOTS
238
CHAPTER 9
NEW ENERGIES
CHAPTER 9. NEW ENERGIES
Abstract: The Microgrids are part of a special class of power systems that offer an attractive
option for the use of sustainable energy. The study imposes several challenges from the point of
view of control; in the literature this problem has been addressed by assuming that the dynamics
of power converters has been drastically simplified. In this article, we exploit the Hamiltonian
structure exhibited by the network and we develop a distributed control scheme for a mesh
topology including dynamic converters.
240
CHAPTER 9. NEW ENERGIES
Hamiltonian systems via another port-controlled Hamil- the same direction than a given basic loop), −1 if points
tonian system (PCH). As a result, the controller design out the basic ambit (if decreases in the opposite direction
and the stability analysis are remarkable simplified. Relax- than the basic loop) and 0 if does not belong to the basic
ations of some assumptions, technical proofs and extensive cutset (basic loopset).
numerical testing are deferred to a journal article follow.
As usual, the lumped elements are of three kinds, namely,
The rest of the paper is organized in the following way: sources, energy stores (inductors and capacitors) and dis-
Section 2 presents the Hamiltonian model for the meshed sipators. Besides, as in Brayton and Moser (1964), it is
network. Section 3 is devoted to the power converter considered that the circuit is complete, so that the sources,
model and the design of local controllers. In Section 4 the the capacitors and some (voltage-controlled) dissipators
complete system is presented and its stability properties are considered as branches while all the inductors and some
are stated. Finally, the desired steady–state behavior is (current-controlled) dissipators are chords, leading to the
formulated in Section 5. partitions
" # " #
i1 v1
i v
2. MICROGRID STRUCTURE it = iC ; vt = vC ; ic = Rc ; vc = Rc
iL vL
iRt vRt
A Microgrid is an electrical network connected to a dis- (3)
tribution system, of low and medium voltage, combining where v1 , i1 ∈ Rn1 are the port variables of the sources,
(typically renewable) energy generating units and loads. vC , iC ∈ Rn2 the variables associated with the capacitors
In the network can be identified: Distributed Generation and vRt , iRt ∈ Rn3 the corresponding to the tree dissi-
(DG) units, power converters, transmission lines and loads. pators, such that n1 + n2 + n3 = n − 1. For the co-tree
In this section the mathematical model of each component variables, vRc , iRc ∈ Rn4 conform the co-tree dissipators,
is presented from PCH perspective, later to present a com- while vL , iL ∈ Rn5 the inductors, with n4 +n5 = b−(n−1).
prehensive and modular model of Microgrid. Finally, the
control law for the inverters are extended to a distributed Under the partition presented, the matrix H takes the
control law that stabilizes the complete Microgrid. form " #
H1R H1L
H = HCR HCL (4)
2.1 Network dynamical model HRR HRL
where each sub-matrix, of appropriate dimensions, repre-
In this section the structure and dynamic of the net- sents the interconnections among the different tree and
work that interconnects the generation systems (i.e. power co-tree elements.
converters) with the loads are addressed. The network is
viewed as an electrical circuit showing that its dynamic The port variables of the capacitors and inductors are
behavior corresponds to the exhibited by a Hamiltonian related with the energy function of the network Ha :
system. An approach based on Graph theory (see Bollobás Rn2 ×n5 → R≥0 defined as
(1998)) that closely follows Avila-Becerril et al. (2015) is n2
X n5
X
considered. Ha (qC , λL ) = HaCi (qCi ) + HaLi (λLi ) (5)
Consider that the power network is represented by a i=1 i=1
graph composed by n nodes and b edges. Hence, there where qCi ∈ R is the i-th entry of the electrical capacitor
exist n − 1 independent current constraints and b − n + 1 charges vector qC ∈ R2 and λLi ∈ R the i-th entry of the
independent voltages constrains, established by Kirchhoff linkage inductor fluxes vector λL ∈ R5 . Then, it holds that
laws. Consider now a given tree of the circuit (integrated
by the n nodes and n − 1 edges such that no loops are ∂Ha (q, λ)
q̇Ci = fCi ; eCi = ; i = 1, . . . , n2 (6a)
formed) and its corresponding co-tree (given by the set of ∂qCi
b−(n−1) edges that do not belong to the tree). Under these ∂Ha (q, λ)
conditions and exploiting the concepts of basic cutsets λ̇Li = eLi ; fLi = ; i = 1, . . . , n5 (6b)
∂λLi
and loopsets of the graph (also see Wellstead (1979)), the
While the dissipators satisfies
current and voltage constraints can be obtained as
fRti = ψti (eRti ); eRci = ψci (fRci ) (7)
i vt where ψti (·), ψci (·) are assumed bijective functions and
[ I H ] t = 0, −H IT
= 0 (1)
ic vc fRti , eRti , eRci , fRci are, respectively, the i-th entry of
fRt ∈ Rn3 , eRt ∈ Rn3 , eRc ∈ Rn4 , fRc ∈ Rn4 .
where the matrix H ∈ R(n−1)×b−(n−1) , known as the fun-
damental loop matrix, completely characterizes the topol- Microgrid’s Topology. Typical topologies can be found
ogy of the circuit, while I is a generic identity matrix of in Avila-Becerril et al. (2015), in terms of the fundamental
proper dimensions. From (1) above, it is clear that the loop matrix. In this paper, the analysis is focused in the
structure of H determines how the tree currents and the so–called Mesh network since it states the more general
co-tree voltages can be generated as linear combinations and reliable topology. We assume that each transmission
of the co-tree currents and the tree voltages, respectively, line is modeled by the π model Kundur et al. (1994), i.e.,
via a series circuit composed by a resistor and an inductor
it = −Hic , vc = H T vt . (2) with a shunt capacitor. Under this structure, the lines are
The entries of H (and H T ) are 1 if a co-tree current (a two–port systems giving the possibility to connect either
tree voltage) points into a given basic ambit (decreases in a source or a load.
241
CHAPTER 9. NEW ENERGIES
The structure of a Mesh topology satisfies: These systems are composed by a microsource that fed
a power converter. In this paper the microsources are
F.1. All the sources are connected to all the loads via
considered as constant voltage sources. Note that including
transmission lines.
a source dynamic in a Hamiltonian framework states
F.2. The loads are parallel connected with one capacitor
only an additional step in the formulation of a single
of the π model.
Hamiltonian system that describes the whole Microgrid.
F.3. The sources are parallel connected with one capacitor
of the π model. In Figure 1 is presented the topology of the i-th electronic
F.4. There exist transmission lines that connects a source inverter, i.e. it consist of the aforementioned voltage source
with another source. which delivers a voltage Vi ∈ R > 0, a switching array, that
modulates the input voltage via the signal ui ∈ R and a
As shown in Avila-Becerril et al. (2015), if the resistors second order LC filter.
involved in the π model are considered as tree dissipators
then n3 = n5 and identifying the co–tree dissipators with
the loads, HRL = I, HRR = 0 of proper dimension, while
HCR = I and H1R = 0. In addition, due to F.1 and
F.2, n1 = n2 = n4 while F.3 leads to the elimination,
for analysis purposes, of each of the capacitors parallel
connected to the sources.
Even that the network model can be formulated under
this general scenario Avila-Becerril et al. (2015), with Fig. 1. Illustrative power converter DC/AC
the aim to, first, situate the contribution with respect
the related literature Schiffer et al. (2014), Simpson-Porco
Let x1i ∈ R denote the linkage inductor flux and x2i ∈ R
et al. (2013), and second, to simplify the presentation, the
the electrical capacitor charge of each converter. From a
following assumption is considered:
direct application of Kirchhoff laws, the dynamic model
A.1. The network is lossless or dominantly inductive. for the i-th device, i = 1, 2, . . . , n1 , is given by
Under assumption A.1, the tree dissipators are no longer
included. The model can be obtained from equations (1) ẋ1i = −Ci−1 x2i + V ui (10a)
and (6), under F.1-4. An advantage of this model lies in ẋ2i = L−1 − f1i
i x1i (10b)
property that it exhibits a port–controlled Hamiltonian
structure (Van der Schaft (1999)). Indeed, defining the where it has been assumed a linear constitutive relation-
state variables x3 = qC and x4 = λL , the model can be ship for both the inductors and the capacitors with Li ∈
written in matrix form as R > 0 the inductance and Ci ∈ R > 0 the capacitance.
ẋ34 = J34 ∇x34 Ha (x3 , x4 ) − F34 fRc + G34 e1 (8) If Hci : R × R → R≥0 is the stored energy function, given
T T T by
where x34 = x3 x4 ∈ R(n1 +n3 ) , ∇x34 Ha (x3 , x4 ) =
1 1 −1 2
∂Ha (x3 ,x4 )
and Hci (x1i , x2i ) = L−1 2
i x1i + Ci x2i , (11)
∂x34
2 2
0 −HCL T 0 then, model (10), can be equivalently written as a port–
J34 = T = −J34 ; G34 = T
HCL 0 H1L controlled Hamiltonian system of the form
I 0
F34 = ẋ12i = J12i ∇x12i Hci (x1i , x2i ) + G12i ui − (12)
0 f1i
subject to the constraints T
where x12i = [ x1i x2i ] ∈ R2 while
f1 = H1L fL , eRc = eC = ψc (fRc ) (9)
0 −1 Vi
This equation shows the demanded current to the sources J12i = = −JT12i ; G12i =
1 0 0
as a linear combination of the transmission line currents
connected to them. In the sequel, it is assumed ideal In this context, the port-variables of each converter are
sources in the sense that they can provide any amount the capacitor voltage e1i = Ci−1 x2i ∈ R and the output
of current, so that only steady state stability issues can be current f1i ∈ R.
approached.
The structure considered for the power converters is only
illustrative. Actually, the switching array is just a repre-
3. SOURCES AND LOADS DYNAMICAL MODELS
sentation of several topologies used in practice and has
been over simplified for presentation purposes. However,
In this section we study the inclusion of power converter
it has been widely shown (see for example Ortega et al.
models into the Microgrid description and loads modeling
(2013)) that common converter topologies can be repre-
from an energy–based perspective.
sented under a Hamiltonian structure.
3.1 Source model It is now convenient to illustrate how the output voltage
of the converter can be controlled to achieve a prescribed
In the sequel, sources are viewed as a one–port dynamical value. To do this, it is necessary to introduce the concept
systems with port–variables given by the output voltage of admissible trajectories which refers to the set of state
e1i and the delivered current f1i with i = 1, 2, . . . , n1 . trajectories that a given dynamical system can reproduce
242
CHAPTER 9. NEW ENERGIES
under a specific input. For the defined Microgrid sources, Ortega et al. (2013). It is clear that the port–variables of
this set is given by the solutions of the loads attached to the network, the current fRc and the
0 voltage eRc , can be related with a port–controlled Hamil-
ẋ?12i = J12i ∇x12i? Hci
?
+ G12i u?i − , (13) tonian system. Moreover, if the interconnection between
iLi
this system and the network (8) is carried out in a power
that recover the behavior of (12) under the input u?i . preserving way, it is obtained another Hamiltonian system.
Equation (13) allows to identify x?12i as the desired steady– We now assume that the system operates around a pre-
state behavior for the sources variables. Thus, it is possible scribed admissible trajectory, as already illustrated in (13)
to immediately formulate a control problem by defining the for the power converter sub–system. Hence, the concept
error signal x̃12i = x12i −x?12i and finding the control input of Incremental Passivity, studied in Pavlov and Marconi
ui that renders this variable to zero. In the next proposi- (2008) and in the case of electrical circuits in Jayaward-
tion a controller that achieves asymptotic stabilization of hana et al. (2007), states a way to characterize the input–
the prescribed behavior for each non interconnected power output behavior of the loads via the following assumption
converters is developed, leaving to Section 4 the Microgrid
stabilization. A.5 The port–variables of each load, related by the con-
Proposition 1. Consider a DC/AC power converter of the stitutive relationship (9), satisfies the incremental
form (12) and assume passivity condition
T
A.2 The state x12i and the output current f1i are available (eRc − e?Rc ) ψc−1 (eRc ) − ψc−1 (e?Rc ) > 0
for measurement. for a given admissible trajectory e?Rc (t) ∈ Rn1 .
A.3 The parameters Li and Ci are known.
A.4 The prescribed steady–state behavior x?12i is known. 4. MICROGRID STABILIZATION
Under these conditions, the control law
In this section a complete model for the approached
ui = Vi−1 [ẋ?1i + Ci−1 x?2i − K1i L−1
i x̃1i ] (14)
Microgrid is presented. Consider first the interconnection
with the desired state satisfying between the sources (power converters) and the network.
ẋ?2i − L−1 ? −1
i x1i + iLi − K2i Ci x̃2i = 0 (15) Notice that the n1 individual power converters of the form
and K1i > 0 and K2i > 0, guarantees that (12) can be pilled up as
lim x̃12i = 0 ẋ1 = −C −1 x2 + V u
t→∞
ẋ2 = L−1 x1 − f1
Proof. From (12) and (13) the error dynamic is given by with total stored energy function
x̃˙ 12i = J12i ∇x̃12i H̃ci (x̃1i , x̃2i ) + G12i ũi (16) 1 1
Hc (x1 , x2 ) = xT1 L−1 x1 + xT2 C −1 x2 (20)
where 2 2
1 1 −1 2
H̃ci (x̃1i , x̃2i ) = L−1 2
i x̃1i + Ci x̃2i (17) where x1 = col{x1i } ∈ Rn1 , x2 = col{x2i } ∈ Rn1 ,
2 2 V = diag{Vi } ∈ Rn1 ×n1 , u = col{ui } ∈ Rn1 , L−1 =
?
and ũi = ui − ui .
diag{L−1
i }∈R
n1 ×n1
and C −1 = diag{Ci−1 } ∈ Rn1 ×n1 .
On the other hand, (14) and (15) can be written as
On the one hand, the vector f1 ∈ Rn1 stand for the
G12i ũi = −K12i ∇x̃12i H̃ci (x̃1i , x̃2i ) (18) currents injected to the network satisfying the constraint
with K12i = diag {K1i , K2i } ∈ R . Substitution of (18)
2×2 (9), which in terms of the network stored energy takes the
in (16) leads to the closed–loop error dynamic form
∂Ha (x34 )
x̃˙ 12i = [J12i − K12i ] ∇x̃12i H̃ci (x̃1i , x̃2i ) (19) f1 = H1L (21)
∂x4
If H̃ci (x̃1i , x̃2i ) in (17) is considered as a Lyapunov function with Ha (x34 ) as in (5) and where it has been used the
candidate, its time derivative along the trajectories of the identity (6b). On the other hand, the voltage of the source
error dynamic (19) is given by ports are given by the output converter voltages
T e1 = C −1 x2 (22)
H̃˙ ci (x̃1i , x̃2i ) = − ∇x̃12i H̃ci K12i ∇x̃12i H̃ci < 0
While the variables fRc , eRc ∈ Rn1 of the network load
expression that, due to the linear structure of the Lya- ports, under (7) and (6a), can be represented as
punov function, implies the claimed asymptotic stability
∂Ha (x34 )
property. fRc = ψc−1 (23)
∂x3
2 Representing the power converters model in a Hamiltonian
structure, using (20), together with (8), the port variables
In Section 4 it is shown that these stability properties are (21), (22) and (23), leads to a dynamic description of the
preserved if these devices are interconnected through the complete Microgrid.
network.
Proposition 2. The dynamic behavior of the complete Mi-
3.2 Load model crogrid conformed by the network (8) with sources (10a-
10b) and loads satisfying assumption A.2, can be repre-
Modeling of electrical loads in a Hamiltonian framework is sented by the port-Hamiltonian system
a topic that has been widely studied since many years ago ẋ = JT ∇x HT (x) − gRT Ψ(x34 ) + GT u (24)
243
CHAPTER 9. NEW ENERGIES
and matrices of appropriate dimensions so that H̃T is non-increasing and its argument x̃ is
0 −I 0 0
0 0
bounded. Moreover, since H̃˙ is zero only in x̃ = 0 the
I 0 0 −H1L T 0 0 equilibrium is asymptotically stable.
JT = 0 0 0 −HCL = −JT ; gRT = 0 I ;
T T
2
0 H1L HCL 0 I 0
Remark. The importance of the presented result lies in
Vu
0 0 the fact that the controller (27-28) guarantees that, for any
GT u = Ψ(x34 ) = −1 ∂Ha (x34 ) admissible trajectory, the error between the actual value of
0 ψc
0 ∂x3 the capacitors parallel connected with the loads, will tend
to the desired value x?3 . Therefore, by ensuring that this
2 desired value corresponds to a sinusoidal function with a
We now can develop a control strategy to stabilize the prescribed amplitude and frequency, then both voltage and
complete grid following the same procedure than the frequency stability of the power network will be achieved.
followed in Section 3.1. In this sense, it is necessary to
define the admissible trajectories which for system (24) 5. STEADY STATE DESIRED BEHAVIOR
are the solutions of
The proposed method for specifying the desired values
ẋ? = JT ∇x? HT (x? ) − gRT Ψ? (x?34 ) + GT u? (25) is based on the following rationale: The steady state be-
where u ∈ R is the control input that generates the
? n1 havior of the voltage C −1 x?2 establish the voltage Ca−1 x?3 ,
admissible trajectory x? ∈ R3n1 +n2 . So that, with the accordingly a natural choice is to associate to each power
definition of the desired system, the error variable is set as converter the voltage
x̃ = x − x? and their corresponding error dynamic is Ci−1 x?2i = A?i sin(ωs t + δi? ),
x̃˙ = JT ∇x̃ HT (x̃) − gRT Ψ̃(x̃34 ) + GT ũ where ωs ∈ R takes the same value for all the power
where ũ = u − u? . In this case, the associated stored converters, while the magnitude Ai : R≥0 → R≥0 and
energy–like function takes the form the phase δi ∈ R≥0 → S must be determined to get
an adequate power flow. Let the complex admittance be
1 denoted as Yik := Gik + jBik ∈ C with conductance
H̃T (x̃) = x̃T P x (26)
2 Gik ∈ R and susceptance Bik ∈ R and let Ni be the set
−1 −1 −1 −1
with the matrix P = diag L , C , Ca , La > 0. of neighbors of the i − th node for which Yik 6= 0. That
Under the foregoing scenario, it is possible to formulate the said, the desired active and reactive power at the i − th
main stabilization result of the paper, i.e. the proof that node for a lossless microgrid Pi? : Sn1 +n2 × Rn1 +n2 → R
the local controllers developed for the power converters and Q?i : Sn1 +n2 × Rn1 +n2 → R, are obtained as
are capable of stabilize the entire Microgrid. This result is X
Pi? = |Bik |A?i A?k sin(δi? − δk? ) (29a)
included in the next
k∼Ni
Proposition 2. Consider a Microgrid system of the form X
Q?i = |Bii |A?2
i − |Bik |A?i A?k cos(δi? − δk? ) (29b)
(24). Assume A.1–A.3 from Proposition 1 are verified and
k∼Ni
in addition assume that
with X
A.4 The parameters La and Ca are known. Bii := B̂ii + Bik
Under these conditions, the control law k∼Ni
u = V −1 [ẋ?1 + C −1 x?2 − K1 L−1 x̃1 ] (27) and B̂ii ∈ R the shunt susceptance.
with K1 = diag{K1i } ∈ R n1 ×n1
and the desired state The power flow equations above (29) are static and model
satisfying the constraints the network when it is balanced, that is, the net sum
of power consumption, injections and dissipated power is
ẋ?2 − L−1 x?1 + H1L L−1 ?
a x4 − K2 C
−1
x̃2 = 0, zero, and determine the desired steady state operation
ẋ?3 ? ˜
+ HCL ∇x4 HT − K3 ∇x̃3 HT + ϕ−1 ?
?
c (vC ) = 0 (28) of the network. The steady state can be determined by
? T T
ẋ4 − H1L ∇x?2 − HCL ∇x?3 − K4 ∇x̃4 H̃ = 0 finding, for a given set of load conditions, the active and
where K1 , K2 , K3 , K4 are diagonal positive gains, guaran- reactive power flow of the network and the magnitudes and
tees that phase angles of all nodes.
lim x̃ = 0 In other words, one way to generate the desired trajectories
t→∞
is solving the power flow equations (29). We propose to fix
Proof. The control law in equations (27) and (28) can be the active and reactive power at the loads, through the
equivalently written as resistances, and solve equations (29a) and (29b) in order
to calculate the desired magnitude A?i and the phases δi?
GT ũ = −KT ∇x̃ H̃T of the n1 nodes with voltage C −1 x?2 . Once founded the
with KT = diag{K1 , K2 , K3 , K4 } ∈ R(4n1 )×(4n1 ) > 0. voltages C −1 x?2 = A?i sin(ωs t + δi? ) that meets the load
Using this expression and considering (26) as a Lyapunov power demand, the restriction (28) is incorporated for
244
CHAPTER 9. NEW ENERGIES
x?1 ∈ Rn1 such that the control law u ∈ Rn1 in (14) can be Hill, D.J. and Chen, G. (2006). Power systems as dynamic
implemented. networks. In Circuits and Systems, 2006. ISCAS 2006.
Proceedings. 2006 IEEE International Symposium on,
Note that the algebraic equations (29) are non linear, both 4–pp. IEEE.
in the voltage and in the angle, therefore the solution Jayawardhana, B., Ortega, R., Garcı́a-Canseco, E., and
involves the use of a numerical methods, for example Castanos, F. (2007). Passivity of nonlinear incremental
Newton-Raphson. In the last years, the control community systems: Application to pi stabilization of nonlinear rlc
has been engaged in research from solubility conditions circuits. Systems & control letters, 56(9), 618–622.
of the power flow equations (see for example Simpson- Kundur, P., Balu, N., and Lauby, M. (1994). Power system
Porco et al. (2015); Dvijotham et al. (2015) and the stability and control, volume 7. McGraw-hill New York.
references therein) to the optimization problem (Madani Lasseter, R.H. (2002). Microgrids. In Power Engineering
et al. (2015); Wei and Bandi (2015) between others). Society Winter Meeting, 2002. IEEE, volume 1, 305–
However these studies are out of the scope of this paper. 308. IEEE.
Madani, R., Lavaei, J., and Baldick, R. (2015). Convexi-
6. CONCLUSION fication of power flow problem over arbitrary networks.
In IEEE 54th Ann. Conf. Decis. Contr.(CDC).
This paper has addressed the problem of stabilizing a Marwali, M.N., Jung, J., Keyhani, A., et al. (2007).
Meshed Microgrid which, unlike as is usual in the liter- Stability analysis of load sharing control for distributed
ature, has been included the dynamics of the power con- generation systems. IEEE Transactions on Energy
verters. It has been shown that a distributed control law Conversion, 22(3), 737.
developed for each source converter preserves its stabiliz- Ortega, R., Perez, J.A.L., Nicklasson, P.J., and Sira-
ing properties even when the converters are interconnected Ramirez, H. (2013). Passivity-based control of Euler-
to the network through a Mesh topology. Crucial to reach Lagrange systems: mechanical, electrical and electrome-
the presented results has been the Hamiltonian structure chanical applications. Springer Science & Business Me-
exhibited by the different devices that conform the grid. dia.
Pavlov, A. and Marconi, L. (2008). Incremental passivity
and output regulation. Systems & Control Letters,
ACKNOWLEDGEMENTS
57(5), 400–409.
Pedrasa, M.A. and Spooner, T. (2006). A survey of
Part of this work was supported by DGAPA-UNAM under techniques used to control microgrid generation and
grant IN116516. storage during island operation. In Proceedings of
the 2006 Australasian Universities Power Engineering
REFERENCES Conference (AUPEC’06), 1–6.
Schiffer, J., Ortega, R., Astolfi, A., Raisch, J., and Sezi,
Avila-Becerril, S., Espinosa-Pérez, G., and Fernández- T. (2014). Conditions for stability of droop-controlled
Carrillo, P. (2015). Stability and consensus of electrical inverter-based microgrids. Automatica, 50(10), 2457–
circuits via structural properties. IFAC-PapersOnLine, 2469.
48(13), 111–116. Simpson-Porco, J.W., Bullo, F., et al. (2015). A solvability
Barklund, E., Pogaku, N., Prodanović, M., Hernandez- condition for reactive power flow. In 2015 54th IEEE
Aramburo, C., and Green, T.C. (2008). Energy Conference on Decision and Control (CDC), 2013–2017.
management in autonomous microgrid using stability- IEEE.
constrained droop control of inverters. Transactions on Simpson-Porco, J.W., Dörfler, F., and Bullo, F. (2013).
Power Electronics, IEEE, 23(5), 2346–2352. Synchronization and power sharing for droop-controlled
Bollobás, B. (1998). Modern graph theory, volume 184. inverters in islanded microgrids. Automatica, 49(9),
Springer Science & Business Media. 2603–2611.
Brayton, R. and Moser, J. (1964). A theory of nonlinear Van der Schaft, A. (1999). L2-gain and passivity in
networks i. Quart. Appl. Math, 22(1), 1–33. nonlinear control. Springer-Verlag New York, Inc.
Dorfler, F. and Bullo, F. (2013). Kron reduction of Wei, E. and Bandi, C. (2015). Fairness considerations in
graphs with applications to electrical networks. Circuits network flow problems. In 2015 54th IEEE Conference
and Systems I: Regular Papers, IEEE Transactions on, on Decision and Control (CDC), 6909–6914. IEEE.
60(1), 150–163. Wellstead, P. (1979). Introduction to physical system
Dvijotham, K., Chertkov, M., and Low, S. (2015). A modelling. Academic Press London.
differential analysis of the power flow equations. arXiv
preprint arXiv:1506.08814.
Fang, X., Misra, S., Xue, G., and Yang, D. (2012). Smart
gridthe new and improved power grid: A survey. Com-
munications Surveys & Tutorials, IEEE, 14(4), 944–980.
Farhangi, H. (2010). The path of the smart grid. Power
and Energy Magazine, IEEE, 8(1), 18–28.
Guerrero, J.M., Chandorkar, M., Lee, T.L., and Loh,
P.C. (2013). Advanced control architectures for intel-
ligent microgrids, part i: decentralized and hierarchical
control. IEEE Transactions on Industrial Electronics,
60(4), 1254–1262.
245
CHAPTER 9. NEW ENERGIES
Abstract: Para operar un módulo fotovoltaico en el punto de máxima potencia, es necesario tener
una carga que sea capaz de consumir dicha potencia, es por eso que en este artı́culo se propone una
interconexión a la red eléctrica del módulo mediante dos convertidores de potencia, uno de ellos es un
convertidor elevador, el cual tendrá la tarea de mantener el módulo fotovoltaico en el punto de máxima
potencia, y el segundo es un inversor, el cual se encargará de transmitir la potencia extraı́da del modulo
a la red eléctrica. Estos dos objetivos se pueden lograr con un buen diseño de dos leyes de control, las
cuales se diseñarán con técnicas basadas en pasividad.
Keywords: Control basado en pasividad, interconexión de sistemas pasivos, convertidor cd/cd elevador,
inversor, sistemas fotovoltaicos, MPPT conectado a la red eléctrica.
246
CHAPTER 9. NEW ENERGIES
PI para el convertidor cd/cd elevador y para el inversor como R1 0 0 0 0 0
MPPT. Una evaluación numérica en MATLAB se presenta en 0 R2 0 0 0 0
la Sec. 5. Finalmente, se presentan comentarios finales y trabajo
0 0 R3 0 0 0
futuro en la Sec. 6. R=
0 0 0 R4 0 0
0 0 0 0 R5 0
0 0 0 0 0 R6
2. ANÁLISIS DEL SISTEMA
0 −1 0 0 0 0
1 0 −u1 0 0 0
El sistema con el que se trabajará a lo largo de este estudio
0 u1 0 −1 0 0
se presenta en la Fig. 2, el cual consta de ocho secciones: J(u) =
0 0 1 0 −1 0
i) el módulo fotovoltaico (MFV), ii) un filtro de entrada, iii 0 0 0 1 0 −u2
un convertidor cd/cd elevador, iv) un inductor de enlace, v) 0 0 0 0 u2 0
un capacitor que genera el bus de dc, vi) un inversor, vii) −1
un inductor de enlace con la red eléctrica y viii) dos bloques C1 0 0 0 0 0
0 L−1 0 0 0 0
que generan dos leyes de control, una para cada convertidor. 2
Tomando en cuenta las leyes de Kirchhoff y las relaciones 0 0 C −1
0 0 0
D= 3
0 0 0 L−14 0 0
Convertidor 0
Elevador L4 Inversor L6 0 0 0 C5−1 0
C1 0 0 0 0 0 L−1 6
= =
C5 Vr Algunas caracterı́sticas del sistema son las siguientes:
PVM =
• El sistema cuenta con dos entradas de control, una de ellas
u1 u2
se usará para MPPT del MPV y la otra para inyectar a la
red la potencia extraı́da del MPV.
Ley de control Ley de control • Ninguna de las dos señales de control son afı́n a la entrada,
y sólo actúa en cuatro canales del sistema.
• El sistema es sub-actuado.
Fig. 2. Sistema conectado a la red eléctrica • La matriz R > 0, R = RT ; J = −J T y D > 0, D = DT .
constitutivas de los elementos, entonces es posible llegar a la 2.1 Prueba de estabilidad del convertidor
representación matemática mostrada en (1-6):
1 La función de almacenamiento de energı́a del sistema es
C1 ż1 + z1 + z2 = ip (1)
R1 1
L2 ż2 + R2 z2 − z1 + u1 z3 = 0 (2) V (x) = xT Dx (8)
2
1 cuya derivada evaluada a lo largo de las trayectorias del sistema
C3 ż3 + z3 + z4 − u1 z2 = 0 (3)
R3 está dada por:
L4 ż4 + R4 z4 − z3 + z5 = 0 (4) V̇ (x) = xT D[(J(u1 , u2 ) − R)∇x H(x) + E]. (9)
1
C5 ż5 + z5 + u2 z6 − z4 = 0 (5) Notar que xT D = ∇Tx H(x) y J(u1 , u2 ) es una matriz an-
R5
L6 ż6 + R6 z6 − u2 z5 = −Vr (6) tisimétrica que se encuentra en una forma cuadrática. Por lo
tanto, este termino es cero para todo tiempo, entonces:
donde z1 es el voltaje del MPV, z2 es la corriente de entrada del
convertidor elevador, z3 es el voltaje de salida del convertidor V̇ (x) = −∇Tx H(x)R∇x H(x) + ∇Tx H(x)E. (10)
elevador, z4 es la corriente del inductor de enlace, z5 es el Ahora, se define µ = E y y = ∇x H(x) entonces
voltaje de entrada del inversor, z6 es la corriente de salida del
V̇ (x) = −y T Ry + y T µ (11)
inversor.
por lo tanto, el sistema es estrictamente pasivo a la salida desde
Sin embargo, en lugar de trabajar con voltajes y corrientes en µ −→ y. Esto quiere decir que con µ = 0, V̇ (x) es definida
el sistema, se realizará un cambio de variables para trabajar con negativa, entonces podemos concluir estabilidad asintótica al
flujos y cargas, recordando que VC = Cq e iL = Lφ, donde q punto de equilibrio x = 0.
es carga del capacitor, y φ flujo magnético del inductor. Como
resultado, el modelo matemático se puede representar en forma Conjectura 1. Debido a que la ley de control se encuentra
matricial a través de una estructura Hamiltoniana (7): en el término antisimétrico, el cual es cero en la prueba de
estabilidad, el sistema siempre es, global y asintóticamente
ẋ = (J(u1 , u2 ) − R)∇x H(x) + E (7) estable, sin importar la ley de control, con entrada µ = 0.
donde
1 El problema que se resuelve en este documento es para cuando
H(x) = xT Dx µ es diferente de cero.
2
qC1 x1 ip
φL 2 x 2 0 3. ANÁLISIS DE PASIVIDAD DEL SISTEMA
qC3 x3 0
x= = E= Debido a que la señal de control no entra en todos los canales
φL 4 x 4 0
qC x5 0 del sistema, se propone separar al sistema en cuatro sub-
5
φL 6 x6 −Vr sistemas, dos de ellos que no dependan de ninguna de las
247
CHAPTER 9. NEW ENERGIES
señales de control, los otros dos sub-sistemas deberán depender Se define Υ2 = E2 y Y2 = ∇x̄2 H(x̄2 ) entonces
de una de las señales de control cada uno, es decir
V̇ (x̄2 ) = −Y2T R̄2 Y2 + Y2T Υ2 (23)
Σ1 : (12)
por lo tanto el sistema es estrictamente pasivo a la salida desde
1 1 Υ2 −→ Y2 .
ẋ1 = − x1 − x2 + ip (13)
R1 C1 L2
Σ2 : (14) 3.3 Análisis de pasividad de Σ3
x̄˙ 2 = [J2 u1 − R̄2 ]∇x̄2 H2 (x̄2 ) + E2 (15)
Se propone la función de almacenamiento de energı́a
1
H2 (x̄2 ) = x̄T2 D2 x̄2 (16) 1 2
2 V (x4 ) = x (24)
2L4 4
donde
1 con derivada evaluada a lo largo de las trayectorias del sistema
x1 dada por
0 −1 R2 0
J2 = R̄2 = E2 = C11 1 R4 1 1
1 0 0 R3 V̇ (x4 ) = x4 − x4 + x3 − x5 . (25)
− x4
L4 L4 L4 C3 C5
−1
x2 L2 0 Se define µ4 = 1
C3 x3 − C5 x5 ,
1
L4 x4 entonces
1
y4 =
x̄2 = D2 = .
x3 0 C3−1
V̇ (x4 ) = −R4 y42 + y4T µ4 (26)
por lo tanto es estrictamente pasivo a la salida desde µ4 −→ y4 .
Σ3 :
R4 1 1 3.4 Análisis de pasividad de Σ4
ẋ4 = − x4 + x3 − x5
L4 C3 C5
Σ4 : Se propone la función de almacenamiento de energı́a
1
x̄˙ 4 = [J4 u2 − R̄4 ]∇x̄4 H4 (x̄4 ) + E4 V (x̄4 ) = x̄T4 D4 x̄4 (27)
1 2
H4 (x̄4 ) = x̄T4 D4 x̄4 con derivada evaluada a lo largo de las trayectorias del sistema
2
donde V̇ (x̄4 ) = ∇Tx̄4 H(x̄4 )[(J4 u2 − R̄4 )∇x̄4 H(x̄4 ) + E4 ]. (28)
" 1 #
Debido a que J4 es una matriz anti-simétrica este término es
0 −1 R5 0 x4
J4 = R̄4 = E4 = L4 cero para todo tiempo, entonces
1 0 0 R6
−Vr V̇ (x̄4 ) = −∇Tx̄4 H(x̄4 )R̄4 ∇x̄4 H(x̄4 ) + ∇Tx̄4 H(x̄4 )E4 . (29)
−1
x5 C5 0 Se define Υ4 = E4 y Y4 = ∇x̄4 H(x̄4 ) entonces
x̄4 = D4 = .
x6 0 L−1
6 V̇ (x̄4 ) = −Y4T R̄4 Y4 + Y4T Υ4 (30)
3.1 Análisis de pasividad de Σ1 por lo tanto el sistema es estrictamente pasivo a la salida desde
Υ4 −→ Y4 .
Se propone la función de almacenamiento de energı́a Una vez realizada la prueba de pasividad de los cuatro sub-
1 2 sistemas, es posible ver al sistema en general como una inter-
V (x1 ) = x (17)
2C1 1 conexión de sistemas pasivos como el mostrado en la Fig.3,
con derivada evaluada a lo largo de las trayectorias del sistema y por los teoremas de pasividad de sistemas retroalimenta-
dada por dos, podemos concluir que el sistema es pasivo ip −→ y1 y
−Vr −→ z6 .
1 1 1
V̇ (x1 ) = x1 − x1 − x2 + ip . (18) Una vez realizada la descomposición y garantizando que la ley
C1 R1 C1 L2
de control no afecta las propiedades de estabilidad del sistema,
Se define µ1 = − L12 x2 + ip , y1 = C11 x1 entonces se trabajará con los sistemas Σ2 y Σ4 para el diseño de las dos
1 leyes de control, debido a que la estructura de Σ2 y Σ4 son
V̇ (x1 ) = − y12 + y1T µ1 (19)
R1 iguales, se presentará la técnica en general y después se aplicara
por lo tanto es estrictamente pasivo a la salida desde µ1 −→ y1 . para cada caso.
Se propone la función de almacenamiento de energı́a Como se mencionó anteriormente, la ley de control no aparece
1 de forma explı́cita en la prueba de estabilidad, por lo tanto se
V (x̄2 ) = x̄T2 D2 x̄2 (20) propone trabajar con el sistema incremental que se obtiene al
2 definir un error del estado y del control, como se muestra a
con derivada evaluada a lo largo de las trayectorias del sistema
continuación:
V̇ (x̄2 ) = ∇Tx̄2 H(x̄2 )[(J2 u1 − R̄2 )∇x̄2 H(x̄2 ) + E2 ]. (21) x̃(2,4) = x̄(2,4) − x?(2,4) (31)
Debido a que J2 es una matriz anti-simétrica este término es ũ(1,2) = u(1,2) − u?(1,2) (32)
cero para todo tiempo, entonces
donde x?(2,4) y u?(1,2) , son los valores deseados, los subı́ndices
V̇ (x̄2 ) = −∇Tx̄2 H(x̄2 )R̄2 ∇x̄2 H(x̄2 ) + ∇Tx̄2 H(x̄2 )E2 . (22) (2,4) se refieren al sub-sistema Σ2 y Σ4 y los sub-indices (1,2)
248
CHAPTER 9. NEW ENERGIES
V̇(2,4) (x̃(2,4) ) =
ip µ1 y1
+
- Σ 1
− ∇Tx̃(2,4) H(2,4) (x̃(2,4) )R̄(2,4) ∇x̃(2,4) H(2,4) (x̃(2,4) )+
+ ∇Tx̃(2,4) H(2,4) (x̃(2,4) )J(2,4) ∇x?(2,4) H(2,4) (x?(2,4) )ũ(1,2) +
y3 Σ (u )
2 1 µ3 entonces el sistema es pasivo desde la entrada U(2,4) −→
Y(2,4) , donde
-1 υ2 ũ1
µ4 y4 U2 = =
υ3 Ẽ2
+
- Σ 3
Y2 =
γ2
=
T
−∇x?2 H2 (x?2 )J2 ∇x̃2 H2 (x̃2 )
γ3 ∇x̃2 H2 (x̃2 )
por lo tanto
z6 y5 µ5
Σ (u )
4 2 -1
Vr V̇2 (x̃2 ) = γ2T υ2 − γ3T R̄2 γ3 + γ3T υ3 .
Entonces, el sistema Σ2 es pasivo desde υ2 −→ γ2 y estricta-
(38)
249
CHAPTER 9. NEW ENERGIES
250
CHAPTER 9. NEW ENERGIES
Voltaje del MFV real y deseado Corriente del inductor de enlace L4 deseado y real
40
30 0.3
Amperes [A]
Volts [V]
z1
20 0.2 z4
z1d
0.1 z4d
10
0
0
0 0.5 1 1.5 2 2.5 3 3.5 −0.1
0 0.5 1 1.5 2 2.5 3 3.5
Tiempo (s)
Tiempo (s)
Corriente del inductor L2 deseado y real Voltaje del capacitor C5 deseado y real Voltaje del capacitor C5 deseado y real
5 658
800
4
Amperes [A]
600 656
z
Volts [V]
Volts [V]
3 2 z5 z
654 5
2 z2d 400
z z5d
5d 652
1 200
0 650
0 0.5 1 1.5 2 2.5 3 3.5 0
0 0.5 1 1.5 2 2.5 3 3.5 1.2 1.25 1.3 1.35 1.4 1.45 1.5 1.55 1.6
Tiempo (s)
Tiempo (s) Tiempo (s)
Voltaje del capacitor C3 deseado y real Corriente del inductor de enlace L6 deseado y real
800
2
600
Amperes [A]
Volts [V]
1
z3
400
z
6
z3d 0
z6d
200
−1
0
0 0.5 1 1.5 2 2.5 3 3.5 −2
0 0.5 1 1.5 2 2.5 3 3.5
Tiempo (s)
Tiempo (s)
Fig. 5. Primeros tres estados y estados deseados del sistema Fig. 6. Últimos tres estados y estados deseados del sistema
por la potencia del MPV. Como se quiere trabajar en el punto 1
Señal de control u1
0.06
Señal de control u1
debe ser igual al de entrada menos la potencia disipada por las 0.05
Amplitud
Amplitud
0.6
Ps = Pe − Pp
0.2
0.035
0 0.03
0 0.5 1 1.5 2 2.5 3 3.5 0 0.5 1 1.5 2 2.5 3 3.5
donde Ps es la potencia de salida, Pe es la potencia del MPV, Tiempo (s) Tiempo (s)
siguiente manera
0.2 0.1
Amplitud
Amplitud
0.1
0
0
Pe = z1 ip .
−0.1 −0.1
−0.2
−0.2
−0.3
251
CHAPTER 9. NEW ENERGIES
Potencias en el sistema
PMPV
150 Pextraída
Ppérdidas
100
50
0
Potencia [W]
−50
−100
−150
−200
252
CHAPTER 9. NEW ENERGIES
Abstract: The concept of sustainability is known more than 30 years ago, being a topical worked in the
academic sphere more frequently than in industry. The pillars of sustainability (financial, environmental
and social) are not commonly found in process control and industrial projects as one of its drivers. In the
environmental area, reducing consumption of energy, raw materials and waste production is essential.
This article describes a procedure to determine how to improve the energy efficiency of a dynamic
process reducing energy consumption in the system while fulfilling its control objectives. Bond Graph
method is used for modeling a complex case in its mechanical and electrical versions. The magnitude of
various components of the system is manipulated so that during its transient behavior the system response
reaches a steady state consuming the least possible amount of energy, without sacrificing production
efficiency. To force the system to meet a given reference and reduce power consumption, a PID
controller is applied to manipulate the external source. An evolutionary algorithm is used to modify the
elements magnitude and to tune the PID controller. This proposal focuses on saving energy in controlled
dynamic industrial processes through an optimization process that significantly improves performance.
Key Words: Sustainability, Evolutionary Computation, Dynamical Systems, Energy Efficiency, MAGO
algorithm, Bond Graph, PID.
has a minimal environmental footprint. The challenge for
1. INTRODUCTION
control engineering is to achieve a more sustainable way for
process operation, using less energy, less raw materials, and
1.1 Sustainable Industrial Processes producing less waste, besides satisfying various behavior
references.
An industrial process with a focus on sustainability is defined
as the set of stages through which the processing of raw Energy consumption is part of the environmental shaft of
products materials, products, residues and waste, taking into sustainability and represents a high impact factor in any
account the rational use of energy, optimizing utilization of industrial process. This work focuses on reducing the energy
materials and minimizing or eliminating the presence of demand of an industrial system while meeting its goals of
residues and waste (Pérez & Meza, 2013). Globally the issue control.
of sustainability in development since 1983, when the
Brundtland Commission (CFR, 1987) the term sustainability
1.2 Energy Efficiency
was defined as "the ability to meet the needs of the present
generation without compromising the ability of future Energy Efficiency actions are understood as activities that are
generations to meet their own needs." In a study by the World aimed at improving or maintaining production at the lowest
Bank (WBG, 2011) makes evident that by including the hub possible energy consumption (Pareja & Parra, 2014). In
of sustainability (financial, environmental and social) in others words, optimize energy use. The purpose is to obtain
every action of the organization, two positive effects arise: the expected performance in a production process, but with
There is an improvement in its financial performance, the least possible energy. For these characteristics can talk
resulting in more than 5% better performance with respect to about optimization of industrial processes. This optimization
those not including these criteria, and demonstrates how is linked directly to maximize or minimize some indicator of
investors are decided by those companies that have the three performance or process efficiency.
pillars of sustainability as their top three priorities.
A very common case, and an easy way to illustrate the term
Nowadays, what matters is not only that the system performs energy efficiency, is finding the optimum value of a given
under its designed nominal conditions but that its operation function and that is also global. This amount can be a
253
CHAPTER 9. NEW ENERGIES
maximum or minimum value, as appropriate. This article provides the state equations of the system in terms of energy,
presents a method to improve energy efficiency in a based on power.
production process that consumes energy, by manipulating
The power flow between systems, subsystems and
the magnitude of some components of the system. The
components is represented by a line or "bond" which carries
efficiency in its natural and controlled behaviors, with
associated the variables effort e(t) and flow f(t), Figure 2, the
different dynamical instantiations of the manipulated
product of these variables gives the instantaneous power P(t)
elements, is compared.
transmitted by the "bond".
The remainder of this paper is divided into four sections.
Section 2 presents the problem to be solved by the
description of a mechanical system, its electrical equivalent
and the equations of state using the Bond Graph technique. In
section 3 the energy consumption of a dynamic element of
the system is evaluated. Similarly, the total consumption by a
comparison between energy consumption in the element and Fig. 2. Graphical representation of instantaneous power.
the external source without control action vs power Other important variables in the description of dynamic
consumption with control action, manipulating three separate systems are momentum p(t) and displacement Q(t). The
system components. In section 4, the analysis is rethought moment is defined as the integral of the effort over time
modifying the system by controlling the external source via a (Equation 1). The displacement Q(t) is the integral of flow
PID controller and looking for minimal power consumption over time (Equation 2).
in both the element of interest and in the source. The
t
conclusions are presented in Section 5.
P(t ) e(t )dt (1)
0
2. PROBLEM TO BE SOLVED
t
Q(t ) f (t )dt (2)
2.1 Mechanical System 0
254
CHAPTER 9. NEW ENERGIES
L1
DC
C2
L2
Fig. 5. Inertial port.
I0 C1
R1
Sources of flow (Figure 6) are those elements that supply DC
kinetic energy to the system. They are represented by Sf, and
physical examples are intensity or speed sources and
hydraulic pumps. The sources of effort (Figure 6) inject
potential energy. They are represented by Se. Examples are Fig.8. Equivalent electrical System
voltage, pressure, force.
From the scheme, flow equations in capacitors and effort
equations in coils can be obtained, as follows:
Flow 2 = Intensity in capacitor C1
Fig. 6. Flow and Effort sources representation. p1 (3)
f 2 Io
L1
2.3 System Modeling
Flow 8 = Intensity in capacitor C2
Bond Graph technique is used for modeling mechanical p1 p 2
f8 (4)
system of Figure 1. The first thing necessary is to determine L1 L 2
the "bond" of each element and junction in which an effort or
flow is applied (each bond has an associated effort and flow). Effort 4 = Voltage in coil L1
q1 p1 p 2 q 2
After defining all physical interactions occurring in the e4 Va R1( ) (5)
system proceeds to link the bond (Vera Alvarez & Caballero C1 L1 L2 C 2
Ocaña, 2008) taking into consideration the following criteria:
Effort 12 = Voltage in coil L2
• The O symbol is used for the junction of bonds where their p1 p 2 q 2
effort is the same. e12 R1( ) Vb (6)
L1 L2 C 2
• The 1 symbol is used for the junction of bonds where their
flow is the same. Where p1 and p2 are the momentum in coils 1 and 2
respectively, and q1 and q2 are the displacements in
• The bonds using a source (effort or flow) enter the junction.
capacitors, all time-dependent variables.
• The bonds that not use a source (effort or flow) leave the
junction. Considering that:
dq(t )
Given the above criteria, the Bond Graph model equivalent to f (t ) (7)
the system in its mechanical and electrical versions is dt
presented in Figure 7. dp(t )
e(t ) (8)
dt
It is replaced in the above equations and have:
dq1(t ) p1(t )
f2 Io (9)
dt L1
255
CHAPTER 9. NEW ENERGIES
Tf
En12(t ) Pot12(t )dt (14) Given the nonlinearity, uncertainty in the convexity and the
0 structure of the model in differential equations, this is a
complicated problem to solve by traditional optimization
algorithms. For this reason, a heuristic technique for its
3. ENERGY ANALYSIS solution is used: The Multidynamics Algorithm for Global
The equations of energy consumption for a system element, Optimization (MAGO, by its acronym in English); formally
the coil 2, were presented. It aims to minimize the energy presented in (Hernandez & Ospina, 2010).
consumed by this element in its transitory state until it The Multidynamics Algorithm for Global Optimization
reaches its natural value (voltage in electrical circuit, force in (MAGO) is an evolutionary heuristic method that transforms
mechanical version). Manipulating the magnitude of another groups of individuals to achieve a goal, likening their
element of the system the coil 2 reaches its natural value behavior to the process of evolution of species. Unlike other
voltage with minimal power consumption. In Section 4, the evolutionary algorithms, MAGO does not used genetic
manipulation of another element of the system is performed operators for the evolutionary process but statistical from the
by a PID control. same population.
Next, energy analysis taking as a starting point the behavior In this paper, MAGO is used to modify the magnitude of
in the natural state of the system is presented. The results are some system elements in the transitory in order to reduce
compared after handling other system elements, which are: energy consumption in the coil L2 and also to calculate the
the resistor R1 and capacitors C1 and C2 (Figure 9): parameters of the PID controller to force behavior in the coil
L2 while achieving minimum energy consumption
3.1 Natural Response throughout the system.
Table 1 presents results of energy consumption for this case. MAGO was born as a hybrid among distribution estimation
algorithms and the statistical quality control. As with other
Table 1. Results of Natural State evolutionary algorithms MAGO begins with a population of
possible solutions distributed randomly throughout the search
Energy Units of Energy space, which is divided autonomously by the algorithm into
Coil 2 5,9973E+03 three subgroups with their own evolution, called Emergent
Total 5,3093E+06 Dynamics, Crowd Dynamics and Accidental Dynamics.
3.2 Energy Control by System Elements The MAGO algorithm uses the covariance matrix of the
population of each generation to establish an exploration
3.2.1 Energy Control by Resistance Handling. distribution and creating three subgroups or dynamics that
The control action is placed in the circuit resistance R1, so make all individuals in each generation. Emergent Dynamics
that its magnitude remains time dependent. Control is directly (G1) creates a small group of individuals within the
manipulating the magnitude of the resistor R1. population around the individual with better genetic
characteristics, displacing the best ones and making them
L1 compete with each other. This group is the evolutionary elite
of each generation, that is, the fittest individuals contributing
DC
C2
L2
with their genes to the next generation. They are spread
I0 C1
throughout the search space. The second group, the Crow
R1(t)
Dynamics (G2), generates individuals around the current
DC
population mean and first standard deviation. This population
subgroup, located in a sector of the search space, is changing
its location during the evolutionary process. The crow
dynamics can be close to the emergent dynamics, but never
Fig. 9. Electrical System with variable resistor
enough to be confused. Only until there is sufficiency and
The goal is to minimize the energy consumed by the coil L2 necessary to ensure full exploration of the search space
in the transient using the control action in the resistance R1 conditions, these two dynamics could be merged within the
(t), until it reaches its natural voltage value. territory, usually at the end of the evolutionary cycle. The
third group, the Accidental Dynamics (G3), follows the
3.2.2 Optimization Algorithm
quantum speciation, in that is established throughout the
The objective function is expressed by equation 15. search space generation by generation in isolation from other
individuals of the other dynamics. Their role is to maintain
Tf
p1(t ) p 2(t ) q 2(t )
2
Tf
min Pot12(t )dt min R(t ) Vb L2 dt diversity and ensure the stability of the algorithm. These
R (t ) 0 R (t )
0 L1 L 2 C 2
subgroups cannot interbreed. In every generation the union of
s.t.
the three subgroups formed all the current population. In
Figure 10 the MAGO basic flow is shown.
R(t ) 0t
(15)
256
CHAPTER 9. NEW ENERGIES
For the model of Figure 9 the following parameters were set: Energy on coil 2 Units of Energy
Without Control (R1, C1, Fixed) 5,997E+03
L1 = 1 L2 = 1 C1 = 1 C2 = 1 With Control (C1 Variable) 5,914E+03
R = 1,56 (Initial Value to calculate R(t)) Total Energy Units of Energy
Without Control (R1, C1, Fixed) 5,309E+06
Va = 5 Vb = 5 I0 = 20 With Control (C1 Variable) 5,595E+06
The simulation begins with a quasi-optimum value of R,
which has an apparent minimum consumption of energy in A slight decrease in the power consumption of the coil L2
the coil 2. Since the percentage of saved energy and the after the control action from capacitor C1 in the system is
variables behavior are compared, the units of each element seen.
are ignored.
257
CHAPTER 9. NEW ENERGIES
3.4 Energy Control by Capacitor Handling C2 (variable) 4.1 Natural Response (Without Controller)
The results of energy consumption for the natural state case
The results in the implementation of control for the case of
are presented in Table 5.
the capacitor C2 are presented in Table 4.
Table 5. Natural State Response
Table 4. Comparison 3
Energy Units of Energy
Energy on coil 2 Units of Energy
Coil 2 1,014E+03
Without Control (C2 Fixed) 5,997E+03
Total 4,466E+06
With Control (C2 Variable) 3,303E+03
Total Energy Units of Energy 4.2 Response with Resistor R2 controlled
Without Control (C2 Fixed) 5,309E+06
With Control (C2 Variable) 7,247E+06 The reference is set to a magnitude of 10 units in the L2
element (voltage in the electrical system and force in the
mechanical system). The initial value of R2 is 0.7.
The decrease in energy consumption in the coil L2 is visible
when the transitory is handled from C2 to reach the natural In Figure 12 the proposed implementation of a PID controller
response. varying the value of the resistor R2 as a method of
controlling the intensity source is appreciated. The results of
The results show that near the C2 element, a PID controller energy consumption for this case are presented in Table 6. ,
must be located to minimize energy consumption in the In Figures 13 and 14, from the results for energy optimization
transitory state of the system and to force a reference in L2, can be observed the control action and the energy
behavior on element L2. consumption (momentum) in L2.
L1
DC
4. CONTROL BASED ON ENERGY EFFICIENCY C2
L2
From the above results it is evident that it is possible to I0 P2(t)
PID R2(t) C1
R
minimize energy consumption in both the system and as in a DC
specific element (coil L2), manipulating different elements of
the system.
In order to control the steady state response of the system, in
this case, momentum P2 and a reference in coil L2, by the Fig. 12. System configuration with PID control.
particular configuration of the system placing a resistor in
parallel with the source was decided. This arrangement is Table 6. Results for R2 Controlling only energy
equivalent to a controllable external source. See Figure 11. consumption in L2.
Additionally, it will be noted how power consumption is Energy Units of Energy
changed in the coil. The magnitude of the additional Coil 2 1,0105E+03
resistance will be varied by adjusting the parameters of a PID Total 4,6496E+06
controller.
L1
DC
C2
L2
I0 R2(t) C1
R
DC
258
CHAPTER 9. NEW ENERGIES
259
CHAPTER 9. NEW ENERGIES
5. CONCLUSIONS our-common-future-brundtland-report/p26349
Retrieved: May 30, 2015, 13:46
It is possible to modify the energy consumption of an Borutzky, W. (Ed.). (2011). Bond Graph Modelling of
engineering system applying advanced optimization Engineering Systems. New York, NY: Springer New
algorithms and using PID controllers, so that such York. http://doi.org/10.1007/978-1-4419-9368-7.
consumption is minimal and in turn the system meets a Breedveld, P. C. (1985). Multibond graph elements in
desired behavior. physical systems theory. Journal of the Franklin
Institute, 319(1-2), 1–36. http://doi.org/10.1016/0016-
The proposed method first was applied to achieve energy 0032(85)90062-6.
savings in the transitory state of the system. Therefore, in Hernández, J. A., & Ospina, J. D. (2010). A multi dynamics
processes where consumption in the transitory state is greater, algorithm for global optimization. Mathematical and
energy savings will also be higher. Computer Modelling, 52(7-8), 1271–1278.
http://doi.org/10.1016/j.mcm.2010.03.024
The method was also successful when it is applied to achieve Karnopp, D., Margolis, D., & Rosenberg, R. (2006). System
a desired behavior and in turn minimizing energy Dynamics. (John Wiley and Songs, Ed.). New Jersey.
consumption. With a broader sensitivity analysis, better Mukherjee, A., Karmakar, R., & Samantaray, A. (2006).
energy savings for the established control conditions can be Bond Graph in Modeling, Simulation and Fault
found. Identification. (I. K. International Publishing, Ed.).
New Delhi
The proposed method is independent of the energy domain of Pareja, R., & Parra, L. (2014). Eficiencia Energética:
the engineering system. Here it was applied to mechanical Herramienta para la sostenibilidad (1st ed.). Colombia.
and electrical system engineering cases. Its extension to Paynter Henry. “An epistemic prehistory of Bond Graphs”.
mechatronic or multidomain systems, by Bond Graph En P. Breedveldand G. Dauphin-Tanguy, Eds., Bond
modeling, is elementary. Graphs for Engineers. Amsterdam. North-Holland.
Pérez, J. L., & Meza, V. S. (2013). Los procesos industriales
Among the experiments, it was apparent that it is possible to sostenibles y su contribución en la prevención de
minimize energy consumption or increase energy efficiency problemas ambientales. Industrial Data, 16(1).
of the system, including additional elements. A possible Vera Álvarez, C., & Caballero Ocaña, M. (2008).
future work is to find both the optimal location and the best Introduccion a La Tecnica De Bond Graph En La
element that minimize energy consumption and maximize the Dinamica Ferroviaria. Ferroviaria’9R, 291–308.
system efficiency while simultaneously the given conditions Retrieved from http://ruc.udc.es/bitstream/2183/10640
are met. /1/CC41 art 28.pdf.
WBG - World Bank Group. (2011). International Finance
Including sustainability as one of the criteria for tuning Corporation, The business case for sustainability.
controllers adds more value to the organization from the http://www.ifc.org/wps/wcm/connect/topics_ext_conte
financial, producing savings from planning, gaining control nt/ifc_external_corporate_site/ifc+sustainability/learnin
engineering greater recognition, generating lower risks and g+and+adapting/knowledge+products/publications/publ
better environmental and social performance. ications_brochure_businesscaseforsustainability
Retrieved: May 30, 2015. 13:00.
Extending the control of sustainable industrial systems
throughout the lifecycle of the process or product, in addition
to meeting the nominal production conditions, is an option to
facilitate the assessment of the water footprint, carbon
footprint and energy footprint, compliance with current
regulations and becomes an excellent complement to the
traditional indicators of projects.
ACKNOWLEDGEMENTS
REFERENCES
CFR - Council on Foreign Relations (1987) Brundtland
Report. http://www.cfr.org/economic-development/
report-world-commission-environment-development-
260
CHAPTER 9. NEW ENERGIES
Abstract: The rapid transformation that the electric grid is facing towards the smart grid is
a high trend in research nowadays. In particular, developed countries have being working in
several projects that test and validate the new power system model given the advantages that it
offers such as create a more efficient grid, increase reliability, robustness and promote a rational
use of energy . In contrast, Colombia and similar countries recently began the attention to smart
grid projects. Therefore, new tools that encourage the work on control systems oriented to the
smart grids should be developed.
This paper presents a low cost microgrid small scale model designed and built on a laboratory
environment. The project intents to serve as a tool in order to fill the gap between the knowledge
and the interest associated with microgrid and smart grid at the E.E. undergraduate level.
A clear description of the main functionalities and operation of the microgrid model such as
distributed generation, intentional islanding, and sensitive loads is given. Also, two of the most
important key aspects of the smart grid are explored in the prototype: advanced metering
infrastructure and demand response. Finally, the project proves to be an example of motivation
for the smart grid research in our academic sector.
261
CHAPTER 9. NEW ENERGIES
262
CHAPTER 9. NEW ENERGIES
4. SMART GRID FEATURES OFF: If, P rice = High;
P BP P articipants = (1)
ON: If, P rice = Low.
The use of the test bed extends from the microgrid
operation to include two of the main characteristics of The simulated customers that participate on the IBP react
the smart grid. The advanced measurement infrastructure as a result of some incentive signal. The participants of
263
CHAPTER 9. NEW ENERGIES
IBP sign a contract with the energy provider in which that reason, an optimized storage system that takes all
the company gives some incentive (rate discounts, money, the power delivered from the renewable energy sources and
etc.) in exchange for apply direct control and disconnect avoids injecting it into the main grid is needed. Another
them. The direct control will happen only in some special option is to implement a control strategy that ensures not
situations defined in the same agreement. to produce more energy than the energy consumed by
the microgrid. In a real microgrid application this could
For demonstration purposes the negotiation terms consist
happen not only when the microgrid loads fail but also
that the participants (some residential) will disconnect
when the DG supply is greater than the power consumed
from the microgrid each time the highest peak in the
by the load.
energy demand profile occur. The participants on this DR
program will act depending on the condition shown in eq. Island mode The Fig. 7 (from 35s to 105s) shows
(2). an example of the islanded operation. Figs. 7b, 7c, 7d
show that even though all load nodes have 120V only
OFF If: 18h ≤ t < 22h. the hospital is consuming energy, which indicates that
IBP P articipants = (2) the residential and mall node are disabled by the central
ON Otherwise.
controller. The hospital (sensitive load) keeps operating
5. EXPERIMENTAL RESULTS normally and consuming all its power without being af-
fected by the blackout.
In order to show the correct operation of the small scale
The behavior of the microgrid test bed during the island
microgrid test bed, a test protocol divided in two parts
mode illustrates the priority that the microgrid gives to
has been designed. First, six different scenarios working
sensitive loads and how helpful a microgrid system is when
in real-time prove the different operation modes of the
those type of loads are involved.
microgrid moving towards islanded mode and returning
to the grid connected mode. Second, another scenario to Between 35s and 105s in Fig. 7e it seems that the DG
test the implementation of the DR programs has a fixed produces more energy than its maximum capacity which
simulation time. In the DR scenario one week simulation is 15W, but this is because the battery supports the DG
takes seven real-time minutes. Due to the space limitation to supply all the power necessary to the sensitive loads.
only the performance of the microgrid in grid-connected,
islanded and DR modes are presented.
Fig. 6. Different Load Failure during microgrid grid con- Fig. 7. Island mode
nected operation
A special event is shown in Fig. 6 between 123s and Demand response For the analysis of the different
160s. When all the loads are off due to their failures, DG DR programs operation, data was collected by setting the
continues to deliver its power and notice that the power system running in a period of one week simulated time for
registered by the main grid sensor (in black straight line) each program. The objective of this demonstration is to
is below 0W. This case shows how the microgrid delivers present the behavior of the system when a group of users
those 15W from the DG to the main grid. decide to participate in a DR program. By changing the
pattern of the energy the DR participants could receive
For the majority of countries, the trade of energy between economic benefits such as lower prices, subsidies or directly
customers and the energy system is not regulated. For payment for the amount of energy they reduce. At the
264
CHAPTER 9. NEW ENERGIES
.
Fig. 11. Consumption of energy of mall in DR price
program
Fig. 8. One week Load demand profile in Bogotá, Colombia Residential load represents a power reduction of 14W
Xm Colombia (2012) approximated in the main grid node (see Fig. 12).
Fig. 14 demonstrates how the commercial node does not
Price plan: Fig. 14 and 15 show how the residential have any reduction on its energy consumption during
and commercial power consumptions behaves when they the incentive base program. This illustrate that the com-
decide to participate in the price plan. In the different mercial customers are not interested in participate in a
simulations, the limit of power for the low price rate program where the energy company has the decision to
was 1950 MW. When the power demand in Bogotá data connect/disconnect. In general, the mall does not want to
corresponds to a value over that limit the price signal will allow direct control because it can lead not only to comfort
show high price. Then, at that moment some residential degradation but also to economic losses.
customers and circuits of the mall will be turned-off.
During the two peak periods of the day (See Fig. 10) the
the residential node reduced the energy consumption from
the average of 19W to 14W. The same situation happens
with the mall (See Fig. 11), it reduces its consumption
on the high price periods from 23W to 15W. Notice that
in both results only a part of the consumers reacts to
the price signal not the whole node (residential or mall).
The changes on the consumers use of energy also has an Fig. 12. Energy delivered from the main grid
direct impact on the energy consumption of the main grid
reducing the energy deliver from 65W to 50W as it is
shown in Fig. 9. Finally, during the weekend no one is
disconnected from the microgrid because the consumption
of Bogotá never cross 1950 MW limit, then the price signal
is always low price (See Fig. 8).
265
CHAPTER 9. NEW ENERGIES
is needed. For instance, the test bed includes: power sys- Heydt, G.T. (2010). The next generation of power dis-
tem integration, control systems, communication network tribution systems. Smart Grid, IEEE Transactions on,
development and electronic design. 1(3), 225–235.
Jeon, J.h., Kim, J.y., Kim, H.m., Kim, S.k., Cho, C., Kim,
The microgrid small scale test bed was presented to E.E
J.m., Ahn, J.b., and Nam, K.y. (2010). Development
students from last semester and motivate some of them
of Hardware In-the-Loop Simulation System for Testing
to develop their senior projects in topics related with
Operation and Control Functions of Microgrid. IEEE
microgrid and smart grids. The different projects where: an
Transactions on Power Electronics, 25(12), 2919–2929.
automation of a low power educational fuel cell system, a
doi:10.1109/TPEL.2010.2078518.
communication protocol oriented to smart grids, a robotic
Jin, G.Y. and Tseng, K.J. (????). System Design for the
platform of electric vehicles and a ring main unit (electrical
Building-integrated Microgrid Test-bed - a Case Study.
measurement device) prototype.
Lasseter, R.H. (2002). MicroGrids. In 2002 IEEE Power
Finally, the project was shown to CODENSA the energy Engineering Society Winter Meeting. Conference Pro-
distribution company of Bogotá. Given the positive opin- ceedings (Cat. No.02CH37309), volume 1, 305–308. Ieee.
ion that the test bed received, the university get the funds doi:10.1109/PESW.2002.985003.
to create a new version of the test bed which will be a Lasseter, R.H., Eto, J.H., Schenkman, B., Stevens, J.,
micro smart grid testbed prepared for the company. Vollkommer, H., Klapp, D., Linton, E., Hurtado, H., and
Roy, J. (2011). CERTS Microgrid Laboratory Test Bed.
IEEE Transactions on Power Delivery, 26(1), 325–332.
7. CONCLUSION doi:10.1109/TPWRD.2010.2051819.
Medina, J., Muller, N., and Roytelman, I. (2010). Demand
In this work a hardware based small scale microgrid test response and distribution grid operations: Opportunities
bed that integrates control systems, communications, sens- and challenges. IEEE Transactions on Smart Grid, 1(2),
ing technology and power systems in order to illustrates 193–198.
the concept of the microgrid model as well as advanced National Energy Technology Laboratory (2008). AD-
measurement infrastructure and demand response is pre- VANCED METERING INFRASTRUCTURE.
sented. The energy monitoring of the system affords the Nikos, P. and Hatziargyriou (????). More
possibility to obtain in real-time the state of the microgrid microgrids EU pilot projects. URL
and operate it secure and efficiently. In addition, the use http://www.microgrids.eu/index.php?page=kythnos.
of a friendly human user interface which emulates a real Rahimi, F. and Ipakchi, A. (2010). Demand response as a
control system allow a good interaction between the user market resource under the smart grid paradigm. Smart
and the test bed in order to enhance the comprehension of Grid, IEEE Transactions on, 1(1), 82–88.
how a microgrid works. Also, the real time demonstration Song, W.Z., De, D., and Tan, S. (2012). A wireless smart
of the demand response behavior allows to enhance the grid testbed in lab. Wireless Communications, IEEE,
comprehension of this concept and understand how the 19(June), 58–64.
demand side participation can benefit the customer as well Stanovich, M.J., Leonard, I., Sanjeev, K., Steurer, M.,
as the energy company. Roth, T.P., Jackson, S., and Bruce, M. (2013). De-
With these features, the test bed can be used as a tool for velopment of a smart-grid cyber-physical systems
teaching and training in introductory smart grids subjects testbed. 2013 IEEE PES Innovative Smart Grid
as well as be a platform for research. Finally, the project is Technologies Conference, ISGT 2013, 1–6. doi:
an example for the region and similar countries to develop 10.1109/ISGT.2013.6497874.
small scale smart grid test beds which can encourage Tatcho, P., Zhou, Y., Li, H., and Liu, L. (2010). A real
students and energy companies to get involved in smart time digital test bed for a smart grid using RTDS.
grid control research. Power Electronics for Distributed Generation Systems
(PEDG), 2010 2nd IEEE International Symposium on,
658–661. doi:10.1109/PEDG.2010.5545920.
REFERENCES Ubilla, K., Jimenez-Estevez, G.a., Hernadez, R., Reyes-
Chamorro, L., Hernandez Irigoyen, C., Severino, B.,
Adinolfi, F., Burt, G., Crolla, P., D’Agostino, F., Saviozzi, and Palma-Behnke, R. (2014). Smart Microgrids as a
M., and Silvestro, F. (2014). Distributed Energy Re- Solution for Rural Electrification: Ensuring Long-Term
sources Management in a Low Voltage Test Facility. Sustainability Through Cadastre and Business Models.
IEEE Transactions on Industrial Electronics, 0046(c), IEEE Transactions on Sustainable Energy, 5(4), 1310–
1. doi:10.1109/TIE.2014.2377133. 1318. doi:10.1109/TSTE.2014.2315651.
Bonfiglio, a., Delfino, F., Pampararo, F., Procopio, R., Valverde, L., Rosa, F., and Bordons, C. (2013). Design,
Rossi, M., and Barillari, L. (2012). The Smart Poly- planning and management of a hydrogen-based micro-
generation Microgrid test-bed facility of Genoa Univer- grid. Industrial Informatics, IEEE Transactions on,
sity. Proceedings of the Universities Power Engineering 9(3), 1398–1404.
Conference. doi:10.1109/UPEC.2012.6398656. Xm Colombia (2012). Xm: Colombia Power System
Chen, B., Butler-purry, K.L., Goulart, A., and Kundur, Operator.
D. (2014). Implementing a Real-Time Cyber-Physical
System Test Bed in RTDS and OPNET. 1–6.
Conference, M.C., Kabalci, E., Hossain, E., and Bayindir,
R. (2014). Microgrid Test-bed Design with Renewable
Energy Sources. 907–911.
266
CHAPTER 9. NEW ENERGIES
Abstra
tIn this arti
le, we have modeled and simulated a photovoltai
(PV) System in order
to
ompare two dierent approa
hes of Maximum Power Point Tra
king (MPPT)
ontrol
stru
tures. Firstly, we analysed the system
ontrolled with Perturb & Observe MPPT dire
tly
applied to the duty
y
le and the ne
essary simulations were performed and
ompared with
the experimental results. Se
ondly, we
oupled an average
urrent mode
ontrol stru
ture to
the MPPT, improving the settling time and redu
ing the steady-state os
illations. Finally, we
presented the resulting
on
lusions.
Keywords: Solar Energy, Pie
ewise Linear Analysis, Average Current Control Mode, DC/DC
Power Converters, Maximum Power Point Tra
king.
1. INTRODUCTION
DC DC
Pv
Photovoltai
(PV) energy is a lo
ally available renewable DC AC
resour
e with a lower
arbon footprint whi
h be
omes Ipv Vpv
essential due to the environmental impa
ts
aused by the
growing use of the
onventional fossil fuels. In addition, MPPT PWM Control
re
ent eorts seek to make this kind of energy more fea-
sible through the improvement of the system
omponents Figure 1. Photovoltai
system
ontrol
e
ien
y and the implementation of governmental poli
ies
to en
ourage its use. For instan
e, the solar PV global de Brito et al. (2013)), whi
h is
apable of mat
hing the
ele
tri
al
hara
teristi
s of the
onne
ted load to the MPP
apa
ity has in
reased from 16 GW in 2008 to 177 GW
in 2014, (REN21 (2015)), and its study has be
ome more
hara
teristi
s lo
ated in the P-V
urve. There are some
attra
tive to resear
hes of dierent elds. works in the literature where the MPPT algorithm dire
tly
ae
ts the duty
y
le of the
onverter, (E. Koutroulis and
The photovoltai
system is mainly
omposed by the el- Voulgaris (2001); N. Femia and Vitelli (2004)). This kind
ements shown in gure 1. The PV sour
e transforms of systems presents an inadequate voltage or
urrent regu-
sunlight dire
tly into ele
tri
ity and its behavior shows lation. In
onsequen
e the
onverter presents an in
reased
a nonlinear
hara
teristi
with a maximum power point swit
hing stress and losses, (Villalva et al. (2010)). Other
(MPP) des
ribed by the
urrent-voltage (I-V)
urve and works in the literature present MPPT systems a
ompa-
the power-voltage (P-V)
urve, (Sera et al. (2007); Tian nied by
ontrol stru
tures that improve the performan
e
et al. (2012); Barth et al. (2016)). The MPP is
onstantly of its stationary and transitory responses, (Villalva et al.
hanging a
ording to the
onne
ted load and the envi- (2010); Kislovski (1990)). For instan
e, the presen
e of a
ronmental
onditions, hen
e it is ne
essary to adopt an proportional and integral
ontroller
an redu
e losses and
intermediate stage between the PV sour
e and the load, stress, and
an improve the settling time of the
onverter
known as an Maximum power point tra
king (MPPT) and avoids os
illation and overshoot, (M. G. Villalva and
system. This guarantees
onstant power and the highest Ruppert (2009)).
energy harvesting possible. The load may be represented
by a resistan
e in the
ase of a non regulated DC bus. It In this work, we present a Photovoltai
System
ontrolled
an also be represented by a
onstant voltage load, in the with two stru
tures that
ontains the MPPT te
hnique,
ase of a regulated DC bus or a battery, (Chiang et al. Figure 2. Firstly, we analyzed the system
ontrolled with
Perturb & Observe MPPT. We performed simulations and
(2009); Farahat et al. (2012)).
ompared them with experimental results. Se
ondly, the
The MPPT system is
omposed mainly of a DC-DC
on- MPPT
ontrol was
oupled with an average
urrent mode
verter, (Eri
kson and Maksimovi
(2001); Rashid (2004)),
ontrol stru
ture. We simulated and
ompared the results
ontrolled by an MPPT algorithm, (Verma et al. (2016); with the rst
ontrol stru
ture ndings. For these tests,
267
CHAPTER 9. NEW ENERGIES
268
CHAPTER 9. NEW ENERGIES
Panel 25 °c Panel 25 °c
Rs IP V 3
50 G=1000
2.5
+ 2
40
G=800
G=600
P[W]
30 G=400
I[A]
1.5 G=1000
Iph Rp VP V 1
G=800 20
G=600
10
ID IRp
0.5 G=400
− 0
0 5 10 15 20
0
0 5 10 15 20
V[V] V[V]
Figure 3. Five Parameters Model (a) V-I
urve for syk50-18m (b) P-V
urve syk50-18m so-
solar panel at 25 C lar panel at 25 C
where: Panel 1000 W/m2 Panel 1000 W/m2
IP V Current at the terminals of the panel. 4
50 T= 25°c
P[W]
30
I[A]
Rs Equivalent series resistan
e. 2
T= 25°c
20
Ns Number of
ells
onne
ted in series. (
) I-V
urve syk50-18m so- (d) P-V
urve syk50-18m so-
VT Thermal diode voltage (kT /q ). lar panel at 1000 W/m2 lar panel at 1000 W/m2
The equation 1 is mathemati
al impli
it and it is usually Figure 4. Voltage vs.
urrent and voltage vs. power de-
solved using numeri
al methods, su
h as Newton Raphson. livered by a photovoltai
panel. Variation in (a)(b)
However, in simulation, this approa
h may lead to longer radiation and (c)(d) temperature.
simulation times and
onvergen
e problems. Therefore, it
was ne
essary to express the equation 1 in a expli
it way, 2.2 Hybrid system modeling
using the Lambert W approa
h, (Efstratios I. Batzelis and
Papathanassiou (2014)). Both systems of Figure 2 present three
ongurations or
states depending of the values (ON , OF F ) of the diode
The photovoltai
arrays do not exhibit a linear relationship and the transistor, Table 2.2.
V − I , and parameters su
h as parasiti
resistan
es Rp
and Rs need to be adjusted with experimental data or Table 2. State Transition Table
with spe
ial devi
es be
ause most
ommer
ial panels do
State T D
not provide this information. In (Villalva et al. (2009))
S1 ON OF F
the parameters of the Rp and Rs are
al
ulated from an S2 OF F ON
iterative pro
ess that adjusts the V −I
urve with the basi
S3 OF F OF F
data provided by the manufa
turer, su
h as short
ir
uit
urrent, open voltage,
urrent and voltage at the maximum Figure 5 depi
ts the three
ir
uit
ongurations of both
power point and maximum power. In this way it is possible solar systems.
to estimate the parameters for a reliable model of the
solar panel without resorting to spe
ialized measuring The whole system
an be expressed as:
devi
es. We used the solar panel SYK50-18M. Table 1 ẋ = Ai x + B (2)
shows its
hara
teristi
data at standard test
onditions; where i ∈ {1, 2, 3}. The A and B matri
es are dierent for
temperature of 25◦ C and radiation of 1000W/m2 . ea
h
ontrol stru
ture.
Table 1. Parameters for a solar panel SYK50-
18M. First Control Stru
ture This systems is of third order
and its A and B matri
es are dened as follows:
Behavior under standard
onditions STD RL 1
− L L 0
Power at the maximum power point (Pmax) 50 W
Open
ir
uit voltage (Vo
) 22.24 V A1 = − C1in 0 0 (3)
Voltage at maximum power point (Vmpp) 18 V 0 0 − RC1out
Short
ir
uit
urrent (Is
) 3.06 A R 1
Current at maximum power point (Impp) 2.78 A − L L − L1
1
TC Isc 0.034%/◦ K A2 = − Cin 0 0 (4)
1 1
TC Uc −0.34%/◦ K Cout 0 − RCout
Cell number N s 36
Diode ideality fa
tor a 1.3 0 0 0
Parallel resistan
e Rp 454.9310 Ω A3 = 0 0 0 (5)
Series resistan
e Rs 0.2660 Ω 0 0 − RC1out
0
The graphs in Figure 4 were simulated with parameters Rp B = fC(xin2 ) (6)
and Rs obtained as is shown in (Villalva et al. (2009)). In 0
graphi
s 4(a) and 4(b) shown V-I and V-P
urves respe
-
tively for a variant radiation and graphi
4(
) and 4(d) the where f (x2 ) is the expli
it solution of the equation 1 using
behavior of the sour
e under temperature variations. the Lambert W approximation, (Efstratios I. Batzelis and
269
CHAPTER 9. NEW ENERGIES
RL L x1 − RLL 1
L 0 0
− 1 0 0 0
+ A1 =
0
Cin (8)
+ + 0 − RCout 0
1
vP V x2 Cin x3 Cout R 0 1 0 0
R 1 1
− − −L L −L 0
PV − − 1 0 0 0
A2 =
1
Cin (9)
(a) State 1. T = ON , D = OF F Cout 0 − RCout 0
1
0 1 0 0
RL L x1
0 0 0 0
+ 0 0 0 0
A3 = (10)
+ + 0 0 − RC1out 0
vP V x2 Cin x3 Cout R 0 1 0 0
− − 0
PV − f (x2 )
(b) State 2. T = OF F , D = ON B = Cin
(11)
0
−Vref
+ This system
an be represented by the hybrid automata in
+ +
270
CHAPTER 9. NEW ENERGIES
relationship between the simulated data and the data P&O algorithm experimental (550 W/m2 and 25 Ohm).
a
quired experimentally is observed. There is a more 30
Current(A), Referece(0.1)
PowerPv(W),VoltagePv(V)
pronoun
ed ripple in Figure 8, be
ause the a
quisition
ard Reference
PWM
has an input lter that ensures the digital
ontrol and also 20
PowerPv
the number of plotted points is mu
h smaller. Voltage
Pv
CurrentPv
10
5
Figure 11 shows a zoom of the duty
y
le signal. The
0 values vary periodi
ally around the MPP, even though the
−5
graphi
s vary somewhat, they show that the period has an
0 0.1 0.2 0.3 0.4
Time (S)
0.5 0.6 0.7 equal number of steps.
Duty C. experimental variation Duty C. simulated variation
Reference (0.1)100%
4.5
Reference (0.1)100%
4.2
4.45
4.4 4.15
4.35 4.1
4.3
4.05
4.25
4.2 4
0.62 0.64 0.66 0.68 0.7 0.62 0.64 0.66 0.68 0.7
2 Time (S) Time(S)
PWM experimental sweep (550 W/m and 25 Ohm).
25 Reference PWM
(a) a (b) b
Current(A), Referece(0.1)
PowerPv(W),VoltagePv(V)
20 PowerPv
VoltagePv Figure 11. Zoom in PWM signal (a) experimental and (b)
15 Current Pv
simulated.
10
Tests with other radiation levels are also depi
ted, for
5
example in Figure 12 a maximum power point tra
king
0 for a radiation of approximately 390W/m2 and a load 50Ω
−5
are shown.
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
P&O experimental (~390 W/m2 and 50 Ohm). P&O simulation (~390 W/m 2 and 50 Ohm).
Time (S) 20 20
Reference PWM Reference PWM
Current(A), Referece(0.1)
Current(A), Referece(0.1)
Power (W),Voltage (V)
PowerPv(W),VoltagePv(V)
PowerPv PowerPv
Figure 8. PWM experimental sweep.
Pv
15 15
Voltage Pv Voltage Pv
Current Pv Current Pv
10 10
Pv
5 5
0 0.1 0.2 0.3 0.4 0.5 0.6 0 0.1 0.2 0.3 0.4 0.5 0.6
the maximum power point in simulation and experimen- Time (S) Time (S)
tally. The arrival times at MPP are very
lose and the (a) (b)
dieren
es lie in the impossibility to maintain a
onstant
radiation and temperature, be
ause the test took pla
e Figure 12. Results (a) experimental and (b) simulation for
under atmospheri
onditions. radiation of approximately 390W/m2 and 50Ω
For the se
ond
ontrol stru
ture we used the same param-
P&O algorithm Simulink simulation (550 W/m2 and 25 Ohm). eters as the rst. The parameters of the PI
ontroller are
30 dened as: kp = 0.5, ki = 100, M = 2. Figure 13 shows
Current(A), Referece(0.1)
ReferencePWM
20
Power
Pv where the se
ond
ontrol stru
ture presents better settling
VoltagePv times and lower steady-state os
illations.
Current
10 Pv
4. CONCLUSION
0
In this work, simulation and experimental results of a
0 0.1 0.2 0.3 0.4 0.5 photovoltai
system
onformed by a solar sour
e, a boost
Time (S) power
onverter, a Perturb and Observe algorithm and
a purely resistive load are presented. The system was
Figure 9. Perturb and observe simulated algorithm.
ontrolled with two MPPT stru
tures. The rst stru
ture
271
CHAPTER 9. NEW ENERGIES
Parameter Model Performan
e for Photovoltai
Panels
60 Using Only Referen
e Data. (September), 235245.
Barth, N., Jovanovi
, R., Ahzi, S., and Khaleel, M.A.
50 (2016). {PV} panel single and double diode models:
Optimization of the parameters and temperature depen-
40 den
e. Solar Energy Materials and Solar Cells, 148, 87
98. doi:http://dx.doi.org/10.1016/j.solmat.2015.09.003.
30 Solar Cells and Storage.
Chiang, S.J., Shieh, H.J., and Chen, M.C. (2009). Model-
20 ing and
ontrol of pv
harger system with sepi
on-
Power
photovoltai
systems.
38
Industrial Ele
troni
s, 62(6), 34993507. doi:
37 10.1109/TIE.2014.2369456.
Farahat, M., Metwally, H., and Mohamed, A.A.E.
36 Power Out: System 1 (2012). Optimal
hoi
e and design of dierent
Power Out: System 2 topologies of d
d
onverter used in {PV}
35
Power Reference systems, at dierent
limati
onditions in
34
egypt. Renewable Energy, 43, 393 402. doi:
http://dx.doi.org/10.1016/j.renene.2011.10.021.
0.51 0.52 0.53 0.54 0.55 0.56 0.57 0.58 Filippov, A.F. (1988). Dierential equations with dis-
t
ontinuous rigth-side (1960-russian). Kluwer A
ademi
Publishers.
(b) Zoom Gallo, G., Osorio, G., and Olivar, G. (2012). Event-driven
simulation of lippov systems: The
ase of a nonlinear
Figure 13. Comparison between the two
ontrol te
hniques wien bridge os
illator. In Cir
uits and Systems (LAS-
a
ts dire
tly on the duty
y
le of the
onverter. In the CAS), 2012 IEEE Third Latin Ameri
an Symposium on,
se
ond stru
ture, the MPPT
ontrol is
oupled with an 14. doi:10.1109/LASCAS.2012.6180299.
average
urrent mode
ontrol stru
ture. The last stru
ture Kislovski, A.S. (1990). Dynami
behavior of a
onstant-
improves the settling time and the steady-state os
illations frequen
y bu
k
onverter power
ell in a photovoltai
of the
onverter. battery
harger with a maximum power tra
ker. Tenth
Brazilian Power Ele
troni
s Conf.
ACKNOWLEDGEMENTS M. G. Villalva, J.R.G. and Ruppert, E.F. (2009). Analysis
and simulation of the p&o mppt algorithm using a
Resear
h supported by COLCIENCIAS. Proje
t: "Análisis linearized photovoltai
array model. Tenth Brazilian
de la e
ien
ia de
onvertidores ele
tróni
os de poten
ia en Power Ele
troni
s Conf.
apli
a
iones de genera
ión de energía elé
tri
a, a partir de N. Femia, G. Petrone, G.S. and Vitelli, M. (2004). Op-
fuentes no
onven
ionales". timizing duty-
y
le perturbation of P&O MPPT te
h-
nique. Ele
troni
s Spe
ialists Conf, 3(5), 11981208.
REFERENCES Rashid, M. (2004). Power Ele
troni
s: Cir
uits, Devi
es,
and Appli
ations. Pearson/Prenti
e Hall.
Aoun, N., Chenni, R., Nahman, B., and Bou
houi
ha, K.
(2014). Evaluation and Validation of Equivalent Five-
272
CHAPTER 9. NEW ENERGIES
273
CHAPTER 9. NEW ENERGIES
Abstract: With the rise in the use of renewable energies, solar panels have proven to be
reliable and have a favorable cost-benefit ratio, producing energy free of noise and air pollution.
Solar panels are subject to considerable variations in working conditions due to changes in
solar irradiation levels and temperature, that affect its semiconductor properties. To be able to
profit as much as possible from this source of energy, control of the modules and perturbation
rejection is very important to obtain the highest viable amount of electrical power. This work
is concerned with the on-line identification and control of a photovoltaic system using neural
networks. Having on-line identification and control allow the system to be more adaptable to
changes in weather and other variations than with common off-line methods.
Keywords: Photovoltaic systems, solar energy, high order neural networks, Kalman filter,
maximum power point tracking.
274
CHAPTER 9. NEW ENERGIES
uk = h(xk )
Substituting this in (2) to obtain an unforced system:
xk+1 = F (xk , h(xk )) = F̃ (xk ). (4)
Fig. 1. VI graph of solar panel
Defining a discrete-time recurrent high order neural net-
and the other represents the leakage current from the work (Alanis et al., 2007):
capacitor (Rsh ). The equation that defines the behavior
of said equivalent model is (Cubas et al., 2014): x̂ik+1 = (wi )T z i (x̂k , uk ), i = 1, ..., n (5)
q(VP V +IP V Rs ) VP V + IP V Rs
IP V = Icc − Io (e nkT − 1) − (1) where x̂i is the sate of the i-th neuron, n is the state
Rsh dimension, wi is the respective on-line adapted weight
where k is the Boltzman constant, T is the absolute vector, and z i (x̂k , uk ) is given by:
temperature in the photovoltaic panel, Io is the inverse Y dij(1) Y dij(L )
z i (x̂k , uk ) = [z i1 ...z iLi ]T = [ ψij ... ψij 1 ]T (6)
saturation current of the diode, q is the charge of the
j∈I1 j∈IL1
electron, n is the ideality factor of the diode.
From (1), it is clear that there exists a relationship between where Li is the respective number of high order connec-
the voltage and the current in the photovoltaic panel. tions, I1 , I2 , ..., ILi is a collection of non-ordered subsets of
This relationship can be observed in Fig. 1, which shows 1, 2, ..., n, and ψi is given by:
the existence of a unique maximum power point, PM P P , ψi = [S(x̂1 )...S(x̂n )u1 ...um ]T (7)
for each solar panel depending on the temperature and
irradiance at the moment of measurement. where S(·) is defined as a logistic function.
To be able to successfully follow the maximum power Assuming that the system (2) is observable, it is approx-
point of the solar panel, it is necessary to have a reference imated by the discrete time RHONN parallel representa-
voltage which corresponds to that point, and to design a tion (Rovithakis and Christodoulou, 2000) :
controller to track that reference voltage; this is commonly
known as a maximum power point tracker (MPPT). In xik+1 = (wi∗ )T z i (xk , uk ) + zi (8)
the literature, there are several algorithms that have been
developed for this purpose, based on neural networks, where xi is the i-th plant state, zi is a bounded approxi-
incremental conductance, fuzzy logic, etc, (Esram and mation error, which can be reduced by increasing the num-
Chapman, 2007). In this paper, the MPPT algorithm used ber of adjustable weights (Rovithakis and Christodoulou,
is known as the perturb and observe method which can be 2000) .
implemented in real-time, and it is one of the algorithms Assuming that there exists an ideal weight vector wi∗ such
most commonly used for this purpose. This algorithm is that the norm of the approximation error can be minimized
based on the following criterion: the voltage of the solar on a compact set Ωzi ⊂ RLi . The ideal weight vector
panel is perturbed and if for this new value the power wi∗ is used only for analysis, assuming that it exists and
obtained has been incremented, then a change in that is an unknown constant (Rovithakis and Christodoulou,
direction will be spurred; if on the contrary, the new power 2000) . Defining the estimate of the weight as wi and the
value has decreased, a new perturbation will be realized in estimation error as:
the opposite direction.
w̃ki = wi∗ − wki (9)
The next step is to manipulate the voltage of the panel
(VP V ) to track the voltage generated by the algorithm Since wi∗ is assumed to be a constant, the next expression
(VM P P ), this is accomplished by using a DC-DC Buck is true:
converter, discussed in section 3, which forces the electrical i
output power of the solar panel to reach the desired value. w̃k+1 − w̃ki = wki − wk+1
i
(10)
2.2 Recurrent High Order Neural Networks (RHONN) 2.3 Kalman Filter
In the field of neural networks, usually k denotes a sam- The Kalman filter (KF) estimates the state of a linear
pling step, where k ∈ 0 ∪ Z+ . Also considering the tradi- system with additive state and output white noises (Feld-
tional definitions of | · | as the absolute value and k · k as kamp, L.A., Feldkamp, T.M., Prokhorov, 2001; Brown
275
CHAPTER 9. NEW ENERGIES
with:
276
CHAPTER 9. NEW ENERGIES
In order to obtain a discrete-time model, the Euler dis- In order to test the performance of the proposed neural
cretization method is used: controller, a simulation is developed implementing the
x1k+1 = x1k + αk1 + α2 x3k uk DC-DC Buck converter and the solar panel by means
x2k+1 = x2k + α3 x2k + α4 x3k of Simscape Power Systems 2 blocks, which allows to
(17) consider the components dynamics for future real-time
x3k+1 = x3k + α3 x2k + α6 x1k uk
y(k) = xk1 implementation.
For the realized test, the temperature in the cell is con-
where, sidered constant at 25 ◦ C. At the beginning a 0 W/m2
ts iP V k ts irradiance is applied, then at 1.5 seconds it is increased
αk1 = , α2 = − , to 500 W/m2 , finally at 3 and 4.5 seconds the irradiance
C1 C1 is changed to 1000 and 3000 W/m2 respectively. At the
ts ts beginning of the simulation the plant is left in open-loop
α3 = − , α4 = ,
RC2 C2 to identify the states, then at 0.2 seconds the loop is
ts ts closed and the controller starts to operate. In Fig. 5, the
α5 = − , α6 = . theoretical power level given by (1), using the photovoltaic
L L
panel model and applying the previously described irradi-
3.2 Identification ance changes, is shown in the red dot line; and the power
obtained by the proposed MPPT is displayed in blue line.
Based on the structure of (17), the RHONN proposed for It can be seen that the convergence time of the controller
the DC-DC Buck converter is as follows: and the tracking error are within acceptable boundaries.
x̂1k+1 = wk11 x̂1k + wk12 S(x̂3k ) + wk13 S(iP V k ) + 0.01uk
x̂2k+1 = wk21 x̂2k In Fig. 6, the identification errors are shown. It can be
(18) seen that the estimated state with the biggest error is the
x̂3k+1 = wk31 x̂3k + wk32 x̂2k + wk33 uk
1 2
ŷk = x̂k Simscape Power Systems is a trademark of The MathWorks, Inc.
277
CHAPTER 9. NEW ENERGIES
278
CHAPTER 9. NEW ENERGIES
279
CHAPTER 9. NEW ENERGIES
Abstract: Organic Rankine Cycle (ORC) technology has demonstrated to be a suitable tool for
recovering waste heat at low temperatures. The fluctuating nature of the waste heat source (varying
temperature and mass flow) makes of waste heat recovery applications a challenging task. In this
contribution Model Predictive Control (MPC) and more classical PID-like controllers are investigated,
where special attention is paid to the analysis of the control performance for heat source profiles coming
from different applications. A dynamic model of a real regenerative ORC unit equipped with a single
screw expander developed in the Modelica language is used to test and compare the PID and MPC based
control strategies. Results show that for low amplitude variations PID and MPC can perform equally
good, but in case of large variations MPC is a more effective control strategy as it allows a safer and
more efficient operation, operating close to the boundary conditions where production is maximized.
Keywords: Model Predictive control, Renewable energy systems, Process control, organic Rankine
Cycle.
1. INTRODUCTION at expander inlet that can damage the expansion machine Wei
et al. (2007).
Reducing the world-wide industrial energy consumption is Most of the current studies are restricted to guarantee safety
a major concern in order to ensure guarantee a sustainable conditions by regulating the superheating (see Grelet et al.
development. Despite all efforts to achieve a more efficient (2015) and the references therein), but little attention has been
production, waste heat losses are still an important concern. An paid to the performance of the power unit in terms of energy
attractive technology able to recover heat at low temperatures production. In order to maximize the output power the evap-
is the Organic Rankine Cycle (ORC) system. orating temperature is usually considered as the most relevant
ORC power units stand out for their reliability and cost- controlled variable Quoilin et al. (2011). In Feru et al. (2014),
effectiveness Verneau (1979),Angelino et al. (1984). Replacing the modeling and control of a waste heat recovery system for a
the water by an organic compound opened new challenges, Euro-VI heavy-duty truck engine was achieved through the use
regarding the cycle design, selection of the fluid, modeling, of a switching model predictive control strategy to guarantee
simulation and control design Sun and Li. (2011); Colonna and safe operation of the WHR system and to maximize output
Van Putten. (2007). Such thermodynamic units are designed power. Also in the automotive field, the problem of maximizing
to operate around certain steady-state conditions, however due the power produced by an ORC waste heat recovery system on
to the highly fluctuating nature of the heat source, they are board a diesel-electric railcar is tackled using dynamic real-time
forced to operate at part-load conditions. Control design plays optimization Peralez et al. (2015). In Hernandez et al. (2015),
an essential role to enable a safe and optimal performance of an experimental study is conducted using an 11 kWel pilot plant,
the ORC unit. Safe operation is achieved by an accurate reg- showing that the constrained Model Predictive Control (MPC)
ulation of the superheating, since it is already recognized that outperforms PID based strategies, as it allows to accurately
low values for superheating maximize the cycle efficiency Her- regulate the evaporating temperature with a lower control effort
nandez et al. (2014) and avoid the formation of liquid droplets while keeping the superheating in a safer operating range.
In this study we investigate the performance of MPC and PID
? The results presented in this paper have been obtained within the frame of based strategies to optimally recover waste heat through ORC
the IWT SBO-110006 project The Next Generation Organic Rankine Cycles technology. Using existent components from the ThermoCycle
(www.orcnext.be), funded by the Institute for the Promotion and Innovation library Quoilin et al. (2014), a dynamic model of an ORC
by Science and Technology in Flanders, Belgium. This financial support is system is built for simulation purposes. The model dynamics
gratefully acknowledged.
280
CHAPTER 9. NEW ENERGIES
are coherent with those observed on real systems as presented During the simulations performed in this paper, the generator
in Desideri et al. (2014), where the dynamic models is exper- rotational speed is kept constant at 3000 rpm to emulate an
imentally validated using a low-capacity (11 kWe) waste heat installation directly connected to the grid. The circulating pump
recovery unit equipped with a single screw expander. Using (N pp ) is a vertical variable speed 14-stage centrifugal pump
the developed dynamic model, insight on the system dynamics with a maximum pressure of 14 bar and 2.2 kWe nominal
and optimal operation is achieved, resulting in the development power.
of a real-time optimizer to compute the optimal evaporating
Starting from the bottom of the scheme it is possible to rec-
temperature which maximizes the power generation. The con-
ognize the liquid receiver (b) installed at the outlet of the
troller’s task is to track the optimal set-point generated by the
condenser (a) where the fluid is collected in saturated liquid
optimizer while ensuring a minimum superheating value for
safely operation. condition. From the receiver outlet, the fluid is pumped (c)
through the regenerator (d) cold side, and the evaporator (e),
The paper is structured as follows. Section 2 introduces the where it is heated up to superheated vapor, reaching its maxi-
architecture and main characteristics of the ORC system. Next, mum temperature at the evaporator outlet. The fluid, after being
in section 3 the Extended Prediction Self-Adaptive (EPSAC) expanded in the volumetric machine (f), enters the regenerator
approach to MPC used in this study is briefly described. A hot side, and then it flows into the condenser (a) to close the
low-order model suitable for prediction is then developed using cycle.
parametric identification as described in section 4. The control
In order to assess the performance of the different devel-
structure, design and tuning of the proposed PID and MPC
oped control strategies a dynamic model of the ORC system
based strategies is described in section 5, followed by the
(Fig. 1) has been developed in the Modelica language using
simulation results in section 6. Finally a conclusion section
existent components from the ThermoCycle library Quoilin
summarizes the main outcome of this contribution.
et al. (2014). The developed model is then exported into
Simulink/Matlab
R
environment by means of the Functional
2. PROCESS DESCRIPTION
Mock-Up Interface (FMI) open standard.
This section describes the architecture and main characteristics
of the ORC system used for evaluating the performance of the 2.2 Conditions for optimal operation of an ORC unit
developed control strategies.
In order to optimally operate an ORC unit, two main conditions
need to be satisfied: i) keep the cycle in a safe condition
2.1 The Organic Rankine Cycle System during operation and ii) maximize the net output power. Safe
operation of the ORC unit is important as it allows a longer
In order to assess the performance of the different developed life expectancy in all components. In this concern, an accurate
control strategies, a dynamic model of the ORC system pre- regulation of superheating (∆Tsh ), is the main priority since a
sented in Fig. 1 has been developed in the Modelica language minimum value of superheating has to be guaranteed in order to
using existent components from the Thermo Cycle library avoid a wet expansion (i.e., formation of liquid droplets at the
Quoilin et al. (2014). The developed model is then exported expander inlet that can damage the expansion machine). The
superheating is defined as:
into Simulink/Matlab environment by means of the Functional
Mock-Up Interface (FMI) open standard. ∆Tsh = Texp,in − Tsat,ev (1)
The system based on a regenerative cycle and solkatherm 2.3 Optimal evaporating temperature
(SES36) as working fluid, shows a nominal power of 11 kWe.
The expander is originally a single screw compressor adapted Previous studies have demonstrated the existence of an optimal
to run in expander mode. It drives an asynchronous generator evaporating temperature which maximizes the output power for
connected to the electric grid through a four-quadrant inverter, a given heat source conditions, where a real-time optimizer
which allows varying the generator rotational speed (Nexp ). (RTO) can be built using a steady-state model Quoilin et al.
281
CHAPTER 9. NEW ENERGIES
(2011) or by means of extremum-seeking algorithm Hernandez these additions is the discrete time convolution of ∆U =
et al. (2016). In this paper the first approach is chosen, thus {δ u(t|t), . . . , δ u(t + Nu − 1|t)} with the impulse response
leading to the following correlation used in the RTO: coefficients of the system (G matrix), where Nu is the
Tsat,opt = −290.915 + 183.33 ∗ log10 (Th f ) + 10.636 ∗ ṁh f (4) chosen control horizon.
Equation (4) is valid in the range of 0.5 ≤ ṁh f ≤ 1.5 kg/s and The control ∆U is the solution to the following constrained
90 ≤ Th f ≤ 125 ◦ C given a constant saturation temperature in optimization problem:
the condenser of psat,cd = 1.4 bar.
N2
x(t + k|t) and n(t + k|t) can be predicted by recursion of the Table 1. Nominal operating conditions considered
process model (6) and by using filtering techniques on the noise for the Identification Procedure
model (7), respectively De Keyser (2003).
Parameter Description Value Unit
3.2 Computing the optimal control action N pp Pump rotational speed 1680 rpm
Nexp Expander rotational speed 3000 rpm
Tsat,ev Evaporating temperature 100 ◦C
A key element in linear MPC is the use of base (or free) and
optimizing (or forced) response concepts Maciejowski. (2002). ∆Tsh Superheating 20 ◦C
282
CHAPTER 9. NEW ENERGIES
This strategy is based on a PI controller which is used to track The present study focuses on investigating which are the advan-
the optimal evaporating temperature Tsat,opt . The PI controller is tages of using advanced controllers such as MPC compared to
tuned using the transfer function which relates the speed in the PID-like strategies for the optimal operation of an ORC system
pump to the evaporating temperature (11) for the following de- in waste heat recovery applications. The control strategy task
sign specifications: settling time Tset = 60 s, overshoot percent is to accurately regulate the evaporating temperature (given by
OS% = 0 and robustness Ro = 0.7, obtaining the PI parameters: the RTO), in order to maximize the energy production, while
1 1 avoiding formation of liquid droplets that could damage the
PITsat,ev = K p 1 + = 0.189 1 + (12)
Ti s 1.7813 s expander by ensuring an small amount of superheating ∆Tsh .
Thus in this study we will focus on answering two questions:
During the implementation phase the clamping anti-reset
windup scheme is used to clip the control action into the per- • which are the heat source conditions which represent the
missible range of the pump (table 2). main challenge for any controller? and
283
CHAPTER 9. NEW ENERGIES
Case I − slow and low Case II − fast and low Case III − slow and large Case IV − fast and large
130 130 130 130
90 90 90 90
1000 2000 3000 1000 2000 3000 1000 2000 3000 1000 2000 3000
1 1 1 1
Fig. 3. Heat source profiles due to temperature Th f and mass flow rate mh f variations. Case I: slow and low amplitude variations.
Case II: fast and low amplitude variations. Case III: slow and large amplitude variations. Case IV: fast and large amplitude
variations.
• which controller produces the highest output power while
keeping the safety conditions? Controller comparison − Case III
20
In order to answer those questions, we introduce four different 15
∆ Tsh (°C)
100
90
80
• Case III: slow and large amplitude transitions. Th f = 500 1000 1500 2000 2500 3000 3500
1600
The three strategies tested for cases I and II (not shown here) 1400
4
controllers have a high enough bandwidth and superheating 3
284
CHAPTER 9. NEW ENERGIES
10
De Keyser, R. (2003). Model based Predictive Control for
5
Linear Systems, chapter invited in UNESCO Encyclopaedia
0
500 1000 1500 2000 2500 3000 3500 of Life Support Systems (EoLSS). Article contribution
6.43.16.1, Oxford, 35 pages.
110
Desideri, A., van den Broek, M., Gusev, S., Lemort, V., and
Quoilin, S. (2014). Experimental campaign and modeling
Tsat,ev (°C)
100
1600
1400
6592.
1200
Grelet, V., Dufour, P., Nadri, M., Lemort, V., and Reichel, T.
500 1000 1500 2000 2500 3000 3500 (2015). Explicit multi-model predictive control of a waste
heat rankine based system for heavy duty trucks. In In IEEE
6
Conference on Decision and Control, Osaka, Japan.
Wexp (kW)
285
CHAPTER 10
NONLINEAR SYSTEMS
CHAPTER 10. NONLINEAR SYSTEMS
Abstract: Proper functioning of unmanned surface vessels requires effective trajectory tracking.
To achieve this, upon selecting the method for control system design, the knowledge required
regarding vessel dynamics described by a model must be taken into account. In this case, it would
be advantageous to design the controller requiring the least task effort in modeling for fulfilling
the tracking requirements. This paper proposes an active disturbance rejection control system
based on linear filters applied to tracking arbitrary trajectories of an underactuated unmanned
surface vessel, as an alternative that uses a simple model. Finally, a performance comparison
between the proposed control and another control technique (whose design requires a more
complex model) is presented.
Keywords: Disturbance rejection, Nonlinear control, Marine systems, Flatness based control.
287
CHAPTER 10. NONLINEAR SYSTEMS
displacement speeds u and y, yaw speed r, driving force F 3. ACTIVE DISTURBANCE REJECTION CONTROL
and steering torque T.
3.1. Active disturbance rejection control
The dynamics of a system of order n can be expressed
according to (2), where µ is the control action, b is a
system constant; and f(…) is a function that depends on
unknown and non-linear system dynamics and on its
endogenous and exogenous disturbances w:
x n f x, x, , x ( n 1) , u, w b (2)
288
CHAPTER 10. NONLINEAR SYSTEMS
1
2 ef 1e f 0 e f (10) ex U F cos u sin F
e
u cos r
b
z1 z 2 y U F sin (14)
z 2 3 z 2 e x
(11) v sin vr cos xr
v cos vr sin yr
1
2 3 1 z 2 0 z1 2 e x
b It is considered that r varies faster than the remaining
4. CONTROL SYSTEM DESIGN variables and can be used as an intermediate control; also,
the control of r is carried out using torque T.
4.1. Problem formulation
Using (14) the control can be decoupled through (15). In
The goal is that a UUSV can follow arbitrary trajectories addition, the remainder terms in (14) can be encapsulated
defined in time without restrictions in their form and with in disturbance dynamics, resulting in decoupled dynamics
speed of approximately 1 m/s., xd and yd will be used to (16).
denote a point of the desired trajectory at a given time
instant. The proposed control system is presented in x U F cos u sin F
(15)
Figure 2. y U F sin u cos r
4.2. Trajectory Controller Design
ex x f x , ,
e f , ,
(16)
In the dynamic model presented in (1) the states
y y y
derivatives can be isolated (12).
x u cos v sin In this manner, the system can be controlled by two
independent control laws (17) similar to (6)
y u sin v cos
(12)
r 2 s 2 1 s 0
x ex ,
u U u u u U vrvr U rr r U F F s s 3
2
(17)
v Vv (v, r )v Vr (v, r )r Vuvuv Vur ur VT T
2 s 1 s 0
2
289
CHAPTER 10. NONLINEAR SYSTEMS
The desired control action is isolated from (15), obtaining 4.4. Anti-saturation mechanism
(18); due to the presence of u in the denominators, its
Controllers using integral terms often present saturation,
value in the controller should be set in a range between
resulting in performance reduction. An anti-saturation
umin and umax using the definition of the function (19). The
mechanism is used to avoid this problem, using the
values umin > 0 and umax are chosen such that the desired
projection of saturated actuator errors (27) with respect to
performance of r is not greatly reduced, thus the desired
the desired actuator values (20) on the decoupled control
control can be expressed by (20).
presented in (17). Consequently, the corrections are
cos sin calculated by (28), where βF and βr are parameters that
Fd U F UF x (18) determine the aggressiveness of the anti-saturation
r sin
cos y
d
mechanism and Δµx and Δµy are the required corrections
u u to be applied in (17).
x U F cos
e V H sin F
y 2 3 1 z 2 0 z1 2 e y .
e
y U F sin V H cos H In addition, because torque T may also present saturation
v sin vr cos xr (21) T sat(Td , Tmin , Tmax ), (30)
v cos vr sin yr
an anti-saturation mechanism is provided based on the
RT T
difference between the effective torque and the desired
4.3. Design of angular velocity controller torque
Since the angular velocity r was considered to behave as a r RT T T Td , (31)
system actuator, this should be controlled on its respective
subsystem. Using the torque T as an actuator in so the resulting controller in form of a linear flatness-
based filter is
er RT T , (22)
h1 er r (32)
where er is the error between the angular velocity
r 1er 1 r 0 m1
measured in the system and the desired angular velocity
(20). In a similar way to (16), the subsystem is presented 4.5. Trajectory generator
based on a change of variable and a term of disturbance
dynamics Since the initial values of the desired trajectory xd, yd may
be distant from the UUSV’s origin point, a trajectory
er r f r , , . (23) reference generator xr,, yr is provided, based on a
simplified mathematical model of the UUSV in the form
Since the subsystem is a first order system, the control
law (4) will be used as well as the selection of parameters x r u r cos r
from (5) y r u r sin r (33)
1s 0 r
r er , (24)
s u r a
s 1 s 0 s 2 r r r ,
2 2 2
(25) where the control objective is that xr, yr follow xd, yd. The
controller for the simplified UUSV (33) can be developed
The desired torque is calculated by using a procedure similar to the one used for the original
Td 1 / RT r . (26) UUSV.
290
CHAPTER 10. NONLINEAR SYSTEMS
5. SIMULATION RESULTS AND ANALYSIS the trajectory. Figure 4 shows the tracking of position
variables separately.
The parameters of the vehicle in (Wondergem et. al.
2011) were used to carry out simulations:
25.8 0 0
M 0 33.8 1.0115
0 1.0115 2.76
0 0 33.8v 1.0115r
C 0 0 25.8u
33.8v 1.0115r 25.8u 0
0.72 1.33 u 0 0
D 0 0.86 36.28 v 0.11
0 0.11 5.04 v 0.5
Figure 3. Trajectory tracking
Table 1 shows the parameters used for the UUSV
controller and the trajectory generator. All parameters are
expressed in IS units.
291
CHAPTER 10. NONLINEAR SYSTEMS
292
CHAPTER 10. NONLINEAR SYSTEMS
Abstract: This work proposes an ADRC (Active Disturbance Rejection Control) estrategy
using GPI (Generalized Proportional Integral) observers to automatically stabilize a riderless
moving bicycle with a time variant forward speed. A bicycle has an unstable and non-linear
behavior when in its upright position that can be modeled as a LPV (Linear-Parameter-Varying)
system, making the design of feedback controllers for stabilizing this plant a challenging problem.
The ADRC scheme simplifies the bicycle model to a linear system by grouping all unmodelled
linear and non-linear terms of the model together with external disturbances into an additive
unified disturbance signal as equivalent input of the plant, which is estimated via the GPI
observer and then rejected through the control law. The control strategy effectiveness is validated
by a co-simulation between ADAMS and MATLAB, which exhibits a high performance of the
proposed technique on a non-linear bicycle model closer to the real world physics than simplified
mathematical models.
293
CHAPTER 10. NONLINEAR SYSTEMS
pero perturbado. Bajo esta técnica, las perturbaciones son pares actuando en el sistema debido a las fuerzas gravi-
unificadas en una señal equivalente a la entrada de control, tacional y centrı́fuga y un balance de momento angular
que tiene en cuenta de manera conjunta las dinámicas no con respecto al eje x [Goldstein, 1953], cuando se mueve a
modeladas y perturbaciones externas asociadas. Esta señal cierta velocidad de avance:
de perturbación es estimada mediante un observador GPI Jwϕ̈t mt ghwϕt Dδ̇ tv t mt hδ tv t. (1)
y rechazada en lı́nea por medio de una ley de control por
realimentación de estado que es función de esta estimación La ecuación (1) representa un sistema dinámico de orden
[Cortés Romero et al., 2010]. La estrategia de control n 2, que relaciona la rotación en la dirección δ t, con el
basada en observador GPI, logra que el sistema de lazo ángulo de inclinación ϕt. Esta ecuación tiene dos polos
cerrado recupere el desempeño del modelo lineal nominal ubicados en ¾
tanto como la sintonización de las ganancias del observador mt gh
lo permitan. También, esta estrategia tiene una propiedad p1,2 , (2)
de acción integral que asegura la regulación del error de J
y un cero en
seguimiento a cero, permitiendo una estabilización robusta mt v th
de la bicicleta en su posición vertical. z , (3)
D
2. MODELAMIENTO DE LA BICICLETA que depende de la velocidad de avance v t. En el Cuadro
1, están consignados los parámetros requeridos por la
Cuando la bicicleta está en movimiento, su naturaleza ecuación (1). Estos parámetros se obtuvieron de los valores
inestable se forma por el desbalance de fuerzas y momentos de masa y su distribución en los cuerpos de un modelo
que se originan por la reacción que ejerce el suelo en CAD (Computer Assisted Design) de una bicicleta de
dirección contraria de su peso, su velocidad de avance y montaña real, desarrollado con el software Solid Edge ST4
los giros producidos por la dirección del manubrio. Existen tal y como se muestra en la Fig. 2.
otros efectos que se producen por el movimiento giratorio Cuadro 1. Parámetros de la bicicleta entera.
de las ruedas y el contacto continuo que existe entre ellas
Item Sı́mbolo Valor Unidad
y alguna superficie irregular del suelo, los cuales son, pre- Masa total de la bicicleta mt 12.514197 Kg
cesión giroscópica, deformación, deslizamiento y fricción Momento de inercia con respecto al eje x J 4.095365 Kg m2
dinámica. Todos estos factores imponen en el sistema una Producto de inercia con respecto a los ejes xz D 3.492466 Kg m2
Distancia entre ejes w 1.06016 m
dinámica compleja que dificulta su estabilización. Altura del centro de gravedad h 0.502656 m
Distancia horizontal del centro de gravedad b 0.514899 m
Gravedad g 9.807 m~s2
2.1 Ecuaciones del Movimiento de la Bicicleta Velocidad de avance v t Variable m~s
Considere inicialmente el esquema simplificado que se Item Name Quantity Mass (Item)
muestra en la Fig. 1, el cual, describe a la bicicleta como 1 Marco Trasero 1 4.850 kg
8
un sistema que se desplaza en un plano horizontal, con 1 2 Rueda Trasera 1 2.058 kg
3 Piñones 1 0.550 kg
su masa concentrada en el punto (b,h), y con un eje de 4 Biela 2 0.356 kg
dirección vertical, lo que implica que tiene un ángulo con 1
1
5 Plato 2 1 0.129 kg
6 Plato 1 1 0.141 kg
respecto al eje x de α 90°, y una distancia horizontal 7 Plato 3 1 0.074 kg
con respecto al punto de contacto de la rueda frontal de 2
1 8 Manubrio + Tenedor Frontal 1 1.950 kg
c 0. Este sistema tiene tres grados de libertad (δ,ϕ,ψ) con 9 Rueda Frontal 1 2.051 kg
4
2
6
3 5 7 1
ϕ 1 1 1
294
CHAPTER 10. NONLINEAR SYSTEMS
normal y por restricciones no-holonómicas en su dirección a la aceleración gravitacional gK0 , y otra parte v 2 K2
longitudinal y lateral. La Fig. 3 muestra el esquema de que se origina por los efectos centrı́fugos y giroscópicos.
la bicicleta conceptual con sus cuatro cuerpos rı́gidos, su Los coeficientes de las matrices M, C1 , K0 y K2 fueron
sistema de coordenadas (x,y,z), y sus tres grados de liber- calculados por medio del algoritmo propuesto en [Meijaard
tad, los cuales son, el ángulo de inclinación ϕt, el ángulo et al., 2007], el cual utiliza los parámetros consignados en
de dirección δ t y la velocidad de avance definida como el Cuadro 2, que se derivan del modelo CAD dividido en
v t Rrw θ̇r t. Este modelo tampoco tiene en cuenta el cuatro cuerpos tal y como se muestra en la Fig. 4. Este
par de perturbación ejercido sobre el eje de la dirección, algoritmo entrega los siguientes resultados:
que se genera por el efecto de la precesión giroscópica en 4.09537 0.31541 0 4.11551
M , C1
la rueda frontal. 0.31541 0.16216 0.58511 0.51821 ,
(5)
K0
61.76730 4.80033 , K 0 6.69605 .
O
x
4.80033 1.31521 2
0 0.57886
y
θf Cuadro 2. Parámetros básicos de la bicicleta.
θr δ Parámetro Sı́mbolo Valor
Distancia entre ejes w 1.06 m
Paso c 0.06 m
Ángulo del cabezal α arctan(3.51) rad
Gravedad g 9.81 m~s2
Velocidad de avance v variable m/s
Rueda trasera:
Radio Rrw 0.32 m
Masa mrw 2.61 kg
α Momentos de inercia de masa Axx , Ayy , Azz 0.09, 0.18, 0.09 kg m2
O Marco trasero:
x ϕ Posición del centro de masa xrf , yrf , zrf 0.42, 0.003, 0.56 m
c Masa mrf 5.91 kg
z w < B 0 Bxz = < 0.32 0 0.05 =
@ xx A @ A
@ A @ A
Momentos de inercia de masa @ Byy 0 A @ 0.56 0 A kg m2
ψ @ A @ A
@ sym. Bzz A @ sym. 0.26 A
> ? > ?
Marco frontal:
Posición del centro de masa xf f , yf f , zf f 0.94, 0, 0.75 m
Figura 3. Esquema de la bicicleta en su forma básica. Masa mf f 1.95 kg
< C 0 Cxz = < 0.14 0 0.01 =
@ xx A @ A
@ A @ A
Momentos de inercia de masa @ Cyy 0 A @ 0.13 0 A kg m2
@ A @ A
@ sym. Czz A @ sym. 0.03 A
> ? > ?
Rueda frontal:
Radio Rf w 0.32 m
Masa mf w 2.05 kg
Momentos de inercia de masa Dxx , Dyy , Dzz 0.09, 0.18, 0.09 kg m2
295
CHAPTER 10. NONLINEAR SYSTEMS
como base el análisis reportado en [Cerone et al., 2010], de avance. La otra zona, en donde la velocidad está por
las trayectorias de sus valores propios para varios valores encima de vc , la estabilidad podrı́a lograrse con acciones
de v t y las trayectorias de la parte real de sus valores de control tenues y lentas en comparación con la zona
propios como función de v t, se muestran en las Figuras anterior. Con esta información podrı́a diseñarse un sistema
5 y 6 respectivamente, ambas en un intervalo de 0, 10 de control que sea capaz de estabilizar la bicicleta con una
m/s. mı́nima velocidad de avance, asegurando su efectividad
para velocidades más elevadas.
Cuando v t es cero, el sistema tiene cuatro valores propios
reales, ubicados en λ1 3.8804, λ2 2.5721, λ3 3.8819 La estrategia de control propuesta, será diseñada para
y λ4 2.6598, los cuales están marcados con cı́rculos estabilizar la bicicleta con una velocidad v t C 1.5 m/s, la
rojos en la Fig. 5. La ubicación de λ1 y λ2 en el semiplano cual, es más baja que la mı́nima velocidad de estabilización
derecho, explica la inestabilidad de la bicicleta en su conseguida en [Michini and Sean Torrez, 2007].
posición vertical. Este serı́a el estado más crı́tico para
controlar ϕt, actuando solamente sobre δ t. A medida 2.2 Modelo Dinámico Virtual Construido en ADAMS
que v t aumente, λ1 y λ2 se convierten en complejos
conjugados y sus partes reales disminuyen hasta ubicarse Para construir un modelo dinámico en ADAMS, el modelo
dentro del semiplano izquierdo y estabilizar el sistema, CAD desarrollado en Solid Edge, fue importado al entorno
a partir de v t 3.15 m/s. También, λ4 se incrementa ADAMS/View y se completaron los siguientes procedi-
desplazándose hacia el semiplano derecho, retornando el mientos:
sistema a su condición inestable desde v t 3.95 m/s.
1. Se ingresa el valor de masa de cada cuerpo del modelo
CAD.
Parte imaginaria de los valores propios de la matriz A(v(t)) (rad/s)
20
15
2. Se interconectan todos los cuerpos mediante articula-
ciones de revolución.
10
3. Se establece para las articulaciones, el modelo de
5 fricción de Coulomb que se muestra en la Fig. 7, con
0
los siguientes parámetros:
-5
µs 0.05, µd 0.03, vs 0.1, vd 2vs .
-10
-15
µs
-20
-16 -14 -12 -10 -8 -6 -4 -2 0 2 4
Parte real de los valores propios de la matriz A(v(t)) (1/s) µd
Coeficiente de fricción
5 −µd
Parte real de los valores propios de la matriz A(v(t)) (1/s)
0
−µs
Velocidad de deslizamiento
-5
296
CHAPTER 10. NONLINEAR SYSTEMS
5. Se agregan los actuadores que moverán la bicicleta. perturbaciones externas. Este sistema sin perturbaciones,
6. Se construyen acoples virtuales para simular las rela- ζ t 0, es diferencialmente plano debido a que interna-
ciones de transmisión de las articulaciones enlazadas. mente sus variables son expresadas como funciones diferen-
7. Se expecifican las variables de entrada y salida que ciales de la salida y t [Sira Ramı́rez and Sunil K. Agrawal,
serán manipuladas de forma externa. 2004]. Bajo el enfoque ADRC, las dinámicas internas y las
8. Se exporta el modelo no lineal de ADAMS/View hacia perturbaciones externas del sistema (9), son asociadas en
MATLAB/Simulink, por medio de la herramienta un término aditivo unificado, llamado ξ t. De esto, resulta
ADAMS/Controls. un sistema simplificado que puede ser definido como:
La Fig. 8 muestra el modelo dinámico completado en y n t %ut ξ t. (10)
ADAMS. El bloque naranja representa el modelo exporta-
do a MATLAB. El objetivo de control es gobernar la salida y t de (10) y
obligarla a seguir alguna trayectoria fluida de referencia
[Tt]
Bicycle_2
Traction Torque
[LA]
Lean angle definida como y t, independientemente de la función
[St]
[SA]
Steering angle
de perturbación desconocida pero uniformemente acotada
Steering Torque [RW] ξ t. Para esto se asume lo siguiente:
Rear wheel angular speed
[D]
External Force
[C]
Time
External_Force (a). La función de perturbación ξ t es completamente
Adams bicycle model Actuator_2
y x
desconocida, mientras que la ganancia %, es perfec-
z tamente conocida.
z
(b). Siendo m un número entero positivo, la m-ésima deri-
x
y
vada con respecto al tiempo de ξ t es uniformemente
y absolutamente acotada. Es decir, que existe un valor
constante Km tal que supt Tξ m tT B Km .
Coupler_1 La caracterı́stica (a), asegura la independencia de ξ t
con respecto a ut. Esto permite proponer una ley de
Actuator_1
control sin limitaciones que sea robusta, aunque haya
incertidumbre en el valor real de %. La caracterı́stica (b),
gravity fue definida para establecer la existencia de una solución
para la ecuación diferencial (10).
297
CHAPTER 10. NONLINEAR SYSTEMS
y t, x2 t ẏ t, . . . , xn t y n1 t, xn1 t ξ t, . . . , Un término de prealimentación, para los casos cuando
xnm t ξ m1 t. Entonces, el modelo aumentado del la trayectoria de referencia de la salida plana, puede
sistema en espacio de estado, resulta ası́ [Cortés Romero conocerse y/o manipularse.
et al., 2011]: La realimentación del estado x̂t, estimado por el
observador GPI y ponderado por las ganancias del
ẋt Axt B%ut Eξ m t, y t Cxt, (12) controlador.
<0
Un término de rechazo activo de las perturbaciones,
@ 1 0 ... 0 =
A
<
@ 0 =
A
@0 0 1 ... 0 A @ A el cual, es básicamente la estimación aproximada de
@ A @ A
con A @
@
A
A > Rnmnm , B @
@ 1n-ésima posición
A
A > la función de perturbación ξˆt z t, que también
@ A @ A
@0
@ 0 0 ... 1 A
A
@
@ A
A entrega el observador GPI.
@0 0 0 ... 0 A @ 0 A
> ? > ?
Con respecto al sistema (10), la ley de control basada en
Rnm1 , C > R1nm , y E
T
1 0 ... 0 0 0 ... 1 > observador GPI para el seguimiento de trayectorias, tiene
Rnm1 .
la siguiente estructura:
Q
Ahora, denotando x̂j t como la estimación de xj t para n1
1 γk x̂k1 t y t x̂n1 t ,
n
y t
k
j 1, 2, . . . , n m, el observador extendido GPI propuesto ut
para la estimación del estado xt, es planteado de la % k 0
siguiente manera: (16)
x̂˙ t Ax̂t B%ut Ly t ŷ t, ŷ t C x̂t, (13) en donde el conjunto de los coeficientes γ0 , . . . , γn1 ,
son seleccionados para que todas las raı́ces del polinomio
en donde x̂t
T
x̂1 t x̂2 t . . . x̂nm t es la estima- pc s sn γn1 sn1 . . . γ0 estén ubicadas en el
ción del vector de estado y L
T
`nm1 . . . `1 `0 es el semiplano izquierdo de C. El error de seguimiento de la
vector de ganancias del observador. salida de lazo cerrado, ey t y t y t, es gobernado
por el siguiente polinomio caracterı́stico:
xt
eyn t γn1 eyn1 t . . . γ0 ey t
Definiendo el error de estimación como ẽx t
x̂t, el vector del error de estimación resultante ẽx t
Q γ ẽ
T n1 (17)
ẽx1 t ẽx2 t . . . ẽxnm t , satisface ξ t x̂n1 t k xk1 t.
ALC ẽx tEξ Ae ẽx tEξ m t, (14)
m k 0
ẽ˙ x t t
La ley de control (16), conduce la trayectoria del error
con Ae > Rnmnm y su polinomio caracterı́stico en de seguimiento del sistema controlado, hacia una pequeña
función de la variable compleja s, que está dado por: vencindad de cero, que puede reducirse tanto como se
Pẽx s detsI Ae snm `nm1 snm1 . . . `1 s `0 . desee, mediante una selección apropiada del conjunto de
(15) los coeficientes γ0 , . . . , γn1 . La única restricción para
la selección de estos coeficientes, es que pc s sea un
Los coeficientes `nm1 , . . . , `1 y `0 de la función poli-
polinomio Hurwitz, es decir, con todas sus raı́ces a la
nomial Pẽx s, deben seleccionarse para que todas sus
izquierda del eje imaginario de C.
raı́ces estén ubicadas a la izquierda del eje imaginario del
plano complejo C. Entonces, todas las trayectorias de los
errores de estimación ẽx1 t, ẽx1 t, . . . , ẽxnm1 t y 4. CASO DE ESTUDIO
ẽxnm t, convergen globalmente a una pequeña vecin-
dad de cero donde permanecerán acotadas. Esta vecin- En la estrategia propuesta, se diseña un controlador de
dad puede ajustarse reduciéndola tan pequeña como se velocidad de avance para el sistema de tracción, que per-
desee, mediante una adecuada selección de las ganancias mite a la bicicleta seguir perfiles suaves de velocidad con
`nm1 , . . . , `1 y `0 . aceleraciones controladas, y luego, se diseña un controlador
de inclinación, para maniobrar el sistema de dirección,
Una aproximación de bajo orden para la función de per- permitiendo compensar la inclinación durante su recorrido.
turbación (ξ˙t 0), requiere un diseño de un observador
de alto ancho de banda, el cual, podrı́a entregar estima-
ciones ruidosas en las aplicaciones prácticas. Aumentando 4.1 Controlador de Velocidad de Avance
el orden de la aproximación, se soluciona el problema de
estimaciones ruidosas y se construye una mejor aproxi- Asumiendo que v t Rrw θ̇r t, se propone controlar la
mación de la función de perturbación, pero el problema velocidad angular de la rueda trasera θ̇r t, aplicándole el
de localización de los valores propios del observador y las par requerido a su eje de giro. La dinámica de la tracción en
ganancias de la estrategia de control, para que impongan la bicicleta, puede simplificarse como un conjunto de fuer-
sobre la planta un seguimiento óptimo de la referencia, se zas y momentos que intervienen en la rueda trasera para
vuelve más difı́cil. generar su movimiento. La Fig. 9, muestra en un diagrama
de cuerpo libre, las fuerzas y momentos concentrados en la
3.3 Controlador Basado en Observador GPI rueda trasera, en donde v es la velocidad lineal del centro
de masa, Fn es la fuerza normal concentrada en el eje,
El controlador es diseñado con el objetivo de establecer la cual, es exactamente el peso de la bicicleta, asumiendo
de una manera dominante, un polinomio caracterı́stico del que se mueve sobre una superficie plana, Ff es la fuerza
error de seguimiento de alguna trayectoria deseada. Para de fricción que ejerce el suelo sobre la rueda y Trw es el
realizar esto, la ley de control debe estar compuesta por: par aplicado por el actuador.
298
CHAPTER 10. NONLINEAR SYSTEMS
299
CHAPTER 10. NONLINEAR SYSTEMS
Controlador δ t
en Observador GPI
este polinomio fueron seleccionadas las ganancias K0 18
Estabilizador Controlador Tδ t ϕt y K1 9, cuyos valores ubican las raı́ces en 6 3 y
Seguidor
establecen un tiempo de asentamiento de 1.53 segundos en
Modelo de δ t
la Bicicleta la respuesta transitoria del error de seguimiento. La acción
v t +
Controlador Trw t v t de control parcial us t, puede convertirse seguidamente
en la referencia de posición angular δ t, por medio de la
de Velocidad
de Avance
Controlador PI ecuación (28).
300
CHAPTER 10. NONLINEAR SYSTEMS
Según la metodologı́a de la sección 3, basándose en el Reemplazando esta ley de control en el sistema de la-
sistema simplificado (35) y la aproximación del mode- zo abierto (35), la dinámica del error de seguimiento
lo interno de la función de perturbación d4 ξ2 t~dt4 eδ t δ t δ t del sistema de control de lazo cerrado,
d4 r2 t~dt4 0, un observador GPI extendido es formulado quedará dominantemente gobernada por eδ ts2 K3 s
de la siguiente manera: K2 0. Un diseño admisible de la ley de control, se obtuvo
x̂˙ t Ax̂t Bκut F δ t δ̂ t, δ̂ t C x̂t, (36) con las ganancias K2 1760 y K3 88, las cuales, ubican
las raı́ces del polinomio caracterı́stico en: 57.3 30.7.
T
con x̂t x̂1 t x̂2 t x̂3 t x̂4 t x̂5 t x̂6 t , la ma- Estas raı́ces establecen un tiempo de asentamiento de 0.15
triz A > R66 con la misma forma de (12), B segundos, en la respuesta transitoria del error de segui-
T miento.
0 1 0 0 0 0 , C 1 0 0 0 0 0 y el vector de ganancias
F ´5 ´4 ´3 ´2 ´1 ´0 T . x̂1 t, x̂2 t y x̂3 t son las 5. SIMULACIÓN
versiones estimadas de δ t, δ̇ t y ξ2 t respectivamente.
La dinámica del vector del error de estimación que resul- Para verificar la estrategia de control propuesta, fue
ta del observador GPI extendido, será dominada por los necesario desarrollar una co-simulación con ADAMS y
valores propios de la matriz A F C . Para este caso, el MATLAB. La Fig. 18 muestra un diagrama de bloques
método de relaciones caracterı́sticas (CRA) no es eficiente con la estrategia de control completa, programada en
en la sintonización de las ganancias ´0 , ´1 , . . . , ´5 del ob- MATLAB/Simulink. Las derivadas con respecto al tiempo
servador GPI extendido, por la estructura de sexto orden de la trayectoria de referencia δ t fueron aproximadas
en la que se ha definido el modelo aumentado del sistema por la siguiente función de transferencia:
en espacio de estado. Una alternativa mejor elaborada sωn2
Ds , (40)
para seleccionar una localización aceptable de los valores s 2ςωn ωn2
2
propios del observador GPI extendido consistirı́a en utili-
que corresponde a una derivada filtrada por un sistema de
zar un método de optimización. Un regulador cuadrático
segundo orden, con una respuesta transitoria caracterizada
lineal (LQR), es el adecuado para solucionar este proble-
por los parámetros ς y ωn . Un diseño admisible para el fil-
ma de localización porque tiene como objetivo encontrar
tro se obtuvo con ωn 50 y ς 0.75, los cuales, establecen
para un sistema en espacio de estado expresado como
un ancho de banda de 3.5 Hz y un tiempo de asentamiento
ẋt Axt But, una ley de control ut Kxt que
de 0.115 segundos en su respuesta transitoria.
minimiza el siguiente ı́ndice de desempeño [Naidu, 2002]:
ª
V S 0
x tQxt u tRut dt,
T T
(37) 2
1.5
R 0.1,
0.5
con los cuales, el método de optimización LQR entre- v t
´5 ´4 . . . ´0 T :
Tiiempo (seg)
det sI A F C s6 ´5 s5 ´4 s4 ´3 s3 ´2 s2 ´1 s ´0 , 16
(38) 14
12
siguiente:
Figura 12. Par aplicado en la rueda trasera Trw t.
1 ˙
ut δ̈ t K3 δ̂ t δ̇ t K2 δ̂ t δ t ξˆ2 t ,
κ
(39) En la co-simulación, se demuestra que las estrategias de
control de velocidad e inclinación logran estabilizar la
en donde δ t, δ̇ t y δ̈ t son la referencia de posición bicicleta actuando en ella desde el reposo (v 0 0 m/s),
angular de la dirección y sus derivadas respectivamente. y con una inclinación inicial de 5.5° (ϕ0 0.096 rad).
301
CHAPTER 10. NONLINEAR SYSTEMS
20 15
10
15 5
Par (Nm)
10 0
10
-5
5 -10
5 0 0.2 0.4 0.6 0.8 1
Fuerza (N)
Par (Nm)
Tiempo (seg)
0
-5 0
-10
-5
-15
-20
-10
0 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40
Tiempo (seg) Tiempo (seg)
Figura 13. Fuerza externa aplicada en el marco trasero. Figura 16. Par aplicado en la dirección Tδ t.
En la Fig. 11 se muestra la velocidad v t alcanzada por donde excede los lı́mites de saturación preestablecidos (-
la bicicleta y se observa el desempeño del controlador PI 10 a 10 Nm). Esto se debe a la inclinación inicial y su
en el seguimiento de la referencia v t. El par Trw t arranque desde el reposo.
generado por la acción PI se muestra en la Fig. 12. Algunos
sobrepicos de velocidad que aparecen cuando la referencia
es constante se originan por el perfil de fuerza externa de
la Fig. 13 aplicado al marco trasero.
x 10
-3 ϕt
0.1 0.1 8 ϕ̂t
Ángulo (rad)
0.05 6
Ángulo (rad)
0.08
4
0
2
0.06 -0.05
0 0.5 1 0
Ángulo (rad)
Tiempo (seg)
0.04 -2
11 12 13 14 15 16
Tiempo (seg)
0.02
-0.02
0 5 10 15 20
Tiempo (seg)
25 30 35 40 Figura 17. Simulación de la bicicleta en ADAMS/View.
Figura 14. Salida de inclinación ϕt y su estimación ϕ̂t. La Fig. 17, exhibe la simulación interactiva de la bicicleta
controlada en el entorno ADAMS/View.
δ t
δ̂ t
-0.5
1
-1
Se demuestra en los resultados que el esquema de control
0.5
0 0.5 1 1.5 2 ADRC propuesto, cumple satisfactoriamente con el objeti-
Ángulo (rad)
Tiempo (seg)
vo de estabilización de la bicicleta en su posición vertical.
0
-0.86
Aunque el controlador estabilizador es capaz de generar
una referencia apropiada para balancear la inclinación,
Ángulo (rad)
-0.5
-0.88
-0.9
tiene sus limitaciones debido a que está basado en un
-1
modelo simplificado de segundo orden. Se ha pensado para
-0.92
-1.5
11 12 13 14 15
Tiempo (seg)
16 17
un trabajo posterior, diseñar un controlador estabilizador
0 5 10 15 20
Tiempo (seg)
25 30 35 40
más efectivo con alguna otra estrategia para sistemas no
lineales, basándose en el modelo LPV de cuarto orden.
Figura 15. Salida de dirección δ t y su estimación δ̂ t. El controlador seguidor exhibe un desempeño destacable
en el seguimiento de trayectorias y rechazo de las per-
turbaciones, que puede mejorarse aumentando el orden
Ante los cambios de velocidad y la fuerza externa inyec- de la aproximación del modelo interno de la función de
tada en el sistema, la Fig. 14 muestra claramente que perturbación. El controlador de velocidad de avance tiene
el controlador de inclinación es efectivo en la estabiliza- un buen desempeño para los requerimientos de este tra-
ción de ϕt. El controlador estabilizador hace su parte bajo, pero debe mejorarse el seguimiento de la referencia
generando la referencia balanceadora δ t, mientras que si se considera en el futuro plantear una estrategia de
el controlador seguidor cumple con su tarea de imponer control que además de estabilizar la inclinación, posibilite
un seguimiento robusto de esa referencia al sistema de el desplazamiento siguiendo alguna trayectoria.
dirección, tal y como se muestra en la Fig. 15. La acción de
control Tδ t generada por el controlador seguidor, mues- Aunque el controlador estabilizador fue diseñado sobre un
tra en la Fig. 16 que el mayor par aplicado al manubrio punto de operación, asumiendo una velocidad de avance
se produce en los primeros segundos de la simulación, en constante, puede observarse su contundencia estabilizando
302
CHAPTER 10. NONLINEAR SYSTEMS
ϕ̂t
δ̂ t
˙
δ̂ t Observador
+
GPI
ξˆ2 t Extendido
K2
P
+
Tδ t
H s K3 1~κ
+
Saturador
P
wJ
d d
K1 2 Ds ϕt
us t vm mt h
δ t
dt δ̇ t
dt δ̈ t
vm Saturador
Modelo de
K0 δ t
Kp la Bicicleta
en ADAMS v t
v t +
+ + Trw t
S
+
Ki
+
Saturador
Integrador
+
Anti-Windup Tϕ t
Ka
Perturbación
la bicicleta en un rango por encima y por debajo del Cerone, V., Andreo, D., Larsson, M., and Regruto, D.
valor nominal (vm 1.5 m/s). El desempeño y robustez (2010). Stabilization of a Riderless Bicycle, A Linear-
del controlador estabilizador se reduce tanto como la Parameter-Varying Approach [applications of control].
velocidad de avance disminuya del valor nominal, hasta el Control Systems, IEEE.
punto donde el sistema de lazo cerrado se vuelve inestable Chen, C.K. and Dao, T.K. (2010). A Study of Bicycle
(cuando v t 0.85 m/s). Dynamics via System Identification. Symposium on
Computer, Communication, Control and Automation.
La efectividad de la estrategia de control propuesta, radica
en la estimación activa de las posibles no-linealidades y Cortés Romero, J., Luviano Juárez, A., Álvarez Salas, R.,
perturbaciones que afectan al modelo virtual durante su and Sira Ramı́rez, H. (2010). Fast identification and
libre movimiento, por medio de dos observadores GPI, los control of an uncertain brushless dc motor using alge-
cuales, reestablecen el desempeño del modelo lineal sim- braic methods. In 12th International Power Electronics
plificado de la bicicleta, permitiendo ası́, que la estructura Congress (CIEP), 9 – 14.
lineal de la leyes de control establecidas en los lazos de Cortés Romero, J., Ramos, G., and Coral Henriquez, H.
estabilización y seguimiento, impongan las dinámicas de (2014). Generalized proportional integral control for
las referencias de entrada. El seguimiento de la referencia periodic signals under active disturbance rejection ap-
de velocidad de avance que establece el controlador PI al proach. In ISA Transactions - The Journal of Automa-
sistema de tracción, ayuda notablemente al controlador tion, volume 53, 1901 – 1909.
de inclinación porque limita la magnitud y la tasa de Cortés Romero, J., Hebertt Sira Ramirez, and Alberto Lu-
variación de las perturbaciones exógenas que afectan al viano Juarez (2011). Control Lineal Robusto de Siste-
sistema, incrementando su controlabilidad puesto que se mas no Lineales Diferencialmente Planos. In Revista
controlan las variaciones de sus parámetros dependientes Iberoamericana de Automática e Informática Industrial,
de esta variable. Un trabajo interesante para desarrollar volume 8, 14 – 28.
a futuro, serı́a proponer alguna estrategia de control que Gao, Z. (2006). Active Disturbance Rejection Control:
sea igualmente dependiente de la velocidad de avance, con a Paradigm Shift in Feedback Control System Design.
sus ganancias auto-sintonizables mediante algún criterio In American Control Conference, 2006, 7 pp.–. doi:
de optimización, para mejorar la robustez del sistema de 10.1109/ACC.2006.1656579.
lazo cerrado ante las perturbaciones e incertidumbres del Goldstein, H. (1953). Classical Mechanics, 186. Addison-
modelo. Wesley, first edition.
Gordon Wilson, D. (2004). Bicycling Science, chapter 8,
263 – 310. The MIT Press, third edition.
REFERENCIAS Iuchi, K., Hiroshi Niki, and Toshiyuki Murakami (2005).
Attitude Control of Bicycle Motion by Steering Angle
Canudas de Wit, C. and Tsiotras, P. (1999). Dynamic and Variable COG Control. In 31st Annual Conference
tire friction models for vehicle traction control. In of IEEE Industrial Electronics Society, 2005. IECON
Decision and Control, 1999. Proceedings of the 38th 2005., 6, 6 pp.–.
IEEE Conference on, volume 4, 3746 – 3751. doi:
10.1109/CDC.1999.827937.
303
CHAPTER 10. NONLINEAR SYSTEMS
304
CHAPTER 10. NONLINEAR SYSTEMS
305
CHAPTER 10. NONLINEAR SYSTEMS
0-5V
ZT
A Fd
USB D
ZC D
A
TDA
0-5V
mb
SC
Fg
220 V3
Fig.1. Diagrama de instrumentación. El cambio de velocidad de la columna de aire dentro del tubo
se puede modelar mediante (7):
2.2. Modelado matemático
𝑑𝑣𝑎 𝑔(𝑓, 𝑦) − 𝑣𝑎
= (7)
El modelo matemático del levitador neumático involucra 𝑑𝑡 𝜏𝑎
elementos mecánicos, aerodinámicos y eléctricos. La Fig.2
muestra las fuerzas que actúan sobre la esfera ubicada dentro La relación 𝑔(𝑓, 𝑦) depende de la frecuencia 𝑓 del voltaje
del tubo. Buckner et al (2009). aplicado al ventilador centrífugo y de la altura medida con
respecto al orificio de entrada del aire al tubo 𝑦, y se obtiene
Aplicando la segunda ley de Newton se obtiene (1) y (2): experimentalmente para diferentes valores de 𝑓 y de 𝑦, 𝜏𝑎 es
la constante de tiempo del sistema del flujo de aire.
𝑑2 𝑦 𝑑𝑣𝑏
∑ 𝐹𝑦 = 𝑚𝑏 2
= 𝑚𝑏 (1) Finalmente, el sensor de distancia (GP2D12) se puede
𝑑 𝑡 𝑑𝑡
modelar como un sistema de primer orden (8):
𝑑𝑣𝑏
𝑚𝑏 = −𝐹𝑔 + 𝐹𝐷 (2) 𝑑𝑦𝑠 𝑦𝑏 − 𝑦𝑠
𝑑𝑡 = (8)
𝐹𝑔 es el peso de la esfera y 𝐹𝐷 es la fuerza de arrastre que 𝑑𝑡 𝜏𝑠
actúa sobre ella. Dichas fuerzas están dadas en (3) y (4):
En donde 𝑦𝑏 es la posición de la esfera, 𝑦𝑠 es la posición del
𝐹𝑔 = 𝑚𝑏 𝑔 (3) sensor y 𝜏𝑠 es su constante de tiempo que se obtiene
1 experimentalmente.
𝐹𝐷 = 𝐶𝐷 𝜌𝑎 𝐴(𝑣𝑎 − 𝑣𝑏 )2 (4)
2
La velocidad del aire producido por el ventilador en un punto
En donde, 𝑚𝑏 es la masa de la esfera, 𝐶𝐷 es el coeficiente de determinado a una distancia 𝑦 de la boca de insuflación está
arrastre, 𝜌𝑎 es la densidad del aire, 𝐴 es el área frontal de la dada por (9), Escoda (2008), Ortiz (2011)
esfera, 𝑣𝑎 la velocidad del aire en el tubo y 𝑣𝑏 la velocidad 𝐶𝑣𝑖 √𝐴0
de la esfera. 𝑣 = 𝑔(𝑓, 𝑦) = [𝑚⁄𝑠] (9)
𝑦𝑏
En donde 𝑣 es la velocidad del flujo de aire en m/s en un
Combinando (2), (3) y (4) resulta (5): punto dado, 𝑦𝑏 es la altura de la esfera en metros, 𝑣𝑖 es la
velocidad de salida del aire en la boca de insuflación, 𝐴0 es
𝑑𝑣𝑏 1
𝑚𝑏 = −𝑚𝑏 𝑔 + 𝐶𝐷 𝜌𝑎 𝐴(𝑣𝑎 − 𝑣𝑏 )2 (5) el área libre de la boca de insuflación, 𝐶 es una constante que
𝑑𝑡 2 se obtiene de tablas. Para este caso 𝐶 = 5 y 𝐴0 =
0.01538 𝑚 2
Para una esfera, (5) toma la forma (6):
2.3 Datos experimentales
𝑑𝑣𝑏 3 𝐶𝐷 𝜌𝑎 (𝑣𝑎 − 𝑣𝑏 )2
= −𝑔 + (6)
𝑑𝑡 8 𝜌𝑏 𝑅 Se obtienen con el fin de calcular la relación entre 𝑓 , 𝑣𝑎 e
𝑅 es el radio de la esfera y 𝜌𝑏 su densidad. 𝑦𝑏 y a partir de ella evaluar una expresión para 𝑔(𝑓, 𝑦).
En la tabla 1 se muestran los datos de la variación de la
velocidad del aire en la boca del tubo en función del voltaje
de control aplicado al variador y de la frecuencia del mismo.
306
CHAPTER 10. NONLINEAR SYSTEMS
𝑚𝑏 [𝐾𝑔] 0.0279
Tabla 1. Variación de la velocidad del aire vs voltaje de
𝑅[𝑚] 0.07
control aplicado
2
V (Volt) f (Hz) (m/s) 𝐴[𝑚 ] 0.01538
0 0 0 𝐾𝑔
𝜌𝑏 [ 3 ] 19.42
𝑚
0.6 6.88 1.91 𝐾𝑔
𝜌𝑎 [ 3 ] 1.196
1.66 19.44 5.08 𝑚
𝜏𝑎 [𝑠] 2
2.12 24.88 6.66
𝜏𝑠 [𝑠] 0.04
2.57 30.20 7.92
𝐶 0.5
2.91 34.28 9.02
𝐶𝐷 0.05
3.25 38.36 9.99
3.56 41.91 10.88 Reemplazando estos valores en el modelo descrito por (6),
3.70 43.68 11.32 (7), (8) y (15), resulta (16):
𝑑𝑣𝑏
Con los datos de la tabla 1 se obtiene la relación entre el = −9.81 + 0.1649(𝑣𝑎 − 𝑣𝑏 )2
voltaje de control aplicado al variador de velocidad (𝑉), la 𝑑𝑡
frecuencia de la alimentación aplicada a la turbina (𝑓) en 𝐻𝑧 𝑑𝑣𝑎 1.498(3.029𝑉 + 0.097)(𝑉𝑠 − 0.764) − 𝑣𝑎
y la velocidad de salida del aire en la boca del tubo (𝑣𝑖 ) en = (16)
𝑑𝑡 2
𝑚/𝑠 (10), (11):
𝑑𝑦𝑠 𝑦𝑏 − 𝑦𝑠
𝑓 = 11.874𝑉 − 0.242 [𝐻𝑧] (10) =
𝑑𝑡 0.04
𝑣𝑖 = 3.029𝑉 + 0.097 [𝑚⁄𝑠] (11)
Tomando como variables de estado: 𝑥1 = 𝑣𝑏 , 𝑥2 = 𝑣𝑎 , 𝑥3 =
De las curvas de respuesta dadas por el fabricante para el 𝑦𝑠 y como punto de equilibrio 𝑦𝑏 = 40𝑐𝑚 se obtiene (17):
sensor de distancia GP2D12 se obtiene (12) que relaciona el
voltaje de salida del sensor con la distancia del cuerpo, 𝑥̇1 = −9.81 + 0.1649(𝑥2 − 𝑥1 )2
Abarca, Loor y Robalino (2008). 𝑥̇ 2 = −0.5𝑥2 + 0.075 + 2.35𝑉 (17)
𝑥̇ 3 = −25𝑥3 + 1000
21.78
𝑉𝑠 = + 0.402 [𝑉𝑜𝑙𝑡] (12)
𝑦𝑏 La ecuación (17) describe el comportamiento dinámico del
En donde 𝑉𝑠 es la salida del sensor en voltios y 𝑦𝑏 la levitador, representa un sistema no lineal que al linealizarlo
distancia de la esfera al sensor en cm. Como se trabaja con alrededor del punto de equilibrio propuesto arroja la
voltajes de 0 a 5V se utilizó un amplificador con ganancia siguiente ecuación de estado:
1.9 con lo cual la (12) se transforma en (13) y (14):
𝑥̇1 −2.543 2.543 0 𝑥1 0
41.4 [𝑥̇ 2 ] = [ 0 −0.5 0 ] [𝑥2 ] + [2.35] 𝑉
𝑉𝑠 = + 0.764 [𝑉𝑜𝑙𝑡] (13)
𝑦𝑏 𝑥̇ 3 0 0 −25 𝑥3 0
41.4 𝑥1
𝑦𝑏 = [𝑐𝑚] (14) 𝑣𝑏 = [1 0 0] [𝑥2 ] (18)
𝑉𝑠 − 0.764
𝑥3
Combinando (9), (11) y (14) y organizando las unidades se
obtiene (15) La función de transferencia correspondiente a (18) que
relaciona la velocidad de la esfera con el voltaje de control
𝑔(𝑓, 𝑦) = 1.498(3.029𝑉 + 0.097)(𝑉𝑠 − 0.764) (15) aplicado al variador de velocidad está dada por (19):
307
CHAPTER 10. NONLINEAR SYSTEMS
Donde 𝑢(𝑡) y 𝑦(𝑡) son las variables de entrada y salida 𝐽 = (𝐺𝑢 + 𝑓 − 𝑤)𝑇 (𝐺𝑢 + 𝑓 − 𝑤) + 𝜆𝑢𝑇 𝑢 (28)
respectivamente y 𝑒(𝑡) es un ruido blanco de media cero.
La ley de control para el GPC queda (29):
El algoritmo de control consiste en aplicar la secuencia de Δ𝑢 = 𝐾(𝑤 − 𝑓) (29)
control que minimice la función de coste (22):
𝑁2 𝑁𝑢
Siendo 𝐾, la primera fila de (𝐺 𝑇 𝐺 + 𝜆𝐼)−1 𝐺 𝑇
𝐽 = ∑ 𝛿(𝑗)[𝑦(𝑡 + 𝑗| 𝑡) − 𝑤(𝑡 + 𝑗)] 2 + ∑ 𝜆(𝑗)[Δ𝑢(𝑡 + 𝑗 − 1)] 2 (23)
𝑗=𝑁1 𝑗=1 3.2 Prueba de los controladores diseñados
Donde 𝑦(𝑡 + 𝑗|𝑡) es la predicción óptima de la salida 𝑤(𝑡 +
𝑗) es la trayectoria de referencia, 𝛿(𝑗) y 𝜆(𝑗) son factores de Se estimaron diferentes controladores tomando como base la
ponderación, 𝑁1 y 𝑁2 son el horizonte mínimo y máximo de variación en el factor de ponderación del esfuerzo de control.
En todos los casos se realizaron las pruebas del sistema
predicción y 𝑁𝑢 es el horizonte de control.
variando el set-Point del 60% al 70% de la altura máxima
El último término de (22) y con 𝐶(𝑧 −1 ) = 1, se puede
que puede alcanzar la esfera dentro del tubo del levitador.
escribir en la forma (24):
1 𝐹 (𝑧 −1 )
−𝑗 𝑗
= 𝐸𝑗 (𝑧 −1 )
+ 𝑧 (24) La fig. 3 corresponde a la respuesta del sistema con el
∆𝐴(𝑧 −1 ) ∆𝐴(𝑧 −1 ) controlador predictivo con 𝜆 = 10 , horizonte mínimo de
predicción 𝑁1 = 1 y horizontes máximos de predicción y de
Con 𝐸𝑗+1 = 𝐸𝑗 + 𝑓𝑗,0 𝑧 −𝑗 y 𝐹𝑗 = [1 − 𝐸𝑗 𝐴̂]𝑧 𝑗 control 𝑁1 = 𝑁2 = 5
75
Así (22) se puede escribir matricialmente como (25):
70
𝑦 = 𝐺𝑢 + 𝐹(𝑧 −1 )𝑦(𝑡) + 𝐺 ′ (𝑧 −1 )Δ𝑢(𝑡 − 1) (25)
65
Variables [%]
Donde: 60
Set-Point
𝐺𝑗 = 𝐵𝐸𝑗 = 𝑔𝑜 + 𝑔1 𝑧 −1 + 𝑔2 𝑧 −2 + ⋯ 55 Ley de Control
Posición
𝑦(𝑡 + 𝑑 + 1|𝑡) 𝑔𝑜 0 … 0 50
𝑦(𝑡 + 𝑑 + 2|𝑡) 𝑔 𝑔𝑜 … 0
𝑦=[ ] 𝐺=[ 1 ] 45
⋮ ⋮ ⋮ ⋮ ⋮
𝑦(𝑡 + 𝑑 + 𝑁|𝑡) 𝑔𝑁−1 𝑔𝑁−2 … 𝑔𝑜 40
0 5 10 15 20 25
t [s]
Fig. 3. Cambio de la posición de la esfera del 60% al 70%
con el controlador predictivo 𝜆 = 10
308
CHAPTER 10. NONLINEAR SYSTEMS
60 Set-Point
Ley de control Al analizar los valores obtenidos para las métricas de la
55 Posición integral del error, se observa que, a medida que se disminuye
50
el factor de ponderación del esfuerzo de control, dichos
valores disminuyen, lo cual indica que la respuesta del
45 sistema converge más rápido hacia el set-point pero a costa
40
de un mayor esfuerzo de control, esto se ve en el criterio IE
0 5 10 15 20 25 (índice de energía) que es un indicador de dicho esfuerzo.
t [s]
Fig. 5. Cambio de la posición de la esfera del 60% al 70% Por su parte, el controlador PI presenta los índices mayores
con el controlador predictivo 𝜆 = 0.5 debido a su respuesta más oscilatoria y al mayor tiempo de
establecimiento.
Finalmente se trabajó con un controlador PI en el cual sus
Los controladores predictivos implementados mostraron un
parámetros se estimaron utilizando asignación de polos
desempeño adecuado, las respuestas de la posición de la
Iserman (1991), García (2011). La fig. 6 muestra la respuesta
esfera con estos controladores se muestra en las fig. 3, 4 y 5
del sistema con este controlador
y en ellas se aprecia que al aplicar un cambio en el set-point
del 60% al 70% el sistema presenta inicialmente un
80 sobreimpulso significativo (22%, 24% y 30%) con tiempos
75 de establecimiento de 13 s, 11 s y 10 s, respectivamente sin
oscilaciones significativas y con error de estado estable
70
aceptable igual a cero. Por otro lado, el controlador PI (fig. 6)
Variables [%]
55 4. CONCLUSIONES
50
Con los dos algoritmos de control, el sistema presentó un
45 desempeño adecuado en cuanto a exactitud (error de estado
0 5 10 15 20 25 30 35 40
t [s] estable aceptable), estabilidad y velocidad de respuesta se
Fig. 6. Cambio de la posición de la esfera del 60% al 70% refiere, lo que indica que este tipo de controladores son
con el controlador PI
309
CHAPTER 10. NONLINEAR SYSTEMS
apropiados para regular sistemas no lineales y con dinámicas Jornadas de Automática Ciudad Real. Recuperado el 20
complejas. de marzo de 2014 de:
http://www.ceautomatica.es/old/actividades/jornadas/XX
Los controladores predictivos pueden considerarse como una V/documentos/39-anessiuore.pdf.
opción posible y robusta a problemas de control de National Instrument. (2012). User guide and specifications
complejidad media a elevada debido a la facilidad y rapidez NI USB 6008/6009.
con que éstos pueden ser implementados en una aplicación Ortiz, P. Ramírez, J. Cardona, L (2011) Modelo matemático
de control determinada, su desventaja radica en que se y control de un sistema de fluidos. Instituto Tecnológico
requiere del conocimiento del modelo del sistema en forma Metropolitano. ISBN: 978-958-8743-08-0
precisa, para obtener un buen desempeño del sistema.
REFERENCIAS
310
CHAPTER 10. NONLINEAR SYSTEMS
Abstract: Fourier-Transform Infrared spectroscopy (FTIR) is a powerful tool for inferring chemical
composition of a sample from the analysis of its infrared spectrum of absorption or emission. The
industrial usage of this analyzer is typically reduced to species characterization, without taking advantage
of the potential in quantitative analysis. In the case of the polymers, there are some variables related
with product quality, which are strongly related to molecular properties of polymer chains, such as
Molecular Weight Distribution (MWD). Using state estimation techniques and monomer concentration
measurements is possible to obtain an average of molecular weight distribution. A comparison using
a nonlinear state estimation technique, the Extended Kalman Filter (EKF), is presented using either
the reactor temperature or the monomer concentration as measured variable under different scenarios of
noise and sampling time. The EKF based on monomer concentration produced better estimates according
with the used performance indexes.
Keywords: State estimation, Extended Kalman Filter, Optical spectroscopy, Polymerization, Average
molecular weight distribution, Process monitoring.
311
CHAPTER 10. NONLINEAR SYSTEMS
3437
0.6
2997 987
Polymers have some variables related to the product quality
2952
1444 (rheological properties, mechanical strength, chemical resis-
0.4 1244 tance, thermal stability, etc.), and these in turn are strongly re-
1192 lated to molecular properties of polymer chains, such as Molec-
0.2
1151 ular Weight Distribution (MWD), Chain Sequence distribution
(CSD) and Long-Chain Branching (LCD) (Apostolos et al.,
1732
2006). In this sense, process control is being redirected to the
0.0 control of these properties in order to produce desired product
4000 3500 3000 2500 2000
-1
1500 1000 500
specifications.
Wavenumber /cm
Due to the complexity of most polymerization processes, con-
Fig. 1. FTIR spectrum of PMMA (Duan et al., 2008). trol algorithms require the use of reliable process models,
which can provide useful insight about molecular properties of
online validation, and model maintenance (Feng et al., 2015). polymer chains. Molecular properties are typically studied by
Chemometric models are based on multivariate statistics and means of population balances at the expense of a considerable
are usually identified using partial least-squares (PLS), prin- computational burden which is not feasible in monitoring and
cipal component regression (PCR) or methods based on Petri control applications. Consequently, there are some methods to
nets. Depending on structure of the calibration models, recali- approximate a model based on population balances, including
bration periods should be specified. the method of moments, that is the most applied, but some
alternatives like orthogonal collocation method or fixed pivot
3. PROCESS MONITORING USING ONLINE methods have been used with prominent results (Apostolos
SPECTROSCOPY et al., 2006).
Estimation techniques in polymerization are focused in the use
It is well-known that most industries still have challenges re- of Kalman filtering strategies. The work of Kiparissides and
lated to measurement and control issues. For instance, the oil coworkers presented an online optimizing control of molecular
sand industry requires online measurements of many chem- weight properties in a batch polymerization reactor (Kiparis-
ical and physical properties such as bitumen, water content, sides et al., 2002). The strategy used a state/parametric estima-
clays fraction, chloride concentration, viscosity, etc. The first tion step. In the first case, an Extended Kalman filter (EKF)
challenge is to develop in-situ probes resistant to aggressive produced an estimate of the state variables and the termination
environments (erosion and corrosion) reducing replacing time. rate constant, which is a time-varying kinetic parameter. In a
Another challenge is related to presence of multi-phase flows different contribution, a comparison among three formulations
and process units where interface measurements are required. of the EKF and a nonlinear moving horizon estimator (MHE) is
Also, there are some complex compositions in the streams, made to predict the concentration of impurities at the beginning
characterized by different particle size distributions (PSD). The of the batch using the complete nonlinear model and available
last challenge is about the timing of lab data analysis, which are measurements (Salau et al., 2014). The results showed accept-
not suitable for online applications as real-time process control able results for the MHE at expenses of high computational
and optimization (Feng et al., 2015). Applications using FTIR burden. Similar approaches has been studied in (Shahrokhi
including polymerization processes are presented as follows. and Fanaei, 2002), where a EKF was implemented for MWD
estimation. Then, a cascade control loop was designed, with
3.1 FTIR applications molecular weight M w as controlled variable of the master loop
and the reactor temperature T as controlled variable of the slave
Applications of FTIR in the chemical process industry are loop.
focused in polymerization and crystallization reactions, where In (Shahrokhi and Fanaei, 2001) a discussion about the draw-
some internal properties such particle size distribution (PSD), backs and implementation issues of the EKF is presented. In
molecular weight distribution (MWD) or crystal size distri- this contribution, the importance of the observability condition,
bution (CSD) are fundamental to assess the quality of the and its fulfillment for a proper estimation is remarked. How-
products. However, there are additional industries where FTIR ever, the authors also remarked the possibility of designing a
is successfully employed, such as the pharmaceutic industry, reduced-order state estimator when the observability condition
medicine, and atmospheric gas monitoring. In (Gallignani et al., is not met. The observable states are estimated by an EKF and
2014), a quality control procedure using FTIR to determine the remaining states by an open-loop observer if the detectabil-
concentration of furosemide is presented. This work emphasize ity property is guaranteed. This observability issues are found in
the advantages of FTIR compared to common pharmaceutical most polymerization cases, because moments of dead polymer
analysis. Other applications are found in the area of gas obser- are not present in measurable properties.
vations in greenhouses and FTIR imaging for histopathology
in prostate cancer. Additionally to monitoring and control pur- The control of molecular properties of polymers are tackled
poses, FTIR spectrometry can be useful for the study of kinet- using feedforward loops in (Graichen et al., 2005), where an
ics in high-throughput reactive systems. In polymerization, it EKF is designed in order to estimate the reaction heat and heat
is a powerful technique for determination of catalyst activity, transfer coefficients. Again, as molecular properties moments
312
CHAPTER 10. NONLINEAR SYSTEMS
313
CHAPTER 10. NONLINEAR SYSTEMS
314
CHAPTER 10. NONLINEAR SYSTEMS
−5 −5
x 10 x 10
1.58
1.58
1.56
Average Molecular Weight [kg/kmol]
1.52
1.54
1.5
1.52
1.48
EKFT EKFT
r r
1.5
EKFM 1.46 EKFM
Plant Plant
1.48 1.44
0 2 4 6 8 10 0 2 4 6 8 10
Time (h) Time (h)
Fig. 3. State estimation without measurement noise and differ- Fig. 4. State estimation with measurement noise and unknown
ent initial state initial state
Model uncertainties were not considered. At 2 hours, the 1.58
x 10
−5
match. 1.46
0 2 4 6 8 10
Time (h)
315
CHAPTER 10. NONLINEAR SYSTEMS
−5
x 10 Daoutidis, P., Soroush, M., and Kravaris, C. (1990). Feedfor-
1.51 EKFT
ward/feedback control of multivariable nonlinear processes.
1.505 EKFM
r
AIChE Journal, 36(10), 1471–1484.
Duan, G., Zhang, C., Li, A., Yang, X., Lu, L., and Wang, X.
Average Molecular Weight [kg/kmol]
Plant
1.495
conia made by using a poly (methyl methacrylate) template.
1.49 Nanoscale Research Letters, 3(3), 118–122.
1.485 Eaton, M.T. and Ricker, N.L. (1995). Extended Kalman Fil-
1.48
tering for Particle Size Control in a Fed-batch Emulsion
1.475
Polymerization Reactor. In American Control Conference
(ACC), 1995, June, 2697–2701. Seattle, Washington.
1.47
Feng, E., Domlan, E., and Kadali, R. (2015). Spectroscopic
1.465
0.1 0.2 0.3 0.4 0.5 0.6 0.7
measurements in oil sands industry-from laboratories to
Time (h) real-time applications. IFAC Proceedings Volumes (IFAC-
PapersOnline), 48(8), 199–204.
Fig. 6. State estimation for different sampling time of filters Gallignani, M., Rondón, R.a., Ovalles, J.F., and Brunetto, M.R.
(detailed) (2014). Transmission FTIR derivative spectroscopy for esti-
mation of furosemide in raw material and tablet dosage form.
Error Index EKFTr EKFM
Acta Pharmaceutica Sinica B, 4(5), 376–383.
Without noise and x̂0 = x0
Graichen, K., Hagenmeyer, V., and Zeitz, M. (2005). Adap-
MSE 1.985 × 10−20 1.985 × 10−20
ITAE 9.252 × 10 −10 9.253 × 10−10
tive Feedforward Control with Parameter Estimation for the
Effect of initial estimated state Chylla – Haase Polymerization Reactor. 3049–3054.
MSE 1.048 × 10−15 8.515 × 10−16 Kiparissides, C., Seferlis, P., Mourikas, G., and Morris, A.J.
ITAE 5.974 × 10−9 5.459 × 10−9 (2002). Online Optimizing Control of Molecular Weight
Effect of measurement noise Properties in Batch Free-Radical Polymerization Reac-
MSE 3.292 × 10−15 1.266 × 10−15 tors. Industrial & Engineering Chemistry Research, 41(24),
ITAE 2.410 × 10−7 3.850 × 10−8 6120–6131.
Effect of sampling time Ling, C. and Kravaris, C. (2016). State Observer Design for
MSE 7.611 × 10−16 7.614 × 10−16 Monitoring the Degree of Polymerization in a Series of Melt
ITAE 2.332 × 10−7 3.531 × 10−8 Polycondensation Reactors. Processes, 4(1), 4.
Table 3. Error index for all scenarios Monsalve-Bravo, G.M., Moscoso-Vasquez, H.M., and Al-
varez, H. (2014). Scale-up of continuous reactors: Using
take into account realistic aspects as the sampling time of the phenomenological-based models. Chimica Oggi/Chemistry
composition measurements, because EKFM preserves the best Today, 32(3), 20–26.
performance compared to conventional EKFTr , even when Othman, N.S., Fevotte, G., Peycelon, D., Egraz, J., and Suau,
information is available every 10 s. In that way, current process J. (2004). Control of polymer molecular weight using near
monitoring in polymerization system can be improved, com- infrared spectroscopy. AIChE Journal, 50(3), 654–664.
bining both FTIR spectroscopy and nonlinear state estimation Ray, W.H. (1985). Polymerization Reactor Control. 1985
techniques. American Control Conference, 3–8.
Finally, it is important to emphasize that many estimation Salau, N.P.G., Trierweiler, J.O., and Secchi, A.R. (2014). State
schemes and especially the EKF needs a statistical identifica- Estimation of Chemical Engineering Systems Tending To
tion step to find out the real noise covariances by using appro- Multiple Solutions. Brazilian Journal of Chemical Engineer-
priate methods like the autocovariance least square (ALS). Ad- ing, 31(03), 771–785.
ditional analysis to be performed consist to incorporate the tem- Santos, A.F., Silva, F.M., Lenzi, M.K., and Pinto*, J.C. (2005).
perature and monomer concentration measurements together Monitoring and Control of Polymerization Reactors Using
to achieve a more robust estimator, and that probably may NIR Spectroscopy. Polymer-Plastics Technology and Engi-
deal with time-delay and asynchronous information. Finally, neering, 44(1), 1–61.
more reliable nonlinear filters like the Unscented Kalman Filter Shahrokhi, M. and Fanaei, M.A. (2001). State Estimation in a
(UKF), the Nonlinear Moving Horizon Estimator (NMHE), and Batch Suspension Polymerization Reactor. Iranian Polymer
the Particle Filters need to be tested for comparison purposes. Journal, 10(3), 1026–1065.
Shahrokhi, M. and Fanaei, M.A. (2002). Molecular Weight
REFERENCES Control of a Batch Polymerization Reactor. Iranian Polymer
Journal, 11(6), 403–411.
Apostolos, K., Dimitris, M., Vassilis, S., Christos, C., and Shenoy, A.V., Prasad, V., and Shah, S.L. (2010). Comparison
Costas, K. (2006). Dynamic Optimization of Molecular of unconstrained nonlinear state estimation techniques on a
Weight Distribution in Batch Polymerization Reactors, vol- MMA polymer reactor. IFAC Proceedings Volumes (IFAC-
ume 39. IFAC. PapersOnline), 9(PART 1), 159–164.
316
CHAPTER 10. NONLINEAR SYSTEMS
Abstract: This paper addresses the problem of optimal predefined-time stability. Predefined-
time stable systems are a class of fixed-time stable dynamical systems for which the minimum
bound of the settling-time function can be defined a priori as a explicit parameter of the system.
Sufficient conditions for a controller to solve the optimal predefined-time stabilization problem
for a given system are provided. These conditions involve a Lyapunov function that satisfy both
a certain differential inequality for guaranteeing predefined-time stability and the steady-state
Hamilton-Jacobi-Bellman equation for guaranteeing optimality. Finally, this result is applied to
the predefined-time optimization of the sliding manifold reaching phase.
317
CHAPTER 10. NONLINEAR SYSTEMS
Consequently, as an extension of the ideas presented in be tuned by a particular selection of the parameters of the
Bernstein (1993); Sánchez-Torres et al. (2015); Haddad system (1), it is said that the origin of the system (1) is
and L’Afflitto (2016), this paper addresses the problem of predefined-time stable.
optimal predefined-time stabilization, namely the problem
of finding a state-feedback control that minimizes certain The following Lyapunov-like lemma provides a character-
performance measure, guaranteeing at the same time ization of predefined-time stability.
predefined-time stability of the closed-loop system. In Lemma 2.1. (Sánchez-Torres et al., 2014) Assume there
particular, sufficient conditions for a controller to solve the exist a continuous radially unbounded function V : Rn →
optimal predefined-time stabilization problem for a given R+ ∪ {0}, and real numbers α > 0 and 0 < p ≤ 1, such
system are provided. These conditions involve a Lyapunov that:
function that satisfy both a certain differential inequality V (0) = 0 (4)
for guaranteeing predefined-time stability and the steady-
state Hamilton-Jacobi-Bellman equation for guaranteeing V (x) > 0, ∀x 6= 0, (5)
optimality. Finally, this result is applied to the predefined- and the derivative of V along the trajectories of the system
time optimization of the sliding manifold reaching phase. (1) satisfies
α
In the following, Section 2 presents the mathematical V̇ ≤ − exp(V p )V 1−p . (6)
p
preliminaries needed to introduce the proposed results.
Section 3 exposes the main results of this paper, which Then, the origin is globally predefined-time stable for (1)
are the sufficient conditions for a controller to solve and Tf = α1 .
the optimal predefined-time stabilization problem and a
particularization to affine systems. Section 4 shows the 2.2 Optimal Control
application of the obtained results to the predefined-time
optimization of the sliding manifold reaching phase, and Consider the controlled system
the simulation results are shown in Section 5. Finally, ẋ(t) = f (x(t), u(t)), x(0) = x0 , (7)
Section 6 presents the conclusions of this paper.
where x ∈ Rn is the system state, u ∈ Rm is the system
2. MATHEMATICAL PRELIMINARIES input, which is restricted to belong to a certain set U ⊂ Rm
of the admissible controls, and f : Rn × Rm → Rn is a
nonlinear function with f (0, 0) = 0.
2.1 Predefined-Time Stability
The system (7) is to be controlled to minimize the
Consider the system performance measure
Z tf
ẋ(t) = f (x(t)), x(0) = x0 , (1)
J(x0 , u(·)) = L(x(t), u(t))dt, (8)
where x ∈ Rn is the system state and f : Rn → Rn is a 0
nonlinear function such that f (0) = 0, i.e.the origin is an where L : Rn × Rm → R is a continuous function, assumed
equilibrium point of (1). to be convex in u. To this end, define the minimum cost
First, the concepts of finite-time, fixed-time and function as
Z tf
predefined-time stability are reviewed. ∗
J (x(t), t) = min L(x(τ ), u(τ ))dτ . (9)
Definition 2.1. (Polyakov, 2012) The origin of (1) is u∈U t
globally finite-time stable if it is globally asymptotically
stable and any solution x(t, x0 ) of (1) reaches the Defining the Hamiltonian, for p ∈ Rn (usually called the
equilibrium point at some finite time moment, i.e., ∀t ≥ costate)
T (x0 ) : x(t, x0 ) = 0, where T : Rn → R+ ∪ {0}. H(x, u, p) = L(x, u) + pT f (x, u), (10)
Definition 2.2. (Polyakov, 2012) The origin of (1) is fixed- the Hamilton-Jacobi-Bellman (HJB) equation can be
time stable if it is globally finite-time stable and the written as ( !)
settling-time function is bounded, i.e. ∃Tmax > 0 : ∀x0 ∈ ∂J ∗ (x, t)
T
∂J ∗ (x, t)
Rn : T (x0 ) ≤ Tmax . 0 = min H x, u, + , (11)
u∈U ∂x ∂t
Remark 2.1. Note that there are several choices for Tmax .
For instance, if the settling-time function is bounded by and it provides a sufficient condition for optimality.
Tm , it is also bounded by λTm for all λ ≥ 1. This motivates For infinite-horizon problems (limit as tf → ∞), the cost
the following definition. does not depend on t anymore and the partial differential
Definition 2.3. (Sánchez-Torres et al., 2014) Let T be the equation (11) reduces to the steady-state HJB equation
set of all the bounds of the settling time function for the !
T
system (1), i.e., ∂J ∗ (x)
0 = min H x, u, . (12)
T = {Tmax > 0 : ∀x0 ∈ Rn : T (x0 ) ≤ Tmax } . (2) u∈U ∂x
The minimum bound of the settling-time function Tf , is
defined as: 3. OPTIMAL PREDEFINED-TIME STABILIZATION
Tf = inf T = sup T (x0 ). (3)
x0 ∈Rn The main result of this paper is presented in this section.
Definition 2.4. (Sánchez-Torres et al., 2014) For the case First, the notion of optimal predefined-time stabilization
of fixed time stability when the time Tf defined in (3) can is defined.
318
CHAPTER 10. NONLINEAR SYSTEMS
Z ∞
Definition 3.1. Consider the optimal control problem for
the system (7) J(x0 , φ∗ (x(·))) = −V̇ (x(t))dt
Z ∞ 0
= − lim V (x(t)) + V (x0 )
min J(x0 , u(·)) = L(x(t), u(t))dt (13) t→∞
u∈U (Tc ) 0
= V (x0 ).
where
U(Tc ) = {u(·) : u(·) stabilizes (7) Now, to prove (22), let u(·) ∈ U(Tc ) and let x(t) be the
solution of (7), so that
in a predefined time Tc }.
∂V
This problem is called the optimal predefined-time V̇ (x(t)) = f (x(t), u(t)).
stabilization problem for the system (7). ∂x
Then
The following theorem gives sufficient conditions for a ∂V
controller to solve this problem. L(x(t), u(t)) = L(x(t), u(t)) + f (x(t), u(t)) − V̇ (x(t))
∂x !
Theorem 3.1. Assume there exist a continuous radially ∂V T
unbounded function V : Rn → R+ ∪ {0}, real numbers = H x(t), u(t), − V̇ (x(t)).
∂x
Tc > 0 and 0 < p ≤ 1, and a control law φ∗ : Rn → Rm
such that:
Since u(·) stabilizes (7) in predefined time Tc , using (18)
V (0) = 0 (14) and (19) we have
V (x) > 0, ∀x 6= 0, (15) Z ∞" ! #
φ∗ (0) = 0 (16) ∂V T
J(x0 , u(·)) = H x(t), u(t), − V̇ (x(t)) dt
∂V 1 0 ∂x
f (x, φ∗ (x)) ≤ − exp(V p )V 1−p (17)
∂x Tc p = − lim V (x(t)) + V (x0 )+
! t→∞
!
∂V T Z ∞
H x, φ∗ (x), =0 (18) ∂V T
∂x H x(t), u(t), dt
0 ∂x
! !
∂V T Z ∞
H x, u, ≥ 0, ∀u ∈ U(Tc ). (19) ∂V T
∂x = V (x0 ) + H x(t), u(t), dt
0 ∂x
Then, with the feedback control ≥ V (x0 )
!
∂V T = J(x0 , φ∗ (x(·))).
u∗ (·) = φ∗ (x(·)) = arg min H x, u, , (20)
u∈U (Tc ) ∂x
the origin of the closed-loop system Remark 3.1. The conditions (18) and (19) together
are exactly the steady-state Hamilton-Jacobi-Bellman
ẋ(t) = f (x(t), φ∗ (x(t))) (21) equation (12).
is predefined-time stable with Tf = Tc . Moreover, the Remark 3.2. It is important that the optimal predefined-
feedback control law (20) minimizes J(x0 , u(·)) in the sense time stabilizing controller u∗ = φ∗ (x) characterized by
that Theorem 3.1 is a feedback controller.
J(x0 , φ∗ (x(·))) = min J(x0 , u(·)) (22)
u∈U (Tc ) Although Theorem 3.1 provides sufficient conditions
= V (x0 ), (23) for a controller to solve the optimal predefined-time
i.e., the feedback control law (20) solves the optimal stabilization problem for a given system, it does not
predefined-time stabilization problem for the system (7). provide a closed form expression for the feedback
controller. Instead, the feedback controller is obtained by
Proof. Predefined-time stability with predefined time Tc solving (20). To obtain a closed form expression for the
follows directly from the conditions (14)-(17) and applying controller, the result of Theorem 3.1 is specialized to affine
the Lemma 2.1 to the closed-loop system (21). systems of the form
To prove (23), let x(t) be the solution of system (21). Then, ẋ(t) = f (x(t)) + B(x(t))u(t), x(0) = x0 , (24)
∂V where x ∈ R is the system state, u ∈ R is the system
n m
V̇ (x(t)) = f (x(t), φ∗ (x(t))). control input, f : Rn → Rn is a nonlinear function with
∂x
f (0) = 0 and B : Rn → Rn×m .
From the above and (18) it follows
L(x(t), φ∗ (x(t))) = L(x(t), φ∗ (x(t)))+ The performance integrand is also specialized to
∂V L(x, u) = L1 (x) + L2 (x)u + uT R2 (x)u, (25)
f (x(t), φ∗ (x(t))) − V̇ (x(t))
∂x ! where L1 : R → R, L2 : R → R
n n 1×m
and R2 : R →
n
319
CHAPTER 10. NONLINEAR SYSTEMS
Corollary 3.1. Assume there exist a continuous radially which is the hypothesis (19). Applying Theorem 3.1, the
unbounded function V : Rn → R+ ∪{0}, and real numbers result is obtained.
Tc > 0 and 0 < p ≤ 1 such that
V (0) = 0 (26) Remark 3.3. The feedback controller (31) provided by
V (x) > 0, ∀x 6= 0, (27) Corollary 3.1 is an inverse optimal controller in the
" " T ## following sense: instead of solving the steady-state HJB
∂V 1 ∂V
f (x) + B(x) − R2−1 (x) L2 (x) + B(x) equation directly to minimize some given performance
∂x 2 ∂x measure, it is defined a family of predefined-time
1 stabilizing controllers that minimize a certain cost
≤− exp(V p )V 1−p (28) function. In this case, one can flexibly specify L2 (x) and
Tc p
R2 (x), while from (34) L1 (x) is parametrized as
L2 (0) = 0 (29)
∂V
L1 (x) +
∂V
f (x)− L1 (x) = φ∗ T (x)R2 (x)φ∗ (x) − f (x) ≥ 0. (35)
∂x ∂x
T Remark 3.4. It is not always easy to satisfy the hypotheses
1 ∂V ∂V (26)-(30) of Corollary 3.1. However, for affine systems of
L2 (x) + B(x) R2−1 (x) L2 (x) + B(x) = 0.
4 ∂x ∂x relative degree one the functions L2 (x) and R2 (x) can be
(30) easily chosen to fulfill these conditions.
Then, with the feedback control This motivates the following section.
T
1 ∂V
u∗ = φ∗ (x) = − R2−1 (x) L2 (x) + B(x) , (31) 4. INVERSE OPTIMAL PREDEFINED-TIME
2 ∂x
STABLE REACHING LAW
the origin of the closed loop system
ẋ(t) = f (x(t)) + B(x(t))φ∗ (x(t)) (32) In this section, first, some basic concepts corresponding
is predefined-time stable with Tf = Tc . Moreover, the to integral manifolds and sliding mode manifolds are
performance measure J(x0 , u(·)) is minimized in the sense reviewed. For this purpose, consider again the autonomous
of (22) and unforced system (1)
J(x0 , φ∗ (x(·))) = V (x0 ), (33) ẋ(t) = f (x(t)), x(0) = x0 ,
i.e., the feedback control law (31) solves the optimal
predefined-time stabilization problem for the system (24). Definition 4.1. (Drakunov and Utkin, 1992) Let σ : Rn →
Rm be a smooth function, and define the manifold S =
Proof. We can see that the hypotheses of Theorem {x ∈ Rn : σ(x) = 0}. If for an initial condition x0 ∈ S, the
Theorem 3.1 are satisfied. The control law (31) follows solution x(t, x0 ) ∈ S for all t, the manifold S is called an
∂
integral manifold.
from ∂u H x, u, ∂V∂x = 0 with L(x, u) specialized to
(25). Then, setting u∗ = φ∗ (x) as in (31), the conditions Definition 4.2. (Drakunov and Utkin, 1992) If there is
(26), (27) and (28) become the hypotheses (14), (15) and a nonempty set N ⊂ Rn − S such that for every initial
(17), respectively. The hypothesis (16) follows from (29). condition x0 ∈ N , there is a finite time ts > 0 in which
∂
the system state reaches the manifold S then the manifold
Since φ∗ (x) satisfies ∂u H x, u, ∂V u=φ∗ (x)
= 0, and
∂x S is called an sliding mode manifold.
noticing that (30) can be rewritten in terms of φ∗ (x) as
Remark 4.1. A sliding mode on a certain sliding manifold
∂V can only appear if f is a non-smooth (possibly
L1 (x) + f (x) − φ∗ T (x)R2 (x)φ∗ (x) = 0. (34)
∂x discontinuous) function. For this case, the solutions of (1)
the hypothesis (18) is directly verified. are understood in the Filippov sense (Filippov, 1988).
Finally, from (18), (31) and the positive definiteness of With the above definitions, the main objective of the
R2 (x) it follows controller is to optimally drive the trajectories of affine
∂V ∂V system (24) to the set S in a predefined time. The function
H x, u, = L(x, u) + [f (x) + B(x)u] σ : Rn → Rm is selected so that the motion of the system
∂x ∂x
(24) restricted to the sliding manifold σ(x) = 0 has a
∂V
= L(x, u) + [f (x) + B(x)u] − desired behavior.
∂x
∂V The dynamics of σ are described by
L(x, φ∗ (x)) − [f (x) + B(x)φ∗ (x)]
∂x σ̇(t) = a(x(t)) + G(x(t))u(t), σ(x(0)) = σ0 , (36)
∂V where a(x) = ∂σ ∂σ
= L2 (x) + B(x) (u − φ∗ (x))+ ∂x f (x) and G(x) = ∂x B(x).
∂x
It is assumed that σ(x) is selected such that the matrix
uT R2 (x)u − φ∗ T (x)R2 (x)φ∗ (x) G(x) ∈ Rm×m has inverse for all x ∈ Rn . It means that
= −2 φ∗ T (x)R2 (x)(u − φ∗ (x))+ the system (36) has relative degree one.
uT R2 (x)u − φ∗ T (x)R2 (x)φ∗ (x) Now, consider the optimal predefined time stabilization
∗ T ∗ problem (13) for the system (36). The aim is to choose
= [u − φ (x)] R2 (x)[u − φ (x)]
the functions V , L2 and R2 such that the hypotheses of
≥ 0, Corollary 3.1 are satisfied. To this end, assume that V (σ)
320
CHAPTER 10. NONLINEAR SYSTEMS
is a Lyapunov function candidate. Its derivative along the Table 1. Parameters of the pendulum model
trajectories of the system (36) closed loop with (31) (41).
σ̇ = a(x) + G(x)φ∗ (x),
Parameter Values Unit
is calculated as M 1 kg
∂V L 1 m
V̇ = [a(x) + G(x)φ∗ (x)] J = M L2 1 kg · m2
∂σ
∂V 1 Vs 0.2 kg · m2 · s−1
= a(x) + G(x) − R2−1 (x)LT2 (x)− Ps 0.5 kg · m2 · s−2
∂σ 2 g 9.8 m · s−2 2
!#
T
1 −1 ∂V
R (x)GT (x) . The simulations were conducted using the Euler
2 2 ∂σ integration method, with a fundamental step size of 1 ×
Then, choosing 10−3 s. The initial conditions for the system (41) were
T selected as: x1 (0) = π/2 rad and x2 (0) = 0 rad/s. In
L2 (x) = 2aT (x) G−1 (x) R2 (x), (37) addition, the controller gains were adjusted to: Tc = 1,
Tc p k = 2 and p = 1/2.
R2 (x) = exp(−V p (σ(x))) GT (x)G(x) , (38)
2
the derivative of V becomes
3.5
1 ∂V 2
V̇ = − exp(V p ) .
Tc p ∂σ 3
1.5
It can easily be checked that
V (σ) = ci (σ T σ)i > 0, ∀σ 6= 0 (40) 1
1 4c
with i = p+1 and 4i2 c = (p+1) 2 = 1, satisfies (39). 0.5
1 2
V (σ) = c p+1 σ p+1 , 1.5
Tc p p 2p
R2 (x) = 2
exp(−c p+1 σ p+1 ), and 1
2J
Tc p p 2p g Vs
L2 (x) = exp(−c p+1 σ p+1 ) − sin(x1 ) − x2 − 0.5
J L J
Ps 0
0 0.5 1 1.5 2 2.5 3 3.5 4
sign(x2 ) + kx2 , time [s]
J
and u∗ = φ∗ (x) is implemented as in (31). Rt
Figure 2. Function J(t) = 0
[L1 + L2 u + uT R2 u]dτ .
Finally, according to (35) the resulting function L1 is
2 Note that σ(t) = 0 for t ≥ 0.827 s < Tc = 1 s (Fig.
2J 2 p 2p 1 ∂V ∂V
L1 (x) = exp(c p+1 σ p+1 ) L2 (x) + − a(x). 1). Once the system states slide over the sliding manifold
4Tc p J ∂σ ∂σ
σ(x) = 0, this motion is governed by the reduced order
5. SIMULATION RESULTS system
ẋ1 (t) = −kx1 (t) = x2 .
The simulation results of the Example 5.1 are presented This imply that the system state tends exponentially to
in this section. The pendulum parameters are shown in zero at a rate of k1 (Fig. 3). Fig. 4 shows the control signal
Table 1. (torque) versus time. Finally, from Fig. 2, it can be seen
321
CHAPTER 10. NONLINEAR SYSTEMS
0
control in dynamic systems. International Journal of
−0.5 Control, 55, 1029–1037.
Filippov, A.F. (1988). Dierential equations with
−1
discontinuous righthand sides. Kluwer Academic
−1.5
x1(t) Publishers Group, Dordrecht.
x (t)
2
Fraguela, L., Angulo, M., Moreno, J., and Fridman, L.
−2 (2012). Design of a prescribed convergence time uniform
0 0.5 1 1.5 2 2.5 3 3.5 4
time [s] robust exact observer in the presence of measurement
noise. In Decision and Control (CDC), 2012 IEEE 51st
Figure 3. Evolution of the states. Annual Conference on, 6615–6620.
Haddad, W.M. and L’Afflitto, A. (2016). Finite-Time
Stabilization and Optimal Feedback Control. IEEE
12
Transactions on Automatic Control, 61(4), 1069–1074.
doi:10.1109/TAC.2015.2454891.
10
Haimo, V. (1986). Finite time controllers. SIAM Journal
8
on Control and Optimization, 24(4), 760–770.
Moulay, E. and Perruquetti, W. (2005). Lyapunov-
6
based approach for finite time stability and stabi-
lization. In Proceedings of the 44th IEEE Con-
u(t)
322
CHAPTER 10. NONLINEAR SYSTEMS
Resumen: En el control de procesos se utilizan ecuaciones diferenciales parciales (EDPs) para describir
procesos físicos mediante las leyes de transferencia de masa, energía o cantidad de movimiento y
ecuaciones fundamentales. Sin embargo, los modelos de EDP conducen a sistemas de infinita dimensión,
que son inviables para el control y simulación de procesos. Las técnicas de reducción de dimensionalidad
como la descomposición ortogonal propia (DOP) son necesarias para obtener modelos de orden reducido
adecuados para los sistemas de control. El método DOP ha funcionado para la reducción de
dimensionalidad en los dominios espaciales cartesianos con mallado regular. No obstante, una limitación
del método DOP corresponde al orden de almacenamiento de la información empírica de los sistemas en
la matriz de datos para sistemas EDP con dominio no cartesiano. Por lo tanto, nuestra contribución es
mostrar que las funciones DOP y sus respectivas varianzas son invariantes con respecto a la orden de
almacenamiento. Usando COMSOL-Matlab/Simulink y el método DOP, se utilizó un sistema de
transferencia de calor en dos dimensiones con varios órdenes de almacenamiento en la matriz de datos,
con el fin de comparar los respectivos modelos de orden reducido. También proporcionamos la
aplicación de la interfaz COMSOL-Matlab para obtener modelos de orden reducido usando un calentador
de oficina (sistema de calor de dos dimensiones).
Palabras clave: DOP, mallado irregular, COMSOL, EDP.
323
CHAPTER 10. NONLINEAR SYSTEMS
artículo es aplicar el método de elementos finitos utilizando solución numérica EDO. Para cargar un modelo en Matlab es
un mallado irregular y DOP, para obtener el modelo de orden utilizado el comando mphload, el cual permite crear una
reducido en un sistema EDP; e igualmente, estudiar el efecto estructura que contiene todas las instrucciones del modelo en
del orden de almacenamiento de los datos de simulación o COMSOL. Las variables o expresiones pueden ser evaluadas
empíricos en la matriz de datos. en la ventana de trabajo por medio del uso del comando
mpheval, en el cual se especifican las coordenadas espaciales
Las simulaciones presentadas en este artículo fueron y temporales a evaluar. El mallado utilizado para simular el
realizadas utilizando la plataforma de simulación COMSOL modelo consta de diversos elementos tales como: elementos y
Multiphysics y Matlab. COMSOL es una herramienta para el vértices, si se requiere conocer esta información es posible
análisis de elementos finitos en sistemas físicos. Aunque la utilizar el comando mphxmeshinfo.
configuración de COMSOL es eficaz a la hora de resolver
varios tipos de problemas, el tiempo de simulación puede Finalmente, es posible obtener el sistema EDO lineal en
llegar a ser grande, dependiendo de muchos factores como: espacios de estado, correspondiente a la solución numérica
número de variables, complejidad del sistema EDP y del del modelo EDP, para ello se utiliza el comando mphstate.
número de nodos utilizados en el mallado. El método DOP Este comando importa en Matlab las matrices que
no puede ser implementado en COMSOL, por lo cual se representan el sistema, las cuales son utilizadas
realiza utilizando COMSOL con Matlab. En este artículo se posteriormente para la obtención del modelo de orden
presenta la simulación de transferencia de calor reducido. Para mayores detalles se puede referir a la
bidimensional utilizando un mallado regular e irregular, documentación de estos programas (COMSOL, 2015).
evaluando el efecto del orden de almacenamiento en el
modelo de orden reducido (MOR).
3. REDUCCIÓN DE DIMENSIONALIDAD USANDO DOP
La estructura en la que se presenta este documento es la En la descomposición ortogonal propia, se busca representar
siguiente: en la sección 2 se presenta una descripción de la las variables del sistema como la suma de funciones base
plataforma de simulación. En la sección 3 se presenta una ortonormales según (1).
breve introducción al método DOP. En la sección 4 son
presentados los resultados de simulaciones para los casos de
estudio, en el cual se involucra la transferencia de calor en ( ) ̂( ) ∑ () ( ) ( )
dos dimensiones. En la sección 5 se presentan las
conclusiones de este trabajo.
En donde ( ) representan un conjunto de funciones base
2. PLATAFORMA DE SIMULACIÓN ortonormales (funciones base DOP) en el dominio espacial z
y ( ) son los coeficientes variantes con el tiempo o
COMSOL posee algunos de los modelos más comunes de
aplicación en las ciencias e ingeniería como: acústica, coeficientes DOP asociados con cada función base. La
dinámica de fluidos computacionales, procesos químicos, reducción de modelos a través de DOP puede realizarse a
corrosión, electroquímica, geomecánica, transferencia de través de 5 pasos, a saber:
calor entre otros (COMSOL, 2010). Una vez es definido el
módulo de simulación o el grupo de EDPs a utilizar se 1. Generación de la matriz snapshot: La matriz snapshot es
especifica la geometría de estudio. En la interfaz del modelo construida por las respuestas del sistema cuando se
se pueden introducir varios tipos de condiciones al sistema de introduce señales ricas en frecuencias en las entradas. En
EDPs como: valores iniciales, condiciones de frontera y la simulación son tomados k observaciones utilizando un
condiciones iniciales. Finalmente se realiza el mallado sobre tiempo de muestreo determinado ( ), según (2).
la geometría de estudio para poder realizar la simulación.
( ( ) ( )) ( )
La interfaz entre COMSOL y Matlab puede ser utilizada
cargando el modelo de COMSOL en Matlab, en donde el 2. Obtención de las funciones base DOP: estas funciones
usuario puede introducir o cambiar cualquiera de los son obtenidas a través del cálculo de la descomposición
parámetros numéricos de la simulación como: propiedades en valores singulares (SVD) de la matriz .
del material, valores iniciales, condiciones iniciales o de
frontera, por medio del uso de funciones de Matlab en ( )
COMSOL, o por medio de comandos desde la interfaz de
Matlab, para mayor detalle se puede referir a la en donde y son matrices ortonormales, y es la
documentación que acompaña esta conexión (COMSOL, matriz que contiene los valores singulares de la matriz
2013). snapshoot. Las columnas de son las funciones base
DOP.
Al utilizar COMSOL con Matlab se pueden efectuar diversas
acciones como, por ejemplo: cargar un modelo de COMSOL 3. Selección de las funciones DOP más relevantes: se
en la ventana de Matlab, evaluar una expresión o variable, realiza verificando los valores singulares de la matriz
extraer la información relacionada con el mallado utilizado, . El grado de correlación entre una función base
modificar las condiciones iniciales y de frontera, extraer la
324
CHAPTER 10. NONLINEAR SYSTEMS
( ) y los datos se obtiene a través del valor singular En esta sección se presentan algunos benchmarks utilizando
correspondiente ( ). Es importante resaltar que los reducción de dimensionalidad por el método de DOP
valores singulares se encuentran ordenados de mayor a presentado previamente. El primer benchmark es un modelo
menor, indicando que la primera función base DOP es el de transferencia de calor bidimensional, utilizando varios
elemento más importante. El criterio para determinar el ordenes de almacenamiento, esta simulación busca evaluar el
número de funciones base está dado en (4), donde r es el efecto que tiene el orden de almacenamiento en la matriz de
rango de la matriz de datos. datos para el MOR. El segundo benchmark es un sistema de
calefacción representado por un modelo de transferencia de
∑ calor bidimensional con geometría irregular, con esta
( ) simulación se muestra el uso de elementos finitos en un
∑
mallado irregular y DOP en la obtención del modelo de orden
reducido de un sistema EDP.
Como los valores propios son positivos, esto implica
que: Para verificar el desempeño de los modelos de orden
reducido, se calcular un error relativo con respecto a los datos
( ) generados por el modelo original. El error es tomado como el
máximo valor calculado del error, y se muestra en (12).
la cantidad es utilizada para determinar el orden de
truncamiento de las funciones base DOP. El número de | ( ) ( )|
funciones DOP base es determinado por la fracción de ( )
( )
los primeros valores singulares que sea lo
suficientemente grande para representar la información
de los datos. En ese orden de ideas se puede construir el En donde ( ) es el valor del modelo original en la
modelo de orden reducido con un número de ecuaciones coordenada espacial z en el tiempo t y ( ) es el valor
n. del modelo de orden reducido.
4. Construcción del modelo de orden reducido: para 4.1 Transferencia de calor con diferentes órdenes de
obtener el modelo de los primeros n coeficientes DOP, se almacenamiento.
utiliza la proyección de Galerkin. Lo anterior consiste en
definir una función residual con la diferencia entre el En el primer benchmark se presenta la transferencia de calor
modelo original y el modelo aproximado, esta se muestra bidimensional en una placa, cuya temperatura es fijada en dos
en (6). de sus fronteras (ver Fig. 1), la altura y el ancho de la placa
corresponden a 0,15 m. La expresión que modela este
̇( ) () () () ( ) fenómeno es la mostrada en (13).
( ) ̇( ) () () () ( ) ( )
Remplazando el modelo aproximado por su
aproximación ( ) en (7) se tiene: La densidad ( ) de la placa es de 2700 , la capacidad
̇( ) () () () ( )
() () ( )
en donde , y .
5. Validación del modelo de orden reducido. Fig. 1: Geometría de la placa con cuatro entradas
325
CHAPTER 10. NONLINEAR SYSTEMS
() ( ) () ( )
( )
() ( ) () ( ) ( )
( )
En la simulación de este fenómeno, las derivadas parciales En la tabla 1, se presentan los primeros 10 valores singulares
con respecto a las dimensiones (x e y) se remplazan por una para los dos modelos de orden reducido en mención,
aproximación con diferencias finitas, este sistema adquiere la mostrando igualdad para ambos órdenes de almacenamiento.
forma matricial presentada en (14). Donde ( ) es el vector Lo anterior, implica que independientemente del tipo de
de temperaturas de todos los puntos de la malla en el instante ordenamiento utilizado en los nodos, los valores obtenidos
t y ( ) es un vector con las entradas del sistema. mediante el método DOP son iguales.
b)
326
CHAPTER 10. NONLINEAR SYSTEMS
Valor U
modelo de orden reducido representa de manera adecuada el -0.0416
-0.0422
30
350
25
20
340 20
15
Temperatura - K
10 10
330
5
320 Coordenada en y 0 0
Coordenada en x
310
300
b)
0
10
-0.041
15 -0.0412
20 -0.0414
25
20
Valor U
10 15
Coordenada en x 25 5 -0.0416
0
Coordenada en y
-0.0418
b) -0.042
-0.0422
30
25
20
20
0.4 15
10 10
5
0.2 0 0
Error relativo % en T
Coordenada en y Coordenada en x
0
Fig. 5: Comparación primer modo U1
-0.2
-0.4 a)
-0.6
30
25
20
20
15 0.15
10 10
5
Coordenada en y 0 0
Coordenada en x 0.1
0.05
puede notar que ambos modos son iguales para los dos 20
20
25
5
independientemente del tipo de ordenamiento utilizado en los Coordenada en y 0 0
Coordenada en x
nodos, los valores singulares y modos obtenidos mediante el
método DOP son iguales. b)
a)
327
CHAPTER 10. NONLINEAR SYSTEMS
0.05
0
donde ( ) es el vector de temperaturas de todos los puntos
de la malla en el instante t y ( ) es un vector con las
-0.05 entradas del sistema. En este caso las entradas del sistema son
30
25
la fuente de calor y la temperatura ambiente . Este
20
15
20 sistema EDO es de orden 3648.
10 10
5
Coordenada en y 0 0
Coordenada en x
( )
328
CHAPTER 10. NONLINEAR SYSTEMS
330
Temperatura - K
325
320
0.1
0.08
0.06
0.04
0.02
0.12
coordenada en y-m 0.1
0.08
0.06
0 0.04
0.02
0 coordenada en x - m
a)
325
a)
320
0.1
0.08
0.06
0.04
0.02
0.12
coordenada en y-m 0.1
0.08
0.06
0 0.04
0.02
0 coordenada en x - m
b)
329
CHAPTER 10. NONLINEAR SYSTEMS
325
Temperatura - K
320
315
0.1
0.08
0.06
0.04 0.12
0.1
0.08
0.02 0.06
coordenada en y - m 0.04
0 0.02
0 coordenada en x - m
b)
335
Temperatura - K
330
325
320
0.1
b) 0.08
0.06
0.12
0.04 0.1
0.08
0.02 0.06
0.04
coordenada en y - m 0.02
0 0 coordenada en x - m
a)
330
Temperatura - K
325
a)
b)
330
CHAPTER 10. NONLINEAR SYSTEMS
331
CHAPTER 10. NONLINEAR SYSTEMS
332
CHAPTER 11
OBSERVERS
CHAPTER 11. OBSERVERS
Abstract: This paper presents a soft sensor to estimate the biomass concentration in a batch
bioprocess used in production of δ-endotoxins of Bacillus thuringiensis, subject to delayed
measurements. The soft sensor proposed is based on a cascade observer-predictor algorithm.
The observer stage is based on a class of second order sliding mode algorithms, allowing a fixed-
time estimation of the biomass. Additionally, the prediction stage offsets the effect of the delay
in measurements. Simulations show the feasibility of the proposed observer.
334
CHAPTER 11. OBSERVERS
preliminaries in order to introduce the basics of fixed time ẋ1 (t) = b1 (x1 (t), x2 (t))x3 (t)
stability and predictor stability. The Section 4 presents ẋ2 (t) = b21 (x1 (t), x2 (t))x3 (t)
simulation results of the cascade observer-predictor for + f2 (x2 (t)) + b22 x4 (t) (3)
the δ-endotoxins production process of Bt. Finally, the
conclusions of this paper are exposed in the Section 5. ẋ3 (t) = b3 (x1 (t), x2 (t))x3 (t)
ẋ4 (t) = b4 (x1 (t))x3 (t)
where
2. BATCH PROCESS MODEL WITH DELAYED µ(x1 (t), x2 (t))
b1 (x1 (t), x2 (t)) = − + ms
MEASUREMENT yx/s
∗
f2 (x2 (t)) = K3 QA (od − x2 (t))
The model of the δ-endotoxins production of Bt proposed b21 (x1 (t), x2 (t)) = −K1 (µ(x1 (t), x2 (t)) − ke (t)) − K2
on (Amicarelli et al., 2010; Rómoli et al., 2016) is used. In b22 = −K2
this paper the block-wise form of the equations is modified
to allow a straightforward design of a second order sliding b3 (x1 (t), x2 (t)) = µ(x1 (t), x2 (t)) − ks (x1 (t)) − ke (t)
mode observer. The model equations are b4 (x1 (t)) = ks (x1 (t))
(4)
µ(sp , od )
ṡp = − + ms xv and with the measurements
yx/s T
y (t) = [x1 (t − τ ) x2 (t − τ )] (5)
ȯd = K3 QA (o∗d − od ) − K1 (µ(sp , od ) − ke (t)) xv
(1) The block-wise form (3)-(5) allows a straightforward
− K2 (xv + xs )
design of a second order sliding mode observer. The
ẋv = (µ − ks (sp ) − ke (t)) xv nominal parameters for the system (3) are given in Table
ẋs = ks xv 1.
where sp is the substrate concentration, od is the Table 1. Nominal Parameters of the BT model.
dissolved oxygen concentration, xv is the vegetative cells
Parameter Values Unit
concentration, xs is the sporulated cells concentration, µ
µmax 0.65 h−1
is the specific growth rate, yx/s is the growth yield, ms is
yx/s 0.37 g · g−1
the maintenance constant, QA is the airflow that enters Ks 3 g · L−1
the bio-reactor, o∗d is the oxygen saturation concentration, Ko 1 × 10−4 g · L−1
K1 is the oxygen consumption dimensionless constant ms 5 × 10−3 g · g−1 · h−1
by growth, K2 is the oxygen consumption constant for ks,max 0.5 h−1
maintenance, K3 is the ventilation constant, ks is the Gs 1 g · L−1
spore formation kinetics and ke (t) is the specific cell death Ps 1 g · L−1
rate. Furthermore, the constitutive equations for µ(sp , od ) ke,max 0.1 h−1
(Monod-based), ks (sp ) and ke (t) are given by: Ge 5 h
Pe 4.9 h
sp od K1 3.795 × 10−3 dimensionless
µ(sp , od ) = µmax
Ks + sp Ko + od K2 0.729 × 10−3 h−1
K3 2.114 × 10−3 L−1
1 1
ks (sp ) = ks,max − QA 1320 L · h−1
1 + eGs (sp −Ps ) 1 + eGs (sp,ini −Ps ) o∗d 0.00759 g · L−1
1 1 tini 0 h
ke (t) = ke,max − sp,ini 32 g · L−1
1 + e−Ge (t−Pe ) 1 + e−Ge (tini −Pe )
(2)
3. PROPOSED SOFT SENSOR SCHEME
where µmax is the maximum specific growth rate, Ks is the
substrate saturation constant, Ko is the oxygen saturation
3.1 Observability Analysis
constant,ks,max is the maximum spore formation, ke,max
is the maximum specific cell death rate, Gs is the gain
constant of the sigmoid equation for spore formation rate, Let the vector H which contains the measured outputs of
Ge is the gain constant of the sigmoid equation for specific the system (3), x1 (t − τ ), x2 (t − τ ) and their derivatives
cell death rate, Ps is the position constant of the sigmoid be defined as
T
equation for spore formation rate, Pe is the position H = [ x1 (t − τ ) x2 (t − τ ) ẋ1 (t − τ ) ẋ2 (t − τ ) ] (6)
constant of the sigmoid equation for specific cell death Similarly to the analysis presented in Sánchez et al.
rate, sp,ini is the initial glucose concentration and tini is (2015), the observability analysis for the system (3)
the initial fermentation time. determines the existence of a diffeomorphism between
Assumption 2.1. It is assumed that the measurements of the vector H and the delayed state vector x =
T
the outputs sp and od are continuously measured with a [ x1 (t − τ ) x2 (t − τ ) x3 (t − τ ) x4 (t − τ ) ] .
delay time τ > 0. The delay τ is considered to be known
and constant. The existence of this diffeomorphism can be evaluated,
at least locally, by checking if the observability matrix
∂H
Defining x1 = sp , x2 = od , x3 = xv , x4 = xs and defined as O = ∂x(t−τ ) is invertible. For the system (3),
considering the Assumption 2.1, the model (1) can be the observability matrix is calculated from (6) and is given
written as: by
335
CHAPTER 11. OBSERVERS
10 0 0 x̂τ1 = β1−1 x̂τ1s
0 1 0 0
x̂τ2 = β2−1 x̂τ2s
O= (7)
∗ ∗ b1 (x1 (t − τ ), x2 (t − τ )) 0
∗ ∗ b21 (x1 (t − τ ), x2 (t − τ )) b22 x̂˙ τ1s = bs1 (x̂τ1 , x̂τ2 )x̂τ3 + k11 φ1 (x̃τ1s )
(10)
where it follows that the determinant of (7) is det(O) = x̂˙ τ2s = f2s (x̂τ2 ) + bs21 (x̂τ1 , x̂τ2 )x̂τ3 + bs22 x̂τ4 + k21 φ1 (x̃τ2s )
b22 b1 (x1 (t − τ ), x2 (t − τ )). Therefore, this system is −1
x̂˙ τ = b3 (x̂τ , x̂τ )x̂τ + k12 [bs (x̂τ , x̂τ )] φ2 (x̃τ )
3 1 2 3 1 1 2 1s
observable for t ≥ τ . However, it can be shown that
−1
|det(O)| achieves a very small value (about 1 × 10−9 ), x̂˙ τ4 = b4 (x̂τ1 )x̂τ3 + k22 [bs22 ] φ2 (x̃τ2s )
which compromises the numerical invertibility of the where x̂τ1 , x̂τ2 , x̂τ1s , x̂τ2s , x̂τ3 and x̂τ4 are the estimates of xτ1 ,
observability matrix O (Sánchez et al., 2015). xτ2 , xτ1s , xτ2s , xτ3 and xτ4 , respectively; x̃τ1s = xτ1s − x̂τ1s and
To overcome this numerical drawback, the following x̃τ2s = xτ2s − x̂τ2s are the error variables; the observer input
1
scaling transformation of the state is proposed: injections φ1 (·) and φ2 (·) are of the form φ1 (·) = b·e 2 +
3
x1s (t − τ ) = β1 x1 (t − τ ) θb·e 2 and φ2 (·) = 12 b·e0 +2θ·+ 23 θ2 b·e2 , with the parameter
(8) θ ≥ 0, the function b·eα = |·|α sign(·) is defined for α ≥ 0,
x2s (t − τ ) = β2 x2 (t − τ )
where sign(x) = 1 for x > 0, sign(x) = −1 for x < 0 and
with β1 and β2 real positive constants to be defined sign(0) ∈ {−1, 1}; and λ1 , λ2 > 0, and k11 , k12 , k21 , k22
thereafter. are the observer positive gains.
Thus, using the notation xτi = xi (t − τ ) for i = 1, . . . , 4, The SMO (10) was proposed in a previous paper (Sánchez
the system (3) under the scaling (8) becomes: et al., 2015). This observer is fixed-time convergent
ẋτ1s = bs1 (xτ1 , xτ2 )xτ3 and also has time-invariance property, according to the
ẋτ2s = f2s (xτ2 ) + bs21 (xτ1 , xτ2 )x3 + bs22 xτ4 definition of Khosravian et al. (2015a). A detailed stability
(9) test of observer (10) without delay in measurements has
ẋτ3 = b3 (xτ1 , xτ2 )xτ3
ẋτ4 = b4 (xτ1 )xτ3 been previously published (Sánchez et al., 2015). However,
the problem considered in this paper is to estimate the
where bs1 (xτ1 , xτ2 ) = β1 b1 (xτ1 , xτ2 ), f2s (xτ2 ) = β2 f2 (xτ2 ), current state x (t) when the measurements of the output
bs21 (xτ1 , xτ2 ) = β2 b21 (xτ1 , xτ2 ) and bs22 = β2 b22 . are delayed such that the output measurement at time t is
y (t) = h (x (t − τ )) for some know constant delay τ ≥ 0.
3.2 Observer-Predictor Scheme In this sense a prediction stage it is proposed to offset the
effect of the delay in the measurement.
In this section a cascade observer-predictor scheme is rep-
resented. Based in the structure presented in Khosravian Prediction Stage (Model + Delay): Second, based on
et al. (2015b), the proposed scheme is composed for a SMO (Khosravian et al., 2015a) a Smith predictor compensating
and Smith predictor. A block diagram of this proposal is the delay may be considered as
shown in Figure 1. In this figure the sensor block separately ẋp (t) = x̂˙ τ (t) + f (xp (t)) − f (xp (t − τ )) (11)
block process is proposed to clarify, in this paper, the
problem of delay occurs in the dynamics of the sensor. where the prediction of the current state is denoted by
xp ∈ Rn and x̂τ is the estimate x subject to delayed output
measurements (5). Moreover, with the system model (3)
and the known delay τ for output measurement (5), it
is possible to know the dynamics of the predicted states
without delay f (xp (t)) and delayed f (xp (t − τ )).
The stability of the Observer-Predictor structure is
such that the estimate state converge asymptoti-
cally/exponentially to the system trajectories (1)-(2), if
the estimates provided by the Observer (SMO) converge
asymptotically/exponentially to the delayed system state
(Khosravian et al., 2015a). In this sense the definition of fi-
nite time convergent include asymptotically/exponentially
convergent and fixed-time convergent of SMO (10) is a
stronger form of finite time (Polyakov, 2012). In the next
section the simulation results are presented.
4. SIMULATION RESULTS
Figure 1. Observer-Predictor scheme This section presents the numerical simulation results
of the proposed estimation structure. The simulations
parameters were:
An explanation of the scheme of Figure 1 is as follows.
• Fundamental step size of 1 × 10−5 [h]. This time is
Observation Stage (SM Observer): First, from (8)-(9) small due to requirement of robust differentiation in
the following Sliding Mode Observer is proposed in order the estimation scheme.
to provide an estimation of the delayed state variables: • Model parameters like shown on Table 1.
336
CHAPTER 11. OBSERVERS
35 −3 −3
x 10 x 10
7.41
32
30 x1
8 x2
x estimated 7.4
1
337
CHAPTER 11. OBSERVERS
10 1 10 10
x4 x4
8 0.8 8 8
x4 estimated x4 estimated
sporulated cells concentration [g/L]
4 0.4 4 4
0.2
2 2 2
0
0 0 0
−0.2
−2 −2 −2
0 2 4 6 8 10 0 0.1 0.2 0 2 4 6 8 10 0 0.5
time [h] time [h] time [h] time [h]
Figure 5. Sporulated cells concentration xs with τ = 5 × Figure 7. Sporulated cells concentration xs with τ =
10−5 [h]. 0.1[h].
−3 −4
12 12 x 10 x 10
2 2
10 10 1.8 x3 estimated
x3 predicted
vegetative cells concentration [g/L]
1.6
8 8
1.4
6 6 1.2
1 1
4 4
0.8
2 2 0.6
x3
0.4
0 x3 estimated 0
x3 predicted 0.2
−2 −2
0 2 4 6 8 10 0 0.2 0.4 0 0
time [h] time [h] 0 0.002 0.004 0.006 0.008 0.01 0 0.5 1
Delay [h] −3
x 10
Figure 6. Vegetative cells concentration xv with τ = 0.1[h].
Figure 8. Delay effect to estimate xv .
simulations shown the feasibility of the cascade observer- nonlinear state estimation. Journal of Process Control,
predictor proposed. 21(1), 119–129.
Guo, Y. and Huang, B. (2015). State estima-
REFERENCES tion incorporating infrequent, delayed and integral
measurements. Automatica, 58, 32–38. doi:
Amicarelli, A., di Sciascio, F., Toibero, J.M., and Alvarez, 10.1016/j.automatica.2015.05.001.
H. (2010). Including Dissolved Oxygen Dynamics into Guo, Y., Zhao, Y., and Huang, B. (2014). Development
the Bt delta-Endotoxins Production Process Model and of soft sensor by incorporating the delayed infrequent
its Application to Process Control. Brazilian Journal of and irregular measurements. Journal of Process Control,
Chemical Engineering, 27(1), 41–62. 24(11), 1733–1739.
Bequette, B. (2002). Behavior of a CSTR with a Khosravian, A., Trumpf, J., and Mahony, R. (2015a).
recirculating jacket heat transfer system. In Proceedings State Estimation for Nonlinear Systems with Delayed
of the American Control Conference., volume 4, 3275 – Output Measurements. (Cdc), 6330–6335. doi:
3280. 10.1016/j.automatica.2016.01.024.
Dochain, D. (2003). State and parameter estimation in Khosravian, A., Trumpf, J., Mahony, R., and Hamel, T.
chemical and biochemical processes: a tutorial. Journal (2015b). State Estimation for Invariant Systems on Lie
of Process Control, 13(8), 801 – 818. Groups with Delayed Output. Automatica, 68, 254–265.
Gopalakrishnan, A., Kaisare, N.S., and Narasimhan, Polyakov, A. (2012). Nonlinear feedback design for fixed-
S. (2011). Incorporating delayed and infrequent time stabilization of linear control systems. IEEE
measurements in Extended Kalman Filter based Transactions on Automatic Control, 57(8), 2106–2110.
338
CHAPTER 11. OBSERVERS
−3
x 10
0.02 2
0.016 1.6
0.014 1.4
0.012 1.2
0.01 1
0.008 0.8
0.006 0.6
0.004 0.4
0.002 0.2
0 0
0 0.002 0.004 0.006 0.008 0.01 0 0.5 1
Delay [h] −3
x 10
339
CHAPTER 11. OBSERVERS
Resumen: Se propone una estrategia de control para un motor diésel con turbocompresor de
geometrı́a variable basada en retroalimentación estática de los estados. La técnica utilizada
permite evitar el uso de la técnica de extensión dinámica del sistema para linealización exacta.
Adicionalmente se propone un estimador de par de carga. El desempeño del controlador y el
estimador propuestos se verifica por medio de simulaciones en el modelo no lineal del sistema
motor-turbocompresor.
340
CHAPTER 11. OBSERVERS
En la literatura aparecen estrategias para el control del presentados en Outbib et al. (2006) y Marcelin et al.
sistema de alimentación de aire de motores diésel turbo- (2009). Los autores muestran sus resultados por medio de
cargados separado del control de la velocidad. simulaciones numéricas.
Existen diferentes controladores para regular el aire ali- En este trabajo que se propone un controlador para
mentado al motor diésel por medio de un sistema turbo- regular la velocidad de un motor diésel, la relación aire
cargador. En Jankovic et al. (2000) se emplea linealización combustible AFR y el flujo de gases de escape de manera
entrada-salida y diseño de Lyapunov para la obtención simultánea con cambios en el par de carga. Además, se
de un controlador no lineal. En Nieuwstadt et al. (2000) propone una estimador de par de carga para el motor
se presentan controladores PI descentralizados con la es- diésel.
timación de variables como la presión. En los dos casos
anteriores se presentan resultados experimentales. El controlador propuesto en este artı́culo se basa en
el trabajo presentado en Rodriguez et al. (2006) en
En Larsen et al. (2000) se utiliza un enfoque de pasivación donde se muestra una forma alterna para linealización
indirecta utilizando un par entrada-salida para estabilizar entrada-salida de sistemas linealizables por retroalimen-
la dinámica cero del sistema y por medio de otro par en- tación dinámica del estado.
trada salida se realiza la pasivación, los autores presentan
resultados en simulación. En Upadhyay et al. (2002) se El estimador propuesto se basa en la técnica de inmersión
diseña un controlador basado en modos deslizantes para e invarianza, Astolfi et al. (2008). La prueba de estabilidad
el control de la inyección de aire, donde el desempeño del se realiza utilizando la aproximación lineal de la dinámica
controlador se evalúa mediante simulaciones. de lazo cerrado.
En Ayadi and Houcine (2004) se linealiza el modelo del Este artı́culo se organiza como sigue. En la sección 2 se
turbocompresor alrededor de un punto de operación para presenta una descripción del modelo de un motor diésel
diseñar un controlador basado en la propiedad de planitud equipado con un turbocompresor de geometrı́a variable. La
diferencial con respecto a la salida definida por el error en sección 3 discute el diseño del controlador propuesto. En
la presión del múltiple de escape. Los autores presentan la sección 4 se presenta un estimador para el par de carga
resultados en simulación. En Ortner and del Re (2007) del motor. La sección 5 muestra los resultados obtenidos
se utilizó control predictivo para optimizar el sistema mediante simulación. Finalmente la sección 6 presenta las
de aire del motor, en este artı́culo presentan resultados conclusiones de este trabajo.
experimentales. En Plianos et al. (2007) se propone un
2. MODELO DEL SISTEMA
controlador por medio del enfoque clásico de linealización
MOTOR-TURBOCOMPRESOR DE GEOMETRÍA
por realimentación dinámica de estados.
VARIABLE
Para realizar el control de velocidad del motor diésel en
Outbib et al. (2002) se presenta un controlador no lineal El modelo del sistema motor-turbocompresor que se utiliza
por diseño de Lyapunov donde se utiliza del flujo de com- en este artı́culo se obtiene de los modelos utilizados en
bustible como variable de control. En Song and Grigoriadis Jankovic et al. (2000) para el control de flujo de aire y
(2003) se controla la velocidad de un motor diesel con en Outbib et al. (2006) para el control de velocidad en
modelo simplificado en presencia de perturbaciones en el motores de combustión interna diésel.
par de carga y con retardos de transporte. El esquema
de diseño de control propuesto se basa en ganancias pro- Al utilizar la ley de conservación de la materia en el
gramadas y un enfoque de parámetros lineales variables múltiple de admisión se puede escribir
(LPV) y desigualdades matriciales (LMI’s). d
ma = Wc − ṁao (1)
dt
En Wahlström and Nielsen (2010) se presenta una estra- donde Wc es el flujo de aire que entra desde el compresor
tegia que toma en cuenta el control de la velocidad y el al múltiple de admisión y ṁao es el flujo de aire que se
sistema de alimentación de aire del motor. Se utilizan bombea dentro de la cámara de combustión. Aplicando la
controladores PID para regular directamente los valores ley de gases ideales en el múltiple de admisión
deseados de la fracción de gases de escape recirculados
(EGR) y la relación aire combustible (AFR). A diferencia
de otras estrategias en donde se controlan de forma indirec- p1 V1
ma = (2)
ta a través de la presión en los múltiples de admisión y de rT1
escape. Los autores de este trabajo proponen una función donde ma es la masa de aire, r es la constante para el aire,
de costo para sintonizar las ganancias del controlador PID. V1 , p1 , T1 son el volumen, la presión y la temperatura del
múltiple de admisión, respectivamente. ṁao está dada por
En Mendoza-Soto and Alvarez-Icaza (2012) se controla la expresión
en conjunto la velocidad del motor diésel y el sistema de
alimentación de aire utilizando una combinación de control
predictivo generalizado y linealización entrada-estado. En ṁao = ηv (ṁao )th (3)
dicho trabajo se utiliza un modelo que toma en cuenta la con
dinámica de velocidad y la alimentación de aire.
En Haoping et al. (2014) los autores utilizan una varia- nVcy ωp1
(ṁao )th = (4)
ción del modelo utilizado en Mendoza-Soto and Alvarez- 4πrT1
Icaza (2012) y usan una combinación de los controladores donde ηv es la eficiencia volumétrica, ω es la velocidad
angular del motor, Vcy es el volumen de cada cilindro, p1
341
CHAPTER 11. OBSERVERS
342
CHAPTER 11. OBSERVERS
p1 p2 ỹ1 = y1 − ȳ1 (33)
Ω= ω > 0, > 1, > 1, Pc > 0 (21)
pa pa ỹ2 = y2 − ȳ2 (34)
El objetivo del controlador es regular la relación aire- ỹ3 = y3 − ȳ3 (35)
combustible AF R , la fracción de gases de escape recircu- Al derivar respecto al tiempo la ecuación (33) y conside-
lados EGR y la velocidad angular del motor a valores de rando que las señales de referencia ȳ son constantes para
operación deseados. Las señales de control deseadas que el caso de regulación se obtiene
producen los valores de AF R y EGR deseados se obtienen
por medio de · q3 (ω, p1 )
ū2 = Γ1 Γ2 ū1 (22) ỹ 1 = q1 (ω) u1 + q2 (ω, p1 ) + (36)
u1
ū3 = (Γ1 + 1) ū1 (23) donde
q
Γ1a + Γ1a 2 + 4(1 − EGR )AF R
Γ1 = (24) k1
2 q1 (ω) = (37)
ω
Γ1a = AF R (1 − EGR ) + 15.6EGR − 1 q2 (ω, p1 ) = (k3 p1 ηv (ω) − k4 τL ) (38)
EGR q3 (ω, p1 ) = k2 ηv2 (ω) ωp21 (39)
Γ2 = (25)
1 − EGR para el proceso de realimentación estática como se utiliza
Dada una velocidad angular ω̄ deseada, un par de carga en Outbib et al. (2006) y como se propone en Rodriguez
τL , los puntos de equilibrio del sistema (17) − (20) y las et al. (2006) el control u1 puede proponerse de tal forma
ecuaciones (22) − (25) se obtiene el flujo de combustible que la estructura de lazo cerrado resulta con una estruc-
y el valor de las variables de estado que producen AF R y tura lineal en términos de una ganancia K1 entonces
EGR como · q3 (ω, p1 )
ke2 k4 ω̄τL ỹ 1 = q1 (ω) u1 + q2 (ω, p1 ) + = −K1 ỹ1 (40)
ū1 = (26) u1
2
k1 ke2 + k2 (Γ1 Γ2 + Γ1 ) + k3 ke (Γ1 Γ2 + Γ1 ) de donde resulta la señal de control
−ū1 + ū2 + ū3
p̄1 = (27) q
ke ηv (ω̄) ω̄
2
− µ1 − (q2 + K1 ỹ1 ) + (q2 + K1 ỹ1 ) − 4q1 q3
k7 u1 = (41)
p̄2 = pa 1 − P̄c (28) 2q1
k8 ū3
µ con K1 > 0.
(−ū2 + ke ηv (ω̄) ω̄ p̄1 ) p̄1
P̄c = −1 (29)
kc pa Para el diseño de la señal de control u2 se toma la ecuación
donde los términos con barra superior representan los (34) y se obtiene la primera derivada respecto al tiempo
valores de referencia para cada una de las variables.
· Pc
3. FORMULACIÓN DEL CONTROL ỹ 2 = k5 kc µ + u2 − ks p1 (42)
p1
pa −1
El sistema dinámico dado por las ecuaciones (17) − (20)
definiendo la dinámica
es linealizable por retroalimentación dinámica del estado
Mendoza-Soto and Alvarez-Icaza (2012).
· Pc
El método de control que se presenta en Rodriguez et al. ỹ 2 = k5 kc µ + k5 u2 − k5 ks p1 = −K2 ỹ2 (43)
p1
(2006) es un alternativa para controlar por retroalimen- pa −1
tación estática en sistemas que son linealizables entrada-
salida utilizando retroalimentación dinámica del estado. entonces se puede proponer la señal u2 como
Entonces, de acuerdo a Rodriguez et al. (2006) existe una
ley de control de realimentación de estados estática linea- K2 Pc
lizante entrada-salida para el sistema dinámico (17)−(20). u2 = − ỹ2 − kc µ + ks p1 (44)
k5 p1
−1pa
Para el desarrollo del controlador por retroalimentación
con K2 > 0.
estática se toman como salidas
La señal de control u3 se diseña a partir de la ecuación
(35). Al derivar la salida ỹ3 una vez respecto al tiempo
y1 = ω (30) resulta
y2 = p1 (31)
1−µ
k8 k8 pa p2
y 3 = Pc + p2 − (32) · 1
k6 k6 1 − µ p a ỹ 3 = k8 1 − µ (ks p1 + u1 − u2 ) − k7 Pc (45)
p2
donde la salida y3 se toma como se propone en Marcelin pa
et al. (2009).
que no depende de la señal de control u3 por lo que
Proponiendo las señales de salida en términos de las se requiere derivar nuevamente respecto al tiempo para
señales de error obtener
343
CHAPTER 11. OBSERVERS
·· · dβ (ω)
ỹ 3 = ϕ1 + ϕ2 + ϕ3 − ϕ4 u3 = −2K3 ỹ 3 − K32 ỹ3 (51) ż = − z (61)
dω
de donde resulta Al definir
·
2K3 ỹ 3 + K32 ỹ3 + ϕ1 + ϕ2 + ϕ3
u3 = (52) dβ (ω)
ϕ4 =Γ (62)
con K3 > 0. dω
con Γ > 0 se obtiene una dinámica asintóticamente estable
En este punto es posible expresar el resultado principal de para z
este artı́culo.
P roposición 1: Considere el sistema dinámico (17) − (20) ż = −Γz (63)
en lazo cerrado con las leyes de control definidas en (41),
entonces el par de carga del motor puede obtenerse a partir
(44) y (52). Entonces existen ganancias K1 , K2 y K3 tales
de
que la dinámica en lazo cerrado es asintóticamente estable
en el conjunto abierto (21).
k4 τ̂L − β (ω)
Demostración : En términos de las coordenadas de error τL = (64)
k4
en la salida la dinámica en lazo cerrado puede expresarse
por las ecuaciones siguientes donde β (ω) = Γω, y τ̂L puede calcularse al resolver la
· ecuación (59).
ỹ 1 + K1 ỹ1 = 0 (53)
Al sustituir el par de carga en la dinámica del motor
·
ỹ 2 + K2 ỹ2 = 0 (54) en lazo cerrado con el controlador definido por (41), (44)
·· · y (52) por el par de carga estimado se obtiene la forma
ỹ 3 + 2K3 ỹ 3 + K32 ỹ3 = 0 (55) siguiente
donde es evidente que con K1 > 0, K2 > 0 y K3 > 0 se
obtiene un sistema en lazo cerrado asintóticamente estable ẋ = f (x, β (x, z)) (65)
en el conjunto abierto (21). T
con x = [ ω p1 p2 Pc ] . La aproximación lineal de (65) y
T
4. ESTIMACIÓN DEL PAR DE CARGA (61) alrededor del punto de equilibrio x̄ = ω̄ p̄1 p̄2 P̄c
y z = 0 queda como
Se define el error de estimación de par de carga como sigue
z = k4 τL − k4 τ̂L + β (ω) (56) ẋδ = Axδ + Bzδ (66)
donde k4 τ̂L + β (ω) es el valor estimado del par de carga żδ = −Γzδ (67)
y β (ω) es una función de la velocidad. De acuerdo a la
técnica de inmersión e invariancia, el objetivo es ahora
garantizar que el error de par converja a cero. Al obtener Note que el sistema (66) − (67) es un sistema interconec-
la derivada de z respecto al tiempo tado con Bzδ como el término de interconexión. Dado que
A es Hurwitz es posible utilizar la proposición 4.10 de
· dβ (ω) R. Sepulchre and Kokotovic (1997) para concluir estabili-
ż = −k4 τ̂ L + ω̇ (57) dad asintótica local.
dω
344
CHAPTER 11. OBSERVERS
5. RESULTADOS EN SIMULACIÓN
Flujo de combustible
0.03
Para la simulación de los resultados se utilizó Matlab. La
u 1 [kg/s]
0.02
simulación del controlador se realizó tomando un tiempo
de muestreo de 1 ms y asignando valores de ganancia 0.01
K1 = 1, K2 = 1.5, K3 = 3 y Γ = 10. 0
0 5 10 15
u 2 [kg/s]
y en lı́nea punteada el valor de la referencia para cada 0.05
estado. Se dan cambios escalón cada 5 segundos en el par 0
de carga aplicado al motor para observar el desempeño del -0.05
control. El par de carga aplicado es inicialmente de 1000 0 5 10 15
u 3 [kg/s]
se ajusta el valor de EGR = 0 y la relación AF R = 21 y 0.2
20
Velocidad angular
AFR
104.75
10
ω [rad/s]
104.7
0
0 5 10 15
104.65 tiempo [s]
104.6
0 5 10 15
6 1100
p 2 [Pa]
5 1000
4 900
0 5 10 15
3
0 5 10 15 tiempo [s]
3
EGR . En esta prueba se ha mantenido constante el par
2
de carga en 1000 N m y una velocidad de referencia
constante en 1000 rpm, mientras se aplican cambios en
1
0 5 10 15 la fracción de gases de escape recirculados de 0, 0.1 y
tiempo [s]
0.2 cada 5 segundos. Se observa que al incrementar la
cantidad de gases de escape recirculados la potencia en
el compresor empieza a reducirse debido a la disminución
Figura 2. Estados con cambios en el par de carga de los gases que impulsan la turbina. En la figura (5)
se observa el comportamiento de las señales de control
obtenidas. Se observan etapas transitorias al momento de al aplicar cambios en el flujo EGR . También se observa
aplicar cambios del par de carga. También se observa el que la relación aire combustible no puede mantenerse en
comportamiento de la relación aire combustible ante los condiciones de recirculación de gases de escape. La relación
cambios en la carga del motor. En todos los casos las AF R solo puede mantenerse constante si se incrementa
señales llegan a su valor de referencia. En la figura (4) se el flujo de gases de escape que impulsa la turbina, lo
muestra el comportamiento del estimador de par de carga que ocurre cuando no hay gases de escape recirculados
propuesto en la sección IV . El par de carga estimado τ̂m se al múltiple de admisión.
presenta en lı́nea continua y el par de carga τm a estimar
en lı́nea discontinua. Se observa que el valor estimado llega 6. CONCLUSIÓN
al valor real para diferentes cambios escalón en el par de
carga.
En este trabajo se ha propuesto un controlador para un
La figura (6) se muestra el comportamiento de los estados motor Diésel con sistema turbocompresor y de recircula-
para cambios en la fracción de gases de escape recirculados ción de gases de escape utilizando una estrategia de re-
345
CHAPTER 11. OBSERVERS
u 1 [kg/s]
104.7199 0.02
104.7198 0.01
104.7197 0
0 5 10 15 0 5 10 15
3.6
u 2 [kg/s]
0.05
p 1 [Pa]
3.4
0
3.2
3 -0.05
0 5 10 15 0 5 10 15
4.2
u 3 [kg/s]
0.16
p 2 [Pa]
4
0.14
3.8
3.6 0.12
0 5 10 15 0 5 10 15
2.2
22
P c [W]
AFR
2
20
1.8
1.6 18
0 5 10 15 0 5 10 15
tiempo [s] tiempo [s]
Figura 5. Estados con cambios de EGR Figura 6. Señales de control y relación aire combustible
troalimentación estática de los estados. Adicionalmente se a vehicle diesel engine speed and air path. Journal of
ha propuesto un estimador de par de carga por el método Dynamic Systems, Measurement, and Control, 136, 1–7.
de inmersión e invarianza. Las simulaciones muestran el Jankovic, M., Jankovic, M., and Kolmanovsky (2000).
desempeño del controlador propuesto funcionando en lazo Constructive Lyapunov control design for turbocharged
cerrado con el estimador de par. El esquema de control diesel engines. IEEE Transactions on Control Systems
utilizado muestra un buen desempeño para alcanzar las Technology, 8(2), 288–299.
referencias cuando se presentan cambios de tipo escalón Larsen, M., Jankovic, M., and Kokotovic, P.V. (2000).
en el par o cambios en las señales de referencia. Indirect passivation design for a diesel engine model. In
pth = 43 M J T2 = 550 K τ = 0.15 s Proceedings of the 2000 IEEE International Conferen-
pa = 100000 P a Vcy = 0.0018 m3 n=5 ce on Control Applications, 25–27. Anchorage, Alaska
V2 = 0.001 m3 V1 = 0.005 m3 ηt = 0.9 USA.
bλ = 0.01543 cλ = −0.00014073 µ = 0.285 Marcelin, D., Nicolas, L., and Houcine, C. (2009). Dyna-
aλ = 0.0371 J = 4000 kg · m2 ηc = 0.9 mic feedback linearization applied to asymptotic trac-
J
cp = 1012.2 kgK r = 287 k JK
g
ηm = 0.8 king: generalization about the turbocharged diesel en-
α0 = 7.73 × 10−1 α1 = −1.54 × 10−3 gine outputs choice. In American Control Conference,
α2 = 2.49 × 10−6 T1 = Ta = 300 K 3458–3463. St. Louis, MO, USA.
Cuadro 1. Parámetros del motor diésel. Mendoza-Soto, J.L. and Alvarez-Icaza, L. (2012). Genera-
lized predictive control of a turbocharged diesel engine.
In American Control Conference, 5725–5730. Montréal,
REFERENCIAS Canada.
Nieuwstadt, M., Kolmanovsky, I., Stefanopoulou, and
Astolfi, A., Karagiannis, D., and Ortega, R. (2008). Nonli- A. Jankovic, M. (2000). EGR-VGT control schemes:
near and Adaptive Control with Applications. Springer- experimental comparison for a high-speed diesel engine.
Verlag., London. IEEE Control Systems Magazine, 20(3), 63–79.
Ayadi, Mounir. Langlois, N. and Houcine, C. (2004). Ortner, P. and del Re, L. (2007). Predictive control of a
Polynomial control of nonlinear turbocharged diesel diesel engine air path. IEEE Transactions on Control
engine model. In Proceedings of the IEEE Conference on Systems Technology, 15(3), 449–456.
Industrial Technology (ICIT), 1384–1389. Hammamet, Outbib, R., Dovifaaz, X., Rachid, A., and Ouladsine, M.
Tunisia. (2002). Speed control of a diesel engine: A nonlinear
Haoping, W., Yang, T., Jerome, B., and Ahmed, E.H. appoach. In Proceedings of the 2002 American Control
(2014). Modeling and dynamical feedback control of Conference, 3293–3294. Anchorage, Alaska USA.
346
CHAPTER 11. OBSERVERS
Outbib, R., X., D., A., R., and M., O. (2006). A theoretical
control strategy for a diesel engine. Journal of Dynamic
Systems, Measurement, and Control, 128, 453–457.
Plianos, A., Achir, A., Stobart, R., Langlois, N., and Cha-
fouk, H. (2007). Dynamic feedback linearization based
control synthesis of the turbocharged diesel engine. In
Proceedings of the 2007 American Control Conference,
4407–4412. New York, USA.
R. Sepulchre, M.J. and Kokotovic, P.V. (1997). Construc-
tive Nonlinear Control. Springer-Verlag, London.
Rodriguez, H., Astolfi, A., and Ortega, R. (2006). On the
construction of static stabilizers and static output trac-
kers for dynamically linearizable systems, related results
and applications. International Journal of Control, 79,
1523–1537.
Song, Q. and Grigoriadis, K.M. (2003). Diesel engine speed
regulation using linear parameter varying control. In
IEEE Proceedings of the American Control Conference,
779–784. Denver, Colorado.
Upadhyay, D., Utkin, V.I., and Giorgio, R. (2002). Mul-
tivariable control design for intake flow regulation of
a diesel engine using sliding mode. In Proceedings of
the 15th Triennial World Congress IFAC, 1389–1394.
Barcelona, Spain.
Wahlström, Johan. Eriksson, L. and Nielsen, L. (2010).
Egr-vgt control and tuning for pumping work minimi-
zation and emission control. IEEE Transactions on
Control Systems Technology, 18, 993–1003.
347
CHAPTER 11. OBSERVERS
Abstract: In this work, a family of non linear observers for linear time invariant
systems is presented. The algorithms have the capability of providing an estimate
of the system internal state in finite time. The observer converges in fixed-time
since it exists an upper bound for the convergence time which is independent of the
initial error. This note also provides a methodology to apply the proposed algorithms
when unknown inputs are present, and how to retain its properties. Simulation
examples are provided to contrast the behavior of the nonlinear observer with a
classic Luenberger one.
348
CHAPTER 11. OBSERVERS
2. ALGORITHM DESCRIPTION Also notice that this terms are only zero if the
estimation error is zero, while the linear term in the
A general LTI system is considered. The system is algorithm becomes zero whenever the estimation
described by error belongs to ker(C).
ẋ = A x + B u(t), This algorithm has the following properties
(1)
y(t) = C x, Theorem 1. Consider system (1) and the algorithm
where x ∈ Rn , u(t) ∈ Rr , and y(t) ∈ Rq . Matrices (2). If the pair (A, C) is observable, then for any
A, B, and C have appropriate dimensions an are t? > t0 N (t) ≥ α(t? )I for t ≥ t? . Fix t? to make α
assumed to be known. The only available signals constant. Then x̂(t) ≡ x(t) for
p1 +1 p2 +1
are the input u(t) and the output y(t). Addition- λM 2 (P )np1 λM 2 (P )np2
?
ally, it is required that the pair (A, C) is observable t≥t + p1 −1
+ p2 −1
,
in order to be able of design an observer with 2 2 (1 − p1 )k1 αp1 +1 2 2 (p2 − 1)k2 αp2 +1
arbitrary speed of convergence. where λM (·) denotes the greatest eigenvalue of
Let x̂(t) be the estimate of x(t). Under the previous its argument. This means that (2) converges in
assumptions, the following is an observer for system finite time. Since the time needed for the estimate
(1): to reach the actual state does not depend on the
initial error, it is said that the algorithm converges
x̂˙ =A x̂ + B u(t) − L(C x̂ − y(t)) in fixed-time. 4
2
X
− P −1 N (t)
p
ki dN (t)x̂ − ψ(t)c i . (2) The larger the value of α, the smaller the time
i=1
needed to converge. Taking t? bigger makes α(t? )
larger; however, the function α(t? ) is upper bounded,
The symbol d·cp takes the signed p power of its
then there exists a value for t? from which α
argument in the scalar case and for vectors it is
does not increase further. The value of α can be
understood element-wise, that is, d·cp = |·|p sign(·).
controlled through the design of K. About this,
The exponents pi have to be chosen such that
it is recommendable to make the real part of the
p1 ∈ [0, 1) and p2 > 1, and the gains ki need
eigenvalues of A + KC close to zero.
only to be positive. Matrix L is designed to make
A − LC Hurwitz, and matrix P is computed from 2.1 Convergence Analysis and Proof of Theorem 1
the algebraic Lyapunov equation
>
A − LC P + P A − LC = −R, R = R> > 0. The convergence analysis of (2) is going to be
Matrix N (t) and signal ψ(t) are related by the handled through the study of the error dynamics.
equality ψ(t) = N (t)x(t), and are computed as For this purpose, let us define the estimation error
as e(t) = x̂(t) − x(t). Taking the time derivative of
d > e(t) yields
N = − A + KC N − N A + KC + C > C,
dt X 2
N (t0 ) = 0, ė = A − LC e − P −1 N (t) ki dN (t)ecpi . (3)
i=1
d
ψ = − (A + KC)> ψ + C > y(t) If the solution e(t) = 0 is attractive and stable
dt then the algorithm converges to x(t). To investigate
− N (t) Ky(t) − Bu(t) ,
that matter, take as Lyapunov function candidate
ψ(t0 ) = 0. V (e) = 21 e> P e, and its derivative along the trajec-
Here, the matrix K has to be proposed such that tories of (3) satisfies
the matrix A + KC is anti-Hurwitz. Taking the 1 X 2
ki
dynamics of N (t), and the derivative of N (t)x(t), V̇ (t) ≤ − e> (t)R e(t) − pi
kN (t)e(t)k1pi +1 ,
the dynamics for ψ(t) is gotten. 2 i=1
n
(4)
Notice that N (t) is the constructibility gramian
(Hespanha, 2009, Chap. 15) of system χ̇ = (A + because
KC)χ with output Cχ. If the pair (A, C) is ob- 1
e> (t)N (t)dN (t)e(t)cpi ≥ pi kN (t)e(t)kp1i +1 .
servable, so is the pair (A + KC, C), since the n
term KCχ represents an output injection. Then From (4) it is clear that e(t) = 0 is asymptotically
N (t) acquires full rank, and becomes bounded from stable. Furthermore, due to the observability of
below, i.e. N (t) ≥ α(t? )I for t ≥ t? > t0 . Since system (1), matrix N (t) is bounded from below by
K is chosen to make the matrix A + KC anti- αI for t ≥ t? for some t? , then
Hurwitz, N (t) will remain bounded (Abou-Kandil ki
et al., 2003, Chap. 1). kN (t)e(t)kp1i +1 ≥
n pi
pi2+1
The property ψ(t) = N (t)x(t), and the full rank ki αpi +1 2 pi +1
349
CHAPTER 11. OBSERVERS
pi2+1 limt→∞ y(t) = 0 implies limt→∞ x(t) = 0 with
ki αpi +1 2 pi +1
V̇ (t) ≤ − V 2 (t), independence of the unknown input and the initial
n pi λM (P ) state. This property can be tested through the
with solution following criteria:
1−p
2
pi −1 i Theorem 2. (Hautus, 1983)(Moreno, 2001) The
1−pi 2 2 (1 − pi )ki αpi +1 system (6) is strong? detectable if and only if
V (t) ≤ V 2 (t? ) − pi +1
(t − t?
) .
λM (P )npi
2
s I −D
rank = n + q, ∀ s ∈ C+ 0
(5) C 0
Notice that V (t) satisfies both inequalities for all and
t ≥ t? . Between t0 and t? , the Lyapunov function rank CD) = q,
does not increase, this is prevented by the quadratic +
term in its derivative. For the convergence time cal- where C0 is the closed right half plane of the
culation, it is assumed that V (t) remains constant complex plane. 4
?
in the interval [t0 , t ). The first condition means that all the transmission
For i = 2, the appropriate inequality is going to zeros of the system have negative real part, and
be used to estimate the time needed to guarantee that the system is detectable. The second state-
V (t) ≤ 1, whereas the inequality for i = 1 will ment requires that m ≥ q, in other words, it is
be used to calculate the necessary time for V (t) to required at least the same number of outputs as
decrease from a unitary value to zero. the unknown inputs.
According to the described process, (5) is rewritten Suppose the rank condition for CD is met. Define
for i = 2: the matrix
p2 +1 ! S
λM (P )n
2 p2
1 S = 1 , S ∈ Rm×m (7)
?
t − t ≥ p2 −1 1 − p2 −1 . S2
2 2 (p2 − 1)k2 αp2 +1 V 2 (t? ) with
?
Taking the limit when V (t ) → ∞, it is clear that S1 = (C D)+ = (D> C > C D)−1 D> C > , S1 ∈ Rq×m ,
for
p2 +1 S2 = Q Im − C D S1 , S2 ∈ R(m−q)×m ,
λ 2 (P )np2
t ≥ t? + p2 −1 M = t1 and Q ∈ R(m−q)×m such that S2 has full row rank.
2 2 (p2 − 1)k2 α p 2 +1 Take the n × n matrix
V (t) ≤ 1. Now assume that V (t1 ) = 1, using S1 C
T
inequality (5) with i = 1, we proceed to estimate T = 1 = S2 C , (8)
T2
the time needed to reach zero. This happens when M
the rhs of (5) equals zero, then V (t) ≡ 0 for
p1 +1
with M ∈ R(n−m)×n in such a way that makes T
λ 2 (P )np1 invertible and M D = 0.
t ≥ t1 + p1 −1 M .
2 2 (1 − p1 )k1 αp1 +1 Taking the linear transformation z = T x, the
Then the time needed for V (t) to decrease from its system is transformed into
initial value to zero is at most ż1 = Ā11 z1 + Ā12 z2 + B̄1 u(t) + ν(t),
p1 +1 p2 +1 (9)
λ 2 (P )np1 λ 2 (P )np2 ż2 = Ā21 z1 + Ā22 z2 + B̄2 u(t),
M M
t? + + ,
2
p1 −1
2 (1 − p1 )k1 αp1 +1 2
p2 −1
2 (p2 − 1)k2 αp2 +1
with Ā = T AT −1 , B̄ = T B, and zi = Ti x,
i = {1, 2}. Define as output ȳ(t) = S y(t), this
which is independent from the initial error. This yields
concludes the proof of Theorem 1.
ȳ (t) z1
ȳ(t) = 1 = .
ȳ2 (t) C̄22 z2
3. UNKNOWN INPUT OBSERVER DESIGN
With this transformation, the state is partitioned
In this section, a perturbed linear system will be into the corrupted part by the unknown input
considered. Take again into consideration system z1 , and the part free from its effect z2 . Notice
(1), but now with an unknown input ν(t) ∈ Rq that the corrupt part can be recovered directly
from the output, whereas the other part has to
ẋ = A x + B u(t) + D ν(t),
(6) be reconstructed from ȳ2 . This transformation is
y(t) = C x. classic and can be found in many texts, see for
It is assumed that matrix D is also known. Without example (Kudva et al., 1980) and (Moreno, 2001).
loss of generality, it is assumed that rank(C) = m,
To be able to reconstruct the full state, at least
and rank(D) = q.
asymptotically, it is required that the pair (A, C)
The existence of an observer capable of estimating is detectable. If m = q only ȳ1 would be available,
x without the knowledge of ν(t) is conditioned and from the detectability it can be concluded that
to the system to be strong? detectable (Hautus, z2 → 0 as t → ∞, then any estimate of z2 that goes
1983). If the system is strong? detectable, then to zero can be used. If m > q, i.e. there are more
350
CHAPTER 11. OBSERVERS
outputs than unknown inputs, a better estimation K = [3.7 3.9]> . For R = I2 , this configuration
of z2 can be attempted. The detectability of pair yields
(A, C) implies the detectability of pair (Ā22 , C̄22 ),
1 42 36
and an observer for z2 can be designed using ȳ2 as P −1 = .
output, and z1 and u(t) as input. The interesting 17 36 60
case is when (Ā22 , C̄22 ) is observable. In such case, For the simulation, the input u was set in cos(t); for
the velocity of convergence for a z2 observer can the unknown input, a triangle wave of amplitude
be assigned. If one uses the observer proposed in and period 2 was chosen. The system was initialized
Section 2, it is possible to reconstruct z2 in fixed- in x0 = [10 − 5 3]> . To compare the response
time, and also the original state using the relation of the proposed algorithm, a linear one was also
x = T −1 z. simulated. For the linear observer, the gain L̄ =
[4 6]> was chosen. For both observers, the initial
Let us summarize the previous discussion. Suppose conditions were set in zero, ẑ2 (t0 ) = 0.
that the pair (A, C) is observable, that rank(CD) =
q, and that m > q. Under these conditions the From Figure 1 to 3, the estimate of x provided
observer proposed in Section 2 can be used to by the proposed observer and the linear observer
estimate, in fixed-time, the internal state of a linear can be compared with the true state. In the plots,
system with unknown inputs. This is done through it can be observed that the estimate produced
the state and output transformation z = T x, ȳ(t) = with the nonlinear algorithm got closer to the true
S y(t), with T as in (8) and S as in (7). This puts state much faster than the linear one. This can be
the system into the form shown in (9). Then the corroborated in Figure 4, in which the norm of the
methodology presented in the previous section can estimation error was plot. From this last graph, the
be applied to design an observer for the subsystem finite-time convergence can be established in less
than two seconds.
ż2 = Ā22 z2 + Ā21 z1 + B̄2 u(t),
ȳ2 (t) = C̄22 z2 . To exhibit the fixed-time convergence, the simula-
tion was repeated with different initial conditions.
This allows the fixed-time reconstruction of the The initial value of the estimation error was in-
original state through x̂ = T −1 ẑ with ẑ = creased in one order of magnitude in each successive
[ȳ1> ẑ2> ]> . simulation. As can be seen in Fig 5, the convergence
time stay under 2.6[s] although the initial error
4. NUMERICAL EXAMPLE was increased above 106 in magnitude. Between
the trajectories for O(105 ) and O(106 ) there is
To exemplify the observer design proposed in pre- no appreciable difference, and this characteristic is
vious sections, let us consider the following system: maintained for orders of magnitude beyond 106 .
" # " # " #
1 2 −1 0 −1 x1
ẋ = −7 −6 2 x + 0 u(t) + 2 ν(t), 10
−1 −2 0 1 2
5
1 0 1
y(t) = x.
2 1 0 t[s]
(10) 1 2 3 4 5
-5
Now, the system has to be transformed in order to
decouple the corrupted and uncorrupted dynamics. Real
- 10 Nonlinear estimate
For this matter, define the transformation matrix Linear estimate
as - 15
1 0 1
T1
T = = 2 1 0 (11) Fig. 1. Estimate of x1 (t).
T2
0 1 −1
In this case, matrix S is I2 , which is clear after x2
40
comparing T and C. Applying the transformation Real
to the system, it becomes Nonlinear estimate
30
ż1 = −2 z1 + [1 − 1]z2 + u(t) + ν(t), Linear estimate
−3 1 1 0 20
ż2 = z2 + z1 + u(t),
−4 0 2 −1 (12)
10
ȳ1 = z1 ,
ȳ2 = [1 0]z2 . t[s]
1 2 3 4 5
Since the pair (Ā22 , C̄22 ) is observable, we can
proceed to design an observer for z2 . - 10
351
CHAPTER 11. OBSERVERS
x3 ACKNOWLEDGEMENT
Real The authors thank the financial support from
25 Nonlinear estimate
Linear estimate
PAPIIT-UNAM (Programa de Apoyo a Proyectos
20 de Investigación e Innovación Tecnológica), project
IN113614; Fondo de Colaboración II-FI UNAM,
15
Project IISGBAS-100-2015; CONACyT (Consejo
10 Nacional de Ciencia y Tecnologı́a), project 241171;
and CONACyT CVU: 491701, and 620585.
5
t[s]
REFERENCES
1 2 3 4 5
Abou-Kandil, H., Freiling, G., Ionescu, V., and
Jank, G. (2003). Matrix Riccati Equations in
Fig. 3. Estimate of x3 (t).
Control and Systems Theory. Birkhäuser Basel.
Atassi, A. and Khalil, H. (2000). Separation results
e(t) for the stabilization of nonlinear systems using
60 different high-gain observer designs. Systems &
Nonlinear estimate Control Letters, 39(3), 183 – 191.
50 Linear estimate
Chen, W. and Saif, M. (2006). Fault detection and
40
isolation based on novel unknown input observer
design. In 2006 American Control Conference.
30 Cruz-Zavala, E., Moreno, J.A., and Fridman, L.
(2012). Asymptotic stabilization in fixed time
20 via sliding mode control. In Decision and Control
10
(CDC), 2012 IEEE 51st Annual Conference on,
6460–6465.
t[s] Cruz-Zavala, E., Moreno, J.A., and Fridman,
1 2 3 4 5
L.M. (2011). Uniform robust exact differentia-
tor. IEEE Transactions on Automatic Control,
Fig. 4. Error norm.
56(11), 2727–2733.
Hautus, M. (1983). Strong detectability and ob-
e(t) servers. Linear Algebra and its Applications, 50,
60 353 – 368.
O(10
40
)
O(10 )
30
O(10 5
O(10
20
)
352
CHAPTER 11. OBSERVERS
based state updates. In 17th IFAC World
Congress, 1051–1067.
Rueda-Escobedo, J., Moreno, J.A., and Oliva-
Fonseca, P. (2016). Finite-time state estimation
for lti systems with a first-order sliding mode. In
14th International Workshop on Variable Struc-
ture Systems, to appear.
Yin, S., Wang, G., and Karimi, H.R. (2014). Data-
driven design of robust fault detection system for
wind turbines. Mechatronics, 24(4), 298 – 306.
353
II. M ODEL OF A D IESEL E NGINE The flo w Wxi in (1) describes the flo w through the EGR
valve. It is given by
In this section a mean-value model of the airpath of a s CHAPTER
11. OBSERVERS
turbocharged diesel engine with EGR is described. For a Aegr (xegr )px 2pi pi
detailed derivation of the mean-value model see [13, 15] and Wxi = √ 1− (3)
RTx px px
the references quoted therein. A schematic diagram of the
considered turbocharged diesel engine including the EGR where Aegr (xegr ) is the effective area of the EGR valve,
and VGT is shown in Figure 1. A third order nonlinear model Tx = 509K is the exhaust manifold temperature, and R =
can be derived using the conservation Observerof mass and Designs energy, the 287 forkgKJ
ais Turbocharger
the gas constant. Moreover, the flo w Wie from the
ideal gas law for modeling the intake andSystem intake manifold into the cylinders is modeled by the speed-
exhaust manifold of a Diesel Engine
pressure dynamics, and a firstorder differential equation with density equation [15]
time constant τ for modeling the power transfer dynamics pi N V d
Marı́a Tejada Zuñiga ∗ Matti Noack ∗∗ Johann Wie = ηv ∗∗
Reger (4)
of the VGT. Under the assumption that the intake and 120Ti R
exhaust manifold temperatures, the∗ compressor and turbine with the volume efficiency ηv = 0.87, the engine speed
Pontificia Universidad Católica del Perú, Lima, Peru
efficiencies, the volumetric efficiency and the time constant N , the intake manifold pressure Ti = 313K, and the
(e-mail: mctejadaz@pucp.pe)
τ of the turbocharger are constant, ∗∗ this modeling
Control Engineering approach
Group, displacement volume Vd =Ilmenau,
Technische Universität 0.002m3 . The turbine flo w Wxt
results in the nonlinear model [13, 15]P.O. Box 10 05 65, D-98684, in kg/sIlmenau,
is given byGermany
(e-mail: matti.noack@tu-ilmenau.de, reger@ieee.org)
RTi px px
ṗi = (Wci + Wxi − Wie ) Wxt = (axvgt + b) c −1 +d
Vi pa pref
RTx We consider observer designs for an exhaust-gas r s
ṗAbstract: (1) turbocharger based ona third order (5)
x = (Wie − Wxi − Wxt + Wf ) Tref 2pa pa
nonlinear Vx model of the air path of a diesel engine. To propose × an observer, the 1 −system is analyzed
in view 1 of different observability properties for nonlinear systems. We derive explicit T x p x p x constraints
Ṗc = (−Pc + ηm Pt ) .
for the τ system state guaranteeing observability. Based
with on that wea design
the parameters = −0.136, a high-gain
b = 0.176, observerc = 0.4,
and a sliding mode observer
Here pi is the intake manifold pressure, px the exhaust man-
for the system. The
d performance
= 0.6, the of
referenceboth designs
pressure p has
ref = been assessed
101.3kP a, and the
through a simulation study.
ifold pressure, Pc the power transfered by the turbocharger, reference temperature Tref = 298K. Finally, the turbine
τ = 0.11s the time constant, ηm = 0.98 the mechanical power Pt is modeled by the equation
Keywords: Nonlinear µ
efficiency, and Vi = 0.006m 3
, Vx = Systems,
0.001m3 the Observability
volumes of Analysis, High-Gain Observer, Sliding-Mode pa
Observer, Turbocharger
the intake and exhaust manifold. In the diesel engine model Diesel Engine P t = W xt c p T x η t 1 − (6)
px
(1), Wci describes the relation between the flo w through the
compressor and the1.power. INTRODUCTION
This relation is modeled as
withefficiency
the turbine based on ytheηt operating
efficienc = 0.76. Furthermore,
point of the theengine,
engine speed N and the fueling
driven by the energy in the exhaust gas. rate W f are considered as
ηc Pc known external parameters.
Over the last decades, Wci = dieselengines µ have , gained interest The second feedback path to the intake manifold from
(2)
because they provide higher cp Ta fuel pi
pa
efficiency
− 1 than gasoline As the shown in Figure 1 the exhaust gas recirculation and the
exhaust manifold is duepathsto exhaust gas recirculation
engines. However, diesel engines have the disadvantage to which is controlled turbocharger introduce feedback in the turbocharged
diesel engine. Furthermore, by an exhaust
since the gas recirculation
exhaust gas recircu-(EGR)
require
where ηac = complicated exhaust gas
0.61 is the compressor treatment.
efficienc y, Ta =One 298Kway valve. The oxygen in the intake is replaced by recirculated
totheachieve
ambientatemperature,
better power cp =utilization J
1014.4 kgK ,and cv =an J
improved
727.4 lation and the turbocharger are both driven by the exhaust
treatment of exhaust
kgK
gas, the turbocharged diesel engine with EGR is a coupled profile
exhaust gases, thereby reducing the temperature
pressuregases is the µuse = ofpcpturbochargers.
c −cv
heat at constant and volume, = 0.286 a of thesystem.
combustion and hence the emissions of nitrogen
Fig. 1 shows nonlinear
and the pa =structure
101.3kP aofthe such turbocharger
pressure. system.
The controlThe
oxides. interactions areengine
relatively complex.
(1) are The
the VGT
constant, ambient
inputs of the diesel model
actuator is usually used to control the intake manifold
EGR position xegr and the VGT position xvgt . The EGR
Wci
Ti , p i , m i
absolute pressure (MAP) while the EGR valve controls the
position and the VGT position range between 0 and 1.
Wxi Intake Manifold mass air flow (MAF) into the cylinders. Both the EGR and
However, to simplify the considerations, the effective areas
Tic , pc EGR VGT paths are driven by the exhaust gases. However, both
AegrtheandEGR Avgtand = VGT
axvgt + b are directly
actuator used as with
are equipped control
position
xr Tr , p x inputs instead
sensors. of
For the
a EGR
detailed valve position
description x egr and
refer tothe VGT
Ladommatos
Intercooler
et al.x(1996)
position vgt . Sinceandthe area Aegr as well as the
effective therein.
references
effective area Avgt are monotonically increasing functions
Cooler Vast variables
in their research[15], has been dedicated
the positions xegrtoand thexvgtcontrol
can beof diesel
Tc , p c Wex Tx , p x , m x
uniquely determined from the effective areas. Furthermore, litera-
engines. Many controllers were proposed in the
the ture, e.g., Lyapunov
Aegr andcontroller design (Janković and Kol-
Exhaust Manifold
control inputs Avgt are constrained due to the
Wxt manovsky, 2000),EGR controllers based on a Linear Parameter-
minimal and maximal and VGT positions. In particular,
the Varying (LPV) model for turbocharged diesel engine
are (Jung
Nt
Compressor Turbine input constraints of the nonlinear diesel engine model
VGT
xv
and
given by Glover, 2003), indirect passivation (Larsen et al.,
Tt , p t 2000), predictive control (Ferreau et al., 2007), feedback
Ta , p a
Oxicat
linearization Aegr ∈ [0m
(Dabo
2
et ,al., × 10−4and
1.82009) m2 ]sliding mode control
(7)
(Utkin et al., Avgt 2000).
∈ [0.04, Other approaches for air-to-fuel ratio
0.176].
control of turbocharged diesel engines were also proposed
In the next sections, a NMPC controller is designed in order
Fig.1.1. Schematic
Schematic diagram of a turbocharged diesel enginediesel with EGR.
in the literature (Fredriksson, 1999; Guzzella and Onder,
Fig. diagram of a turbocharged engine to control the diesel engine
2013; Janković et al., described
1998). The by the model (1).
different solutions are
with EGR (Herceg et al., 2006). mostly nonlinear and model-based. Usually it is assumed
that the state variables be measurable or somehow com-
The function of the turbocharger is to increase the power putable.
density of the engine by pushing additional fresh air into However, full state information is not the case for many
the cylinders of the engine in order to burn more fuel systems in practice, as was explained in Fredriksson and
without smoke generation, allowing for the injection of Egardt (2002). The intake manifold pressure or boost
more fuel without reaching the smoke bound. The turbine pressure is measurable. The compressor power may be
has a variable geometry (VGT) so as to adapt the turbine
354
CHAPTER 11. OBSERVERS
computed in principle. But the sensors used for computing mass fraction variables are ignored since they are not
the compressor power are quite expensive, and, therefore, coupled with other engine dynamics and are difficult to
it is often assumed to be unmeasurable. So as to relax measure. The temperature variables are omitted because
this assumption the use of observers for estimating the temperature dynamics are slow when compared with pres-
unmeasurable states may be considered a solution; first sure and flow dynamics.
designs have been proposed only in the recent past (Glenn
The detailed derivation of the eighth-order nonlinear
et al., 2011; Salehi et al., 2013; Qu et al., 2015).
model of the engine under investigation may be found
The focus of this contribution is on estimating the states in Jung et al. (2002). In Table 2 all variables and pa-
of this turbocharger model. To this end, the model is rameters are listed. Reducing the consideration to the
analyzed in terms of observability first. Within the observ- third-order model, we may refer to just three states pi ,
ability analysis we shall consider two concepts of observ- px , and Pc that represent the intake manifold pressure,
ability for nonlinear systems, namely, weak observability the exhaust manifold pressure, the power transferred by
and uniform observability. Local weak observability is an the turbocharger, respectively. Then the following system
indispensable condition for the design of observers, see description is obtained:
Hermann and Krener (1977). The uniform observability, as
introduced by Gauthier and Bornard (1981), is a stronger RTi
ṗi = (Wci + Wxi − Wie )
concept and allows for the stable design of High-Gain
Vi
observers. Afterwards, we investigate the application of RTx
Σ: ṗx = (Wie + Wf − Wxi − Wxt ) (1)
two different observer approaches on the turbocharger
Vx
1
system: a high-gain observer and a sliding-mode observer. Ṗ = (−P + P )
c c t
The performances of both are compared in terms of design, τ
complexity and robustness.
Following the analysis of the turbine mass-flow presented
The paper is structured as follows: The following section by Schollmeyer (2010), the flow rates as well as the
recalls the nonlinear model of the turbocharger system, compressor and turbine power are given by
based on a simplification of a more complete physical
model. Section 3 presents the observability analysis of the ηc Pc
Wci = ,
nonlinear system using the observability rank condition cp Ta ( ppai )µ − 1
analytically and more extensive numerical calculations. In r
Aegr (xegr )px 2pi pi
Sections 4 and 5 we present the high-gain observer and Wxi = √ (1 − ),
sliding-mode observer design, respectively. Both observers RTx px px
are simulated and results are presented in Section 6. pi N Vd
Wie = ηv ,
Conclusions are drawn in Section 7. 120RTi
pa px
2. MODELING Wxt = (axvgt + b) c −1 +d
px pref
r s
The modeling of the air path system of a diesel engine, Tref 2pi pa
which is composed of a turbocharger with variable turbine × 1− ,
Tx px px
geometry, exhaust gas recirculation and intercooler, is not µ
an easy task and consists largely of identification and map pa
Pt = Wxt cp Tx ηt 1 −
creation, as Wahlström and Eriksson (2011) had presented px
in detail. Moreover, complicated functional relationships
arise, for example, by attaching flow functions. with the constant parameters: compressor and turbine effi-
ciency ηc and ηt , volumetric efficiency ηv , intake manifold
Through versatile measurements describing the couplings, temperature Ti , exhaust manifold temperature Tx , and
a comprehensive information flow exists for the system time constant of the turbocharger power transfer τ .
which then can be used for observation. However, this
The control inputs of the diesel engine model based on (1),
results in a very complex mathematical model to be
are the EGR position xegr and the VGT position xvgt . For
mastered. Fortunately, simplifications and approximations
simplifying the considerations of the model, however, the
have been made.
effective areas of the valves, i.e. Avgt = axvgt + b and Aegr ,
Against this background, here we use a fully parameterized are used as control inputs. Furthermore, the control inputs
nonlinear model of the turbocharger which is based on Aegr and Avgt are constrained due to the minimal and
the approach from Janković and Kolmanovsky (2000). maximal EGR and VGT positions. The intake manifold
This is a simplified third-order nonlinear model derived pressure or boost pressure, is measurable and forms the
from an eighth-order nonlinear mean-value model of the system output.
air path of a turbocharged diesel engine with EGR and
VGT. The full-order air path model is of eighth order with It should be noted that the model has a singularity when
states: intake and exhaust manifold pressure (pi and px ), the pressure in the intake manifolds equals the ambient
oxygen mass fractions in the intake and exhaust manifolds, pressure, that is, if pi = pa then the compressor flow
turbocharger speed and the two states describing the becomes infinite. There exists also another singularity
actuator dynamics for the two control signals. when the turbine stalls. The simplified model can not
handle this situation and, as explained in (Fredriksson
For more simplicity the model is reduced to a third-order and Egardt, 2002), the system is only valid on the set
model subject to the following assumptions: The oxygen Ω = {(pi , px , Pc ) : pi > pa , px > pa , Pc > 0}. Fortunately,
355
CHAPTER 11. OBSERVERS
it may be demonstrated that Ω is invariant, thus, every solvability. The Jacobian of the observability map is called
trajectory starting in Ω stays in Ω (Janković et al., 1998). observability matrix, given by
∂
Table 1. Parameters of the model (Janković h(x)
∂x
and Kolmanovsky, 2000) ∂
LF h(x)
Description Value Unit ∂x
N Engine Speed 2000 [rpm]
On (x) :=
∂ L2 h(x) ,
F
Wf Fuel rate 5 [kg/h] ∂x .
R Specific gas constant 287 J ..
kg·K
pi Intake manifold pressure - [hP a] ∂ n−1
L h(x)
px Exhaust manifold pressure - [hP a] ∂x F
pref Reference pressure 101.3 [hP a] where ∂x∂
(·) denotes the gradient.
pa Ambient pressure 101.3 [hP a]
Pc Compressor Power - [kW ] Here it is important to note that the observability map
Ti Intake manifold temperature 313 [K] and its Jacobian will actually also depend on the input.
Tx Exhaust manifold temperature 509 [K] Excluding that these inputs may cause conflicts with
Tref Referent Temperature 298 [K] observability, naturally, leads to the stronger concept of
Ta Ambient Temperature 298 [K]
uniform observability. A system is uniformly observable if
Vi Volume of the intake manifold 0.006 [m3 ]
Vx Volume of the exhaust manifold 0.001 [m3 ] the observability map may be solved uniquely for the state,
Vd Displacement Volume 0.002 [m3 ] whatever the input. In this case, every input is a so-called
ηm Turbo mechanical efficiency 98 [%] universal input.
ηc Compressor isentropic efficiency 61 [%]
ηt Turbine isentropic efficiency 76 [%]
The analytical expression of the observability matrix for
ηv Volume efficiency 87 the turbocharger system is rather lengthly, and not easy to
[%]
J analyze. In order to get an impression of observability, the
cP Specific heat at constant pressure 1014.4
kg·K
J
analysis is made first for the free system, i.e. with inputs
cv Specific heat at constant volume 727.4 kg·K
µ constant 0.286 −
indentical to zero. In this case the observability matrix is
a Parameter a −0.136 − 1 0 0
b Parameter b 0.176 − ∗ β1 (x1 ) 0
c Parameter c 0.4 −
O3 (x) = ∂β3 (x2 ) (3)
d Parameter d 0.6 − ∗ ∗ β1 (x1 )
∂x2
where
For our analysis we may refer to system (1) in terms of RTi ηc 1
system β1 (x1 ) = µ
Vi cp Ta pi
ẋ = F (x, u) , pa −1
Σ: (2)
y = h(x) , cp ηt Tx pa µ px px
β3 (x2 ) = 1− b c −1 +d
where here F : R3 ×R2 → R3 and h : R3 → R differentiable τ px pa pref
r r
in x sufficiently often. The function expressions of F (x, u) Tref 2pa pa
correspond to the right side of the system (1), that is × (1 − )
Tx px px
x = (pi , px , PC )> and h(x) = x1 .
In order to probe whether the autonomous system is at
least locally weakly observable. For showing this, we use the
3. OBSERVABILITY observability rank condition presented by Hermann and
Krener (1977), i.e., we check whether rank O3 (x) = 3.
For nonlinear systems there exist various observability Thus, it is only necessary to verify that its diagonal
concepts. In this contribution, in particular, we consider elements are different from zero.
weak observability and uniform observability which are
significant for the observer approaches to be applied later, In the region of interest Ω we have that β1 (x1 ) 6= 0. Fig. 2
following Besançon (2007). shows that the values of the third element of the diagonal
will also never be zero. Therefore we conclude that the
Consider a single-output system of class Σ from (2). In system is at least locally weakly observable on the set Ω.
general, the observability map is
The remaining analysis is concerned with the evaluation of
h(x) the inputs to determine whether these are universal. The
LF h(x) complete model in affine form is given by
2
on (x) :=
LF h(x)
,
.. ! ! !
. ∗ ∗ 0
n−1
LF h(x) F (x, u) = ∗ + ∗ u1 + ∗ u2 ,
∗ 0 ∗
where LkF describes the k-th Lie derivative with respect
to F (·, u). Observability may then be characterized by the in this way it is easy to see that the inputs are not
solvability of the resulting system of nonlinear equations coupled. Then for simplifying the analysis, we analyzed
for the state x. In view of the implicite function theorem, each input independent from each other. That is, analyzing
the Jacobian of this map may be used for showing local u1 we set u2 ≡ 0 and vice versa. It turns out that the
356
CHAPTER 11. OBSERVERS
5
x 10
1.22
1.2
1.18
1.16
pi [Pa]
1.14
1.12
1.1
1.08
−3 −2.5 −2 −1.5 −1 −0.5 0
det(O3) 9
x 10
5
x 10
∂β3 (x2 ) 1.28
Fig. 2. Values of ∂x2 in the region of interest
1.26
px [Pa]
help of computational software. However, for getting an
1.18
impression we may plot the determinant as a function
of states, considering the dynamics of the system and 1.16
the entire range of input values. Note that the range of 1.14
550
450
357
CHAPTER 11. OBSERVERS
1.22
5
x 10 example, the generalized super-twisting observer (GSTO)
and homogeneous SM observers Moreno (2013). Those ap-
1.2 proaches may guarantee different properties, but are well-
1.18
studied only for second order systems. The SM observer
that we adopt here for the third order system (1) was first
1.16 proposed in (Moreno and Dochain, 2013). For a system in
p [Pa]
1.14
2
1.12
ẑ˙1 = −L k1 |ŷ − y| 3 sign(ŷ − y) + ẑ2
1
1.1
ẑ˙2 = −L2 k2 |ŷ − y| 3 sign(ŷ − y) + ẑ3
1.08
−12 −10 −8 −6 −4 −2 0 ẑ˙3 = −L3 k3 sign(ŷ − y) + φ(ẑ, u, u̇)
det(O3) 5
x 10
5
x 10
ŷ = ẑ1
1.28
1.18
dynamics. Thus, one is left with choosing some appropriate
1.16
L > 0.
1.14
In the original coordinates, the SM observer for system
1.12
(1) again only requires the inversion of O3 (x). In order
1.1
−12 −10 −8 −6 −4 −2 0 to have a representation similar to the HG observer let
L = 1 . Then the observer in original coordinates reads
det(O3) 5
x 10
2
800
−1 |ŷ − y| 3 sign(ŷ − y)
∂o 3 (x̂)
750
x̂˙ = F (x̂, u) − l() |ŷ − y| 31 sign(ŷ − y)
700 ∂ x̂
sign(ŷ − y)
650
600
ŷ = x̂1
(8)
Pc [W]
550
300
−12 −10 −8 −6 −4 −2 0 6. RESULTS
det(O3) 5
x 10
358
CHAPTER 11. OBSERVERS
4 4
x 10 x 10
0.5 0.5
ε =0.3 L1
ε =0.5 L2
0 ε =0.9 0 L
3
−0.5 −0.5
ep [Pa]
i
ep
−1 −1
i
−1.5 −1.5
−2 −2
−2.5 −2.5
0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25
Time[s] Time[s]
4 4
x 10 x 10
0.5 0.5
ε =0.3 L1
ε =0.5 L2
0 ε =0.9 0 L
3
−0.5
−0.5
−1
ep [Pa]
x
−1
ep
x
−1.5
−1.5
−2
−2
−2.5
−3 −2.5
0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25
Time[s] Time[s]
500 180
ε =0.3 L1
ε =0.5 160 L2
400 ε =0.9
L3
140
120
300
100
eP [W]
c
eP
200 80
c
60
100
40
20
0
0
−100 −20
0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25
Time[s] Time[s]
Fig. 5. Estimation errors using the high-gain observer Fig. 6. Estimation errors using the sliding-mode observer
On the other hand, the Figures 8–9 show a slower conver-
gence of the error for the Sliding Modes observer, however,
design values have been kept as in the same. No particular in this case, the measurement noise is not present in the
parameter tuning was made. results.
It can be seen that the one with largest L yields a faster For the two observer approaches taken in consideration the
convergence of the observation error. Furthermore, this estimated intake and exhaust manifold pressure as well as
method improves the HG observer since, remarkably, the the compressor power are very close to the state variables
peaking phenomenon turns out not present in the results. of the real system.
The performance of the designed observers has been as-
sessed by imposing measurement noise. More precisely, in 7. CONCLUSION
the simulations the measured output was contaminated
by additive normally distributed noise with zero mean
and standard deviation 3.4832 × 103 . The transient per- The observability analysis for a model of turbocharger in a
formance of the estimation errors is shown in Figures 7–9. diesel engine has been performed using weak and uniform
observability concepts. Operating the system in the invari-
The convergence velocity of the observation error increases ant set of the usual operation range we have shown that
while the gain of the nonlinear part increases. There the system is uniformly observable. Based on this, we have
exists a trade-off for the HG observer in the selection designed a high-gain observer and a sliding-mode observer
of this gain. That is, for high gains the perturbation is so as to obtain estimates for the turbocharger states.
dominated better, but to the price that the measurement Imposing a similar estimation error dynamics, simulations
noise is amplified and the peaking phenomenon is more with experimental testbed-data show that both observers
pronounced. are able to reconstruct suitably the desired states. How-
359
CHAPTER 11. OBSERVERS
4
x 10
0.5 500
HGO HGO
0 400
−0.5 300
ep [Pa]
eP [W]
−1 200
c
i
−1.5 100
−2 0
−2.5 −100
0 0.1 0.2 0.3 0.4 0.5 0 0.1 0.2 0.3 0.4 0.5
Time[s] Time[s]
4
x 10
0 200
SMO SMO
−0.5 150
−1 100
ep [Pa]
eP [W]
c
i
−1.5 50
−2 0
−2.5 −50
0 0.1 0.2 0.3 0.4 0.5 0 0.1 0.2 0.3 0.4 0.5
Time[s] Time[s]
Fig. 7. Estimation error wrt. state pi in presence of noise Fig. 9. Estimation error wrt. state PC in presence of noise
ACKNOWLEDGMENT
4
x 10
0.5
HGO
The authors gratefully acknowledge insightful discussions
0
with Jaime Moreno on the observability properties and
−0.5 observer design for nonlinear systems.
−1
REFERENCES
ep [Pa]
x
360
CHAPTER 11. OBSERVERS
states for air path flow regulation in a variable geometry Wahlström, J. and Eriksson, L. (2011). Modelling diesel
turbocharger exhaust gas recirculation diesel engine. engines with a variable-geometry turbocharger and ex-
Int. Journal of Engine Research, 12(6), 501–512. haust gas recirculation by optimization of model param-
Guzzella, L. and Onder, C. (2013). Introduction to Mod- eters for capturing non-linear system dynamics. Journal
eling and Control of Internal Combustion Engine Sys- of Automobile Engineering, 225(7), 960–986.
tems. Springer, Berlin.
Herceg, M., Raff, T., Findeisen, R., and Allgöwer, F.
(2006). Nonlinear model predictive control of a tur-
bocharged diesel engine. In IEEE Conf. on Control
Appl., 2766–2771.
Hermann, R. and Krener, A.J. (1977). Nonlinear control-
lability and observability. IEEE Trans. on Automatic
Control, 22(5), 728–740.
Janković, M., Janković, M., and Kolmanovsky, I. (1998).
Robust nonlinear controller for turbocharged diesel en-
gines. In American Control Conf., 1389–1394.
Janković, M. and Kolmanovsky, I. (2000). Constructive
Lyapunov control design for turbocharged diesel en-
gines. IEEE Trans. on Control Systems Techn., 8(2),
288–299.
Jung, M. and Glover, K. (2003). Control-oriented linear
parameter-varying modelling of a turbocharged diesel
engine. In IEEE Conf. on Control Appl., 155–160.
Jung, M., Ford, R.G., Glover, K., Collings, N., Christen,
U., and Watts, M.J. (2002). Parameterization and
transient validation of a variable geometry turbocharger
for mean-value modeling at low and medium speed-load
points. Technical report, SAE.
Ladommatos, N., Balian, R., Horrocks, R., and Cooper,
L. (1996). The effect of exhaust gas recirculation on
combustion and nox emissions in a high-speed direct-
injection diesel engine. Technical report, SAE.
Larsen, M., Janković, M., and Kokotović, P.V. (2000).
Indirect passivation design for a diesel engine model.
In IEEE Conf. on Control Appl., 309–314.
Moreno, J.A. (2013). On discontinuous observers for
second order systems: properties, analysis and design. In
Advances in Sliding Mode Control, 243–265. Springer.
Moreno, J.A. and Dochain, D. (2013). Finite time converg-
ing input observers for nonlinear second-order systems.
In 52th IEEE Conf. on Decision and Control, 3554–
3559.
Moreno, J.A. and Vargas, A. (2000). Approximate high-
gain observers for uniformly observable nonlinear sys-
tems. In 39th IEEE Conf. on Decision and Control,
784–789.
Qu, Q., Wang, H., and Tian, Y. (2015). Nonlinear observer
based sliding mode control for a turbocharged diesel
engine air-path equipped with EGR and VGT. In
Chinese Automation Congress, 121–126. Wuhan.
Röbenack, K. (2012). Structure matters – some notes
on high gain observer design for nonlinear systems.
Systems, Signals and Devices, 9, 1–7.
Salehi, R., Shahbakhti, M., Alasty, A., and Vossoughi, G.
(2013). Nonlinear observer design for turbocharger in a
si engine. In American Control Conf., 5231–5236.
Schollmeyer, M. (2010). Beitrag zur modellbasierten Lad-
edruckregelung für Pkw-Dieselmotoren. Ph.D. thesis,
Leibniz Universtät Hannover.
Utkin, V., Chang, H.C., Kolmanovsky, I., Cook, J., et al.
(2000). Sliding mode control for variable geometry
turbocharged diesel engines. In American Control Conf.,
584–588.
361
CHAPTER 12
OPTIMIZATION
CHAPTER 12. OPTIMIZATION
Abstract: The optimal controller LQR (Linear Quadratic Regulator) is presently used for controlling
complex systems when these have strict performance requirements. For its design, it is necessary to
have a state space model and also the value of the Q and R matrices, which are essential components
for the proper performance of the LQR algorithm on the process. The weighting matrices Q and R
should be defined by the analyst to balance the regulation performance and the control effort, and to
compare the results with the specified design goals. Iterative methods such as trial and proof, Bryson
method or pole allocation, are commonly used for the tuning of those matrices; methods that to find
a solution, take a significant time and in many cases fail to meet certain performance features in the
process. The application of the LQR controller has been limited by the uncertainty in finding the
right weighting factors. In this paper, the evolutionary algorithm MAGO (Multidynamics Algorithm
for Global Optimization) is used for calculating the Q and R matrices. The LQR is tested on
Benchmark type plants. The results are compared with PID (proportional, integral, derivative)
controllers of the optimal and optimal-robust types. With the evolutionary process, uncertainty in the
calculation of the matrices Q and R is eliminated. In addition, the LQR controller performs better
when optimized with the MAGO algorithm.
Key Words: LQR, PID, Optimization, Robustness, Evolutionary Algorithms, Performance Criteria.
363
CHAPTER 12. OPTIMIZATION
2013; Hassani 2014) drivers in tuning PID controllers (Li et 2.1 State Space Models
al., 2006; Hernández-Riveros, Urrea-Quintero, Carmona-
Cadavid, 2014), showing successful solutions in each case The concept of state is proposed by Kalman in the early
applied. 1960s, and initiates what is known as modern control.
AE are widely studied as a heuristic tool for solving Modern control refers to the control in state space. Its
nonlinear systems, continuous, discontinuous, convex and analysis is focused on three types of variables in the
not convex optimization problems where traditional dynamic system, the input variable, the output variable and
methods are not effective, and in many cases, to support the state variables. Their general representation is given in
successful solutions. AE are based on biological or natural equations (1-2):
principles for the study and design of human-made systems
(Yu Xinjie and Mitsuo Gen, 2010), such as the theory of x t f x, u, t (1)
natural selection (Darwin, 1859), heredity (Ayala and Kiger, y t g x, u, t (2)
1984) and population genetics (Fisher, 1930).
In this paper, the use of an evolutionary algorithm, Where (1) is the equation of state of the system and (2) is
MAGO, (Hernández and Ospina, 2010) as a tool to the equation of system output. After linearizing equations
minimize a characteristic performance index in a control (1-2) around a point of operation, the following invariants
loop, to thereby obtaining optimal values of the controller equations in time (3-4) are obtained.
parameters is proposed. MAGO only requires two
parameters provided by the analyst: the number of x t Ax t Bu t (3)
generations and population size; features that facilitate its y t Cx t Du t (4)
use. The results obtained for adjusting the weights of the Q
and R matrices in the LQR controller are compared with
Where A is an nxn matrix, of the system states, B is an
LQR tuned by trial and proof (since this is one of the most
nxm input matrix, C is a kxn output matrix and D is a direct
used methods) and two PID strategies (optimum PID and transmission kxm matrix.
optimum-robust PID). These last strategies are also tuned
with MAGO because they have better performance when 2.2 Transfer Function Model
compared with traditional methods for tuning PID
controllers optimizing some criterion of overall performance The representation of the process in transfer function and
(Hernández-Riveros, Urrea-Quintero, Carmona-Cadavid, block diagrams is widely used for the analysis and design of
2014). PID strategies as comparative method are applied control systems. Although most processes are highly
because they are one of the control strategies most used in nonlinear, the approximation in transfer function is very
the industry. useful for analysis of the system in open and closed loop
This paper is divided into the four following topics: the operating around an operating point (Ogunnaike and
problem of process representation, description of LQR and Harmon, 1994). Its general form of representation is given
PID controllers, evolutionary tuning, result analysis and by (5).
some conclusions. y s GP s u s (5)
2. PROCESS REPRESENTATION Where, Gp(s) represents the "transfer" of the input u(s)
to the output y(s) of the process. Thus, an input-output
A model is characterized by a structure that represents the structure of the system to be analyzed is obtainined.
formalization or abstraction of a given study process. At
present, there are different applications for models and 2.3 Benchmark Models of the Plant
therefore several analytical approaches have yielded
different structures or forms of models, such as those based As a model of the plant, two test representations proposed
on data (deterministic, stochastic), based on time by (Åström and Hägglund, 2000) are selected; the transfer
(continuous, discrete), symbolic (mathematical, conceptual), function representation is used for tuning the PID controller
etc.; these representations will be used upon the design and its state space representation for tuning the LQR.
criterion.
It should be remembered when achieving the objective 2.4 Plant in Transfer Function
of modeling, that the model should not be too complex,
however, it should give a sufficient description of the The models of the plant are present on equations (6) y (7).
system. That is, every model has a degree of uncertainty, K p e m s
G( s) (6)
either by lack of information or structural uncertainty, or by T s 2 mTm1s 1
2 2
m1
the approximation or parametric uncertainty (Roffel and
Betlem, 2006).
K p e m s
For the design of controllers are used two typical G(s)
(1 Tm1s)(1 Tm 2 s ) (7)
representations of a system, they are state space and transfer
function. These representations depend on the selected The values used in the plant represented by (6) are: Kp =
control structure. Currently, it is a fact that these two 1, τm = 1, ξ = 1 y Tm1 in a range of 1, 10 and 20. For (7),
approaches, input-output and state space, complement each the following values are used: Kp = 1, τm = 1, Tm1 = 1 and
other, offering a much wider range for control systems. Tm2 = a*Tm1, where a ≤ 1. In Tables 1 and 2, the transfer
364
CHAPTER 12. OPTIMIZATION
Table 1: Transfer Function for the plant (6) Tuning controllers consists in determining the best values of
a set of parameters to drive the selected control strategy
Plant in the equation (6) ensuring that the process behavior meets design
e s Gp1_ servo1 (s) Gp1_ reg1 (s) specifications. The tuning may be also optimal under some
G p1_ servo1 ( s) 2 performance criterion. There are two forms of operation of
s 2s 1 the controller: as regulator, where the desired value ysp
e s Gp1_ servo 2 (s) Gp1_ reg 2 (s) (reference value or Set-Point) remains constant so the
G p1_ servo 2 (s)
100s 2 20s 1 system should be insensitive to disturbances, and as
servomechanism, where the desired value ysp can change
e s
Gp1_ servo3 (s) Gp1_ servo3 (s) Gp1_ reg 3 (s) over time, so that a good tracking of this is desired.
400s 40s 1
2
J xT t Qx t uT t Ru t dt
the process is controllable, this is why the models obtained
from (6) and (7) in the controllable canonical form, must (9)
t1
meet the criterion of controllability, this criterion indicates
the capacity of the system to reach a final status from any
initial state. Where Q is an n×n matrix and R is an mxm matrix, semi-
definite and positive-definite respectively, assigning weights
to each of the terms of the function, i.e. a balance in output
and input "energies" of the plant is performed. The control
law that governs this structure of optimal states feedback
control is given in (10).
u t Kx t (10)
K t R 1 BT P t (11)
P t P t BR 1 BT P t Q P t A AT P t (12)
365
CHAPTER 12. OPTIMIZATION
Equation (12) is the matrix Riccati differential equation. possible to obtain optimum tuning parameters for the LQR
When the final time (tf) of the integral in (9) tends to controller. This is the path taken in this paper.
infinity, the solutions of the Riccati equation will remain
constant at the value of the initial time, therefore equation 3.3 Optimal PID Controller
(12) equals zero, obtaining the algebraic equation Riccati.
There is only one solution for P (t), when solving this The Proportional Integral Derivative Controller, PID, is one
equation, if it is replaced in (11) an optimal constant value K of the feedback control algorithms most widely used in
industry and with an extensive background (Åström and
for states feedback is obtained.
Hägglund, 1995; O'Dwyer, 2009). The output, u, (control
3.2 Tuning methods for LQR action) is a composite of three effects, Kc the proportional
action, Ti the integral time and Td the derivative time, which
For tuning the Q and R parameters of LQR controller, are calculated on the basis of the difference between the
different methods are used, (Ghoreishi, 2001), some of them desired value (Set-Point ) of the controlled variable ysp and
the actual controlled variable y. The control law in
are:
continuous time for the ideal PID controller is presented in
a. Bryson Method: Based on the normalization of
equation (15).
variables seeking the maximum acceptable value will
1
t
de t
u t K c * e t e t dt Td
be one. This is because the units of the states "x" (for
(15)
the Q matrix) and control action "u" (for the R
Ti tini dt
matrix) are very different. See equation (13).
Although in many cases this method can produce
very good results, usually is applied as a starting An optimal PID controller consists in adjusting its
point and then the tuning is refined by trial and proof. parameters Kc, Ti and Td so that an objective function (error
between the actual output of the plant regarding the desired
Q diag q1 ,..., qn R diag r1 ,..., rm
value and / or effort control) is minimized. See Equation
n m
(16).
1 1 (13)
qi 2
xi max i 1
rj 2
u j max i 1
J ( K c , Ti , Td ) min J ITAE t e t dt
x
x 0 (16)
b. Trial and Proof Method: (or heuristic method) is an 3.4 Optimal Robust PID
iterative process that makes use of prior knowledge
and behavior of the system to control with the A robust PID controller is a control strategy that meets a
purpose of adjusting the matrices Q and R such that a measure of specific robustness that is, finding a quantitative
desired behavior is achieved. The method requires a measure of how stable is the control loop. For knowing
large amount of time to obtain a satisfactory robustness have different alternatives, highlighting the
response. One commonly used option is to take as a methods based on the internal model control (Rivera et al,
starting matrices Q and R, those in equation (14a). 1986) and those considering the tuning from the values of
Q P * C ' C margin of phase and closed loop gain (Lee, 2004). The latter
(14a) leaded to the use of the maximum sensitivity function
R dig r1 ,..., rm
(called Ms) as a measure of system robustness (Aström &
With a positive real multiplicative factor P varying Hägglund, 2004; Alfaro et al, 2009).
until obtaining the desired value and for R a constant The margins of phase, Am, and the phase, φm, in the
value that its size will depend on the number of frequency domain are defined in equations (17) and (18),
entries to the system u(t). respectively
Another trial and proof way for tuning the LQR 1
controller is the manual application of the Bryson Am (17)
method, equation (14b). Gc ( j p )G( j p )
Q dig q1 ,..., qn m arg Gc ( jg )Gp ( jg ) (18)
(14b)
R dig r1 ,..., rm
c. Pole Allocation Method: With this method, the Where, ωp and ωg are obtained through the equations
poles of the closed loop system are placed in certain (19) and (20).
values defined by the analyst, all thanks to prior
knowledge of the process. For using this method it is Gc ( j p )G( j p ) 1 (19)
necessary to have actual measurements or otherwise
apply observers to estimate the actual value of the arg Gc ( jg )G( jg ) (20)
process variables, for which there is no physical
measurement. The gain margin as a measure of robustness indicates
It is evident that although there are different tuning that in the event that the system model is incorrect, the static
methods for the LQR controller, all require at the end a gain may increase in a factor Am before the system becomes
tuning by trial and proof to achieve the desired results in the unstable. Typical specification values for the gain margin
process. Some methods need bay by trial and proof a are 2≤ Am ≤5.
starting point for some possible values of the parameters Q
and R. It can be concluded that with just this method is
366
CHAPTER 12. OPTIMIZATION
367
CHAPTER 12. OPTIMIZATION
covariance matrix of the population of each generation, to (28). If N1, N2 and N3 are the cardinalities of the sets G1,
establish a distribution of exploration and create three G2 and G3, then the cardinality of the Emerging Dynamics,
subgroups or dynamics making the whole individuals in Crowd Dynamics and Accidental Dynamics are set,
each generation. The subgroup named Emerging Dynamics respectively. This way of defining the elements of each
(G1) creates a small group of individuals around the group is dynamic in nature and autonomous in MAGO.
individual with better genetic characteristics; this group is Cardinalities depend on the dispersion of the whole
the evolutionary elite of each generation, i.e. fittest population in generating j.
individuals to contribute with their genes to the next
generation. The Crowd Dynamics (G2) follows a similar XM ( j ) diag ( S ( j )) x
process, around the current population mean, applied to a G1 x Pob( j )
XM ( j ) diag ( S ( j ))
largest portion of the population. This dynamics is always
close to the emerging dynamics, but never close enough to XM ( j ) 2 diag ( S ( j )) x
be confused. Only until there are sufficient and necessary
XM ( j ) diag ( S ( j )), or , (28)
conditions to ensure full exploration of the search space, G2 x Pob( j )
these two dynamics could be merged within a territory, XM ( j ) diag ( S ( j )) x
usually at the end of the evolutionary. The Accidental XM ( j ) 2 diag ( S ( j ))
dynamics (G3) is the quantum speciation. It is established in
x XM ( j ) 2 diag ( S ( j )), or ,
isolation from individuals of the other dynamics in G3 x Pob( j )
generation by generation. This portion of the population is x XM ( j ) 2 diag ( S ( j ))
always formed spontaneously and contains a number of
entirely new individuals. Thus the main diagonal of the Where, XM ( j ) is the mean of the actual population and
covariance matrix is different from zero, ensuring numerical Pob( j ) G1 G2 G3 .
stability of the evolutionary process. G1, G2 and G3 cannot
interbreed. Following, the pseudo algorithm of MAGO is presented.
The Emerging Dynamics is created with N1 test
individuals being better than their predecessors, and the Pseudocode of MAGO
original elite group of individuals who were not defeated by 1: j = 0, Generation of the initial population, with a
the individuals tested. A number of the fittest individuals uniform randomly distribution in the search
move towards the best of the entire population. For each of space.
these individuals a mutant is created using a stochastic 2: Repeat
crossover between the chosen individual and the best of all. 3: Evaluate each individual by means of the
This mutant is evaluated using the objective function, and if objective function
its value is better than that obtained in the test individual, 4: Calculate the covariance matrix of the
the mutant is who goes to the next generation rather than its population and the first, second and third
predecessor. For each individual, i, in the generation, j, xi j , dispersion.
a test individual, t, is created according to the rule presented 5: Calculate the N1, N2 and N3 cardinalities of
in equation (26). groups G1, G2 and G3.
t i B m
x j x j F j x j x j (26) 6: Select N1 of the best individuals, modify
according to the objective function and make
them compete. Winners pass to the next
Where xB j is the best of all individuals in the generation
generation j + 1.
j, and the mean of the population is xm j . To incorporate 7: Sampling N2 individuals from the uniform
information about the present characterizations between the
j distribution in the hyper rectangle [LB (j), UB
problem variables, the matrix F is chosen as defined in
j (j)], and move on to the next generation j + 1.
equation (27). Where, S is the simple covariance matrix 8: Sampling N3 individuals from the uniform
of the individuals in generation j.
distribution throughout the whole search space
F j S j S j (27) and move to the next generation j + 1.
9: j = j + 1
10: Do until a termination criterion is satisfied.
The Crowd Dynamics is created with N2 individuals
through a uniform distribution in the hyper-rectangle
[ LB UB ] . Figure 1 shows the flowchart of the MAGO.
j j
j
diagonal diag (S ) are taken into account for the
cardinality of each dynamics in the generation j. If Pob is
j
368
CHAPTER 12. OPTIMIZATION
PID J ( K c , Ti , Td ) min J ITAE t e t dt
x
x 0
(31)
Sujeto _ a :1.4 M s 2
Where, Kc, Ti, Td are tuning parameters taking positive
real values for the robust optimal PID controller.
5. RESULT ANALYSIS
MAGO has been used for tuning the robust LQR and
PID controllers. MAGO setting parameters are 150
generations and 100 individuals. The ITAE values obtained
in simulations using MAGO as an optimization tool are
presented In Table 3. Column 1 shows the models described Figure 2: Response of plant 2 as regulator.
in Tables 1 and 2; column 2 shows the control loop
performance index ITAE, which is divided into four
columns: first, the values obtained in (Hernández-Riveros,
Urrea-Quintero, Carmona-Cadavid, 2014) for the PID
controller, in the second, the PID with an additional
robustness criterion, third the LQR selecting optimally via
369
CHAPTER 12. OPTIMIZATION
370
CHAPTER 12. OPTIMIZATION
REFERENCES
Alfaro V. M., Vilanova R., Arrieta O. 2009. Robust tuning of Two-
Degree-of-Freedom (2-DoF) PI/PID based cascade control
systems. Journal of Process Control, 19(10), 1658-1670.
Åström K. J., Hägglund T., 1995. PID Controllers: Theory,
Design, and Tuning, ISA, Research Triangle Park, NC.
Åström K. J., Hägglund T., 2000. Benchmark Systems for PID
Control. IFAC Workshop on Digital Control -Past, present,
and future of PID Control. Spain.
Åström K. J., Hägglund T. 2004. Revisiting the Ziegler–Nichols
step response method for PID control. Journal of process
control, 14(6), 635-650.
Figure 7: Dynamics Evolution (cardinalities) of MAGO. Ayala F. J., Kiger J. A. 1984. Genética moderna. Ediciones
Omega, Barcelona, España.
Beker-Jr G. A., Gammel J. L. 1961. The Padé Approximant.
Journal of Mathematical Analysis and Applications, 2, pp. 21-
30.
Darwin C. 1859. The origin of species. London: John Murray.
Fisher R. A., (1930). The genetical theory of natural
selection. Oxford University Press, Oxford
Fleming P. J., Purshouse R. C. 2002. Evolutionary algorithms in
control systems engineering: a survey. Control Engineering
Practice, 10(11), 1223-1241.
Ghoreishi S. A., Mohammad, A. N., Basiri S. O. 2011. Optimal
Design of LQR Weigthing Matrices based on Intelligent
Optimization Methods, International Journal of Intelligent
Inofrmation Processing, 2(1), pp.
Glacier C., Hognas G., Makita P., Toivonen H. T. 1991.
Approximation of delay systems - a case study. International
Journal of Control, 53 (2), pp. 369-390.
Hangos K., Cameron I., 2001. Process Modelling and Model
Analysis. Elsevier Science. Vol. 4 Process Systems
Figure 8: Response of plant 2 as servomechanism. Engineering.
Hassani K., Lee Won-Sook. 2014. Optimal Tuning of linear
6. CONCLUSIONS Quadratic Regulators Using Quantum Particle Swarm
When talking about tuning controllers, either PID or LQR Optimization. In Proceedings of the Int. Conference of Control,
(strategies that are discussed in this paper), everything is Dynamic Systems, and Robotic.59,1-59,8.
Hernández J. A., Ospina J. D., 2010. A multi dynamics algorithm
reduced to the characterization and adjusting of the value of
for global optimization. Mathematical and Computer
three or more tuning parameters, generally obtained by trial Modelling, 52(7), 1271-1278.
371
CHAPTER 12. OPTIMIZATION
372
CHAPTER 12. OPTIMIZATION
Abstract: In this paper, we propose an extended form of bio-inspired heuristic algorithm, called
Evolutive Extension (EvE) in this work, based on the evolutive approach. EvE is an optimization
algorithm which considers several nature examples to construct a list of norms and procedures
in search to obtain a better performance than other heuristic algorithms. This method, as
many similar ones, is focused on the task of finding an optimum solution to a minimization
problem. Using as a base method the Differential Evolution algorithm (DE), EvE includes the
abstraction of natural phenomena like elitism, extinction, symbiosis, parasitism, and different
forms of reproduction. The study of performance is carried out by a statistical analysis of the
results obtained from several executions over a list of typical test functions. The performance
of the algorithm and its comparison against the performance of the original form of Differential
Evolution is shown. This investigation includes the analysis of the calibration parameters for
EvE, also seen from a statistical approach. The presented results show a better performance
for EvE algorithm, due to its low number of iterations required to achieve the minimum for
each of the test functions proved in this paper. Also, for EvE algorithm it is possible to define
a region in which one lies the calibration parameters that allows to the algorithm to solve all
the presented tests functions in the less number of iterations, which gives an easy reference to
tune up the algorithm.
373
CHAPTER 12. OPTIMIZATION
transformations to modify and improve these test the superiority of each member, forcing them to
representations (Martı (2003)). To put in practice compete between all of them (Price et al. (2006)). Let
this scheme in a computational algorithm, it is necessary: P RN be the number of individuals P Rpg that conforms
to specify a chromosomic representation, where the a population, and let V P Rpg be the p-th mutant vector
number of chromosomes is proportional to the size of the with p = 1, ..., P RN , from the g-th generation of vectors.
problem; an initial population built in this representation; Then, the recombination process is defined by:
and a stop criterion based on a finite number of iterations g
V P Rp,m if rand(0, 1) < Cr
or in an error bound. Also, an evaluation measure is g
Up,m = or m = Irand , (1)
necessary to establish a selection-elimination criterion P Rg otherwise,
p,m
for chromosomes, and finally it is needed to define one
g
or more recombination rules, and one or more mutation where Up,m is the m-th element of the p-th trial vector U
rules (Fleetwood (2004)). from the g-th generation, Cr is a recombination crossover,
which directly modulates how the combination between
As we mentioned before, there are many optimization the original parent P R and the mutant V P R happens,
algorithms in the bio-inspired approach, and many of and Irand is an aleatory positive integer in the range [1, M ]
them use the evolutive structure. Evolutive algorithms with m = 1, ..., M (where M is the dimension of the search
commonly use a binary codification to represent the space), and it allows to guarantee that Upg 6= P Rpg while
chromosomic form for each candidate solution in the search there exist enough genetic diversity within the population.
space of a problem. This binary representation simplifies
the execution of tasks as recombination and mutation, In the current literature, several variations of DE
because it is only necessary to replace some bits into the algorithms are presented. Such variations are defined
complete chain to perform those instructions, but it is by the expression that allows the calculation of the
also necessary to translate the original form of the search mutant vectors V P Rpg (Storn and Price (1995)). The
space to a binary representation, and backwards to the most common and generalized form according to Das and
original form after all the evolutive operations (Schmitt Suganthan (2011), is the random/1 form, calculated as
(2001)). Only the translation cost can be expensive for follows:
some applications, where it is preferable to work directly g g g
V P Rpg = P Rr1 + F (P Rr2 − P Rr3 ), (2)
over the original search space.
where r1 6= r2 6= r3 are aleatory positive integers
Differential Evolution (DE) is a bio-inspired algorithm uniformly distributed in the range [1, P RN ]. F , called
that deals with that situation, when the search space is mutation factor, is a constant to fix in the interval [0, 2]
a subset of <n . Using a set of simple arithmetic rules, DE (Fleetwood (2004)), and it regulates the capability of
deals with the mutation and recombination tasks, showing exploration (near to 2) and exploitation (near to 0) for
a great capacity to solve problems of high complexity the DE algorithm.
(Storn and Price (1997)).
A selection process is carried out based on the performance
The aim of this work, is to propose an extension of the DE of each recombined individual, by the expression:
algorithm, the Evolutive Extension (EvE) algorithm. EvE g
Up if f (Upg ) < f (P Rpg ),
is based on the inclusion of operations than can represent P Rpg+1 = (3)
P Rpg otherwise,
some natural phenomena trying to improve the searching
capacity of the original algorithm. These new rules are where f ( ) represents an evaluation function, or cost
introduced in order to modulate the capacity of greedy function that maps from the search space <n to <. Also,
search, multiple initialization, premature convergence for each iteration, it is convenient to include a global
and regional search, as characteristics that define the superiority memory with the form:
g
performance of a heuristic algorithm, according to Storn Up if f (Upg ) < f (Gbest),
and Price (1995). Gbest = (4)
Gbest otherwise,
The main contribution of this work is the proposal of such that, at the end of the execution of DE, Gbest is
the EvE algorithm itself. Of course, it is necessary to presented as the best solution found.
include not only the structure of the algorithm, but also
its performance, comparing it with the original Differential 3. EVOLUTIVE EXTENSION ALGORITHM
Evolution algorithm, to show the superiority of EvE. Also,
this work shows a statistical approximation to the problem
of calibration for EvE. The structure of this paper is the DE algorithm takes directly a set of candidate solutions
following: Section 2 presents the original DE algorithm; from the search space, and operates them to improve
Section 3 presents the EvE algorithm; Section 4 presents the cost of each one in search of solving an optimization
the test functions used in this paper; and Section 5 presents problem. As we mentioned before, this evolutive process
and discusses the results of DE and EvE solving the uses mutation and recombination in a simple set of
minimization problem of each test function. Finally, we arithmetic rules. But is it possible to improve the searching
offer some conclusions about all the implementation. capacity of the algorithm? There exist another natural
behavior than can be expressed as a simple arithmetic
rule in search of obtaining better and/or faster results?
2. DIFFERENTIAL EVOLUTION ALGORITHM EvE algorithm tries to implement an answer to those
questions, considering basic evolutive behaviors inspired in
Differential Evolution is a heuristic algorithm based on biology, such as elitism, extinction, symbiosis, parasitism
the survival of a population into an environment that and different forms of reproduction. Elitism and extinction
374
CHAPTER 12. OPTIMIZATION
have been considered in another bio-inspired algorithms, Ex + 1 if kGbest − P Rig k = 0,
but never in a scheme as complete as EvE. Ex = (8)
Ex otherwise.
Elitism: Our first approach is the elitism. In nature, Ex increases every time a member of the population
elitism represents the recognition of the best members of replicates the best individual found at the moment. When
a population, propitiating their survival and reproduction. Ex surpass an extinction bound, all the members of the
Computationally speaking, elitism represents the addition population are reinitialized, and the chromosomes of the
of certain memory to an algorithm. A memory driven best solution found are reinserted in the population, in a
by attributes has an effective and subtle effect in random index:
the search, allowing the identification of an attribute 0n
P Rrandi(P gn−1
RN ) = Gbest , (9)
than represents a set of possible solutions. Thus, an
already identified attribute can prevent that a non-revised where the super index 0n represents the generation in
candidate solution be accessed, due that it contains the which the population has been reinitialized.
already exploited attribute, or, on the other hand, the Reproduction: For the standard DE algorithm,
identified attribute can redirect the search to non-revised reproduction only happens by the recombination of
candidate solutions which have the desired attribute. EvE the chromosomes of a parent and a mutant generated
takes into consideration two kinds of memory: under the same index. EvE algorithm recombines
- A short term memory, where the attributes of recently diversified individuals in search of generating a more
visited solutions are stored. This memory allows to aggressive attitude, covering faster the search space. The
perform intense exploration over specific region of the first reproduction model is the multipoint crossing. Let
search space. P Rig and P Rjg be two members of the population, which
- A long term memory, where the frequencies of the belong to the same region (expression 6 or 7), then, they
occurrence of attributes are stored, seeking to identify can generate two new members by:
the regions in the search space defined by the g+1a
P R(i|j),m g
= P Ri,m ,
attributes. Long term memory also allows to diversify mod2=0
(10)
the search orientation. P Rg+1 a g
(i|j),mmod26=0 = P Rj,m ,
Let X be the search space and N (x) the selection of g+1b g
P R(i|j),m mod2=0
= P Rj,m ,
each one of the P RN individuals that forms the EvE’s g+1b g (11)
population. Then: P R(i|j),m mod26=0
= P Ri,m .
En (x) = Nn∗ (x) \ P Ri , (5) Expression 10 represents the formation of one of two
children, the (a) child, in which the odd chromosomes
represents the association of each P R with a previously are taken from the parent with index j, and the even
declared attribute, after checking its compliance. That is, chromosomes are taken from the parent with the index i.
the equivalence class where each P R is associated with the The expression 11 represents the formation of the second
compliance of an attribute. Then, EvE uses two attributes child (b), whose chromosomes are taken in the contrary
to perform its elitism memory: way. We also use the index (i|j) to represent that each
∃P Rig , ε, γ : kP Rig − P Rig+1 k < ε, one of the children can take the index of any of the two
(6) parents, only if they improve their evaluation cost, that
|f (P Rig ) − f (P Rig+1 )| > γ, is, if child (a) or (b) surpass both parents, it takes the
and index of the parent with the worst performance; if one of
the children surpass the performance of only one parent,
∃P Rig , δ, σ : kP Rig − P Rig+1 k > δ, it takes its index; if both children surpass both parents,
(7)
|f (P Rig ) − f (P Rig+1 )| < σ. child (a) takes the i index, and child (b) takes the j index.
Using the elitist memory of EvE algorithm, it is possible
Expression 6 defines a significant cost change region, in to establish an attraction between two members of the
which a small modification in an individual causes a population. Let P Rig and P Rjg be two individuals with a
high change in its evaluation cost, so, the search can be common attribute, but with kP Rig − P Rjg k > ε, where ε is
diversified allowing a great variety of evaluation costs;
a fixed constant that represents at least kU B−LBk/2 (U B
whereas expression 7 defines a significant step region,
and LB are the upper and lower bounds of each dimension
in which it is necessary to perform great modifications
of the search space, respectively). Then:
to the individuals in order to obtain a small change
in its evaluation cost: in this region, the search can be g+1 1 g 1 g
P R(i|j),m = P Ri,m + P Rj,m , (12)
intensified by the steps required by each individual to 2 2
perform an improvement. For each individual, 6 and 7 where, the individual with the worst evaluation cost is
represent mutually exclusive attributes. eliminated from the population, and, as in the previous
reproduction model, the index (i|j) represents that the
Extinction: Our second proposed rule is extinction. In
child takes the index of the parent with the worst
nature, extinction is the total elimination of the members
performance, if the mentioned child is not the individual
of a population, due to their incapacity to adapt to
with the worst performance.
the environment. For EvE algorithm, it is necessary
to compensate the greedy characteristic that produces Parasitism: Another natural phenomena considered in the
premature convergence in the population. Let Ex = 0 EvE algorithm is parasitism. In nature, parasitism is the
represent a counter, then: survival of an individual at the expense of a second one.
375
CHAPTER 12. OPTIMIZATION
In the EvE algorithm, we represent that idea giving to a 5. EXECUTION AND RESULTS
member of the population the capacity to mimic another
one of better performance, in search of guaranteeing its
survival. Let P Rig and P Rjg be two individuals of EvE’s In order to solve each of the test functions presented
population, and there exists recognition of P Rig ’s good in the last section, it is necessary to specify all the
performance, and P Rjg has the parasite property. Then: execution conditions. First, for each test function, N = 10,
g+1 g g
that is, x ∈ <10 . Also, for all x∗ values, we tolerate
P Rj,m = (1 − ρ)P Ri,m + (ρ)P Rj,m , (13) an error of e = ±10−6 . Finally, for EvE algorithm it
where ρ is a constant in the range [0, 1], and defines is necessary to establish some programing factors. The
an “infection ratio”. Parasitism helps the EvE algorithm extinction bound is set as 3P RN trying to guarantee
to assign good properties to individuals with bad that the extinction phenomena will not be triggered
performance, which allows to perform a more aggressive by a population with a low genomic variety, and the
search. infection ratio is 0.25, as a fixed small step for the
parasitism phenomena that allows that the parasite
Symbiosis: The final nature example used to create does not duplicate the parasited individual. In practice,
EvE algorithm is the symbiosis phenomena. In nature, those parameters show good results, avoiding excessive
symbiosis establishes a relationship of mutual benefit. Let reinitialization or premature convergence. In order to
P Rig be an individual with symbiotic attitude, then, its compare the original DE algorithm against EvE, we use
mutation model takes the form: the same number of individuals in both of them, P RN =
V P Rig = P Rig + F (Gbest − P Rig ) 50.
g g (14)
+ F (P Rr1 − P Rr2 ), The first approach is to let the calibration parameters for
which simulates a mutual benefit relation between P Rig DE and EvE be selected in a random way, thus Cr ∈ [0, 1]
and Gbest. The first one guarantees that its children and F ∈ [0, 2], in a uniform distribution. That is, we want
contain information of the individual of best performance, to know for which calibration parameters each algorithm
and the second one ensures its reproduction. In Equation requires the less number of iterations to solve each of the
14, r1 6= r2 are random integers uniformly distributed in minimization problems. Then, we perform 1000 executions
the range [1, P RN ] (Qin and Suganthan (2005)). for each algorithm, trying to cover most of the Cr × F
plane, and every time, Cr and F are randomly chosen.
4. TEST FUNCTIONS Then, for DE and EvE, we can identify a “comfort zone”,
CZ, as the Cr ×F region where a test function is minimized
The study of the performance of a heuristic algorithm is with the less number of iterations. Figure 1 and Figure 2
commonly realized through the data mining from several show an example of the comfort zone.
executions over a set of test functions. According to Price
et al. (2006), it is possible to classify those test functions
in three kinds:
· Unimodal functions: Only contain a global minimum, Table 2. Unconstrained Multimodal Test Functions
with no local minimums.
· Unconstrained multimodal functions: Present at least Ackley: q P
two local minimums, but their global minimum does −0.2 1 N
x2 1
PN
N j=1 j cos (2πxj )
not depend of the domain of the function. f (x) = −20e − eN j=1 + 20 + e,
· Constrained multimodal functions: There is no global −30 ≤ xj ≤ 30, j = 1, ..., N, f (x∗ ) = 0, x∗ = 0.
minimum defined for these functions. The minimum Rastrigin:
PN
evaluation value depends of the domain of the f (x) = (x2 − 10 cos (2πxj ) + 10),
j=1 j
function. −6 ≤ xj ≤ 6, j = 1, ..., N, f (x∗ ) = 0, x∗ = 0.
Salomon: q
For this paper, we use the test functions presented in PN
f (x) = − cos (2πkxk) + 0.1kxk + 1, kxk = x2j ,
Tables 1, 2 and 3. j=1
−50 ≤ xj ≤ 50, j = 1, ..., N, f (x∗ ) = 0, x∗ = 0.
Table 1. Unimodal Test Functions
Sphere:
PN 2
f (x) = x ,
j=1 j
−10 ≤ xj ≤ 10, j = 1, ..., N, f (x∗ ) = 0, x∗ = 0.
Generalized Rosenbrock: Table 3. Constrained Multimodal Test Functions
PN −1
f (x) = j=1
(100(xj+1 − x2j )2 + (xj − 1)2 ),
−20 ≤ xj ≤ 20, j = 1, ..., N, f (x∗ ) = 0, x∗ = 1. Schwefel:
Schwefels Ridge: 1
PN p
PN Pk f (x) = − N x sin ( |xj |),
j=1 j
f (x) = k=1
(( j=1 xj )2 ), −500 ≤ xj ≤ 500, j = 1, ..., N, f (x∗ ) = −418.983, x∗ = 420.9687.
−50 ≤ xj ≤ 50, j = 1, ..., N, f (x∗ ) = 0, x∗ = 0. Rana:
PN −1
f (x) = xj sin (α) cos (β) + x(j+1)modN cos (α) sin (β),
p j=1 p
For Tables 1, 2 and 3, x∗ represents the value in which the α= |xj+1 + 1 − xj |, β = |xj+1 + 1 + xj |,
test function has its minimum evaluation cost, f (x∗ ). −512 ≤ xj ≤ 512, j = 1, ..., N, f (x∗ ) = −511.708, x∗ = −512.
376
CHAPTER 12. OPTIMIZATION
Figure 1. Comfort zone, in red, for EvE in the sphere function Figure 3. Comfort zone, in red, for EvE in the Rastrigin function
Figure 2. Comfort zone, in red, for DE in the sphere function Figure 4. Comfort zone, in red, for DE in the Rastrigin function
The z axis in Figures 1 and 2 represent the number of Table 5. Results for Unconstrained Multimodal Test
iterations required to solve the problem. As we can see, Functions
both algorithms can solve the sphere function of <10 .
Function DE CZ EvE CZ Success ratio
The red region represents the comfort zone, in which the Ackley 0.66 ≤ Cr ≤ 1 0.55 ≤ Cr ≤ 1 P (DE) = 0.641
problem is solved in less than 500 iterations. It is easy 0.21 ≤ F ≤ 0.48 0.13 ≤ F ≤ 0.26 P (EvE) = 0.887
to see that, for EvE algorithm, the comfort zone is more Rastrigin 0.36 ≤ Cr ≤ 0.86 0.50 ≤ Cr ≤ 0.96 P (DE) = 0.744
clearly defined that for DE algorithm. 0.78 ≤ F ≤ 0.96 0.11 ≤ F ≤ 0.28 P (EvE) = 0.903
Salomon 0.44 ≤ Cr ≤ 1 0.54 ≤ Cr ≤ 0.98 P (DE) = 0.573
The process is as follows: after 1000 executions for each 0.63 ≤ F ≤ 1.12 0.18 ≤ F ≤ 0.76 P (EvE) = 0.781
algorithm in each function, a vector that storages the
number of iterations required to achieve the minimum Table 5 still exhibits the superiority of EvE over
evaluation value is read, and if the number of iterations is DE. Although the success ratio is considerably low for
at most 500, the index of that stored value is used to access Unconstrained Multimodal functions in comparison with
to the Cr and F values that allow that performance. Then, the success ratio achieved in Unimodal functions, it
the minimum and maximum values of Cr and F that have is important to remember that we only consider as a
the “at most 500 iterations” property defines the CZ of DE success the achievement of the minimum in less than 500
and EvE. To avoid to take in consideration some minimum iterations.
or maximum at the extremes of the distribution, it is
necessary to discard the calculated value if it is isolated, Finally, Table 6 summarizes the results for the most
that is, if this value has no at least another point with the difficult kind of test functions used in this paper, the
property of “at most 500 iterations” in a neighborhood of Constrained Multimodal functions. Also, Figures 5 and 6
0.01, in order to guarantee that there is at least two (Cr , F ) present an example of the CZ for these functions.
coordinated pairs in the CZ bounds. Using the CZ of each
Table 6. Results for Constrained Multimodal Test
algorithm, 1000 new executions are performed. The success Functions
ratio is computed by the number of success with less than
500 iterations over the number of executions. Function DE CZ EvE CZ Success ratio
Schwefel 0.40 ≤ Cr ≤ 0.99 0.35 ≤ Cr ≤ 1 P (DE) = 0.693
Table 4 summarizes the results of DE and EvE algorithm 0.36 ≤ F ≤ 0.81 0.12 ≤ F ≤ 0.32 P (EvE) = 0.910
Rana 0.32 ≤ Cr ≤ 0.99 0.33 ≤ Cr ≤ 1 P (DE) = 0.469
for the Unimodal test functions.
0.38 ≤ F ≤ 1.72 0.13 ≤ F ≤ 1.05 P (EvE) = 0.687
Table 4. Results for Unimodal Test Functions
Function DE CZ EvE CZ Success ratio For all the three kinds of test functions considered in this
Sphere 0.23 ≤ Cr ≤ 0.97 0.31 ≤ Cr ≤ 1 P (DE) = 0.841
0.11 ≤ F ≤ 0.61 0.13 ≤ F ≤ 0.42 P (EvE) = 0.997
paper, EvE algorithm shows a superiority in the capacity
Generalized 0.55 ≤ Cr ≤ 0.95 0.53Cr 0.95 P (DE) = 0.712 to achieve the minimum evaluation value in less iterations.
Rosenbrock 0.10 ≤ F ≤ 0.81 0.13 ≤ F ≤ 0.51 P (EvE) = 0.991 Also, it is important to note that no execution has success
Schwefels 0.32 ≤ Cr ≤ 0.97 0.28 ≤ Cr ≤ 0.99 P (DE) = 0.883
Ridge 0.13 ≤ F ≤ 0.78 0.13 ≤ F ≤ 0.87 P (EvE) = 0.998
ratio of 1, and that is intrinsically a heuristic behavior,
there exist random components in both algorithms that
cause to mismatch some parts of the execution, like the
As we can see in Table 4, Unimodal test functions do initialization step, or the selection of the components of
not represent a high challenge, neither for DE nor for the mutation equations. Nevertheless, the superiority of
EvE. Table 5 presents the results for the Unconstrained EvE algorithm surpass with 0.218 points the performance
Multimodal functions, and Figure 3 and Figure 4 show of DE in the worst EvE case. For the better DE case, EvE
an example of the CZ of each algorithm in this kind of has a superior performance with 0.115 points of success
functions. ratio over the DE’s performance.
377
CHAPTER 12. OPTIMIZATION
REFERENCES
Bäck, T. (1996). Evolutionary algorithms in theory
and practice: evolution strategies, evolutionary
programming, genetic algorithms. Oxford university
press.
Bilchev, G. and Parmee, I.C. (1995). The ant colony
metaphor for searching continuous design spaces. In
Evolutionary Computing, 25–39. Springer.
Figure 5. Comfort zone, in red, for EvE in the Rana function Cooper, L. (1964). Heuristic methods for
location-allocation problems. Siam Review, 6(1),
37–53.
Cormen, T.H., Leiserson, C.E., Rivest, R.L., and Stein, C.
(2001). Introduction to algorithms, volume 6. MIT press
Cambridge.
Das, S. and Suganthan, P.N. (2011). Differential evolution:
a survey of the state-of-the-art. IEEE Transactions on
Evolutionary Computation, 15(1), 4–31.
Eiben, A.E. and Schoenauer, M. (2002). Evolutionary
Figure 6. Comfort zone, in red, for DE in the Rana function computing. Information Processing Letters, 82(1), 1–6.
Fleetwood, K. (2004). An introduction to differential
6. CONCLUSIONS evolution. In Proceedings of Mathematics and Statistics
of Complex Systems (MASCOS) One Day Symposium,
In this paper, a new bioinspired heuristic algorithm is 26th November, Brisbane, Australia.
presented, and its performance is discussed. Using the Houck, C.R., Joines, J., and Kay, M.G. (1995). A
evolutionary approach, and the Differential Evolution genetic algorithm for function optimization: a matlab
algorithm as a base of construction, we take some natural implementation. NCSU-IE TR, 95(09).
processes as inspiration to construct a set of operations Martı, R. (2003). Procedimientos metaheurısticos en
that improve the performance of the EvE algorithm, from optimización combinatoria. Matemátiques, Universidad
the extinction process to avoid the premature convergence de Valencia, 1(1), 3–62.
behavior, to the parasitism phenomena, that allows to Mühlenbein, H., Gorges-Schleuter, M., and Krämer,
explore the search space using a convex combination. As a O. (1988). Evolution algorithms in combinatorial
competition scheme, we take several classic test functions optimization. Parallel Computing, 7(1), 65–85.
and run over them the EvE algorithm and the original Pearl, J. (1984). Heuristics: intelligent search strategies for
DE algorithm to make a point of comparison. In all the computer problem solving. Addison-Wesley Pub. Co.,
cases, EvE has a better performance than DE, but it is Inc., Reading, MA.
important to mention that the execution cost is higher Pérez-González, A., Begovich, O., and Ruiz-Leon, J.
for EvE, than for DE, this is obviously due to the set of (2014). Modeling of a greenhouse prototype using PSO
rules adding to extend the algorithm. Nevertheless, this algorithm based on a LabView application. In 11th
cost is not far above than DEs cost, thanks to the use of International Conference on Electrical Engineering,
a programing scheme of one rule at a time, that is, if an Computing Science and Automatic Control (CCE), 1–6.
individual is recombined, it is not candidate anymore for IEEE.
a multipoint crossing procedure, and so on, until the end Price, K., Storn, R.M., and Lampinen, J.A. (2006).
of each generation. Differential evolution: a practical approach to global
optimization. Springer Science & Business Media.
Another important characteristic that arises from our Qin, A.K. and Suganthan, P.N. (2005). Self-adaptive
results is the definition of the CZ. It is easy to see that differential evolution algorithm for numerical
for DE algorithm, CZ is not clearly defined for all the test optimization. In IEEE Congress on Evolutionary
function, whereas for EvE, we can conclude that a CZ of Computation, volume 2, 1785–1791. IEEE.
0.55 ≤ Cr ≤ 0.95 and 0.18 ≤ F ≤ 0.26 covers all our test Schmitt, L.M. (2001). Theory of genetic algorithms.
functions, providing EvE a better generalization property. Theoretical Computer Science, 259(1), 1–61.
As future work, EvE will be compared against another Storn, R. and Price, K. (1995). Differential evolution-a
heuristic algorithms. Also, it will be necessary to formally simple and efficient adaptive scheme for global
define its execution cost. And, at the same time, it optimization over continuous spaces, volume 3. ICSI
is proposed as future work the formal explanation of Berkeley.
every EvE rule and process (remembering that this is an Storn, R. and Price, K. (1997). Differential evolution–a
introductory paper); as well as the addition of new rules simple and efficient heuristic for global optimization over
that may improve its performance. continuous spaces. Journal of global optimization, 11(4),
341–359.
ACKNOWLEDGEMENTS
378
CHAPTER 12. OPTIMIZATION
Abstract: This paper considers gain parameter election of a class of nonlinear controller-
observer for a systems with uncertainty, using Paerto-set approach. The presented observer
based on Super-twisting observer is used for velocity estimation/reconstruction signals based
on the available plant input/output information and can be calculated on-line. The signal then
is injected to the classical sliding mode controller to provide a feedback control in presence of
unmodeled dynamics and perturbations. To guarantee the convergence of the proposed algorithm
the gain parameters must be time-varying and depending on available current measurements.
The optimal selection of the gains are proposed in this work. A simulation study on a 2 degrees
of freedom Stewart platform is presented to show the effectiveness of the scheme.
379
CHAPTER 12. OPTIMIZATION
• An adaptive sliding mode controller, which uses the The feedback control law u X̂, t must derive the states
state estimated on-line by the adaptive Super-Twist
observer; of the systems to the desired values. i.e., lim X̂1 = X1∗ ,
t→∞
• Optimization gain parameters of the named observer
lim X̂2 = 0. By defining the ”ideal” sliding surface (Utkin
and controller by Pareto-set approach; t→∞
• Application of the studied approach to a new two (1992)(:
DOF Stewart platform as a solar tracker ·
σ X̂ := X̂ 1 + C X̂1 − X1∗ = X̂2 + C X̂1 − X1∗ = 0
2. SYSTEM DESCRIPTION AND CONTROL DESIGN (6)
where C = diag(c1, c2 ) is a diagonal
matrix
with positive
Consider the mathematical model of 2 DOF Stewart
elements, the SM controller u X̂, t structure can be
platform
defined as
Ẋ1 (t) = X2 (t)
−1
Ẋ2 (t) = f (X, t) + g (X1 , t) u + ξ (t) (1) u X̂, t = −k X̂, t [g (X1 , t)] Sign σ X̂ (7)
y (t) = X1 (t)
where the state estimates X̂ are generated by the observer
- X1 (t) , X2 (t) ∈ IR2 are the states of the system; (2) with varying parameters, that is,
> ·
1/2
- X (t) := X1> (t) , X2> (t) ∈ IR4 ; X̂ 1 = X̂2 + λ X̂, t
y − X̂1
Sign y − X̂1
· (8)
X̂ 2 = f X̂, t + g (X1 , t) u + α X̂, t Sign y − X̂1
- u ∈ IR2 is a control to be designed,
- y (t) ∈ IR2 is a measurable output at time t ≥ 0,
2.2 Gain parameters tuning
- ξ (t) ∈ IR2 is bounded unmeasurable term including
external and internal perturbations/uncertainties: for all First by the physical reasons we may accept the following
t ≥ 0, kξ (t)k ≤ ξ + < ∞, assumptions.
- f (X, t) ∈ IR2 H1 The vector-function f (X, t) and g (X1 , t) are Lips-
chitzian with respect to the first argument on t ≥ 0,
- g (X1 , t) ∈ IR2×2 i.e.,
Remark 1. Notice that in the model (1) only the half of
f (X, t) − f X̂, t
≤Lf
X − X̂
= Lf kek
coordinates (X1 (t)) is available in time; X2 (t) is supposed
g (X1 , t) − g X̂1 , t
≤Lg
X1 −X̂ 1
= Lg ke1 k
to be estimated online.
H2 The matrix g (X1 , t) function is invertible everywhere,
2.1 Observed sliding mode control design namely
−1
< ∞, kgk := tr {gg | }
+ 2
[g (X1 , t)]
≤ g−1
To estimate the non-measurable coordinate X2 (t) in (1)
we apply a popular second-order sliding mode (super- Notice that by (5)
twisting) observers Shtessel et al. (2014):
2 2
2
·
1/2
Fe t, X1 , X2 , X̂2
≤ 2Lf kek + 2 ξ + (9)
X̂ 1 = X̂2 + λ
y − X̂1
Sign y − X̂1
· (2) The classical second-order scheme corresponds to the case
X̂ 2 = f X̂, t + g (X1 , t) u + αSign y − X̂1 Lf = 0. The situation, corresponding the inequality
The parameters α and λ are constant in the original (9), has not been considered
elsewhere. The adaptation
publications. The vector function Sign (z) is defined as parameters k = k X̂, e1 , t of the controller (7) and
follows
Sign (z) := (sign (z1 ) , ..., sign (zn ))| λ X̂, t , α X̂, t of the observer (8) are designed as
1 if zi > 0
(3) k = k X̂, e1 , t := (p0 )−1 ρ + κ X̂, e1 , t
sign (zi ) := −1 if zi < 0
i
∈ [−1, 1] if z = 0 p p
i +γ p0 /2 , ρ ≥ p0 /2
(10)
New variable e := X − X̂ which characterizes the
κ X̂, e1 , t :=
f X̂, t + αSign (e1 ) +
error of estimated state and is governed by the following
equations: C X̂2 + λ ke1 k1/2 Sign (e1 )
·
e1 = e2 − λ ke1 k1/2 Sign (e1 ) and p p
(4)
·
e2 = Fe t, X1 , X2 , X̂2 − αSign (e1 ) (η1 (γ) +p p1 /2)/p1 ke1 k
if ke1 k ≥ > 0
where
h i
λ X̂, t := and
Fe t, X1 , X2 , X̂2 = f (X, t) − f X̂, t + ξ (X, t) (5)
p (11)
(η1 (γ) +p /2)/p1
p 1
describes unmodeled dynamics and external perturbation ke1 k <
if
effects. Obviously, it is unmeasurable. α X̂, t := α > 0
380
CHAPTER 12. OPTIMIZATION
The initial values of the observer X̂2 (0) and it is clear that simultaneous optimization two differ-
should be not
so
ent functions by the same arguments p1 is impossible.
far from the real values X2 (0) fulfilling
X2 (0) − X̂2 (0)
≤ That’s why we need to apply the, so-called, ”Pareto-set
e+ 1
2 . approach”. A standard technique for generating the Pareto
set in multicriteria optimization problems is to minimize
3. OPTIMIZATION OF THE CONTROLLER (convex) weighted sums of the different objectives for
PARAMETERS various different settings of the weights. Pareto efficiency,
or Pareto optimality, is a state of allocation of resources in
which it is impossible to make any one individual better off
The control gain (10) contains three free parameters
without making at least one individual worse off Ehrgott
p0 , p1 , p2 which should be selected in such a way that the
(2005).
resulting behavior would be as better as possible. Without
loss of generality we may select the parameters p0 , p1 and The term is named after Vilfredo Pareto (1848–1923), an
p2 of the controller as follows Italian engineer and economist who used the concept in
p0 + p1 + p2 = 1 his studies of economic efficiency and income distribution.
Many real-world problems involve simultaneous optimiza-
implying tion of several incommensurable and often competing ob-
p2 = 1 − p1 − p0 (16) jectives. Usually, there is no single optimal solution, but
Indeed, since rather a set of alternative solutions. These solutions are
1 optimal in the wider sense that no other solutions in the
arg minV = arg min V search space are superior to them when all objectives are
u∈Uadm u∈Uadm p0 + p1 + p2
considered. They are known as Pareto-optimal solutions
the multiplication of the function V by any positive Ehrgott (2005).
constant does not change the optimal control parameters.
If we fix the value p0 ∈ (0, 1), then in view of (16) we In our case the Pareto set looks as in the Fig.1.
have only one free parameters p1 ∈ (0, 1) which should be It this figure
selected to fulfill the following optimality requirements:
2 P is the
Pareto set,
1) to make the ”workability zone” e+
2 = as much
F1 (p1 ) Futop = min F1 (p1 ) , min F2 (p1 ) is the utopia
p1 ∈(0,1) p1 ∈(0,1)
as possible which corresponds to the minimization of
point,
the function
πP {Futop } is the projection of the utopia point Futop
1 The proof is described in Keshtkar et al. (2016) to the Pareto set.
381
CHAPTER 12. OPTIMIZATION
20
15
States X1
10
5 x1
x2
x1opt
0 x2opt
0 2 4 6 8 10
Time, s
382
CHAPTER 12. OPTIMIZATION
u1
2000
u2
Control signals, N
u1opt
1000
u2opt
−1000
−2000
0 2 4 6 8 10
Time, s
the control signals we see a reduction in the control signal Jornada, D. and Leon, V.J. (2016). Biobjective robust op-
values. timization over the efficient set for pareto set reduction.
European Journal of Operational Research, 252(2), 573
Finally, Fig. 6 illustrates the time-variations of the adap-
– 586.
tive control parameters k = k (x̂ (t) , t) , λ = λ (x̂ (t) , t)
Jung, J., Huh, K., Fathy, H.K., and Stein, J.L. (2006).
providing the desired dynamics of the considered closed-
Optimal robust adaptive observer design for a class of
loop system.
nonlinear systems via an h-infinity approach. American
25
control conference, In Proceedings of the 2006, 3637–
lamda 3642.
20 k Keshtkar, S., Keshtkar, J., and Poznyak, A. (2016). Adap-
tive sliding mode control for solar tracker orientation.
15 2016 American Control Conference, (accepted).
lamda, k
383
CHAPTER 12. OPTIMIZATION
Energies, 3, 1586–1603.
Zheng, Y., Fatta2, H.A.A., and Loparo, K.A. (2000). Non-
linear adaptive sliding mode observer-controller scheme
for induction motors. Int. J. Adapt. Control Signal
Process, 14, 245–273.
384
CHAPTER 12. OPTIMIZATION
1. INTRODUCTION with the addition of the rules of Deb (2000) for handling
constrains is presented, as a tool for solving real world
In recent years, the use of metaheuristics has been a very optimization problems.
effective tool for solving real world problems. In engi- The four-bar mechanism is used in many industrial ap-
neering, synthesis is the process for designing mechanical plications because of its simplicity; however, its synthesis
systems; the dimensional synthesis is the length calculus for trajectory tracking is a complex task that can not be
of the components of a mechanism for producing a specific solved efficiently by using deterministic methods. Three
movement or behavior. The application of metaheuristics case studies are presented here, in order to test the ef-
in this kind of numerical problems has been treated previ- ficiency of the MBSO algorithm and the quality of its
ously in related literature as in Santiago-Valentin (2016); results; this cases are variations of a four-bar mechanism
Jiménez et al. (2014); Portilla-Flores et al. (2014), among that are designed for controlling specific trajectories. In the
others. modeling of the problems was included a normalization
The Brainstorm Optimization Algorithm is a relatively function for the links (Capistran-Gumersindo, 2015), in
new algorithm proposed by Shi (2011), that has been used order to improve the aesthetic of the mechanisms; this
successfully in engineering optimization problems as in means that the ratio between the bars should be balanced.
Duan et al. (2013), Jordehi (2015) and Fernández-Jiménez A mechanism is more aesthetic as it is better suited for its
(2014), among others. Subsequently a modification of manufacturing; it implies an enhance on stress distribu-
the original algorithm was proposed by (Zhan et al., tion, the use of cheaper and/or less specialized materials,
2012) called Modified Brainstorm Optimization Algorithm etc.
(MBSO) which computationally improves the performance This paper is distributed as follows: In Section 2 the
of the original algorithm; this algorithm is used in this methodology is introduced, including the analysis of the
work. In this paper, a new proposal based on MBSO four-bar mechanism, the optimization strategy and the
? The authors would like to thank Instituto Politécnico Nacional
computational implementation. In Section 3 the results of
the case studies are analyzed, while the final discussion is
(IPN) of México, for its support via Secretarı́a de Investigación y
Posgrado (SIP) with the project SIP-20161615.
385
CHAPTER 12. OPTIMIZATION
presented in Section 4. Finally, there is an appendix with required four-bar mechanisms it is necessary to consider
the data tables resulting from the simulation carried out. the following constraints:
• Law of Grashof: This law establishes that for a planar
2. METHODOLOGY
four-bar linkage, the sum of the shortest bar and
the largest bar cannot be larger than the sum of
This research addresses the dimensional synthesis of a
the remaining bars, if a continual relative rotation
four-bar mechanism with three different case studies for
between two elements is desired (Norton, 1995). It is
trajectory control; so, as a first point the general model of
expressed in (4),
this mechanism is developed.
r1 + r2 ≤ r3 + r4 (4)
2.1 Kinematics of the mechanism
So, the relationships in (5) must be satisfied,
The four-bar mechanism is one of the most used devices
in machinery design due to its simplicity for controlling r2 < r3 , r3 < r4 , r4 < r1 (5)
movements with one freedom degree. These mechanisms • Sequence of angles: The coupler of the mechanism
have been well studied in the corresponding literature; a must pass in a logical order along the points that
detailed description is in Sanchez-Marquez et al. (2014). delimited the trajectory (precision points). It can be
Fig. 1 shows a planar mechanism whose elements are: a achieved if the angular positions corresponding to
reference bar (r1 ), a crank bar (r2 ), a couple bar (r3 ) and each of the precision points are ordered in magnitude
a rocker bar (r4 ).The equation of the closed loop is in (1): as shown in (6), where n is the number of precision
r1 + r4 = r2 + r3 (1) points in the desired path.
n−1
X
θ2i − θ2i+1 ≤ 0 (6)
i=1
It is important to note that equations (2), (3), and the where x is a vector formed by the variables that represents
expressions of the mechanism kinetics are enough to cal- the magnitude of the bars, as expressed in (9),
culate any position of point C along the trajectory of the
mechanism.
x = [r1 , r2 , r3 , r4 , rcx , rcy ]T = [x1 , x2 , x3 , x4 , x5 , x6 , ]T
(9)
2.2 Design constraints
The merge of functions (7) and (8) forms the weighted sum
Since the case studies presented are taken from real- indicated in (10), where w1 and w2 are the coefficients for
world problems, in order to solve the synthesis of the controlling the aesthetic of the mechanism. With a value
386
CHAPTER 12. OPTIMIZATION
g2 (p) = p2 − p3 ≤ 0
g3 (p) = p3 − p4 ≤ 0 The vector of design variables is defined by (22):
387
CHAPTER 12. OPTIMIZATION
n
X
i Algorithm 1 Modified Brain Storm Optimization
Error1 = [(C1xd − Cxi )2 + (C1yd
i
− Cyi )2 ]
1: Randomly generate N ideas and evaluate them
i=1 2: while (CurrentEval 6 EvalM ax) do
Xn
3: Cluster the N ideas into M clusters, keep the
i
Error2 = [(C2xd − Cxi )2 + (C2yd
i
− Cyi )2 ] (25) best idea in each cluster as the cluster center
i=1 4: if (rand(0, 1) < PR ) then
This problem is subject to the constraints in (26): 5: Randomly select a cluster and replace its center
with a randomly generated idea
g1 (p) = p1 + p2 − p3 − p4 ≤ 0 6: end if
7: for (i = 1 to N ) do
g2 (p) = p2 − p3 ≤ 0 8: if (rand(0, 1) < PG ) then . Create an idea based on one
g3 (p) = p3 − p4 ≤ 0 cluster
9: Randomly select a cluster j with probability
g4 (p) = p4 − p1 ≤ 0 Pj
10: if (rand(0, 1) < PC1 ) then
g5 (p) = p10 − p11 ≤ 0 11: Add random values to the selected cluster
g6 (p) = p11 − p12 ≤ 0 center, to generate a new idea Yi
12: else
g7 (p) = p12 − p13 ≤ 0 13: Add random values to a random idea from the
selected cluster, to generate a new idea Yi
g8 (p) = p13 − p14 ≤ 0
14: end if
g9 (p) = p14 − p15 ≤ 0 15: else . Create an idea based on two clusters
16: Randomly select two clusters, j1 and j2
g10 (p) = p15 − p16 ≤ 0 17: if (rand(0, 1) < PC2 ) then
g11 (p) = p16 − p17 ≤ 0 18: Combine two cluster center and add a
random value to generate a new idea Yi ;
g12 (p) = p17 − p18 ≤ 0 19: else
20: Combine two random ideas from the two
g13 (p) = p18 − p19 ≤ 0 (26) clusters and add a value to generate a
new idea Yi ;
2.5 Optimization Strategy 21: end if
22: end if
The Brainstorm Optimization algorithm was proposed 23: Evaluate the idea Yi and replace Xi if Yi is
by Shi (2011), based in four ideas developed by Osborn better than Xi ;
(1957). Zhan et al. (2012) presented an enhanced version, 24: end for
MBSO, which is used in this work. It is inspired on the 25: end while
process of solving a problem by humans when they get
together to interchange ideas and opinions, complementing tion of weights; these combinations start with w1 = 1 and
each other in order to reach a general solution. In the w2 = 0, and by subtracting 0.1 and adding 0.1 respectively
algorithm the ideas are represented with vectors of real they finish in w1 = 0.1 and w2 = 0.9. The number of
numbers corresponding to the design variables. The ideas evaluations of the objective function was taken as a stop
are grouped accordingly to their similarity, in a process condition, with the values of 350,000, 150,000 and 350,000
called clustering; then, it is determined whether one of for CS1, CS2, and CS3, respectively; the values of the
these central ideas will be replaced for a completely new complementary parameters are listed in the Table 3.
one. New ideas can be formed from one or two clusters, and The constraints are handled by implementing in MSBO
a process of competition is produced (Zhan et al., 2013); the feasibility rules of Deb (2000), which dictate that:
the complete MBSO is described in the Algorithm 1.
Table 2. Parameters of the MBSO algorithm
MBSO uses the parameters listed in the Table 2. The pro-
cess of creating a new idea is governed by the relationship Number of ideas N
in (27). This equation indicates that the new idea Y1 in Number of clusters M
the nth dimension (d) is a random number between the Probability of replacing a full idea PR
limits Ld and Hd if and only if a random number is lower Probability of replacing one component of the idea Pr
than a preset probability Pr . Otherwise, the new idea will Probability of creating a new idea based in one cluster PG
be generated by adding noise; this noise is produced from First probability of center cluster idea PC1
the difference between two elements of the same dimension Second probability of center cluster idea PC2
from another two ideas, picked randomly and multiplied
by a random number between 0 and 1. The function rand Table 3. Tuned parameters for all cases
generates random numbers with a uniform distribution.
Parameter CS1 CS2 CS3
N 45 100 100
d rand(Ld , Hd ) if (rand(0, 1) < Pr ) M 10 20 30
Y1new =
pdi + rand(0, 1)(pda − pdb ) otherwise PR 0.4 0.4 0.4
PG 0.8 0.8 0.8
(27)
PC1 0.8 0.7 0.8
PC2 0.8 0.7 0.7
Experiment design A set of 300 simulations was executed
Pr 0.01 0.01 0.01
for each case study, with 30 executions for each combina-
388
CHAPTER 12. OPTIMIZATION
(1) Between two feasible individuals the one with the best
value in the objective function is selected.
(2) Between a feasible individual and another unfeasible,
the feasible one will be chosen.
(3) Between two infeasible individuals, the individual
with the lower sum of constraint violations will be
accepted.
The feasibility is measured by the sum of constraint
violation determined by (28), where p is the number of
constraints; only inequality constraints were used in the
three case studies:
X p
SV R = max(0, gi (p)) (28)
i=1
3. RESULTS
In all the three case studies it was found that, without the
normalization function, the mechanisms tend to be quite
disproportionate, specially in CS2 and CS3 ; e.g., in the
second case the value of r2 for a combination of weights
w1 = 1 and w2 = 0 is about ten times smaller than the rest
of the bars. These results can be consulted in the appendix
of this paper. On the other hand, if it is given more weight Fig. 2. Mechanism for the first case study
to the normalization function, the obtained mechanisms Fig. 3. It is noteworthy that in this case the trajectory
tend to present unexpected behaviors since bars are more marked by the mechanism has a error that could be
like each other and they appear to be more a double rocker questionable for the purpose of an optimal trajectory
device. The best aesthetic and functional mechanisms were tracking, but otherwise complies with the ratio of the bars;
generated when the weight assigned to the normalization without the normalization function the mechanism would
coefficient was in the range 0 < w2 < 0.5. be infeasible in the real world.
Table 4. Statistical analysis - CS1 As in the second case, for CS3 the mechanisms produced
Weight 0.1+0.9
without a normalization tend to be physically irrepro-
Average 0.5107 σ2 0.3311 Best 0.0706
ducible; the algorithm generated its best result with w1 =
Median 0.1998 σ 0.5754 Worse 2.0423
0.7 and w2 = 0.3. This mechanism is shown in Fig. 4.
Weight 0.5+0.5
4. CONCLUSION
Average 1.2645 σ2 0.4008 Best 0.15381
Median 1.2354 σ 0.633 Worse 2.43293
As noted in the results section, the use of an additional
Weight 0.9+0.1
2 function for normalization allow to control the size of the
Average 0.3881 σ 0.0862 Best 0.13537
bars by simply varying the proportion in a weighted sum.
Median 0.2995 σ 0.2936 Worse 1.30035
Depending on how much accuracy is required in the tra-
Weight 1+0
jectory less weight should be assigned to the normalization
Average 0.3823 σ2 0.2603 Best 0.0019
function; on the other hand, if the weight assigned to the
Median 0.0454 σ 0.5102 Worse 1.6361
normalization function is greater than 0.5 the obtained
MBSO generated good results from the point of view mechanisms have bars that tend to be very similar among
of engineering as they converged to feasible solutions themselves, generating unexpected behaviour. This sug-
in the three problems, and the values of the objective gests that the adequate range for aesthetic mechanisms
function shown quality results corresponding to acceptable should be w2 ∈ [0, 0.5]. This range allows to have a balance
trajectories. Tables 4, 5, and 6 show the statistic analysis Table 5. Statistical analysis - CS2
for CS1, CS2, and CS3, respectively, taking into account
the extreme and the medium cases regarding the weights. Weight 0.1+0.9
In the first two case studies, the value σ 2 < 1 indicates Average 0.0924 σ2 2.03E-17 Best 0.0924
that the results tend to be accurate. Median 0.0924 σ 4.51E-09 Worse 0.0924
Weight 0.5+0.5
For CS1, the best result was obtained with the combina- Average 0.1247 σ2 4.70E-13 Best 0.1247
tion of weights of 0.9 to the mean square error function Median 0.1247 σ 6.86E-07 Worse 0.1247
and 0.1 to the normalization function; a simulation of the Weight 0.9+0.1
corresponding mechanism is shown in Fig. 2. Average 0.0379 σ2 9.37E-10 Best 0.0379
For the second problem the mechanisms generated without Median 0.0379 σ 3.06E-05 Worse 0.03803
389
CHAPTER 12. OPTIMIZATION
REFERENCES
Capistran-Gumersindo, E. (2015). Cómputo inspirado en
la naturaleza para la optimizacion mono-objetivo de un
efector final de tres dedos. Master’s thesis, Laboratorio
Nacional de Informatica Avanzada, Mexico.
Deb, K. (2000). An efficient constraint handling method
for genetic algorithms. Computer Methods in Applied
Mechanics and Engineering.
Duan, H., Li, S., and Shi, Y. (2013). Predator–prey
brain storm optimization for dc brushless motor. IEEE
Transactions on Magnetics, 49(10), 5336–5340.
Fernández-Jiménez, M. (2014). Optimización de un
mecanismo de cuatro barras usando el algoritmo de op-
timización basado en tormenta de ideas. Master’s thesis,
Laboratorio Nacional de Informatica Avanzada, Mexico.
Jiménez, V.P., Sánchez, O.F.A., and Mauledoux, M.
Fig. 3. Mechanism for the second case study (2014). Diseño de un efector final de tres dedos para
agarre óptimo. DYNA: revista de la Facultad de Mi-
nas. Universidad Nacional de Colombia. Sede Medellı́n,
81(184), 93–101.
Jordehi, A.R. (2015). Brainstorm optimisation algorithm
(bsoa): An efficient algorithm for finding optimal lo-
cation and setting of facts devices in electric power
systems. International Journal of Electrical Power &
Energy Systems, 69, 48–57.
Norton, R.L. (1995). Diseño de Maquinaria, una Introduc-
cion a la Sintesis y Analisis Mecanismos y Maquinas.
McGraw Hill.
Osborn, A. (1957). Applied Imagination-Principles and
Procedures of Creative Writing. Charles Scribner’s Sons.
Portilla-Flores, E., Santiago-Valentin, E., Solano-Palma,
A., Vega-Alvarado, E., and Calva-Yáñez, B. (2014).
Cálculo de fuerza para agarre óptimo de un efector
final de tres dedos utilizando el algoritmo de forrajeo de
bacterias. Second International Conference on Advanced
Mechatronics, Design and Manufacturing Technology.
Sanchez-Marquez, A., Vega-Alvarado, E., Portilla-Flores,
E.A., and Mezura-Montes, E. (2014). Synthesis of a
planar four-bar mechanism for position control using the
harmony search algorithm. In Electrical Engineering,
Computing Science and Automatic Control (CCE), 2014
11th International Conference on, 1–6. IEEE.
Santiago-Valentin, E. (2016). Optimización de sistemas
mecatronicos utilizando herramientas de inteligencia ar-
Table 6. Statistical analysis - CS3
Fig. 4. Mechanism for the third case study
Weight 0.1+0.9
between the aforementioned aesthetics and the precise Average 1.1679 σ2 0.6338 Best 0.13004
control of the trajectories, in order to produce solutions Median 1.265 σ 0.7961 Worse 2.64519
that are acceptable from the point of view of engineering. Weight 0.5+0.5
Analyzing the MSBO from the point of view of computing, Average 2.57 σ2 6.1875 Best 0.191
the algorithm meets the requirements for its use in op- Median 1.5815 σ 2.4874 Worse 8.2547
competitive results that suggest that MSBO can be use Average 2.4647 σ2 15.435 Best 0.4662
as a tool for real world problems. However, the algorithm Median 0.8776 σ 3.9288 Worse 12.4385
390
CHAPTER 12. OPTIMIZATION
Appendix A. RESULTS OF THREE STUDY CASES Table A.2. Main weights of the CS2
OF THE MAIN WEIGHTS
Weight 0.1+0.9
Table A.1. Main weights of the CS1 r1 r2 r3 r4
Best 1.861134 1.790211 1.811698 1.861134
Weight 0.1+0.9
Worse 1.861238 1.790278 1.811783 1.861238
r1 r2 r3 r4 rcx rcy
rcx rcy F.O
Best 36.084 34.994 36.084 36.084 35.799 35.894
Best 1.808446 1.846364 0.092499
Worse 49.701 12.906 13.411 49.663 -1.164 12.869
Worse 1.808516 1.84644 0.092499
θ0 x0 y0 θ21 θ22 θ23
Weight 0.5+0.5
Best 4.078 -17.102 41.158 3.095 3.266 3.427
r1 r2 r3 r4
Worse 5.703 19.934 22.020 0.548 0.714 0.892
Best 1.927416 1.785747 1.927416 1.927416
θ24 θ25 θ26 F.O
Worse 1.925981 1.784889 1.925981 1.925981
Best 3.578 3.725 3.868 0.0706
rcx rcy F.O
Worse 1.087 1.335 1.681 2.0423
Best 1.69976 1.946801 0.12476
Weight 0.5+0.5
Worse 1.699511 1.947457 0.124764
r1 r2 r3 r4 rcx rcy
Weight 0.7+0.3
Best 30.476 27.127 30.044 30.46 25.245 25.209
r1 r2 r3 r4
Worse 41.972 23.304 41.962 41.96 -50.923 -44.808
Best 1.946624 1.844081 1.946624 1.946624
θ0 x0 y0 θ21 θ22 θ23
Worse 1.957514 1.850633 1.957514 1.957514
Best 3.802 -8.144 52.152 2.61 2.7974 2.991
rcx rcy F.O
Worse 1.9 -60 18.436 3.484 3.5913 3.696
Best 1.585674 1.963814 0.093646
θ24 θ25 θ26 F.O
Worse 1.587307 1.956146 0.093749
Best 3.191 3.399 3.606 0.1538
Weight 0.9+0.1
Worse 3.797 3.894 3.987 2.4323
r1 r2 r3 r4
Weight 0.9+0.1
Best 1.990289 1.921494 1.990289 1.990289
r1 r2 r3 r4 rcx rcy
Worse 2.012848 1.934196 2.012847 2.012848
Best 39.452 14.133 22.785 37.883 23.342 17.332
rcx rcy F.O
Worse 43.732 6.0066 10.835 43.717 -28.829 59.999
Best 1.477738 1.947855 0.037909
θ0 x0 y0 θ21 θ22 θ23
Worse 1.482921 1.929627 0.038037
Best 3.696 0.309 52.83 2.267 2.636 2.985
Weight 1+0
Worse 2.862 -49.718 36.512 3.882 4.021 4.155
r1 r2 r3 r4
θ24 θ25 θ26 F.O
Best 20.32431 2.12592 20.31793 20.3206
Best 3.343 3.717 4.115 0.1353
Worse 33.67168 2.040306 33.64924 33.65048
Worse 4.284 4.411 4.534 1.3
rcx rcy F.O
Weight 1+0
Best 2.252166 -0.04672 0.002893
r1 r2 r3 r4 rcx rcy
Worse 2.328561 -0.10588 0.003544
Best 50.44 6.191 8.794 47.872 -8.510 -10.107
Worse 59.527 8.582 8.595 59.518 8.484 -40.094
θ0 x0 y0 θ21 θ22 θ23
Best 4.513 27.018 35.655 0.508 1.022 1.385
Worse 3.253 59.999 46.489 1.153 1.277 1.406
θ24 θ25 θ26 F.O
Best 1.716 2.049 2.417 0.0019
Worse 1.531 1.650 4.641 1.6361
391
CHAPTER 12. OPTIMIZATION
392
CHAPTER 13
POWER SYSTEMS
CHAPTER 13. POWER SYSTEMS
∗
Faultad de Ingeniería, Universidad Nacional Autónoma de México
(e-mail:michrojasg@comunidad.unam.mx,gerardoe@unam.mx).
Resumen
En el presente trabajo se muestra una método para obtener condiciones de existencia de estado es-
tacionario constante en circuitos eléctricos al emplear propiedades estructurales que presentan este
tipo de sistemas. En trabajos recientes se han podido establecer condiciones de estado estacionario
para circuitos de corriente directa hasta con dos cargas CPL; el objetivo principal del presente escrito
es mostrar que es posible establecer este tipo de condiciones de existencia de estado estacionario de
manera sistemática vía sus propiedades estructurales.
Keywords: Estado estacionario constante, CPL, árbol, co-árbol, circuitos eléctricos, matriz de
transferencia.
394
CHAPTER 13. POWER SYSTEMS
Y (s) = G(s)U (s) + k (1) se presentó en Sanchez et al. (2013) como una condición
necesaria y suficiente, pero sólo representa la cantidad máxi-
donde G(s) ∈ Rm×m es la función de transferencia, Y (s) = ma de potencia que el sistema es capaz de disipar(Barvanov
L {y(t )} es el vector de salidas (voltajes en las cargas), U (s) = et al. (2015)), es decir, es la cota superior de potencia que
L {u(t )} es el vector de entradas (corrientes en las cargas), la CPL puede demandar a la fuente. Cabe mencionar que la
donde el operador L indica la transformada de Laplace y condición dada por (7) es necesaria y suficiente para circuitos
k ∈ Rm representa la relación entre las entradas (corrientes eléctricos en donde m = 1, es decir, sólo tienen dentro de su
de las cargas) y la fuente de excitación, donde m es el número configuración una CPL, si m > 1 se tiene la siguiente propo-
de CPL dentro del sistema. Las entradas u y las salidas y son sición
variables conjugadas de puerto, es decir, el producto entre Proposición 2. {Barvanov et al. (2015), Proposición 1} Se asu-
éstas (u T y) tiene unidades de potencia, además establecen me que existe una matriz T := d i ag {t i } ∈ Rm×m tal que
la relación de las CPL con el sistema como se muestra en la · ¸
T G(0) +G T (0)T Tk
siguiente ecuación >0 (8)
(T k)T 21Tm T P
ū i (g i ū + k i ) = − P i ; i ∈M (6)
|{z} Con base en lo anterior es posible definir el espacio de bordes
ȳ i ū i C 1 (G) como un espacio vectorial real b-dimensional de todas
las funciones E (G) ∈ R donde se encuentran las corrientes
donde G T (0) := [g 1 g 2 · · · g m ], con base en la (6), es posible que fluyen a través de los bordes, de la misma forma es posi-
realizar la siguiente proposición ble definir el espacio dual C 1 (G) donde se encuentran todos
Proposición 1. El estado estacionario constante existe si y los voltajes v. Las variables de puerto de los elementos que
sólo si la solución de (6) sobre la entrada u ∈ R. se encuentran inter-conectados, al ser circuitos eléctricos,
♦ deben cumplir con las restricciones impuestas por las leyes
de Kirchoff, dichas restricciones se encuentran dadas en fun-
donde la condición para que (6) tenga solución es la siguiente ción de cusets y loopsets básicos que se generan en gráficas
especiales, árbol yco-árbol.
1 T Xm Un árbol es un subgráfica conectada que posee n nodos y
k [G(0) +G T (0)]−1 k ≥ Pi (7) n − 1 bordes (ramas) de modo que en su composición no
2 i =1
existan trayectorias cerradas, los bordes restantes (cuerdas),
sí lo anterior es cierto, según la definición, existe estado es- b − (n − 1) corresponden a los bordes del co-árbol. EL cutset
tacionario constante. En realidad la condición dada por (7) 3 Si desea conocer más de teoría de grafos consulte Gross and Yellen (1999)
395
CHAPTER 13. POWER SYSTEMS
i t = −Hi c vc = H T v t (14) RL L
iL
El objetivo del presente es mostrar que para circuitos eléc- + iC P L
tricos el método descrito en Barvanov et al. (2015) se puede rC
Vc C CPL
aplicar para circuitos con una estructura más compleja al
−
tomar en cuenta las propiedades estructurales de los circui-
tos eléctricos y en una primera aproximación sin importar el RL L
número de cargas en su estructura. En la siguiente sección iL
se muestra el análisis con un circuito con una carga para iC P L
realizar el estudio de la existencia de estado estacionario para +
comprender y aterrizar mejor los conceptos descritos en este
capítulo. Vc C rC CPL
E
−
3. Circuitos con CPL
396
CHAPTER 13. POWER SYSTEMS
para obtener esta representación 4 , en la Figura 3 se muestra por lo tanto el árbol asociado al circuito está compuesto
la simplificación del circuito ha analizar. por todos los nodos y por el conjunto de bordes {6, 1, 5, 4},
por consiguiente el co-árbol esta compuesto por {2, 3, 8, 7}.
RL L Al aplicar la teoría de grafos, para un circuito con estructura
iL
radial, la matriz fundamental de trayectorias cerradas 5 se
+ iC P L encuentra dada por
Vc C rC CPL · ¸
− HC ↑ HC L
H= . (18)
HR↑ HRL
RL L
iL en donde y de acuerdo a las características estructurales de
iC P L la red las sub matrices que la definen quedan representadas
+
por
Vc C rC CPL
HC ↑ = I ∈ Rn1 ×n1
−
HC L = ant i d i ag {1} ∈ Rn1 ×n1
(19)
HR↑ = 0 ∈ Rn2 ×n2
HRL = I ∈ Rn3×n3
Figura 3. Red radial simplificada. Para obtener la representación entrada-salida correspon-
Para fines de presentar el resultado principal de este trabajo diente a las cargas para la red radial se sigue la metodología
se realizan las siguientes consideraciones empleada en Avila-Becerril et al. (2015), en donde las varia-
bles de puerto de los elementos eléctricos se representan co-
mo
n1 Capacitores
n2 Resistencias de árbol ∂H a (q, φ)
(17) q̇ = i C , vC = = ∇q H a
n3 Inductores ∂q
n4 Fuentes de corriente que representan a las CPL ∂H a (q, φ) (20)
φ̇ = v L , iL = = ∇φ H a
∂φ
de igual forma, el grafo asociado corresponde al mostrado en
la Figura 4. i R t = − f t (v R t ), v Rc = − f c (i Rc )
6 5
∂HT ( f a , e a )
· ¸ · ¸
fa
3 7
x= ; ∇x HT (x) = ∂ fa = eC
ea ∂HT ( f a , e a ) fL (21)
2 8
∂e a
u = iC P L
A 1 B 4 C
en donde las relaciones entre las variables de puerto están
para elegir la representación en árbol y co-árbol se deben de dados por las siguientes ecuaciones
cumplir las siguientes observaciones
f˙a = −HC ↑ f ↑ − HC L f L
Observación 1. Las variables eléctricas de los elementos con- (22)
ė a = −HCT↑ e c + HR↑ e R t
trolados por voltaje pertenecen al árbol.
Observación 2. Las variables eléctricas cuyos elementos son en donde el modelo Hamiltoniano de esta red se encuentra
controlados por corriente pertenecen al co-árbol representado por el siguiente conjunto de ecuaciones
Observación 3. Para este análisis se desprecia el efecto de la
fuente de voltaje. ẋ = [J − R]∇x HT (x) + g u + kd
Observación 4. El número de capacitores es igual al número y = g T ∇x HT (x) (23)
de inductores, es decir n 1 = n 3 . T
z = k ∇x HT (x)
Observación 5. El número de las resistencias de árbol es igual
al número de inductores n 2 = n 3 . y al emplear la matriz fundamental de trayectoria cerrada se
Observación 6. El número de fuentes de corriente (cargas de obtiene
5 Recordar que la matriz fundamental de trayectoria cerrada representa un
potencia constante) es igual al número de resistencias de
capacitores n 1 = n 4 . mapeo lineal entre los elemento de árbol y co-árbol
4 En teoría de circuitos este método se conoce como teorema de superposi-
ción
397
CHAPTER 13. POWER SYSTEMS
· ¸
0 −HC L Para las configuraciones anillo y radial, en este momento
J −R = se encuentra desarrollando las condiciones de existencia de
HCTL −HRLT
(R t−1 )−1 HRL
· ¸ (24) estado estacionario bajo la misma metodología.
−HC ↑
g= T −1 −1
−HRL (R t ) HR↑ Agradecimientos
En donde la matriz de transferencia, desde la corriente hasta
el voltaje de la carga para circuitos eléctricos con esta estruc- Parte de este trabajo se realizó con el apoyp de DGAPA-
tura se expresa como UNAM con el número I N 116516.
Referencias
1
T
C Ls 2 +C HRL R t−1 HRL s + HC L HCTL (25) Avila-Becerril, S., Espinosa-Perez, G., and Fernanadez, P.
T T
d i ag {HC ↑ (Ls + HRL R t−1 HRL ) − HC L HRL HR↑ R t−1 } (2015). Dynamic characterization of typilcal electrical cir-
cuits via structural properties.
en donde la matriz de transferencia tiene como componentes Barvanov, N., Ortega, R., Griño, R., and Polyak, B. (2015).
en su diagonal principal las funciones de transferencia de On existence and satability of equilibria of linear time-
cada carga, es posible llegar a esta simplificación gracias a las invariant systems with constant power loads. IEEE
propiedades estructurales que exhibe el sitema. Para aplicar Transactions on Circuits and Systems, 62(12), 1–8.
la metodología antes descrita se obtiene el valor de la matriz Belkhayat, M., Cooley, R., and Witulski, A. (1995). Large sig-
de transferencia cuando s = 0 nal stability criteria for distributed systems with constant
power loads. IEEE.
T
HRL T
R t−1 HRL − HC L HRL HR↑ R t−1 Fujisawa, T. and Kuh, E. (1971). Some results on existence
. (26) and uniqueness of solutions of nonlinear networks. IEEE
HC L HCTL Transactions on Circuit Theory.
Gross, J.L. and Yellen, J. (1999). Graph theory and its applica-
Para establecer la condición de máxima potencia que una
tions. Chapman and Hall.
carga puede demandar en un arreglo con n cargas es necesa-
Higgins, T.J. (1949). History of the operational calculus as
rio analizar el comportamiento carga por carga, de esa forma
used in electric ccircuit analysis. Electrical Engineering, 42–
es posible establecer condiciones de máxima potencia, dicha
45.
condición es la siguiente
Hill, D. (1993). Nonlinear dynamic load models with recovery
for voltage stability studies. IEEE Transactions on Power
E2 Systems, 8(1), 166–177.
P j ≤ Pn (27)
4 j =1 r j Jusoh, A.b. (2004). The instability effect of constsant power
loads. National Power and Energy Conference.
donde P j es la máxima potencia que puede tener la j −ési ma Karlsson, D. and Hill, D. (1994). Modelling and indentifica-
carga, n es el total de cargas presentes en el circuito. tion of nonlinear dynamic loads in power systems. IEEE
Transactions on Power Systems, 9(1), 157–166.
Overbye, T. (1994). Effect of load modelling on analysis of po-
Con base en la condición anterior y la estructura del sistema,
wer system voltage stability. Electrical Power and Electrical
es posible realizar la siguiente proposición
Systems, 16(5), 329–338.
Proposición 3. Para un circuito eléctrico con la estructura Roska, T. (1978). On the uniqueness of solutions of nonlinear
mostrada en la Figura 2 existe estado estacionario, si y sólo dynamic networks and systems. IEEE Transactions on
si cada carga cumple con la condición (27) Circuit and Systems, CAS-25(3), 161–169.
Sanchez, S., Ortega, R., Bergna, G., Molinas, M., and Griño, R.
de este modo, es posible establecer condiciones de existencia
(2013). Conditions for existence of equilibrium points of
de estado estacionario con base en las propiedades estructu-
systems with constant power loads. Conferenc.
rales de los circuitos eléctricos.
S.Sanchez and Molinas, M. (2012). Assessment of a stability
4. Conclusiones analysis tool for constant power loads in dcc-grids. In
15th International Power Electronics and Motion Control
Se establecieron condiciones de existencia de estado estacio- Conference.
nario constante para un tipo específico de circuitos eléctricos Wellestad, P. (1979). Introduction to physical system mode-
con n CPL considerando sus propiedades estructurales, es lling. Avademic Press London.
cierto que en el presente análisis se busca establecer con- X.Liu, A.F. and Cross, A. (2007). Negative input resistence
diciones de existencia solamente para un tipo específico de compensator for a constant power load. IEEE Transactions
topología, pero, a través del conocimiento de las propiedades on Industrial Electronics, 54(6), 3188–3196.
estructurales de las topologías básicas es posible establecer
condiciones de existencia vía sus propiedades estructurales,
lo cual es un campo de estudio abierto para el análisis de cir-
cuitos eléctricos, así como la extensión de estas condiciones
para circuitos de AC para estudiar problemas característicos
de estos sistemas como el colapso de voltaje e inestabilidad
en general en estos sistemas.
398
CHAPTER 13. POWER SYSTEMS
Resumen: El objetivo de este trabajo fue la implementación y validación de dos técnicas de control
adaptativo por modelo de referencia MRAC digital, sobre un microcontrolador de 16 bits, comparadas
con un controlador PID convencional ante perturbaciones para el control de posición angular de un
balancín equilibrado con el impulso de un motor sin escobillas. El sistema está conformado por una barra
metálica la cual tiene acoplado a su centro de gravedad un rodamiento y en uno de sus extremos tiene el
motor sin escobillas con hélice, el cual produce una fuerza de empuje que actúa sobre la barra haciendo
cambiar la posición de ésta a un ángulo determinado. La velocidad del motor es controlada a través del
control electrónico de velocidad ESC, empleando un microcontrolador como sistema electrónico que
realimenta el ángulo de posición para generar las salidas de control al motor. Para la validación se
implementó un sistema de control PID, un sistema de control MRAC con dos parámetros de adaptación y
un controlador MRAC Proporcional Diferencial MRAC-PD. Los tres controladores se validaron con y
sin perturbaciones, demostrando que el comportamiento de los dos controladores MRAC presenta
mejores resultados ante sistemas variantes en el tiempo o sometidos a perturbaciones externas.
Palabras claves: Control Adaptativo por Modelo de referencia, Control PID, Modelo de un Balancín,
Motor Brushless.
399
CHAPTER 13. POWER SYSTEMS
sobre el sistema las cuales son de interés para la obtención Teniendo en cuenta los datos anteriores, la ecuación 1 se
del modelo matemático (Corvalán Aravena & Negroni Vera puede reemplazar como se presenta en la ecuación 5 y de esta
2012; Martín Ballesteros & Del Río Carbajo 2013). Para manera obtener el modelo de la planta en términos de
obtener el modelo matemático aproximado de este sistema, se Laplace.
emplea la ecuación del movimiento de rotación de un sólido
alrededor de su eje central de inercia, la cual es análoga a la ( )
segunda ley de Newton y se representa mediante la ecuación
1 (Viltres La Rosa 2012).
Aplicando la transformada de Laplace a la ecuación 5 se
( ) obtiene la función de transferencia.
( ) ⁄
( )
( )
( )
( )
( )
( )
400
CHAPTER 13. POWER SYSTEMS
( ) ( ) ( )
( ( ) )
Partiendo de la ecuación 16 la función de transferencia en
lazo cerrado que relaciona la salida Y del sistema con la
entrada R, se obtiene el modelo de referencia representado en
la ecuación 17.
( )
( ( ) )
Fig. 3. Diagrama de bloques del control digital PID en
Una vez obtenido el modelo de referencia se procede a
Simulink.
obtener las ecuaciones que describen el comportamiento de
los dos parámetros de adaptación. Para obtener los dos
El valor de referencia es comparado con el valor recibido del
parámetros se reemplaza la ecuación 17 en la ecuación 18
potenciómetro y el valor producido por la diferencia entre
obteniendo las ecuaciones 19 y 20.
estos dos valores es enviado al bloque PID, el cual crea la
señal de control correspondiente para la corrección de ese
error. A la salida del PID el bloque conv1 convierte los datos ( ) ( )
a formato unit8, facilitando de esta manera que el bloque
Serial Send pueda transmitir los datos a través del puerto ( )
serial. ( ( ) )
( )
( )
( ) ( )
( ( ) )
( )( ) ( )
( ) ( )
( ( ) )
2.4 Controlador MRAC con Dos Parámetros de Adaptación
La fig.4 corresponde al diagrama de bloques, hecho en
Proporcional
Matlab Simulink, que describe las ecuaciones del controlador
MRAC para dos parámetros.
Para la obtención de los modelos de referencia se debe
plantear un modelo de referencia a partir de la respuesta en
lazo cerrado del sistema. La ecuación 13 y 14 corresponde a
la salida Y de la planta como el producto en términos de
Laplace entre la ley de control U y la planta G (Duraisamy &
Dakshinamurthy 2010).
( ) ( )
( )( ) ( )
401
CHAPTER 13. POWER SYSTEMS
La ecuación 23 corresponde a un controlador MRAC con ley controlador PID, controlador MRAC dos parámetros y
de control PD con filtro. controlador MRAC PD respectivamente sin la pesa.
Como se puede apreciar en las figuras 6 a la 9, los tres
( )
controladores responden bien a la misma posición
preestablecida, teniendo un tiempo de establecimiento menor
Partiendo de la ecuación 23 y realizando el procedimiento de 5 segundos y no mostrando diferencias significativas que
anterior como se hizo en el control proporcional de dos con lleven a la inestabilidad del sistema.
parámetros se obtiene la salida del control representada en 24
y 25. Por otra parte, las figuras 9, 10 y 11 corresponden a las
pruebas realizadas con la perturbación dada por la pesa a los
( ) misms tres controladores. En los tres casos, la perturbación se
( )( )( ) ( )
( ) realiza cuando la respuesta del sistema ya se encuentra en
estado estable. Para los tres casos se realizó la perturbación
( ) pasados los 5 segundos a la respuesta al escalón.
( )
( ) ( )
( )( )
( ( ) ( ) )
3. RESULTADOS
Las pruebas a los diferentes tipos de controladores se
realizaron en un dispositivo Arduino Mega, para lo que se
discretizaron todas las ecuaciones de los sistemas de control
empleando un periodo de muestreo de 50 ms. Estas pruebas
consisten en analizar la respuesta del sistema de control sin y
con perturbaciones, para lo que se empleó una pesa de 60
gramos que se ubica en el brazo del balancín a una distancia
de 20 cm con respecto al centro del motor. Al incluir esta Fig. 8. Salida del controlador MRAC-PD.
pesa al balancín se modifican sus parámetros y así su modelo
matemático. Esta prueba con la pesa se realiza cuando el
sistema ya se encuentra en estado estable ante una posición
de setpoint dada. Las figuras 6, 7 y 8 corresponden al
402
CHAPTER 13. POWER SYSTEMS
6. CONCLUSIONES
Uno de los objetivos de este proyecto era el de demostrar, de
Fig. 9. Salida del controlador PID cuando se pone una pesa forma académica, como los sistemas de control avanzados, en
de 60 gramos después de su estado estable. este caso los adaptativos, se comportan de forma adecuada
para sistemas no lineales o invariantes en el tiempo. Este
En la figura 9, se puede apreciar que para el caso de los trabajo continua con la investigación de sistemas de control
sistemas de control lineales, en este caso PID, estos adaptativos que se inició para una mesa vibratoria,
controladores no tienen la capacidad de responder ante planteándose como un primer acercamiento a la
perturbaciones en los parámetros del sistema. implementación de estos controladores para dispositivos
altamente inestables como lo son los helicópteros
cuadrotores. No obstante, la implementación directa del
controlador diseñado no puede aplicarse para un cuadrotor ya
que el modelo del balancín tiene un tiempo de respuesta
lento, lo cual no es aceptable para un sistema inestable como
el mencionado.
Asimismo, es importante tener en cuenta que el costo
computacional de los controladores adaptativos planteados es
elevado, diferente a la ecuación de diferencias de orden 2 que
se implementa en el control PID digital. Lo anterior se debe a
que el sistema digital debe simular el comportamiento de la
planta junto con el controlador para el modelo de referencia,
y a su vez ejecutar las ecuaciones de diferencias requeridas
por los parámetros adaptativos de las ecuaciones 21 y 22 para
Fig. 10. Salida del controlador MRAC con dos parámetros el caso de control proporcional con dos parámetros de
proporcionales cuando se pone una pesa de 60 gramos adaptación y las ecuaciones 26 y 27 para el control adaptativo
después de su estado estable. proporcional diferencial.
Finalmente, una ventaja de este desarrollo fue la construcción
de algoritmos para ser implementados en hardware,
optimizados para el procesamiento en microcontroladores de
16 bits. Como desventaja es importante recalcar que la
velocidad de procesamiento, limitada por la velocidad de
procesamiento del microcontrolador, puede ser mejorada al
implementarla en un sistema embebido como la Raspberry pi
o una Beagleboard.
REFERENCES
Clarke, D.W., 1988. Application of generalized predictive
control to industrial processes. IEEE Control systems
magazine, 8(2), pp.49–55.
Corvalán Aravena, I. & Negroni Vera, J.J., 2012. Modelo de
un Control Aproximado de un Cuadricóptero Basado
en Equilibrio de un Balancín. Trilogía Ciencia -
Fig. 11. Salida del controlador MRAC-PD cuando se pone Tecnología -Sociedad, 24(34), pp.81–91.
una pesa de 60 gramos después de su estado estable.
Duraisamy, R. & Dakshinamurthy, S., 2010. An adaptive
Las figuras 10 y 11 muestran la respuesta de los controles optimisation scheme for controlling air flow process
MRAC de dos parámetros y el PD, cuando estos son with satisfactory transient performance. Maejo
403
CHAPTER 13. POWER SYSTEMS
404
CHAPTER 13. POWER SYSTEMS
*Universidad Nacional de Colombia Sede Medellín, Maestría en Ingeniería Eléctrica; e-mail: cmflorezr@unal.edu.co.
**XM E.S.P, Doctorado en Sistemas Energéticos e-mail: jfvilla@xm.com.co
Abstract: En los sistemas eléctricos de potencia al presentarse una contingencia, se produce una
reacomodación natural no controlada en los ángulos de las barras que va acompañada de una
redistribución de los flujos y cargas por los elementos del sistema. Por otro lado, el desarrollo de las
unidades de medición fasorial en estos sistemas han posibilitado la medición del ángulo de voltaje
visualizando así una forma de poder controlar esta variable. Este trabajo propone entonces un posible
esquema de control fasorial para los sistemas eléctricos el cual modifique la dinámica de la evolución de
los ángulos de una manera ordenada y así a pesar de las diferentes contingencias, mantener el sistema
dentro de los valores tolerables por los elementos de la red. El control propuesto es implementado en un
IEEE 14 obteniendo resultados satisfactorios, conservando el sistema en sus ángulos determinados y de
esta forma evitando efectos adversos como sobre cargas en líneas de transmisión.
Keywords: PMU, Control de Frecuencia, WAMS, Control PI, Control en cascada, Sistemas eléctricos de
potencia, Control de Ángulo.
eléctrico en una cascada de eventualidades (Machowski et al.
1. INTRODUCCIÓN
2008).
Los sistemas de potencia que suministran energía a
Se clasifican tres tipos de estabilidades en los sistemas
instalaciones eléctricas hoy en día están mucho más
eléctricos de potencia: Estabilidad de Frecuencia, al cual se le
susceptibles a colapsos de lo que han estado en los últimos
asocia un sistema de estrategias de control de frecuencia.
años debido a que estos sistemas cada vez más dependen de
Estabilidad de Voltaje, al cual se le asocia un sistema de
fuentes de generación que están localizadas remotamente de
estrategias de control de voltaje. Y finalmente la estabilidad
los centros de carga (Cano Esperón 2004; Kundur 1994; Ilic
por el ángulo del voltaje, la cual no cuenta con una estrategia
2007). Dos factores fomentan que la generación esté alejada
de control definida tal como los dos tipos de estabilidades
de los centros de carga:
anteriores (Kundur 1994).
i. Las economías de compra de energía desde fuentes
remotas a más bajo costo a diferencia de generación 1.1 Inestabilidad por ángulo de fase
local más costosa.
405
CHAPTER 13. POWER SYSTEMS
En cuanto a la inestabilidad transitoria, esta puede ocurrir comunicación que trasmiten los valores de las variables más
debido al lento despeje de fallas en el sistema de transmisión. importantes del sistema de energía eléctrica, como son
Generalmente la inestabilidad transitoria de ángulo ocurre potencia activa, voltajes, corrientes, entre otras, a través de
cuando una falla en el sistema de transmisión cerca de la las líneas de interconexión hasta los centros de control, y a su
planta de generación no es despejada lo suficientemente vez hasta las centrales eléctricas que, regulando la generación
rápido para evitar un desbalance prolongado entre la salida de energía, controlan tanto los intercambios como la
mecánica y eléctrica del generador (Cañizares et al. 2005). frecuencia del sistema (Cañizares et al. 2005).
Una inestabilidad transitoria inducida por falla no ha sido la
causa de ningún apagón en los años recientes. Sin embargo, El desarrollo de la tecnología de medición fasorial de voltaje
los generadores necesitan estar protegidos de los daños que en los diferentes nodos del sistema eléctrico abrió una nueva
pueden resultar cuando la protección en los sistemas de posibilidad en el control de voltaje. Manejando la potencia
transmisión es muy lenta para operar (Vaschetti et al. 2012) activa en los grandes sistemas de generación se puede
controlar tanto la frecuencia del sistema interconectado,
como el intercambio de energía entre países (Fijalkowski
2. INTRODUCCIÓN AL CONTROL ANGULAR 2009).
En la actualidad, el control de la frecuencia en los sistemas de Actualmente, los sistemas de control utilizados para el
potencia, específicamente el AGC (Automatic Generation control de frecuencia asumen condiciones de estado
Control) y el control de los intercambios entre países y zonas estacionario, donde una pequeña perturbación en la
interconectadas, se apoya en los sistemas de comunicación generación, por lo general, causa una disminución de
que transmiten los valores de los intercambios medidos a velocidad, haciendo que las unidades que intervienen en el
través de las líneas de interconexión hasta los centros de control automático de generación actúen. El sistema se
control y a su vez hasta las centrales que, por medio de su reacomoda angularmente a nuevos valores. Algunas unidades
generación, pueden controlar tanto estos intercambios como permanecen en su punto de operación, esto es, no modifican
también la frecuencia del sistema (Corsi et al. 2004). El su potencia eléctrica entregada, mientras que las otras están
desarrollo de la tecnología de medición del fasor de tensión programadas para realizar el control de frecuencia y si
PMU (Phasor Measurement Unit) en los diferentes puntos del modifican la potencia entregada al sistema. Al colapsar una
sistema eléctrico abre una nueva posibilidad, en el sentido de unidad generadora, el sistema pierde velocidad, se producen
controlar esta variable mediante la potencia de las centrales efectos de baja frecuencia, grandes áreas del sistema se van
generadoras y de esta manera controlar de una forma afectadas con sobrecargas y disminución en los valores de los
novedosa tanto el intercambio entre países y zonas voltajes, debido a que por lo general los puntos de control son
interconectadas como la frecuencia del sistema distantes a los puntos donde se presenta el evento, generando
interconectado. Así se abre la posibilidad de lo que restricciones técnicas que encarecen el costo de operación del
podríamos llamar un AGC distribuido sin el uso de los sistema.
canales de telecomunicación.
Los sistemas de energía eléctrica son uno de los motores Se propone el desarrollo de un esquema de control angular
principales para el desarrollo económico y social de las para aplicarlo a algunos generadores del sistema eléctrico con
naciones, conformado por un conjunto de elementos como la el objeto de mantener el ángulo en las barras dentro de unos
generación, el transporte, la distribución y el consumo de la valores deseados lo cual podría traer como beneficios una
energía eléctrica (CIGRE 2007). Estos elementos constituyen mayor seguridad del sistema eléctrico, así como la
un sistema complejo e integrado, el cual necesita estar posibilidad de una mayor economía en el funcionamiento de
coordinado y regulado para garantizar el uso racional de los los mismos.
recursos naturales y de infraestructura, que debe cumplir
estándares de calidad en la prestación del servicio con 3. ESTRUCTURA DEL CONTROL ANGULAR
criterios de seguridad y confiabilidad (Standards & Process
2013).
Una posible estrategia para el control angular a un nivel
La seguridad de los sistemas de potencia requiere de la primario estaría dada por el flujograma presentado en la
coordinación y control de las principales variables del Figura 1, en la cual se puede ver como el control de ángulo
sistema: voltaje, frecuencia y ángulo. Esto, aún en presencia podría ser adaptado al control clásico de frecuencia. Esto trae
de fallas del sistema, perturbaciones de la carga y salidas como ventaja la aproximación de la realidad a una
eventuales de unidades de generación (Corsi 2000). Si esta implementación del mismo ya que un sistema no reemplaza
coordinación no se dá, el sistema se vulnera y puede llevar a al otro, sino que por el contrario lo complementa.
una secuencia de eventos que causen colapsos de voltaje y
hasta apagones generalizados, afectando el suministro de El posible esquema angular de control se puede dividir
energía eléctrica (Bialek 2005). principalmente en dos etapas. La primera etapa nos dice lo
mismo que el control clásico de frecuencia, el sistema ante un
El control actual de la frecuencia en los sistemas de potencia, evento lleva a las máquinas a un nuevo punto de operación en
específicamente el AGC y el control de intercambios de el cual el voltaje y la frecuencia permanecen constantes pero
energía entre países, se apoya en los sistemas de su distribución angular cambia. Es en este punto en el cual la
406
CHAPTER 13. POWER SYSTEMS
segunda etapa aparece, se hace un cálculo de error angular con motivo de tener un sistema al borde del colapso y de esta
con el cual se procede a determinar una acción de control que manera resaltar dos diferentes escenarios, uno con control
se adicionará al error de velocidad de la máquina el cual será angular y otro sin él.
asumido por el control de frecuencia que finalmente
determina la señal calculada al elemento final de control, en Adicionalmente a los controles primarios se tiene un esquema
este caso los servomotores que dan paso al vapor o al agua. de control secundario AGC, el cual se puso a prueba frente a
En otras palabras, la esencia del control angular se encuentra la estrategia de control propuesta.
en la forma en la que se podría anidar un control, en este caso El esquema de control angular propuesto no necesita un
en cascada, para el lazo del control de frecuencia. reajuste de los demás esquemas de control, lo cual es una
ventaja ya que el sistema de control existente no es
modificado.
407
CHAPTER 13. POWER SYSTEMS
Tabla 1 se puede ver el cambio del punto de operación antes se garantizara que el lazo de control interno fuera más rápido
de la falla y después de ella con el fin de hacerse a una idea que el lazo externo.
del funcionamiento del control de ángulo.
Tabla 1. Valores Prefalla y Postfalla sin y con control angular
Generador 2 Generador 3 Generador 4 Generador 5
PRF PFSC PFC PRF PFSC PFC PRF PFSC PFC PRF PFSC PFC
Pot 150 151 160 60 60 77 40 40 52 30 32 78
An -18 -31 -25 -39 -48 -40 -37 -48 -38 -45 -57 -48
*PRF: prefalla, PFSC: posfalla sin control de ángulo, PFC: posfalla con
control de ángulo
De esta misma forma la Tabla 2 presenta el nivel porcentual
con control angular y sin él, de las líneas 1-2 y 1-5 las cuales
son las líneas que se distribuyen principalmente la
transmisión perdida.
Tabla 2. Valores porcentuales de carga Lineas 1-2 y 1-5
Linea 1-2 Linea 1-5
PRF PFSC PFC PRF PFSC PFC
%Load 68 116 96 75 98 84
Figura 4. Porcentaje de carga de las lineas 1-2
En cuanto a las líneas de transmisión, se puede evidenciar el
Tanto de la Tabla 1 como de la Tabla dos pueden notarse mismo fenómeno anterior, donde la oscilación no es
importantes cambios. En cuanto a los ángulos, estos se prolongada y adicionalmente su nuevo punto de operación es
encuentran menos dispersos lo que favorece la estabilidad más bajo pudiendo evitar así una cascada de eventualidades
angular; mientras que la carga de las líneas se reparte de otra en las líneas de distribución por sobretensiones
manera evitando una posible cadena de eventos.
4.3 Análisis dinámico de resultados
5. CONCLUSIONES
Para tener un análisis más completo de la dinámica del
control angular, se presenta a continuación la dinámica del En cuanto al control angular, se pudo observar como de
nodo Slack y el porcentaje de carga de dos de las líneas forma mutua ambos controladores asumieron la carga sin
alternas que soportan la contingencia. entrar en algún tipo de conflicto y adicionalmente cumplen el
objetivo de regular la generación de forma tal que la
1100
Control Clásico distribución angular del sistema no se ve mayormente
1050
Control Angular afectada. Si bien no se enfatizó en la incidencia de los
reguladores de voltaje en este trabajo, se espera para trabajos
1000
futuros analizar si estos juegan un papel importante en la
Potencia [MW]
950
estabilidad y robustez del control angular.
408
CHAPTER 13. POWER SYSTEMS
REFERENCIAS
409
CHAPTER 13. POWER SYSTEMS
Resumen: Los sistemas de control para motores de inducción (MI) de alto desempeño representan
un tópico de interés en la comunidad de ingenierı́a aplicada, debido a que las nuevas aplicaciones
en las que este tipo de máquina rotatoria está involucrada, exigen la maximización del desempeño
dinámico y la minimización del consumo de energı́a. Bajo el contexto actual, se presenta en este
artı́culo un controlador no lineal basado en pasividad para alto desempeño que realiza el seguimiento
de velocidad y norma de flujos magnéticos de rotor, en el que se propone una polı́tica de minimización
de pérdidas de potencia buscando reducir el consumo de energı́a del motor de inducción sin degradar
el desempeño dinámico. La utilidad de esta propuesta se muestra mediante simulación numérica
donde se compara el consumo de energı́a en dos escenarios, en el primero se emplea una norma de
flujos magnéticos constante equivalente al valor nominal de la máquina y en el segundo escenario se
aplica una norma de flujos magnéticos ad hoc, la cual es variante en el tiempo y busca reducir las
pérdidas de potencia. En ambos escenarios se asigna el mismo perfil de velocidad y par de carga.
Los resultados muestran que implementando la polı́tica de minimización de pérdidas de potencia se
mejora la eficiencia del MI en un 17.74 % para el perfil de velocidad y par de carga evaluados.
410
CHAPTER 13. POWER SYSTEMS
como en el marco ab. Esta metodologı́a de control se ha emplear este marco de referencia es que se evita la depen-
ido afianzando ya que además de realizar seguimiento de dencia explı́cita de la posición del rotor, lo que simplifica
velocidad ası́ como de la norma del flujo magnético de en gran medida el análisis del sistema (Liu et al., 1989).
rotor, al mismo tiempo respeta todas las condiciones bajo Este modelo es conocido en la literatura como el modelo
las cuales sus propiedades de estabilidad se establecen ab (Seely, 1962), (Meisel, 1984), modelo de Stanley (Krish-
permitiendo mejorar drásticamente el desempeño (Mujica nan, 2001) o modelo en el marco de referencia fijo al estator
and Espinosa-Pérez, 2014). Estas cualidades han sido (Krause et al., 2002) y está dado por
mostradas y evaluadas de forma experimental en (Mujica
˙ Lsr Rr np Lsr Us
and Espinosa-Pérez, 2015), donde se determina cuál es el Is = −γIs + 2
ψr − ωJ ψr + (1a)
método para la obtención de velocidad y aceleración que σLr σLr σ
permite mejorar aún mas el desempeño. Rr Rr Lsr
ψ̇r = − ψr + (np ωJ ) ψr + Is (1b)
Por otro lado, es sabido que las pérdidas de potencia en Lr Lr
el MI están dadas por la diferencia de potencia eléctrica 1 np Lsr T B τL
ω̇ = I J ψr − ω− (1c)
y mecánica. Estas se producen en el cobre de los deva- J Lr s J J
| {z }
nados tanto del estator como del rotor, por la histéresis τe
y corrientes parásitas de estator, ası́ como las pérdidas
por fricción y rozamiento con el aire del rotor (Chapman, donde ψr ∈ R2 es el vector de encadenamientos de flujos
2005). El diseño de los MI se han mejorado tanto que en magnéticos de rotor, Is ∈ R2 el vector de corrientes
aplicaciones a velocidad nominal constante estas pérdidas de estator, ω la velocidad en el eje del motor, Ls , Lr ,
son pequeñas, pero cuando se tiene velocidades diferentes Lsr > 0 son las inductancias en estator, rotor y mutua
de la nominal, dichas pérdidas se incrementan significati- respectivamente, τe es el par electromagnético, Rs , Rr > 0
vamente. Una forma de minimizar estas pérdidas es por las resistencias en estator y rotor respectivamente, np el
medio de una selección adecuada de la norma del flujo número de par de polos, J > 0 el momento de inercia del
magnético de rotor. rotor, B ≥ 0 el coeficiente de amortiguamiento mecánico
o fricción viscosa, τL el par de carga externo aplicado al
En este sentido, existen diversos trabajos como el presen- L2
eje del rotor, Us ∈ R2 los voltajes de estator, σ̄ = 1 − Lssr
tado en (Vedagarbha et al., 1997), donde se diseña un Lr
controlador no lineal que logra seguimiento exponencial el coeficiente de dispersión o coeficiente de Blondel con
sin singularidades y mejora la eficiencia de la máquina. σ = Ls σ̄, mientras que
2
Asimismo, en (Bodson et al., 1995) presenta un algoritmo Lsr Rr Rs 0 −1
para seleccionar la referencia de flujos de rotor del MI γ= + , J , = −J T .
σL2r σ 1 0
considerando lı́mites máximos de voltaje y corriente a
velocidad constante. Por otro lado, en (Kumar et al., 2015) Los subı́ndices (·)s y (·)r son usados la denotar variables
se presenta un análisis de como mejorar el desempeño de estator y rotor respectivamente.
dinámico de los sistemas de control del MI para optimizar
eficiencia utilizando un control basado en modelo. Sin em-
bargo, en todos estos trabajos reportados anteriormente, la 3. CONTROL NO LINEAL BASADO EN PASIVIDAD
estrategia de control no es capaz de realizar seguimiento DE ALTO DESEMPEÑO PARA MI
de velocidad y norma de flujos magnéticos de rotor con
grandes tasas de variación en el perfil deseado, estas limi-
taciones encuentran solución en el resultado mostrado en Con el objetivo de evaluar el resultado principal, se consi-
el presente artı́culo. dera un esquema de control no lineal basado en pasividad
de alto desempeño para el seguimiento de velocidad y
El presente artı́culo está organizado de la siguiente manera: norma de flujos magnéticos descrito en detalle en (Mu-
En la sección 2 se presenta el modelo matemático del MI, jica et al., 2014), es decir, el lı́mt→∞ |ω − ωd | = 0 y
mientras que en la sección 3 se muestra las ecuaciones del lı́mt→∞ |kψr k − kψrd k| = 0, donde ωd es la velocidad de
PBC de alto desempeño para el MI. En la sección 4 se rotor deseada y β , kψrd k la norma de flujos magnéticos
presenta el desarrollo del resultado principal, el cual es la de rotor deseada. Dicho controlador, es generado a partir
obtención de una polı́tica de selección de la norma de flujos del modelo del MI descrito en (1) bajo las siguientes
magnéticos que minimiza las pérdidas de potencia. Los suposiciones:
resultados de simulación de dos experimentos se presentan
en la sección 5 y por último en la sección 6 se presenta la S.1 Se dispone de medición de las señales de corrientes
discusión y conclusiones. de estator Is , velocidad ω y aceleración del rotor ω̇.
Estas dos últimas señales se obtienen como salida de
un sistema dinámico a partir de la medición de posición
2. MOTOR DE INDUCCIÓN
del rotor.
S.2 Todos los parámetros del modelo son conocidos.
En este artı́culo, se considera el modelo matemático del MI
S.3 El par de carga τL es una función desconocida, la cual
trifásico de múltiples pares de polos tipo jaula de ardilla,
es estimada en lı́nea.
representado en un plano bifásico ortogonal equivalente
S.4 La velocidad deseada del rotor ωd es una función
por medio de la transformación de Blondel (Blondel et al.,
acotada y dos veces diferenciable.
1913). Adicionalmente, se considera también que las fases
S.5 La norma de flujo magnético de rotor deseado β es
son simétricas y además distribuidas sinusoidalmente, la una función estrictamente positiva, suave y acotada.
permeabilidad magnética en los núcleos laminados infinita,
lo cual, desprecia los efectos en las ranuras, las pérdidas Por lo tanto, se define el error de estados y su dinámica
en el hierro y en los devanados. La principal ventaja de como
411
CHAPTER 13. POWER SYSTEMS
" # " #
e Is ėIs z˙1 = z2 (11a)
e= eψr = x − xd =⇒ ė = ėψr = ẋ − ẋd (2) z˙2 = z3 (11b)
eω ėω
z˙3 = −λ31 z1 − 3λ21 z2 − 3λ1 z3 + λ31 θ
donde el vector de estados está dado por x , [IsT , ψrT , ω]T ∈ + 3λ21 z4 + 3λ1 z5 − λ22 z4 − 2λ2 z5 + λ22 ω̄d (11c)
R5 y el vector de estados deseados se establece como
z˙4 = z5 (11d)
xd , [Isd
T T
, ψrd , ωd ]T .
z˙5 = −λ22 z4 − 2λ2 z5 + λ22 ω̄d (11e)
donde θ es la posición de rotor medida por el codificador
En consecuencia, la ley de control impone la siguiente
incremental unido al eje del rotor, z1 la posición estimada,
estructura para los voltajes de estator
2 ω = z2 la velocidad estimada de rotor, ω̇ = z3 la
np Lsr Lsr Rr aceleración estimada, ωd = z4 la velocidad deseada o de
Us = σ I˙sd + J ωd ψrd + + R s Isd
Lr L2r consigna para el PBC, ω̇d = z5 la aceleración deseada y
Lsr Rr ω̄d ≈ ωd la señal exógena que define el perfil de velocidad
− ψrd − KIs eIs , (3) que el usuario desea alcanzar, con
L2r T T
donde se incluye un término de amortiguamiento constante [z1 (0) z2 (0) z3 (0) z4 (0) z5 (0)] = [θ(0) 0 0 ω̄d (0) 0] .
KIs en el error de corrientes. Ahora, se procede a definir La estabilidad del sistema dinámico descrito en (11) se
las expresiones necesarias para su implementación. Las cumple para todo λ1 > 0 y λ2 > 0. Sin embargo, como
corrientes deseadas de estator están dadas por primera aproximación, se recomienda asignar a λ1 , λ2 >
2|max(ω̇d )| (dos veces el valor máximo de aceleración
Lr Rr
Isd = ψ̇rd − np ωd J ψrd + ψrd , (4) deseada, ası́ se preserva el contenido frecuencial de la señal
Rr Lsr Lr
de entrada). Note, que conforme λ1 → +∞, este método
mientras que los flujos de rotor deseados variantes en el será una mejor aproximación al operador derivada, por
tiempo se obtienen como solución del sistema dinámico lo htanto más sensible ial ruido mientras que el término
Rr β̇ β λ31 λ31 z4 + λ32 z5 − λ13 ż5 → 0. Este último es empleado
ψ̇rd = np ωd + τd J ψ rd + ψ rd , ψ rd (0) = , 1 1
np β 2 β 0 para compensar el desfase natural del sistema compuesto
(5) por (11a)-(11c). En este trabajo se consideró λ1 = 12000
con esta última expresión se puede reescribir la ecuación y λ2 = 800.
(4) y se tiene que las corrientes de estator deseadas
Lr 1 Lr β̇ 4. RESULTADO PRINCIPAL
Isd = τd J ψrd + ψrd + ψrd (6)
Lsr np β 2 Lsr Rr Lsr β
Partiendo del hecho que el PBC del MI, dado por (3)-(7),
dependen ahora de τd . Por lo tanto, se define el par garantiza un correcto seguimiento de la velocidad deseada
electromagnético deseado τd como y de la norma del flujo de rotor deseada y considerando
τd = J ω̇d + Bωd + τ̂L − Kω eω , (7) que ambas señales se incorporan como entradas de forma
donde independiente a la ley de control. Entonces, se propone
Z una polı́tica de selección de norma de flujos magnéticos
τ̂L = −Kωi eω dt, Kωi > 0, τ̂L (0) = 0. (8) que minimice las pérdidas de potencia de acuerdo a las
condiciones de operación de la máquina a través del cálculo
Ahora bien, la ley de control (3) requiere para su imple- en lı́nea de dicha norma. En la Figura 1 se muestra la
mentación de la derivada respecto al tiempo de Isd , esta estrategia de control propuesta.
se propone como ω. d
" ! # ωd Diferenciador sucio
L τ̇ 2τ β̇ τ Velocidad ω de 3do orden
r d d d
I˙sd = − 3 J ψrd + J ψ̇rd Aceleración ω
. compensado
Lsr np β2 β β2 Perfil
velocidad
" ! ! # Error mecánico
Estimador
τLe Posición θ
Lr β̈β − β̇ 2 β̇ ψ̇rd ωd par de carga PBCV3 Encoder
Generador perfiles . -+
+ 2
ψrd + ψ̇rd + , (9) velocidad, ω..d .
incremental
Rr Lsr β β Lsr aceleración y
derivada aceleración ωd Par τd Derivada de par τd Voltajes
estator
1024 PPR
electromecánico electromecánico
deseado deseado Us A
ası́ también y por consiguiente la derivada de τd es Perfil flujo +
Us1
Motor
magnético
ψrd + Transfor UsB
mación-1
τ̇d = J ω̈d + B ω̇d + τ̂˙L − Kω ėω .
óptimo + inducción
Flujos de rotor Us2
(10) deseados
+
-
Blondel
UsC 1HP
β
.
A partir de esta estructura, se puede demostrar que la ley Generador perfiles
flujos magnéticos β
.. Corrientes de
Derivada
corrientes de
deseados, primera y estator deseadas estator
de control (3) preserva propiedades de estabilidad, para segunda derivada β deseadas . Corrientes
Isd Isd estator IsA IsB IsC
-
detalles consultar (Mujica et al., 2014). Error eléctrico
+
Perfil
flujo
Is1
Transformación
magnético Is2 Blondel
3.1 Perfiles Deseados de Velocidad y Aceleración
Figura 1. Esquema de control propuesto.
El PBC de alto desempeño para MI requiere de la medición
de velocidad y aceleración, para tal propósito se emplea el
siguiente método de diferenciación numérica fundamenta- 4.1 Pérdidas de potencia en el MI
do en un sistema dinámico lineal que es llamado diferen-
ciador sucio de tercer orden compensado y está descrito Sean las pérdidas de conversión de potencia dadas por
por P otP er = UsT Is − τe ω, (12)
412
CHAPTER 13. POWER SYSTEMS
sustituyendo (3) y (4) en (12) y considerando que la ley Se puede observar de (16) que debido a que los parámetros
de control garantiza convergencia, es decir, eI = eω → 0 de la máquina son positivos esta expresión siempre será
se tiene positiva. Por otro lado, evaluando la segunda derivada
L2 np Lsr parcial de (15) con respecto a β, se tiene
P otP er = Ls − sr I˙sd T
Isd − T
ωd ψrd J Isd
Lr Lr ∂ 2 P otP er Rr L2r Rs τd2 Rs
2 =6 + 2 2 +2 2 >0
Lsr Rr T Lsr Rr T ∂β 2 n2p Lsr np β 4 Lsr
+ 2
+ Rs Isd Isd − ψrd Isd − τd ωd
Lr L2r lo cual garantiza que la β ∗ es un mı́nimo global de (13).
(13)
Buscando reescribir la ecuación anterior en términos de los 4.2 Polı́tica de minimización de pérdidas de potencia
flujos de rotor deseado, es conveniente sustituir (5) en (9)
de tal forma que se obtiene Si se considera a la norma de flujo magnético de rotor que
" ! minimiza las pérdidas de potencia, dada por la ecuación
2
1 Lr Rr τ τ d β̇ L r β̈ (16), como entrada de referencia de la ley de control,
I˙sd = − 2 d4 − Lr ωd 2 + + I
Lsr np β β β Rr β entonces el PBC logrará los objetivos de control reducien-
! # do el consumo de energı́a. Sin embargo la función (16)
Rr τd Lr np ωd β̇ Lr τ̇d aplicada directamente presenta una singularidad en la ley
+ np ωd + + + J ψrd
np β 2 Rr β np β 2 de control, este inconveniente se muestra especı́ficamente
en la ecuación (5) cuando el motor se aproxima a velocidad
Por lo tanto, la expresión (13) se tiene solo en términos de cero. Dicho esto, se presenta una polı́tica de minimización
β y τd de la forma de pérdidas de potencia que evita la singularidad en la ley
Ls Lr Rs Lr Ls 1 de control y está dada por el siguiente sistema dinámico
P otP er = +2 β̇β + − β̈β
L2sr Rr L2sr L2sr Rr Rr 1
κ = [sign(β ∗ − βM in ) + |sign(β ∗ − βM in )|] (17a)
L2r Ls Lr τd2 β̇ Lr Ls L2r Rs 2
+ − 2 2 + 2 + + 2 2 β̇ 2 ς˙1 = ς2 (17b)
Lsr np np β 3 Rr L2sr Rr Lsr
2 2 ς˙2 = −λ23 ς1 − 2λ3 ς2 + λ23 (κβ ∗ + (1 − κ)βM in ) (17c)
Lr Ls Lr Lr Ls Lr τ̇d τd
+ − 2 β̈ β̇ + − 2 donde βM in = 2BωN om es la norma de flujos magnéticos
Rr2 L2sr Rr L2sr n2p np β2
2
2 mı́nima establecida para magnetizar al rotor, κ la variable
Rr L Rs τd Rs que discrimina si el valor de β ∗ alcanza a βM in y λ3
+ 2
+ 2r 2 2
+ 2 β2 (14)
np Lsr np β Lsr determina la respuesta en frecuencia del sistema. Esto
si se considera implica que en la ley de control descrita en (3) se deberá
β̇β Lr L2 β̇τ 2 ahora considerar que β = ς1 , β̇ = ς2 y β̈ = ς˙2 , con
I˙sd
T
Isd = 2 + 2 β̈β − 2 r 2 3d T T
Lsr Lsr Rr Lsr np β [ς1 (0) ς2 (0)] = β̄ ∗ (0) 0 .
2
Lr L L2 τ̇d τd
+ 2
β̇ 2 + 2 r 2 β̈ β̇ + 2 r 2 2 ,
Rr Lsr Rr Lsr Lsr np β 5. EVALUACIÓN NUMÉRICA
T Lr
ψrd J Isd = − τd , La evaluación numérica fue implementada en MATLAB-
Lsr np Simulink. Para el PBC de alto desempeño de MI se eligió
2 !2 como ganancia eléctrica KIs = 80, ganancias mecánicas
T Lr τd 1 Lr β̇ Kω = 1 y Kωi = 45 y el periodo de muestreo fue de
Isd Isd = + + β2
Lsr np β Lsr Rr Lsr β 0.1 ms. En la Tabla 1, se muestran los parámetros del MI
y empleados en la presente evaluación, si se considera que es
! un motor alimentado por 220 VRM S a 60 Hz con potencia
T 1 Lr β̇ nominal de 1 HP.
ψrd Isd = + β2
Lsr Rr Lsr β Tabla 1. Parámetros motor de inducción
Par de polos np = 2
Para efectos del cálculo de una expresión que minimize la Resistencia de estator Rs = 2.516 Ω
ecuación presentada en (14), se considera el par deseado Resistencia de rotor Rr = 1.9461 Ω
τd constante y que la norma del flujo de rotor es constante, Inductancia de estator Ls = 0.2340 mH
i.e., β̇ = β̈ = 0 entonces las pérdidas de potencia son Inductancia de rotor Lr = 0.2302 mH
Rr L2r Rs τd2 Rs Inductancia mutua Lsr = 0.2226 mH
P otP er = + + 2 β2 Fricción viscosa B = 1.1 × 10−4 N·m·s/rad
n2p L2sr n2p β 2 Lsr Coeficiente momento de inercia J = 6.04675 × 10−3 kg·m2
donde el extremo dedicha funciónes
∂P otP er Rr L2 Rs τd2 Rs 5.1 Experimento 1
= −2 2
+ 2r 2 +2 2 β (15)
∂β np Lsr np β 3 Lsr
donde la norma de flujo de rotor que minimiza las pérdidas En este experimento se utiliza una referencia de velocidad
de potencia esta dada por constante de 862.5 rpm que equivale al 50 % de la velocidad
vs nominal, además se aplicó un par de carga de 2.06 Nm
u
u L2sr Rr L2r que corresponde al 50 % del valor nominal, este último
β =t
∗
+ |τd |. (16) fue incorporado como fricción viscosa. Con el objetivo
Rs n2p n2p
de identificar que valor de norma de flujo magnético de
413
CHAPTER 13. POWER SYSTEMS
pérdidas. 0.6
1.6
0.5
β
1.4 β∗ 0.4
Norma de flujos magnéticos de rotor [ Wb ]
0.3
1.2
0.2
1
0.1
0.8
0
0 10 20 30 40 50 60
Tiempo [ s ]
0.6
Figura 4. Perfiles de norma de flujos de rotor
0.4
El objetivo de este experimento es comparar la magnitud
0.2
de pérdidas de potencia en ambos escenarios evaluados
bajo las mismas condiciones, es decir, aplicando el mismo
0
0 10 20 30 40 50 60
perfil de velocidad deseado, par de carga y parámetros de
Tiempo [ s ] sintonı́a.
Figura 2. Referencias de normas de flujo de rotor.
Como resultados de esta evaluación, se puede ver en la
400 Figura 5 el correcto seguimiento del perfil de velocidad en
P otEle ambos escenarios, donde ω en la velocidad obtenida con el
350 P otM ec
P otP er
escenario 1 y ω ∗ la velocidad que corresponde al escenario
300
2.
1000
Potencia instantánea [ W ]
250 ωd
800
ω
ω∗
200 600
400
150
Velocidad angular [ rpm ]
200
100
0
50
-200
0 -400
0 10 20 30 40 50 60
Tiempo [ s ]
-600
Figura 3. Potencia eléctrica, mecánica y pérdidas de po-
tencia. -800
-1000
Por otro lado, en la Figura 3 se pueden observar la 0 10 20 30
Tiempo [ s ]
40 50 60
414
CHAPTER 13. POWER SYSTEMS
200
se asignada adecuadamente la norma de flujos magnéticos
180
kUs k
kUs∗ k
de rotor. Por lo tanto, si se cuantifica la energı́a disipada
160
en pérdidas para el segundo experimento, se tiene que
en el escenario 1 el consumo de energı́a es: 6267.833
Norma de voltaje de estator [ V ]
140
J, mientras que en el escenario 2, empleando la norma
120 calculada por (16) el consumo de energı́a es: 5155.869 J, lo
cual representa un ahorro del 17.74 % considerando los 60
100
segundos que dura el experimento. Como trabajo futuro
80 se busca implementar en forma experimental la polı́tica
60
de reducción de consumo energético propuesta en este
artı́culo.
40
20 REFERENCIAS
0
0 10 20 30 40 50 60 Bensiali, N., Etien, E., and Benalia, N. (2015). Convergen-
Tiempo [ s ]
ce analysis of back-emf mras observers used in sensorless
Figura 6. Comparación de norma de voltajes de estator control of induction motor drives. Mathematics and
4.5 Computers in Simulation.
4
kIs k Blondel, A., Mailloux, C., and Adams, C. (1913). Syn-
kIs∗ k
chronous motors and converters: theory and methods of
3.5 calculation and testing. McGraw-Hill Book Company.
Norma de corrientes de estator [ A ]
415
CHAPTER 13. POWER SYSTEMS
Abstract: In this paper the characterization of a class of electrical circuits is carried out in
terms of both stability properties and steady state behavior. The main contribution is the
interpretation of the electrical topology in terms of mathematical properties derived from the
structure of their models. In this sense, it is explained at what extent the topology by itself
defines the dynamic behavior of the systems.
416
CHAPTER 13. POWER SYSTEMS
417
CHAPTER 13. POWER SYSTEMS
Proof. The error dynamic is given by Property 1. If the number of tree resistors is equal to
x̃˙ = [J − R] P x̃ + GẼ1 (17) the number of inductors and they are one to one series
connected then
with the energy–like function
1 HRL = I2 ∈ Rn3 ×n3 ; HRR = 01 ∈ Rn3 ×n4 (23)
Ha (x̃) = x̃T P x̃ (18) with I2 as already defined and 01 a zero matrix.
2
−1
and the identities ṽC = ∇x̃1 Ha (x̃) = C q̃, ĩL = Property 2. If the number of co–tree resistors is equal to
∇x̃2 Ha (x̃) = L−1 φ̃. Take the function (18) as Lyapunov the number of capacitors and they are one to one parallel
function candidate, its time derivative along (17) satisfies connected then
Ḣa = −zRT−1 z ≤ 0 (19) HCR = I3 ∈ Rn2 ×n2 ; H1R = 02 ∈ Rn1 ×n2 (24)
with RT = diag{R11 , R22 } and R11 , R22 symmetric with I3 as already defined and 02 a zero matrix.
positive definite matrices defined by Remark 4. Typical networks exhibit these properties (see
T Avila-Becerril et al. (2015)). But is also found when
R11 = Rc + HRR Rt HRR modeling of inductors and capacitors losses are considered.
R22 = Rt−1 + HRR Rc−1 HRR
T
The dynamical behavior of the electrical network (5) under
while T
HCR ṽC Properties 1 and 2 is described by
z= (20) ẋ = J∇x Ha (x) + F1 (x) + GE1 (25)
HRL ĩL
where
The proof is completed by noting that the maximal in- −fc−1 (∇q Ha (x))
F1 (x) =
variant set where H̃˙ a = 0 is z = 0. To guarantee −ft−1 (∇φ Ha (x))
that ṽC = ĩL = 0 are the only solutions, brings out Under Properties 1 and 2 the dissipation terms only
T
ker{HCR } = ker{HRL } = 0 as the sufficient conditions depend on the state; this allows to relax the assumptions
that assure asymptotic stability of (x̃, ṽ1 ) = (0, 0). about fc (·) and ft (·).
2 Proposition 3. Consider the electrical network described
by (25) with v1? a constant input such that the steady
The proposition below shows that, in the case of equilib- state behavior is well posed. Assume A.2 holds and that
rium points, the same conditions still guarantee asymp-
totic stability for nonlinear capacitors and inductors. A.3 The maps fc (·) and ft (·) define incremental output
Proposition 2. Consider an electrical network described by strictly passive operators in the sense that
model (5) with v1? a constant input. In addition, assume (x1 − x2 )T fc−1 (x1 ) − fc−1 (x2 ) > 0
that (x1 − x2 )T ft−1 (x1 ) − ft−1 (x2 ) > 0
A.1 The resistors involved in the circuit are characterized hold for x1 6= x2 and considering xi as input.
by linear constitutive relationships.
A.2 The equilibrium point x? that correspond to v1? lo- Under these conditions the equilibrium point (x? , v1? ) is
cally satisfies x? = argmin{Ha (x)}. locally asymptotically stable.
Then, the equilibrium point (x? , v1? ) is locally asymptoti- Proof. The equilibria of the system is characterized by
cally stable if J∇x? Ha (x? ) + F1 (x? ) + GE1? = 0
T
ker{HCR } = ker{HRL } = 0. (21) Thus, if A.2 holds it is possible to consider the Lyapunov
function candidate H0 : Rn2 +n5 → R≥0 , defined in (22),
Proof. The total stored energy Ha (x) is a nonlinear func-
which leads, under the condition v1 = v1? , to
tion. Therefore, under the assumption A.1 the equilibria
is characterized by the solutions of Ḣ0 (x) = −(∇x Ha (x) − ∇x? Ha (x? ))T [F1 (x) − F1 (x? )]
[J − R] ∇x? Ha (x? ) + GE1? = 0 Using (3a) and (3b) and since A.3 holds it is clear that
If assumption A.2 holds, then, it is possible to consider Ḣ0 (x) < 0 with the maximal invariance set defined as
the Lyapunov function candidate H0 : Rn2 +n5 → R≥0 ?
ε = {(q, φ) | (vC − vC ) = 0, (iL − i?L ) = 0} . (26)
reported by Jayawardhana et al. (2007)
2
H0 (x) = Ha (x)−xT ∇x? Ha (x? )−(Ha (x? )−x?T ∇x? Ha (x? ))
(22) 4. TOPOLOGICAL STRUCTURE OF TYPICAL
whose time derivative along the trajectories of (5), under NETWORKS
A.1 and v1 = v1? , yields
Ḣ0 (x) = −z T RT−1 z. In this section it is stated the structure of the matrix
2 H for a generic network that captures, in an unified
Remark 3. The result above considers just the stability way, the characteristics of typical networks (For details
of equilibrium points. The tracking problem, up to the on typical networks see the work of Fernández-Carrillo
authors knowledge, imposes an open problem. et al. (2015)). The generic network satisfies Properties
1 and 2, hence, its dynamic behavior is represented by
Motivated by Proposition 1 and Proposition 2, the final model (25) leaving HCL ∈ Rn2 ×n5 and H1L ∈ Rn1 ×n5
result of this section focuses in a particular class of circuits to be characterized. Consider that the n5 inductors are
characterized by two properties. divided into three types, namely: nr r-inductors, that
418
CHAPTER 13. POWER SYSTEMS
belong to a trajectory that connects a source with a • The entries different from zero of each row of Mr ∈
capacitor, ns s-inductors that belong to a trajectory that Rn1 ×nr shows the connection of sources with r-
connects a source with another source and np p-inductors inductors.
that connects a capacitor with another capacitor, such that • The i − th row of Mr , i = 1, 2, . . . , n1 is divided into
nr + ns + np = n5 . Moreover, assume that the capacitors z sections leading to
belong at least to one of the following classes βi = [ βir1 βir2 · · · βirz ] (29)
C.1 The i−th capacitor, i ∈ {1, ..., n2 }, shares cutset with where each βirj ∈ R1×rj , j = 1, 2, . . . , z, has only one
ri r-inductors. entry equal to 1 if the i − th source is connected to
C.2 The i−th capacitor, i ∈ {1, ..., n2 }, shares cutset with the j − th capacitor. Otherwise, the vector is zero.
pi p-inductors. • Since two sources can not be connected to the same
while the voltages sources satisfies that r-inductor, each column of Mr has only one entry
different from zero.
C.3 The i−th voltage source, i ∈ {1, ..., n1 }, shares cutset • The sum of the entries different from zero of the i−th
with mi ∈ {1, ..., n5 } inductors. row of Mr equals ρi .
• The entries different from zero of each column of
So that, the rows of matrix HCL can be divided into: i)
Ms ∈ Rn1 ×ns stand for the connection of sources with
capacitors that hold simultaneously conditions C.1 and
other source.
C.2 and ii) capacitors that hold only with condition C.2.
• The sum of the entries different from zero of the i−th
The columns of HCL are divided into three blocks each
row of Ms equals γi . Therefore, ρi + γi = mi of C.3.
one corresponding to r, s and p inductors, respectively.
• Matrix 0p ∈ Rn1 ×np is a zero matrix that exhibits the
Following the stated organization, matrix HCL takes the fact that sources can not be related with p-inductors.
form T
1r1 0 · · · 0 0s N1 Property 5. The vector 1n1 ∈ Rn1 is a right eigenvector of
0 1T · · · 0 0s N2 matrix MsT satisfying MsT 1n1 = 0.
r2
. .. . . .. .. .. This general network can be specialized to the typical
.. . . . . .
circuits, for example the Mesh Network.
−HCL = 0 0 · · · 1r 0s Nz
T
; (27)
0 0 · · · 0z 0s Nz+1 Mesh Network In this topology each load is connected to
. .. . . .. .. .. all sources, then ri = n1 for all i ∈ {1, 2, . . . , r}. Also, the
.. . . . . . number of sources equals the number of capacitors, then
0 0 ··· 0 0s Nn2 n1 = n2 . Finally, there are not neither p-inductors nor C.2
where class capacitors.
• 1Tri ∈ R1×ri , i = 1, . . . , z, are vectors filled with ones The matrices that topologically characterize a Mesh net-
denoting the condition stated in C.1. In this case it work are
has been assumed that there exist z capacitors of this
−1Tn1 0 · · · 0 0s
kind. It holds that r1 + r2 + · · · + rz = nr . 0 −1T · · · 0 0s
• The zero columns 0s ∈ R1×ns reflects the fact that n1 n ×n
HCL = . .. .. .. .. ∈ R 1 5 , (30)
any capacitor can be connected to s type inductors. .. . . . .
• Row vectors Ni ∈ R1×np include the possibility that 0 0 · · · −1Tn1 0s
a given capacitor can be simultaneously connected
to r and p type inductors. If the i − th capacitor where n1 is the number of sources and ns < n1 is the
is connected to one of the p-inductors, appears a 1, number of sources connected to other source, while
otherwise is 0. If the capacitor only hold C.1, Ni = 0. H1L = [ Mr Ms ] (31)
• In the rows that go from z + 1 to n2 only appear vec- with the particular feature that all the partitions βirj of
tors Ni since they correspond to C.2 class capacitors. the i − th row of Mr include an element different from zero
since all the sources are connected to all capacitors.
Property 3. Each column of the matrix N = col {N1 , . . . , Nn2 } ∈
Rn2 ×np is composed by one 1, one −1 and the rest of the
entries equal to zero. 5. STEADY–STATE CHARACTERIZATION: TWO
CASE STUDY
Property 4. The vector 1n2 ∈ Rn2 , i.e., the vector filled
with ones of dimension n2 , is a left eigenvector of matrix The aim of this section is to exploit the structure of
N satisfying 1Tn2 N = 0. the Fundamental Loop Matrix H under two different
Concerning the structure of matrix H1L ∈ Rn1 ×n5 , it is scenarios.
obtained from (10), which is given by
i1 = −H1L iL 5.1 Lossless DC network
taking into account that H1R = 0. Their columns are
Consider a network operating under constant voltage
divided into three blocks corresponding to r, s and p type
sources, with possibly nonlinear capacitors and inductors,
inductors, respectively, where the third one is zero due
with linear co–tree resistances and assuming that the tree
to the fact that sources do not belong to cutsets where
resistances Rt = 0. This scenario is frequently considered
p-inductors are involved. This matrix takes the form
in practice, for example, in Electrical Power Systems under
H1L = [ Mr Ms 0p ] (28) the assumption that the lines are dominantly inductive.
419
CHAPTER 13. POWER SYSTEMS
Properties 1 and 2 hold. Hence, model (5) reduces to included. Hence, the circuit dynamic is described by (5)
−1 under (14) and is represented as
Rc −HCL 0
ẋ = T ∇x Ha (x) + T v1 (32) P −1 ż = [J − R] z + Ge1
HCL 0 H1L
T
T
where z = vC iTL and J, R and G previously defined.
For stability properties of the network, Proposition 2 can
be directly applied, since Since the steady state behavior is now time–varying, the
? T −1
Ḣ0 (x) = −(vC − vC ?
) Rc (vC − vC )≤0 admissible trajectories are given as solution of
? P −1 ż ? = [J − R] z ? + Gv1? (36)
and by direct substitution of vC in (32), the maximal
invariant set of the system corresponds to iL = i?L , As usual (Desoer and Kuh (1969)), it is assumed that
condition that proves that asymptotic stability is attained. voltage and currents are of the form
The second part is devoted to the characterization of
the steady state behavior defined by the equilibria of the f (t) = F cos(ωt + φ) = Re(Fejωt )
system, which are the solutions of with the phasor F = F ejφ . Therefore, admissible trajecto-
−Rc−1 vC
?
− HCL i?L = 0 (33a) ries are defined by
HCLT ?
vC + H1LT ?
v1 = 0, (33b) Re jωP Z? ejωt − (J − R)Z? ejωt = Re(G)(V?1 ejωt )
?
for a given v1 , the characterization can be carried out leading to
for the capacitor voltages or the inductor currents. If the −1
Z? = [jωP − (J − R)] GV?1 (37)
capacitor voltages are chosen, it is possible to write that
−1 Considering that the circuit satisfies Properties 1 and 2,
? T T ? i.e., identities (23) and (24) hold, the model reduces to
vC = − HCL HCL HCL H1L e1 (34)
since, from Property 3, HCL is row full rank. sC + Rc−1 HCL
sP − (J − R) = T
−HCL sL + Rt
Departing from this last expression, in the following propo-
sition it is illustrated how exploiting the structures for where s = jω, C ∈ Rn2 ×n2 is the capacitance matrix and
HCL and H1L it is straightforward to concluded the steady L ∈ Rn5 ×n5 is the inductance matrix. Hence, it is obtained
state operation that is achieved by a given network. The that ?
T
result is illustrated for the case of a Mesh circuit. VC A1 H1L V?1
Z? = ? = (38)
IL T
A2 H1L V?1
Proposition 4. Consider a Mesh electrical circuit charac-
terized by (30) and (31). Assume that with
T
−1
A1 = − YCR + HCL YLR HCL HCL YLR (39)
• The network is lossless, i.e., Rt = 0.
T T −1
• Propositions 1 and 2 hold. A2 = YLR − YLR HCL YCR + HCL YLR HCL HCL YLR
• The vector of voltage sources v1 is composed by n1 and diagonal admitance matrices
constant values.
Under these conditions capacitor voltages achieves average YCR = sC + Rc−1 (40)
consensus (Bai et al. (2011)) in the sense that −1 −1 −1 −1 −1
YLR = (sL + Rt ) =s (L + s Rt ) =s D (41)
?
vC = α1n1 . (35)
For illustrative purposes, the steady state behavior of
with capacitor voltages is characterized assuming a Mesh net-
n1
1 X work. In accordance with the partitions of HCL and H1L
α= v? previously introduced, the inductance takes the form
n1 m=1 1m
L = diag{Lr , Ls }
the steady state average value of v1 .
with Lr ∈ Rnr ×nr , Ls ∈ Rns ×ns . In addition, the former
Remark 5. A direct corollary of the last proposition refers
must be divided into z matrices as
to the case when v1? = v̄1 1n1 , with v̄1 ∈ R. Under
this condition voltage capacitor consensus (see Bai et al. Lr = diag{Lri }; i = 1, 2, . . . , z
?
(2011)) is achieved in the sense that vC = v1? . where Lri ∈ Rn1 ×n1 since for a Mesh network nr = n1 .
Remark 6. Interestingly enough, if inductive loses are in- From Properties 1 and 2, the dimension of Rc is n1 × n1 ,
cluded, Rt 6= 0, consensus is not longer preserved. Instead due to the fact that n1 = n2 , while Rt ∈ Rn5 ×n5 , i.e.,
of, the entries of the matrix Rc−1 + HCL Rt−1 HCL T
depends equals the dimension of L.
on the values of the tree resistors. However, it seems
that this situation allows for designing compensation tech- Under the aforementioned partitions, matrix D introduced
niques, i.e., adding new lumped elements to the circuit, in (41) is given by
such that the desired behavior is accomplished. D = diag{Dr , Ds } (42)
where Dr = diag{Dri } with Dri = (Lri + s Rti )−1 , −1
420
CHAPTER 13. POWER SYSTEMS
with Mri ∈ Rn1 ×ri , i = 1, 2, . . . , z, matrices composed by Fernández-Carrillo, P., Avila-Becerril, S., and Espinosa-
the vectors introduced in (29) exhibiting only one entry Pérez, G. (2015). Consenso en sistemas eléctricos de
different from zero on each of their columns. potencia mediante compensación. Congreso Nacional
Proposition 5. Consider a Mesh electrical circuit charac- de Control Automtico (AMCA) 2015, 527 – 532.
terized by (30) and (31). Assume that Jayawardhana, B., Ortega, R., Garcı́a-Canseco, E., and
Castanos, F. (2007). Passivity of nonlinear incremental
• Propositions 1 and 2 hold. systems: Application to pi stabilization of nonlinear rlc
• The vector of voltage sources v1 is composed by n1 circuits. Systems & control letters, 56(9), 618–622.
sinusoidal functions. Jeltsema, D. and Scherpen, J. (2003). A dual relation
Under these conditions the input/output relationship be- between port-hamiltonian systems and the brayton–
tween the capacitor voltage phasors V?C and the voltage moser equations for nonlinear switched rlc circuits.
source phasors V?1 is given by Automatica, 39(6), 969–979.
T T
Maschke, B.M., Van der Schaft, A.J., and Breedveld, P.C.
1n1 Dr1 Mr1 (1995). An intrinsic hamiltonian formulation of the
1T D M T dynamics of lc-circuits. Circuits and Systems I: Fun-
? −1 n1 r2 r2 ?
VC = −M .. V1 (43) damental Theory and Applications, IEEE Transactions
.
on, 42(2), 73–82.
1Tn1 Drr Mrz
T
Ortega, R., Jeltsema, D., and Scherpen, J. (2003). Power
2
where M = s C + sRc−1 + diag 1Tn1 Dri 1n1 , i = shaping: A new paradigm for stabilization of nonlinear
1, 2, . . . , z. rlc circuits. Automatic Control, IEEE Transactions on,
Remark 7. The idea to manipulate the capacitance, in- 48(10), 1762–1767.
ductance and resistance values to attain a given steady Ortega, R., van der Schaft, A., Castanos, F., and As-
state behavior is not new. In many applications this pro- tolfi, A. (2008). Control by interconnection and stan-
cedure is recognized as compensation and is related with dard passivity-based control of port-hamiltonian sys-
the addition of new elements parallel or series connected tems. Automatic Control, IEEE Transactions on,
with the originals. The advantage offered by the approach 53(11), 2527–2542.
presented in this paper, is that the compensation analysis Van der Schaft, A. (2010). Characterization and partial
can be carried out in a systematic way. synthesis of the behavior of resistive circuits at their
terminals. Systems & Control Letters, 59(7), 423–428.
6. CONCLUDING REMARKS Van der Schaft, A. and Maschke, B. (2013). Port-
hamiltonian systems on graphs. SIAM Journal on Con-
In this paper a dynamic characterization of a class of trol and Optimization, 51(2), 906–937.
electrical circuits has been presented. The characterization Weiss, L. and Mathis, W. (1997). A hamiltonian formu-
contemplates both stability properties and steady behav- lation for complete nonlinear rlc-networks. Circuits and
ior; its main feature is that it is based on the structural Systems I: Fundamental Theory and Applications, IEEE
properties of the networks which have been obtained by Transactions on, 44(9), 843–846.
using arguments from the Graph theory. It has been shown Wellstead, P. (1979). Introduction to physical system
that the structure of these circuits strongly defines their modelling. Academic Press London.
dynamic behavior. The usefulness of the results reported
in this paper lies in the possibility to carry out the charac-
terization in a very systematic way. In addition, it has been
shown that they offer an alternative to deal with problems
like compensator location to attain a prescribed behavior.
ACKNOWLEDGEMENTS
REFERENCES
Avila-Becerril, S., Espinosa-Pérez, G., and Fernández-
Carrillo, P. (2015). Stability and consensus of electrical
circuits via structural properties. IFAC-PapersOnLine,
48(13), 111–116.
Bai, H., Arcak, M., and Wen, J. (2011). Cooperative
control design: a systematic, passivity-based approach.
Springer Science & Business Media.
Bollobás, B. (1998). Modern graph theory, volume 184.
Springer Science & Business Media.
Brayton, R. and Moser, J. (1964). A theory of nonlinear
networks i. Quart. Appl. Math, 22(1), 1–33.
Desoer, C. and Kuh, E. (1969). Basic Circuit Theory.
McGraw-Hill.
421
CHAPTER 13. POWER SYSTEMS
Abstract: With the increasing penetration of smart grid technologies, prediction and forecast-
ing task of the power system variables such as load and spot price became a difficult task.
Traditional methodologies lack performance to capture the new dynamics of the power system.
The Holt Winters model is widely used to estimate seasonal series. This model requires an
heuristic setting of its non-dynamic parameters. This paper proposes a comparative study
between Kalman Filter and Moving Horizon Estimator as optimal dynamic identification
methods for the Holt-Winters model parameters. In order to evaluate the proposed optimal
setting performance, the energy price and load time series of the Colombian electrical power
system were used as case study, furthermore the classical estimation methods are chosen as
reference frames.
422
CHAPTER 13. POWER SYSTEMS
and regression based methodologies are compared with the model and also to improve estimation techniques pre-
exponential smoothing models including the Holt Winters sented.
in short, medium and long term horizons. Here the lowest
error is achieved with the Holt-Winters model at different 500
Precio [$/kWh]
But concluded that due to the non-inclusion of external 300
0
0 1000 2000 3000 4000 5000 6000 7000 8000
Now, presented models are applied to estimation and pre- Time [Hours]
seasonality are set, as prices and load time series. These 1.7
Load [kWh]
papers reports a reduction in error estimation and low 1.6
1.5
computational burden when the Holt Winters models is 1.4
applied to short term load and price estimation but in long 1.3
0 20 40 60 80 100 120 140 160 180
and medium term time windows the results do not exhibit Time [Days]
423
CHAPTER 13. POWER SYSTEMS
3. THE HOLT-WINTERS MODEL In order to improve the identified model for the series of
price and load, the optimal parameters were calculated
Holt-Winters (HW) models are often used in seasonal as: α = 0.00038, γ = 0.00033, δ = 0.15888 and α =
time series with disagreeable trending, seasonal or random 0.0057971, γ = 0.0006222, δ = 0.0046903, respectively.
variation, and they are grouped in additive or multiplica- The simulation results with these parameters are shown
tive. This model is the result of Holt (1957) and Peter in Fig. 3 and 4.
Winston (1960) investigations, which proposed prediction
methods whose principles are founded on the second-order
exponential smoothing (Goodwin, 2010), (Hyndman et al., 450
Price series
2008). In (Gardner Jr., 2006) a complete state of the ajuste
400
art about exponential smoothing models and the different
forms of the Holt-Winters models are presented. Because 350
the profiles of the proposed series, an additive seasonality
300
and a linear trend will be assumed (seven days for load
Price [$/kWh]
and 24 hours for price), based on this, an additive trend 250
model will be used as follows:
µk = αyk − αSk−p + (1 − α)µk−1 + (1 − α)Tk−1 (1) 200
150
Tk = γµk − γµk−1 + (1 − γ)Tk−1 (2)
100
Sk = δyk − δµk + (1 − δ)Sk−p (3)
where µk is the level component, k T is the trend com- 50
ponent, Sk is the seasonal component and p is the period
in a seasonal cycle and α, γ and δ are parameters of the 0
1000 2000 3000 4000 5000 6000
model. Time [Hours]
1−α 1−α −α 0 ··· 0 1.55
−αγ 1 + αγ −αγ 0 ··· 0
1.5
0 0 0 1 ··· 0
. . ..
A=
. .
(7) 1.45
0 0 0 0
. . . . 1.4
. . . .
. . . . ··· 1
1.35
−δ(1 − α) −δ(1 − α) (δα + (1 − δ)) 0 · · · 0
T 1.3
B = [ α αγ 0 · · · 0 δ − δα ] (8) 20 40 60 80 100 120
Time [Days]
yk = µk + Tk + Sk−p+1 (9)
C = [1 1 1 0 ··· 0] (10) Fig. 4. Optimal adjustment HW model for load series
D = [0] (11) As can be seen in both figures, the model with yd = 0, does
This model does not have an input (exogenous) vector. not adjust properly to the series, and since the adjustment
When the model is used for prediction, it is assumed as was not satisfactory, it was decided to make an adjustment
an autonomous system. Regarding the initial values of the based on parameter optimization to achieve a better result
states there are several heuristics to estimate these values and capture the trend of the series. This model will be used
as described in (Segura and Vercher, 2001), due to the for implementing the proposed estimation methods.
ease in calculations compared to other methods, the initial
state of this model are calculated with the methodology 4. MOVING HORIZON ESTIMATOR
proposed by (Makridakis et al., 1998): The Moving Horizon Estimator (MHE) is an estimation
technique based on optimization. In MHE, the problem of
• µ0 = D̄1 , where D̄1 is the mean of all measured data
state estimation of a given system is solved, considering
for the first seasonal cycle of the series.
all previous measurements available in a specific time
• T0 = D̄1 −p
D̄2
, where D̄2 is the mean of all measured window. Similarly to the Kalman filter, MHE takes into
data for the second seasonal cycle of the series. account the stochastic variables in the model, like the
• Sok−p = D k
D̄1
, for k = 1, 2, ..., p; where Dk is the given measurement and model noises. Therefore it acts as a
measure in the instant k from the first seasonal cycle. complete and optimal estimator (Muske et al., 1993), (Rao
424
CHAPTER 13. POWER SYSTEMS
et al., 2001), (Pawlowski et al., 2010). In the literature, where yk+j ∈ Rny is the measurement data in the instant
several applications of the MHE for dynamic systems are k + j, ŷk+j ∈ Rny is the data estimate in the instant k + j,
found. This paper explores its use with the HW model. x̂ ∈ Rnx is the data estimate at a given time, vk+j ∈ Rnv
4.1 Mathematical structure of Moving Horizon Estimator are the measurement noises in the instant k + j.
In order to solve the optimization problem of the MHE, it In order to simplify the solution of the optimization
is necessary to know the matrices associating the current problem, based on (13), equation (23) can be rewritten
state xk with the current output yk , a previous state as follows:
xk−N , determined by a measurement horizon N . Those
relationships are given by the expression (12) and (13) ˆ T Q(ỹ − ỹ)
ˆ + ω̃ T Rω̃
respectively, taken from (Valencia et al., 2011). min (ỹ − ỹ)
x̂k−N ,ω̃
(24)
x̃ = Φxk−N + γ ũ + αw̃ (12)
subject to: ỹˆ = Γx̂k−N + Λω ω̃
ỹ = Γxk−N + Λũ + Λω ω̃ (13) where:
where x̃ ∈ RN nx , ỹ ∈ RN ny , ũ ∈ R(N +1)nx , ω̃ ∈ R(N +1)nw
and they are defined in the expressions (14) to (22).
T
R 0 0 ···
x̃ = [ xk−N +1 xk−N +2 xk−N +3 · · · xk ] (14) 0 R 0 ···
R=
... .. .. .. ∈ RN n v (25)
T . . .
Φ = AT (A2 )T (A3 )T · · · 6(AN )T (15) 0 ··· ··· R
B 0 0 ··· Q 0 0 ···
AB B 0 ··· 0 Q 0 ···
Q=
.. .. ... .. .. ..
∈ RN n w (26)
A2 B AB . . . . .
γ= (16)
.. .. . . .. 0 ··· ··· Q
. . . .
N −1 N −2
once it is replaced (13) with ũ = 0, in (24), the expression
A B A B ··· B (27) is obtained .
T
ũ = (uk−N )T (uk−N +1 )T · · · (uk−1 )T (17)
min ˆ T Q(Γxk−N + Λω ω̃ − ỹ)
(Γxk−N + Λω ω̃ − ỹ) ˆ + ω̃ T Rω̃
x̂k−N ,ω̃
T
ω̃ = (ωk−N )T (ωk−N +1 )T · · · (ωk−1 )T (18) (27)
Λ = Cγ (22) x̂k−N
where Bω ∈ Rnw ×nw is the relationship matrix between x= (31)
ω̃
the uncertainty modeling with states; likewise if change B
with Bω , can be found α = γ, Λ = Λω . The solution of the optimization problem can be replaced
in (12) and (13) to calculate the estimation value of xk and
In the state estimation problem, the objective is to min- yk . In summary, the steps for performing the estimation
imize errors between the measurement and estimation with MHE are the follows:
at every instant of measurement, and likewise minimize
modeling errors, as shown in (23). (1) Define the estimation window N with N measure-
ment data known.
0
(2) Define the covariance matrix R and Q.
X (3) Define the matrix Bω , this is usually choose as B
min k(yk+j − ŷk+j )k2Q + kωk+j k2R Boulkroune et al. (2010).
x̂k−N ,ωk+j
j=−N +1
(23) (4) Compute Γ, Λ, Q and R.
subject to: (5) Compute H and F .
x̂k+j+1 = Ax̂k+j + Buk+j + Bω ωk+j (6) Compute x̂k−N and ω̃ with the optimization solve.
ŷk+j = C x̂k+j + vk+j (7) Compute xk y yk .
425
CHAPTER 13. POWER SYSTEMS
10
Price [$/kWh]
Price [$/kWh]
Solving the problem with the described steps generates the 300
5 200
ŷk . This value should be compared with the original data 100
series. 0
0
1000 2000 3000 4000 5000 6000 1000 2000 3000 4000 5000 6000
Time [Hours] Time [Hours]
Price [$/kWh]
Price [$/kWh]
300
Section 2, based on this, the objective is to estimate the Time [Hours] Time [Hours]
2
x 10
AR 0, 9543
1.8 ARM A 3, 49 × 104
ARIM A 0, 03
Load [kWh]
1.6
1.4
Load [kWh]
1.5
1
Load series
HW and MHE estimation, N=2 1.4
Among the presented methodologies the Holt Winters with
1.3 Load series
MHE presents the smallest estimation error. This happens
0
20 40 60 80 100 120
1.2
20 40
HW and MHE estimation, N=7
60 80 100 120
when the lenght of the estimation window had similar
7
Time [Days]
32
Time [Days]
value to the series seasonality. The lenght of the estimation
1.7
x 10
2.5
x 10
window had a direct impact on the optimization result.
1.6 2
This performance is explained by the model structure and
Load [kWh]
Load [kWh]
1.5
1.5
1
Load series the optimization method used. With the proper seasonal
1.4
1.3
0.5
HW and MHE estimation, N=30
values the model captures all the information of the sea-
Load series
1.2
HW and MHE estimation, N=12 0 sonal series. The MHE can replicate the identified cycle in
20 40 60 80 100 120 40 60 80 100 120
Time [Days] Time [Days] the estimation of the remaining data series cycles. Changes
in the seasonal cycles are adjusted in the next estimation,
Fig. 6. Holt Winters model with MHE estimation for Load this clearly depends on the estimation window size getting
the minimum error if the window size has exactly the sea-
The obtained results are compared using the mean squared sonality value, if the windows size is different the correction
error (MSE) between the estimated series and the original is not optimal. When the window size is smaller than the
426
CHAPTER 13. POWER SYSTEMS
seasonal cycle of the series, the lack of complete seasonal Montgomery, D.C., Jennings, C.L., and Kulahci, M.
data increases the error. Likewise, as the measurement (2015). Introduction to Time Series Analysis and Fore-
window is larger than the period of the seasonality the casting. Wiley, 2nd edition.
setting error cumulates and the performance decreases. Muske, K., Rawlings, J., and Lee, J.H. (1993). Receding
Hence it is best to select for this class of systems a horizon Horizon Recursive State Estimation. In American Con-
of measurement equal to the seasonality of the series, such trol Conference, 1993, 900–904.
that the design of the estimator is consistent with the series Ng, S. and Perron, P. (1995). Unit Root Tests in ARMA
properties. Models with Data-Dependent Methods for the Selec-
tion of the Truncation Lag. Journal of the Amer-
5. CONCLUSIONS ican Statistical Association, 90(429), 268–281. doi:
10.1080/01621459.1995.10476510.
This work successfully presented a comparative study be-
Niimura, T. (2006). Forecasting Techniques for Dereg-
tween an optimal estimation methodology and traditional
ulated Electricity Market Prices - Extended Sur-
tools in order to estimate energy load and price time
vey. In Power Systems Conference and Exposition,
series in the Colombian market. Based on a consolidated
2006. PSCE ’06. 2006 IEEE PES, 51–56. doi:
estimation model as Holt-Winters a state space model
10.1109/PSCE.2006.296248.
was proposed allowing to use optimal estimation method-
Pawlowski, A., Guzman, J., Rodrguez, F., Berenguel,
ologies to identify the Holt-Winters model states. The
M., and Snchez, J. (2010). Application of time-series
methodology was validated comparing the results with
methods to disturbance estimation in predictive con-
Kalman filter, ARMA, AR, MA and ARIMA models. The
trol problems. In 2010 IEEE International Sympo-
MHE methodology achieved the minimal estimation error.
sium on Industrial Electronics (ISIE), 409–414. doi:
It is remarkable that the proposed estimation methodol-
10.1109/ISIE.2010.5637867.
ogy highly depends of the relationship between the time
Ramı́rez, A. (2013). Métodos utilizados para el pronóstico
series seasonality and the measurement window size. The
de demanda de energı́a eléctrica en sistemas de dis-
proper selection on the estimation window brings to the
tribución. Thesis, Univerdida Tecnolgica de Pereira.
estimation methodology the ability to compensated the
Rao, C.V., Rawlings, J.B., and Lee, J.H. (2001).
exogenous variables effects in time series.
Constrained linear state estimationa moving horizon
ACKNOWLEDGEMENTS approach. Automatica, 37(10), 1619–1628. doi:
10.1016/S0005-1098(01)00115-7.
This project was supported by Colciencias with the schol- Segura, J.V. and Vercher, E. (2001). A spreadsheet mod-
arship given to Julián Rendón as a part of the project eling approach to the HoltWinters optimal forecasting.
“Jovenes investigadores” call number 645 the doctoral European Journal of Operational Research, 131(2), 375–
scholarship of Pablo Deossa call 528. 388. doi:10.1016/S0377-2217(00)00062-X.
Singh, A., Ibraheem, I., Khatoon, S., Muazzam, M., and
REFERENCES Chaturvedi, D. (2012). Load forecasting techniques and
methodologies: A review. In 2012 2nd International
Boulkroune, B., Darouach, M., and Zasadzinski, M. Conference on Power, Control and Embedded Systems
(2010). Moving horizon state estimation for linear (ICPCES), 1–10. doi:10.1109/ICPCES.2012.6508132.
discrete-time singular systems. IET Control The- Snchez, P. and Velsquez, J.D. (2010). Problemas de
ory and Applications, 4(3), 339–350. doi:10.1049/iet- investigacin en la prediccin de series de tiempo con redes
cta.2008.0280. neuronales artificiales.
Chatfield, C. and Yar, M. (1988). Holt-Winters Fore- Suganthi, L. and Samuel, A.A. (2012). Energy mod-
casting: Some Practical Issues. Journal of the Royal els for demand forecastingA review. Renewable and
Statistical Society. Series D (The Statistician), 37(2), Sustainable Energy Reviews, 16(2), 1223–1240. doi:
129–140. doi:10.2307/2348687. 10.1016/j.rser.2011.08.014.
Gardner Jr., E.S. (2006). Exponential smooth- Valencia, F., Lopez, J., Marquez, A., and Espinosa, J.
ing: The state of the artPart II. Interna- (2011). Moving horizon estimator for measurement
tional Journal of Forecasting, 22(4), 637–666. doi: delay compensation in model predictive control schemes.
10.1016/j.ijforecast.2006.03.005. In 2011 50th IEEE Conference on Decision and Control
Goodwin, P. (2010). The Holt-Winters Approach to Ex- and European Control Conference (CDC-ECC), 6678–
ponential Smoothing: 50 Years Old and Going Strong. 6683. doi:10.1109/CDC.2011.6161346.
Foresight: The International Journal of Applied Fore- Weron, R. (2014a). Electricity price forecasting: A review
casting, (19), 30–33. of the state-of-the-art with a look into the future.
Haghi, H.V. and Tafreshi, S.M. (2007). An overview International Journal of Forecasting, 30(4), 1030–1081.
and verification of electricity price forecasting models. doi:10.1016/j.ijforecast.2014.08.008.
In Power Engineering Conference, 2007. IPEC 2007. Weron, R. (2014b). A review of electricity price forecast-
International, 724–729. IEEE. ing: The past, the present and the future. HSC Research
Hyndman, R., Koehler, A.B., Ord, J.K., and Snyder, R.D. Reports HSC/14/02, Hugo Steinhaus Center, Wroclaw
(2008). Forecasting with Exponential Smoothing: The University of Technology.
State Space Approach. Springer Science & Business XM (2016). Intelligent Information Management. URL
Media. http://informacioninteligente10.xm.com.co.
Krafft, M. and Mantrala, M.K. (2009). Retailing in the
21st Century: Current and Future Trends. Springer
Science & Business Media.
427
CHAPTER 13. POWER SYSTEMS
428
CHAPTER 13. POWER SYSTEMS
La función principal de este convertidor electrónico es Suposición 2.1. Para propósitos de control, iL y vO están
elevar el voltaje de salida a un nivel mayor que el voltaje disponibles para medición y todos los parámetros son
de la fuente. Debido a esta caracterı́stica, el uso del conocidos, excepto la resistencia de carga R.
convertidor elevador se extiende a diversas aplicaciones
como; fuentes ininterrumpidas de alimentación, tecnologı́a En la práctica el objetivo final de control es mantener
vehicular, sistemas fotovoltaicos, celdas de combustible de vO regulado a un valor constante vref a pesar de las
hidrógeno y telecomunicaciones, Mohan et al. (2003), entre variaciones en la resistencia de carga. Bajo este escenario
otros. de regulación, al definir una señal de control constante
La configuración convencional de este convertidor se mues- u = ū ∈ U, los puntos de equilibrio de (1) son obtenidos
tra en la Fig. 1, donde vi es el voltaje de entrada, iL es la como
vi
corriente promedio del inductor, vO es el voltaje promedio v̄O = ,
de salida, u es el ciclo de trabajo (considerado como señal 1 − ū
v̄O v̄ 2 (2)
de control), L, C y R denotan el valor de la inductancia, īL = = O .
capacitancia y resistencia de carga. Finalmente M es el R(1 − ū) Rvi
dispositivo activo de conmutación tipo MOSFET y D un Observación 2.2. Para calcular el punto de equilibrio īL
diodo. es necesario conocer la resistencia de carga R con exacti-
tud; sin embargo, en la práctica esta resistencia presenta
variaciones.
Formulación del problema: considere el modelo del
convertidor elevador dado por (1) considerando la ob-
servación (2.1) y la suposición (2.1). Diseñar, si es posi-
ble, una ley de control de retroalimentación de estados
dinámica adaptable, tal que el equilibrio en lazo cerrado
Fig. 1. Configuración convencional de un convertidor ele-
vador. E := (īL , v̄O ) ∈ I × V
sea asintóticamente estable y con una región de atracción
Al aplicar las leyes de voltaje y corriente de Kirchhoff para definida.
ambos casos cuando M se encuentra apagado y encendido,
se obtiene el modelo no lineal promediado (libre de rizo)
3. DISEÑO DEL CONTROLADOR
para el diagrama eléctrico de la Fig. 1 como, Ortega et al.
(1998); Sira-Ramirez and Silva-Ortigoza (2006)
Este trabajo de investigación aborda el enfoque “back-
1h i stepping” para asegurar la regulación del voltaje de sal-
i̇L = vi − (1 − u)vO ida de un convertidor elevador a un valor deseado. Este
L (1)
1h vO i enfoque recursivo es ampliamente conocido por ofrecer si-
v̇O = (1 − u)iL − , multáneamente la construcción de una ley de control y una
C R
función de Lyapunov, esto para estabilizar asintóticamente
donde [iL , vO ]> ∈ I × V, u ∈ U, con conjuntos definidos el punto de equilibrio de sistemas no lineales con una
por forma estricta de retroalimentación, Khalil (2000); Mar-
I := {iL ∈ R : 0 < imin < iL < imax }, quez (2003); Krstić et al. (1995); Sepulchre et al. (1997).
V := {vO ∈ R : 0 < vi < vO < vmax }, Sin embargo, no es posible aplicar directamente esta
U := {u ∈ R : 0 < u < umax < 1}. técnica para estabilizar el modelo (1), ya que este no posee
la forma estricta necesaria.
Nótese que las cotas máximas y mı́nimas se encuentran
relacionadas al dimensionamiento fı́sico del convertidor. El Considerando lo anterior, esta sección muestra como las
modelo (1) se dice que es bilineal, puesto que la señal de ideas recursivas de la técnica “backstepping” estándar son
control u multiplica directamente a las variables de estado. consideradas para diseñar una ley de control de retroali-
Este fenómeno es después explotado en la etapa de diseño. mentación de estados con una dinámica definida, que da
solución al problema formulado, aun cuando se presentan
Observación 2.1. El modelo (1) es desarrollado asumiendo variaciones en la resistencia de carga. En este sentido,
las siguientes consideraciones, Bacha et al. (2014); Sira- el esquema propuesto, inspirado por el control por modo
Ramirez and Silva-Ortigoza (2006). corriente, hace uso de un lazo de voltaje con acción pro-
porcional integral (PI) para generar una señal de referencia
• El convertidor opera en modo de continuo de con- de corriente. Ası́ también, implementa un lazo de corriente
ducción (MCC). diseñado a partir del enfoque recursivo “backstepping”, lo
• Los dispositivos de conmutación y elementos pasivos que resulta en un controlador con dinámica cero estable
son ideales, lo que indica que las perdidas de voltaje en el dominio de interés (vease Fig. 2). Nótese que el
por conmutación y resistencias parásitas son despre- control indirecto del voltaje de salida mediante el control
ciadas. de corriente, es un enfoque robusto ante la fase no mı́nima
• La fuente de alimentación provee un voltaje con- impuesto por el cero (ubicado en el semiplano derecho
stante. complejo) de la función de transferencia vO (s)/u(s).
429
CHAPTER 13. POWER SYSTEMS
430
CHAPTER 13. POWER SYSTEMS
431
CHAPTER 13. POWER SYSTEMS
432
CHAPTER 13. POWER SYSTEMS
R1 ||R2 = 4.1 Ω, cuando ML está encendido 30
R= (46)
R1 = 8.5 Ω, cuando ML está apagado.
Voltaje [V]
25
Estas variaciones se efectúan a una razón de 2.5 Hz. Se
observa que el controlador propuesto asegura la regulación 20
vO
de vO a un valor fijo de 24 V, a pesar de cambios vref
15
repentinos en la carga (Fig. 4a). Es notable, que en 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Corriente [A]
observada en la Fig. 4b donde se muestra una variación
de 12.5 A a 5 A, que corresponde a una potencia de 10
salida nominal de 150 W y de 67 W, respectivamente.
5 iL
Simultáneamente la señal de control en Fig. 4c muestra
iref
una correcta compensación a los cambios de carga, ya 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
que u es cercana al 50% para carga nominal y después Tiempo [s]
se ajusta a 48% sin mostrar saturación alguna. En la Fig.
4d se muestra el comportamiento de las señales auxiliares b)
433
CHAPTER 13. POWER SYSTEMS
434
CHAPTER 13. POWER SYSTEMS
Abstract: Current advances of smart grids are causing a demand for new operations and
services, specially to the low voltage consumers grid. Among these processes,load balancing has
a prominent capability for ensuring stable states between feeders. This paper presents some
results about the design of an automated process to load balancing feeders in final consumption
units of a small urban smart grid using a timed sliced Hierarchical Petri net. The main objective
is to analyze the proposed system and establish an efficient and reliable workflow to automate
load balancing and ensure stability while minimizes intervention. As a result it improves the
quality of power service to the low voltage final consumers.
435
CHAPTER 13. POWER SYSTEMS
Then, we have also review about PN use in SG. It will . Definition 2.2.3. Hierarchical Petri net by Place
addressed, some specific definitions of PN that will be Bounded Substitution.In a PN structure give by
important for the development of this proposal. ”(3)”,Silva (2012):
N = (P, T, F ). (3)
2.1 Review Stage
There is, an Y sub-net which limited by place
so the replacement of this Y sub-net, generates
As load balancing method within AFR context, we first
another net N’ = (P’, T’, F’ ), where: i) P’ =
can mention the transfers overload concentrated method
P \T ∪{S y }, S y is the new element that replaces
(losses and loads) into specific feeder, working from three-
Y ; ii)T’ =T \Y ;iii)F’ =F \Int(Y ),Int(Y )is the inner
phase final consumption units. This method, is based on
Y arcs set, Silva (2012).
minimizing current consumption achieved by Fuzzy ma-
chine inference and Newton-Rhampsom’s optimization al- 3. PROPOSAL
gorithm, between power consumption and power variation
in each feeder,Siti et al. (2011). According, of hypothesis and the opportunity to improve-
Another method, focused only on single-phase final con- ment the load balancing process in the current structure
sumption units, minimizing power and voltage consump- of the LV distribution grid. We propose in this article:
tion use an hybrid genetic algorithm. In this case, modeling an intelligent process for load feeder balancing in
also taking up load transfer but reconnecting single- final consumption units using a Timed Hierarchical Petri
phase consumers in same feeder in grid, with lower load Net (THPN), in order to obtain reliably and efficiently
level,Shahnia et al. (2014). workflow, formally validated.
We can also mention, the hybrid load consumption algo- In fact, this process is based on algorithms of artificial
rithm model for final consumption units in LV grids,based intelligence as fuzzy logic and optimization systems; and
on the Unified Modeling Language (UML)-PN paradigm, also forecast algorithms based on stochastic process as
Sicchar et al. (2011). Which connects data acquisition, Markov chains,Sicchar et al. (2015).
classification, programming and consumption forecast, and The contribution of the validated model, could be useful
sending best selection for arrangement switching feeders in in reshaping modernization of the existing LV distribution
load balancing. grid structure (legacy system). It can be used as the
In currently article, We will continue load balancing in implementation of a specific service in the process of
final consumption units feeders model but, based on hier- automatic feeders reconfiguration in final consumption
archical PN paradigm, using balancing diagnostic,current consumer units, within the scope of small urban smart
and load consumption prediction, minimization consump- grid.
tion and optimal arrangement sequence selection flow al- Through hierarchical PN will be performed validation
gorithms as hierarchical sub-processes in main PN. system design including its inner sub-processes,that make
Thereby, contributing with efficient process in load balanc- feeders reconfiguration system, called ”DPMS”. Which is
ing, which can be used as an alternative method and/or explained in following sub-section.Similarly, use of Timed
interface in existent LV grid, or as support process in transitions in proposal model are intended to represent the
supervision center of a small urban smart grid (SUSG). most realistic way possible an entire period by simulation
processing system, for consumer units feeders balancing.
2.2 Definitions In this particular case, takes a granular period of 60
. Definition 2.2.1.Petri Net.A Petri net structure is a minutes, i.e., seeking feeder reconfiguration lasting one
directed weighted bipartite graph,according ” (1)”,Silva hour, depending on sample consumption obtained at 10
(2012): minute intervals derived from real-time measurement, in
N = (P, T, A, w). (1) the case of a SUSG. Thus, it is not covered in this case,
the load balancing flow in existing secondary grid, which
where: ”P ” is the finite set of places,P 6= ∅. ”T ” is is the subject of future work.
the finite set of transitions, T6= ∅.”A”⊆(PxT)∪(TxP)
is the set of arcs from places to transitions and In this article, will be developed according to initial pro-
from transitions to places.”w ”:A → {1,2,3,· · ·} is posal of the authors,Sicchar et al. (2011) however having a
the weight function on the arcs. where: ”P ” is the contribution a broad and integrated PN with hierarchical
finite set of places,P 6= ∅. ”T ” is the finite set of description of its sub-process. It will be based on the
transitions, T6= ∅.”A”⊆(PxT)∪(TxP) is the set of system developed to consumer units energy consumption
arcs from places to transitions and from transitions diagnose,Sicchar et al. (2015) but now considering beyond
to places.”w ”:A → {1,2,3,· · ·} is the weight function imbalance diagnostic and consumption forecasting stage,
more two stages: minimizing current consumption and
. on the arcs.
Definition 2.2.2. Timed Petri Net.Defined by ”(2)”,Silva switching sequence selection.
(2012):
3.1 DPMS Architecture
N = (P, T, A, w, M 0, f ). (2)
where: (P, T, A,w,M0 ) is a marked Petri net, Silva The process system, is called ”DPMS” because of its four
(2012),”f ”: T→R+ is a firing time function that stages or sub-processes:”Diagnose”, ”Prevision”, ”Mini-
assigns a positive real number to each transition on mization” and ”Selection”, each with a specific algorithm.
the net. So, these formed the DPMS system.
436
CHAPTER 13. POWER SYSTEMS
437
CHAPTER 13. POWER SYSTEMS
438
CHAPTER 13. POWER SYSTEMS
439
CHAPTER 13. POWER SYSTEMS
440
CHAPTER 13. POWER SYSTEMS
Abstract: The increasing use of renewable technologies in power generation may require its
participation on ancillary services like frequency regulation. For the specific case of wind sources,
this may lead to participation in frequency control loops. This paper focuses on the simulation
of the performance of the LFC scheme for a multi-area power system, with participation of
DFIG turbines in the frequency control loops, through the synthetic inertia method.
Keywords: Power systems, load frequency control, wind turbine, pitch angle, multi-area
system.
441
CHAPTER 13. POWER SYSTEMS
442
CHAPTER 13. POWER SYSTEMS
Electromagnetic torque [N*m]
6000
4000
2000
0
0 500 1000 1500 2000 2500 3000 3500
Rotor speed [rpm]
Fig. 3. Wind-turbine operation curve for different wind Fig. 4. Wind-turbine simplified model with additional con-
speeds (Thomsen, 2006) trol loop for LFC contribution (based on (Ramtharan
operating point below its maximum power. As seen from et al., 2007a)).
figure 3, this results in the operating point of the wind-
turbine being moved to the right regarding to its point of additional loops are aggregated to the LFC: one containing
maximum power extraction. a transfer function offering an output proportional (multi-
plying by K1 ) to the frequency change rate (corresponding
The mechanical torque in each of the curves shown in to the primary response of the wind turbine); and the
Figure 3 is given by the equation (Thomsen, 2006): other loop having the task of restoring the power delivered
by the wind turbine after its participation in the load
Pm frequency control. The deviation frequency signal is pre-
Tm = . (1) filtered, previously, by a filter with gain Ka .
wshaf t Gb p
Both the simplified model and the synthetic inertia model
In equation (1), ωshaf t is the rotational speed of the
are depicted in figure 4, where:
wind-turbine shaft, Gb is the gearbox ratio, p denotes
the number of pole pairs in the generator and Pm is the and
mechanical power, which is defined as:
• ωr is the rotor angular speed,
• n is the number of wind-turbines,
1 • iqr is the quadrature rotor current,
Pm = ρπR2 v 3 Cp . (2)
2 • vqr is the quadrature rotor voltage, and
• X1 , X2 , X3 and T1 are constant values representing
There, ρ = 1.225 kg/m3 is the air density, R = 45 m is the
relationships between the internal wind generator
blade’s length, v is the wind speed and Cp is the efficiency
parameters (see (Ramtharan et al., 2007a) for a
coefficient described below:
detailed description of them).
116 −12.5 A 2M W wind-turbine, is selected to perform the simula-
Cp = 0.22(( ) − 0.4β − 5)e λt , (3) tions, the parameters of this turbine are presented in table
λt
A.2 in section 5.
with λt a parameter given by Thomsen (2006):
Moreover, for wind speeds equal or over the rated wind
speed of the wind-turbine, a pitch angle control should
1 be performed to maintain the angular speed of the wind-
λt = 1 0.035 (4)
Rωshaf t − 1+β 3 turbine at its nominal value. Additionally, when a fre-
+0.08β
v quency event occurs, and the turbine is operating under
In this way, the value of the coefficient Cp will be de- the action of pitch control, it is proposed to add a loop of
pending on the pitch angle β, the wind speed v and the additional control where the pitch angle is increased by a
rotational speed ωshaf t . Thus, for each wind speed value value proportional to the frequency deviation (this value
an operating point slightly moved to the right is taken. At is denoted by the constant Rβ ). This pitch control scheme
this operating point, the torque is given by equation (5), is illustrated in Figure 5 with a PI controller.
where the value of Kop = 0.3 is a constant. The whole wind turbine model with the inclusion of control
loops described above is presented in figure 6, where the
Top = Kop v 2 (5) variable n indicates the number of generating units, and
the variable vw indicates the wind speed. This model
Besides the simplified model previously described, is is also has an additional element compared with the models of
necessary to use a fraction of the power generated by the (Ramtharan et al., 2007a) and (Moore and Ekanayake,
wind turbine in order to contribute to the LFC. Using the 2009), the variable Pref , which constitutes the power
so-called synthetic inertia model (see (Ramtharan et al., that the wind-turbine would deliver if it would not be
2007a) and Moore and Ekanayake (2009), a couple of contributing to system frequency control.
443
CHAPTER 13. POWER SYSTEMS
+ +
- - -
+ +
-
+
- + +
+
-
+
Fig. 6. Schematic diagram for wind-turbine integration in the LFC for a single area system (Moore and Ekanayake,
2009).
Fig. 7. IEEE 9 bus system with an arbitrary multi-area For simulation purposes, load disturbances are applied in
partition. each area as follows: In area 1 a perturbation of 0.01 p.u. is
made at 90 seconds, in area 2 a perturbation of 0.08 p.u. is
Simulations were performed using the parameters pre- performed at 60 seconds, and in area 3 a load disturbance
sented in table A.1 (see Appendix), taken from (Anderson of 0.06 p.u. is applied at 30 seconds. Also, the wind speed
and Fouad, 2002) and using 100 M V A as base power. profile employed to feed the wind units is shown in figure
The work is performed under the assumption of area 1 9 as considered in in (Bernard et al., 2013).
444
CHAPTER 13. POWER SYSTEMS
4.2 Results
50.1
Frequency [Hz]
rotor current iqr , and the pitch angle β in wind tur-
bines. The parameters for these controllers were calculated
with the Gradient Descent method, in Matlab Design 49.9
Optimization-Based PID Controller toolbox (see table 1).
49.8 Without wind turbines in LFC
Table 1. Parameters for different PI controllers
With wind turbines in LFC
Parameter Area 1 Area 2 Area 3 iqr Pitch
P 0 0 0 0 7.19 49.7
0 50 100 150 200
I -0.05 -0.05 -0.028 2.70 0.53 Time [s]
Figures 10 - 12 present the frequency deviations for each Fig. 11. Frequency deviation in area 2.
area with and without contribution of wind turbines in
the LFC system of area 3. As the AGC, by design, leads
to the local control of disturbances minimizing the effects
in the other areas, the expected consequence is no sensible 50.1
operational difference between the frequency deviations of
Frequency [Hz]
each area. However, for each area, the inclusion of the 50
wind farm in LFC of area 3 actually helps to compensate
frequency deviations in the LFC, but only when there 49.9
is enough wind to produce the required power. During
the periods of low wind, disturbance effects are harder 49.8
to compensate than the conventional case, due to the Without wind turbines in LFC
lack of renewable power in the system. Besides this, 49.7 With wind turbines in LFC
it is important to highlight that the presence of hydro
generation in area 1 implies a greater inertia, diminishing 0 50 100 150 200
Time [s]
even more the effects of frequency disturbances. The effects
over areas 2 and 3 are pretty similar, because these areas Fig. 12. Frequency deviation in area 3.
have similar conditions.
The power exchanged between area 3 and the other areas
0.2
is shown in figure 12. As it was said before, both frequency With wind turbines in LFC
Without wind turbines in LFC
and power deviations are low when there is enough wind
Power exchanged [p.u.]
0.15
speed to sustain the power contribution of the wind farm.
Once wind speed is low, area 3 needs to increase the power 0.1
absorbed from the order areas in order to reduce the effects 0.05
of local disturbances in the local area frequency.
0
50.05 −0.05
50 −0.1
0 50 100 150 200
Frequency [Hz]
49.9
Fig. 13. Power exchange for area 3.
49.85
Without wind turbines in LFC 5. CONCLUSION
49.8 With wind turbines in LFC
The simulation of the performance of the LFC scheme for a
49.75 multi-area power system, with participation of DFIG tur-
0 50 100 150 200
Time [s] bines in the frequency control loops, through the synthetic
inertia method, with PI controllers for the AGC control
Fig. 10. Frequency deviation in area 1. was presented. The results show that wind turbines are
useful for frequency regulations tasks in primary control.
The responses of the rotor voltage vqr and the pitch angle However, the variability of wind and the effects of the de-
βref are depicted in figures 14 and 15, respectively. These creasing inertia from conventional units can be dangerous
figures show how the participation of wind turbines in LFC for frequency performance. Additionally, it is also shown
can be more stressful for them. However, a less aggressive that the participation of wind turbines in LFC introduces
performance could be obtained with a better tuning of the more stress in the operation of these units, requiring the
involved PI controller, or by the implementation of another exploration of control techniques that help to reduce these
kind of controllers. efforts for the wind units.
445
CHAPTER 13. POWER SYSTEMS
ternational, 1–5.
40
Quadrature rotor voltage [V]
446
CHAPTER 13. POWER SYSTEMS
×106 7
1.0 Inlet pressure 0.6 MPa Inlet pressure 0.6 MPa
Inlet pressure 1.0 MPa 6 Inlet pressure 1.0 MPa
Clearance volume
0.8 Inlet closed 5
Pressure, Pa
Speed, m/s
0.6 4
3
0.4 Swept volume
2
0.2 1
0.0 0
0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.00 0.02 0.04 0.06 0.08 0.10 0.12
Volume, m3 ×10−3 Time, s
Fig. 10. Piston velocity vs. displacement Fig. 12. Piston velocity vs. displacement
π
7 2
Inlet pressure 0.6 MPa Rotation accelerates
6 Inlet pressure 1.0 MPa 3 Rotation decelerates
8π
Rotationangle ϕ, rad
1 Leading edge
4 4π
3 1
8π
Trailing edge
2 Inlet port
0
1
0 − 18 π
0.00 0.05 0.10 0.15 0.20 0.25 0.00 0.05 0.10 0.15 0.20 0.25
Displacement, m Displacement, m
Fig. 11. Piston velocity vs. displacement Fig. 13. Piston velocity vs. displacement
Two inlet pressures 1.0 MPa and 0.6 MPa are compared
and the configuring factors are identified (Fig. 10).
π
By performing such simulations within the expected inlet 2
Inlet pressure 0.6 MPa
pressure range with a certain step, a matrix of these Inlet pressure 1.0 MPa
Rotation angle ϕ, rad
1
4π
4.5 Parameters description
447
CHAPTER 13. POWER SYSTEMS
5. RESULTS AND DISCUSSIONS Imran, M., Usman, M., Park, B.S., and Lee, D.H. (2016).
Volumetric expanders for low-grade and waste heat re-
The developed model is based on parameters of the se- covery applications. Renewable and Sustainable Energy
lected linear motor and the servomotor and allows to Reviews, 57, 1090–1109.
synchronize the rotation with the linear movement of the Lemort, V., Quoilin, S., Cuevas, C., and Lebrun, J. (2009).
expander. By decreasing the linear speed during the intake Testing and modeling a scroll expander integrated into
phase, the pressure loss at the inlet can be reduced. How- an organic rankine cycle. Applied Thermal Engineering.,
ever, reduced translation speed means lower frequency and 29, 3094–3102.
the mass flow rate through the expander, so the optimum Mikalsen, R. and Roskilly, A. (2008). The design and sim-
has to be found (Fig. 12). ulation of a two-stroke free-piston compression ignition
engine for electrical power generation. Applied Thermal
The rotation of the piston can be increased or decreased
Engineering, 28(56), 589–600.
when necessary for a better fit of the inlet. A moderate
Petrichenko, D., Tatarnikov, A., and Papkin, I. (2015).
rotating acceleration/deceleration is achievable in combi-
Approach to electromagnetic control of the extreme
nation with the piston deceleration during the intake. The
positions of a piston in a free piston generator. Modern
final deceleration must be adjusted so the leading edge of
Applied Science, 9(1), 119–128.
the skirt opening reaches the outlet port at the end of
Tran, X.B. and Yanada, H. (2013). Dynamic friction
the stroke. In this case this angle is π/2 (Fig. 13). Two
behaviors of pneumatic cylinders. Intelligent Control
rotation/translation profiles for different inlet pressures
and Automation, 4(2).
are shown on Fig. 14
The shape of the inlet port needs to be adjusted to Appendix A. NOMENCLATURE
”follow” an optimal intake profile in order to keep the
inlet intersection area around its maximum during the A.1 Latin characters
intake phase. Otherwise higher pressure ratios will require
high dynamics from electromagnetic train or will cause D diameter (m)
relatively high intake losses. F force (N)
L length (m)
The presented model is focused on the inlet control and m mass (kg)
therefore simplified by setting the outlet pressure as con- n number of ports (-)
stant. p pressure (Pa)
S intersection area (m2 )
6. CONCLUSIONS
t time (s)
V volume (m3 )
The dynamics of the system depend on the linear generator
x displacement (m)
used. Industrial linear motors are characterized by large
moving masses. It is possible to reduce the weight of
the piston if magnets are directly attached to it. A high A.2 Greek characters
dynamics of the piston movement is necessary to increase
β port angle (rad)
the resulting frequency and the volumetric flow rate of the
∆ difference (-)
machine. A relatively high static force is required to keep
υ speed (m/s)
the piston under control at its extreme positions. The use
φ angle of rotation (rad)
of a position encoder ensures the high accuracy of the inlet
ϕ filling factor (-)
timing, which is crucial for the system efficiency.
χ relative displacement (-)
The proposed system contains no bouncing devices such
as gas- or mechanical springs, which are typically used A.3 Subscript
in free piston machines. Instead, the piston movement
is fully controlled, so its velocity becomes zero at both cyl cylinder
extreme positions. A higher system efficiency is expected dis discharge
since mechanical wear or thermodynamic irreversibilities el electromagnetic
are avoided. fr friction
max maximal
REFERENCES min minimal
Bell, I.H., Wronski, J., Quoilin, S., and Lemort, V. (2014). open opening
Pure and pseudo-pure fluid thermophysical property port port
evaluation and the open-source thermophysical property rot rotation
library coolprop. Industrial & engineering chemistry su supply
research, 53(6), 2498–2508.
Gusev, S., Ziviani, D., De Viaene, J., Derammelaere, S.,
and van den Broek, M. (2016). Modelling and prelimi-
nary design of a variable-bvr rotary valve expander with
an integrated linear generator. In Proceedings of the
17th International Refrigeration and Air Conditioning
Conference at Purdue.
448
CHAPTER 13. POWER SYSTEMS
Resumen: En este artı́culo se demuestra formalmente que un sistema fotovoltaico (FV), con
arquitectura basada en MPPT-distribuido, está constituido por una interconexión pasiva
de sistemas pasivos. El resultado se presenta, primero, para un inversor FV tipo string
y posteriormente, se plantea la extensión de los resultados a inversores FV con MPPT-
distribuido. Como consecuencia, este estudio establece las bases para el diseño de controladores
de seguimiento de punto de máxima potencia y de inyección de potencia activa/reactiva en
el punto de conexión común, para sistemas FV con MPPT-distribuido, desde una perspectiva
basada en la interconexión de sistemas.
449
CHAPTER 13. POWER SYSTEMS
450
CHAPTER 13. POWER SYSTEMS
i pv1 io
+
vpv1
= +
MFV 1 MC 1 vo1 Inversor Trifásico
- -
=
u1 ea
ia
i pv2
SW1 SW2 SW3
Lg RLg
+
vpv2
= + +
ib eb
MFV 2 MC 2 vo2 Co
- - vo
= - Lg RLg
ic ec
u2
SW4 SW5 SW6
Lg RLg
i pvN
+ = +
MFV N vpvN MC N voN
- -
= ua ub uc
uN
Notar que, J es una matriz anti-simétrica de 2 × 2, la cual como la función de almacenamiento de energı́a del subsis-
se conoce como matriz de interconexión. Además, R es tema Σ1k , entonces la derivada de H(z1k ) a lo largo de las
una matriz diagonal definida positiva, tal que las siguientes trayectorias del subsistema está dada por:
propiedades se satisfacen J = −J T y R = RT > 0. Por
último, observar que el modelo en (2) se puede representar Ḣ(z1k ) = z1k ż1k
a través de una estructura interconectada como se ilustra −1 −1 2
= −RC C1 z1k + z1k e1k .
en la Fig. 4. 1
451
CHAPTER 13. POWER SYSTEMS
[ ]
Ahora, si se define como salida pasiva a vo 0
( ) 0 vo .
∂H(zk ) g(x) =
Y= , −id −iq
∂zk
Notar que, nuevamente Ji es una matriz anti-simétrica, en
entonces, se satisface la siguiente desigualdad: este caso de 3×3, y Ri es una matriz diagonal semi-definida
Ḣ(zk ) < Y T Ek . (4) positiva, tal que las siguientes propiedades se satisfacen
Resultado 3. El sistema Σk representado matemáticamen- Ji = −JiT y Ri = RTi ≥ 0.
te por:
Σk : L2e (U ) → L2e (U ) Ahora, tomando la derivada del Hamiltoniano a lo largo
Ek 7→ Y = Σk (Ek ) de las trayectorias del sistema (6) se obtiene:
El modelo del convertidor cd/ca trifásico (inversor ) se Comentario 5. Nuevamente, observar que el control de
presenta en esta sección en coordenadas del marco de potencia activa/reactiva en el punto de interconexión a
referencia dq. Este modelo también puede ser escrito, en la red eléctrica, deberá ser diseñado tal que, se preserven
forma compacta, a través de una estructura hamiltoniana las propiedades de pasividad del sistema Σi .
dada por:
4. INVERSOR TIPO STRING
∂Hi (x)
ẋ = [Ji − Ri ] + g(x)m + ϑ (6) En esta sección se presenta el análisis de estabilidad
∂x
para un inversor tipo string. Como se ilustra en la Fig.
donde x = [x1 , x2 , x3 ]T = [Lg id , Lg iq , Co vo ]T representa el 5, esta topologı́a FV consiste de una cadena de MFV,
vector de variables del sistema, m = [md , mq ]T representa un único MC y un inversor. A partir de los resultados
el vector de entradas de control, y ϑ = [−ed , −eq , io ]T re- obtenidos en las secciones anteriores se demostrará, en las
presenta el vector de entradas externas. El Hamiltoniano, siguientes subsecciones, que un SFV, con inversor tipo
en este caso, está dado por string, está constituido por la interconexión pasiva de
1 1 1 sistemas pasivos.
Hi (x) = L−1 x2 + L−1 x2 + C −1 x2
2 g 1 2 g 2 2 o 3 4.1 Propiedades de Pasividad SFV String
y
[ ]
0 −ωdq Lg 0 RLg 0 0 El análisis toma como referencia el sistema ilustrado en
Ji = ωdq Lg 0 0 ; Ri = 0 RLg 0 , la Fig. 4, el cual satisface las siguientes restricciones de
0 0 0 0 0 0 interconexión (con k = 1):
452
CHAPTER 13. POWER SYSTEMS
CD + CD -1
vpv1 vo i pv1 e11 C11 z11
+
CD - CA Σ11
-
- -1
L z21 e21
Gn Σ 21(u1)
MFV n -1 e31
C21 z31
-1
i pv2 + e12 C11 z12
Σ12
Figura 5. SFV con inversor string. -
−1
e11 = ipv1 − L z21 -1
L z22 e22
e21 = C1−1 z11
+
0 Σ 22(u2) + 0
e31 = −io
+ -1 e32
e41 = C2−1 z31 . C21 z32 -
A partir de estas ecuaciones y usando los resultados 2-4,
para cualquier ipv1 ∈ L2e (U ), y cualquier T ≥ 0 se cumple e4 io
que Ψ(.)
En esta sección se presenta el análisis para el SFV con 5.1 Propiedades de Pasividad SFV con MPPT Distribuido
MPPT-distribuido. De hecho, éste es una extensión del
resultado que se presentó en la sección 4. El análisis se El sistema interconectado, ilustrado en la Fig. 6, satisface
realizó únicamente para la conexión en cascada de 2 MC las siguientes restricciones de interconexión ∀ j ∈ {1, 2}:
453
CHAPTER 13. POWER SYSTEMS
e1j = ipvj − L−1 z2j Ricardo Alonso, Eduardo Román, Asier Sanz, Vı́ctor
E. Martı́nez Santos, and Pedro Ibáñez. Analysis of
e2j = C1−1 z1j Inverter-Voltage Influence on Distributed MPPT Archi-
e3j = ∆io − io tecture Performance. IEEE Transactions on Industrial
2 Electronics, Vol. 59, No. 10, October 2012.
∑
e4 = C2−1 z3j Hugues Renaudineau, Fabrizio Donatantonio, Julien Font-
j=1
chastagner, Giovanni Petrone, Giovanni Spagnuolo,
Jean-Philippe Martin, and Serge Pierfederici. A PSO-
A partir de estas ecuaciones, para cualquier ipvj ∈ L2e (U ) Based Global MPPT Technique for Distributed PV
∀ j ∈ {1, 2}, y cualquier T ≥ 0 se cumple que Power Generation. IEEE Transactions on Industrial
Electronics, Vol. 62, No. 2, February 2015.
⟨C1−1 z11 , e11 ⟩T + ⟨L−1 z21 , e21 ⟩T + ⟨C2−1 z31 , e31 ⟩T + Matthias Kasper, Dominik Bortis, and Johann W. Kolar.
Classification and Comparative Evaluation of PV Panel-
+⟨C1−1 z12 , e12 ⟩T + ⟨L−1 z22 , e22 ⟩T + ⟨C2−1 z32 , e32 ⟩T +
Integrated DC/DC Converter Concepts. IEEE Transac-
+⟨io , e4 ⟩T = tions on Power Electronics, Vol. 29, No. 5, May 2014.
⟨C1−1 z11 , ipv1 ⟩T −1
+ ⟨C1 z12 , ipv2 ⟩T ≥ Arjan Van der Schaft. L2-Gain and Passivity Techniques
in Nonlinear Control. Springer, 2000.
ϵ11 ∥ (Σ11 (e11 ))T ∥22 −β11 + ϵ12 ∥ (Σ12 (e12 ))T ∥22 −β12 Y. Mahmoud, W. Xiao y H. H. Zeineldin. A Simple
−β21 − β31 − β21 − β32 ≥ Approach to Modeling and Simulation of Photovoltaic
Modules. IEEE Trans. Sustain. Energy, vol. 3, no. 1,
En consecuencia, el sistema interconectado de la Fig. 6, pp. 185-186, Jan. 2012.
con entrada R. Cisneros, M. Pirro, G. Bergna, R. Ortega, G. Ippoliti,
T
(ipv1 , ipv2 ) , M. Molinas. Global Tracking Passivity-Based PI Con-
y salida trol of Bilinear Systems: Application to the Boost and
( −1 )T
C1 z11 , C1−1 z12 Modular Multilevel Converters. Control Engineering
Practice,43, October 2015, 109-119.
es estrictamente pasivo a la salida, lo cual implica que un
SFV con MPPT-distribuido, con k = 2, es L2 ganancia-
finita, Arjan Van der Schaft (2000).
6. CONCLUSIÓN
REFERENCIAS
R. C. N. P. Podgurski y D. J. Perreault. Submodule
Integrated Distributed Maximum Power Point Tracking
for Solar Photovoltaic Applications. IEEE Trans. Power
Electron., vol. 28, no. 6, pp. 2957-2967, Jun. 2013.
Freddy Chan y Hugo Calleja. Reliability Estimation of
Three Single-Phase Topologies in Grid-Connected PV
Systems. IEEE Trans. Ind. Electron., vol. 58, no. 7, pp.
2683-2689, Jul. 2011.
Samir Kouro, Jose I. Leon, Dmitri Vinnikov, y Leopoldo G.
Franquelo. Grid-Connected Photovoltaic Systems: An
Overview of Recent Research and Emerging PV Conver-
ter Technology. IEEE Industrial Electronics Magazine,
vol. 9, no. 1, pp. 47-61, Mar. 2015.
454
CHAPTER 13. POWER SYSTEMS
Abstract: This paper is concerned with the power dispatch in a microgrid. The dispatch problem
is formulated as linear program. Thus, the proposed solution is the application of neural network
that solves linear programming on-line. This proposal in motivated by the increasing electric
energy demand and the rising need to incorporate sustainable energy sources to the power grid
in a reliable scheme. A microgrid is an interconnection of distributed energy sources (DES),
with the tendency to include renewable energies that offer many advantages to customers and
utilities. The different DES that compose the microgrid are controlled independently to track
the optimal reference provided by the proposed method in order to supply a demanded power
output minimizing the consumed power from the main grid.
Dynamical systems which can solve real-time optimization The proposed approach has attractive features such
were introduced in Pyne (1956) . Since then, other major as: fixed time convergence to the optimization problem
contributions have been proposed by Korovin and Utkin solution and a fixed number of parameters (one in this
(1974) , Pazos and Bhaya (2009) , Wang (1993) and Wilson case), regardless of the optimization problem dimension.
(1986). Due to its inherent massive parallelism, these Therefore, it offers the scalability characteristic, that
455
CHAPTER 13. POWER SYSTEMS
allows the possibility of adding other energy sources to selecting Tmax (ρ) ∈ T and calculating the inverse of the
the microgrid. settling-time function, allowing the tuning of ρ.
In the following, Section 2 presents the mathematical It is worth to note, the true fixed stabilization time
preliminaries and some useful definitions. In Section 3 the for a system designed based on prescribed-time stability
basis of the linear programming problem are established is unknown but bounded by Tmax (ρ). In contrast, a
and the proposed algorithm is presented. Section 4 designed system with predefined-time stability has a
describes the microgrid connection and the controllers known stabilization time.
implemented along with the linear programming problem
for the power dispatch and shows the simulation results. Definition 2.5. (Predefined-time based design). The par-
Finally, in Section 5 the conclusions are presented. ticular case when for the system (1), the time Tf defined
in (3) can be tuned by a particular selection of the system
2. MATHEMATICAL PRELIMINARIES parameters ρ, Tf = Tf (ρ), is referred to the notion of the
predefined-time stability.
Consider the system With the definition of a predefined-time attractive set, the
ξ˙ = f (t, ξ) (1) following lemma provides a Lyapunov characterization of
where ξ ∈ Rn and f ∶ R+ × Rn → Rn . For this system, its
a class of these sets on the state space:
initial conditions or initial states are ξ(t0 ) where t0 ∈ R+ . Lemma 2.1. (Lyapunov function Sanchez-Torres et al. (2015)).
The time variable t is defined on the interval [t0 , ∞). If there exists a continuous radially unbounded function
V ∶ Rn → R+ ∪ {0}
such that V (x) = 0 for x ∈ M and any solution x(t) satisfies
The idea of the sliding mode control is highly related with
the finite- time stability. This time however often depends
V̇ ≤ − exp(V p )V 1−p
on the initial conditions of the system. The case when α
(4)
convergence time is uniform and independent of the initial p
conditions is known as fixed time stability Cruz-Zavala for α > 0 and 0 < p ≤ 1, then the set M is globally
et al. (2010). Polyakov (2012) gives a precise statement of predefined-time attractive for the system (1) and Tmax =
the fixed time stability :
α
1
+ t0 .
Definition 2.1. (Globally fixed-time attraction ). Let a non-
empty set M ⊂ Rn . It is said to be globally fixed-time Proof See Sanchez-Torres et al. (2015).
attractive for the system (1) if any solution ξ(t, ξ0 ) of (1) Definition 2.6. (Predefined-time stabilizing function). For
reaches M in some finite time moment t = T (ξ0 ) and the x ∈ Rn , the predefined-time stabilizing function is defined
settling-time function T (ξ0 ) ∶ Rn → R+ ∪{0} is bounded by as
some positive number Tmax , i.e. T (ξ0 ) ≤ Tmax for ξ0 ∈ Rn .
Φp (x) = exp (∥x∥ )
1 p x
∥x∥
p (5)
p
where 0 < p ≤ 1.
Note that there are several choices for Tmax , for example
if T (x0 ) ≤ Tm for a positive number Tm , also T (x0 ) ≤ λTm
with λ ≥ 1. This motivates the definition of a set which With the definition of the stabilizing function, let the
contains all the bounds of the settling-time function. following dynamic system:
Definition 2.2. (Settling-time set). Let the set of all the Lemma 2.2. (Predefined-time stable dynamical system). For
bounds of the settling-time function for system (1) be every initial condition x0 , the system
defined as follows:
T = {Tmax ∈ R+ ∶ T (x0 ) ≤ Tmax } . ẋ = − Φp (x)
1
(2) (6)
Tc
In addition, the minimum bound for the settling-time with Tc > 0, and 0 < p ≤ 1 is predefined-time stable with
function of (1) is defined as: settling-time Tc . That is, x(t) = 0 for t > t0 + Tc in spite of
Definition 2.3. (Minimum bound for the settling-time set). the x0 value.
Consider the set T defined in (2), let the time Tf ∈ R such
Proof See Sanchez-Torres et al. (2015).
that
Tf = {T ∈ T ∶ T ≤ Tmax , ∀Tmax ∈ T } . (3) In order to apply the previous result to control design,
consider the dynamic system
ξ˙ = ∆(ξ, t) + u
Note that for some systems Tmax can be tuned by a
particular selection of the system parameters, this notion (7)
refers to the prescribed-time stability which is given in with ξ, u ∈ R and ∆ ∶ R+ × R → R . The main
n n n
Fraguela et al. (2012) and the predefined-time stability objective is to drive the system (7) to the point ξ = 0
Sanchez-Torres et al. (2015). The prescribed-time stability in a predefined fixed time in spite of the unknown non-
based design and the predefined-time stability based vanishing disturbance ∆(ξ, t). A solution to this problem
design are explained in the following definitions. which does not require an individual selection of each
Definition 2.4. (Prescribed-time based design). Consider the of the n control variables based on the unit control is
set T defined in (2). The particular case when for the presented in the following theorem:
system (1), Tmax can be tuned by a particular selection Theorem 2.1. (Predefined-time multivariable control). Let
of the system parameters ρ, Tmax = Tmax (ρ), is referred to the function φ(ξ, t) to be bounded as ∥∆(ξ, t)∥ ≤ δ, with
the notion of the prescribed-time stability which is given 0 < δ < ∞ a known constant. Then, by selecting the control
in Fraguela et al. (2012). This design is performed by input
456
CHAPTER 13. POWER SYSTEMS
u = −(+ δ)
1 ξ For this case, the multivariable activation functions
∥ξ∥ are ϕ (x, l, h) = [ ϕ1 (x, l1 , h1 ) . . . ϕn (x, ln , hn ) ] , with
exp(∥ξ∥) T
Tc
with Tc being a scalar, the system (7) is globally ϕi (x, li , hi ) of the form
⎧
⎪ xi − li
⎪− exp(∥x − l∥) if xi ≤ li
predefined-time stable with settling-time upper bounded
⎪
⎪
⎪ ∥x − l∥
⎪
⎪
by Tc .
ϕi (x, li , hi ) = ⎨0 if li < xi < hi
⎪
⎪
⎪ −
⎪
Proof : See Sanchez-Torres et al. (2015).
⎪
⎪− exp(∥x − h∥) if xi ≥ hi
x h
⎪
i i
⎩ ∥x − h∥
2.1 Linear Programming Problem Statement (18)
and
φ (σ) = −
σ
∥σ∥
Let the following linear programming problem: exp(∥σ∥). (19)
⎧
⎪
⎪
⎪
⎪
minx cT x Therefore, with the structure given by (13) and the KKT
⎨s.t. Ax = b
⎪
(8)
⎪
multiplier as in (16) and (17), with activation functions
⎪
⎪ l≤x≤h
⎩ (18) and (19), the following theorem presents a RNN which
solves (8) in predefined-time.
where x = [ x1 . . . xn ] ∈ Rn are the decision variables,
T
Hence, the Lagrangian of (8) is Ts > 0, the point x∗ is globally predefined-time stable with
L (x, y, z) = cT x + z T x + y T (Ax − b) . (9) settling-time Ts .
The KKT conditions establish that x∗ is a solution for (8) Proof : The dynamics of (15) is given by
if and only if x∗ , y and z in (8)-(9) are such that σ̇ = A (−c + AT y + z) . (21)
∇x L (x , y, z) = c + z + A y = 0
∗ T
(10)
Ax∗ − b = 0
Therefore, with the selection of y as in (16), the system
(11) (21) reduces to
zi x∗i = 0 if li < x∗i < hi , ∀i = 1, . . . , n. (12)
σ̇ = −
1 σ
Ts ∥σ∥
exp(∥σ∥).
3. A RECURRENT NEURAL NETWORK (RNN) FOR Thus, from Theorem 2.1, a sliding mode is induced on the
LINEAR PROGRAMMING PROBLEM manifold σ = 0. Therefore, the set Ωe is predefined-time
attractive with settling-time Ts .
On the manifold σ = 0, the equivalent value of φ is
Following the KKT approach, from Loza-Lopez et al.
the solution of σ̇ = 0. Resulting to φeq = 0 or yeq =
(2015) a recurrent neural network which solves the
(AAT )−1 [Ac − Az]. Therefore, the dynamics of (13) on
problem (8) in finite time is proposed. For this purpose, let
Ωe = {x ∈ Rn ∶ Ax − b = 0} and Ωd = {x ∈ Rn ∶ l ≤ x ≤ h} .
According to (8), x∗ ∈ Ω where Ω = Ωd ∩ Ωe . that manifold is
ẋ = −cΛ + Λz. (22)
From (10), let
ẋ = −c + AT y + z, (13) With the selection of z as in (17), the resulting system (22)
is
then, y and z must be designed such that Ω is an attractive
ẋ = −cΛ + Λ (∥c∥ + ) ϕ (x, l, h) .
1
set, fulfilling conditions (10)-(12). For this case, in addition Ts
Consider the Lyapunov function V = ∥x∥. Its derivative is
to condition (12), z is considered such that
zi ≥ 0 if xi ≥ hi given by V̇ = ∥x∥
{
xT
ẋ. Therefore
zi ≤ 0 if xi ≤ li
, (14)
xT
and the variable σ ∈ Rm is defined as V̇ = [−cΛ + Λ (∥c∥ + ) ϕ (x, l, h)]
1
∥x∥ Ts
σ = Ax − b. (15) (23)
xT 1
≤ [ ϕ (x, l, h)] .
In order to obtain predefined-time stability to the solution ∥x∥ Ts
x∗ , the terms y and z are proposed in (13) as
Replacing the Lyapunov function
y = (AAT )−1 [Ac − Az + φ (σ)]
1 ⎧
⎪
(16) ⎪
⎪≤ − exp(V ) if x < l or x > h
1
V̇ ⎨
Ts
⎪
Ts (24)
⎪
⎩= 0 if l ≤ x ≤ h
⎪
and
z = (∥c∥ + ) ϕ (x, l, h)
1
(17)
Ts From Theorem 2.1, the set Ωd is predefined-time attractive
respectively, where Ts > 0. with settling-time Ts .
457
CHAPTER 13. POWER SYSTEMS
Lambda
9
predefined-time convergence to the set Ω. Now, by using
the equivalent control method, the solution of ẋ = 0 and
8 Lambda
Lambda Reference
c + A yeq + zeq = 0.
T Wind Speed
13
Speed (m/s)
Therefore, the condition (10) is fulfilled, implying the point 12
10
Remark 3.1. Note that, in contrast to the most of the 0 2 4 6 8 10 12 14 16 18
RNN presented in the literature, this scheme only needs Time (sec)
Power (W)
400 Pv Maximum Power
458
CHAPTER 13. POWER SYSTEMS
0 1500
Power (W)
-100 1400
-200 1300
-300
1200
Load Power
0 1 2 3 4 5 6 7 8 9 10 Optimizer Load Power
1100
Time(sec)
1000
Figure 3. Battery power and reference. 0 5
Time(sec)
10 15
Power (W)
400
conditions. A discrete equivalent controller is applied to
the each converter, and they switch according to the sign 200 Grid Power
Grid Power Reference
line is the power given or extracted by the BBS, and the Wind Power Tracking
100
red dot line is a sinusoidal power reference to track.
Power (W)
50
459
CHAPTER 13. POWER SYSTEMS
460
CHAPTER 13. POWER SYSTEMS
Abstract: This paper proposes a methodology for the stability assessment of a power grid based
on percolation theory. Once it is proved that the rotor angle stability depends on the initial
condition and the topological properties of the network, tools provided by complex network
and percolation theory are used to identify the most vulnerable buses from a transient stability
perspective in order to locate the best places to install control to prevent the spread of lack
of synchronism through the grid. Finally, the proposed approach is tested on the transmission
network system of IEEE 14.
Keywords: Smart Power Grid, Transient Stability, Complex Networks, Percolation Theory.
1. INTRODUCTION However, power systems are large scale systems, and power
engineers need to predict those scenarios based on guesses;
sometimes, criteria like bigger capacity are used. However,
Power system is the set of elements that generate, transmit they are still guesswork made by humans, and are prone
and distribute electrical energy to consuming agents such to mistakes. (M.Pavella (1994))
as residential buildings, factories, street lighting, and so on.
Most of power generators are electromechanical systems, Currently, the power grid is continuously growing thanks
they use the energy of water or fossil fuels to generate to new uses of electrical energy and demand vegetative
mechanical energy, which is then converted into electrical growth. Also, the envisioned future power generation will
energy (A.R. Bergen (1999)). Normal operating conditions rely on renewable energy sources and they are highly
or steady state satisfy the following conditions: stochastic, there will be an increasing number of transient
disturbances acting on the power system. Considering
- All machines rotate at the same velocity and they are these future conditions, it is going to be more difficult to
never accelerated. guess the important fault scenarios.
- The voltage and currents generated have the same
frequency. This paper proposes an approach of transient stability
analysis in the power grid based on topological information
The first condition is called frequency stability while the by identifying the most vulnerable generators of the grid
second is known as voltage stability. When all generators using percolation theory. The main objective of identifying
rotate at the same velocity, the relative difference between the vulnerable nodes is to place controls on them in order
their rotor angles remains constant. Rotor angle stability to prevent the loss of synchronism in the other generators
is the ability of the power system to maintain this syn- of the grid.
chronism.
Transient Stability is the name given to the maintenance
of rotor angle stability when the system is subjected to 2. REVIEW OF TRANSIENT STABILITY ANALYSIS
large disturbances (Kundur et al. (2004)).
There are several methods for transient stability analysis The equation that governs rotor motion of a synchronous
of power systems such as: machine is known as the swing equation, and is based on
the elementary principle in dynamics which states that
- Automatic Learning: Based on machine learning tech- accelerating torque is the product of the moment of inertia
niques.(Wehenkel (1998)) of the rotor times its angular acceleration. It can be written
- Direct methods: Based on obtaining Lyapunov func- for the synchronous generator in the form
tions for simple models of the systems. (Varaiya et al.
(1985))
- Simulations in time domain: Solving differential equa- H d2 δ dδ
= Pa = Pm − P e − D , (1)
tions using numerical methods. (M.Pavella (1994)) 180f dt2 dt
In practical situations, the most common way to check where H is the inertia constant of the machine, f is the
transient stability is running vast time-domain simulations frequency of the system, δ is the rotor angle, Pa is the
for those that are considered the important fault scenarios. accelerating power, Pm is the mechanical power, Pe is the
461
CHAPTER 13. POWER SYSTEMS
electrical power and D is the damping coefficent of the Consider a network whose edges are present with proba-
machine (J.Machoswki (2008)). bility p; percolation studies the emergence of paths that
percolate through the grid. For great values of p a lot edges
Constants coefficients H and D depend on the size and
are present, then big clusters of nodes connected by edges
capability of the machine, f is a constraint of the system,
can form. The creation of a giant component during the
and Pm depends on the external mechanical power (water,
percolation process is known as a percolation transition
oil, etc.), and for this analysis it is taken as a constant.
(Albert and lszl Barabsi (2002)).
Imagine taking a network and removing some fraction of
This equation shows that every time the mechanical and its vertices or edges, what effect this will have on the
the electrical power are unequal the second derivative of performance of the network?, this is a practical interest
the rotor angle is proportional to that difference; meaning that Percolation Theory can help us to analyze. It can help
that there is an acceleration in the machine. us to understand the macroscopic behavior of networks
Considering Pm as a constant, the main variable to analyze in relation to the microscopic states of its components
is the electric power; for simplified models this power is (Newman (2010)).
described as
4. TRANSIENT STABILITY APPROACH FROM A
|E 0 ||E 0 | PERCOLATION THEORY PERSPECTIVE
Pe = Pmax · sinδ = 1 2 · sinδ (2)
X
First, the power grid must by modelled as a complex
For (2) the generator is supplying power through a trans- network. The regional transmission system was simulated
mission system. Voltage |E10 | is the transient internal volt- but the local distribution system will be excluded from
age of the generator and |E20 | is the voltage at the receiving the analysis. The vertices of a complex network represent
end and X is the transfer reactance between |E10 | and |E20 | the generation centrals, and load or switching substations;
(J.Grainger (1994)). meanwhile, the edges are modelled as the high voltage
Most systems have more than one synchronous machine; transmission lines. The topology is not hard to determine,
for a system with n generators, the active output of the a power network has a spatial and geographic interpreta-
ith generator is given by tion, and its the same given to its graph model.
The admittance matrix of a power network has the same
formulation of the Laplacian of a graph: the diagonal
Pe = Re{Ei Ii∗ }
elements of the admittance matrix are equal to the degree
n
X matrix and the non-diagonal elements are the same to the
= Re Ei Yij∗ Ej∗ adjacency matrix.
j=1
n
For a graph G formally defined as G = (V, E, ω), where V
X is a finite set of vertices, E is a set of edges, and ω is the
= Ei2 Gii + Ei Ej [Bij sin(δi − δj ) + Gij cos(δi − δj )]
weight between vertices (Mesbahi and Egerstedt (2010)).
j=1
The Laplacian L(Gw ) is defined as
(3)
X
|Ei | and |Ej | are the internal transient voltages, and Bij [∆(Gw )]ii = wij (4)
and Gij are the susceptance and conductance between {j|(vj ,vi ) E(G)}
generators i and j (P.Kundur (1994)).
Equation (3) demonstrates that rotor angle stability de- wij if vi vj E(G)
[A(Gw )]ij = (5)
pends highly on the initial conditions of the system and 0 otherwise
its transfer admittance, which is heavily reliant on the
topological structure of the power network. L(Gw ) = ∆(Gw ) − A(Gw ) (6)
3. PERCOLATION THEORY AND NETWORK This paper considers the following considerations for the
RESILIENCE modelling of the power grid.
- It only considers the high voltage system and it is
Percolation theory describes the behavior of connected assumed balanced.
clusters in a graph, in order to understand this concept - The impedances between bus and neutral are ne-
assume that some liquid is poured on top of a porous glected.
material, and wonder if the liquid will be able to reach - All transmission line and transformers are mod-
the bottom of it (Broadbent and Hammersley (1957)). elled as weighted edges, the weight is equal to the
Percolation is a pervasive concept that arises not only impedance between nodes.
from atomic and molecular solids in physics, it also comes - All parallel lines are modelled as an equivalent single
from social, technological, and natural systems (Strogatz line.
(2001); ben Avraham and Glasser (2007); S et al. (2000)), - Shunt impedances of lines are considered.
information diffusion in online social networks (Barrat - Loads are modelled as constant admittances.
et al. (2008)), resistor networks (Kirkpatrick (1973)), for- - All generator have the same constant of inertia (13,08
est fires (Drossel and Schwabl (1992)), among others. s) and damping constant (2 s).
462
CHAPTER 13. POWER SYSTEMS
Frequency [Hz]
59
14
13 1
58.5
12 2
58
11 3
57.5
0.5 1 1.5 2 2.5 3 3.5 4
time [s]
10 4
Fig. 2. Frequency variation due to fault on bus 1
9 5 60
Gen 01
Gen 02
59.95 Gen 06
8 6
Gen 08
7
59.9
59.8
59.75
For the transient stability simulations it is used the Power
System Analysis Toolbox - PSAT. For this specific case, all
59.7
faults are balanced three phase faults on the buses, not on
the high voltage lines; which means there are not changes 59.65
of topology before or after the fault. In addition, all faults
had a duration of 2 seconds. 59.6
0.5 1 1.5 2 2.5 3 3.5 4
For each system, a fault is simulated on every bus asso- time [s]
463
CHAPTER 13. POWER SYSTEMS
REFERENCES
Albert, R. and lszl Barabsi, A. (2002). Statistical mechan-
ics of complex networks. Rev. Mod. Phys.
A.R. Bergen, V. (1999). Power Systems Analysis. Tom
Robbins, USA.
Barrat, A., Barthlemy, M., and Vespignani, A. (2008).
Dynamical Processes on Complex Networks. Cambridge
University Press, New York, NY, USA, 1st edition.
ben Avraham, D. and Glasser, M.L. (2007). Diffusion-
limited one-species reactions in the bethe lattice. Jour-
nal of Physics: Condensed Matter, 19(6), 065107.
Broadbent, S.R. and Hammersley, J.M. (1957). Percola-
tion processes. I. Crystals and Mazes. Proceedings of the
Cambridge Philosophical Society, 53, 629–641.
Drossel, B. and Schwabl, F. (1992). Self-organized critical
Fig. 4. Unsynchronized generators in IEEE-14 bus test case forest-fire model. Phys. Rev. Lett., 69, 1629–1632.
J.Grainger, W. (1994). Power System Analysis. United
States of America. McGraw-Hill Education, USA.
J.Machoswki, J.W.Bialek, J. (2008). Power System Dy-
namics: Stability and Control. John Wiley y Sons,
United Kingdom.
Kirkpatrick, S. (1973). Percolation and conduction. Rev.
Mod. Phys., 45, 574–588.
Kundur, P., Paserba, J., Ajjarapu, V., Andersson, G.,
Bose, A., Canizares, C., Hatziargyriou, N., Hill, D.,
Stankovic, A., Taylor, C., Cutsem, T.V., and Vittal, V.
(2004). Definition and classification of power system
stability ieee/cigre joint task force on stability terms
and definitions. IEEE Transactions on Power Systems,
19(3), 1387–1401.
Mesbahi, M. and Egerstedt, M. (2010). Graph theoretic
methods in multiagent networks. Princeton series in ap-
plied mathematics. Princeton University Press, Prince-
ton (N.J.).
M.Pavella, P. (1994). Transient Stability of Power Sys-
Fig. 5. Unsynchronized generators in IEEE-14 bus test case tems: Theory and Practice. John Wiley y Sons, United
using an overdamping in machine 2 Kingdom.
Newman, M. (2010). Networks: An Introduction. Oxford
operate and prevent or decrease the potential damage of
University Press, Inc., New York, NY, USA.
a blackout.
P.Kundur (1994). Power System Analysis. United States
5. CONCLUSIONS of America. McGraw-Hill Education, USA.
re Regulacion de Energia y Gas CREG, C. (1995). Res.
025 de 1995 - Codigo de Redes. Ministerio de Minas y
This paper provides an approach of network transient
Energia - Republica de Colombia.
stability analysis using percolation and complex networks
S, S., G, W., L., D.A., N, J., and D, S. (2000). Social
theory.
percolation models. PHYSICA. A, 277, 239–247.
Dynamic equations demonstrates that transient stability Strogatz, S.H. (2001). Exploring complex networks. Na-
relies heavily on the networks topological properties. In ture, 410(6825), 268–276.
addition, the power grid characteristics show that it can Varaiya, P., Wu, F.F., and Chen, R.L. (1985). Direct
be easily modelled as a complex network allowing us to methods for transient stability analysis of power sys-
use percolation theory analysis tools on it. tems: Recent results. Proceedings of the IEEE, 73(12),
1703–1715.
For a selected system, a balanced three phase fault was Wehenkel, L. (1998). Automatic Learning Techniques in
simulated in the generator and it was observed the fre- Power Systems. Kluwer Academic Publishers Norwell,
quency variation of all the generators in the network. USA.
IEEE 14 bus system was tested and the results show
how the lost of synchronism spreads through the grid;
by identifying the bus associated to a generator which
fault causes the most accelerated spreading and placing
a control device on it can help the grid to delay and
464
CHAPTER 14
PROCESS CONTROL
CHAPTER 14. PROCESS CONTROL
Abstract: This paper presents the development of a novel control law with parameter updating for
trajectory tracking in a boiler-turbine system, which is a MIMO system where all state variables are
strongly coupled. The proposed controller is simple and based on comprehensible concepts. Its design
consists in representing the mathematical model using numerical methods, and then the control actions
are computed through a linear algebra technique to accomplish the target of trajectory tracking control,
by last, the parameter updating is based in the Lyapunov theory. The advantages of this method are the
condition for the tracking error tends to zero, and the calculation of control actions, are obtained simply
by solving a system of linear equations. Besides, the control technique has the ability to follow
trajectories for two outputs simultaneously.
Keywords: Control System Design, Linear Algebra, Adaptive Control, Nonlinear Model, Tracking
Trajectory Control.
466
CHAPTER 14. PROCESS CONTROL
controller for the boiler-turbine, that should be simple and et al., 2014, Cheein et al.,2016). This simple approach
easy to implement, is a challenging assignment. suggests that knowing the value of the desired state, a value
for the control action which forces the system to move from
The control of boiler-turbine has attracted the attention of its current state to the desired one may be found. The
many researches in the last years. Different control strategies theoretical fundamentals are based on easily understandable
have been proposed in the literature, where some of them use concepts and there is no need of complex calculations to
linearization techniques (Chen et al., 2004; Tan et al., 2005; attain the control signal. The adaptive scheme is obtained
Zheng et al., 2006; Tan et al., 2008; Sarailoo et al., 2012). analyzing the system stability and the asymptotic stability of
Also in (Dimeo et al., 1995; Fang et al., 2004; Wu et al., the complete controllers is proven through Lyapunov theory.
2009; Wu et al., 2010) different control methods based on
Hinf are applied. In (Fang et al., 2004) the loop-shaping H\inf The paper is organized as follows: in Section 2, the dynamic
method is used to design the feedback controller and the final model of a boiler-turbine is presented. The methodology of
controller is reduced to a multivariable PI form. In (Wu et al., the controller design is shown in Section 3. The development
2009; Wu et al., 2010), the nonlinear boiler-turbine of the parameter updating is detailed in Section 4. In Section
dynamics were represented by the Takagi and Sugeno fuzzy 5, the theoretical results are validated with simulation results
model. Then, a fuzzy Hinf state feedback control law was of the control algorithm. Finally, Section 6 presents the
synthesized in terms of linear matrix inequalities (LMIs). In conclusions.
(Thangavelusamy et al., 2013) the authors propose a Fuzzy
logic based Sliding Mode Control (FSMC) to a drum-type 2. DYNAMIC MODEL OF THE MOBILE ROBOT
boiler–turbine system with a proportional integral controller.
The drawback of these methods lies in the need of linearizing A 160MW oil-fired electrical power plant model is used in
the model for a certain range of operation. this paper constituted with a drum type boiler and a turbine.
The model is based on the P16/G16 at the Sydvenska Kraft
Several authors suggest the combined application of different AB plant in Malmö, Sweden (Bell et al., 1987),. This model
control techniques such as genetic algorithm (GA), fuzzy is widely used in the specialized literature (Tan et al., 2008),
control, gain scheduling, sliding mode, for the control of (Tan et al., 2005), (Fang et al., 2004; Sarailoo et al., 2012), .
boiler-turbine. In (Wu et al., 2012) the authors uses several The boiler dynamic model is provided by both physical and
techniques jointly, first, a nonlinear predictive control based empirical methods based on data acquired from a series of
on an online Recording Horizon Control (RHC) algorithm experiments and identifications which capture all the relevant
with genetic algorithm is designed. Second, an Hinf fuzzy characteristics of the process. The nonlinear boiler-turbine
controller is implemented. Finally a switching control dynamic model is given by the following equations:
strategy is applied to choose between the two described
controllers. In (Wu et al., 2013) is presented a data-driven
fuzzy modelling strategy and predictive controller for boiler- x1 0.0018u2 x19/8 0.9u1 0.15u3
turbine. In this work the behavoir of the boiler-turbine is
x2 (0.073u2 0.016) x1 0.1x2
9/8
(1)
dived into a number of local regions taking into account the
x (141u (1.1u 0.19) x ) / 85
measurement data, then a multimodel-based model predictive 3 3 2 1
control (MMPC) is developed. Finally a Data-driven Direct
Predictive Controller (DDPC) is designed with an online
update of the predictor. In (Ghabraei et al, 2015) a robust y1 x1
adaptive sliding mode controller (RASMC) for a y 2 x2 (2)
multivariable nonlinear model of boiler– turbine is presented. y 0.05(0.13073 x 100 q / 9 67.975)
First the authors develop an input–output feedback 3 3 e
467
CHAPTER 14. PROCESS CONTROL
0 0
TABLE I 0 5 x1 6 u3 x3
Typical operating points of Bell and Astrom model
#1 #2 #3 #4 #5 #6 #7 (5)
x0 75.60 86.40 97.20 108 118.8 129.6 140.4
1
0 0 0 x1
7 x1 0 x2
x20 15.27 36.65 50.52 66.65 85.06 105.8 128.9
8
1/8
u30 0.183 0.256 0.340 0.433 0.543 0.663 0.793 Where the parameters are the same ones that (1)
Au x Bx (6)
3. CONTROL DESIGN
Where (6) represent the real dynamic model of the boiler-
The basic objective of any design for boiler-turbine control
turbine. It will use to find the expressions of control actions
system is to make the electrical output y2 follows the load
for tracking trajectories.
demand rapidly while the water level y3 and drum steam
pressure y1 within the allowed limits are maintained. Besides
fulfilling the previous objective, the proposed controller is Remark 3:The real parameters of the system boiler-turbine
are θ 1 , , 9 , while the estimated parameters used by
T
capable of following the allowable trajectory for the outputs
y1 and y2, holding y3 constant within certain limits/bounds.
468
CHAPTER 14. PROCESS CONTROL
the proposed controller will be θˆ ˆ1 , , ˆ9 Where the values y3d, x1, u1, u2, u3 are known. The value of
T
. Finally the
x3ez is obtained using algorithms that find minimum of
error in the parameters election will be unconstrained multivariable function using derivative-free
θ ( θ θ) 1 , , 9 .
ˆ method (Brent, 2013).
T
1 x1d k1ex Using the process of Gaussian elimination (Strang, 2006) the
1
σ 2 x2 d k 2 ex
analytic expressions of the control actions are found
3 x3 ez k3 ex
2
1 3
3 u1 1 9 x1
1 1 6
3
is represented by en e x1
2
ex ex2
2
3
2
.
The proposed controller design is completely finished by
The value of x3ez is found using the latest expression of (2). It using the relationships given in (8), which will be used to
aims to force y3 for achieving the desired output value y3d by generate the control actions in (10).
varying x3ez
Theorem 1: If the system behaviour is ruled by (6) and the
y3 0.05(0.13073 x 100 q / 9 67.975) controller is designed by (10), then en 0 with t
3 x ,x e x ,u
1 3 1
3 ez 1 4. ADAPTIVE LAW
9
1
45.59u
9
1
2.514u
9
3
2.096
9
67.975 In this section the existence of parametric uncertainty is
considered, where the estimated parameters used by the
controller are far from the real values of the plant. This
uncertainty, when is not taken into consideration, affects the
y3 y3 d when x3 x3 ez performance of the developed controller. To compensate this
. (8) effect a methodology is proposed to adapt the internal
469
CHAPTER 14. PROCESS CONTROL
When parametric uncertainty is considered, the control action Analyzing the right side of the equation, we see that all terms
designed in (9) is modified using the estimated parameters of are pre-multiplied by matrices that contain terms which
the model instead of real parameters depend on the errors in the parameters. Therefore each term
can be written as a function of θ as follows:
ˆ 1σ A
uA ˆ 1Bx
ˆ (11)
0 0 0 0 0 0 0 0 0
Where 9
Bx 0 0 0 0 0 0 x1 8
x2 0 θ
ˆ ˆ ˆ ˆ ˆ x1 ˆ1ˆ6
1 1 4 1 4 6
1 9/8 Bx G 1 θ
ˆ 1 0
A x1 0
ˆ4
ˆ 1
Where G1 is a 3x9 matrix. In the same way the remaining
0
1/8
5
x1 terms are rewritten
ˆ4ˆ6 ˆ6
ˆ 1σ G θ
AA 2
ˆ 1
ˆ
AA Bx G θ
0 0 0 3
Bˆ ˆ7 x1 ˆ 0
1/8
8
Where G2 and G3 are a 3x9 matrix. Then (13) can be replaced
ˆ9 0 0 by
Aˆ A u x Bˆ B x (12) e x Ke x Gθ
Aˆ A Aˆ 1
σA ˆ x B
ˆ 1Bx ˆ B x V e x K e x e x G θ θ θˆ
T T T
(16)
which is equivalent to
Analyzing (16) is defined the following parameter-updating
laws for the proposed adaptive controller.
ˆ 1σ AA
σ x AA ˆ 1Bx
ˆ Bx
θˆ G e x
T
Analyzing the left side of the equation and considering (20) (17)
and (21) in the proof of Theorem 1 we have:
470
CHAPTER 14. PROCESS CONTROL
Substituting this expression in (16) we have applied on the original time-continuous system. The outputs
y1d(t) and y2d(t) will be variable trajectories while y3d(t) will
V e x K e x 0
T
(18) be maintained constant and equal to zero (Wu et al., 2010;
Chen, 2013), .
y
1 2
variables of the boiler-turbine and the desired states to be desired trajectory, Cy di
( t ) yi ( t ) dt the
followed of the controller. The adaptation block updating the
i
2 0
values of the θ̂ according (17) make that the tracking error quadratic error of the output yi with i 1, 2, 3 . Thus, the
e x (t ) tends to zero (18). cost function can be represented by the combination of all
quadratic errors,
C
1
C y1
Cy Cy (19)
2 2 3
Fig. 2. Scheme. Adaptive Control Law. y1d 9 sin(0.02(t 100)) 0.02(t 100) 86.4
y2 d 9 sin(0.03(t 100)) 0.05(t 100) 36.65
5. SIMULATION RESULTS
100 t 800
471
CHAPTER 14. PROCESS CONTROL
For each simulation, the controller parameters are Fig. 4. Monte Carlo experiment. Outputs Errors.
chosen in a random way, as specified in (Tempo et
al., 2007).
472
CHAPTER 14. PROCESS CONTROL
In this simulation the compared controller are C1= TS Despite that, the parameters tend to a fixed value.
Control, C2= Linear Control and C3 = our proposal
controller, where C1 and C2 are developed in(Wu et al.,
2010). Fig. 6 shows the outputs, where in D represents the
desired values for each output, while the Fig. 7 shows the
control action. As it can be seen the response of the controller
C3 have a better performance for the stipulated trajectory
specially for y2 that is the main variable to follow. In this one
it can be observed that the C3 follows more quickly the
desired requirements.
473
CHAPTER 14. PROCESS CONTROL
REFERENCES
Bell, R. and K. J. Åström (1987). Dynamic models for
boiler-turbine-alternator units: data logs and parameter
estimation for a 160 MW unit, Lund Institute of Technology,
Department of Automatic Control.
Brent, R. P. (2013). Algorithms for minimization without
derivatives, Courier Corporation.
Cheein, F. A. A., Scaglia, G., Torres-Torriti, M., Guivant,
J., Prado, A. J., Arnò, J., ... & Rosell-Polo, J. R. (2016).
Algebraic path tracking to aid the manual harvesting of olives
using an automated service unit. Biosystems Engineering,
142, 117-132.
Cheein, F. A. and G. Scaglia (2013). "Trajectory Tracking
Controller Design for Unmanned Vehicles: A New
Methodology." Journal of Field Robotics: n/a-n/a.
Chen, P.-C. and J. S. Shamma (2004). "Gain-scheduled ℓ
1-optimal control for boiler-turbine dynamics with actuator
saturation." Journal of Process Control 14(3): 263-277.
Fang, F., L. Jizhen, et al. (2004). Output tracking control
of a nonlinear boiler-turbine unit. Decision and Control,
Fig. 11. Evolution of parameters estimates using parameter
2004. CDC. 43rd IEEE Conference on.
updating
Ghabraei, S., Moradi, H., & Vossoughi, G. (2015).
Multivariable robust adaptive sliding mode control of an
6. ACKNOWLEDGEMENTS industrial boiler–turbine in the presence of modeling
imprecisions and external disturbances: A comparison with
This work was partially funded by the Consejo Nacional de type-I servo controller. ISA transactions, 58, 398-408.
Investigaciones Científicas y Técnicas (CONICET – National Khalil, H. K. and J. Grizzle (1996). Nonlinear systems,
Council for Scientific Research), Argentina and the Prentice hall New Jersey.
Universidad Nacional de San Juan (UNSJ), San Juan, Romoli, S., Scaglia, G. J. E., Serrano, M. E., Godoy, S.
Argentina. A., Ortiz, O. A., & Vega, J. R. (2014). Control of a Fed-
Batch Fermenter Based on a Linear Algebra Strategy. IEEE
7. CONCLUSION Latin America Transactions, 12(7), 1206-1213.
Sarailoo, M., B. Rezaie, et al. (2012). "MLD model of
In this paper, a new control law with parameter updating for boiler-turbine system based on PWA linearization approach."
trajectory tracking in a boiler-turbine was proposed. The International Journal of Control Science and Engineering
control actions for a zero-error trend, are obtained through 2(4): 88-92.
linear algebra techniques by means of the resolution of a Scaglia, G., L. Q. Montoya, et al. (2009). "Numerical
linear system. These techniques are based on simple methods methods based controller design for mobile robots." Robotica
and do not require the use of coordinate transformation, and 27(02): 269-279.
complicated adaptive scheme; the process model and the Serrano, M. E., Scaglia, G. J., Godoy, S. A., Mut, V., &
values of y1d and y2d are the only variables needed to be Ortiz, O. A. (2014). Trajectory tracking of underactuated
known. surface vessels: A linear algebra approach. IEEE
Transactions on Control Systems Technology, 22(3), 1103-
The main advantages of the proposed controller are the 1111.
simplicity of the design procedure and the ability to follow Strang, G. (2006). Linear Algebra and Its applications.
any path allowable for y1 and y2 since, in normal operating USA.
conditions. The different tests carried out in this work prove Tan, W., F. Fang, et al. (2008). "Linear control of a
the good performance of the proposed controller design boiler–turbine unit: Analysis and design." ISA transactions
procedure, even when they are compared with a controller of 47(2): 189-197.
the literature. In fact, the system behavior was tested, Tan, W., H. J. Marquez, et al. (2004). "Multimodel
reaching better performance than those ones obtained by (Wu analysis and controller design for nonlinear processes."
et al., 2010). The parameter updating is obtained through the Computers & chemical engineering 28(12): 2667-2675.
study of the stability analysis based on Lyapunov theory. The Tan, W., H. J. Marquez, et al. (2005). "Analysis and
convergence to zero of the tracking errors for the controller control of a nonlinear boiler-turbine unit." Journal of Process
and for the adaptation law were demonstrated by the Control 15(8): 883-891.
Lyapunov theory. The results proved that the proposed Tan, W., J. Liu, et al. (2004). "Tuning of PID controllers
controller is capable of tracking a desired trajectory with a for boiler-turbine units." ISA transactions 43(4): 571-583.
small error when the dynamic parameters are adapted. Tempo, R. and H. Ishii (2007). "Monte Carlo and Las
Vegas Randomized Algorithms for Systems and Control*:
474
CHAPTER 14. PROCESS CONTROL
0 0 0 0 0 0 0 0 0
1 1
A A σ A Bx x Bx Bx 0 0 0 0 0 0 9
x1 8 x2 0 θ
σ Bx x Bx (20) 0 0 0 0 0 0 0 0 x1
σ x 0 (25)
Bx G 1θ
According to (7) we have σ x e x Ke x , where
Proceeding in the same way and using the Symbolic Math
K diag k1 , k 2 , k3 0 and e x x1 d x1 , x2 d x2 , x3 ez x3 ,
T
V
1
ex ex 0
T
(22) positions a11 , a12 , a13 , a24 , a35 , a36 , whose expression are
2
V ex ex ex K ex 0
T T
(23)
475
CHAPTER 14. PROCESS CONTROL
ˆ ˆ x 8
Whose nonzero elements are:
2 ˆ2ˆ 3ˆ5ˆ 1ˆ
1
3ˆ3 ˆ
a11 ˆ1 ˆ1ˆ6
1
14 1 4 6
b11 x ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ x
2
1 3 5 7 3 4 9 2 6 8 2
a12 ˆ4
1
ˆ3ˆ5ˆ8 x2 x1 8
ˆ2ˆ6ˆ7 x1
9
8
ˆ ˆ ˆ
a13 ˆ3
1
2ˆ3 x1 8 1 4 6
ˆ ˆ ˆ8 x2 ˆ7 x1
9
6
8
4 6
b12 ˆ ˆ
4 4
2
a24 ˆ4 ˆ5ˆ7 x1 ˆ ˆ ˆ
1
1 ˆ ˆ
4 6 6 4 6
2 x1
a35
8
9
ˆ4 ˆ8 x2 ˆ7 x1 8
b24 ˆ
1 4
2ˆ5 x1
a36 ˆ3
8
ˆ x ˆ x
9
ˆ ˆ
x1 8 9
6 4 6
b35 ˆ4 8 2 7 1
8
9
ˆ9 x2
ˆ x
ˆ
ˆ 1Bx
ˆ is equivalent to: 5ˆx1ˆ
8 9
The expression of the AA b36 ˆ6 8 2
ˆ7 x1 8
4 6
b11 b19
ˆ θ G θ The matrices G 1 , G 2 , G 3 are completely constituted by
ˆ Bx
AA
1
(27)
3
know values. The matrix G is equal to G G 2 G 3 G1
b91 b99
and your expression is
T
a11 b11 0 0
a12 b12 0 0
a13 b13 0 0
0 a24 b24 0
Gθ = 0 0 a35 b35
θ
0 0 a36 b36
9
0 x1 8
0
0 x2 0
0 0 x1
476
CHAPTER 14. PROCESS CONTROL
Abstract: In this paper is presented an approach of Numerical Methods Controller (NMCr) based on an
empirical linear model of the processes. The controller is developed for self-regulating processes with an
open loop response similar as a first order plus dead time (FOPDT) model, and can be tuned using the
characteristic parameters taken from the reaction curve. The performance of the proposed controller is
tested in two nonlinear chemical processes and the results are compared, by simulations, against a PID
controller using the ISE performance index to measure it.
Keywords: numerical methods, chemical processes, nonlinear processes, model approximation, FOPDT,
characteristic parameters.
FOPDT model, also tuning equations are presented based on
1. INTRODUCTION
the characteristic parameters of the process, which are
The methodology based on linear algebra and numerical obtained from the reaction curve method. The results
methods concepts to design control algorithms is a simple obtained, using the proposed approach, in several higher
and relatively new method that allows the control of highly order linear systems showed a good performance.
nonlinear systems. It have been applied to the design of
This work shows the application of the proposed Numerical
different class of control systems such as trajectory tracking
Method Controller (NMCr), based on a FOPDT model of the
of mobile robots and UAV’s (Scaglia, G. et al. (2006, 2009),
actual process, to nonlinear chemical processes. The
Rosales, A. et al. (2010), Guerrero, F. et al. (2013)), chemical
performance and robustness of the proposed controller are
plants and bioprocesses (Quintero, O. L. et al. (2008),
tested through simulations in two nonlinear chemical
Scaglia, G. et al. (2014), Suvire, R. et al. (2013), Godoy, S.
processes. The first process is a mixing tank with variable
et al. (2013)) just to name some applications. Its main
dead time, and the second process is a continuous stirred tank
advantage is that the conditions for the tracking error tends to
reactor. In both cases the results are compared against a PID
zero and control actions are obtained with low computational
controller.
cost which makes it easy to implement in a microcontroller.
These conditions are found by solving a system of linear This paper is organized as follows: Section II presents the
equations and finding the conditions for the system to have basic concepts required to controller design, section III
exact solution. presents the development of the proposed controller. Then in
section IV the simulation results are presented divided in two
For all the previous applications, the controllers were
cases of study and finally conclusions and future works are
designed based on the complete process model. Therefore,
presented in section V.
for each different process a different control law is produced
and its application is just for the process under analysis. 2. BASIC CONCEPTS
Besides, the development of a complete model for industrial
processes is difficult mainly due to the complexity of the 2.1 First Order Plus Dead Time System (FOPDT)
process, the lack of knowledge of some process parameters
and the possible higher order of the manipulated variables. Introducing a step change at the process input and then
Therefore, the methodology of numerical controller method recording the transmitter output, produces a reaction curve
using complete models can produce more complex that represent the dynamic behaviour of actual process
controllers. An efficient alternative modelling method for (Seborg, D. E. et al. (2011)). The typical open loop response
process control is the use of empirical models, most times is overdamped as is shown in Fig. 1, and it is called a FOPDT
FOPDT models can work satisfactory for analysis and design system. The transfer function of a First Order Plus Dead
of process control (Camacho O. and Smith C, 2000). Time system is:
( )
In a previous work, Guevara, L. et al. (2016) presented an ( ) (1)
approach of numerical methods controller (NMCr) based on a ( )
477
CHAPTER 14. PROCESS CONTROL
Where, is the gain of the process, is the time constant of 3. CONTROLLER DESIGN
the process and is the dead time or delay.
In this section, it is shown the development of the proposed
numerical methods controller based on a FOPDT model. The
transfer function with the Taylor approximation is considered
for design purposes. Simplifying some terms in (3) and
reorganizing for ease of calculation, two parameters ( and
) are included, they contain the characteristic parameters
of the system ( and ). Consecuently, The transfer function
can be represented as follows:
( )
( ) (5)
( )
Where:
(6)
Fig. 1. Typical Response FOPDT.
(7)
2.2 Taylor Approximation for Dead Time
Then, solving (5) a second order differential equation is
Dead time is not specified in the methodology based on linear obtained:
algebra or numerical methods, therefore, an approximation to
replace the dead time must be considered. For instance, in ̈ ̇ (8)
this article the Taylor approximation is used, similar to Representing (8) in matrix form, it is found:
Camacho, O. et al. (1997). Hence, the dead time expression is
substituted as follows: ̇
[ ] [ ][ ] [ ] (9)
̇
(2)
Where, and are the state variables of the system and
Replacing (2) in (1), it is obtained: the controller output. Then, the system can be described by
the following differential equations:
( ) (3)
( )( ) ̇ (10)
2.3 Euler Approximation ̇ (11)
Using the Euler approximation in the previous equations, the
This approximation is used for discretization of an integral
following equations are obtained:
term, replacing it by a finite difference. Thus, if the slope
value is known in a time period , an estimate value of the (12)
next state could be obtained using a linear
approximation (Chaves, E. (2010)). (13)
478
CHAPTER 14. PROCESS CONTROL
the integral of error multiplied by the second tuning Energy balance around tank:
parameter . For implement it, the integral term is represent
( ) ( ) ( ) ( ) ( ) ( ) ( ( )
by:
( )
( )) ( ) ( ) (21)
(15)
Where, is the total error accumulated represented by Pipe delay between the tank and the sensor location:
(16) and is the sampling period: ( ) ( ) (22)
(16) Transportation lag (Variable delay time):
Adding this integral term into (14), the following equation is (23)
obtained: ( ) ( )
Temperature transmitter:
(17)
( ) ( )
[ ( )] (24)
Assuming that the current value of variable is the same
required for a next state . Then, replacing (16) in (13) Valve position:
and solving the output controller , the control law is
( )
obtained: [ ( ) ( )] (25)
Valve equation:
( ) ( )√ (26)
(18)
Where:
Where:
( ) = mass flow of hot stream, lb/min
(19)
( ) = mass flow of cold stream, lb/min
Equation (19) represents the derivative term of the controlled = liquid heat capacity at constant pressure, Btu/lb-°F
variable. The sampling time value ( ) is recommended to be
in the range /10 < < /4. = liquid heat capacity at constant volume, Btu/lb-°F
The discrete approximation of the derivative, works well if ( ) = hot flow temperature, °F
the process does not present noise, but if noise is present, it is ( ) = cold flow temperature, °F
necessary to add a low-pass filter, as is done in a PID
controller when the derivative term is present. ( ) = liquid temperature in the mixing tank, °F
The integral term eliminates the steady state error, but a ( ) = equal to ( ) delayed by , °F
higher value of produces a more oscillatory response. = dead time or transportation lag, min
Therefore, it should be found a relationship between the two
tuning parameters. Therefore, several tests in systems with = density of the mixing tank contents, lbm/
different characteristics parameters were done, and
= liquid volume,
minimizing the Integral Squared Error (ISE), a criterion for
getting an initial value of was obtained. close to 1 is a ( ) = transmitter output signal on a scale from 0 to 1
good starting value, and substituting it into (20), the integral
tuning parameter can be obtained. ( ) = valve position, from 0 (closed) to 1 (open)
( ) = fraction of controller output, from 0 to 1
(20)
= valve flow coefficient, gpm/
4. STUDY CASES
= specific gravity, dimensionless
The proposed controller was tested in two very common
chemical processes whose models have already been used in = pressure drop across the valve, psi
previous works. = time constant of the temperature sensor, min
4.1 Mixing Tank = time constant of the actuator, min
The mixing tank is a nonlinear chemical process. The tank = pipe cross section,
receives two streams, a hot stream ( ), and a cold stream = pipe length, ft
( ). The outlet temperature is measured at a point 125 ft
downstream from the tank. The complete model (Camacho, The steady-state values and parameters used in this example
O. and Smith C., 2000), it is described by the following are presented in Camacho, O. and Smith C., (2000), except
equations. and which has new values: 80 ft and 0.08 respectively.
479
CHAPTER 14. PROCESS CONTROL
In the reactor, an exothermic reaction takes place (A→B). To test the proposed controller, set point changes were made
The reactor is surrounded by a jacked where a cooling liquid to test tracking references and also disturbances were
flows to remove the heat of reaction. The process information introduced to prove regulation tasks. The controller
and steady-state values used in this example were taken from performance is measured using the Integral of Squared Error
Rojas, R. et al. (2014). The complete model is described by (ISE). The results are compared with a PI controller, tuned
the following equations. using the equations proposed by Dahlin (Smith, C. et al.
(1997)), since this control is the most popular alternative in
Component balance on reactant A: process control.
( )
( ) ( ) ( ) (27) 5.1 Characteristic and tuning parameters
Energy balances on the reactor: Introducing a step change at the input (valve position) and
then recording the transmitter output, the characteristic
( )
( ) ( ) ( ) parameters of the FOPDT models are taken from the result
reaction curves and are presented in Table 1.
[ ( ) ( )] (28)
Table 1. Characteristic Parameters
Energy balances on the jacket: Process
Parameter
( ) ( )
[ ( ) ( )] [ ( ) ( )] (29) Mixing Tank 0.89 2.29 1.33 0.58
CSTR 1.03 5.55 2.45 0.44
The reaction rate for this CSTR:
( ) ( )
( ) (30) Using these values and formulas for Dahlin synthesis, the
tuning parameters for PI controller are obtained. The integral
The valve equation: parameter used in NMCr+I is obtained using (20). The tuning
( ) parameters of both controllers are presented in Table 2.
( ) (31)
Table 2. Controller Tuning Parameters
Where:
PI NMCr+I
( ) = concentration of reactant in the reactor, lbmol/ Process
480
CHAPTER 14. PROCESS CONTROL
481
CHAPTER 14. PROCESS CONTROL
482
CHAPTER 14. PROCESS CONTROL
483
CHAPTER 14. PROCESS CONTROL
Abstract: Fed-batch fermenters have gained particular attention due to its wide range of high valued
products production possibilities. Nowadays, the optimization and control of these systems is an
important task given its significant economic impact and particular characteristics. The paper proposes a
linear algebra based controller for nonlinear and multivariable bioprocesses. It involves finding the
control actions to make the system follow predefined optimal concentration profiles. The controller
parameters are selected with a Monte Carlo experiment. Finally, the good performance of the controller is
proved under normal conditions and adding perturbations in the control action.
Keywords: nonlinear dynamics control, fed-batch fermentation, penicillin production, profiles tracking
control.
to batch (Liang and Chen, 2003). All this avoid the
1. INTRODUCTION
possibility of using classic industrial controllers, like PI and
On the one side, the increasingly demand of high added value PID controllers, being necessary to implement control
products, with specific characteristics for the market, has algorithms specifically developed for bioprocesses (De
made essential the dispose of reliable tools in the production Battista et al., 2012).
area. Due to this fact, in the last few years the optimization
The objective of this work is to develop a control technique
and control techniques development has become a
for a fed-batch penicillin production process. It consists in
fundamental topic for scientific investigations. On the other
tracking predefined state variables profiles (preferably
side, the bioprocesses industry has demonstrated to be an
optimized ones) with minimal error. The methodology was
excellent option for the obtaining of a long variety of
originally designed for robotics systems by Scaglia et al
products in a natural and ecological way. Although, this kind
(Scaglia et al., 2009, Scaglia et al., 2010) and lately extended
of processes have many complications, and that is what make
to other systems like unmanned vehicles (Cheein and Scaglia,
them to be the biggest challenge from control engineer’s
2014), chemical processes (Serrano et al., 2014a),
viewpoint (Ashoori et al., 2009).
underactuated surface vessels (Serrano et al., 2013, Serrano et
The biological processes have been characterized for using al., 2014b), between others, all of them with excellent results
microscopic organisms to obtain valuable substances. For in the path tracking.
example: recombinant proteins, vaccines and antibiotics in
The main advantage of this procedure is the application of
the pharmaceutical industry, or beer, wine, yeast in the
linear algebra for the controller design. Here the control
manufacturing of agro-food goods, biogas and compost in the
action (substrate feed rate) is obtained solving a linear
treatment of urban and industrial solid organic wastes and
equations system, although the controller structure rises from
wastewater (Mangesh and Jana, 2008), or biopolymers in the
a nonlinear mathematical model. This means that the
chemical industry (Chung et al., 2015), between others.
controller performance is independent from the operation
A bioreactor can be operated in one of the following forms: point, obtaining good result even when the initial conditions
batch, fed-batch or continuous. The continuous operation and the parameters change.
mood is characterized to work in a stationary way, while in
The paper is organized as follows: The description of the
batch and fed-batch procedures, the states vary in the time.
system and the process is presented in Section 2. The
This particularity of batch and fed-batch processes added to
controller design and the selection of its parameters are
the many difficulties that any bioprocess present, make the
exposed in Section 3. While in Section 4, the simulation
control of them to be an arduous task to achieve. Some of the
results are shown, this include: the performance under normal
mentioned complications are: the nonlinear and unstable
conditions and with perturbation in the control action.
dynamic behavior of microorganisms which means strong
Finally, in Section 5, the conclusions are exposed.
modeling approximations; the presence of numerous external
disturbances; troubles for most of the representative variables
2. SYSTEM AND PROCESS DESCRIPTION
on-line measurement; both the initial states of the process and
the parameters of the model may vary randomly from batch
484
CHAPTER 14. PROCESS CONTROL
The system under study is a fed-batch bioreactor for YX/S Yield factor for substrate to biomass (gX/g S) 0.47
penicillin production. The substrate and the microorganism
used are glucose and Penicillum crysogenum, respectively. YP/S Yield factor for substrate to product (gP/g S) 1.2
The equations that model the process were originally SF Feed concentration (gS/L) 500
proposed in (Cuthrell and Biegler, 1989), from which many
optimization and control papers have been written (Lim et al.,
1986, Luus, 1993, Riascos and Pinto, 2004, Ronen et al.,
2002). It is a single input multi output system (SIMO), where 3. CONTROLLER DESIGN
the input is the substrate feed rate, and the outputs are the
cells, product (penicillin) and substrate concentrations inside 3.1 Controller Structure
the reactor. The mathematical model of the process is:
For the application of this technique, it is necessary to know
X the mathematical model that represents the process, and the
X t µ X , S X U reference profiles that are expected to be followed by the
S FV
P
system.
P t S X K deg P U (1)
S FV The most important variables within de bioreactor are the
S t µ X , S X S P ms S X 1 S U
cells and product concentration, so their reference profiles are
YP / S K m S
YX / S SF V expected to be tracked by the controller.
Table 2. Parameters of penicillin biosynthesis model In order to achieve the objective presented above, numerical
methods are used to determine the evolution of the system.
Parameter Definition Value The method developed by Euler is used in order to integrate
µmax Maximum specific biomass growth rate (h-1) 0.11 numerically an ordinary differential equation given an initial
ρmax Maximum specific production rate (gP/gX h) 0.0055 value.
485
CHAPTER 14. PROCESS CONTROL
ref n 1 S ref n n
Replacing Eq. (4) in (3) is obtained the following expression. S k S S S Xn Xn ms S n
µ X n , Sn Sn X n X n
n
486
CHAPTER 14. PROCESS CONTROL
The following procedure aims to find the best values for kζ,
such that the tracking error is minimized. In this test, the
Monte Carlo algorithm is applied. The experiment consist in
simulate the process a number of times using random values
of kζ. Then, the total error is calculated for each iteration. The
kζ that make a minimum total error are selected.
To determine the number of simulations (N) is used Eq. (14)
(Tempo and Ishii, 2007). Note that in order to limit the
chance of a wrong answer, an appropriate confidence (δ) and
accuracy (ε) must be indicated.
1
log
N (14)
1
log
1 Fig. 2. Reference and real profiles of cells and penicillin
Depending on the precision to be obtained, δ and ε values are concentration, obtained under normal operation conditions.
selected. For this study, δ=0.01 and ε=0.005 Consequently,
N=1000.
The tracking error ||en|| was defined in Eq. (13) and the total
error is found with the following expression:
1250
E= en (15)
n 1
Figure 2 present how the real cells and product concentration Figure 4 shows the perturbed control actions. Figures 5 and 6
follow perfectly the references. Finally, Fig. 3 shows the show the tracking of cells and product profiles in the
tracking error. As it can be seen, this error tends to zero as the perturbed system. Finally, Fig. 7 shows the tracking error.
process moves forward. The disturbances in the control action cause an increasing of
the tracking error, however, it remains at acceptable levels.
487
CHAPTER 14. PROCESS CONTROL
6. CONCLUSIONS
This controller has several advantages over others: it can be
easily applied, there is no need to have advanced knowledge
about automatic control to do it; the methodology has less
mathematical complexity because it allows obtaining the
control action as a solution of linear equations system,
although the controller structure arises from a nonlinear
mathematical model. As a result, this technique can be used
in a long variety of systems. Furthermore, this controller is
versatile against different disturbances; this was supported by
the tests made, which results show that the controller
manages to achieve the reference profile successfully at any
time, and prove its good performance.
Fig. 5. Reference and real cells concentration profiles
applying perturbations in the control action.
ACKNOWLEDGMENT
Grateful to the National Council of Scientific and
Technological Research (CONICET) and the Chemical
Engineering Institute (IIQ) from the National University of
San Juan for making possible this research.
REFERENCES
ASHOORI, A., MOSHIRI, B., KHAKI-SEDIGH, A. &
BAKHTIARI, M. R. 2009. Optimal control of a
nonlinear fed-batch fermentation process using
model predictive approach. Journal of Process
Control, 19, 1162-1173.
CUTHRELL, J. E. & BIEGLER, L. T. 1989. Simultaneous
optimization and solution methods for batch reactor
control profiles. Computers & Chemical
Engineering, 13, 49-62.
Fig. 6. Reference and real penicillin concentration profiles
CHEEIN, F. A. & SCAGLIA, G. 2014. Trajectory tracking
applying perturbations in the control action.
controller design for unmanned vehicles: A new
methodology. Journal of Field Robotics, 31, 861-
887.
CHUNG, H., YANG, J. E., HA, J. Y., CHAE, T. U., SHIN, J.
H., GUSTAVSSON, M. & LEE, S. Y. 2015. Bio-
based production of monomers and polymers by
metabolically engineered microorganisms. Current
opinion in biotechnology, 36, 73-84.
488
CHAPTER 14. PROCESS CONTROL
DE BATTISTA, H., PICÓ, J. & PICÓ-MARCO, E. 2012. TEMPO, R. & ISHII, H. 2007. Monte Carlo and Las Vegas
Nonlinear PI control of fed-batch processes for Randomized Algorithms for Systems and Control*:
growth rate regulation. Journal of Process Control, An Introduction. European journal of control, 13,
22, 789-797. 189-203.
GEORGE, J. 2014. On adaptive loop transfer recovery using
Kalman filter-based disturbance accommodating
control. Control Theory & Applications, IET, 8, 267-
276.
LIANG, J. & CHEN, Y. 2003. Optimization of a fed-batch
fermentation process control competition problem
using the NEOS server. Proceedings of the
Institution of Mechanical Engineers, Part I: Journal
of Systems and Control Engineering, 217, 427-432.
LIM, H., TAYEB, Y., MODAK, J. & BONTE, P. 1986.
Computational algorithms for optimal feed rates for
a class of fed‐batch fermentation: Numerical results
for penicillin and cell mass production.
Biotechnology and Bioengineering, 28, 1408-1420.
LUUS, R. 1993. Optimization of fed‐batch fermentors by
iterative dynamic programming. Biotechnology and
Bioengineering, 41, 599-602.
MANGESH, M. G. & JANA, A. K. 2008. A comparison of
three sets of DSP algorithms for monitoring the
production of ethanol in a fed-batch baker’s yeast
fermenter. Measurement, 41, 970-985.
RIASCOS, C. A. & PINTO, J. M. 2004. Optimal control of
bioreactors: a simultaneous approach for complex
systems. Chemical Engineering Journal, 99, 23-34.
RONEN, M., SHABTAI, Y. & GUTERMAN, H. 2002.
Optimization of feeding profile for a fed-batch
bioreactor by an evolutionary algorithm. Journal of
biotechnology, 97, 253-263.
SCAGLIA, G., MONTOYA, L. Q., MUT, V. & DI
SCIASCIO, F. 2009. Numerical methods based
controller design for mobile robots. Robotica, 27,
269-279.
SCAGLIA, G., ROSALES, A., QUINTERO, L., MUT, V. &
AGARWAL, R. 2010. A linear-interpolation-based
controller design for trajectory tracking of mobile
robots. Control Engineering Practice, 18, 318-329.
SERRANO, M., SCAGLIA, G., ABALLAY, P., ORTIZ, O.
& MUT, V. 2014a. LINEAR ALGEBRA AND
OPTIMIZATION BASED CONTROLLER
DESIGN FOR TRAJECTORY TRACKING OF
TYPICAL CHEMICAL PROCESS. Latin American
Applied Research, 44, 313-318.
SERRANO, M. E., SCAGLIA, G., MUT, V., ORTIZ, O. &
JORDAN, M. 2013. Trajectory Tracking Controller
Based on Linear Algebra: a case study in
underactuated surface vessels. Journal of Control
Engineering and Applied Informatics, 15, 15-25.
SERRANO, M. E., SCAGLIA, G. J., GODOY, S. A., MUT,
V. & ORTIZ, O. A. 2014b. Trajectory tracking of
underactuated surface vessels: A linear algebra
approach. Control Systems Technology, IEEE
Transactions on, 22, 1103-1111.
STRANG, G. 2006. Linear algebra and its applications,
Thomson, Brooks/Cole, Belmont, CA. ISBN 0-030-
10567-6.
489
CHAPTER 14. PROCESS CONTROL
Abstract: En este artículo se parte de un estudio que indica la necesidad de la comunidad caficultora del
departamento de Antioquia de contar con un equipo que permita determinar la calidad de granos de café
tostados de manera objetiva y automática. Se analiza el proceso de beneficio del café, sus componentes y
principales parámetros para determinar la calidad posterior al proceso de tostión (o torrefacción). Se
plantea la determinación de 3 atributos principales: densidad, color y humedad, este último de gran
complejidad de estimación, debido a la dificultad para adquirir sensores que permitan hacerlo
directamente en grano. Se propone una metodología basada en inteligencia artificial con redes neuronales
para hacer dicha estimación y se diseña la mejor alternativa de un dispositivo portátil para lograr los
resultados antes descritos.
Keywords: Análisis de calidad, granos de café tostados, visión por computador, software libre, bajo
costo.
el concepto de “Cafés Especiales” (Cooperativa de
1. INTRODUCCIÓN
Caficultores de Antioquia).
Colombia ha sido por tradición un país exportador de café de
En el año 2014, en la Gobernación de Antioquia se lanzó el
alta calidad. Según (Federación Nacional de Cafeteros de
concurso “¿Quién se le mide?”, mediante el cual se
Colombia, 2016) mientras que en el mes de octubre de 1956
identifican problemas de la región en diferentes áreas, una de
se producían 279.000 sacos de café de 60kg cada uno, en la
ellas la de Cafés Especiales, en la cual la Secretaría de
actualidad en el mes de mayo de 2016 se producen
Productividad y Competitividad indicó, a partir de un estudio
1’163.000. De forma similar, el precio en el mismo mes en
y como conocedor del sector caficultor del departamento, la
1956 era de 74.5 centavos de dólar por libra de 453.6gr
sentida necesidad de un equipo que permita evaluar la calidad
excelso, y en 2016 es de 144.66 (Federación Nacional de
en los granos de café tostados.
Cafeteros de Colombia, 2016). Las exportaciones de Café
para el mes de abril de 2016 fueron de 92.283 toneladas Dicho equipo debe evaluar algunos atributos que permiten
métricas por valor de US277.374.000 (Departamento determinar la calidad del café, en grano, posterior al proceso
Administrativo Nacional de Estadística, 2016). de tostión. Además, para adaptarse a las necesidades del
sector, debe ser un equipo no invasivo, de modo que no se
Estas cifras dan cuenta del crecimiento del sector, la
destruya la muestra, que requiera poca cantidad de la misma,
valorización del Café en el mercado internacional, y la
posibilidad de medición in situ, material de fabricación no
importancia del producto para la economía Colombiana. Sin
contaminante del alimento, peso bruto inferior a 10Kg,
embargo ha sido un sector golpeado por altibajos en los
equipo resistente a la temperatura y humedad, entre otras
precios, la tendencia ha sido al alza en consumo a nivel
características (Gobernación de Antioquia, 2014).
global, aunque un poco más baja en mercados como la Unión
Europea, EEUU y Japón, que juntos representan más del 50% La metodología desarrollada y estructura del artículo es la
del consumo global, presenta más dinamismo en mercados siguiente: primero se determinan cuáles son los atributos de
como África y Asia (Organización Internacional del Café, calidad más relevantes, particularmente para los cafés
2015). especiales del departamento de Antioquia (capítulo 2).
Posteriormente se hace un análisis sobre la viabilidad de
Como una estrategia para brindar valor agregado al café de la
implementar un dispositivo que pueda determinarlos de
región Antioqueña, la Gobernación de Antioquia y la
forma objetiva, se selecciona la tecnología más adecuada para
Cooperativa de Caficultores de Antioquia han buscado
tal fin (capítulo 3) y se implementa buscando viabilidad
posicionar el café de esta región como un café de altísima
económica (capítulo 4). Finalmente se realiza la validación a
calidad, con atributos consistentes, verificables y sostenibles
partir de muestras de diferentes niveles de tostión y se
y por los cuales están dispuestos a pagar precios superiores,
obtienen resultados y se analizan (capítulo 5).
que redunden en un mayor bienestar de los productores, con
490
CHAPTER 14. PROCESS CONTROL
2. ATRIBUTOS PARA DETERMINACIÓN DE - Sabor: se puede calificar como suave, dulce, ácido,
CALIDAD EN CAFÉ TOSTADO afrutado, pronunciado, alto y propio del café.
Antes de analizar la calidad de un grano de café tostado, es - Cuerpo: corresponde a la persistencia que tiene la bebida en
necesario conocer el proceso que éste surte para poder la boca y a la manera como se desplaza por la lengua hacia la
convertirse en un grano de la calidad exacta para producir el garganta. Una buena bebida de café presenta cuerpo
resultado esperado en una taza de café. La primera parte es el completo, moderado o balanceado
beneficio, mediante el cual el café pasa del árbol a la taza.
- Acidez: se refiere a esa chispa ligeramente picante que se
2.1 Beneficio del café siente en la lengua y que hace que el bebedor se estremezca
por un instante. Puede llegar a ser indeseable cuando se
El proceso de beneficio del café, aunque no es la finalidad de
califica como agria, vinosa, picante, acre, astringente o
este artículo describirlo detalladamente, consiste en las
ausente
siguientes etapas:
- Impresión Global: se refiere a la calificación general de la
- Cosecha: se recolecta del cultivo
bebida de café.
- Despulpado: se retira la pulpa por medios mecánicos.
A nivel de ingeniería, algunas de estas características son
- Retiro del mucílago: se retira por medio de tanques de difíciles de obtener, puesto que dependen de la percepción de
fermentación o vía mecánica un experto. Esta dependencia del ser humano es a su vez una
desventaja, debido a que esta persona está sujeta a una gran
- Lavado: se elimina totalmente el mucílago.
cantidad de factores externos que afectan la medición, como
- Secado: se elimina humedad vía exposición al sol o en enfermedades, estado anímico, nutricional, etc. Es por ello
equipos mecánicos. que la ingeniería debe ayudar a la determinación de ciertas
características que indiquen calidad en los granos de café y
- Trilla: se obtiene el café almendra o café verde, retirando el de allí la necesidad que indica el estudio de la Gobernación
pergamino. El resultado es el café excelso. de Antioquia.
- Tostión: se tuesta el café en hornos, lo que cambia sus 2.4 Atributos de calidad
propiedades físicas y químicas. También conocido como
torrefacción. El proceso varía dependiendo de la forma cómo En general, cualquier característica física o química que se
se aplica el calor, intensidad y tiempo del mismo. pueda medir de forma exacta en el café tostado, generará un
indicador más para poder determinar su calidad. Sin
- Molido: reducción del tamaño para facilidad de extracción embargo, no hay instrumentación disponible para hacer
de aromas y compuestos solubles cualquier medición que se desee. El primer paso es entonces
2.2 Composición determinar cuáles son estos atributos requeridos para las
necesidades particulares, y posteriormente determinar, desde
El café tostado está compuesto por diferentes elementos, los la ingeniería, cuál es la forma más adecuada de hacer estas
cuales, si bien no son indicadores directos de calidad, de mediciones, e incluirlas en un equipo con las características
forma indirecta afectan o benefician la calidad de los granos antes descritas.
(Café de Colombia, 2010):
Según el informe del estudio realizado por la Secretaría de
- Agua: un grano verde contiene del 10 al 13%, mientras que Productividad y Competitividad de la Gobernación de
uno tostado no debe superar el 5%. Antioquia (Gobernación de Antioquia, 2014), las
- Materias grasas: un grano contiene del 15 al 20% de materia características más importantes que se deben medir en el café
grasa para las necesidades de Antioquia son:
- Proteínas: un grano de café verde contiene un 11%. - Color: mundialmente medida en escala Agtron (o número
Agtron), el cual va de 0 a 100, siendo 0 el más oscuro y 100
- Alcaloides: La cafeína es el principal alcaloide. El café el más claro.
Arábigo contiene de 1 a 1.5%, lo cual lo hace más suave que
el Robusta con un 1.6 al 2.7%. - Densidad: expresada en g/l.
- Materias minerales: pequeñas cantidades de potasio, calcio, - Peso: no se tuvo en cuenta, por ser irrelevante a nivel de
magnesio y fósforo. ingeniería, ya que depende del tamaño de la muestra. Se
considera que ya está incluida en la densidad. Se exige
- Ácidos clorogénicos: expertos atribuyen a estos ácidos las mínimo precisión de 0.01g.
propiedades antioxidantes y antivirales.
- Humedad: en rangos del 1 al 70%.
2.3 Propiedades organolépticas
- Dureza: medida en kgf, se plantea como variable opcional
Son aquellas que pueden ser detectadas a través de los para el equipo.
sentidos y, para las cuales, los catadores expertos en café
pueden hacer una apreciación subjetiva de la calidad del Con la especialización que se tiene en cafés especiales en el
grano tostado. Estas son (Café de Colombia, 2010): departamento de Antioquia y aledaños, es imposible
determinar cuáles características son las ideales para un buen
- Aroma: es un olor bien conocido. café, ya que éstas dependen del mercado en el cual se va a
491
CHAPTER 14. PROCESS CONTROL
492
CHAPTER 14. PROCESS CONTROL
Un algoritmo que identifique los picos en cada una de las 3 Existen formas de medir de forma indirecta una variable, a
curvas R, G y B a través de una cámara digital, permitirá partir de otra u otras cuyo valor ya se conozca. Para esto se
determinar qué tan oscuro o claro están los granos. Sin requiere de una gran cantidad de datos obtenidos con equipos
embargo, nótese que n los niveles más negros hay una gran calibrados para dichas variables, de modo que se pueda
cantidad de pixeles, debido a sombras; dichos pixeles no estimar las características o parámetros en diversas áreas de
deben ingresar al algoritmo de detección de valores máximos. estudio, que son de gran interés para los investigadores, que
Para el caso de la figura, se podría determinar que el máximo trabajan en lograr que sus modelos predigan estos parámetros
de rojo verde y azul están en valores en el eje horizontal de con un alto grado de certeza y confiabilidad. Según la
52, 65 y 23 respectivamente, el promedio de estos 3 valores literatura, la técnica más utilizada para obtener estimaciones
es 46,67, el cual es un claro indicador de la oscuridad o de parámetros es el clásico análisis de regresión; sin
claridad de los granos presentes. embargo, la calidad de las estimaciones por medio de esta
técnica se ven limitadas cuando se presenta un problema no
Posteriormente este valor se puede convertir a escala Agrton lineal (Baykasolu, Dereli, & Tani, 2004). Las redes
a través de una relación lineal y, por medio de un instrumento neuronales artificiales establecen un método distinto que se
calibrado en dicha escala, se puede obtener el valor de color basa en el aprendizaje y determinación de las relaciones
de forma precisa. presentes entre las variables de entrada y salida de un
conjunto determinado de datos.
3.2 Densidad
Este atributo es de fácil medición, debido a que en el En (Muharrem, Sema, Serap, & Yurtsever, 2013) se utiliza el
mercado se encuentran fácilmente básculas digitales con color para estimar nivel de humedad y niveles nutricionales
precisión de 0.01g. Se requiere posibilidad de comunicación de hojas de maní, y en (Behroozi Khazaei, Tavakoli Hashjin,
con una unidad central de procesamiento, puesto que la Ghassemian, Khoshtaghaza, & Banakar, 2013) se utiliza la
indicación es típicamente visual, por lo tanto con algún visión para estimar el nivel de humedad a partir del
protocolo de comunicación serial sería posible enviar el dato encogimiento de la uva al secarse y las características del
de peso a la unidad central. La báscula seleccionada es marca cambio de color, utilizando regresiones lineales.
BBG.
Las redes neuronales artificiales son un modelo simplificado
Se debe garantizar una muestra de café de volumen del cerebro y tienen la capacidad de emular algunas
constante, en un recipiente idóneo para manipulación de características propias de los humanos, como la capacidad de
alimentos, preferiblemente fabricado en vidrio o acero memorizar y de asociar acciones, su forma de obtener
inoxidable. Se selecciona una placa de Petri de cristal de gran conocimiento es a partir de ejemplos, es decir, no se
utilización a nivel de laboratorios. Su peso es de 32g y establecen reglas predefinidas para llegar a una solución, sino
volumen interno una vez ubicada la tapa es de 80ml. que la red neuronal define sus propias reglas durante el
proceso de aprendizaje, redefiniendo su comportamiento para
Conocido el peso (m) y volumen (v), se calcula la densidad adaptarse a una serie de datos diferentes y lograr obtener una
utilizando la ecuación: solución confiable.
493
CHAPTER 14. PROCESS CONTROL
El entrenamiento de la red neuronal backpropagation consta corresponde a la suma de productos entre los pesos asociados
de dos etapas, en la primera etapa el patrón de entrada es a cada neurona y el error proporcional de la neurona anterior:
asociado a la red y propagado a través de las capas hasta
llegar a la capa de salida. Obtenidos los valores de salida de ∑ (3)
la red neuronal, se comienza la segunda etapa, comparando
dichos valores con la salida deseada para determinar el error. Este procedimiento se repite hasta llegar a las neuronas de la
Se ajustan los pesos de la última capa proporcionalmente al capa de entrada; obtenido el valor de error para todas las
error generado. Se pasa a la capa anterior con una neuronas de la red, se procede a la actualización de los pesos
retropopagación del error, ajustando los pesos y repitiendo (w’), para ello se toma el valor actual del peso y se adiciona
este procedimiento hasta llegar a la capa de entrada. De esta el producto entre el error de la neurona, la derivada de la
forma se reajustan los pesos de las conexiones de la red función de activación de la suma de productos calculados
neuronal, conocidos los valores de entradas y salidas anteriormente (h), el valor de entrada de la neurona y un
deseadas del problema que se busca solucionar. parámetro momento (n) que determina la capacidad de
convergencia del algoritmo de entrenamiento:
Para iniciar el proceso de aprendizaje de las redes neuronales,
se utiliza un patrón de entradas y salidas de características de
(4)
café tostado, a partir de 10 muestras de café en diferentes
puntos de tostión, a las cuales se les mide densidad y color
con el procedimiento antes descrito y, a través de otro equipo Este algoritmo de entrenamiento debe iterar de manera
calibrado, se obtiene su humedad. Este procedimiento se continua hasta converger en un mínimo valor de error
repite 10 veces para cada muestra, obteniendo un total de 100 permisible en la salida de la red, en este caso, un valor
valores que se ingresan al algoritmo de entrenamiento para la aproximado de las variables humedad y Quantik. Finalmente,
red Backpropagation, una muestra de esta base de datos se después de aplicar el algoritmo de entrenamiento se obtienen
presenta en la Tabla 1. dos redes neuronales artificiales diferentes con la siguiente
configuración: una primera red neuronal artificial que consta
Tabla 1. Patrón de entradas y salidas de variables de café. de una capa de entrada con 10 neuronas, una capa oculta con
2 neuronas y una capa de salida con 1 neurona (10:2:1), y
MUESTRA R G B GRAY PESO HUMEDAD QUANTIK
tiene como estímulos de entradas los valores de peso, RGB y
1 94 92 86 92 24,63 0,5 115
GRAY obtenidos con la balanza y el procesamiento de la
2 93 89 82 90 24,63 0,5 115
imagen de la muestra de café analizada, para generar en la
3 93 92 87 92 24,63 0,5 115
salida un valor estimado de humedad; la segunda red consta
4 89 86 79 86 24,63 0,5 115
de una capa de entrada con 30 neuronas, una capa oculta con
5 91 87 80 87 24,63 0,5 115
5 neuronas y una capa de salida con 1 neurona (30:5:1), y
6 94 93 89 93 24,63 0,5 115
cuenta con los valores de RGB y GRAY obtenidos de la
7 94 90 85 91 24,63 0,5 115
imagen como estímulos de entrada, para determinar en la
8 94 93 87 93 24,63 0,5 115 salida su valor estimado de nivel de tostión en escala Agtron.
9 94 93 89 93 24,63 0,5 115
10 91 87 81 88 24,63 0,5 115 Haciendo una breve descripción del algoritmo implementado
y su secuencia de funcionamiento, se presentan las siguientes
Para aplicar el algoritmo de entrenamiento se procede de la etapas básicas: obtener el valor del peso de una muestra de
siguiente forma; se inicializan los pesos de la red en valores grano de café tostado por medio de la balanza digital;
aleatorios y menores que 1 (wij); se ingresan los valores de adquirir una imagen de la muestra de grano a partir de la
entradas (x1, x2, … xn) correspondientes a las variables peso, cámara; aplicar técnicas de visión artificial para obtener
RGB y GRAY, y se especifica la salida deseada que debe información relevante de la imagen adquirida, comenzando
obtener la red (y), correspondiente a las variables humedad y con un filtro de tamaño para eliminar regiones que no aportan
Quantik respectivamente; se calculan los valores de salida de información, aplicar una función de histograma a la imagen
cada neurona (yn) a partir de la sumatoria de productos entre para obtener el valor de los componentes RGB, cambiar el
los pesos y los valores de entrada a la red, y se aplica una formato de imagen a escala de grises y nuevamente aplicar
función de activación sigmoidal (f): una función de histograma para obtener el valor del
componente GRAY; ingresar estos valores obtenidos como
∑ (2) patrones de entrada a las dos redes neuronales entrenadas
previamente, para obtener los valores de humedad y nivel de
Se aplica el mismo procedimiento para obtener todos los tostión en escala Agtron (Fig. 4).
valores de salida en las neuronas de la capa intermedia hasta
llegar a la capa de salida; se calculan los términos de error
(en) para todas las neuronas y se propagan hacia atrás de
forma proporcional para cada neurona, en la capa de salida el
valor del error es equivalente a la diferencia entre el valor
deseado para la variable asignada y el valor real obtenido en
la salida de la red, en las capas ocultas y de entrada el error
494
CHAPTER 14. PROCESS CONTROL
Este equipo debe entonces ser capaz de alojar los elementos Fig. 6. Pantalla táctil 3.2” ref uLCD-32PTU
descritos en el capítulo anterior, además de contar con una
unidad de procesamiento capaz de recopilar toda la La capacidad de procesamiento de la Raspberry obviamente
información y mostrarla en una interfaz hombre máquina. no puede compararse con un computador industrial o uno de
línea corporativa de última generación, por lo tanto las tareas
En el mercado existen diferentes alternativas para desarrollar que se le programen deben evaluarse desde el punto de vista
algoritmos de visión por computador. Uno de ellos, tal vez el del desempeño. Por ejemplo en aplicaciones de visión, debe
más potente, es LabVIEW con su toolkit “Vision evaluarse muy de cerca si el procesamiento es en línea de un
Development” (National Instruments, 2016), el cual permite video, o simplemente desde una fotografía. El grupo de
adquirir imágenes de cámaras convencionales de bajo costo, investigación ICARO ya ha realizado aplicaciones en este
propias o especializadas, para hacer todo el desarrollo de tipo de plataformas con procesamiento en video y funciona
estos algoritmos a través de una gran cantidad de librerías. moderadamente bien. Para el caso particular, se requiere
Posibilita el trabajo paralelo en procesadores, haciendo los únicamente una fotografía del café para hacer todo el análisis,
algoritmos altamente eficientes. Sin embargo, su principal por lo tanto la capacidad de procesamiento de la Raspberry es
debilidad es la dependencia de un computador (personal o de adecuada.
escritorio) para la ejecución de dichos algoritmos, y la única
alternativa son las SmartCameras capaces de ejecutarlos y 4.2 Diseño de la estructura principal
que el mismo fabricante ofrece, pero a un costo superior a los
U$2.000 y hasta los U$10.000 y más (National Instruments, La estructura principal debe ser en un material apto para la
2016). De forma similar está Matlab, el cual también tiene industria alimenticia. El acero inoxidable es,
una alta dependencia de computador. Ambos, LabVIEW y indiscutiblemente, este material por excelencia. Debe
Matlab, tienen adicionalmente costo de licencia, el cual debe contener las siguientes características:
adquirirse para poder utilizar el desarrollo.
495
CHAPTER 14. PROCESS CONTROL
(1) (2)
(3)
5. RESULTADOS
Apertura superior para instalación de la pantalla táctil, con - Medición de dureza (opcional): no cumple.
puerta para protección (Fig. 10).
5.2 Análisis de muestras de café
Y espacio oculto (en la parte superior entre pantalla y
lámpara) para ubicación de la unidad central de Para realizar las pruebas y validación del prototipo, se toman
procesamiento y adaptadores de voltaje. 5 muestras de café y, a cada una, se le realizan 10 pruebas en
similares condiciones, retirándola, cambiando de posición los
granos, y volviéndola a ingresar, esto con el fin de evaluar el
desempeño del mismo. A continuación el análisis de los
resultados de 3 de las muestras, en las otras no incluidas en
este artículo los resultados fueron similares.
496
CHAPTER 14. PROCESS CONTROL
Como se puede ver en la Tabla 2, los valores de densidad y en la medición y un número Agtron con una variación más
humedad son muy estables. En el caso de la densidad el valor grande, en 3 casos en valor 90, 1 vez en 90 y en 6 ocasiones
mínimo fue de 0.47 y el máximo de 0.471g/ml, error que es en 100. En esta muestra aplica también el análisis realizado
despreciable. En el caso de la humedad el valor mínimo fue para la muestra #1.
de 3.52 mientras que el máximo fue de 3.82, con una
variación de 0.30%. Para el número Agtron, se tuvieron Tabla 4. Muestra #5: Café sin tostión (crudo)
valores inferiores de 45 (equivalencia a medio) y máximos de ANÁLISIS DENSIDA HUMEDA AGTRON AGTRON
D D REAL OBTENID
60 (equivalencia a medio claro), es decir, se tuvo un error O
considerable en números y un cambio de un nivel en la
41 0,85 9,7 100 100
equivalencia; esto se puede generar debido a la reubicación y
posición de los granos, lo cual es de esperarse debido a que la 42 0,85 9,7 100
imagen cambia de condiciones como sombras o 43 0,849 9,65 90
irregularidades en la tostión en los granos, aunque también se
debe a un error típico en todos los instrumentos de medición. 44 0,849 9,7 100
48 0,849 9,68 95
1 0,47 3,6 50 55
49 0,849 9,66 100
2 0,47 3,74 50
50 0,849 9,65 90
3 0,471 3,81 45
4 0,47 3,82 50
5.3 Tiempo de ejecución
5 0,47 3,63 55
24 0,407 0,99 40
6. CONCLUSIONES
497
CHAPTER 14. PROCESS CONTROL
bajar el costo del equipo pero aumentando el costo Konica Minolta. (s.f.). Food Industry Apps. Recuperado el 21
computacional del mismo y generando mayor tiempo para de Junio de 2016, de Food Industry Apps:
efectuar el análisis. Aunque dicho tiempo resulta ser http://sensing.konicaminolta.asia/wp-
adecuado para la aplicación particular, se podría reducir con content/uploads/2011/05/FoodIndustryApps.pdf
equipos de mejores especificaciones, aunque aclarando que León, K., Mery, D., Pedreschi, F., & León, J. (2006). Color
esto aumentaría también el costo del mismo. measurement in L a b units from RGB digital
images. Food Research International Volume 39, Issue
Se diseña, construye y valida un equipo que cumple con las 10, 1084–1091.
especificaciones exigidas, que servirá de apoyo a las Muharrem, K., Sema, K., Serap, G., & Yurtsever, S. (2013).
cooperativas, laboratorios, microempresas y, en general, Utilization of color parameters to estimate moisture
tostadores de café, con tecnología de bajo costo importada, content and nutrient levels of peanut leaves. Turkish
pero integración y desarrollo local. Journal of Agriculture and Forestry, 604-612.
National Instruments. (2016). Módulo Vision Development.
6. AGRADECIMIENTOS Recuperado el 21 de Junio de 2016, de Módulo Vision
Development:
A la Gobernación de Antioquia y Secretaría de Productividad http://sine.ni.com/np/app/main/p/ap/vision/lang/es/pg/1/s
y Competitividad por haber lanzado el concurso “¿Quién se n/n17:vision/fmid/8519/
le mide?” y haber realizado el estudio de necesidades de National Instruments. (2016). NI Smart Cameras. Recuperado
caficultores en el departamento de Antioquia el 21 de Junio de 2016, de NI Smart Cameras:
respectivamente, recursos con los que se pudo ejecutar este http://sine.ni.com/np/app/main/p/bot/no/ap/vision/lang/es
proyecto y financiar la patente en trámite (solicitud ante SIC /pg/1/sn/n25:device,n17:vision,n21:11601/
# 15 280069 0000 0000 del 2015-11-24). OpenCV. (2016). OpenCV. Recuperado el 21 de Junio de
2016, de OpenCV: http://opencv.org/
REFERENCIAS
Organización Internacional del Café. (2015). Informe del
Baykasolu, A., Dereli, T., & Tani, S. (Noviembre de 2004). mercado de café Marzo de 2015. Londres: OIC.
Prediction of cement strength using soft computing Raspberry. (2016). Raspberry Pi 2. Recuperado el 21 de
techniques. Cement and Concrete Research, 2083-2090. Junio de 2016, de Raspberry Pi 2:
Behroozi Khazaei, N., Tavakoli Hashjin, T., Ghassemian, H., https://www.raspberrypi.org/wp-
Khoshtaghaza, M., & Banakar, A. (2013). Application of content/uploads/2015/01/Pi2ModB1GB_-comp.jpeg
Machine Vision in Modeling of Grape Drying Process. Sun, D.-W. (2016). Computer vision technology for food
Journal of Agricultural Science and Technology, 1095- quality evaluation. Londres: Elsevier.
1106.
Café de Colombia. (2010). Clasificaciones de calidad.
Recuperado el 20 de Junio de 2016, de Clasificaciones
de calidad:
http://www.cafedecolombia.com/particulares/es/sobre_el
_cafe/el_cafe/clasificaciones_de_calidad/
Cooperativa de Caficultores de Antioquia. (s.f.). Cafés
Especiales. Recuperado el 20 de Junio de 2016, de Cafés
Especiales:
http://www.cafedeantioquia.com/index.php/gestion-
comercial/cafes-especiales
Departamento Administrativo Nacional de Estadística.
(2016). Colombia, exportaciones de café, carbón,
petróleo y sus derivados, ferroníquel y no tradicionales,
según valores y kilos netos 1992 - 2016. Bogotá: DANE.
Everard, C., O’Callaghan, D., Fagan, C., O’Donnell, C.,
Castillo, M., & Payne, F. (2007). Computer Vision and
Color Measurement Techniques for Inline Monitoring of
Cheese Curd Syneresis. Journal of Dairy Science.
Volume 90, Issue 7, 3162–3170.
Federación Nacional de Cafeteros de Colombia. (2016).
Precio externo del café colombiano - promedio mensual.
Bogotá: Federación Nacional de Cafeteros de Colombia.
Federación Nacional de Cafeteros de Colombia. (2016).
Producción registrada - mensual. Bogotá: Federación
Nacional de Cafeteros de Colombia.
Gobernación de Antioquia. (2014). Reto #6 Cafés Especiales.
Medellín: Gobernación de Antioquia.
498
CHAPTER 14. PROCESS CONTROL
Abstract: Set Theoretic Methods in Control using random sequences have been widely used previously as
a tool for process design. In this paper, we present experimental results of determining control sets by
using low-discrepancy deterministic sequences as an alternative to random sequences. We show that in
certain situations this approach allows to reduce both significantly, the experimental error and the
computational time.
v) Methods based on selecting random sequence, i.e., Monte
1. INTRODUCTION
Carlo methods [Gómez 2009].
At present, almost all nonlinear control strategies are based
Various papers reported the use of stochastic sequences
on state-space models of the system to be controlled. These
[Calderon et al 2012, Alzate et al 2013, Alzate et al 2015,
models are usually obtained by first principles modeling or
Zuluaga-Bedoya et al 2015], and, for these papers, the
by system identification techniques. Then, these models are
number of samples is based on Hoeffding’s bound.
used to emulate the real system during design stage or as part
of the controller itself. Once the model is available in a state- This paper proposes the implementation of low-discrepancy
space representation including system restrictions and system deterministic sequences instead of random sequences. These
uncertainties, there are countless methods and design type of deterministic sequences are the core of the so called
techniques that use different mathematical tools. Quasi Monte Carlo integration methods. For several
applications, it has been reported that their use instead of
In particular, Set Methods in Control have been reported as
random sequences reduce the experimental error [Acebrón et
an alternative tool to evaluate the controllability of chemical
al 2005, Dick et al 2013, Atanassov et al 2008, Boyle et al
processes. This is because the Reachable, Controllable and
1997, Hokayem et al 2003, Jank 2005].
Reversible sets are useful for the process controllability
analysis, and the subsequent control system design [Alzate et This paper is structured as follows. Section 2 describes Set
al 2015, Gómez-Pérez et al 2015, Zuluaga-Bedoya et al Theoretic Methods in Control and low-discrepancy
2015]. deterministic sequences. Section 3 describes the
phenomenological based semi-physical model and the
Initial approximation to calculate Reachable and Controllable
algorithm to calculate the different sets. Section 4 presents
Sets are based on concepts of the theory of Optimal Control
the results obtained and their analysis and, finally, Section 5
[Grantham et al 1975]. Afterwards, and until now several
presents the conclusions and future works.
articles have been published based on a wide variety of
approximation methods, some of the most relevant are listed
below:
2. BASIC CONCEPTS
i) Methods based on the solution of partial differential
equations of the Hamilton-Jacobi-Bellman-Isaacs (HJBI) type 2.1 Set Theoretic Methods in Control
[Kurzhanskiy et al 2002, Kurzhanskiy et al 2011, Oishi et al
2006]. In this section, we present the basic definitions of Set
Theoretic Methods in Control used in this paper. The sets
ii) Level Set Methods [Mitchell 2004, Cross et al 2008]. used are Reachable, Controllable and Reversible.
iii) Ellipsoidal approximation methods [Kurzhanskiy et al Consider the nonlinear dynamic continuous system
1997, Kurzhanskiy et al 2002]. On this technique, a toolbox
is available for Matlab® (Ellipsoidat Toolbox) [Kvasnica et
al 2004]. ̇ ( ( ) ( ) ) (1)
iv) Methods based on Polytopes [Bravo et al 2005].
499
CHAPTER 14. PROCESS CONTROL
() ∑ (7)
( ) * ( ) + (3)
3) Reversible Set: Given a set and a time , the Reversible Halton Sequence: Let be the first prime
Set is the set of all state vectors that belong to Reachable Set number. The Halton sequence in dimensions is
of in a time and the Controllable Set of in a time . given by
( ) * ( ) ( )+ (4) . () () ( )/ (8)
500
CHAPTER 14. PROCESS CONTROL
501
CHAPTER 14. PROCESS CONTROL
502
CHAPTER 14. PROCESS CONTROL
503
CHAPTER 14. PROCESS CONTROL
Abstract: In this paper, a model based diagnosis system for a reverse osmosis desalination
module with spiral wound configuration is developed. At first, a mathematical model based on
differential and algebraic equations of the reverse osmosis system is obtained. From this model, a
structural analysis technique is performed which allows us to obtain a structural model of the plant
defined by a set of constraints. After the structural model a set of analytical redundancy relations
(ARR) is obtained by applying the ranking algorithm of constraints. The comparison between the
system in nominal operation with the system under different faults shows that all faults of interest
were detectable and isolable.
Keywords: Fault Diagnosis, Reverse Osmosis, Analytical Redundancy Relations.
504
CHAPTER 14. PROCESS CONTROL
work.
505
CHAPTER 14. PROCESS CONTROL
506
CHAPTER 14. PROCESS CONTROL
0.95 1.4
solved to find all unknown variables from the known
0 10 20 30 40 0 10 20 30 40
ones. Table 2 shows the Ranking Algorithm (Blanke,
Feed Concentration (Kg/m3) Permeate Concentration (Kg/m3)
36 0.125 2006)
35.5
35
0.12
Table 2. Ranking Algorithm
0.115
34.5
34 0.11
Ranking Algorithm
0 200 400 600 0 200 400 600
507
CHAPTER 14. PROCESS CONTROL
x 10
-5 RRA7 x 10
-5 RRA8 x 10
-4 RRA9
-0.5 10
Thus a total of 12 signals were used for fault detection -1 -6
-8
5
-1.5 0
and isolation analysis. -2 -10
-5
-2.5 -12
0 500 1000 1500 0 500 1000 1500 0 500 1000 1500
4. SIMULATION RESULTS RRA10 ERR1 ERR2
0.02 0.5 0.1
508
CHAPTER 14. PROCESS CONTROL
RRA1 RRA2 x 10
-3 RRA3 As a future work we propose the use of nonlinear
0.02 0.02 2
observers of certain variables of interest, which will
0 0 0
allow for obtaining additional redundancy relations to
-0.02
0 500 1000 1500
-0.02
0 500 1000 1500
-2
0 500 1000 1500 detect and isolate other faults of interest.
RRA4 x 10
-3 RRA5 RRA6
50 2 0.1
ACKNOWLEDGMENT
0 0 0
The authors acknowledge the support of the National
-50
0 500 1000 1500
-2
0 500 1000 1500
-0.1
0 500 1000 1500 Innovation Program for Competitiveness and
10
x 10
-3 RRA7
0.02
RRA8
0.02
RRA9 Productivity (Innovate Peru) entity that financed the-
5 0 0 FINCyT-IA-2013 207 project, under which this article
0 -0.02 -0.02 was developed.
-5 -0.04 -0.04
0 500 1000 1500 0 500 1000 1500 0 500 1000 1500
509
CHAPTER 14. PROCESS CONTROL
510
CHAPTER 14. PROCESS CONTROL
caso, seguido dos resultados experimentais. Por fim, a seção 𝑍̂ = 𝑇𝑃𝑇 (04)
6 finaliza o trabalho com a conclusão. E a matriz residual, que assimila as variações no espaço
observação gerado pelos autovetores associados aos 𝑚 − 𝛼
2 MÉTODOS ESTATÍSTICOS MULTIVARIADOS
menores autovalores, é dada por:
Os métodos estatísticos multivariados são capazes de extrair
informações de grandes conjuntos de dados multivariados 𝐸 = 𝑍 − 𝑍̂ (05)
e, apesar de vários métodos diferentes, todos compartilham 2.2 Análise de Componentes Independentes (ICA)
da semelhança de identificar variáveis “artificiais”, que são A grande diferença entre PCA e ICA está no princípio
combinações lineares ou não-lineares das variáveis utilizado para o tratamento dos dados. Enquanto as
originais. Neste trabalho, utilizaram-se duas técnicas: componentes principais são baseadas na descorrelação,
análise de componentes principais (PCA) e análise de obtendo sinais ou dados não correlacionados, gaussianos ou
componentes independentes (ICA). Esses métodos são não, as componentes independentes são baseadas no
brevemente descritos a seguir. Para maiores detalhes, princípio de independência estatística, obtendo sinais ou
consultar Jackson (1991) para o PCA e Hyvärinen et al dados de variáveis não gaussianas independentes.
(2001) para o ICA.
Para se definir um modelo para ICA, considera-se um
2.1 Análise de Componentes Principais (PCA) conjunto de 𝑛 variáveis aleatórias 𝑥1 , … , 𝑥𝑛 , que foram
A ideia central do PCA é reduzir a dimensionalidade de um observadas e medidas. De acordo com Hyvärinen et al
conjunto de dados, que consiste de um grande número de (2001), estas variáveis podem ser escritas como uma
variáveis correlacionadas, com a perda mínima de combinação linear de 𝑛 variáveis estatisticamente
informação. Isto é possível por meio da transformação dos independentes 𝑠1 , … , 𝑠𝑛 , como segue:
dados em um novo conjunto de variáveis, ditas
𝑥𝑖 = 𝑎𝑖1 𝑠1 + 𝑎𝑖2 𝑠2 + ⋯ + 𝑎𝑖𝑛 𝑠𝑛 (06)
componentes principais (PCs), que são não correlacionadas
Em que: 𝑖 = 1, … , 𝑛, e 𝑎𝑖𝑗 são coeficientes escalares reais
e ordenadas de modo que as primeiras componentes
contenham a maior parte da variação presente em todas as com 𝑖, 𝑗 = 1, … , 𝑛.
variáveis originais. Este é considerado o modelo básico do ICA e descreve
Para aplicação do método, quando a matriz de dados como os dados observados são gerados a partir de um
originais é coletada, esta deve ser centralizada e processo de mistura dos componentes independentes 𝑠𝑗 .
normalizada. Assim, dada uma matriz 𝑋𝑚𝑥𝑝 de dados Sendo que, os coeficientes da mistura 𝑎𝑖𝑗 são
coletados, onde 𝑚 é o número de medições (observações) e desconhecidos. Tanto as componentes da mistura como os
𝑝 o número de variáveis, a normalização da matriz de dados componentes independentes são estimados a partir de 𝑥𝑖 .
nos dá uma matriz 𝑍𝑚𝑥𝑝 de média zero e covariância igual Essas estimações são realizadas assumindo-se algumas
a sua matriz de correlação. De acordo com Jackson (1991), hipóteses, de modo que as componentes tornem-se o mais
o PCA é realizado efetuando-se a diagonalização da matriz independente possível umas das outras.
de covariância dos dados normalizados, isto é: Usando notação matricial, em que 𝐱 é o vetor de variáveis
𝑆 = 𝐸Λ𝐸 𝑇 (01) aleatórias cujos elementos são as misturas, 𝐬 é o vetor de
Em que: 𝐸 é uma matriz 𝑝𝑥𝑝 ortogonal chamada de matriz componentes independentes e A é a matriz de mistura, de
acordo com tem-se:
de autovetores e Λ é uma matriz diagonal 𝑝𝑥𝑝 chamada
matriz de autovalores. 𝐱 = A𝐬 (07)
Assim, como por hipótese, as componentes de 𝐬 são
Os autovalores estão dispostos em ordem decrescente, estatisticamente independentes e pode-se achar um sistema
sendo que cada autovalor expressa uma porcentagem da de “desmistura” ou separação W, em que os componentes
variância total dos dados. Por meio dos autovetores, pode- reconstruídos 𝐬̂ sejam o mais independente possível. Neste
se transformar as 𝑝 variáveis, que são correlacionadas, em caso, tem-se que:
outras 𝑝 variáveis, não-correlacionadas, como segue:
𝐬̂ = W𝐱 (08)
𝑃𝑐 = 𝑍𝐸 (02) Nas condições normais de operação dos dados 𝐱, W e 𝐬̂ são
As 𝛼 primeiras componentes principais tem a maior obtidos utilizando-se o algoritmo FastICA proposto por
variância e podem representar todas as 𝑝 variáveis originais Hyvärinen et al (2001). Como acontece no PCA, pode-se
sem muita perda de informações. Estas, podem ser reduzir a dimensão dos dados selecionando-se as 𝑑 linhas
calculadas apenas com os autovetores associados aos 𝛼 de W dominantes que tem o maior efeito na variação dos
primeiros autovalores, ou seja: elementos do vetor de componentes principais, usando-se a
norma euclidiana. Assim, tem-se duas matrizes reduzidas,
𝑇 = 𝑍𝑃 (03) 𝑊𝑑 correspondente a parte dominante e 𝑊𝑒 correspondente
A matriz 𝑃𝑝𝑥𝛼 também é chamada de loading vectors. Em a parte excluída. Da mesma forma, pode-se selecionar as 𝑑
consequência, a matriz estimada dos dados, obtida do PCA, colunas da matriz de mistura A, resultando nas matrizes 𝐴𝑑
é dada por: e 𝐴𝑒 . Em consequência disto, tem-se os seguintes vetores de
componentes independentes:
511
CHAPTER 14. PROCESS CONTROL
512
CHAPTER 14. PROCESS CONTROL
deve ter em seus dados de treinamento esse tipo de controlador para o modelo benchmark são: o horizonte de
informação. O método utilizado neste trabalho e proposto modelo foi ajustado para 80, as matrizes de supressão foram
por Tian et al (2011a) baseia-se no ângulo formado entre o escolhidas como matrizes identidades, o horizonte de
principal autovetor dos dados detectados com mau predição e o horizonte de controle como sendo 10 e 1,
desempenho e os principais autovetores dos dados respectivamente, e o tempo de amostragem foi estabelecido
previamente treinados no modelo. Assumindo-se que se tem em 1 minuto. Além disso, o “set point” das variáveis
𝑔 causas (𝐶1 , 𝐶2 , … , 𝐶𝑔 ) de deterioração de desempenho do controladas foram ajustadas para 1 e 0.8, respectivamente,
controlador MPC, previamente conhecidas, um conjunto de e a variância do distúrbio 𝐷 de 0.01. O conjunto de variáveis
dados 𝑋𝑖 é coletado para cada causa 𝐶𝑖 e treinado para monitoradas foi estabelecido como sendo:
adquirir um modelo de referência. Esse modelo é composto
pelo principal autovetor 𝑒𝑖 , que corresponde ao maior 𝑋 = [𝑼1 𝑼𝟐 𝒆𝒑1 𝒆𝒑𝟐 𝒀1 𝒀𝟐 ] (22)
autovalor da matriz de covariância da amostra dos dados, já 5.2 Treinamento dos Dados de Referência
que o autovetor correspondente ao maior autovalor explica Para criar o modelo de referência usando-se o DPCA e o
a maior parte das informações do subespaço. Assim, cada DICA, simulou-se o sistema em malha fechada com uma
causa 𝐶𝑖 define um subespaço 𝑉𝑖 que é representado pelas janela de 1000 minutos, considerando-se as condições de
características do seu autovetor 𝑒𝑖 . ajustes descritas na seção 5.1, que são as condições normais
do sistema. A partir destes dados, o benchmark foi criado,
Quando um baixo desempenho é detectado pelo
utilizando-se as técnicas estatísticas multivariadas já
procedimento de avaliação de desempenho, examina-se
descritas nesse trabalho, para ser aplicado no
qual subespaço {𝑉𝑖 }𝑔 𝑖=1 está mais próximo do subespaço
monitoramento do sistema de controle MPC.
dos dados monitorados. Para isto, define-se o ângulo entre
os dois subespaços, considerando-se 𝑉𝑖 o subespaço dos Para o diagnóstico das causas de deterioração do controle,
dados do modelo, sendo 1 ≤ 𝑖 ≤ 𝑔 e 𝑉𝑚 o subespaço dos estas causas devem ser previamente conhecidas e um
dados monitorados: treinamento dos dados com ocorrência destas causas deve
ser realizado. Para a coluna de destilação Wood-Berry,
𝑒𝑖 𝑇 𝑒𝑚 quatro causas foram estabelecidas por Tian (2011a) e
𝜃𝑖,𝑚 = 𝑐𝑜𝑠 −1 ( ) (21)
‖𝑒𝑖 ‖‖𝑒𝑚 ‖ seguidas por esse trabalho, conforme segue na Tabela 1.
Assim, o subespaço 𝑉𝑖 que tiver o menor ângulo 𝜃𝑖,𝑚 é o Estas quatro causas foram: mudança da variância da
mais similar ao subespaço 𝑉𝑚 e o atual mau desempenho é distúrbio (𝐶1 ), alteração nas restrições das variáveis
provavelmente causado pela causa 𝐶𝑖 . controladas (𝐶2 ), aumento do primeiro ganho estático de
100% (𝐶3 ), e mudança no horizonte de controle do
5 ESTUDO DE CASO controlador (𝐶4 ).
Para demonstrar a efetividade das técnicas, estas foram Tabela 1. Causas e parâmetros dos dados treinados
aplicadas em um sistema de destilação Wood- Berry. Este Causa Condição de Operação Parâmetro/Variável Valor
processo tem sido usado em muitos trabalhos como Tian C1
Perturbação no
Variância 0.015
(2011a) e é ilustrado pela Fig. 1. distúrbio
Saturação nas Restrições das
C2 ±0.7
restrições saídas
Incompatibilidade de Primeiro ganho
C3 25.6
modelo do processo estático
Controlador mau Horizonte de
C4 3
sintonizado controle
Similar a criação do modelo DPCA e DICA, os dados com
as quatro causas foram simulados em uma janela de 1000
minutos. Com estes dados, o benchmark para diagnóstico
pode ser feito, seguindo os passos da seção 4. A simulação
em malha fechada do conjunto das variáveis monitoradas
sob sistema de controle MPC referente ao período de
referência e os quatro períodos de treinamento são
mostrados em Fig. 2, Fig. 3 e Fig. 4.
Fig. 1. Coluna de destilação Wood-Berry 5.3 Resultados Experimentais
5.1 Modelo de coluna de destilação Wood-Berry
Após definir o benchmark, um protótipo para o
A coluna de destilação Wood-Berry é um sistema de duas monitoramento do MPC foi desenvolvido usando-se o
entradas e duas saídas para mistura de metanol e água. O DPCA e o DICA, para então avaliar a capacidade destes
diagrama apresentado na Fig. 1 indica as variáveis métodos em identificar anomalias no controle do sistema.
envolvidas na simulação. Esta foi realizada em malha Os limites de controle das estatísticas T², SPE, 𝐼²𝑑 e 𝐼²𝑒 são
fechada, usando-se a estratégia de controle MPC com definidos seguindo a seção 3. A cada nova medição, as
restrições impostas nas entradas de ±0.2, nas saídas de estatísticas são recalculadas para um monitoramento on-line
±1.5 e no incremento das entradas de ±0.05, seguindo a e, para avaliar o desempenho deste, quatro falhas foram
mesma estratégia usada por Tian (2011a). Os parâmetros do inseridas no sistema, de acordo com a Tabela 2.
513
CHAPTER 14. PROCESS CONTROL
514
CHAPTER 14. PROCESS CONTROL
REFERÊNCIAS
AlGhazzawi, A. and Lennox, B. (2009). Model predictive
control monitoring using multivariate statistics. J.
Process Control, 19(2), 314–327.
Chiang, L. H., Russell, E. L., and Braatz, R. D. (2001). Fault
detection and diagnosis in industrial systems. Springer,
London.
Desborough, L. and Harris, T. (1993). Performance
assessment measures for univariate
feedforward/feedback control. Canadian J. of
Chemical Eng., 71, 605–616.
Harris, T.J. (1989). Assessment of closed-loop
performance. Canadian J. Chemical Eng., 67(10), 856–
861.
Fig. 8. Monitoramento DICA para F3 e F4 Harris, T.J., Boudreau, F., and Macgregor, J.F. (1996).
O método proposto nesse trabalho, descrito na seção 4, Performance assessment of multivariable feedback
retrata o ângulo entre o autovetor de cada período controllers. Automatica, 32(11), 1505–1518.
monitorado e o autovetor de cada período treinado. O menor Harris, T.J., Seppala, C.T., and Desborough, D. (1999). A
ângulo indicará a causa atual do mau desempenho. Os review of performance monitoring and assessment
resultados são mostrados na Fig. 9 e estes demonstram que techniques for univariate and multivariate control
o método proposto para diagnóstico encontra com sucesso systems. J. Process Control, 9(1), 1–17.
a causa raiz da degradação do desempenho para todas as Huang, B. (1997). Multivariate Statistical Methods for
falhas inseridas. Control Loop Performance Assessment. Ph.D. thesis,
Dept. Chemical Eng., University of Alberta, Canada.
Hyvärinen, A., Karhunen, J. and Oja, E. (2001).
Independent Component Analysis. John Wiley & Sons,
Inc, New York.
Jackson, J. (1991). A user’s Guide to Principal
Components. Wiley, New York.
Jelali, M. (2006). An overview of control performance
assessment technology and industrial applications.
Control Engineering Practice, 14(5), 441–466.
Lee, J. M., Yoo, C. and Lee, I. B. (2004). Statistical process
monitoring with independent component anlaysis. J. of
Process Control, 14(5), 467–485.
Loquasto, F. and Seborg, D.E. (2003a). Model predictive
controller monitoring based on pattern classification
and pca. In Proc. 2003 American Control Conference,
Fig. 9. Resultados do diagnóstico volume 3, 1968–1973. Denver, Colorado.
6 CONCLUSÃO Qiang, Z. and Shaoyuan, L. (2006). Performance
Monitoring and Diagnosis of Multivariable Model
Este trabalho propôs uma abordagem para avaliação de Predictive Control Using Statistical Analysis. Chinese
desempenho de controladores MPC, utilizando métodos J. Chem Eng, 14(2), 207-215.
estatísticos multivariados. Os métodos utilizados foram o Stefatos, G. and Ben Hamza, A. (2010). Dynamic
PCA e o ICA. Estatísticas de monitoramento foram independent component analysis approach for fault
utilizadas para o monitoramento on-line e seus limites de detection and diagnosis. Expert Systems with
confiança foram calculados, conforme descritos no Applications, 37, 8606–8617.
trabalho, com o intuito de detectar as anormalidades no Tian, X., Chen, G. and Chen, S. (2011a). A data-based
desempenho do controlador. A estratégia proposta para approach for multivariate model predictive control
monitoramento mostrou-se capaz de detectar tais performance monitoring. Neurocomputing, 74, 588–
anormalidades, embora mais eficiente no uso do ICA, pois, 597.
os cálculos das estatísticas de monitoramento usadas para o Tian, X., Chen, G., Cao, Y. and Chen, S. (2011b).
PCA assumem que os dados têm uma distribuição Performance monitoring of MPC based on dynamic
gaussiana, o que pode levar a um pobre desempenho de principal component analysis. In Proc. 18th IFAC
monitoramento. O método proposto para diagnóstico da World Congress, 13139–13144, Italy.
causa raiz das anormalidades também se mostrou eficiente Zhang, Q. and Li, S.Y. (2006). Performance monitoring and
em seu diagnóstico. Os resultados apresentados neste diagnosis of multivariable model predictive control
trabalho demonstram a eficácia destes métodos no using statistical analysis. Chinese J. Chem. Eng., 14(2),
monitoramento do MPC, visto a grande importância deste 207–215.
na indústria.
515
CHAPTER 14. PROCESS CONTROL
516
CHAPTER 14. PROCESS CONTROL
517
CHAPTER 14. PROCESS CONTROL
518
CHAPTER 14. PROCESS CONTROL
1300
Valor medio
Incertidumbre
Biomasa
1098
896
T [K]
694
Secado
Biomasa
Pirólisis seca 492
Aire
Gas de síntesis
290
0 0.14 0.28 0.42 0.56 0.7 0.84
Carbonizado z[m]
Combustión Combustión
Figura 5. Perfil de temperatura experimental
de carbonizado
Tabla 3. Composición del gas de sı́ntesis, µ -
Reacciones valor medio, σ - desviación estándar
de reducción Reducción
µ σ
%H2 18.94 1.39
%CO 20.72 1.15
Figura 3. Esquema del gasificador GEK %CO2 8.77 1.06
%CH4 2.18 0.29
Tabla 2. Caracterı́sticas del gasificador de
10kW y requerimientos de la biomasa mediciones en los termopares. De igual forma, en la Tabla
3 se presenta la composicón del gas de sı́ntesis registrada
Rango de potencia de salida 2 − 10kW en el analizador de gases, donde el valor de la media se
Consumo de biomasa 160 − 320 kg/dia
considera una composición nominal, mientras que con la
Rango de flujo de gas 5 − 27 m3 /hr
desviación estándar se define una banda de incertidumbre
Tamaño de partı́cula 1.27 − 3.81 cm
Porcentaje de humedad máxima
en las mediciones.
30 %
de la biomasa
Proporción de carbono fijo 4. CONSIDERACIONES PARA LA SIMULACIÓN
> 0.25
y volátiles
Contenido de cenizas > 5% El punto de partida es el modelo obtenido en Santamarı́a
(2016), en el cual se muestra que se reproduce el comporta-
21.5
miento estático del gasificador con un tren de 30 RCTAs y
donde se desprecian los primeros 10 cm del reactor debido
10.5 a que en dicha zona no se lleva a cabo ninguna reacción.
De igual forma se determina que el mejor ajuste a los
18.5 datos experimentales se obtiene con un P e = 6; de este
26.5
trabajo también se toman las condiciones de alimentación,
las caracterı́sticas de la biomasa y los parámetros de la
73.9
62.5
53.5
84
55
519
CHAPTER 14. PROCESS CONTROL
XH2
dp0 [cm] 2.54 ±20 %
95 % C 95 % C
Composición 8.8
2.5 % H 2 − 4% H
del carbonizado
2.5 % O 1 − 3% O
4.4
tan los valores nominales de los parámetros mencionados N = 13
N = 30
anteriormente, ası́ como el porcentaje de perturbación que 0
0.1 0.2233 0.3467 0.47 0.5933 0.7167 0.84
se considera para cada uno.
z[m]
(a) Concentración H2
24
5. RESULTADOS
20
Se realizaron simulaciones considerando una discretización
uniforme del reactor con 30 RCTAs, variando de manera 16
aleatoria los parámetros de la Tabla 4 dentro del rango
XCO
permitido; a partir de estos resultados se generó una ban- 12
da de incertidumbre debida a la perturbación de dichos
parámetros. El paso siguiente es obtener una discretización 8
del gasificador de menor orden cuya solución se encuentre
dentro de dicha banda; debido a que el trabajo de Canales 4
N = 13
(2013) no abarca la configuración Imbert, como una pri- N = 30
0
mera aproximación se realizó la distribución no uniforme 0.1 0.2233 0.3467 0.47 0.5933 0.7167 0.84
de los nodos a prueba y error hasta obtener perfiles que z[m]
cumplieran con la condición de estar dentro de la banda (b) Concentración CO
de incertidumbre. El número de tanques considerado para
este trabajo es de N = 13 , en las Figs. 6 a 8 se muestran 10
los perfiles obtenidos cuando el gasificador se representa
con 13 RCTAs, la banda de incertidumbre para cada caso 8
y el modelo cuando N = 30.
De las Figs. 6 a 8 es posible observar que las zonas 6
XCO2
520
CHAPTER 14. PROCESS CONTROL
REFERENCIAS
105 All-Power-Labs (2016). Power pallet support. URL
http://www.allpowerlabs.com/support/support
-power-pallet.
ρC
70
Badillo, U. (2014). Modelado, Observación y Control de
Reactores de Gasificación. Tesis de doctorado, Progra-
35
ma de Maestrı́a y Doctorado en Ingenierı́a UNAM.
Basu, P. (2006). Combustion and gasification in fluidized
N = 13 beds. CRC press.
N = 30 Bhavanam, A. y Sastry, R. (2011). Biomass gasification
0
0.1 0.2233 0.3467 0.47 0.5933 0.7167 0.84 processes in downdraft fixed bed reactors: a review.
z[m]
International Journal of Chemical Engineering and Ap-
(b) Densidad del carbonizado plications, 2(6), 425–433.
Bird, R.B., Stewart, W.E., y Lightfoot, E.N. (1960).
Figura 7. Perfiles espaciales de densidades de sólidos Transport phenomena. Madison, USA.
Canales, L. (2013). Reducción de orden en modelos para
1200
reactores de gasificación. Tesis de maestrı́a, Programa
de Maestrı́a y Doctorado en Ingenierı́a UNAM.
1020 Di Blasi, C. y Branca, C. (2013). Modeling a stratified
downdraft wood gasifier with primary and secondary air
840 entry. Fuel, 104, 847–860.
T [K]
521
CHAPTER 14. PROCESS CONTROL
Abstract: A common kernel used in scientific computing is the stencil computation. FPGA
based heterogeneous systems has been used to overcome stencil algorithm performance limita-
tions due to the memory bandwidth on CPU and GPU based systems. Performance improvement
is achieved through the combination of several data flow optimization techniques, taking
advantage of the FPGA inherent parallelism. However 2D array architectures used in most
cases, involves the need of considerable amount of FPGAs to simulate problems with sizes that
can be treated by a CPU or GPU based system at a lower cost. In this document is presented
a FPGA based system that use a one-dimensional array of processing elements for a stencil
computation. The data-path is designed to reuse the processing elements and reduce the number
of memory transfers using registers arrays to store data temporarily. The proposed architectures
are implemented in a ZedBoard Zynq Evaluation and Development Kit using Vivador Design
Suite and Xilinx SDK. System performance is evaluated for the approach to numerical solution
of heat transfer problems modelled with the heat equation for the one-dimensional case using
stencil algorithm. An architecture for the Laplace equation for the two-dimensional case derived
from the heat equation approach is proposed.
522
CHAPTER 14. PROCESS CONTROL
523
CHAPTER 14. PROCESS CONTROL
This kernel is used to calculate each one of the mesh points 3.1 Architecture for the approach to the numerical solution
as shown in Algorithm 2. of the one dimensional heat equation
Algorithm 2 Pseudocode for the stencil computation to A baseline architecture is designed for a sequential imple-
obtain the approach to the numerical solution of Laplace mentation of the stencil algorithm. The block diagram is
equation with the explicit scheme shown in Figure 4.
for n from 0 to N − 1 do
for j from 1 to h − 1 do
for i from 1 to l − 1 do
n+1
ui,j ← 0.25(uni+1,j + uni−1,j + uni,j+1 + uni,j−1 )
end for
end for
end for
3. SYSTEM DESCRIPTION
524
CHAPTER 14. PROCESS CONTROL
END
Fig. 5. Flow chart for the stencil algorithm implemented. From the architecture with concurrent processing and
Operations outside the dashed line are executed in PS storage is developed a data-path with a two-dimensional
and those found within the dashed line are executed register array to obtain the approach to the numerical
in PL. The sequence for the execution of the stencil solution of the 2D Laplace equation. The block diagram
algorithm is coordinated by the control unit which of the implemented circuit for a 8 × N mesh is shown in
is described in VHDL as a finite state machine. The Fig. 8.
counters kj and kn are used to control the loops. The
counter ka is used to address the RAM. 4. EVALUATION
525
CHAPTER 14. PROCESS CONTROL
R5 0
1 RAM5 R50 R51 R52 PE5
The FPGA resources utilization respect to the PE is
summarized in Table 5 for A1 , A2 and A3 . This reports
R6 0
1
RAM6 R60 R61 R62 PE6 corresponds to implementation using 65536x32 RAM for
RAM7 A1 and A2 and 512x32 RAM for A3 . The parallel imple-
mentations for 62 PE could not be performed because the
circuit exceed the number of available FPGA slices.
ZYNQ-7 Control
PS Unit
4.2 Numerical results
Data out Control J N
(b)
The stencil scheme used has four floating-point operations.
Therefore with a 100 M Hz clock the system has a peak Fig. 9. (a) Approach to the numerical solution of the 1D
performance of 400 MFLOPS in A1 and 12 GFLOPS in heat equation obtained with the baseline architecture.
A2 and A3 . However, considering the number of mesh (b) Percent error in comparison with Matlabr simu-
points, the stencil floating-point operations and the algo- lation. The plots are made using the Matlabr mesh
rithm execution time, the system performance in GFLOPS function.
corresponds to the values shown in Table 4.
526
CHAPTER 14. PROCESS CONTROL
Table 5. FPGA resources utilization for the approximation to the numerical solution of
heat equation.
A1 A2 A3
6 PE 14 PE 30 PE 6 PE 14 PE 30 PE
Slice LUTs/ 2461 9381 20131 41953 9212 23244 42600
53200 (4.63%) (17.63%) (37.84%) (78.86%) (17.32%) (43.69%) (80.08%)
Slice Registers/ 1692 1404 1792 2581 1641 2068 3136
106400 (1.59%) (1.32%) (1.68%) (2.43%) (1.54%) (1.94%) (2.95%)
F7 Muxes/ 182 142 246 374 160 2240 256
26600 (0.68%) (0.53%) (0.92%) (1.41%) (0.6%) (8.42%) (0.96%)
F8 Muxes/ 27 59 59 187 0 1056 128
13300 (0.2%) (0.44%) (0.44%) (1.41%) (0%) (7.94%) (0.96%)
Block RAM Tile/ 58 58 58 58 4 0 16
140 (41.43%) (41.43%) (41.43%) (41.43%) (2.86%) (0%) (11.43%)
DSPs/ 4 24 56 120 24 56 120
220 (1.82%) (10.91%) (25.45%) (54.55%) (10.91%) (25.45%) (54.55%)
5. CONCLUSIONS AND FUTURE WORK ceedings of the 11th International Symposium on Ap-
plied Reconfigurable Computing, ARC 2015, Springer
In this paper is presented a FPGA based heterogeneous International Publishing pp. 383-392, 2015
system for stencil computation. The system is designed [3] K. Sano, Y. Hatsuda and S. Yamamoto, Multi-FPGA
for the approach to the numerical solution of parabolic Accelerator for Scalable Stencil Computation with Con-
PDE for a one dimensional heat transfer problem with stant Memory-Bandwidth, IEEE Transactions on Par-
initial value and boundary conditions. The implementation allel and Distributed Systems, vol. PP, 2014
is made using the SoC architecture of a XC7Z020CLG484- [4] R. Kobayashi and K. Kise, Scalable stencil-
1 Xilinx FPGA of the ZedBoard. computation accelerator by employing multiple FPGA,
IPSJ Transactions on Advanced Computing Systems,
Three different architectures based in the explicit finite pp. 1-13, 2013
difference method are described. Performance analysis [5] V. Bandishti, I. Pananilath and U. Bondhugula, Tiling
shows the improvement achieved in terms of execution Stencil Computations to Maximize Parallelism, en Pro-
time for the stencil algorithm with two proposed par- ceedings of the International Conference for High Per-
allel architectures. The speedup factor led to determine formance Computing, Networking, Storage and Anal-
that implemented architectures offer different performance ysis, IEEE, pp. 1-11, 2012
optimization due to the memory structure. However in [6] J. M. Cecilia, J. L. Abellán, J. Fernández, M. E. Aca-
both cases the use of the registers array allows to take cio, J. M. Garcı́a, M. Ujaldón, Stencil computations on
advantage of spatial and temporal locality reducing the heterogeneous platforms for the Jacobi method: GPUs
need of memory transfer operations. versus Cell BE, The Journal of Supercomputing,
For future work more deep performance analysis in terms Springer Science+Business Media, vol. 62, No. 2, pp.
of accuracy, precision, data transfer, scalability and power 787-803, 2012
consumption could be performed. This evaluation should [7] R. Kobayashi, S. Takamaeda-Yamazaki and K. Kise,
allow performance comparison with CPU and GPU based Towards a Low-Power Accelerator of Many FPGAs for
systems. Otherwise the study of variation for the im- Stencil Computations, en Proceedings of the Third In-
plemented architectures to address bigger heat transfer ternational Conference on Networking and Computing,
problems in 2D using parabolic and elliptic PDEs could IEEE, pp. 343-349, 2012
be developed. [8] K. Sano, Y. Hatsuda and S. Yamamoto, Scalable
Streaming-Array of Simple Soft-Processors for Stencil
ACKNOWLEDGEMENTS Computations with onstant Memory-Bandwidth, en
Proceeding of the 19th Annual International Sympo-
This work was supported by the AE&CC research group sium on Field-Programmable Custom Computing Ma-
of the Instituto Tecnológico Metropolitano through the chines, IEEE, pp. 234-241, 2011
project P14208. Luis Castaño acknowledges the financial [9] R. Strzodka, M. Shaheen, D. Pajak and H. Seidel,
support by the scholarship Estudiantes Sobresalientes de Cache Accurate Time Skewing in Iterative Stencil
Posgrado Universidad Nacional de Colombia. Computations, en Proceedings of the International
Conference on Parallel Processing, IEEE, pp. 571-581,
2011
REFERENCES
[10] A. R. Brodtkorb, C. Dyken, T. R. Hagen,
[1] Schmitt, C., Schmid, M., Hannig, F., Teich, J., J. M. Hjelmervik and O. O. Storaasli, State-of-the-art
Kuckuk, S. and Harald Köstler, A Challenge of in heterogeneous computing, Scientific Programming,
Portable and High-Speed FPGA Accelerator, in Pro- IOS Press Amsterdam, vol 18, pp. 1-33, 2010
ceedings of the Second International Workshop on [11] K. Datta, S. Kamil, S. Williams, L. Oliker, J. Shalf
High-Performance Stencil Computations, HiStencils and K. Yelick, Optimization and Performance Model-
2015, pp. , 2015 ing of Stencil Computations on Modern Microproces-
[2] T. Usui, R. Kobayashi and K. Kise, A Challenge of sors, SIAM Review, vol. 51, no. 1, pp. 129-159, 2009
Portable and High-Speed FPGA Accelerator, in Pro-
527
CHAPTER 14. PROCESS CONTROL
Abstract: This paper aims to develop a simple but efficient control technique based on a linear algebra
approach for tracking optimal profiles of a nonlinear multivariable fed-batch bioprocess. The
methodology proposed allows, knowing the desired states, to find the values for the control actions by
solving a system of linear equations. Its main advantage is that the condition for the tracking error tends
to zero. The efficiency of the proposed controller is tested through several simulations. The optimal
controller parameters are selected through Montecarlo Randomized Algorithm in order to minimize a
cost index.
Keywords: Multivariable control, trajectory tracking, nonlinear systems, algebraic approaches, optimal
trajectories.
the processes (Alford, 2006). Besides that, in the open-loop
1. INTRODUCTION
control strategies, the main disadvantage is that no
Fed-batch processes are widely used in the biotechnological compensation is made for modeling mismatch or random
industry, which is demanding for more efficient, reliable and disturbances during the process operation (Lee et al., 1999,
safer processes to optimize production and improve power Chang, 2003, Chung et al., 2006, Soni and Parker, 2004). It is
quality (De Battista et al., 2012). In a fed-batch operation, therefore important to design a controller to track the optimal
one or more nutrients are gradually supplied to the bioreactor, policy considering disturbance compensation for the closed-
but no product is withdrawn until the process is finished. Its loop control problem.
main advantages are the avoidance of substrate overfeeding The aim of this work is to solve the problem of tracking
which can inhibit the growth of microorganisms and optimal profiles of an important biological system, which has
catabolite repression. On the other hand, from the control a complex dynamics and a strong nonlinearity. The proposed
engineer's viewpoint, the fed-batch fermentation is methodology to achieve the stated objective is based on
characterized for a large number of obstacles: complex solving a system of linear equations. One of the key features
dynamic behavior of microorganisms, the process model of this technique is its simple approach, which suggests that
usually contains strongly time-varying parameters, changes in knowing the value of the desired state, analyzing the
initial conditions, input saturation, external disturbances and conditions for a system to have an exact solution and then
the stiffness and nonlinearity of the model equations (Rani solving the system of linear equations; it can find the values
and Rao, 1999, Bayen and Mairet, 2013, Renard et al., 2006, for the control actions, which forces the system to move from
Lee et al., 1999, Johnson, 1987). its current state to the desired one.
Several control techniques are studied today associated with The main advantages of this method are its simplicity,
optimization and control of bioprocess, such as: bio-inspired versatility and accuracy even under parametric uncertainty.
algorithms (Rocha et al., 2014), genetic algorithms (Sarkar The methodology for the controller design is very simple,
and Modak, 2003, Sarkar and Modak, 2004), robust control nonlinear model is used; thus, its performance is independent
(Renard et al., 2006, Renard and Wouwer, 2008), nonlinear of the operating point, and has an excellent performance
fuzzy control (Cosenza and Galluzzo, 2012),evolutionary against the set point changes. The optimal controller
algorithms (Ronen et al., 2002), model predictive control parameters are selected through Montecarlo Experiments in
(MPC) (Ashoori et al., 2009) and nonlinear MPC (Craven et order to minimize a proposed cost index. The computing
al., 2014, Santos et al., 2012), adaptive stochastic algorithms power required to perform the mathematical operations is
(Carrasco and Banga, 1997), neural network model (Saint- low. Furthermore, the developed algorithm is easier to
Donat et al., 1991, Tholudur and Ramirez, 1996), etc. Most implement in a real system because the use of discrete
of the control literature for fed-batch cultures focuses on equations allows direct adaptation to any computer system or
open-loop operation owing to the highly nonlinear and programmable device. Moreover, because its simplicity and
inherently difficult dynamic behavior (Berber, 1996). These the mathematical tools that it use, this methodology is
methods have good results in biological processes; however, applicable to many systems, not only to bioprocesses.
they have limitations regarding the need for advanced The case study proposed for control is the Lee-Ramirez fed-
specific knowledge, the difficulty of mathematical processing batch bioreactor (Lee and Ramirez, 1992), developing a
(especially in nonlinear systems), trouble with real-time mathematical model for the induced foreign protein
implementation and the need for a complicated database of production by recombinant bacteria in a fed-batch bioreactor.
528
CHAPTER 14. PROCESS CONTROL
529
CHAPTER 14. PROCESS CONTROL
b 1 1
x1 1 1 u
A
x ( N x3,n ) x3, n 1, n
1 1 x2 1 x1 g u
u x2 1 1 2, n
1 1 (5) 1
u1 g ( I x5,n )
( N x3 ) x3 x3 x1 x2 x1
u
1 2
Y
1 x1,ref , n 1 k1 x1,ref , n x1, n x1, n
x4 x1 Rx2 x1
1 ( I x5 )
x4 x4
T0
x5 x1 x2,ref , n 1 k2 x2, ref , n x2, n x2, n x
1, n
x1, n g
T0 x
Remark 1. The variables x6 and x7 are not directly related 2, n
with the control variables, therefore, are not considered in the x3,ez , n 1 k3 x3,ez , n x3, n x3, n g
x1, n x2, n x1, n
controller design. T0 Y
According to the rule of numerical integration of Euler, the
differential equations can be approximated as follow: x4,ref , n 1 k4 x4,ref , n x4, n x4, n x1, n
x1, n
Rx2, n
T0 x4, n x4, n
xi ,n 1 xi ,n
xi (6)
x5,ez , n 1 k5 x5,ez , n x5, n x5, n
(10)
T0 x1, n
T
0
Where T0 is the sample time, xi,n represents the state variable i
in n instant and xi,n+1 the state variable i in (n+1) instant.
Defining the following expression: To simplify the mathematical treatment, the equations system
errori ,n1 errori ,n is expressed as follows:
Now, replacing (9) in each differential expression appearing 0 (a31a52 a32 a51 )(a31b4 b3a41 ) (a31b5 b3a51 )(a31a42 a32 a41 )
in (5), the process model can be rewritten, see (10). This equations system is solved for each sampling period by
The values of x1, x2, and x4 are the references to follow, iteration methods, where the unknown variables x3,ez,n+1 and
therefore are known (the reference values as well as the real x5,ez,n+1 (sacrificed variables) are calculated.
system values). Note that, the Eq. (10) is a system of five Once the values of x3,ez,n+1, x5,ez,n+1 are found, the matrix A
equations and two unknowns, which normally has no and b are completely known at (n) time. Therefore, the
solution. Therefore, the unknown variables of this system are control variables u1,n and u2,n (u vector) can be calculated
defined as “sacrificed variables” and are written as xi,ez solving the system (10) by the least squares method:
corresponding, in this case, to x3,ez and x5,ez. The key of this
technique is that the values adopted by such variables forces ( AT A)u AT b u ( AT A)1 AT b (13)
the equation system (10) to have exact solution, which
implies error not only be minimal, but equal zero. The solution allows finding the control actions (u1,n and u2,n)
to be applied at time n to follow the desired trajectories with
a minimal error.
530
CHAPTER 14. PROCESS CONTROL
Remark 3. The following constraints on the control variables The aim in this subsection is to find the values of the
are considered (Balsa-Canto et al., 2000): 0.0≤ u1 ≤1.0 and controller parameters, for which the tracking error is minimal
0.0≤ u2 ≤1.0. (the bioreactor behavior directly depends on the adjustment
Remark 4. In Eq. (8) note that: of parameters ki ). To achieve the target, a Montecarlo
▪ If ki = 0, the reference trajectory is reached in only one step. experiment is performed.
Remark 5. The parameters ki, i = {1,2,3,4,5}, satisfied 0 < ki In the field of systems and control, Montecarlo methods have
< 1, which allows the tracking error tends to zero. been found useful especially for problems related to
Remark 6. The tracking error is the value of the difference robustness of uncertain systems (Tempo and Ishii, 2007).
between the reference and real trajectory, and is calculated In this work, the Montecarlo experiment consisted on
as: randomize the controller parameters and then simulate the
en e1,n 2 e2,n 2 e4,n 2 ; E=T0 en
process, this is repeated a large number of trials (N) and the
(14)
tracking error is calculated in every one.
n
Now, considering by definition that Montecarlo randomized
x1,ref ,n x1,n algorithm (MCRA) is a randomized algorithm that may
e1,n (15) produce an incorrect result, but the probability of such an
x1,max incorrect result is bounded (Motwani and Raghavan, 1995),
the number of simulations necessary to ensure a certain
x2,ref ,n x2,n degree of confidence and accuracy (confidence boundaries) is
e2,n (16)
achieved using the following expression (Tempo and Ishii,
x2,max
2007):
x4,ref ,n x4,n 1
e4,n (17) log (18)
x4,max N
log 1
1
Where: x1,max= 1,9 L, x2,max= 13,92 g/L, and x4,max= 3,1 g/L
Note that the tracking error is dimensionless. Where δ= confidence and ε= accuracy.
It is fixed δ= 0.01 and ε= 0.005. Therefore, N ≥ 920.
Theorem 1. If the system behavior is ruled by (10) and the
controller is designed by (13), then, ei,n→0, n→∞, when The initial conditions used to simulate the system are shown
profile tracking problems are considered. The proof of this in Table 1.
theorem is not shown because space reasons. Feeding concentrations and parameters values can be seen in
TRAJECTORY E
Table 2.
OPTIMAL PROFILES
TRACKING N Table 1 Initial conditions for the state variables [g/L].
GENERATION PLANT
CONTROLLER V
(x1,ref, x2,ref and x4,ref) x1,0 x2,0 x3,0 x4,0 x5,0 x6,n x7,0
(u1 and u2) I
R 1.0 0.1 40.0 0.01 0.01 1.0 0.01
O
N
M Table 2 Feeding concentrations and parameters (Tholudur
STATE VARIABLES
E and Ramirez, 1997).
MEASUREMENT
N
(x1, x2, x3, x4, x5, x6, x7)
T N (g/L) I(g/L) Y p (h-1)
40 100 0.51 0.09
Figure 1. Architecture of the trajectory tracking controller.
The final time for the process is Tf = 10 h and the sample time
Figure 1 shows the architecture of the control system for simulations is T0 = 0.1 h
proposed herein. In this work, the optimal profiles are taken The ki values found for the minimum tracking error after
from literature (Balsa-Canto et al., 2000), the focus is in the 1000 simulations (total simulation time: 75 min) are
trajectory tracking of such profiles. presented in Table 3.
531
CHAPTER 14. PROCESS CONTROL
Figure 2. Tracking optimal profiles of desired variables (x1, x2, and x4) in normal operation conditions.
A very good controller performance can be observed, the 4.3. Parametric uncertainty.
optimal desired profiles are successfully tracking as can be
In this subsection, the MC Method is applied to make an
seen in Fig. 2.
analysis of the system in case of appearing modeling errors.
Note that the tracking error defined by (14), remains low and
For this, a determined error is introduced in the model
acceptably bounded as shows Fig. 3.
parameters (15% above and below their nominal values) and
perform 500 simulations (N = 500). In each simulation the
parameters are chosen in a random way by MC sampling
experiment (Auat Cheein et al., 2013). The parameters under
disturbance are the concentration of nutrient feed stream N,
the concentration of inducer in the inducer feed stream I, the
Monod-type constant p and the growth yield coefficient Y.
The initial conditions and the sampling time are the same
used in the previous subsection.
The number of simulations taken to carry out this test is
considering the confidence δ= 0.01 and accuracy ε= 0.01.
Therefore, replacing in (18), N ≥ 500.
Figures 4 shows the behavior of the system with the
controller proposed in this work, which presents a good
Figure 3. Tracking error (e) in normal operation conditions. response against parametric uncertainty.
It is noteworthy that the controller design is focused in the
It is noteworthy that not only the desired variables follow the tracking optimal profile of the desired variables, not of
optimal profiles, but also the other variables track their control variables. This is one of the advantages, since it
respective reference profiles (it is not show here for space allows control of the system even under disturbances.
reasons). The simulation tests carried out along this work show a very
good performance of the control law developed.
Figure 4. Tracking optimal profiles (desired variables x1, x2, and x4) under parametric uncertainty (±15%, 500 simulations).
532
CHAPTER 14. PROCESS CONTROL
533
CHAPTER 15
ROBOTICS
CHAPTER 15. ROBOTICS
Keywords: Consenso promedio, robot móvil (3,0), sistemas multiagentes, control disparado por
eventos.
535
CHAPTER 15. ROBOTICS
operativos, redes, comunicación efectiva, autonomı́a e in- agentes se da de forma sı́ncrona o ası́ncrona, demostrando
teligencia artificial, logrando posicionar a un CPS como el el desarrollo de algoritmos que garantizan el consenso. En
ente tecnológio para la cuarta revolución industrial, como particular, en Olfati-Saber and Murray (2004a); Olfati-
es comentado en Shiyong Wang and Zhang (2016). Saber et al. (2007) estudia algoritmos para lograr consenso,
considerando tiempo invariante en el sistema, el cual cons-
Ahora bien, la actividad colaborativa requiere de algorit-
ta de una red de agentes que determinan una topologı́a de
mos de control basados en red (NCS: Networked Control
comunicación fija o variante, representada por grafos con
Systems) que determinan un correcto funcionamiento de
topologı́a de tipo balanceados y fuertemente conectados,
los elementos que interactuán en un CPS, de acuerdo a
como elementos necesarios y suficientes para garantizar
Gao et al. (2008). Es evidente considerar que el consumo
convergencia.
energético es prioritario, debido al alto procesamiento de
señales, por consiguiente es necesario evaluar la posibilidad La motivación por el uso de microprocesadores embebidos
de optimizar y extender los recursos energéticos bajo el de recursos limitados para reunir información y realizar las
paradigma de control de sistemas disparado por eventos. actualizaciones del controlador en los agentes, hace posible
la generación de estrategias de control que sean activadas
La ingenierı́a de control establece la formalidad en anali- de forma ası́ncrona o por eventos. Particularmente, en Di-
zar un sistema, a partir de su modelo matemático, para marogonas and Frazzoli (2009); Dimarogonas and Johans-
lograr estabilidad del sistema ante perturbaciones, donde
son (2009); Seyboth et al. (2011) se presentan estrategias
las dinámicas que predominan su estudio son determinan-
de control distribuido activado por eventos para el SMA,
tes para elegir un análisis adecuado, y sean evaluadas en
sólo requiriendo información de los agentes vecinos para
tiempo continuo o tiempo discreto. Para ello se requiere de
la implementación del controlador, con la consideración
la implementación adecuada de las estrategias de control
del formalismo en tiempo discreto. En consecuencia, se
en las unidades de procesamiento, que indudablemente
requiere el análisis de los sistemas involucrados en tiempo
trabajan en tiempo discreto, de ahı́ la importancia de
discreto, o en la ocurrencia de un evento, es decir, de forma
distinguir que los sistemas se pueden emular de forma
ası́ncrona, y con control centralizado o descentralizado,
sı́ncrona y ası́ncrona, es decir, para determinar las acti-
resultados en este sentido son presentados en Seyboth et al.
vaciones de las acciones de control en cada cierto intervalo
(2013); Fax and Murray (2004); Yan et al. (2014); Olfati-
de tiempo o bajo la restricción de la dinámica del sistema,
Saber and Murray (2004b).
respectivamente, y actualmente basado en eventos, como
en Durand and Marchand (2009). La variedad de SMA donde se ha aplicado el consenso trae
consigo un discernimiento en la diversidad de sistemas me-
En un sistema de control sı́ncrono es la progresión autóno- catrónicos, por lo que el interés particular se encuentra en
ma del tiempo, contı́nuo o discreto, lo que dispara la ejecu-
robots móviles, como lo planteado en Yamaguchi (2003),
ción de las acciones, mientras que en los sistemas de control
para coordinar desplazamiento de los agentes y realizar
ası́ncrono, y en especı́fico basados en eventos, es la propia
tareas de colaboración.
evolución dinámica del sistema la que decide cuándo se
ejecutará la próxima acción de control. Esta circunstancia, En este trabajo se analiza el problema de consenso pro-
al ser implementada en una unidad de procesamiento, para medio desde la perspectiva de tiempo ası́ncrono, primor-
una aplicación de alta demanda de cálculo, se traduce en dialmente en la ocurrencia de un evento, para probar
una reducción en la comunicación del bus de datos y el que la propuesta del algoritmo garantice estabilidad al
uso del procesador, generando ahorro de energı́a. En la promedio de los estados de forma arbitraria para esquemas
Fig. 1 se muestra la diferencia entre el muestreo basado en de comunicación de grafos fuertemente conectados y no
tiempo continuo t, sı́ncrono T , y ası́ncrono tk (en eventos), direccionados.
distinguir que las señales conservan el comportamiento de
f (t), de ahı́ la iniciativa de utilizar señales discretas y En la sección 2 se presenta los recursos matemáticos
recientemente señales ası́ncronas, las cuales contienen la involucrados en el desarrollo del presente trabajo, en la
información necesaria. sección 3 se define la estrategı́a de control y se demuestra
la estabilidad de la ley de control, aunado al caso particular
del SMA integrado por cuatro robots móviles tipo (3,0),
en la sección 4 se analiza los resultados de simulación, y
finalmente, establecer las conclusiones y perspectivas en la
sección 5.
2. RECURSOS MATEMÁTICOS
536
CHAPTER 15. ROBOTICS
Se denomina un grafo dirigido si las aristas tienen dirección 2.3 Control disparado por eventos
del vértice i (punto inicial) al vértice j (punto final). Ası́,
el grado del vértice i es el número de aristas que tienen En este trabajo, el estudio se enfoca a la clase de sistemas
punto inicial en el mismo. Caso contrario en un grafo no dinámicos de la forma:
dirigido, en el cual las aristas no tienen dirección y por lo
tanto el grado del vértice es el número de aristas incidentes ẋ = f (x) + g(x)u (6)
en sı́ mismo. donde: x ∈ X ⊂ Rn , u ∈ U ⊂ Rp , f y g son funciones
Definición 2. La matriz de adyacencia de un grafo G es Lipschitz que se desvanecen al origen. Si el sistema admite
la matriz A(G) de tamaño n × n y cuyas entradas están un control por retroalimentación k : X → U que estabiliza
dadas por aij = 1 si (i, j) E y aij = 0 en otros casos. al sistema asintóticamente, entonces existe una Función de
Lyapunov de Control (CLF) V : X → R, la cual es una
Definición 3. La matriz de grado asociada al grafo G es función suave, definida positiva tal que:
una matriz diagonal D(G) cuyos elementos di,j correspon- ∂V ∂V
V̇ = f (x) + g(x)k(x) (7)
den al grado de salida del nodo i. ∂x ∂x
Es necesario notar que si se asume que k es suave,
Definición 4. La matriz Laplaciana se denota por L(G) de entonces V existe y es suave. En el presente trabajo solo
tamaño n × n cuyas entradas están dadas por el valor de se requerirá que V sea suave, lo cual es menos restrictivo.
dij si i = j o −aij si i 6= j. También puede determinarse En el marco de trabajo de control disparado por eventos
por: generalmente se trabaja con dos funciones:
L = D − A. (1)
Función evento ē : X × X → R indica si es necesario
Para grafos no dirigidos, la matriz Laplaciana es simétrica (ē ≤ 0) o no recalcular y aplicar la señal de control
y semi-definida positiva, i.e, L = LT ≥ 0. al proceso. La función de evento ē toma el valor del
estado actual x como una entrada y una memoria m
2.2 El problema de consenso para un SMA del valor que tomo el estado la última vez en la que
ē fue negativa.
Consenso significa buscar un acuerdo respecto al valor de
Función de retroalimentación k : X → U, la cual se
una variable de interés, la cual depende de los estados
usa como en el caso clásico.
de todos los agentes que sean miembros del sistema.
Un algoritmo de consenso es una regla de interacción Definición 5. Un control por retroalimentación disparado
que especifica el intercambio de información entre un por eventos (k, ē) se dice ser semi-uniformemente MSI
agente y todos sus vecinos en la red multiagente. En (Minimal Sampling Interval) si para todo δ > 0 y todo
estos algoritmos la variable de interés es el estado de x0 en la esfera de radio δ con centro en el origen B(δ), la
información de cada agente, por lo que se asume que existe duración de tiempo entre dos eventos consecutivos, puede
una interacción vecino a vecino entre los integrantes del ser acotado por debajo por algún τ > 0.
equipo. En consecuencia, cada agente actualiza su valor Se sabe que para un sistema no lineal de la forma (6)
de información de acuerdo a los estados de sus vecinos. con un control disparado por eventos semi-uniformemente
La representación de un grafo considera que existe un MSI (ē, k), la solución con condiciones iniciales x0 ∈ X en
SMA, donde cada agente se denota por xi y a su vez cada el instante t = 0, se define como la solución del sistema:
uno presenta la dinámica de un integrador simple:
ẋi (t) = ui (t), i ∈ N (2) ẋ = f (x) + g(x)k(m) (8)
donde: ui (t) es la entrada de control y N el número de m = x si ē(x, m) ≤ 0, x 6= 0
(9)
agentes. Por lo tanto, la forma matricial de la dinámica ṁ = 0 en otros casos
de los agentes xi = [x1 , ..., xN ]T , las entradas de control con x(0) := x0 and m(0) = x(0)
u = [u1 , ..., uN ]T y las condiciones iniciales x(0) permiten
Teorema 2.1. (Fórmula universal de disparo por evento).
representar al SMA (2).
Si existe una CLF para el sistema (6), entonces el control
De acuerdo a Seyboth et al. (2013) el protocolo estándar disparado por eventos (ē, k), es MSI, suave en X \ {0} y es
de consenso para un SMA se define: tal que:
ui (t) = −ΣjNi (xi (t) − xj (t)), (3) ∂V ∂V
f (x) + g(x)k(m) < 0, x ∈ X \ {0} (10)
con Ni que representa a los agentes vecinos j con respecto ∂x ∂x
al elemento i.
ki (x) := −bi (x)δi (x)γ(x) (11)
Asumiendo que el grafo es conectado y no dirigido la ley de
control (3) resuelve global y asintóticamente el problema ē(x, m) := −a(x) − b(x)k(m) (12)
q
de consenso promedio. El valor promedio de los estados de −σ a(x)2 + θ̄(x)b(x)∆(x)b(x)T (13)
los agentes se define como:
t→∞ 1
X con las consideraciones planteadas en Marchand et al.
xi (t) −−−→ xi (0). (4) (2013).
N
iN
2.4 Modelo cinemático del robot (3,0)
Finalmente, el sistema en lazo cerrado puede ser escrito en
forma vectorial como: De acuerdo a la clasificación establecida por Campion and
ẋ(t) = −Lx(t), (5) Bastin (1996) el robot móvil (3,0) es un robot holónomo
537
CHAPTER 15. ROBOTICS
con tres ruedas omnidireccionales que tiene tres grados de Considere la dinámica del modelo como un integrador
movilidad y no contempla grado de direccionabilidad, para simple y además los teoremas de estabilidad de Lyapunov
moverse en cualquier dirección con cualquier orientación. permiten dar condiciones suficientes para la estabilidad en
Una vista isométrica del robot móvil se muestra en la los puntos de equilibrio. Por lo tanto se toma en cuenta lo
Fig. 2, donde puede identificarse un sistema de coordenada siguiente:
inercial {w} y un sistema de coordenadas {m} se fijan en el
Si una función V (x) satisface la condición (16) se dice
plano de movimiento y en el robot móvil, respectivamente.
que es definida positiva. En cambio, si satisface la
Sea η̇w = [ẋw ẏw φ̇w ]T y η̇m = [ẋm ẏm φ̇m ]T la velocidad condición más débil de V (x) ≥ 0 para x 6= 0 se dice
linear y angular del robot móvil con respecto al sistema ser semidefinida positiva.
de coordenadas inercial {w} y al sistema de coordenadas
del móvil {m}, respectivamente. Se asume que el robot V (0) = 0, V (x) > 0 en u. (16)
móvil tiene una estructura rı́gida, con ruedas que no se Una función se dice definida negativa o semidefinida
deforman, no deslizan y su movimiento es en un plano negativa si −V (x) es definida positiva o semidefinida
horizontal (suelo) con un punto de contacto entre la rueda positiva, respectivamente. Si V (x) no tiene signo
y el suelo; considerando los ángulos descritos por el eje definido con respecto a alguno de estos casos se dice
Ym y el eje axial de la rueda como δ1 = π6 y δ3 = π3 , indefinida.
el modelo cinemático se puede representar en (14). Cabe Teorema 3.1. (Estabilidad Lyapunov). El origen es esta-
mencionar que los valores de los ángulos δ1 y δ2 forman ble si existe una función definida positiva y continuamente
parte del diseño del robot móvil ya que maximizan su diferenciable, tal que V̇ (x) es semidefinida negativa, y
destreza (Villarreal-Cervantes et al., 2013).
es asimétricamente estable si V̇ (x) es definida negativa.
ẋw = ẋm cos φw − ẏm sin φw (Khalil, 1996).
ẏw = ẋm sin φw + ẏm cos φw (14)
φ̇w = φ̇m Considerese entonces el siguiente vector de estado asociado
El mapeo entre la velocidad lineal de las ruedas y la al sistema multiagente, x = [x1 , x2 , ..., xN ]T , para el
velocidad angular y lineal del sistema de coordenadas del sistema definido por (6) una función (17), la cual es
aplicada al sistema en lazo cerrado (5), debe elegirse los
robot móvil se representa en (15) donde θ̇ = [θ̇1 θ̇2 θ̇3 ]T elementos de L tal que V (x) sea una función candidata de
es la velocidad angular de las ruedas, rr = 0,0625m es el Lyapunov válida.
radio de las ruedas y la distancia entre el centro geométrico
del móvil y la rueda se representa como L = 0,287m. 1
V (x) = xT Lx (17)
1√ 1 2
θ˙1 rr = 3ẋm + ẏm + Lφ̇m donde: L representa la matriz Laplaciana asociada al grafo
2 2 fuertemente conectado y no direccionado.
θ˙2 rr = −ẏm + Lφ̇m (15)
˙ 1√ 1 Note que esta función de Lyapunov, es cero en el sub-
θ3 rr = − 3ẋm + ẏm + Lφ̇m
2 2 espacio nulo de L y es mayor que cero fuera de él, debido
a la topologı́a del grafo, i.e. L ≥ 0. Si se introduce el
complemento ortogonal del espacio nulo de L, se tiene que
1 T 1 2 ⊥
2 x Lx ≥ 2 λ2 (L)kyk , donde x = x0 + y, y ∈ ker L
y λ2 representa el segundo valor propio de la matriz L,
que es el mı́nimo valor propio diferente de cero. Puesto
que kyk representa una medida de la distancia d(x, ker L)
la función de Lyapunov cumple con todas las condiciones
para ser considerada una candidata de Lyapunov. Ahora
analizaremos si V (x) es una CLF. Para ello, se evalúa la
derivada temporal de V (x) a lo largo de las trayectorias
del sistema con control dado por (3):
1 1
V̇ (x) = ẋT Lx + xT Lẋ
2 2
= −(Lx)T (Lx) = kLxk2 (18)
≤ −λ2 (LT L)kyk2
Figura 2. Diagrama esquemático del robot móvil (3,0). Por lo tanto, la Función Candidata de Lyapunov es una
Función de Lyapunov y el sistema es estable.
Por otro lado, se puede observar que el sistema represen-
3. DISEÑO DE LA ESTRATEGIA DE CONTROL tado en (2), puede tomar la forma de un sistema lineal y
como consecuencia ser representado mediante la siguiente
Se describe el diseño de una ley de control basado en even- ecuación:
tos para un SMA basándose en el esquema centralizado, ẋ = Ax + Bu. (19)
para este propósito, es necesario determinar una función
de control de Lyapunov y posteriormente desarrollar el Ası́, el sistema (19) puede ser escrito en la forma ẋ =
control disparado por eventos, aplicado a robots móviles f (x) + g(x)u considerando f (x) = Ax y g(x) = B. Por lo
tipo (3,0). tanto, la técnica de control basada en eventos descrita en
538
CHAPTER 15. ROBOTICS
la sección 1.5 se puede aplicar de forma sistemática. En del SMA, en particular, dichos agentes serán las variables
consecuencia, el siguiente paso es determinar las funciones de estado que definen la dinámica de cuatro robots móviles
necesarias a(x) y b(x) para obtener la retroalimentación tipo (3,0), con condiciones iniciales diversas, para lograr
ki (x) y la función de evento e. Ası́, igualando (18) con (8) el consenso en la posición y la orientación, a través de
resulta: la topologı́a de comunicación centralizada, definida por
∂V ∂V el grafo y ejecutando la técnica de control disparado por
a(x) := f (x) = 0, b(x) := g(x) = xT LI, eventos, representada en el esquema de la Figura 4.
∂x ∂x
∆(x) := I, θ̄(x) = b(x)bT (x) − 2a(x), σ = 0,9,
De acuerdo a la descripción del robot móvil tipo (3, 0),
donde: I es la matriz identidad. Cabe mencionar que es posible obtener el modelo matemático que defina el
debido a la representación del sistema, para este caso del desplazamiento en el sistema coordenado (X1 , X2 ), con-
integrador simple, el valor de la matriz A es nulo. Por lo siderando su estructura fı́sica y los tres grados de libertad
tanto, el valor de a(x) que depende de A y la derivada de de movilidad. Por consiguiente, el modelo cinemático del
la función de Lyapunov es cero. Ası́, las ecuaciones que robot móvil (3, 0) es posible redefinir:
describen la acción de control y la función de evento son:
ẋwi = ẋmi cos φwi − ẏmi sin φwi
ki (x) := −bi (x)δi (x)γ(x), (20) ẏwi = ẋmi sin φwi + ẏmi cos φwi (25)
q
ē(x, m) := −b(x)k(m) − σ θ̄(x)b(x)∆(x)b(x)T , (21) φ̇wi = φ̇mi
p donde: i identifica al agente iN , ẋmi define la velocidad
θ̄(x)b(x)∆(x)b(x)T lineal sobre el eje de referencia X1 , ẏmi define la velocidad
γ(x) := T
si x ∈ S (22) lineal sobre el eje de referencia X2 , y Φi define la orien-
0 b(x)∆(x)b(x) si x ∈
/S tación del robot, en un sistema coordenado descrito por
(X1 , X2 ) en el sistema de referencia {w}.
El interés es verificar que el sistema en lazo cerrado T
Sea el vector de estado x = [x11 x12 x21 x22 x31 x32 ] =
definido por (3), en su ejecución, tiene como salida el R R R T
consenso de los estados que conforman al SMA, los cuales xw xw yw yw φw φw y el vector de señal de con-
h iT
han sido definidos como en (6). A través del Laplaciano T
trol u = [u1 u2 u3 ] = ẋm ẏm φ̇m , la representación
se define la topologı́a de comunicación y con ello se
tiene de forma implı́cita la comunicación centralizada. En en el vector de variables de estado x del sistema (25) se
este sentido y de forma particular, se considera el grafo define en (26), y considere el compensador no lineal (27).
balanciado y no direccionado mostrado en la Fig. 3, para
un caso especı́fico del SMA, integrado por 4 agentes. ẋ11 = x12
ẋ12 = u1 cos(x32 ) − u2 sin(x32 )
ẋ21 = x22
(26)
ẋ22 = u1 sin(x32 ) + u2 cos(x32 )
ẋ31 = x32
ẋ32 = u3
La estrategia de control consiste en realizar el desplaza- Con la finalidad de validar los resultados teóricos obte-
miento de sistemas puntuales, los cuales serán miembros nidos, se realizan pruebas a través de simulaciones en
539
CHAPTER 15. ROBOTICS
x1i (m) 3 2 1 2 3
x2i (m) 1 2 3 4
YW (m)
2.5
θi (rad) −(1/2)π 0 (5/6)π (2/3)π
2
0.8
encuentro, particularmente, el promedio de sus condiciones
0.4
consenso de los tres estados que definen la locomoción de
0.3
los miembros que integran al SMA, se representa en el 0.2
esquema de la Fig. 4. 0.1
0
0 1 2 3 4 5 6 7 8
Tiempo (s)
15
10
−5
−10
−15
−20
−25
−30
0 1 2 3 4 5 6 7 8
t (s)
Figura 4. Estrategia de control disparado por eventos para
el consenso en la dinámica de los estados del SMA que
determinan la traslación y orientación. Figura 7. Respuesta de la función de retroalimentación.
540
CHAPTER 15. ROBOTICS
4.5
Durand, S. and Marchand, N. (2009). Further
Agente
Agente
Agente
1
2
3
Results on Event-Based PID Controller. In
4
Agente 4
Proceedings of the European Control Conference
3.5
2009, 1979–1984. Budapest, Hongrie. URL
3
http://hal.archives-ouvertes.fr/hal-00368535.
Département Automatique.
x2i (m)
2.5
2
Agente
Agente
3
4 A general formula for event-based stabilization of non-
1.5
linear systems. Automatic Control, IEEE Transactions
1
on, 58(5), 1332–1337.
Olfati-Saber, R. and Murray, R.M. (2004a). Consensus
x3i (rad)
0.5
541
CHAPTER 15. ROBOTICS
542
CHAPTER 15. ROBOTICS
Z(s) a3 3,384
= = (4a)
Uz (s) s2 + a1 s s2
+ 3,613s
Φ(s) b3 −67,3
= 2 = 2 (4b)
Uφ (s) s + b1 s + b2 s + 6,542s + 25,5
Θ(s) c3 60,4
= 2 = 2 (4c)
Uθ (s) s + c1 s + c2 s + 4,302s + 28,24
Ψ(s) d3 3,828
= 2
= 2 (4d)
Uψ (s) s + d1 s s + 4,225s
543
CHAPTER 15. ROBOTICS
544
CHAPTER 15. ROBOTICS
sistema (16a)-(16b) es un observador para el subsistema 0 I2 0 0
(14a)-(14b), i.e. lı́mt→∞ (xξη (t) − x̂ξη (t)) = 0. 0 0 gA(ψ) 0
0 0 0 I2
Hξη1 =
Demostración. Definimos el error de observación como K1
−1
− A (ψ) 0 −K3 I2 −β1
x̃ξη = xξη − x̂ξη (17) g
y
La dinámica del error de observación resulta
0 0 0 0
x̃˙ ξη = (Aξη − Lξη Cξη ) x̃ξη (18) 0 0 0 0
Hξη2 = 0 0 0 0
y ya que el par (Cξη , Aξη ) es observable, se puede escoger K2 −1
Lξη tal que Aξη − Lξη Cξη es Hurwitz y los errores de 0 − A (ψ) 0 β1 − K4 I2
observación converjan asintóticamente a cero. g
Nota 1. Cabe mencionar que se tiene un observador local y la dinámica del error de seguimiento para el observador
para la dinámica en X, Y , θ y φ, ya que esta basado en ê˙ ξη = Gξη1 eξη + Gξη2 êξη + Lξη Cξη (eξη − êξη ) (23)
la aproximación de primer orden y considerando altura y
ángulo de guiñada constantes. donde
0 0 0
0
Ahora, modificamos la ley de control (13) utilizando las 0 0 0
0
0
velocidades longitudinales y rotacionales estimadas ξˆ2 y η̂2 , Gξη1 = 0 0 0
respectivamente, obtenidas a partir del observador (16a) K1
−1
− A (ψ) 0 β2 − K3 I2 0
con lo que obtenemos g
1 y
uη =β3−1 β1 η̂2 + β2 η1 + η̈1d − A−1 (ψ) K1 (ξ1 − ξ1d )
g 0 I2 0 0
i i 0 0 gA(ψ) 0
ˆ ˙
+K2 (ξ2 − ξ ) − K3 (η1 − η d ) − K4 (η̂2 − η̇ d ) (19)
d
1 1 1 Gξη2 = 0 0 0 I2
K2 −1
Proposición 3. Considere el subsistema (14a), el observa- 0 − A (ψ) −β2 −K4 I2
dor (16a) y la ley de control (19). Las matrices Hξη y g
Aξη − Lξη Cξη son Hurwitz. Entonces en el sistema en lazo
cerrado (14a)-(16a)-(19), los errores de seguimiento y de Para estudiar el sistema completo, organizamos las ecua-
observación para la dinámica en X, Y , θ y φ convergen ciones para ėξη y ê˙ ξη en forma matricial como
asintóticamente a cero, i.e. lı́mt→∞ xξη (t) − xdξη (t) = 0 ėξη Hξη1 Hξη2 eξη
= (24)
y lı́mt→∞ (xξη (t) − x̂ξη (t)) = 0. ê˙ ξη Gξη1 + Lξη Cξη Gξη2 − Lξη Cξη êξη
545
CHAPTER 15. ROBOTICS
PC de Visión
FPS y cuentan con una tarjeta interna de procesamiento de
Sistema de cámaras OptiTrack
imágenes, lo cual libera parcialmente al sistema operativo
USB de realizar dicha tarea. Por medio del puerto USB las
cámaras reciben alimentación y realizan el envı́o de datos.
Ethernet
Cabe mencionar, que del sistema de visión sólo se obtienen
la posición en el espacio XY Z y el ángulo de guiñada ψ;
los ángulos de alabeo y cabeceo, φ y θ, respectivamente,
Router
se obtienen a partir de los sensores con los que cuenta el
Ethernet
AR.Drone 2.0.
Conexión Wi-Fi
Sistema de control y comunicación. Se cuenta con una
Helicóptero de 4 rotores AR.Drone 2.0 Laptop de Control
Laptop Core i5 con un sistema operativo Ubuntu 12.04 con
ROS en su versión Fuerte Turtle, que realiza los cálculos
del control y los envı́a al AR.Drone 2.0 vı́a Wi-Fi. ROS
Figura 2. Plataforma experimental. es un framework flexible de código abierto, que permite la
escritura de software para el control de robots. Se trata
5. EXPERIMENTACIÓN de una colección de herramientas, librerı́as y convenciones
que simplifican la creación de comportamientos complejos
5.1 Plataforma experimental y robustos en robots. ROS permite el diseño de manera
individual de procesos que se pueden acoplar de manera
Para validar los resultados teóricos obtenidos, la estrategia flexible en tiempo de ejecución. Estos procesos se pueden
de control con observador diseñada se implementa en una agrupar en paquetes que pueden ser fácilmente comparti-
plataforma experimental. La plataforma se muestra en la dos y distribuidos, con lo que se logra la reutilización de
Figura 2 y está formada por un AR.Drone 2.0, un sistema código.
de visión y un sistema de control y comunicación.
La comunicación entre la PC de visión y la Laptop de
AR.Drone 2.0. Es un helicóptero de 4 rotores comercial, control se hace a través de una transmisión basada en una
fabricado por la empresa francesa Parrot. El AR.Drone 2.0 arquitectura cliente/servidor (NatNet SDK) a través de
se puede controlar a través de aplicaciones para equipos una red local mediante una conexión Ethernet.
con arquitectura de PC con sistemas operativos basados
en Linux, utilizando una conexión Wi-Fi. El AR.Drone 2.0 5.2 Experimentos
cuenta con una estructura con tubos de fibra de carbono,
con piezas de plástico nylon con 30 % de fibra de vidrio, 2 A continuación se presenta un experimento utilizando las
cascos de protección de polipropileno expandido, un casco velocidades longitudinales y rotacionales en los ejes X, Y
para exteriores y otro para interiores, nanorevestimiento y Z obtenidas a partir de los observadores diseñados en
que repele lı́quidos, ası́ como espuma para aislar el centro este trabajo. En los experimentos, primero se procedió a
inercial de las vibraciones producidas por los motores. despegar al AR.Drone, de ahı́ ponerlo en modo hover y
En la parte electrónica, el AR.Drone 2.0 cuenta con un entonces comenzar con el seguimiento de trayectoria. Una
procesador ARM Cortex A8 de 1 GHz de 32 bits con Linux vez terminado el seguimiento de trayectoria, se vuelve a
2.6.32, una memoria RAM DDR2 de 1 GHz, un puerto poner en modo hover y posteriormente se aterriza.
USB 2.0 que se puede utilizar para grabar video en una
memoria USB o conectar un GPS, 2 cámaras de video, una La trayectoria deseada en el plano XY es una Lemniscata
vertical QVGA de 60 FPS y una horizontal HD 720p de de Gerono dada por
T
30 FPS. d 2πt 4πt
ξ1 (t) = 1.0 cos , 0.5 sin (28)
Los sensores con los que cuenta el AR.Drone 2.0 son: T T
con un periodo de T = 60 s. Se tienen una altura y un
Giroscopio de 3 ejes con precisión de 2000o /seg
ángulo de guiñada constantes, donde la altura deseada es
Acelerómetro de 3 ejes con precisión de ± 50 mg
de z d = 1 m y el ángulo de guiñada deseado es de ψ d = 0o .
Magnetómetro de 3 ejes con precisión de 6o
La Fig. 3 muestra el movimiento del AR.Drone 2.0 en el
Sensor de presión con una precisión de ± 10 Pa
espacio.
Sensor de ultrasonido para medir la posición vertical
Las figuras 4 y 5 presentan los errores de posición y
El AR.Drone 2.0 tiene 4 motores inrunner sin escobillas de
orientación del AR.Drone, respectivamente.
28,500 RPM y 14.5 W con controladores con CPU AVR
de 8 MIPS.
6. CONCLUSIONES Y TRABAJO FUTURO
Sistema de visión. La localización del AR.Drone 2.0
(posición y orientación) se realiza a través de un sistema de Este trabajo presenta una estrategia de control para lo-
visión que consta de 12 cámaras marca OptiTrack modelo grar el seguimiento de trayectorias de un helicóptero de
Flex 13 y una PC Core i3 con Windows 7 que tiene 4 rotores AR.Drone 2.0 de Parrot. Dicha estrategia de
el software Motive:Tracker. Dicho software está diseñado control esta diseñada utilizando el método de backstepping
para el rastreo de objetos en 6 DOF con alta precisión y está basada en el uso de observadores para reconstruir
y con soporte para el procesamiento en tiempo real. Las las velocidades longitudinales y rotacionales del AR.Drone.
cámaras Flex 13 tienen una resolución de 1.3 MP a 120 Como se puede observar en los resultados experimentales
546
CHAPTER 15. ROBOTICS
0.9
1
1
estrategia de control no lineal y se emplearán observadores
0.8 no lineales para estimar las velocidades longitudinales y
0.5
0.5 rotacionales, con lo que se pretende mejorar el desempeño
0
0
y obtener resultados más generales. También, se plantea el
-0.5
uso de varios helicópteros de 4 rotores AR.Drone 2.0 para
Distancia X (m)
Distancia Y (m) -0.5 -1 formar un sistema multi-agente y diseñar estrategias de
control para la coordinación de movimiento de este tipo
de sistemas, en particular, para lograr vuelo en formación.
Error de Posición en X
Bristeau, P. J., Callou, F., Vissiére, D., Petit, N., 2011. The
0
navigation and control technology inside the ar.drone
Error (m)
-0.1
-0.2 Pégard, C., 2013. Modeling the quad-rotor mini-
-0.3
rotorcraft. In: Quad Rotorcraft Control. Advances in
5 10 15 20 25 30 35 40 45 50 55
Tiempo (s) Industrial Control. Springer London, Ch. 2, pp. 23–34.
Error de Posición en Z Hernandez, A., Copot, C., De keyser, R., Vlas, T., Nascu,
0
I., 2013. Identification and path following control of an
Error (m)
-0.1
ar.drone quadrotor. 2013 17th International Conference
-0.2
5 10 15 20 25 30 35 40 45 50 55
on System Theory, Control and Computing (ICSTCC),
Tiempo (s) 583–588.
Khalil, H. K., 2002. Nonlinear Systems. Prentice Hall.
Kokotovic, P. V., 1992. The joy of feedback: nonlinear and
Figura 4. Errores de posición del AR.Drone. adaptive. IEEE Control Systems Magazine 12 (3), 7–17.
Krstic, M., Kokotovic, P. V., kanellakopoulos, I., 1995.
Error de Orientación en ? Nonlinear and Adaptive Control Design. John Wiley &
Sons, Inc.
Error (rad)
-0.04
Mokhtari, A., M’Sirdi, N. K., Meghriche, K., Belaidi, K.,
5 10 15 20 25 30 35 40 45 50 55
2012. Feedback linearization and linear observer for a
Tiempo (s) quadrotor unmanned aerial vehicle. Advanced Robotics
Error de Orientación en 3 20 (1), 71–91.
Error (rad)
-0.1
International Conference on Unmanned Aircraft Sys-
tems (ICUAS), 756–767.
-0.2
5 10 15 20 25 30 35 40 45 50 55 Wang, X., Shirinzadeh, B., 2015. Nonlinear augmented
Tiempo (s)
observer design and application to quadrotor aircraft.
Nonlinear Dynamics 80 (3), 1463–1481.
Figura 5. Errores de orientación del AR.Drone.
obtenidos, los errores de observación convergen a cero y el
AR.Drone logra el seguimiento de trayectorias.
Para este tipo de vehı́culo, el error que se tiene en el
seguimiento de trayectorias puede considerarse aceptable
debido a que se despreciaron los efectos no lineales que
se presentan debido al desbalanceo y deformaciones en
las hélices, ejes y engranes del AR.Drone. Ası́ mismo,
547
CHAPTER 15. ROBOTICS
Resumen: En este artículo se presenta un control robusto QFT diseñado bajo la herramienta ‘QFT
Control Toolbox for Matlab (QFTCT)’, con el objetivo de mantener en autobalance un robot móvil
basado en péndulo invertido que emplea un sistema de procesamiento embebido Beaglebone Black,
sistema en el cual se implementa el controlador mediante ecuaciones en diferencias. Los resultados
obtenidos demuestran que en el modelo péndulo invertido se debe asegurar un centro de masa bajo,
mediante el anexo de un contrapeso en la parte inferior del cuerpo del prototipo, aumentando de tal forma
las prestaciones del control QFT en aspectos de estabilidad robusta y rechazo de perturbaciones sobre el
autobalance del robot. Además se determina que la dificultad de garantizar una distribución de peso sobre
el cuerpo del péndulo genera una inclinación constante que lleva al sistema a ser inestable en espacios de
funcionamiento reducidos, causa que es solucionada mediante un control PID u otro tipo de control,
capaz delimitar el desplazamiento del móvil del prototipo. Estableciendo finalmente que el control de un
sistema péndulo invertido requiere de dos controladores autobalance y posición.
Palabras claves: Control robusto, Quantitative feedback theory (QFT), Péndulo Invertido, Robot de
autobalance.
de la acción de control para conseguir movimientos caóticos
1. INTRODUCTION
para hacer pivotar el péndulo sobre el carro con ruedas. El
La mayoría de métodos de análisis y diseño de sistemas modelo matemático del sistema se obtuvo por medio de
realimentados para mecanismo tipo péndulo invertido ecuaciones de Euler-LaGrange, determinando las condiciones
consideran que se dispone de una planta suficientemente necesarias para el movimiento caótico de la varilla por medio
exacta. No obstante, en aplicaciones reales no se cuenta con de la función de Melnikov. Se ha tomado un enfoque no
un modelo estricto ya que existe un error considerable de lineal del sistema del péndulo invertido, lo que ha permitido
modelo así como también de un alto grado de incertidumbre implementar un controlador no lineal para estabilizar el
dentro del modelo estimado (Sánchez 1996). Para diseñar sistema tal como se propone en (Frías 2013), donde se
controladores con buenas prestaciones de desempeño ante determino una funcion candidata de Lyapunov en
procesos con alto grado de incertidumbre, en los últimos combinación con el principio de invariancia de La Salle para
treinta años se han desarrollado un conjunto de teorías de diseñar el controlador de estabilización del sistema en lazo
control avanzado que permiten mitigar los efectos de la cerrado. Se pudo demostrar que el sistema es asintóticamente
incertidumbre del sistema, en las que se encuentran el método estable localmente, alrededor del punto de equilibrio es
H∞ (Francis 1983), método LTF (Loop Tranfer Recovery) inestable, con un dominio de atracción calculable.
(Stein 1981), método de diseño IMC (Internal Model
Control) (Zafiriou 1989), método de Kharatinov (Barmish En este trabajo se presenta el desarrollo de un control robusto
1993), Método de Sintesís-µ (Packard 1993), método GPC a partir de la teoría QFT para la estabilidad un sistema
(Generalized Predictive Control) (Clarke 1988) y el método basado en la estructura de un péndulo invertido, la cual se
QFT (Quantitavive Feedback Theory) (Horowitz 1982), caracteriza por ser un sistema no lineal, de fase no mínima,
teorías que a través de los años han realizado aportes inestable con considerables incertidumbres paramétricas, ya
importantes en el desarrollo de técnicas de control avanzado, que se consideran variaciones del peso, fricción de las ruedas,
aplicados principalmente en las áreas de procesos químicos, largo del robot, cambios del centro de masa e inercia del
robótica, estructuras flexibles y control de aeronaves (Dorato móvil (Jun 2011); lo que hace necesario implementar técnicas
1993). Para el control de sistemas subactuados tipo péndulo de control robusto, en espacial aplicando la teoría de
invertido se han aplicado ampliamente estas técnicas de realimentación cuantitativa QFT. En la sección 2. se
control avanzado, tal como se propone en (Pérez Polo 2012), describen las generalidades de la metodología de control
donde se desarrolló un control de posición por asignación de robusto QFT. En la sección 3. se plantea el prototipo basado
polos más integrador, de un péndulo invertido por medio de en péndulo invertido implementado, el modelo matemático
movimientos caóticos, cuyo objetivo es incrementar la fuerza de la planta mediante el uso de ecuaciones Euler-Lagrange y
548
CHAPTER 15. ROBOTICS
Estabilidad Robusta | | | |
549
CHAPTER 15. ROBOTICS
̈ ̈ ̇ ̇
̈ ̈
( )
̇
[ ]
550
CHAPTER 15. ROBOTICS
[ ]
551
CHAPTER 15. ROBOTICS
( ) ( )( )
Fig. 5. Intercepto de contornos para cada frecuencia de
interés.
Donde representa el controlador diseñado, representa la
A partir de los contornos, se diseña el controlador utilizando planta nominal, es la salida del sistema, es la salida del
la técnica loop-shapping, la cual permite agregar polos y controlador, es la señal de referencia y es la perturbación
ceros de manera que la función del lazo cumpla unas de entrada a la planta. A partir de la ecuación 16 y el
restricciones determinadas. En este caso, se logró cumplir los algoritmo que se propone en (Xue 2007), se determinó que el
requisitos establecidos por las especificaciones de diseño con sistema de control diseñado, aunque es estable externamente,
un margen de fase y ganancia mínimo de y no cumple la condición de estabilidad interna ya que existe
respectivamente, y rechazo de perturbación en las frecuencias cancelación de polos inestables en la función del lazo
de interés, ya que el controlador aportó la fase y ganancia . No obstante se alcanzaron las
suficiente a la función del lazo nominal para que la especificaciones de desempeño planteadas desde el enfoque
respuesta en el diagrama Nichols quedara por encima de la QFT.
intersección de cada contorno. El controlador está constituido
por un integrador, el cual hace que el sistema mantenga un Para validar el controlador se realiza un análisis de las
error de estado estable cercano acero, y un polo y dos ceros respuesta del sistema en lazo cerrado ante señales de prueba,
reales, quienes movieron horizontalmente la función de lazo para determinar tanto en el dominio del tiempo como en el
hasta alcanzar la zona de las especificaciones deseadas. En domino de la frecuencia las especificaciones de desempeño
la Fig. 6. Se observa la respuesta de la función de lazo del controlador. Los límites de las especificaciones aparecen
nominal con el controlador diseñado de la ecuación 15. como líneas punteadas, y la respuesta del sistema en lazo
cerrado debe estar por debajo de estos límites.
552
CHAPTER 15. ROBOTICS
En la Fig. 7 representa el análisis de la estabilidad robusta, en La Fig. 9 muestra la respuesta en el dominio del tiempo de
donde se observa que el controlador diseñado para todas las todas las plantas del espacio de incertidumbre cuando el
frecuencias de interés cumple con los criterios de estabilidad sistema tiene una señal de referencia igual a cero y es
robusta del sistema. sometido a una señal de perturbación de carga tipo escalón
unitario. Para todas las posibles respuestas, se observa que
cumplen con un tiempo de establecimiento inferior a 1s, y la
salida en estado estable es cercana a cero. Eso es coherente
con el análisis que se realizó en el domino de la frecuencia de
la figura 8.
5. RESULTADOS
Se validó el comportamiento del sistema real frente a la
planta simulada, con el fin de asegurar que el controlador
implementado tenga un comportamiento similar al
controlador diseñado, y cumpla con las especificaciones de
desempeño. En la Fig. 10 se aprecia un seguimiento cercano
al 85% del sistema real ante el modelo de simulación. Esto
indica que el uso de ecuaciones de euler-lagrange para
identificar el modelo matemático del robot de autobalance
basado en péndulo invertido, es adecuado y se pueden
conseguir resultados aceptables.
Fig. 8. Análisis de rechazo de perturbaciones en el dominio
de la frecuancia.
Se realizaron pruebas en las cuales se observó el
comportamiento del sistema ante la variación de la referencia
de equilibrio, perturbaciones sobre el cuerpo del péndulo y
553
CHAPTER 15. ROBOTICS
554
CHAPTER 15. ROBOTICS
obtuvo un mejor rendimiento de la evaluación del ángulo del robot desde el punto de partida, siendo este un gran
vertical de inclinación del robot móvil. problema al momento de evaluar el sistema en áreas con poco
espacio y obstáculos sobre la misma, efecto que es
Con relación al dispositivo procesador de datos Beagle Bone eliminado con un controlador extra, encargado del
Black®, se encontró que es un sistema rápido, versátil y con desplazamiento horizontal del carro del robot móvil de
una cantidad aceptable de entradas y salidas analógicas y autobalance basado en péndulo invertido.
digitales, dichas características fueron convenientes para el
desarrollo del sistema, aunque presento sensibilidad al ruido REFERENCIAS
producido por lo micromotores, por ende se hizo necesario
aislar la etapa de potencia de la de control atreves, de un Acosta, R. «Obtención de la ecuación de Euler-Lagrange
puente H, de tecnología Mosfet que requiere bajo consumo utilizando los vectores base y vectores recíprocos.»
de energía, esto puso fin al problema descrito. Artículo de investigaciòn Acosta Ingenierìa , 2004:
pp. 17 - 2.
La identificación matemática del sistema fue en un 90% Barmish, B. R., & Kang, H. I. « A survey of extreme point
acertada, esto se vio reflejado en el comportamiento óptimo results for robustness of control systems.»
del controlador sobre el modelo real, ocasionando que al Automatica vol. 29, no 1, 1993: pp. 13-35.
cambiar valores en las incertidumbres físicas del robot; como beagleboard. https://beagleboard.org. 03 de 20 de 2014.
lo son la masa y la altura, mantuviera su equilibrio vertical y https://beagleboard.org/blog/2014-02-03-project-
estuviera limitado en su posición horizontal. Al realizar las spotlight-robotics-cape/ (último acceso: 30 de 06 de
pruebas físicas de los motores, en cuanto a fuerza utilizando 2015).
un dinamómetro, se pudo establecer que el torque máximo Biernson, G. Principles of Feedback Control. New York:
generado por cada uno de ellos era aproximadamente de 6.9 John Wiley & Sons, 1988.
N.m, esto con el fin de efectuar el respectivo escalamiento de Callister, W. D., & Rethwisch, D. G. Materials science and
la señal de control y limitar la energía transmitida desde los engineering: an introduction . New York: Wiley.,
actuadores al robot móvil. 2007.
Chait, Y., & Yaniv, O. «Multi‐input/single‐output computer‐
aided control design using the quantitative feedback
Ya que toda técnica no robusta de control estima el análisis
theory.» International Journal of Robust and
de una sola planta de un sistema cualquiera, no tiene en
Nonlinear Control, 1993: pp. 47-54.
cuenta factores que pueden variar las condiciones de la
Clarke, D. W. «Applications of Generalized Predictive
misma, como lo son la masa, centro de masa y altura, en el
Control to Industrial Processes.» IEEE Control
caso del prototipo móvil de autobalance, con la técnica de
Systems Society, 1988: pp. 49 - 55.
realimentación cuantitativa QFT, fue posible alterar todos
d'Azzo, J. J., Sheldon, S. N., & Houpis, C. H. Linear control
estos factores y tenerlos por incertidumbres, lo cual genera no
system analysis and design with MATLAB. New
solo una sino un conjunto de plantas para las cuales el
York: Marceld Dekk, 2003.
controlador que se obtiene, sigue siendo eficiente y mantiene
Dorato, P., Tempo, R., & Muscato, G. «Bibliography on
estable el sistema.
robust control.» Automatica, vol. 29, no 1, 1993: p.
201-213.
El controlador QFT diseñado para el modelo del prototipo de Dorf, R. C., & Bishop, R. H. Modern control systems. New
robot móvil basado en péndulo invertido que obtuvo un Jersey: Prentice Hall, 2011.
excelente rechazo de perturbación, rapidez de respuesta y Francis, G. Zames y B. A. «Feedback, Minimax Sensitivity,
estabilidad robusta, fue en última instancia el controlador que and Optimal Robustness.» IEEE transactions on
se implementó, sin embargo fue necesario un control PID de automatic control vol. 28, nº 5, 1983: pp. 585 - 601.
posición, para limitar el área de acción del prototipo, por Frías, O. O. G. «Estabilización del Péndulo Invertido Sobre
tanto, se realiza una fusión de los dos controladores lo cual Dos Ruedas mediante el método de Lyapunov.»
genero una robustez de un 95% aproximadamente, hasta el Revista Iberoamericana de Automática e
punto de poder variar la referencia de equilibrio sin tener Informática Industrial RIAI, 2013: 30-36.
problemas con presencia de perturbaciones en la ruedas y Gera, A., & Horowitz, I. «Optimization of the loop transfer
cuerpo, además de que elimina la importancia de que la function†.» International Journal of Control, 1980:
arquitectura implementada debe tener el peso distribuido pp. 389-398.
sobre el cuerpo. Horowitz, I. «Fundamental theory of automatic linear
feedback control systems.» IRE Transactions on
El control robusto QFT, es eficaz y mantiene el robot móvil Automatic Control, 1959: pp. 5-19.
de autobalance en equilibrio, cumpliendo la especificaciones Horowitz, I. «Quantitative feedback theory.» In IEE
de desempeño a pesar de no cumplir las condiciones de Proceedings D-Control Theory and Applications ,
estabilidad interna a cause de cancelación de polos inestables. 1982: pp. 215-226.
Estas condiciones de funcionamiento están sujetas a la Horowitz, Isaac. «Fundamental theory of automatic linear
arquitectura implementada sobre el cuerpo del robot, ya que feedback control systems.» IRE Transactions on
debe tener el peso distribuido sobre la referencia de balance, Automatic Control, 1959: pp. 5-19.
generando desplazamiento horizontal críticamente en el carro
555
CHAPTER 15. ROBOTICS
556
CHAPTER 15. ROBOTICS
Resumen
El control servovisual de robots manipuladores representa una opción natural en ambientes
no estructurados y es especialmente atractivo cuando la labor a realizar puede expresarse
directamente en coordenadas de imagen. En el presente artı́culo se muestra una propuesta
de control servovisual no calibrado para robots planares en el cual no se requiere el modelo
dinámico del sistema. El método se valida mediante simulaciones numéricas y se compara
con otra metodologı́a reportada en la literatura, obteniendo un buen desempeño por parte
del esquema propuesto.
Keywords:
Robotic manipulators, Visual control, GPI Observers, Robust control, Kinematics
557
CHAPTER 15. ROBOTICS
558
CHAPTER 15. ROBOTICS
559
CHAPTER 15. ROBOTICS
ė1 =e2 (24) Comentario 3.1. El primer término del lado derecho
ė2 =H −1 (f v − C v s) + H −1 − Λė , de (39) fue incluido en este trabajo a diferencia del algorit-
v v w1 (25)
mo presentado en Arteaga-Pérez y Gutiérrez-Giles (2014).
donde e1 , e y Lo anterior se debe a que el amortiguamiento añadido por
el término ê2 depende del ancho de banda del sistema de
w1 , − (H v ÿ r + C v ẏ r + D v ẏ + g v ) . (26) visión, que usualmente suele ser muy pobre con respecto a
la dinámica del manipulador. El inconveniente de añadir
Para emplear los observadores GPI se hacen las siguientes este término disipativo es la necesidad de medición de la
suposiciones Arteaga-Pérez y Gutiérrez-Giles (2014). velocidad articular q̇.
4
Suposición 3.1. El vector w1 puede ser localmente mode-
lado como un vector de polinomios de Taylor dependientes Comentario 3.2. Como se supone que no se conoce exac-
del tiempo, de grado p − 1, más un término residual, i.e., tamente la matriz de rotación de la cámara Rφ , para
implementar la ley de control no se debe de emplear (17)
p−1
sino
X τ = J T R̂φ f v . (40)
w1 = ai ti + r a (t) , (27)
i=0 Utilizando (24), (25), (28)–(31) y (32)–(37) se puede
donde ai es un vector de coeficientes constantes y r a (t) es obtener la dinámica del error de observación
un vector de términos residuales.
4 ¨ẽ1 + λp+1 ẽ˙ 1 + λp ẽ1 = H −1
c (−K v s + K v ẽ2
Suposición 3.2. El vector r a (t) y por lo menos sus prime- −kd R̂φ J −T q̇ − C c s + w̃1
ras p derivadas con respecto al tiempo, existen.
4 = r b (t) + w̃1 , (41)
donde
Dadas las suposiciones anteriores, se puede escribir el
modelo interno para la señal desconocida w1 (t) como
r b (t) , H −1
c (−K v s + K v ẽ1 − C c s)
ẇ1 = w2 (28) + H −1
c − I w̃ 1 . (42)
ẇ2 = w3 (29) Derivando p–veces (41) con respecto al tiempo, se obtiene
..
. (p+2) (p+1)
ẽ1 + λp+1 ẽ1 + ...
ẇp−1 = wp (30) (p)
. . . +λ1 ẽ˙ 1 + λ0 ẽ1 = r a(p) (t) + r (t) . (43)
b
ẇp = r (p)
a (t) (31)
Si se define
Con base en este modelo interno, se propone el observador
ẽ1
de estados .
z , .. , (44)
(p+1)
ê˙ 1 = ê2 − Λe + λp+1 ẽ (32) ẽ1
ê˙ 2 = λp ẽ (33) se puede reescribir la dinámica (43) como un sistema lineal
˙ 1 = ŵ2 + λp−1 ẽ invariante en el tiempo definido por
ŵ (34)
˙ 2 = ŵ3 + λp−2 ẽ
ŵ (35) ż = Az + Br̄ ab . (45)
..
. A continuación se enuncia el resultado principal de este
˙ p−1 = ŵp + λ1 ẽ trabajo.
ŵ (36)
Proposición 3.1. Sea una trayectoria deseada en coorde-
˙ p = λ0 ẽ ,
ŵ (37) nadas de imagen y d acotada con al menos sus primeras
p + 2 derivadas con respecto al tiempo acotadas y la ley
donde ẽ , e1 − ê1 y λ0 , . . . , λp+1 ∈ < n×n
son las matrices
de control (39) en conjunto con el observador (32)–(37) en
diagonales de coeficientes del vector de polinomios
lazo cerrado con (11). Defı́nase una región D ∈ <n(p+1)
como
ρ(σ) = σ p+2 + λp+1 σ p+1 + · · · + λ1 σ + λ0 , (38) n o
con σ ∈ <n . A continuación se propone la ley de control D , z ∈ <n(p+1) |kzk ≤ ymax , (46)
basada en las señales obtenidas por el observador
donde ymax es una constante arbitrariamente grande. En-
tonces, es posible encontrar una combinación en las raı́ces
f v = −kd R̂φ J −T q̇ − K v (ê2 + Λe) − ŵ1 , (39) del polinomio (38) para hacer que las variables deslizantes
(s, ṡ), los errores de seguimiento (e, ė, ë) y los errores
donde kd > 0, K v ∈ <n×n es una matriz de ganancias, de observación z definidos en (44) tiendan a una vecindad
positiva definida y R̂φ es el estimado de la matriz de arbitrariamente pequeña alrededor del origen.
rotación de la cámara. 4
560
CHAPTER 15. ROBOTICS
[píxeles]
Para poner a prueba la validez del método propuesto se
realizó una comparación con el algoritmo GPI estándar 200
˙ p−1 = ŵp + λ1 ẽ
ŵ (51) 2200
[píxeles]
2000
˙ p = λ0 ẽ ,
ŵ (52)
1800
en conjunto con la ley de control (c.f. (39))
1600
f v = Ĥ v −2ξωn ê2 − ωn2 e − ŵ1 , (53) 0 10 20 30 40 50 60 70 80
0 10 20 30 40 50 60 70 80
Se utilizó el modelo de un robot de dos grados de libertad, [seg]
dado en forma paramétrica por (Spong et al., 2006)
h i
θ1 + 2θ2 c2 θ3 + θ2 c2 Figura 3. Cámara calibrada, seguimiento de trayectoria:
q̈ +
θ 3 + θ 2 c2 θ3 deseada (- - -), GPI estándar (—) y PBC-GPI (—).
h i h i
−θ2 s2 q̇2 −θ2 s2 (q̇1 + q̇2 ) θ4 c1 + θ5 c12
q̇ + + 40
θ2 s2 q̇1 0 θ5 c12
h i 20
θ6 q̇1 + θ8 sign(q̇1 )
[píxeles]
=τ. (54)
θ7 q̇2 + θ9 sign(q̇2 ) 0
−20
Con el objeto de llevar a cabo una simulación lo más rea-
−40
lista posible, se utilizaron los valores reportados en Reyes 0 10 20 30 40 50 60 70 80
y Kelly (2001).
40
561
CHAPTER 15. ROBOTICS
REFERENCIAS
◦ Arteaga-Pérez, M.A. (1998). On the properties of a dyna-
Figura 5. Cámara rotada 5[ ], seguimiento de trayectoria:
deseada (- - -), GPI estándar (—) y PBC-GPI (—). mic model of flexible robot manipulators. ASME Jour-
nal of Dynamic Systems, Measurement, and Control,
600
120, 8–14.
500 Arteaga-Pérez, M.A. y Bueno-López, M. (2015). 3D visual
servoing control for robot manipulators without parame-
400 tric identification. IEEE Latin America Transactions,
13(3), 569–577.
300
Arteaga-Pérez, M.A. y Gutiérrez-Giles, A. (2014). On the
[píxeles]
200
GPI approach with unknown inertia matrix in robot
manipulators. International Journal of Control, 87(4),
100 844–860.
Bueno-López, M. y Arteaga-Pérez, M.A. (2012). Fuzzy
0
vs nonfuzzy in 2D visual servoing for robot manipula-
−100
tors. International Journal of Advanced Robotic Sys-
1600 1700 1800 1900 2000
[píxeles]
2100 2200 2300 2400 tems, 10(1), 1–11.
Bueno-López, M. (2012). Control servovisual de robots
manipuladores en 3D. Dr. eng., Universidad Nacional
Figura 6. Cámara rotada 45[◦ ], seguimiento de trayectoria: Autónoma de México, Avenida Universidad 3000, Co-
deseada (- - -) y PBC-GPI (—). yoacán, México, D.F.
Chaumette, F. y Hutchinson, S. (2007). Visual servo
una pequeña rotación de la cámara, el desempeño del GPI control part ii: Advanced approaches. IEEE Robotics
estándar se degrada bastante como puede apreciarse en and Automation Magazine, 109–118.
la Figura 5. Como puede verse en esta misma figura, el Gutierrez-Giles, A., Arteaga-Perez, M.A. y Sira-Ramirez,
desempeño del PBC–GPI siguió siendo parecido al caso H. (2016). Generalized proportional integral observer-
de perfecta calibración de la cámara. based force control in robot manipulators. REVISTA
Por último, si la cámara está rotada un ángulo considerable IBEROAMERICANA DE AUTOMATICA E INFOR-
como en el caso (iii), el sistema con controlador GPI MATICA INDUSTRIAL, 13(2), 238–246.
estándar se vuelve inestable. En contraste el controlador Hutchinson, S., Hager, G.D. y Corke, P.I. (1996). A
PBC–GPI servovisual propuesto en este trabajo sigue tutorial on visual servo control. IEEE Transactions on
manteniendo un buen desempeño, como se muestra en la Robotics and Automation, 12 (5), 651–670.
Figura 6. Kelly, R. (1996). Robust asymptotically stable visual
servoing of planar robots. Robotics and Automation,
5. CONCLUSIONES IEEE Transactions on, 12(5), 759–766.
Reyes, F. y Kelly, R. (2001). Experimental evaluation of
El esquema de controlador–observador propuesto en este model–based controllers on a direct−drive robot arm.
trabajo tiene algunas ventajas sobre los existentes en la Mechatronics, 11, 267–282.
literatura. Primero, no requiere conocer el modelo dinámi- Sciavicco, L. y Siciliano, B. (1996). Modeling and Control
co del manipulador, lo que resulta en una implementa- of Robot Manipulators. McGraw-Hill.
ción más sencilla. Además es fácil de sintonizar, lo que Sira-Ramı́rez, H., Ramı́rez-Neria, M. y Rodrı́guez-Ángeles,
también puede representar una ventaja en la práctica. A. (2010). On the linear control of nonlinear mechanical
Debido a que el diseño está inspirado en el control basado systems. 49th IEEE Conference on Decision and Con-
en pasividad, a diferencia de los esquemas basados en el trol, Atlanta, GA, USA.
principio de equivalencia cierta, el controlador no cancela Slotine, J.J.E. y Li, W. (1987). On the adaptive control
dinámicas beneficiosas para la estabilización del sistema, of robot manipulators. The International Journal of
e.g., términos disipativos, por lo que se logra un mejor Robotics Research, 6(3), 49–59.
desempeño comparativamente. La robustez del algoritmo Spong, M.W., Hutchinson, S. y Vidyasagar, M. (2006).
se puso a prueba mediante simulaciones numéricas, en el Robot Modeling and Control. John Wiley and Sons.
562
CHAPTER 15. ROBOTICS
Resumen: En este trabajo se aborda el problema de diseño de trayectorias caóticas en robots móviles. En
particular, con base en el mapa de Hénon se inducen comportamientos caóticos en las velocidades
lineales y angulares de un robot móvil diferencial tipo Khepera III, con propósitos de patrullaje
aprovechando las propiedades del caos. Se reportan tanto resultados numéricos como experimentales. Por
último, se muestra que las trayectorias del robot Khepera III son impredecibles.
Palabras clave: Robot móvil diferencial, caos, diseño de trayectorias, mapa de Hénon, aplicaciones de
robots caóticos.
comportamiento caótico en las velocidades de sus ruedas y no
1. INTRODUCCION
es resultado directo de la manipulación de su posición.
Los estudios de comportamientos de búsqueda de alimentos El trabajo se organiza de la siguiente manera: En la sección 2
en grupos de animales (especialmente de las hormigas), son se discute acerca de los sistemas caóticos y sus propiedades,
de gran ayuda en la solución de muchos problemas, sobre describiendo con detalle el mapa de Hénon. En la sección 3
todo de optimización. Recientemente, los biólogos han se describe el modelo matemático del robot diferencial,
descubierto que las hormigas realizan actividades de hablando de las características del robot Khepera III, el cual
búsqueda de manera caótica (Li, et al., 2014). será utilizado en la etapa numérica y experimental de este
Tratando de imitar este comportamiento, se han realizado trabajo. En la sección 4 se describen los experimentos
diversos trabajos en el diseño de trayectorias caóticas para realizados y se muestran los resultados numéricos y
robots móviles. Algunos utilizan sistemas caóticos junto con experimentales obtenidos. Finalmente, en la sección 5 se
las ecuaciones de comportamiento de un robot diferencial mencionan las conclusiones y el trabajo futuro.
para obtener un modelo matemático que le brinda a este un
movimiento caótico (Nakamura & Sekiguchi, 2001); otros 2. SISTEMAS CAOTICOS
(Martins-Filho & Macau, 2006) utilizan el conocido mapa La teoría de caos puede definirse como el estudio cualitativo
estándar, el cual es un sistema caótico simple discreto de dos del comportamiento dinámico aperiódico mostrado por
dimensiones, para generar una serie de puntos localizados sistemas deterministas no lineales. Puede ser observado en
caóticamente en un plano para ser seguidos por el robot; gran cantidad de sistemas naturales e ingenieriles.
mientras que otros (Volos, et al., 2012) emplean un generador Entre las principales propiedades de los sistemas caóticos se
aleatorio de bits caótico. También ha habido trabajos, (Curiac encuentran las siguientes:
& Volosencu, 2014) que estudian la generación de Alta sensibilidad a condiciones iniciales.
comportamientos caóticos de robots móviles durante el Deterministas con comportamiento aparentemente
recorrido de una trayectoria fronteriza.
aleatorio.
Entre las aplicaciones que se dan a los robots con trayectorias
caóticas se encuentran el patrullaje de áreas o fronteras, Oscilatorios, pero no periódicos.
búsqueda de elementos peligrosos (por ejemplo explosivos), Sensibilidad a condiciones iniciales indica que el sistema
rescate, exploración de terrenos inhóspitos, etc. caótico tendrá evoluciones temporales drásticamente
En este trabajo se diseñan trayectorias caóticas en robots diferentes, a pesar que éstas se generen a partir de
móviles. A continuación mencionamos las principales condiciones iniciales ligeramente diferentes. Esta
diferencias a los anteriores trabajos al emplear el mapa de característica es deseable en robots de patrullaje, ya que
Hénon para proporcionar velocidades a las ruedas de un robot ocasionaría que la trayectoria generada caóticamente del
móvil diferencial (Khepera III), de manera que se genere una mismo sea altamente impredecible. Sin embargo, las
trayectoria caótica que permita al robot patrullar un área de trayectorias son deterministas debido a que, conociendo las
trabajo de manera eficiente e impredecible para observadores ecuaciones y las condiciones iniciales del sistema, es posible
externos (intrusos, etc). La principal ventaja del método predecir su comportamiento. Tomando en cuenta estas
utilizado en este trabajo radica en el hecho de que la propiedades podemos decir que un robot, cuya trayectoria sea
trayectoria caótica seguida por el robot emerge del caótica será altamente impredecible para el observador
563
CHAPTER 15. ROBOTICS
externo, pero transparente para el diseñador y para cualquier En el trabajo realizado por Suster-Jadlosvska (2010) se
observador que tenga conocimiento de las ecuaciones y los reporta el siguiente modelo para un robot diferencial tomando
valores de las condiciones iniciales utilizadas para su diseño. en cuenta las siguientes consideraciones:
4. EXPERIMENTACION Y RESULTADOS
El objetivo que se persigue en este trabajo es emplear el mapa
caótico de Hénon (1) para propiciar un comportamiento
caótico a las velocidades lineal ( ) y angular ( ) del robot
Fig. 1. Atractor caótico de Hénon. móvil diferencial descritos por (2).
3. MODELO MATEMATICO DE UN ROBOT Despejando las velocidades de las ruedas izquierda y derecha
DIFERENCIAL de las ecuaciones (3) y (4), obtenemos los histogramas
mostrados en la figura 3. Se puede observar que los valores
Un robot móvil diferencial es aquel cuyo movimiento está
de son mayormente negativos y los de positivos, lo cual
basado en dos ruedas de tracción independiente, colocadas en
es indicio de que las trayectorias del robot se basarán
ambos lados del cuerpo del robot (figura 2). El movimiento
principalmente en giros cerrados hacia la izquierda. Debido a
se consigue aplicando diferentes velocidades a cada una de
esto, a los valores de y se les realiza una dispersión de
las ruedas motrices. De esta forma, se puede hacer que el
manera que el histograma de las velocidades de las ruedas sea
robot avance en línea recta fijando ambos motores a la misma
uniforme, como se muestra en la figura 4. De esta manera, la
velocidad, o que gire en una u otra dirección aplicando
dispersión permite realizar un re-escalamiento de los valores
diferentes velocidades. En este trabajo se usará el modelo
aproximado del robot Khepera III. a un rango utilizable por el robot, limitando sus velocidades
mínimas y máximas que puede alcanzar.
564
CHAPTER 15. ROBOTICS
b)
a)
a)
( ) ( )
( ) ( )
.
Paso 2: ( ) ( ) ( )
( ) ( )
Paso 3: ( )
( )
565
CHAPTER 15. ROBOTICS
Paso 5: ((( ) ( ))
) ,
((( ) ( ))
)
Paso 6: segundos.
566
CHAPTER 15. ROBOTICS
Fig.10. Trayectoria del robot para 500 iteraciones del Fig. 12. Trayectoria del robot generada por las condiciones
algoritmo iniciales del inciso b).
567
CHAPTER 15. ROBOTICS
5. CONCLUSIONES
568
CHAPTER 15. ROBOTICS
569
CHAPTER 15. ROBOTICS
Resumen: Este trabajo aborda el problema de formación de redes de robots móviles sin
colisiones con base en la teorı́a de sistemas complejos. En particular, la formación de 5 robots
móviles conectados en estrella, utilizando el modelo linealizado y controlable obtenido por
retroalimentación dinámica del modelo de un robot diferencial tipo Khepera III. Se diseña una
trayectoria para el robot maestro en forma de “8”, lográndose el seguimiento en formación
por un grupo de robots evitando colisiones. Los resultados se proporcionan en simulaciones
numéricas.
Palabras clave: Formación de robots, robot diferencial Khepera III, sistemas complejos.
1. INTRODUCCIÓN vecinos con los que esten conectados con propósitos del
objetivo de control.
En años recientes, el control de sistemas multiagente ha
atraı́do la atención de las ciencias computacionales y de 2. SISTEMAS COMPLEJOS
la comunidad de control. Con ayuda de las investigaciones
en los campos de sistemas complejos y comportamientos En esta sección, se proporciona breve descripción sobre
colectivos se han propuesto algunos protocolos de control redes dinámicas complejas y las condiciones de sincro-
para lograr el desempeño y beneficios de agentes coordi- nización. En este trabajo se consideran redes dinámicas
nados tal como la formación. Entre las ventajas que pre- complejas de nodos idénticos, linealmente acoplados me-
sentan los sistemas multiagente se encuentran: un grupo diante la primer variable de estado de cada nodo, siendo
aumenta la eficiencia y efectividad de la tarea asignada, el cada nodo un subsistema dinámico n-dimensional descrito
uso de grupos de robots permite enfrentar fallas de mejor como sigue
forma, son capaces de realizar tareas que son imposibles ẋi = f (xi ) + ui , i = 1, 2, . . . , N, (1)
para un solo robot, etc. T
donde xi = (xi1 , xi2 , ..., xin ) ∈ Rn son las variables de
En Adel Abbaspour (2015) se logra formación óptima con estado del nodo i, ui = (ui1 , 0, ..., 0)T ∈ Rn es la señal de
la ayuda de varias técnicas de optimización numéricas. control del nodo i y es definida por
Se utilizan como base robots con ruedas con el propósito XN
de manipular un objeto en común. Finalmente, se utiliza ui1 = c aij Γxj , i = 1, 2, . . . , N, (2)
una aproximación virtual para demostrar cómo se logra j=1
la formación óptima. En Shanying Zhu (2015) utilizan la constante c > 0 representa la fuerza de acoplamiento
el modelo de una partı́cula propulsada para describir los de la red compleja y Γ ∈ Rn×n es una matriz constante
comportamientos colectivos emergentes en un grupo de de conexiones que indica que variables de estado están
agentes móviles. En Yanyan Dai (2015) se plantea una acopladas. Mientras que, A = (aij ) ∈ RN ×N es la
estrategia de formación para múltiples robots con evasión matriz de acoplamiento, la cual representa la topologı́a de
de obstáculos y restricciones de velocidad. acoplamiento de la red compleja. Si existe una conexión
Este trabajo propone la formación de un grupo de robots entre el nodo i y el nodo j, entonces aij = 1; de otra
móviles con evasión de colisiones utilizando la teorı́a de forma aij = 0 para i 6= j. Los elementos de la diagonal de
sistemas complejos. Primeramente se presenta el modelo la matriz de acoplamiento A se definen como sigue
XN XN
matemático de un robot diferencial y como llevarlo a
un sistema equivalente lineal donde es posible aplicar el aii = − aij = − aji , (3)
controlador. Posteriormente se propone el uso de la teorı́a j=1, j6=i j=1, j6=i
de grafos y sistemas complejos para formar una red de 5 sı́ el grado del nodo i es di , entonces aii = −di , i =
robots. Estos robots pueden obtener información de los 1, 2, . . . , N. Sı́ asumimos que la red compleja está conecta-
570
CHAPTER 15. ROBOTICS
571
CHAPTER 15. ROBOTICS
ẋ = x˙1 ,
ẏ = x˙2 , (22)
θ̇ = x˙3 .
572
CHAPTER 15. ROBOTICS
y sign(k) es la f unción signo que para efectos en este c11 = c21 = 0.3, c12 = c22 = 3, δ12x = δ13x = δ14x =
trabajo se utiliza la defición dada en Kwakernaak et al. δ15x = 50, δ12y = δ13y = δ14y = δ15y = 50, r21 = r31 =
(1991), r41 = r51 = r12 = r32 = r42 = r52 = r13 = r23 = r43 =
1, si k > 0, r53 = r14 = r24 = r34 = r54 = r15 = r25 = r35 = r45 = 10,
sign(k) = 0, si k = 0, (30) m = 0.5.
t t
Con (Xd , Yd ) = (4sen( 10 ), 4sen( 20 )) y distancias propues-
−1, si k < 0. tas entre los robots para la formación:
∆12x = −1, ∆13x = 1, ∆14x = −1, ∆15x = 1, ∆12y =
4.2 Formación de una red de robots en topologı́a estrella 1, ∆13y = 1, ∆14y = −1, ∆15y = −1.
Para la formación de una red de robots, se conectarán 5
nodos utilizando el modelo lineal descrito en la sección 3.1.
En la red se usarán conexiones dirigidas en estrella para
tener un mejor funcionamiento del controlador como se
plantea en Martı́nez Clark (2014). La figura 3 muestra el
grafo utilizado, se tiene un robot maestro N1 y 4 robots
esclavos N2, N3, N4 y N5.
573
CHAPTER 15. ROBOTICS
574
CHAPTER 15. ROBOTICS
575
CHAPTER 15. ROBOTICS
Abstract: The main goal of this paper is to achieve asymptotically stable walking for a bipedal robot
over an Aperiodic Gait pattern. The five-link planar bipedal robot has one degree of under-actuation, four
actuators and point feet. Trajectory tracking and multi-orbital stability were performed using Generalized
Proportional Integral (GPI) controller. The stability of the multi-periodical walking was tested through
the computation of Poincaré return maps. This analysis was performed by numerical simulation with
a gait pattern reconfiguration. The results of the simulation show that the GPI control achieved a
performance robust enough to overcome the gait pattern reconfiguration without changes in the control
law with an event-based action. The robot analyzed corresponds to a prototype under development at the
Control Laboratory of the Universidad Nacional de Colombia, Bogotá.
576
CHAPTER 15. ROBOTICS
The bipedal walking is a sequence of changes between single studied as the collision of rigid bodies. This impact phase
and double support. Here, it is assumed that the double support together with the continuous dynamic of the single support
phase is instantaneous and that the end of the support leg has phase form a hybrid model. As proposed by Grizzle et al.
unilateral constraints. In this way it does not have rebound nor (2001) and Westervelt et al. (2003), it is mandatory to take in
slip and its vertical reaction force is repulsive and the lateral consideration external forces affecting the system at the impact
force is inside of the friction cone. This constraint converts time. For this reason the exchange of support leg phase of the
the end of the support leg into a pivot, which transforms the walking was studied with the robot as an unpinned system with
system into a five degrees of freedom mechanism. The robot an augmented model that takes into account a fix point over
has four actuators: two between the torso and the thighs and two the robot’s body, corresponding to pe := [phe pve ]T , where pe
more in the knees. Although the actuators have flexible joints is the position of the end of the support leg, phe refers to the
to the links, here the stiffness of the union is configured to its horizontal coordinate of that point and pve to the vertical one.
maximum value, it in order to avoid the additional dynamic of Hence, the generalized coordinates vector for impact phase is
the strings. As in Tzafestas et al. (1996), Westervelt et al. (2007) qe := [qTs pTe ]T . With the new generalized coordinate vector
and Arcos-Legarda et al. (2016) the Lagrange’s differential the dynamic model is described as:
equation was used to find the model of the robot in the single
support phase to get the Euler-Lagrange expressed as:
De (qe )q̈e +Ce (qe , q̇e )q̇e + Ge (qe ) = Be (qe )u + δ Fext , (2)
Ds (qs )q̈s +C(qs , q̇s )q̇s + Gs (qs ) = Bs (qs )u + δ (qs , q̇s ) + ζ , (1) where δ Fext represents impulsive forces caused by the ground
reaction in the swing leg at the impact time. Based on the
where qs := [q1 q2 . . . q5 ]T is the generalized coordinates concept of momentum conservation, the angular momentum
vector, Ds (qs ) is the inertial matrix, Cs (qs , q̇s )q˙s is a vector of before and after the impact must be the same and even when the
Centripetal and Coriolis forces, Gs (qs ) is a vector of forces angular position remains constant in that instantaneous period
associated to the gravity, Bs (qs )u is a vector of generalized of time, the velocity undergoes a sudden change. It is shown in
forces, δ (qs , q̇s ) takes into account the uncertainties of the the following equation:
model and ζ is an unknown vector of the external disturbances.
− −
De (q+ +
e )q̇e − De (qe )q̇e = Fext , (3)
where (q− − + +
e , q̇e ) and (qe , q̇e ) represent the position and the
velocity just before and after the impact, respectively; Fext is a
forces vector which represents the effect of the ground reaction
on each joint. Let us define F2 := [F2v F2h ]T as the reaction
force in the swing leg, where F2v refers to the tangential reaction
and F2h to the normal one. Following the the procedure of
Westervelt et al. (2007), the principle of virtual work could be
used, which yields:
Fext = E2 (q− T
e ) F2 , (4)
• q− +
e ≡ qe
• The velocity of the end swing leg would be equal to zero
Then the following equation can be extracted:
q1
q2
d p2 ∂ p2 (q−e ) +
= q̇e ,
dt ∂ qe
E2 (q− +
e )q̇e = 0. (6)
−q4 −q3
The Eq. (5) and (6) are collected in a matrix form to obtain
Θ(qs)
+ −1
Fig. 2. Multibody Model of the Bipedal Robot q̇e De (q+
e ) −E2 (q−
e )T De (q−
e )q̇−
e
= , (7)
F2 E2 (q−
e ) 02×2 02×1
As was exposed by Hurmuzlu and Marghitu (1994), the event
of exchange of the support leg produces an impact that is often which can be summarized in
577
CHAPTER 15. ROBOTICS
−1
e = ∆(qe , q̇e ),
− −
q̇+ (8) ξ = D11 (qs ) − D12 (qs )D−1 ∗...
22 (qs )D21 (qs )
n h i
F2 = Σ(q− −
e , q̇e ). (9)
· · · ∗ −D12 (qs )D22 (qs )−1 ξ̃2 (qs , q̇s ) − Ω2 (qs , q̇s ) + . . .
Finally, the hybrid system is described in the Eq. (1) and (8) as: o
· · · + ξ̃1 (qs , q̇s ) − Ω1 (qs , q̇s ) ,(14)
and
q̈s = f (qs , q̇s ) + Ds (qs )−1 Bs (qs )u pv2 6= 0
Σ: (10) −1
e = ∆(qe , q̇e )
q̇+ − −
pv2 = 0 κ (qs ) = D11 (qs ) − D12 (qs )D−1
22 (qs )D21 (qs ) B1 (qs ). (15)
where Based on the simplified model described in the Eq. (13) the
following assumptions are proposed.
f (qs , q̇s ) = Ds (qs )−1 [−C(qs , q̇s )q̇s − Gs (qs ) + δ (qs , q̇s ) + ζ ].
• First, here and now ξ will be considered as a total dis-
turbances vector, in which the endogenous and exogenous
3. CONTROL DESIGN disturbances are lumped.
• Second, the total disturbances ξ and their m derivatives
The control of bipedal robots has several challenges. For in- are considered bounded.
stance, it has to deal with the fact that this system is nonlinear • Finally, it is assumed that each component of the vector ξ
and Multi-Input Multi-Output (MIMO). One of the biggest can be approximated by a time polynomial with order m,
(m+1)
challenges in the control of bipedal robots is that they are hybrid so that the dtd (m+1) ξi ≈ 0, ∀i = 1, 2, 3, 4 .
systems, because of their continuous and discrete dynamic.
Finally, the point-feet structure of our prototype makes it an With the above assumptions, the following control law is pro-
underactuated mechanism with its nominal state as an unstable posed:
system. Bellow a feedback control design is provided to tackle
all the difficulties that bipedal robots with dynamic walking,
u = κ −1 (qs (s)) s2 qd (s) − KGPI (qb (s) − qd (s)) , (16)
like the one studied in this paper faces. First, the Eq. (1) is
simplified as follow: where qd is the references vector for the controlled joints and
KGPI is a diagonal transfer function matrix with the following
Ds (qs )q̈s + Ω(qs , q̇s ) = Bs (qs )u + ξ̃ (qs , q̇s ), (11) structure:
where:
KGPI11 0 0 0
0 KGPI22 0 0
KGPI = ,
0
(17)
Ω(qs , q̇s ) := C(qs , q̇s )q̇s + Gs (qs ), 0 0 KGPI33
0 0 0 KGPI44
ξ̃ (qs , q̇s ) := δ (qs , q̇s ) + ζ .
and
The Eq. (11) is fragmented to separate the actuated part of the αi,n+m sn+m + · · · + αi,1 s + αi,0
system from the underactuated part. Now, let us fragment the KGPIii = , ∀i = 1, · · · , 4.
s m+1 (sn−1 + αi,2n+m−1 sn−2 + αi,2n+m+1 )
position vector and the output matrix as (18)
If the control law in the Eq. (16) is inserted into the Laplace
qb(N−1×1) B1 (qs )(N−1×p) transformation of the simplified model Eq. (13) the close loop
qs := , Bs (qs ) := ,
q(N1×1 ) 0(1×p) ] system takes the following form:
where p is the number of actuators. Once the new structure of
the variables was established, the model was fragmented as: s2 qb (s) = κ (qs (s)) κ −1 (qs (s)) s2 qd (s) · · ·
(19)
· · · −KGPI (qb (s) − qd (s)))] + ξ (s),
D11 (qs ) D12 (qs ) q̈b Ω (q , q̇ )
+ 1 s s = ... by simplifying this equation we get
D21 (qs ) D22 (qs ) q̈N Ω2 (qs , q̇s )
B1 (qs ) ξ̃1 (qs , q̇s )
··· = u+ . (12) s2 qb (s) = s2 qd (s) − KGPI (qb (s) − qd (s)) + ξ (s). (20)
0 ξ̃2 (qs , q̇s )
The Eq. (20) describes the close loop behavior and has a special
The fragmented model representation is used to express the feature that corresponds to transforming of the system into a
model as a function of the actuated variables, which is decoupled one. This feature is important because, based on that
transformation the tracking control problem can be studied as
22 (qs )D21 (qs ) q̈b + Ω1 (qs , q̇s ) + . . .
D11 (qs ) − D12 (qs )D−1 a bunch of Single-Input and Single-Output (SISO) problems,
h i instead of a more complex MIMO system. The fragmentation
· · · + D12 (qs )D−1
22 (qs ) ξ̃ (q ,
2 s s q̇ ) − Ω (q ,
2 s s q̇ ) = ... of the dynamic produces the following system of equations:
· · · = B1 (qs )u + ξ̃1 (qs , q̇s ),
s2 qb,1 (s) = s2 qd,1 (s) − KGPI11 (qb,1 (s) − qd,1 (s)) + ξ1 (s),
then, it is simplified to obtain ..
q̈b = κ (qs )u + ξ , (13) .
where s2 qb,4 (s) = s2 qd,4 (s) − KGPI44 (qb,4 (s) − qd,4 (s)) + ξ4 (s).
578
CHAPTER 15. ROBOTICS
s2n+m + αi,2n+m−1 s2n+m−1 + · · · + αi,1 s + αi,0 ei (s) =
···
· · · = sm+1 ξi (s) sn−1 + αi,2n+m−1 sn−2 + · · · + αn+m+1 , · · ·
· · · ∀i = 1, · · · , 4,
(22)
At the right side of the Eq. (22) the product sm+1 ξi (s) is close
to zero, for which the error’s dynamics take the following form:
579
CHAPTER 15. ROBOTICS
b,2
3.5 3.5
the Poincaré’s method to search for the presence of a periodic
q
q
3 3
orbit in the evolution of the states of a passive bipedal robot.
0.2 0.2
20 20
0.15 0 0.15 0
Based in the fact that our prototype is a point feet robot, the
sl 0.1 −20 d/dt qb,1 sl 0.1 −20 d/dt qb,2 more convenient approach to analyze the global stability of the
robot is the Poincaré’s method. But, with the aperiodic walk
performed in the simulation test in this section, the Poincaré’s
−0.3 0 method can not be applied without several conditions. There-
fore, the method of Poincaré’s section is used to identify peri-
qb,3
qb,4
−0.4 −0.5
odic behavior in the dynamic of the biped at each gait pattern.
−0.5
0.2
−1
0.2 The computation of the return map for the each step length
20 50
0.15 0 0.15 0 is done with the procedure presented in Chen et al. (2007), it
sl 0.1 −20 d/dt qb,3 sl 0.1 −50 d/dt qb,4 is done through the sampling of the states in a time when a
condition called the Poincaré’s section is satisfied. Poincaré’s
section could either be a function of the robot states or can be
Fig. 4. States’ convergence to stable cycles in the changes of determined by external events. Here the Poincaré’s section is
step length determined by the switching foot event, which depends of the
ground level.
there, it is possible to see the evolution from one step length to
another, and additionally it is shown the cycles or stable orbits First, let us define the state vector as x := [qTs q̇Ts ]T , then a
at each step length. sampling done in the Poincaré’s section is divided in two sets,
one for each step length in the simulations, after that, they are
The evolution of the walk is shown in a stick diagram in the Fig. used to build a couple of functions that take values just before
5. The behavior of the robot’s gait with the longest step length the impact time and are defined as
is presented in the left side of the Fig. 5, while in the right side
it is shown the gait pattern for the step length corresponding to
10 cm. xsl− (k + 1) = Psl (xsl− (k)) ∀sl = 10cm, 20cm, (24)
where xsl− (k) are the kth samples of the state vector just before
0.8 0.8
the impact, xsl− (k + 1) are the state vector just before the next
0.7 0.7 impact and Psl (x− (k)) are functions called the Poincaré return
map. The stability test is summarized in the tasks of evaluating
∗
the existence of two fix points xsl− such that
0.6 0.6
0.5 0.5
∗ ∗
0.4 0.4
xsl− (k + 1) = P(xsl− (k)) = xsl− (25)
y (m)
The sets of states of the robot in each switching foot are saved
0 0
in vectors as bellow
−0.1 −0.1
2.8 3 3.2 10.2 10.4 10.6 ∗
x (m) x (m)
xsl− (k − (ρ − 1)) − xsl−
x− (k − (ρ − 2)) − x−∗
sl sl
χsl (k) = .. , (26)
Fig. 5. Evolution of the walk in different gait patterns .
∗
xsl− (k) − xsl−
5.1 Evaluation of the gait stability
where ρ represents the number of switching feet executed in
the simulation. Linear regressions are performed between the
Although the stability of the tracking control can be ensure
with the selection of the right roots for the error’s dynamics,
χsl (k) and χsl (k − 1) values. The result of the regressions give
linear approximations of (24) with the following form
the gait stability depends too of the walking pattern. Several
approaches to deal with the global stability of the walking have
∗ ∗
been proposed, two of the more accepted are: the Zero Moment (xsl− (k + 1) − xsl− ) = Φsl ∗ (xsl− (k) − xsl− ) (27)
Point (ZMP) proposed by Vukobratović and Borovac (2004),
which has been successful testes with robots with flat feet like where the eigenvalues of Φsl must have a magnitude lower than
ASIMO Sakagami et al. (2002); in the other hand it is the one to guarantee the asymptotic stability of the walking. It was
580
CHAPTER 15. ROBOTICS
evaluated and the stability of each gait pattern was proved, so it rejection control of a biped robotic system using high-order
is possible conclude that the robot has stable walk for both step extended state observer. ISA transactions, 62, 276–286.
lengths. McGeer, T. (1990). Passive walking with knees. In Robotics
and Automation, 1990. Proceedings., 1990 IEEE Interna-
6. CONCLUSION tional Conference on, 1640–1645. IEEE.
Sakagami, Y., Watanabe, R., Aoyama, C., Matsunaga, S., Hi-
A time-invariant GPI controller was developed to control a gaki, N., and Fujimura, K. (2002). The intelligent asimo:
planar bipedal robot with point feet structure. The robot keeps System overview and integration. In Intelligent Robots and
stable walking even with gait pattern changes during the robot Systems, 2002. IEEE/RSJ International Conference on, vol-
is in operation. The controller designed achieves stable walking ume 3, 2478–2483. IEEE.
in the reconfiguration of trajectories and it was confirmed by Tzafestas, S., Raibert, M., and Tzafestas, C. (1996). Robust
the Poincaré test at each walking pattern. sliding-mode control applied to a 5-link biped robot. Journal
of Intelligent and Robotic Systems, 15(1), 67–133.
The trajectory generation strategy allows switching from one Vukobratović, M. and Borovac, B. (2004). Zero-moment point-
step length to another in a short period of time. The use of thirty five years of its life. International Journal of Hu-
the planning trajectories strategy permits the changes between manoid Robotics, 1(01), 157–173.
short to long steps and in the opposite way, without the design Westervelt, E.R., Grizzle, J.W., Chevallereau, C., Choi, J.H.,
of a controller for each step length wanted. Thus, the risk to and Morris, B. (2007). Feedback control of dynamic bipedal
fall in an instability state caused by aperiodic gait, produced by robot locomotion, volume 28. CRC press.
the step length changes, were successfully tackled to ensure the Westervelt, E.R., Grizzle, J.W., and Koditschek, D.E. (2003).
dynamic walking stability. Hybrid zero dynamics of planar biped walkers. Automatic
Control, IEEE Transactions on, 48(1), 42–56.
REFERENCES
Arcos-Legarda, J., Cortes-Romero, J., and Tovar, A. (2016).
Active disturbance rejection control based on generalized
proportional integral observer to control a bipedal robot with
five degrees of freedom. In American Control Conference
(ACC), 2016. IEEE.
Arcos-Legarda, J., Tovar, A., Cortés, J., Dı́az, H., and
Sarmiento, L. (2013). Multivariable gpi control of a gait
exoskeleton for people with walking disabilities. In II In-
ternational Congress of Engineering Mechatronics and Au-
tomation (CIIMA).
Chen, H. et al. (2007). Passive dynamic walking with knees:
A point foot model. Ph.D. thesis, Massachusetts Institute of
Technology.
Chevallereau, C., Grizzle, J.W., and Shih, C.L. (2009). Asymp-
totically stable walking of a five-link underactuated 3-d
bipedal robot. Robotics, IEEE Transactions on, 25(1), 37–
50.
Grizzle, J.W., Abba, G., and Plestan, F. (2001). Asymptotically
stable walking for biped robots: Analysis via systems with
impulse effects. Automatic Control, IEEE Transactions on,
46(1), 51–64.
Grizzle, J.W., Chevallereau, C., Sinnet, R.W., and Ames, A.D.
(2014). Models, feedback control, and open problems of 3d
bipedal robotic walking. Automatica, 50(8), 1955–1988.
Grizzle, J., Westervelt, E., and Canudas-de Wit, C. (2003).
Event-based pi control of an underactuated biped walker.
In Decision and Control, 2003. Proceedings. 42nd IEEE
Conference on, volume 3, 3091–3096. IEEE.
Hamed, K.A. and Grizzle, J.W. (2013). Robust event-based sta-
bilization of periodic orbits for hybrid systems: Application
to an underactuated 3d bipedal robot. In American Control
Conference (ACC), 2013, 6206–6212. IEEE.
Hill, J. and Fahimi, F. (2015). Active disturbance rejection for
walking bipedal robots using the acceleration of the upper
limbs. Robotica, 33(02), 264–281.
Hurmuzlu, Y. and Marghitu, D.B. (1994). Rigid body collisions
of planar kinematic chains with multiple contact points. The
international journal of robotics research, 13(1), 82–92.
Martı́nez-Fonseca, N., Castañeda, L.Á., Uranga, A., Luviano-
Juárez, A., and Chairez, I. (2016). Robust disturbance
581
CHAPTER 15. ROBOTICS
Resumen:
Dos de los problemas de control en ingenierı́a, es el diseño y la sintonización para satisfacer cierto
desempeño en una planta o sistema. El uso eficiente del controlador radica principalmente en
la correcta sintonización. En este artı́culo se está interesado en el problema de sintonización del
control PD en el espacio de operación para un robot móvil omnidireccional con base en el método
de optimización y su solución se proporciona a través del algoritmo de evolución diferencial.
Se analizan, comparan y discuten los resultados obtenidos en simulación y experimental
dando como resultado un comportamiento adecuado en simulación pero un pobre desempeño
en los resultados experimentales. Con el propósito de mejorar el comportamiento en forma
experimental se mencionará algunas direcciones de investigación.
Keywords: Sintonización óptima, control PID, evolución diferencial, robot móvil (3,0).
582
CHAPTER 15. ROBOTICS
√ √
estructura-control de sistemas mecatrónicos (Villarreal- β̄1 = − sin x3 +√ 3 cos x3 β̄2 = sin x3 + √3 cos x3
Cervantes et al., 2010, 2014), entre otros. β̄3 = cos x3 + 3 sin x3 β̄4 = cos x3 − 3 sin x3
Por tal motivo, en este trabajo se propone un método En el problema de optimización se desea encontrar la mejor
de optimización a través del planteamiento formal de un solución en los parámetros del sistema de control. Por tal
problema de optimización para la sintonización del sistema motivo, el vector p de la ecuación (6) se considera como el
de control de un robot móvil omnidireccional. La idea vector de variables de diseño en donde se involucran a los
principal del trabajo es proporcionar una visión general de parámetros del control PD del RMO.
la problemática que emerge al sintonizar las ganancias del
sistema de control fuera de lı́nea y a través de un método
de optimización. p = [kp1 , kp2 , kp3 , kd1 , kd2 , kd3 ]T (6)
La estructura de este artı́culo se organiza de la siguiente
manera: En la sección 2 se formula y plantea el método 2.2 Función objetivo
de sintonización para el robot móvil omnidireccional. El
algoritmo que soluciona el problema de optimización se Con el propósito de garantizar una precisión en el segui-
describe en la Sección 3. Los resultados numéricos y miento de la trayectoria y un consumo de energı́a relativa-
experimentales se discuten en la Sección 4 y finalmente mente bajo, se propone como función objetivo optimizar
en la Sección 5 se proporcionan las conclusiones y trabajo error de posición, error de orientación (e) y el par de en-
a futuro. trada (u) para cada rueda del RMO en el espacio de coor-
denadas cartesianas, dado por la ecuación (7). Ası́ mismo
se incluye el factor de ponderación µ para resolver el POD.
2. MÉTODO DE OPTIMIZACIÓN PARA LA
Por último, para normalizar las unidades en los términos
SINTONIZACIÓN DEL SISTEMA DE CONTROL DEL
(e1 , e2 , e3 ), se realiza la conversión del error angular a un
ROBOT MÓVIL OMNIDIRECCIONAL
error lineal en e3 obteniendo el factor L2 , donde L es la
distancia del centro del robot móvil a la llanta.
Para sintonización óptima del controlador se plantea un
problema de optimización dinámica (POD) con el propósi- J¯ = µ1 J¯1 + µ2 J¯2 (7)
to de encontrar los parámetros del control PD en el espacio donde:
de operación de un robot movil omnidireccional (RMO) Z tf Z tf Z tf !
considerando una trayectoria predefinida, a su vez garan- J¯1 = 2
e1 + 2
e2 + L 2
e23 dt
tizar el mı́nimo error de posición, orientación y consumo 0 0 0
Z Z Z !
de energı́a. Formalmente el problema de optimización se tf tf tf
establece en las ecuaciones (1)-(3), donde J¯ ∈ R es la fun- J¯2 = u21 + u22 + u23 dt
ción objetivo a minimizar, p∗ es el vector de las variables 0 0 0
de diseño, en gi ∈ R se encuentran las restricciones de
desigualdad y hi ∈ R las restricciones de igualdad, del 2.3 Restricciones de diseño
problema de optimización.
Z tf Para un POD, es necesario contemplar el modelo dinámico
M ∗in ¯
Jdt (1) en el espacio de estados del RMO (Villarreal-Cervantes
p 0
et al., 2015) como una restricción activa en el problema de
Sujeto a: optimización dada por ẋ = f (x) + g(x)u.
gi (t) < 0 ∀ i = 1, 2, ..., o (2)
Por otro lado, la función de epicicloide se establece como
hi (t) = 0 ∀ i = 1, 2, ..., p (3)
la trayectoria a seguir altamente no lineal incluido en
el problema de optimización como una restricción de
2.1 Variables de diseño igualdad. Su parametrización se observa en la ecuación
(8).
En esta investigación, se propone el controlador PD en
el espacio de operación para un RMO, el cual se observa
en la ecuación (4), donde kp = diag[kp1 , kp2 , kp3 ] ∈ R3×3 x̄d = 0,8181 cos(2πf t) + 0,1818 cos(9πf t)
ȳd = 0,8181 sin(2πf t) − 0,1818 sin(9πf t) (8)
y kd = diag[kd1 , kd2 , kd3 ] ∈ R3×3 son las matrices con
φ̄d = 0,4363 sin(3,33πf t)
ganancias de dicho controlador, J = R3×3 dada por la
ecuación (5) es la matriz Jacobiana que relaciona las Para que el problema de optimización obtenga mejores
fuerzas y pares ejercidos en el robot móvil con las fuerzas resultados, se propone utilizar el polinomio de Bézier de 5to
ejercidas por las ruedas del robot móvil, e ∈ R3 es error orden con el propósito de generar una trayectoria suave del
de posición (lineal y angular) y ė ∈ R3 es la velocidad del punto de inicio del RMO al punto inicial de la trayectoria
error. parametrizada de la función epicicloide. La ecuación que
u = J T (kp e + kd ė) (4) describe el polinomio de Bézier está dada por la ecuación
(9) tal que ∆i = tt2i −t
−t1 donde: ti es el tiempo, t1 = 0 es
1
β̄1
el tiempo inicial de la trayectoria Bézier y t2 = 3 es su
2
3 sin x3 − β̄2
3 3 respectivo tiempo final.
J = β̄3
− 23 cos x3 β̄4 (5)
3 3
1 1 1
3L 3L 3L
donde: ϕi = ∆5i (126 − 420∆i + 540∆2i − 315∆3i + 70∆4i ) (9)
583
CHAPTER 15. ROBOTICS
4. RESULTADOS
El algoritmo comienza con una población de NP indivi-
duos. La población inicial distribuye a los individuos alea- Para llevar acabo las pruebas en simulación se utilizaron
toriamente por el espacio de búsqueda. Las generaciones los parámetros cinemáticos y dinámicos del prototipo real
siguientes se denotan como: G = 0, 1, 2, ..., Gmax . Cada del robot móvil omnidireccional que se encuentra en el
individuo está representado con un vector D-parámetros laboratorio de Mecatrónica del “CIDETEC-IPN” en la
(variables de diseño p de la ecuación (6)), resultando: Ciudad de México, donde los parámetros de dicho robot
xi,G = x1i,G , ...xD
i,G . se observan en la Tabla 1. Estos parámetros son necesarios
Después de obtener la población inicial se inicia un proceso para la simulación en MATLAB del modelo dinámico del
iterativo en el que se aplica a cada individuo los operadores robot móvil. Se utiliza el método de integración de Euler
básicos de mutación, cruza y selección. con la condición inicial x0 = 0 ∈ R6 , el tiempo de
integración ∆t = 0,005s y el tiempo final de sesenta y
3.1 Mutación tres segundos.
El operador de mutación conocido como ED/Rand/1/Bin Tabla 1. Parámetros del robot móvil omnidi-
es uno de los más utilizados en la literatura, donde la reccional (Peñaloza-Mejı́a et al., 2015).
mutación se da generando un individuo nuevo para cada P arámetro Descripción V alor U nidades
individuo de la generación anterior, tal como se observa en r Radio de la llanta 0.0625 m
la ecuación (10). L Dist del centro a la llanta 0.287 m
ui,G+1 = xr1 ,G + F (xr2 ,G − xr3 ,G ) (10) m M asa del robot móvil 16.31 kg
kg
J Inercia de la llanta 5.82E −4
Donde los subı́ndices r1 , r2 , r3 son tres individuos de la ge- m2
kg
Iz Inercia del robot móvil 0.51
neración G aleatoriamente elegidos y diferentes entre ellos. m2
584
CHAPTER 15. ROBOTICS
Con el propósito de visualizar el desempeño del robot derivativa de los estados y posiblemente a la imprecisión
móvil omnidireccional para el seguimiento de una tra- de los parámetros del robot.
yectoria altamente no lineal, se implementa para su so-
lución el algoritmo presentado en la Sección 4. Para el El error producido entre la posición y orientación durante
algoritmo es necesario definir los parámetros siguientes: el seguimiento de trayectoria se observa en las Figs. 1d y
1e, donde para el caso de simulación prevalece un error
los lı́mites del vector de variable de diseño p, los cua-
menor en comparación con el experimental, esto conlleva
les se establecen como pmin ∈ [0, 0, 0, 0, 0, 0] y pmax ∈
a que aún existe problemáticas para pasar las soluciones
[1000, 1000, 1000, 1000, 1000, 100], la condición de paro re-
en simulación a la práctica real. También se observa que
presentado por el número de iteraciones e1 = 100. En este
existe una relación importante en la asignación de µ1 y µ2 ,
trabajo se utilizó la siguiente propuesta para los valores
donde entre más ponderación tenga el error mejores serán
de las ponderaciones de la función objetivo de la ecuación
los resultados proporcionados por el algoritmo.
(7): donde µ1 = 0,999 y µ2 = 0,001. En el caso de
estudio de diseño de sistemas mecatrónicos a través de un En las Figs. 2a y 2b se muestra el comportamiento que
proceso de optimización, en (Portilla-Flores et al., 2011) se tiene el par de entrada con respecto a cada rueda del
resuelve dicho problema mediante un algoritmo evolutivo RMO en simulación y caso experimental, ası́ mismo, se
considerando cinco corridas para validar el desempeño del observa en la Fig. 2b que la señal de control se satura,
algoritmo, por tal motivo en este trabajo se propone la esto se le atribuye a que en el problema de optimización
misma métrica de cinco corridas del algoritmo, con el no se considera el ruido generado por la estimación de la
objetivo de encontrar la mejor solución. velocidad angular de las ruedas. De igual forma, se observa
Los resultados obtenidos se muestran en la Tabla 2. En que la saturación produce sobre impulsos que pasan a 5
la primer columna se muestra el caso de estudio y los N · m, donde para el prototipo experimental que se utiliza
el par de entrada máximo es de 5 N · m para cada motor
valores propuestos en las ponderaciones, en la columna
del RMO. En simulación no sucede dicho comportamiento,
dos se muestran las corridas para cada uno de los casos de
se observa que existe un buen seguimiento de trayectoria
estudio, en las columnas 3,...,8, se observan los resultados
y un par de entrada en el rango de 1 N · m.
óptimos de las ganancias del controlador dados por el
algoritmo de ED, en la columna 9 se observa el valor de 5. CONCLUSIÓN
la función objetivo (J)¯ propuesta para este trabajo, en la
columna 10 se muestra el valor del error producido por el En este trabajo se presenta un método de sintonización
RMO (J¯1 ), en la columna 11 se muestra el par de entrada para el robot móvil omnidireccional con base en la formu-
en las ruedas (J¯2 ) y por último en la columna 12 se muestra lación de un problema de optimización dinámico no lineal.
el tiempo de ejecución del algoritmo para cada corrida de
los casos de estudio. Los resultados en simulación indican que es posible obte-
ner las ganancias del control PD en el espacio de ope-
Al examinar a detalle los resultados obtenidos por las ración con base en un método de optimización, con el
corridas, se visualiza que la quinta corrida es la que mejor propósito de proporcionar un comportamiento aceptable
desempeño presenta y se resalta en color negro. Para las en el seguimiento de una trayectoria altamente no lineal.
corridas de uno a tres sus valores óptimos están muy Sin embargo, los resultados experimentales indican que se
cercanos a su lı́mite superior pmax , llegando a la conclusión requiere de una formulación diferente para poder realizar
que su área de búsqueda está limitada y el algoritmo no una similitud de al menos un 90 % con los resultados en
pudo encontrar mejores soluciones. En las corridas cuatro simulación.
y cinco se visualiza que sus valores óptimos se encuentran
distribuidos dentro de pmin y pmax concluyendo que en el Es importante hacer notar que para una trayectoria lenta
intervalo propuesto se pueden encontrar mejores solucio- y sencilla, como por ejemplo un cı́rculo, este método de
nes. sintonización es viable, sin embargo, para trayectorias
altamente no lineales como las presentadas en este trabajo,
Una vez que se encuentra una buena solución de las ga- la correspondencia de la simulación con la realidad dista
nancias del controlador, se procede a realizar los resultados mucho.
experimentales. Haciendo una comparación entre la simu-
lación y lo experimental, se observa primero la simulación El trabajo futuro de esta investigación es incorporar en
en la Fig. 1a donde se realiza el seguimiento de trayectoria el problema de optimización, la robustez de los resultados
muy similar a la propuesta. Para el caso experimental de obtenidos ante incertidumbres del sistema no lineal mode-
la Fig. 1b se observa que el prototipo del robot móvil lado en forma de ecuación diferencial con el propósito de
bajo circunstancias reales no realiza la trayectoria ade- tener una correspondencia de al menos un 90 %.
cuadamente con las ganancias óptimas encontradas por
el algoritmo. También se muestra el comportamiento de REFERENCIAS
la orientación del RMO en las Figs. 1c y 1d, donde se Albertos, P. (2007). Lights and shadows of the intelligent
observa que en simulación la orientación del robot móvil control. In European Control Conference, 4440–4441.
sigue la trayectoria propuesta con una gran similitud, Algoul, S., Alam, M., Hossain, M., and Majumder, M.
para el caso experimental existen errores muy notables (2011). Multi-objective optimal chemotherapy control
en la orientación del robot móvil. La no correspondencia model for cancer treatment. Medical and Biological
de los resultados experimentales con la simulación, se le Engineering and Computing, 49, 51–65.
atribuye principalmente a las dinámicas no modeladas en Deb, K. (2000). An efficient constraint handling method
la simulación como el ruido en la estimación de la señal for genetic algorithms. Computer methods in applied
mechanics and engineering, 186(2), 311 – 338.
585
CHAPTER 15. ROBOTICS
2 903,879 542,041 804,838 154,291 298,662 6,394 0,070 0,066 0,003 0,94
3 906,479 734,172 466,998 44,889 143,653 5,887 0,008 0,005 0,002 0,95
4 480,954 637,488 272,761 48,838 258,023 3,624 0,027 0,024 0,002 0,94
5 514.785 25.640 582.770 3.637 172.636 5.726 0.002 0.0002 0.002 0.95
P romedio 688,991 472,777 618,704 58,253 239,168 5,540 0,023 0,020 0,002 0,94
(x1 , x2 ) (x1 , x2 )
0.8 0.8
(x̄d , ȳd ) (x̄d , ȳd )
0.6 0.6
0.4 0.4
0.2 0.2
Yw [m]
Yw [m]
0 0
-0.2 -0.2
-0.4 -0.4
-0.6 -0.6
-0.8 -0.8
586
CHAPTER 15. ROBOTICS
dos Santos Coelho, L. and Pessoa, M.W. (2011). A tuning tracking system. Applied Mathematics and Compu-
strategy for multivariable {PI} and {PID} controllers tation, 274, 690 – 699.
using differential evolution combined with chaotic zas- Storn, R. and Price, K. (1997). Differential evolution–
lavskii map. Expert Systems with Applications, 38(11), a simple and efficient heuristic for global optimization
13694 – 13701. over continuous spaces. Journal of global optimization,
Fleming, P. and Purshouse, R. (2002). Evolutionary 11(4), 341–359.
algorithms in control systems engineering: a survey. Villarreal-Cervantes, M.G. (2014). Control pid robusto
Control Engineering Practice, 10(11), 1223 – 1241. de un robot manipulador con base en un problema de
Karer, G. and Skrjanc, I. (2016). Interval-model-based optimización dinámica. Congreso Latinoamericano de
global optimization framework for robust stability and Control Automático, 1428–1434.
performance of {PID} controllers. Applied Soft Compu- Villarreal-Cervantes, M.G., Cruz-Villar, C.A., Alvarez-
ting, 40, 526 – 543. Gallegos, J., and Portilla-Flores, E.A. (2010). Differen-
Peñaloza-Mejı́a, O., Márquez-Martı́nez, L.A., Alvarez, J., tial evolution techniques for the structure-control design
Villarreal-Cervantes, M.G., and Garcı́a-Hernández, R. of a five-bar parallel robot. Engineering Optimization,
(2015). Motion control design for an omnidirectional 42(6), 535–565.
mobile robot subject to velocity constraints. Mathema- Villarreal-Cervantes, M.G., Cruz-Villar, C.A., and
tical Problems in Engineering, 2015, 1. Alvarez-Gallegos, J. (2014). Synergetic structure-
Portilla-Flores, E.A., Mezura-Montes, E., Alvarez- control design via a hybrid gradient-evolutionary
Gallegos, J., Coello-Coello, C.A., Cruz-Villar, C.A., algorithm. Optimization and Engineering, 16(3),
and Villarreal-Cervantes, M.G. (2011). Parametric 511–539.
reconfiguration improvement in non-iterative concurrent Villarreal-Cervantes, M.G., Guerrero-Castellanos, J.F.,
mechatronic design using an evolutionary-based Ramı́rez-Martı́nez, S., and Sánchez-Santana, J.P.
approach. Engineering Applications of Artificial (2015). Stabilization of a (3,0) mobile robot by means
Intelligence, 24(5), 757–771. of an event-triggered control. {ISA} Transactions, 58,
Reynoso-Meza, G., Blasco, X., Sanchis, J., and Martı́nez, 605 – 613.
M. (2014). Controller tuning using evolutionary multi- Zhang, J. and Yang, S. (2014). An incremental-pid-
objective optimisation: Current trends and applications. controlled particle swarm optimization algorithm for
Control Engineering Practice, 28, 58 – 73. eeg-data-based estimation of operator functional state.
Reynoso-Meza, G., Sanchis, J., Blasco, X., and Freire, R.Z. Biomedical Signal Processing and Control, 14, 272 – 284.
(2016a). Evolutionary multi-objective optimisation with
preferences for multivariable {PI} controller tuning.
Expert Systems with Applications, 51, 120 – 133.
Reynoso-Meza, G., Sanchis, J., Blasco, X., and Martı́nez,
M. (2016b). Preference driven multi-objective optimi-
zation design procedure for industrial controller tuning.
Information Sciences, 339, 108 – 131.
Ruano, A.E., Ge, S.S., Guerra, T.M., Lewis, F.L., Princi-
pe, J.C., and Colnaric, M. (2014). Computational inte-
lligence in control. Annual Reviews in Control, 38(2),
233 – 242.
Sabir, M.M. and Ali, T. (2016). Optimal {PID} controller
design through swarm intelligence algorithms for sun
587
CHAPTER 15. ROBOTICS
Resumen: Este artículo describe un sistema basado en visión artificial para el control de trayectoria
de un robot diferencial en un espacio de dos dimensiones. Dicho sistema puede descomponerse en
tres elementos clave: El sistema de visión artificial, las comunicaciones inalámbricas y el control de
la trayectoria. En primer lugar se describe el sistema completo en forma breve, para
posteriormente mostrar el diseño del control y los resultados obtenidos. Se espera que el sistema
sirva como plataforma experimental para el desarrollo futuro de proyectos en los temas de visión
artificial, sistemas multi-robóticos (MRS) y robótica colaborativa.
Palabras clave: Visión artificial, Comunicación Inalámbrica, Control de Trayectoria.
Este documento se encuentra organizado de la siguiente
1. INTRODUCCIÓN
forma: En la sección 2 se describe la plataforma para el
Los sistemas robóticos están inspirados en la naturaleza y control de trayectoria; en las secciones 3, 4 y 5, se explica el
copian el comportamiento de animales como aves, peces, sistema de visión artificial (SVA), el protocolo de
mamíferos e insectos, además de los seres humanos. La idea comunicación inalámbrica y los algoritmos de generación de
central de estos sistemas es que tengan la capacidad de trayectorias y control de posición respectivamente.
ejecutar tareas que simplifiquen nuestra vida o que nos Finalmente en las secciones 6 y 7 se presentan los resultados
reemplacen en aquellas que son demasiado peligrosas o obtenidos, conclusiones y trabajo futuro en este tema.
difíciles de hacer (Kazuo T, 1999).
Para lograrlo, los sistemas robóticos deben poseer
2. PLATAFORMA
instrumentos que les permitan interactuar con su entorno, ya
sea para medir el estado de cierta variable, o para actuar La plataforma de control de trayectoria está constituida por el
sobre la misma y cambiar su estado. Una de las variables más robot diferencial (dos o más robots en futuros trabajos), una
importantes que debe medir y/o manipular un robot es su cámara, un sistema de comunicación inalámbrica y una unidad
posición. Antes de realizar una tarea en la que requiera de procesamiento (computador), Figura 1.
desplazarse por el entorno, este debe ser capaz de identificar
su posición para luego establecer cómo llegar a un
determinado lugar. Por lo tanto, establecer el estado de la
posición inicial y la trayectoria que este robot debe realizar,
es una de las áreas más estudiada por los sistemas robóticos
autónomos, ya sean aéreos, acuáticos o terrestres (Kazuhiro
K, 2007)
En este trabajo se presenta un sistema constituido por un
robot diferencial capaz de recorrer una trayectoria punto a
punto, rechazando perturbaciones en su posición o
desviación. Se presenta la plataforma de control de posición y
generación de trayectorias, integrando las áreas de Visión
Artificial, Procesamiento de Señales, Telecomunicaciones y
Teoría de Control.
Figura 1. Plataforma para el control de trayectoria.
588
CHAPTER 15. ROBOTICS
2.2 Cámara
589
CHAPTER 15. ROBOTICS
Si el valor de la desviación estándar es superior a la cual al ser conectado al computador actúa como coordinador,
mitad de la media (esto implica que los máximos se lo que posibilita usar el máximo nivel de potencia de
encuentran dispersos, debido a la presencia de transmisión al disponer de una fuente de alimentación
verdes que no son exclusivos de la etiqueta), ilimitada permitiendo garantizar un mayor rango de
entonces el valor de umbral será la media, esto con cobertura. El otro tipo de dispositivo, es el XBee-PRO ZB
el fin de tratar de eliminar todo el verde que S2B, el cual fue configurado como dispositivo final en el
realmente no pertenece a la etiqueta. robot. Por otro lado, para la configuración y pruebas de estos
módulos se hizo uso de XCTU que tiene las herramientas
Si el valor de la desviación es igual o inferior a la para un rápido prototipado y puesta a punto de los módulos.
mitad de la media (esto implica que los máximos
están menos dispersos, debido a que los verdes son La trama API para el envío de la información debió ser
únicamente de la etiqueta), entonces, para considerar creada de forma dinámica. Dado que la aplicación de visión
la mayor cantidad de verde se define como umbral la artificial y la interfaz de control de trayectoria fueron
diferencia entre la media y la desviación estándar. desarrolladas en el software Matlab, se usó la misma
plataforma para la creación de las tramas de los módulos
Después de aplicar el proceso de umbralización automático XBee.
anteriormente mencionado las imágenes de la muestra
conservan total o parcialmente el ruido introducido. Para Para la creación de la trama se tuvo en cuenta solo los datos
eliminarlo se clasifican como zonas de menor área que la de posición y la suma de chequeo. Los datos son definidos
etiqueta y el resultado es el mostrado en la Figura 4. por la aplicación de visión artificial, y el valor del checksum
debe ser calculado para cada nueva trama.
Para la recepción de datos, el reto era almacenar toda la trama
de llegada y obtener de ella los campos de interés. Es decir,
de toda la trama recibida solo tomar los valores del comando
enviado (posición). Para esto inicialmente se almacena toda
la trama en un vector, con el fin de evitar pérdidas de bytes.
Posteriormente se obtienen con punteros los valores de
Figura 4. Eliminación de ruido. (a) Etiqueta pre-procesada.
coordenadas de destino, y se efectúa su conversión a enteros
(b) Etiqueta umbralizada. (c) Etiqueta final, luego de eliminar
para facilitar hacer uso de los mismos.
ruido.
590
CHAPTER 15. ROBOTICS
que la distancia entre ellos también lo sea; estos son los dos esfuerzo de control que debe hacer cada motor para
requerimientos principales para diseñar el controlador de minimizar la desviación. Por lo anterior, cada acción de
trayectoria. Como se muestra en la Figura 7, (Xa, Ya) control (ciclo de trabajo) es el error asociado a cada caso (a
corresponden a la posición actual, (Xd, Yd) es la posición cada motor) multiplicado por el controlador así:
deseada correspondiente, VA y VB son la velocidad de los
motores del robot, “alfa” y “d” son la desviación y la
distancia respectivamente.
(1)
Robot
La velocidad del robot se controla por medio del ciclo de Figura 9. Diagrama de bloques del sistema de control de
trabajo de la señal PWM que reciben los motores. Este ciclo trayectoria simplificado.
de trabajo es la acción de control que se genera al comparar
los dos requerimientos (desviación y distancia) con sus
valores actuales. El robot es un sistema MISO con dos 5.1 Modelo matemático del robot
entradas - una salida y con el fin de simplificarlo se hace la
suposición de que los motores siempre tendrán una velocidad Para encontrar el controlador C, primero se debe modelar el
distinta de cero en toda la dinámica de la trayectoria. Lo robot conociendo como cambia su ángulo de giro en función
anterior tiene varias implicaciones. En primer lugar, el robot de las velocidades de cada rueda, y por consiguiente del ciclo
no se detendrá en toda la trayectoria sino que tendrá de trabajo de cada motor. La Figura 10 muestra la dinámica
movimiento continuo (más adelante se estudia como del movimiento del robot.
detenerlos al final de la trayectoria), y en segundo lugar los
dos requerimientos (desviación y distancia) se simplifican a
uno solo. Esto se debe a que cuando se desea controlar el
ángulo y los motores no se detienen, se corrige al mismo
tiempo la distancia “d” entre los puntos. Por lo tanto, no se
hace necesario realimentar la distancia y compararla, por lo
que el diagrama de bloques del sistema de control es más
simple. Para determinar cómo se comporta la señal de error
que ingresa al controlador se debe dividir el ángulo de
desviación en dos casos, ver Figura 8.
(2)
Debido a que el requerimiento es que la desviación sea
mínima, el error es la referencia menos el ángulo de
orientación actual del robot. Así, en cada caso de la Figura 8,
el error es el negativo del ángulo alfa. Cada caso representa el
591
CHAPTER 15. ROBOTICS
Dónde:
( )
( ) ( ) { } (5)
592
CHAPTER 15. ROBOTICS
Dónde:
‖ ‖ (12)
√( ) ( )
( ) ( )
Al aumentar el tiempo de retardo en la realimentación (mayor ( ) (13)
tiempo de procesamiento y envío de datos) se observa un √( ) ( )
aumento en las oscilaciones y en el tiempo muerto, Figura 15.
Esto se debe a que los sistemas con retardo en su respuesta En esta ecuación:
presentan raíces de fase no mínima, que dan lugar a Representa la coordenada en x de la posición deseada
inestabilidad si no se realiza un control apropiado. (Ogata, Representa la coordenada en x de la posición actual
2010) Representa la coordenada en y de la posición deseada
Representa la coordenada en y de la posición actual
Representa la coordenada en x de la posición de
orientación
Representa la coordenada en y de la posición de
orientación
Es el ángulo alfa a controlar
593
CHAPTER 15. ROBOTICS
Figura 16. Fila A, robot con trayectoria estable. Fila B, trayectoria inestable. Fila C, trayectoria estable con oscilación.
Debido a que por medio de la ley de cosenos se obtiene el El robot comienza su movimiento en la posición 1, Figura 6.
ángulo más pequeño entre dos vectores (Fuller, Tarwater), se El sistema de visión genera las coordenadas y las envía al
desconoce el signo de esté, Figura 8. Para determinarlo, se robot, este calcula su desviación mediante (13) y (14),
usa el siguiente producto vectorial: determinando la potencia (ciclo de trabajo) para cada motor
mediante (15). El robot se dirige en línea recta hacia la
‖⃗ ‖ (⃗⃗⃗⃗⃗ ⃗⃗⃗⃗ ) posición 2 y cuando el valor de la ecuación (12) alcanza un
(⃗⃗⃗⃗ (14) umbral mínimo, el sistema de visión le envía la nueva
posición deseada 3, y el robot continúa su desplazamiento.
⃗⃗⃗⃗ ) Para detenerse al final de la trayectoria, el código reconoce la
última posición deseada (en este caso la posición 5. Figura 6)
Luego si ‖ ⃗ ‖ entonces es negativo, y será positivo y usa (12) para determinar cuándo apagar los motores.
en caso contrario. Este resultado es de suma importancia para
la orientación del robot, pues las acciones de control de cada Para validar el modelo del sistema de control se implementa
motor dependen del signo del ángulo , como puede el algoritmo de la sección 5.3 para diferentes valores de y
verificarse en (1) y en la Figura 8. Al reemplazar (10) y (13) de tiempo de retardo. En la Figura 16, se muestra el resultado
en (1) el algoritmo implementado en la tarjeta será entonces: de las pruebas. La rotación estable proporciona la trayectoria
más rápida aunque no la más óptima, se espera trabajar en
( ) ( ) trayectorias optimizadas en el futuro.
( )
√( ) ( )
(15) En el experimento se aplicaron perturbaciones al robot, como
( ) ( ) cambiar su dirección o su posición y pudo notarse que esté
( )
√( ) ( ) recuperó su trayectoria para redirigirse nuevamente al punto
deseado, mostrando así un resultado satisfactorio del control
Donde es adicionada para garantizar el de trayectoria.
requerimiento de que la velocidad de los motores siempre
Se realizaron numerosas pruebas en el control para puntos de
será distinta de cero para cuando .
llegada preestablecidos, registrando el número de veces que
En la sección experimental se entra en detalle sobre el
el robot llego a la posición deseada sin desviarse en
mecanismo de parada del robot.
direcciones aleatorias, los resultados son mostrados en la
Tabla 1.
6. EXPERIMENTO
Número Tiempo promedio
Se realizó el montaje mostrado previamente en la Figura 1 de recorrido [s]
ubicando la cámara a un metro por encima del suelo con el
brazo de una lámpara, asegurando condiciones adecuadas de Éxitos 25 36
luz y ubicando al robot en una superficie sin componentes
Fallos 5 54
importantes de verde. Al iniciar el código en MATLAB se
abre una ventana que muestra la posición actual del robot y Total 30
un puntero para que el usuario pueda crear la trayectoria
punto a punto, escogiendo todas las posiciones deseadas de Tabla 1. Cuantificación de la efectividad del sistema
manera sucesiva, ver Figura 5. El algoritmo de visión diseñado.
artificial es capaz de eliminar todo el ruido y componentes de
verde que no hacen parte de las etiquetas para diferentes
condiciones de iluminación.
594
CHAPTER 15. ROBOTICS
AGRADECIMIENTOS
Este trabajo ha sido financiado por la Universidad Nacional
de Colombia a través del Programa Nacional de Semilleros
de Investigación, Creación e Innovación en la Modalidad de
Proyectos para la introducción en la investigación, creación o
innovación 2013-2015.
REFERENCIAS
Ailon, A., "Control strategies for groups of autonomous
wheeled mobile robots with restricted inputs,"
in Electrical & Electronics Engineers in Israel (IEEEI),
2012 IEEE 27th Convention of , vol., no., pp.1-4, 14-17
Nov. 2012
F. Cádiz, S. A. Hevia, S. A. Reyes, “Mecánica clásica”,
Departamento de física, Pontificia Universidad Católica
de chile, 2013, pp. 27-34.
G. Fuller, D. Tarwater. "Vectores, Planos y Rectas" En:
Geometría analítica. Pearson education. P.314-349.
Jing Zhou; Dejun Mu; Feisheng Yang; Guanzhong Dai, "A
novel approach for analyzing collective dynamics of
large-scale multi-robot system in task allocation," in
Information and Automation (ICIA), 2014 IEEE
International Conference on , vol., no., pp.1137-1142, 28-
30 July 2014 doi: 10.1109/ICInfA.2014.6932820
Karl J. Aström, T. Hägglund, “Control PID avanzado”, Lund
Institute of Technology, Suecia, Pearson education S.A.
2009, pp. 184-188.
Kazuhiro K, “Robotics as Systems Integration”. Integration
Technology. IEEE International Conference on. 2007.
Kazuo T. “Human Friendly Robotics, a Challenge to New
Robot Applications”. International Conference on
Intelligent Robots and Systems. IEEE. 1999, Volume: 2,
Pages: 609 – 609.
595
CHAPTER 15. ROBOTICS
Abstract: En este artículo se expone el trabajo realizado para desarrollar un sistema de control de
movimiento para un manipulador móvil, el cual fue implementado por medio de un servidor ejecutado en
la terminal del sistema operativo Linux Mint instalado en un sistema embebido modelo Fit-PC3i
embarcado en el robot, el cual adicionalmente realiza las funciones de comunicación entre la estación
base y la estación de control. Se desarrollaron leyes de control de tipo discreto para resolver el problema
de regulación y seguimiento de trayectorias, con el fin de facilitar su implementación y simplificar el
desarrollo de los programas, sin embargo el énfasis radica en la programación de los mismos en un
sistema de este tipo.
Keywords: sistema embebido, control discreto, manipulador móvil.
1. INTRODUCCION convergencia asintótica del mismo, centrando su estudio en
los diferentes métodos de control empleados.
Un aspecto de gran interés en el desarrollo de sistemas de
control es el empleo de sistemas embebidos con suficiente En Hendzel Z., et. al. (2011), proponen un algoritmo de
capacidad de cómputo para el cálculo de las diferentes control discreto implementado en un robot móvil de dos
operaciones a realizar. En el caso de Bruzzonea, G., et. al. ruedas, el cual se prueba para mostrar el desempeño a través
(2009), se analizó la posibilidad y validación de emplear de varios experimentos. En Horowitz, B., et. al. (2003) se
sistemas embebidos en aplicaciones practicas de robótica describe detalladamente una metodología de diseño de
mediante el uso de programas desarrollados en distribuciones controladores para vehículos autónomos mediante la
de Linux con las cuales se reportan una gran compatibilidad implementación de sistemas embebidos, a través del cual se
en aplicaciones de automatización. En Suppiah, et. al. (2014), pueden validar su efectividad basado en el tiempo de
se presentan técnicas que relacionan el uso de sistemas computo que proporciona la combinación del control con la
embebidos en el sector académico con el uso de plataformas aplicación. Por esta razón, es posible hablar extensamente de
experimentales, si bien el trabajo está enfocado en el área las ventajas que en años recientes ha aportado el uso de
educativa se evidencia el amplio potencial del uso y software libre para aplicaciones en robótica, puesto que
aplicación de los sistemas embebidos en sistemas robóticos. permiten mediante el uso de algortimos implementados en
C/C++ obtener aplicaciones que no dependan de otros, al
Además que en la literatura, también se encuentran varios realizarse desde cero estos programas pueden ser modificado
trabajos donde el control del robot se realiza de manera o reestructurado cada vez que sea necesario, sin necesidad de
inalámbrica con el fin de ser implementado en un sistema desechar la aplicación que se tenia funcionando; otra
embarcado para resolver el problema de regulación o de característica muy importante se obtiene del tiempo de
seguimiento de trayectorias, como es el caso de Wang Z., et. ejecución ya que puede decirse que es en tiempo real para
al. (2012), en el cual se describe el desarrollo de un cada una de las tareas programadas, siendo los eventos, una
controlador para la estimación de movimiento y su posterior de las técnicas de programación más empleadas. Sin
prueba de estabilidad empleando el método de Lyapunov, los embargo, en ninguno de estos trabajos se presenta a detalle la
resultados se muestran por medio de imágenes obtenidas en arquitectura de hardware o el software desarrollado para la
tiempo real y con simulaciones que demuestran la implementación de los controles reportados.
convergencia de la regulación visual propuesta; por otro lado
en Chen J., et. al. (2005), únicamente se describen tres Por lo que resulto atractivo plantear una propio con el fin de
enfoques a través de los cuales se propone el uso de un validar su efectividad en el manipulador móvil que se tiene,
controlador a partir de una función que garantice la para aprovechar de forma adecuada las características físicas
596
CHAPTER 15. ROBOTICS
del sistema embebido con que se cuenta y la implementación en la estación móvil se encarga de calcular la ley de control
directa de controladores en tiempo discreto que se puede en lazo cerrado que dirige los movimientos de los diferentes
realizar en este tipo de dispositivos, lo que conlleva también a actuadores, tanto del brazo como de la plataforma, para que
una reducción considerable en la cantidad de recursos el manipulador móvil alcance la posición solicitada
computacionales que se requieren al ejecutarse cada uno de empleando un sistema operativo Linux Mint 13 Maya.
los programas desarrollados. En este artículo se presenta el
desarrollo e implementación de una arquitectura de control
que opera desde un sistema embebido, embarcado en un
manipulador móvil teleoperado por medio de un Gamepad.
Los algoritmos de control se desarrollaron en tiempo discreto,
debido a que su implementación se realiza en un dispositivo
digital.
597
CHAPTER 15. ROBOTICS
q1 = a tan 2 ( px , py ) (1)
(
q 3 = a tan 2 D, ± 1- D 2 ) (2)
con
px2 + py2 + pz2 - a22 - a32
D=
2a2 a3
( )
Fig. 3. Leyes de control desarrolladas.
q 2 = a tan 2 px2 + py2 , pz -
(3) 3.1 Control para regulación del móvil
2atan 2 (( a2 + a3 cosq 3 ) , ( a3 sin q 3 ) ) A partir del modelo presentado en (5), se propone una ley de
control de estructura variable que permite resolver el
De igual forma, a partir del modelo cinemático continúo del problema de regulación del robot de acuerdo a su orientación
robot móvil, dado por, inicial. Los detalles de esta propuesta de control se presentan
en Hernández-Oliva, et al., (2014).
Para el primer caso, se busca que el robot se acerque a la
(4)
posición deseada sobre el eje X1, buscando una orientación
final dada por x3=0, cuando esto se logra se conmuta la ley de
control para U1 y el móvil se aproxima al origen, las leyes
obtenidas son,
Orosco (2003), obtuvo el modelo discreto exacto, que se
presenta en (5):
u2 =
( k3 -1) x3 (6)
T
x2 ( k2 - 1)
u1 = (7)
(5) æ é k - 1 ùö
2y sin ç x3 + ê 3 ú÷
è ë 2 ûø
x1 ( k1 - 1)
u1¢ = (8)
æ é k - 1 ùö
2y cos ç x3 + ê 3 ú÷
definiendo la función ψ como:
è ë 2 ûø
Para la segunda estructura de control, se busca que el robot se
ì æT ö acerque a la posición deseada aproximándose primero al eje
ï sin ç u2 ( kT )÷
è ø X2, de manera que tenga una orientación final dada por
ï si u2 ( kT ) ¹ 0
2
ï x3=/2 y se conmuta de nuevo la ley de control para U1 y el
y éëu2 ( kT ) ùû = í u2( kT )
ï T móvil se aproxima al origen.
ï si u2 ( kT ) = 0
ï p p
î 2 + k3 x3 - k3 - x3
u2 = 2 2 (9)
3. ALGORITMOS DE CONTROL DESARROLLADOS T
Debido a que los algoritmos de control deben ser pæ pö
+ k2 ç x2 - ÷ - x2
implementados en el sistema embebido, se desarrollaron 2 è 2ø
controladores discretos a partir del modelo discreto exacto u1 = (10)
æ1 p ö
del brazo antropomórfico y de la plataforma móvil, 2y sin ç x3 (1+ k3 ) + (1- k3 )÷
considerado como un modelo desacoplado al manipulador è2 2 ø
móvil, Fig. 3.
598
CHAPTER 15. ROBOTICS
pæ pö 1
+ k2 ç x2 - ÷ - x2 u2 = é Dx3d - g t ex-3 + ds t- ùû
Të
(17)
2 è 2ø
u1¢ = (11)
æ1 p ö
2y cos ç x3 (1+ k3 ) + (1- k3 )÷
è2 2 ø 3.3 Control de posición para el brazo antropomórfico.
Estas leyes de control discretas fueron programadas en el El control de posición para el brazo manipulador se realizó
embebido, desarrolladas en el programa mover móvil que se utilizando un PID con compensación de gravedad en tiempo
presentará en la sección 4. discreto. Las ecuaciones fueron implementadas en el sistema
3.2 Control para seguimiento de trayectoria para el embebido mediante el programa mover brazo que se
móvil. presentará en la sección 4. El detalle de cómo fue calculado
este control se observa en Hernández-Oliva (2015).
El control por modos deslizantes en tiempo discreto es muy
útil en aplicaciones prácticas debido a la simplicidad de su
implementación, ya que los problemas que se presentan en 4. DESARROLLO E IMPLEMENTACIÓN DEL
tiempo continuo, como el castañeo, pueden ser reducidos y SOFTWARE EN EL SISTEMA EMBEBIDO
hasta eliminados si se garantiza una adecuada selección de
los parámetros de control. A partir del modelo presentado en Hoy en día existe un gran interés en el uso de controladores
(5), se propone una ley de control por modos deslizantes en precisos, en este sentido el uso de sistemas embebidos
tiempo discreto para el seguimiento de trayectorias del móvil, aplicados a robótica ha tenido gran aceptación, puesto que un
la definición de esta ley de control se detalla en Niño-Suarez, solo dispositivo puede sustituir componentes de hardware
et. al. (2012). que requieren mayor implementación proporcionando en
algunos casos peso extra al robot.
A partir del modelo del error dado en (12) y de las superficies
deslizantes definidas en (13), En este trabajo, las leyes de control en tiempo discreto fueron
implementadas mediante programas que son ejecutados en el
æ T ö
ex+1 = ex1 + 2u1y ( u2 ) cos ç ex3 + x3d + u2 ÷
Fit-PC3i.
è 2 ø El primero, tiene como finalidad configurar la red
inalámbrica y la instalación de drivers propios requeridos por
- Dx1d el sistema operativo Linux Mint 13 Maya con el que cuenta el
sistema embebido, el diagrama de flujo del programa de
æ T ö
ex+2 = ex2 + 2u1y ( u2 ) sin ç ex3 + x3d + u2 ÷ configuración de la red se presenta en la Fig. 4, de forma
è 2 ø (12) general.
- Dx2d
s y = e - ex + g e
+
x2 2
-
y x2 (13)
s t = ex+ - ex + g t ex-
3 3 3
2 w1x2 + w1y2
u1 =
y ( u2 )
(16)
599
CHAPTER 15. ROBOTICS
En la Fig. 5, se observa los diagramas de flujo de los En la (Fig. 7), se observa el diagrama de flujo general de este
programas desarrollados para el servidor y el cliente. programa.
Esta función permite la instalación del servidor y la creación Fig. 6. Diagrama de flujo programa Joystick.
de la red de comunicación para el envío y recepción de los
datos. El siguiente programa desarrollado es joystick, se De acuerdo a la información que ingresa del programa
encuentran disponibles versiones de Linux para enlazarse un Joystick, el programa motores determina que acción realizar,
dispositivo de juego, son programas de código abierto. Su si calcular la ley de control para el desplazamiento del móvil
utilizó como base un programa por eventos, cada evento es o para el posicionamiento del brazo. Por lo cual, la operación
empleado para leer información de los diferentes botones del del programa motores tiene dos etapas donde cada una se
control para juego, también de esta forma es posible encarga de calcular la operación respectiva, de acuerdo al
deshabilitar o habilitar botones. En el caso del prototipo tipo de tarea que se desea que el robot realice, desplazarse o
desarrollado se utiliza el Gamepad modelo F710 de la marca posicionar el brazo. Las dos operaciones no se pueden
600
CHAPTER 15. ROBOTICS
601
CHAPTER 15. ROBOTICS
limitado por una frecuencia de muestreo, y por lo tanto se rescate y exploración”. XV Congreso Latinoamericano
puede implementar en una gran variedad de sistemas de Control Automático, CLCA 2012. Lima, Perú,
embebidos, lo que resulta cada vez más atractivo por la Octubre 23-26.
facilidad y simplicidad en la programación y por las Orosco, G. R. (2003). “Modelado, análisis y control de
características de hardware con que cuentan. vehículos articulados y robots móviles. Teoría y
experimentos”, Tesis de doctorado. CINVESTAV.
Los programas de control y comunicaciones se desarrollaron
Hernández-Oliva, Noemi, Niño-Suarez, Paola A., Portilla-
en C y C++, para que fueran ejecutados en el sistema
Flores, Edgar A., (2014). “Esquema discreto de control
embebido embarcado en el móvil; esto se logró gracias a que
de posición para teleoperación de un manipulador
el sistema embebido puede ser conectado directamente a una
móvil”. Second International Conference on Advanced
terminal y a un monitor, razón por la cual es factible verificar
Mechatronics, Design, and Manufacturing Technology -
la operatividad del software desarrollado. Un aspecto de gran
AMDM 2014, pp. 223-228, Bogotá Colombia, Octubre
importancia fue lograr una reducción de las tarjetas
22-24.
electrónicas con que contaba el prototipo previo, ya que fue
Hernández-Oliva, Noemi (2015). “Sistema de control de
factible programar funciones, que requerían un dispositivo
movimiento para un manipulador móvil”. Tesis de
adicional, para que el sistema embebido las realizará,
Maestría en Tecnología de Cómputo. Instituto
obteniendo una reducción en el peso de la plataforma móvil.
Politécnico Nacional CIDETEC.
Como trabajo a futuro, se prevé la implementación de Vaishak, N. & Ram Chandra, C. (2012). “Embedded Robot
cámaras en el robot, que permitan resolver el problema de Control System Based On an Embedded Operating
regulación visual y cuya información también sea procesada System, the Combination of Advanced RISC
en el sistema embebido por medio de la adecuación de Microprocessor (ARM), DSP and ARM Linux”.
módulos adicionales de programación, puesto que el servidor International Journal Of Engineering And Innovative
instalado permite esa conectividad. Technology (IJEIT), 2(6), pp. 143-147.
REFERENCIAS
Bruzzonea, G., Caccia, M., Raverab, G. and Bertone, A.
(2009). “Standard Linux for embedded real-time robotics
and manufacturing control systems”. Robotics and
Computer-Integrated Manufacturing. No. 25: 178–190.
Suppiah, R. and Fauzi, M. B. A. (2014). “Introducing
Embedded Systems Development on a Robotics-Based
Platform”. 2014 International Conference of Teaching,
Assessment and Learning (TALE), pp. 103 – 108,
Wellington, New Zealand, Diciembre 8-10.
Wang Z., Liu Y., Cail B., and Zhao J. (2012). "Planar Region
Alignment based Visual Regulation for Mobile Robot".
Proceedings of IEEE International Conference on
Mechatronics and Automation, pp. 1978 – 1983,
Chengdu, China, Agosto 5-8.
Chen J., Dixon W., Dawson D., and Galluzo T. (2005).
"Navigation and control of a wheeled mobile robot".
Proceedings of the IEEE/ASME International
Conference on Advanced Intelligent Mechatronics, pp.
1145 – 1150, Monterrey, California, USA, Julio 24-28.
Hendzel Z. and Szuster M. (2011). "Discrete neural dynamic
programming in wheeled mobile robot control".
Commun Nonlinear Sci Numer Simulat, 16:2355 – 2362.
Horowitz, B., Liebman, J., Ma, C., Koo, T., Sangiovanni-
Vincentelli, A., & Sastry, S. (2003). “Platform-based
embedded software design and system integration for
autonomous vehicles”. Proceedings Of The IEEE, 91(1),
pp. 198-211.
Sánchez-Cristo A. F., Niño-Suarez P., Portilla E., Avilés O.,
Villegas G., (2013). “Manipulador móvil teleoperado
por un sistema de control discreto”. XI Congreso
Iberoamericano de Ingeniería Mecánica, CIBIM 2013.
La Plata-Argentina, Noviembre 11-14.
Niño-Suárez P. A., Portilla-Flores E. A., Sánchez-Cristo A.
F., Villegas-Medina G. (2012). “Manipulador móvil para
602
CHAPTER 16
SIGNAL PROCESSING
CHAPTER 16. SIGNAL PROCESSING
604
CHAPTER 16. SIGNAL PROCESSING
los cuales están circulando fluidos de dos o más fases. A se dispone de dispositivos confiables y de costo moderado
este tipo de flujos, en el que coexisten dos o más fases, se para la medición directa de las propiedades de interés de
les conoce como fluidos multifásicos. Un ejemplo tı́pico de las pulpas, como su densidad y viscosidad. De aquı́, se tiene
este tipo de fluidos se puede evidenciar en las plantas de que para modelar y representar adecuadamente el com-
procesamiento de minerales y alimentos. En estas indus- portamiento de estos fluidos al interior de una tuberı́a, se
trias, es común procesar el material de interés con agua deben identificar las propiedades de mezcla correctamente
(generalmente) para formar una pulpa o lechada, la cual y conocer los factores que limitan el transporte de estas al
es transportada a las diferentes unidades de procesamiento interior de una tuberı́a.
donde se realiza el tratamiento correspondiente. Ası́, en las
plantas de procesamiento de minerales se suele denominar
como pulpa a la mezcla de partı́culas sólidas (materia Diferentes estudios se han desarrollado para la estimación
prima de interés) suspendidas en un lı́quido. de las propiedades de densidad y viscosidad de fluidos
usando técnicas de procesamiento digital de señales.
Algunos de estos estudios se han enfocado en la estimación
El carácter multifásico de los fluidos puede deberse al de estas propiedades a través de montajes a nivel de labo-
origen mismo del fluido, o a una condición operativa de ratorio, pero no se han encontrado trabajos en que se lleve
la lı́nea de conducción. El primer caso hace referencia este análisis desde el dominio de la frecuencia, en tiempo
a la fuente, es decir, cuando en un tanque previo a la real y para estimaciones en lı́nea. Los resultados obtenidos
lı́nea las materias primas son adicionadas para formar una en los trabajos de Abdallah et al. (2013), Heinisch et al.
mezcla, o existen cambios de fase. La condición operativa (2015), Manzaneque et al. (2014) y Eris et al. (2015), se
por su parte, hace referencia a los cambios de fase que se basan en la aproximación a sistemas resonantes para la
producen al variar alguna de las condiciones de transporte determinación de las propiedades del fluido mediante la
en la lı́nea, como por ejemplo temperatura o presión, que identificación de la frecuencia de corte (wc ) y el factor de
pueden causar cambios de fase parciales durante el trans- calidad (Q) para la respuesta en frecuencia. Sin embargo,
porte del fluido. Con base en lo anterior, y dada la impor- y con un análisis previo de la señal disponible en el proceso
tancia de las lı́neas de conducción como parte fundamental de transporte de fluidos a lo largo de una lı́nea, no fue posi-
de las plantas de procesamiento, resulta necesario identi- ble identificar o reconocer un comportamiento resonante.
ficar y estudiar los aspectos relevantes del fenómeno que
interesan desde la ingeniera de procesos. Dichos aspectos,
se refieren a comprender cómo están viajando los fluidos Para el caso de procesamiento de señales continuas, existen
y cuanta energı́a se requiere para hacerlos mover (∆P ) diferentes técnicas factibles de implementar en tiempo
entre las diferentes unidades de operación (Benretem et al., real, las cuales suministran información suficiente para
2010). ser analizada en lı́nea. Algunas de estas, cómo Wavelet
y la transformada de Fourier en tiempo corto STFT,
permiten obtener la información espectral de señales no
Las propiedades de los fluidos multifásicos afectan direc- estacionarias (Shensa, 1992). Esta última técnica se basa
tamente el comportamiento reológico de estos y el cálculo en la división de la señal original en porciones finitas, sobre
de la caı́da de presión (∆P ) cuando fluyen. Ası́, la impor- las cuales se aplica la transformada de Fourier para obtener
tancia de la reologı́a en el estudio del comportamiento de la información de los componentes espectrales en el mo-
fluidos al interior de una tuberı́a, radica principalmente en mento en que ocurren (Flórez et al., 2009). Sin embargo, y
que dependiendo de la cantidad y forma de los sólidos, los dado el principio de incertidumbre de Heisenberg, la STFT
efectos inerciales de las partı́culas pueden o no dominar presenta problemas de resolución tiempo-frecuencia para
las propiedades de la mezcla. En el caso en que se tiene los cuales se han desarrollado técnicas basadas en análisis
partı́culas muy grandes, los efectos inerciales dominan y de multiresolución (Martı́nez and Castto, 2002).
la viscosidad aparente de la mezcla será muy similar a la
del lı́quido. Por lo tanto, se puede presentar asentamiento
de las partı́culas al interior de la tuberı́a formando un De otro lado, la transformada Wavelet (TW) se funda-
flujo heterogéneo. Caso contrario se tiene cuando las menta en el análisis multiresolución, en el cual la señal
partı́culas son muy finas, puesto que la presencia de estos es analizada para diferentes frecuencias con diferentes
sólidos usualmente produce un incremento significativo en resoluciones (resolución variable), y ası́ dar respuesta a los
la viscosidad de la mezcla comparado con la del lı́quido problemas en resolución de la transformada STFT. Con
transportador. Al interior de la tuberı́a, en este tipo de la transformada Wavelet es posible obtener representa-
ciones adecuadas de las señales tanto en el dominio tem-
fluidos no se depositan las partı́culas y resultan en flujo
poral como en el de la frecuencia de manera simultánea,
homogéneo, pero pasan de un comportamiento de tipo
Newtoniano a no Newtoniano (He et al., 2004). permitiendo identificar los momentos de ocurrencia para
determinadas componentes espectrales, como por ejemplo
los cambios en estado estacionario o transitorios. Ası́, el
Todos los factores mencionados anteriormente, relaciona- principio de operación de la transformada Wavelet con-
dos al cálculo de las propiedades de las pulpas o fluidos siste en una serie de etapas de filtrado y decimación de
multifásicos condicionan los procesos de transporte, y por la señal de interés. De lo anterior, al aplicar la TW a
tanto cualquier variación en dichas propiedades se puede una señal, se está realizando un proceso de filtrado en
ver reflejada como cambios bruscos en las variables de el dominio temporal usando filtros paso-bajo y paso-alto
operación, tales como la caı́da de presión en la lı́nea y que se encargan de eliminar bajas o altas frecuencias de
caudal o flujo volumétrico. Estas últimas son consideradas la señal según corresponda. Este proceso se repite para
normalmente en las plantas de procesamiento, dado que no las señales resultantes del proceso anterior, y se conoce
605
CHAPTER 16. SIGNAL PROCESSING
como etapa de descomposición. El número de veces que En los trabajos de Meyer (1993), Mallat (1989) y Shensa
se repite el proceso de filtrado para la señal original, (1992) se explica con detalle la deducción de la trans-
corresponde al número de niveles de descomposición de formada Wavelet, ası́ como el procedimiento de cálculo,
la señal, obteniendo como resultado un grupo de señales a caracterı́sticas y formas de representación, diferentes al
diferentes bandas de frecuencia que representan la misma banco de filtros.
señal.
606
CHAPTER 16. SIGNAL PROCESSING
607
CHAPTER 16. SIGNAL PROCESSING
ciones (N umIte ). Posteriormente se determina la presión Después de seleccionar los parámetros para el cáluclo
de salida conociendo ρP , µP y QP en cada iteración i, lo de la DWT, se verifica que la longitud de la señal sea
anterior con el fin de determinar la caı́da de presión del igual al tiempo de simulación seleccionado para el análisis
fluido en la lı́nea. Adicional, y previo a la implementación (i = N umIte ), y se procede a calcular la transformada
se selecciona el tipo de Wavelet y el número de niveles en Wavelet discreta, con lo cual se obtienen los coeficientes
los que se desea descomponer la señal para el cálculo de la de la transformada acorde al tipo de Wavelet y niveles
DWT. La selección del tipo de Wavelet y número de nive- seleccionados. Luego, se realiza un tratamiento de los
les se realiza con base el objetivo del análisis, dado que de- resultados previo a obtener el resultado gráfico de la
pendiendo del tipo que se seleccione se fijará la resolución DWT. Teniendo en cuenta que la descomposición de la
temporal, y con base en los niveles se determinará las señal se realiza a diferentes frecuencias de corte, y en
bandas de frecuencia, y por tanto la resolución en este do- cada nivel la longitud de las señales corresponde con la
minio. Para el caso desarrollado en este trabajo, se evaluó mitad de la señal en el nivel anterior, se deben separar
el comportamiento de diferentes tipos de transformada los coeficientes por escala (k) y posteriormente realizar un
Wavelet con el fin de identificar cual de estas permitı́a proceso de remuestreo para llevar cada señal al tamaño de
tener una buena resolución temporal, teniendo en cuenta la señal original. Finalmente se gráfican los resultados de
que los tiempos de perturbación al sistema son conocidos. los coeficientes para su correspondiente escala.
Para el caso de los niveles, se probaron diferentes valores
entre 2 - 6, y se seleccionó aquel que permitiera establecer 5. RESULTADOS.
diferencias entre los dos tipos de perturbaciones aplicadas
al sistema. De lo anterior, para la implementación de la Con el fin de ilustrar el análisis de la respuesta en frecuen-
DWT se seleccionó la función madre Daubechies 2, la cual cia para la señal de caı́da de presión de un fluido ante per-
además de sus propiedades de ortogonalidad y asimetrı́a, turbaciones en sus propiedades (densidad y viscosidad), se
permitió cumplir con el objetivo del presente trabajo. Para presenta en las Figura 5 la respuesta del sistema ante una
más detalle acerca de la función Wavelet seleccionada y perturbación tipo impulso en la densidad y en la viscosi-
sus propiedades se recomienda consultar los trabajos de dad. Para este caso, se consideró como entrada únicamente
(Mallat, 1989), (Meyer, 1993) . una de las propiedades dejando la otra constante.
608
CHAPTER 16. SIGNAL PROCESSING
609
CHAPTER 16. SIGNAL PROCESSING
Fig. 7. Descomposición por niveles usando la transformada Wavelet para la señal de caı́da de presión.
(2003) se presenta un ejemplo de lo mencionado, en el que Lo anterior es posible, si se conoce cuales son las variables
se usa el criterio de umbrales para el filtrado de ruido en susceptibles a perturbaciones, que además pueden generar
diversas aplicaciones. cambios en el proceso.
6. CONCLUSIONES.
Basados en lo anterior, y a partir de los resultados
Con base en los resultados obtenidos y partiendo del obtenidos para los coeficientes de la transformada Wavelet,
conocimiento de la dificultad que se tiene para separar en los cuales es posible distinguir patrones referentes a
los efectos de la densidad y la viscosidad sobre la caı́da cada una de las perturbaciones realizadas, es posible pro-
de presión de un fluido fluyendo, el análisis en frecuen- poner dentro de trabajos futuros una implementación en
cia resulta ser una herramienta de gran ayuda para la lı́nea. Esta última utilizando la metodologı́a propuesta,
identificación de patrones que permita diferenciar dichos pero cambiando en cada instante de tiempo el vector de
efectos, que desde el dominio temporal no son posibles de tiempo a analizar. Es decir, tomando como entrada los
reconocer, cuando no se dispone de equipos de medición valores registrados por los sensores de presión y flujo en el
o estimadores para todas las variables de interés de un proceso real, y desplazando el vector conforme transcurre
proceso. Este caso resulta muy frecuente en los procesos el tiempo. Ası́, a partir de esto también es posible desarrol-
de transporte de fluidos multifásicos, para los cuales no lar un protocolo de detección de perturbaciones, con base
se dispone de dispositivos confiables, o para el caso de en el perfilado de las componentes Wavelet de la señal
que se cuente con ellos, implican altos costos dado los de caı́da de presión. Dicho protocolo puede estar basado
ambientes fuertemente erosivos de los fluidos como las en un método de umbrales o cualquier otro que permita
pulpas minerales. separar el ruido de la señal de los cambios debidos a las
perturbaciones.
610
CHAPTER 16. SIGNAL PROCESSING
611
CHAPTER 16. SIGNAL PROCESSING
Palabras clave: cifrado, telemedicina, caos, mapa logı́stico, análisis de seguridad, señales
electrofisiológicas.
612
CHAPTER 16. SIGNAL PROCESSING
generadas por el cerebro en el orden de los microvolts), esquema propuesto es seguro y eficiente para su uso en
Torres-Garcı́a et al. (2013). telemedicina.
Uno de los objetivos de la criptografı́a es brindar confi- La organización del trabajo es de la siguiente forma: en la
dencialidad, donde básicamente un algoritmo criptográfi- Sección 2 se muestran los detalles del algoritmo de cifrado
co convierte texto claro a texto cifrado, con el uso de caótico propuesto. Los resultados experimentales y análisis
una clave secreta que sólo personas autorizadas conocen. de seguridad se presentan en la Sección 3, con base a
Por otra parte, los sistemas caóticos presentan propieda- simulaciones en MatLab. Finalmente, las conclusiones son
des estrechamente relacionadas con buenos procesos en mencionadas en la Sección 4.
criptografı́a, como en difusión (cambiar el valor de texto
claro), permutación (cambiar la posición del texto claro), 2. ALGORITMO DE CIFRADO PROPUESTO
amplio espacio de claves, complejidad del sistema fuente y
pseudoaleatoriedad, Alvarez y Li (2006). Es por ello, que La arquitectura implementada en el algoritmo de cifrado
en los últimos años se han propuesto varios esquemas de propuesto utiliza dos procesos llamados permutación y
cifrado caótico para biometrı́a, imágenes y texto, ver Cui difusión. Estos procesos consisten en cambiar tanto la
(2010), Murillo-Escobar et al. (2015a), Murillo-Escobar posición como el valor de cada uno de los elementos de
et al. (2015b), Patidar et al. (2009), Zhou et al. (2014), la señal clara, con base a secuencias caóticas generadas
Khan et al. (2007) y Murillo-Escobar et al. (2014a). por el mapa logı́stico.
En la literatura, se han presentado varios esquemas de El mapa logı́stico es un mapa uni-dimensional amplia-
autentificación paciente-médico para telemedicina basados mente utilizado en crecimiento de poblaciones biológicas,
en dos factores de seguridad (cifrado caótico y tarjetas comportamiento complejo, generadores de números pseu-
inteligentes) o tres factores de seguridad (cifrado caótico, doaleatorios y criptografı́a caótica, May (1976). En este
tarjetas inteligentes y biometrı́a), por ejemplo en Jiang trabajo, el mapa logı́stico es implementado como fuente de
et al. (2014), Xie et al. (2013), Lu et al. (2015), Yan secuencias caóticas, el cual esta descrito matemáticamente
et al. (2013) y Chaudhry et al. (2015). Sin embargo, como sigue
los esquemas de autentificación propuestos y el respectivo xi+1 = axi (1 − xi ), (1)
análisis de seguridad se presentan de forma descriptiva a
donde i = 1, 2, ..., j representa el número de iteraciones,
nivel global de autentificación, sin detalles criptográficos
xi ∈ (0, 1) es el estado del sistema, con la condición inicial
del algoritmo de cifrado, ni un análisis de seguridad crip-
x0 ∈ (0, 1), parámetro de control a ∈ (3.999, 4) para
tográfico basado en resultados numéricos. Debido a que
garantizar secuencias caóticas y evitar ventanas periódicas,
se omiten detalles del cifrado caótico y consecuentemente
Pareek et al. (2006).
el análisis de seguridad correspondiente, la seguridad del
cifrado no es verificado ni validado. Se conoce que los sistemas caóticos uni-dimensionales pre-
sentan desventajas criptográficas como poco espacio de
Algunos avances de cifrado caótico de datos clı́nicos se
claves secretas, distribución no uniforme de datos caóticos,
han presentado en la literatura. Recientemente, se han
rangos caóticos discontinuos, entre otros, Arroyo et al.
propuesto esquemas de cifrado de señales clı́nicas como
(2008). Sin embargo, estos mapas de baja dimension tam-
EEG y ECG, basado en mapas caóticos como logı́stico
bién son sumamente atractivos por su simple estructura
y Henon, sin embargo todos ellos carecen de un análisis
que requiere poca memoria y operaciones aritméticas para
de seguridad básico como espacio de claves, sensibilidad
generar datos de forma eficiente, Cristian-Iulian y Vasile-
a la clave, sensibilidad a la señal clara, histogramas,
Gabriel (2007). Por lo tanto, en el algoritmo propuesto,
correlación, análisis diferencial con NPCR y UACI, tiempo
se implementan operaciones para eliminar dichas desven-
de cifrado, entre otros, para demostrar y validar el sistema
tajas con base a trabajo reciente en Murillo-Escobar et al.
criptográfico propuesto, ver por ejemplo Lin (2016), Lin
(2015a) y Murillo-Escobar et al. (2015b).
et al. (2014), Chen et al. (2012), Sufi et al. (2011), Lin
y Wang (2011) y Parveen et. al (2011). Las señales electrofisiológicas tales como el ECG y EEG
son adquiridas de una base de datos de PhysioBank
En este trabajo, se presenta un novedoso algoritmo crip-
ATM, Physionet.org (2016). Las muestras corresponden
tográfico simétrico basado en dos mapas logı́sticos con dis-
a señales reales tanto en amplitudes como en tiempo.
tribución mejorada, una ronda de permutación-difusión,
La duración de las señales obtenidas son de 10 segundos
y una clave secreta de 128 bits representada por 32 ca-
con distinta frecuencia de muestreo Fs (de 100 Hz para
racteres hexadecimales para brindar confidencialidad a
ECG y 160 Hz para EEG, ver fig. 1). La señal de ECG
información clı́nica en aplicaciones de telemedicina, parti-
es obtenida de la base de datos “Apnea-ECG database
cularmente para la transmisión segura de ECG y EEG. El
(apnea-ecg)” y señal “a01” con duración de 10 segundos.
proceso general del algoritmo se basa en trabajos recientes
En el caso del EEG, se obtiene de la base de datos
presentados en Murillo-Escobar et al. (2015a) y Murillo-
”EEG Motor Movement/Imagery Dataset (eegmmidb)”,
Escobar et al. (2015b), en donde se cifra imagen a color
grabación “S001/S001R01” y senal “Fc5” con duracion
RGB y plantilla de huella dactilar, respectivamente. La
de 10 segundos. A diferencia del trabajo presentado en
principal diferencia radica en el pre-procesamiento pro-
Cardoza et al. (2016), donde se capta la señal clı́nica
puesto (transformación) sobre la señal clara para que el
de pulsos cardı́acos directamente del paciente a través
algoritmo realice el adecuado cifrado de cualquier tipo
de un fotopletismógrafo y posteriormente es cifrada, el
de señal electrofisiológica sin importar su naturaleza o
presente trabajo se enfoca únicamente en el cifrado de
su amplitud. Se presenta un análisis de seguridad básico
señales electrofisiológicas humanas y no en la medición
basado en simulaciones en MatLab y se muestra que el
directa del paciente.
613
CHAPTER 16. SIGNAL PROCESSING
Se considera como señal clara a toda aquella señal filtrada, donde i = 1, 2, 3, . . . , `, P ∈ [1, `] es el vector pseudo-
amplificada y digitalizada, la cual puede ser de distin- aleatorio de permutación y round es la operación de
tos tipos como electrocardiograma ECG, electroencefalo- redondeo al valor más cercano. Sin embargo, P contiene
grama EEG, electromiograma EMG, electroretinograma valores repetidos que son determinados por software y
ERG, electrofonograma EPG, electrooculograma EOG, reemplazados por los faltantes de forma automática para
entre otros, cuyas frecuencias de muestreo puede variar. incrementar la seguridad. El vector para el proceso de
Por otra parte, el algoritmo tienen la capacidad para cifrar difusión se determina de la siguiente forma
cualquier tipo de señal electrofisiológica de cualquier am- Di = xL1 ∗ 1000 + Z mod 1 (5)
T −`+i
plitud con poco o mucho ruido, esto por la transformación donde i = 1, 2, 3, . . . , `; Di ∈ (0, 1). La multiplicación
que lleva la señal antes de ser cifrada. Sin embargo, señales por 1000 se utiliza para obtener una distribución de
electrofisiológicas con mucho ruido no son útiles para la datos del mapa logı́stico más uniforme, lo que ayu-
toma de decisiones, por lo que es de vital importancia que da a incrementar la seguridad, Murillo-Escobar et al.
las señales estén bien filtradas antes del cifrado. Se define (2015a). Finalmente, el proceso de cifrado se realiza con
como señal clara a P ∈ R compuesta por dos vectores que la siguiente operación
corresponden a la amplitud A y al tiempo t en segundos,
ambos con una longitud de datos ` = Fs t. El algoritmo de Ci = (AT (Pi ) + Di ) mod 1, (6)
cifrado se aplica únicamente al vector A y consiste en los donde i = 1, 2, 3, . . . , ` y C ∈ (0, 1) es la señal cifrada
pasos descritos a continuación. (criptograma) con precisión decimal de 10−15 .
Paso 5 (Agregar datos): El valor de Pmin , Pmax y Z
Paso 1 (definir la clave secreta): Se utiliza una clave son agregados al final del criptograma en la siguiente
secreta de 128 bits definida por 32 caracteres hexadeci- forma
males de la misma forma presentada en Murillo-Escobar C`+1 = Z,
et al. (2015a), ver tabla 2. Esta se divide en cuatro C`+2 = 1 si Pmin < 0 o C`+2 = 0 si Pmin ≥ 0,
secciones para la inicialización de secuencias caóticas de C`+3 = abs(Pmin )/100000,
dos mapas logı́sticos. Además, con este procedimiento, C`+4 = 1 si Pmax < 0 o C`+4 = 0 si Pmin ≥ 0,
se elimina el espacio de claves pequeño que presentan C`+5 = abs(Pmax )/100000,
los mapas uni-dimensionales. donde abs representa el valor absoluto.
Paso 2 (transformación): El sistema criptográfico (di- Descifrado: El proceso de descifrado consiste en invertir
gital) obtiene la señal clara y determina máximos-mı́ni- todos los pasos del cifrado. Para ello, se debe obtener
mos para transformar las amplitudes de la misma en del criptograma los valores Z, Pmin y Pmax . Con el uso
amplitudes con valores entre (0,1). Para obtener esta de exactamente la misma clave secreta (a nivel de bits),
nueva señal clara transformada, se determina el valor calcular los vectores pseudoaleatorios P y D de la misma
mı́nimo de A con Pmin = min (A), donde Pmin ∈ R es forma en que en el paso 4. El proceso de descifrado se
un escalar que representa el valor mı́nimo en A y min determina como sigue
representa la función para determinar el valor mı́nimo D(Pi ) = (Ci − Di ) mod 1, (7)
en A. También, se encuentra el valor máximo en A con
Pmax = max (A), donde Pmax ∈ R es un escalar que donde i = 1, 2, 3, . . . , `, y D ∈ (0, 1) es la señal clara
representa el valor máximo en A y max representa la transformada descifrada. Para obtener la señal clara con
función para determinar el valor máximo en A. Después, amplitudes originales, se escala D con el uso de Pmin y
se determina la siguiente transformación Pmax .
a)
2
A − (Pmin − 0.01)
AT = , (2)
Amplitud
1
Pmax + 0.01
(mV)
0
sigue −100
Z = Z + ATi + xL2I+1−i mod 1, (3) −200
0 1 2 3 4 5 6 7 8 9 10
Tiempo (seg)
donde i = 1, 2, 3, . . . , `, Z ∈ (0, 1) es una variable inicia-
lizada en cero para incrementar la seguridad y tiene una
Figura 1. Señales electrofisilógicas adquiridas de Physio-
precisión decimal de 10−15 , ATi son los elementos de la
Bank. a) ECG de persona saludable y b) EEG canal
señal clara transformada, xL2 es la secuencia caótica del
Fc5.
mapa logı́stico 2 (se toman los últimos ` valores) y mod
es la operación de módulo. Tabla 2. Llave secreta y su distribución en dos mapas
Paso 4 (cifrado): El mapa logı́stico 1 es iterado T = ` + logı́sticos, Murillo-Escobar et al. (2015a).
1000 con a1 y x10 como se muestra en tabla 2. Se
determina una secuencia para permutación y otra para 3. RESULTADOS EXPERIMENTALES
difusión con (4) y (5), respectivamente.
Un algoritmo de cifrado caótico debe presentar excelen-
Pi = round xL1 T −`+i ∗ (` − 1) + 1, (4) tes propiedades de pseudoaleatoriedad en la información
614
CHAPTER 16. SIGNAL PROCESSING
Llave secreta Parámetro de control Condición inicial criptogramas muy distintos si dos claves secretas muy
32 dı́gitos H1 , H2 , . . . , H32 donde H ∈ [0 − 9, A − F ] parecidas son utilizadas. Esto aporta seguridad contra
hexadecimal
(H1 ,H2 ,...,H8 )10 (H17 ,H18 ,...,H24 )10 ataques diferenciales, donde se busca alguna relación en-
cálculos A= C=
232 +1
(H9 ,H10 ,...,H16 )10
232 +1
(H25 ,H26 ,...,H32 )10
tre dos o más criptogramas cifrados con claves secre-
B= D=
232 +1 232 +1 tas muy parecidas. En la tabla 3, se muestra el coe-
logı́stico 1 a1 = 3.999 + (α1 ∗ 0.001) x10 = β1 mod 1
ficiente de correlación (determinado con Eq. (8)) entre
α1 = (A + B + Z) mod 1 β1 = C + D + Z tres criptogramas de electrocardiogramas ECG1, ECG2 y
ECG3, y la correlación entre tres criptogramas de elec-
logı́stico 2 a2 = 3.999 + (α2 ∗ 0.001) x20 = β2 mod 1 troencefalogramas EEG1, EEG2 y EEG3, generados por
α2 = (A + B) mod 1 β2 = C + D
tres claves secretas “11223344556677889900AABBCCD-
cifrada y además, poseer caracterı́sticas intrı́nsecas para DEEFF”, “11223345556677889900AABBCCDDEEFF” y
resistir los ataques criptoanalı́ticos como búsqueda exhaus- “11223344556677889900AABCCCDDEEFF”. Con base a
tiva, análisis estadı́stico, análisis diferencial, entre otros los resultados, la correlación entre criptogramas es muy
que pueden quebrantar un sistema criptográfico basado en cercano a cero, lo que indica prácticamente nula corre-
caos, Alvarez y Li (2006). lación entre criptogramas y alta sensibilidad a la clave
secreta.
En esta sección, se muestran algunos análisis de seguridad
sobre el esquema de cifrado propuesto como espacio de PN PN PN
claves, sensibilidad a la clave secreta secreta, sensibilidad Corr = q PN
N×
i=0
Pn
(xi ×yi )−
i=0
xi ×
PN
i=0
yi
Pn (8)
a la señal clara, histogramas, correlación y tiempo de N×
i=0
(xi (
)2 −
i=0
xi )
2
× N×
i=0
(yi )2 −(
i=0
yi )
2
0.8
Amplitud
0.6
0.4
3.3 Sensibilidad a la señal clara
0.2
0
0 1 2 3 4 5 6 7 8 9 10
0.6
efecto, se generan dos criptogramas C1 y C2 de dos señales
0.4
0.2
claras muy parecidas con el uso de la misma clave secreta.
0
En la fig. 3 se muestran dos señales claras para el caso de
0 1 2 3 4 5 6 7 8 9 10
Tiempo (seg) ECG, donde únicamente el elemento 100 (en 1 segundo)
es modificado por una milésima. Para determinar que tan
Figura 2. Señales cifradas con el algoritmo propuesto. a) distintos son ambos criptogramas, se utilizan dos cálculos
ECG y b) EEG. llamados NPCR (tasa de cambio de pixel neto) y UACI
(intensidad de cambio promedio unificado), con la ecuación
(9) y (11), respectivamente.
3.1 Ataque exhaustivo Pi=`
W (i)
N P CR = i=1 × 100 (9)
El ataque exhaustivo consiste en probar cada una de las `
posibles claves secretas hasta encontrar la señal clara. Para donde
que el sistema criptográfico pueda resistir a este tipo de 0 si C1 (i) = C2 (i)
ataque, en la actualidad se requiere de al menos 2100 W (i) = (10)
1 si C1 (i) 6= C2 (i)
posibles combinaciones, Alvarez y Li (2006). En el cifrado y
propuesto, se tienen 2128 posibles combinaciones por lo que i=`
se considera que el esquema propuesto puede resistir este 100 X
U ACI = |E1 (i) − E2 (i)| (11)
tipo de ataque. ` i=1
615
CHAPTER 16. SIGNAL PROCESSING
a) a)
2 25
Frecuencia
20
Amplitud
1
(mV)
15
X:g1
Y:g−0.08 10
0
5
−1 0
0 1 2 3 4 5 6 7 8 9 10 −1 −0.5 0 0.5 1 1.5 2
b) b)
2 50
Frecuencia
40
Amplitud
1
(mV)
30
20
0
X:g1
Y:g−0.079 10
−1 0
0 1 2 3 4 5 6 7 8 9 10 −150 −100 −50 0 50 100 150 200
Tiempog(seg)
Figura 3. Señales muy parecidas para análisis diferencial. Figura 5. Histogramas de señal cifrada. a) ECG y b) EEG.
a) ECG y b) Mismo ECG con cambio en 1 seg.
el tiempo que requiere cifrar y descifrar señales con las
Tabla 4. Sensibilidad a la señal clara. caracterı́sticas mencionadas.
Señal NPCR UACI
Tabla 5. Tiempo de cifrado y descifrado.
ECG 99.6019 33.3286
Señal Cifrado (seg) Descifrado (seg)
EEG 99.7507 32.9186
ECG 0.0960 0.1086
3.4 Histogramas EEG 0.1587 0.1648
Sensibilidad - -
100
a clave secreta
√
Sensibilidad - - -
50 a señal clara
√
Error cuadrado - - -
0 medio MSE
−1 −0.5 0 0.5 1 1.5 2 √ √
Tiempo de cifrado - -
√
b) Desviación de - - -
80 porcentaje residual
Frecuencia
60
40 4. CONCLUSIONES
20
0
−150 −100 −50 0 50 100 150 200
Se presento un algoritmo criptográfico basado en mapa
logı́stico para aplicaciones en telemedicina y se motró la
eficiencia y seguridad criptográfica en ECG y EEG, me-
Figura 4. Histogramas de señal clara. a) ECG y b) EEG. diante un análisis de seguridad básico como espacio de
claves secretas, sensibilidad a la clave secreta, sensibilidad
a la señal clara, histogramas y tiempo de cifrado. Los
3.5 Tiempo de cifrado
resultados indican que el algoritmo propuesto puede ser
aplicado en telemedicina para el monitoreo remoto de
Las aplicaciones de telemedicina requieren interacción en- señales electrofisiológicas de forma segura.
tre paciente-médico en tiempo real. Por este motivo, el
tiempo de cifrado y descifrado debe ser mı́nimo para dis- Otros análisis de seguridad, análisis estadı́sticos y me-
minuir el retraso en la transmisión de datos. Con base en diciones métricas sobre la calidad de cifrado, deben ser
simulaciones en MatLab y la función Tic-Toc, se determina aplicados en el esquema propuesto en este trabajo pa-
el tiempo para cifrar y descifrar señales electrofisiológicas ra incrementar la confiabilidad en aspectos de seguri-
con el algoritmo propuesto en este trabajo. La señales dad, como análisis de autocorrelación, frecuencia flotan-
utilizadas tienen 1000 datos en ECG y 1600 datos en EEG, te, análisis estadı́stico de pseudoaleatoriedad con NIST,
con duración de 10 segundos. En la tabla 5 se presenta error medio cuadrado MSE, ataque de sólo texto claro
616
CHAPTER 16. SIGNAL PROCESSING
elegido/conocido, ataque de ruido, entre otros. Además, May R.M. (1976). Simple mathematical models with very
implementar otras señales clı́nicas electrofisiológicas como complicated dynamics. Nature, 261, 459-467.
electromiogramas EMG, eletrooculogramas EOG o elec- Murillo-Escobar M.A., Abundiz-Pérez F., Cruz-Hernández
troretinogramas ERG. También, mostrar la confidenciali- C. y López-Gutiérrez R.M. (2014a). A novel symmetric
dad de información generada por esfignomanómetros digi- text encryption algorithm based on logistic map. Pro-
tal (presión sanguı́nea), estetoscopio digital (aoscultación ceedings of the International Conference on Communi-
en arterias y órganos), oximetro de pulso (saturacion de cations, Signal Processing and Computers, 49-53.
oxigeno en la sangre), desmatoscopio digital (observación Murillo-Escobar M.A., Cruz-Hernández C., Abundiz-
de la piel), sistemas de ultrasonido, entre otros. Pérez F., y López-Gutiérrez R.M. (2014b). Cifra-
do caótico de plantilla de huella dactilar en sistemas
REFERENCIAS biométricos. Congreso Latinoamericano de Control Au-
tomático 2014, 18-23.
Alvarez G. y Li S. (2006). Some Basic Cryptographic Murillo-Escobar M.A., Cruz-Hernández C., Abundiz-
Requirements for Chaos-Based Cryptosystems. Inter- Pérez F., López-Gutiérrez R.M., y Acosta Del Campo
national Journal of Bifurcation and Chaos, 16(8), 2129- O.R. (2015a). A RGB image encryption algorithm based
2151. on total plain image characteristics and chaos. Signal
Arroyo D., Alvarez G., and Fernandez V. (2008). On Processing, 109, 119-131.
the inadequacy of the logistic map for cryptographic Murillo-Escobar M.A., Cruz-Hernández C., Abundiz-
applications. X Reunión Espanola sobre Criptologı́a y Pérez F., y López-Gutiérrez R.M. (2015b). A robust
Seguridad de la Información, 77-82. embedded biometric authentication system based on
Cardoza L., López R.M., Murillo M.A., Mendez R. and fingerprint and chaotic encryption. Expert Systems with
Cruz C. (2016). Secure Transmission of Biomedical In- Applications, 42, 8198-8211.
formation. IEEE Latin America Transactions, sometido. Pareek N.K., Patidar V. y Sud K.K. (2006). Image
Chaudhry S.A., Mahmood K., Naqvi H. y Khan M.K. encryption using chaotic logistic map. Image and Vision
(2015). An improved and secure biometric authentica- Computing, 24, 926-934.
tion scheme for telecare medicine information systems Patidar V., Pareek N.K. y Sud K.K. (2009). A new
based on elliptic curve cryptography. Journal of Medical substitution-diffusion based image cipher using chaotic
Systems, 39(175), 1-12. standard and logistic maps. Communications in Nonli-
Chen C.-K., Lin C.-L., Chiang C.-T. y Lin S.-L. (2012). near Science and Numerical Simulation, 14, 3056-3075.
Personalized information encryption using ECG signals Parveen S., Parashar S. y Izharuddin (2011). Technique
with chaotic functions. Information Sciences, 193, for providing security in medical signals. International
125–140. Conference on Multimedia, Signal Processing and Com-
Cristian-Iulian R. y Vasile-Gabriel I. (2007). Aspects re- munication Technologies, 68-71.
garding chaotic maps hardware implementations. Revue Physionet.org (2016). PhysioBank ATM. Online, disponi-
Roumaine Des Sciences Techniques, 52, 219-227. ble en: https://physionet.org/cgi-bin/atm/ATM [citado
Cui D. (2010). A Novel Fingerprint Encryption Algorithm 20 Mayo 2016].
Based on Chaotic System and Fractional Fourier Trans- Sufi F., Han F., Khalil I. y Hu J. (2011). A chaos-
form. International Conference on Machine Vision and based encryption technique to protect ECG packets for
Human-machine Interface, 168-171. time critical telecardiology applications. Security and
Inzunza-González E. y Cruz-Hernández C. (2013). Double Communication Network, 4, 515–524.
Hyperchaotic Encryption for Security in Biometric Sys- Torres-Garcı́a A.A., Reyes-Garcı́a C.A., Villaseñor-Pineda
tems. Nonlinear Dynamics and Systems Theory, 13(1), L. Y Ramı́rez-Cortés J.M. (2013). Análisis de señales
55-68. electroencefalográficas para la clasificación de habla
Jiang Q., Ma J., Lu X. y Tian Y. (2014). Robust chaotic imaginada. Revista Mexicana de Ingenierı́a Biomédica,
map-based authentication and key agreement scheme 34(1), 23-39.
with strong anonymity for telecare medicine information Vergeles-Blanca J.M. (2011). La telemedicina. Desarrollo,
systems. Journal of Medical Systems, 38(12), 1-8. ventajas y dudas. Búsqueda Bibliográfica. Internet y las
Khan M.K., Zhang J. y Tian L. (2007). Chaotic secure Nuevas Tecnologı́as, 59-61.
content-based hidden transmission of biometric templa- WHO (1998). A health telematics policy in support
tes. Chaos, Solitons & Fractals, 32, 1749-1759. of WHO’s Health-For-All strategy for global health
Lin C.-F. (2016). Chaotic visual cryptosystem using development: report of the WHO group consultation on
empirical mode decomposition algorithm for clinical health telematics. World Health Organization, 11–16
EEG signals. Journal of Medical Systems, 40(52), 1-10. December, Geneva, Switzerland.
Lin C.-F., Shih S.-H. y Zhu J.-D. (2014). Chaos based Xie Q., Zhang J. y Dong N. (2013). Robust anonymous
encryption system for encrypting electroencephalogram authentication scheme for telecare medical information
signals. Journal of Medical Systems, 38(49), 1-10. systems. Journal of Medical Systems, 37(9911), 1-8.
Lin C.F. y Wang, B.S.H. (2011). A 2D chaos-based visual Yan X., Li W., Li P., Wang J., Hao X. y Gong P. (2013). A
encryption scheme for clinical EEG signals. Journal of Secure biometrics-based authentication scheme for tele-
Marine Science and Technology, 19(6), 666-672. care medicine information systems. Journal of Medical
Lu Y., Li L., Peng H., Xie D. y Yang Y. (2015). Robust Systems, 37(9972), 1-6.
and efficient biometrics based password authentication Zhou Y., Bao L. y PhilipChen C. (2014). A new 1d chaotic
scheme for telecare medicine information systems using system for image encryption. Signal Processing, 97, 172-
extended chaotic maps. Journal of Medical Systems, 182.
39(65), 1-10.
617
CHAPTER 16. SIGNAL PROCESSING
618
CHAPTER 16. SIGNAL PROCESSING
en la precisión para identificar parámetros y el tiempo A1 El edificio está inicialmente en reposo de acuerdo
requerido para la estimación. Para señales complejas, se a (2a) y (2b), y por lo tanto u(0), u̇(0) y ü(0) son cero.
ha superado el problema de múltiples efectos de reflexión A2 Únicamente la base de la estructura es excitada du-
y transmisión, evitando usar técnicas de procesamiento de rante una actividad sı́smica mediante el movimiento del
señales. Por otra parte, se estiman los desplazamientos y terreno ug , ver (2c).
velocidades en cada uno de los pisos. A3 El edificio está libre de esfuerzo en el techo de acuerdo
Este artı́culo está organizado de la siguiente manera: la con (2d).
sección 2 describe el modelo de edificio y el proceso de dis- A4 El modelo en (1), permite tratar el modelo de edificio
cretización usados, evitando emplear transformaciones de de cortante como una barra de cortante elástica y por
coordenadas para resolver el modelo original. La sección 3 lo tanto los efectos de propagación de onda pueden ser
describe el diseño del observador adaptable, que combina estudiados.
un estimador de estados con un esquema de identificación
de parámetros. Los resultados de simulación se presentan 2.1 Semidiscretización espacial
en la sección 4, y finalmente, las conclusiones de este
trabajo se presentan en la sección 5. En la práctica los edificios están construidos por diferentes
materiales, geometrı́as, diferente número de columnas en
2. MODELO MATEMÁTICO cada piso, etc, de manera que la respuesta estructural
de cada uno de ellos es diferente. En consecuencia, el
Considere el edificio de cortante de múltiples pisos mos- modelo (1) es discretizado en la variable espacial en puntos
trado en la Fig. 1, el cual es excitado por una acción donde se desea el comportamiento estructural, por ejemplo
sı́smica üg . Generalmente la dinámica de este sistema es en los pisos. Usando una semidiscretización basada en
gobernada por una ecuación diferencial de segundo orden, el método de diferencias finitas, se producen diferentes
(Chopra, 1995). Sin embargo, ésta no permite estudiar los velocidades de cortante y coeficientes de amortiguamiento.
efectos de propagación de las ondas. En consecuencia, un Para los puntos 1 hasta el n − 1 se usa una aproximación
modelo novedoso basado en el principio de movimiento centrada de segundo orden y un backward de primer orden
inducido por ondas sı́smicas es propuesto en (Morales- en el punto n, dando como resultado una representación
Valdez y Alvarez-Icaza, 2015), dando como resultado la matricial acompañada de un vector de estados más un
ecuación de onda con amortiguamiento de Kelvin, vector de condiciones de frontera. Además, se realiza
H
una expansión en la matriz de estados para poner el
vector de condiciones de frontera en función de señales de
aceleración medibles. De manera que el modelo (1) puede
ser aproximado por
2 2
ü = βM Λu + ηM Λu̇ + büg (3)
2 2
donde βM y ηM son matrices ∈ Rp×p , con p = (n + 1), y
se definen de la siguiente manera
0 0 0 ··· 0 0 0 0 0 ···
0 β12 0 ··· 0 0 η12 0 0···
β22 ··· η22 0···
2
βM = 0 0 0 , 2
ηM = 0 0
.. .. .. .. .. .. .. .. ..
..
Figura 1. Edificio de cortante clásico . . . . . . . . . .
0 0 · · · 0 βn2 0 0 · · · 0 ηn2
2
∂ ui 2
∂ u̇i 2
∂ ui (4)
βi2 2
+ ηi2 2 = (1)
∂y ∂y ∂t2 Note que los elementos de las matrices en (4) están re-
donde, βi es la velocidad de cortante, ηi es una cons- lacionados con las propiedades de cada piso. Una venta-
tante proporcional a la fuerza de amortiguamiento de la ja importante de (3) es que no necesita transformación
estructura y ui , i = 0, 1, 2, . . . , n es el desplazamiento de de coordenadas para encontrar la solución del sistema.
cada masa mi para cualquier instante de tiempo t. Las Además,
condiciones iniciales y de frontera para este modelo (1)
son las siguientes:
0 0 0 ··· 00 1
1 −2 1 ··· 0
0 0
u(y, 0) = 0 0≤y≤H (2a)
1 0 1 −2 ··· 0
1 0
Λ= . .. ... . .. , b= (5)
u̇(y, 0) = 0 0≤y≤H (2b) 2
∆h .. ..
. .
. ..
. . .
0 0 · · · 1 −2 1 0
u(0, t) = ug 0<t (2c)
0 0 · · · 0 1 −1 0
µu̇(H, t) = 0 0<t (2d)
donde µ es el módulo de cortante. Además se supone lo u = [u0 , u1 , u2 , . . . , un ]T , u̇ = [u̇0 , u̇1 , u̇2 , . . . , u̇n ]T ,
(6)
siguiente: ü = [ü0 , ü1 , ü2 , . . . , ün ]T
619
CHAPTER 16. SIGNAL PROCESSING
donde, 0p×p e Ip×p denotan las matrices nula e identidad donde, Θβ , Θη ∈ Rp×1 . Por lo tanto, (9) es ahora de la
de tamaño (p × p), respectivamente. forma
z = ψΘβ + ψ̇Θη (16)
donde los elementos de los vectores Θβ y Θη son los
parámetros a identificar del modelo. Una forma de para-
3. OBSERVADOR ADAPTABLE metrizar (16) se muestra a continuación
z = ΥΘ (17)
En esta sección se diseña un observador adaptable, que
combina un observador de estados con un esquema de donde Θ = [ΘTβ , ΘTη ]T ∈ R2p×1 y Υ = [ψ, ψ̇] ∈ Rp×2p es
identificación con el objetivo de reconstruir las señales no el regresor, formado por las señales no medibles de despla-
medibles y recuperar los parámetros desconocidos en el zamiento y velocidad, sin embargo estas serán estimadas
mismo instante de tiempo, respectivamente. Se asume que por el observador de estados.
2 2
en un edificio real las matrices de parámetros βM y ηM
son desconocidas y que los desplazamientos y velocidades Mı́nimos cuadrados Sea Θ̂ = [Θ̂Tβ , Θ̂Tη ]T el vector de
no pueden ser medidos durante la actividad sı́smica. parámetros estimados en (17), tal que la salida estimada
está dada por
ẑ = ΥΘ̂ (18)
3.1 Identificación paramétrica entonces, los parámetros Θ̂ son identificados empleando el
algoritmo de mı́nimos cuadrados normalizado con factor
Sea la ecuación de onda con amortiguamiento de Kelvin de olvido, dado por las siguientes ecuaciones, (Ioannou y
discretizada (3), la cual puede ser reescrita como Sun, 1989)
2 2 P ΥT ΥP
z = βM Λu + ηM Λu̇, con z = (ü − büg ) (9) Ṗ = αP − (19)
m2
˙
replanteando (9) como Θ̂ = P ΥT ε (20)
620
CHAPTER 16. SIGNAL PROCESSING
mediciones en las condiciones de frontera son conocidas. entonces, el error de estimación x̃r es acotado. Además si
El modelo reducido empleado es el siguiente el vector de parámetros estimados Θ̂ satisface la condición
[ ] [ ][ ] [ ] de excitación persistente, entonces la dinámica de x̃˙ r se
u̇r 0n×n In×n ur 0n×1 reduce a
= 2 − ü (22)
v̇r βM r Λr ηM 2
r Λr vr ln×1 g x̃˙ r (t) = Ar∗ x̃r (t) (33)
[ 2 2
] T [ ]
zr (t) = (ür + lüg ) = βM r Λr ηM r Λr [ur vr ] (23) 0n×n In×n
Ar∗ = 2 2 (34)
(1 + γ)βM r Λr (1 + γ)ηM r Λr
ur = [u1 − u0 , u2 − u0 , u3 − u0 , . . . , un − u0 ]T ,
vr = [v1 − v0 , v2 − v0 , v3 − v0 , . . . , vn − v0 ]T , y por lo tanto, el error de observación converge exponen-
(24)
ür = [ü1 − ü0 , ü2 − ü0 , ü3 − ü0 , . . . , ün − ü0 ]T , cialmente a cero, pues la matriz resultante Ar∗ es Hurwitz.
T
l = [1, 1, . . . , 1]
4. RESULTADOS DE SIMULACIÓN
donde ur , vr , v̇r ∈ Rn×1 son los vectores de despla-
zamiento, velocidad y aceleración relativos, respectiva- A continuación se presentan los resultados de simula-
mente; l ∈ Rn×1 ; βM 2 2 2 2 2
r = diag[β1 , β2 , . . . , βn ] y ηM r = ción para un edificio de 6 pisos, de 18 m altura y con
2 2 2 n×n
diag[η1 , η2 , . . . , ηn ] ambos ∈ R . Además, pisos uniformemente espaciados, con separación de en-
tre pisos de 3 m cada uno. La discretización espacial
fue llevada a cabo en cada uno de los 6 niveles, don-
−2 1 0 ··· 0
1 −2 1 ··· 0 de las mediciones son registradas. El tiempo de mues-
1 .. .. .. .. ..
treo empleado para la simulación es de 0,001 s. Con
Λr = . (25)
∆h2
0
. . . . el objetivo de probar la robustez del algoritmo, se pro-
··· 1 −2 1 ponen propiedades heterogéneas para cada piso, de ma-
0 ··· 0 1 −1 nera que las velocidades de cortante y los coeficientes
de amortiguamiento a identificar son todos diferentes,
de manera que el observador adaptable propuesto usa es decir, βM = diag[405, 300, 250, 600, 430, 150] m/s y
el siguiente estimador de estados para estimar xr = ηM = diag[30, 42, 40, 25, 20, 10] N s/m. Para el caso de la
[uTr , vrT ]T ∈ R2n×1 , excitación sı́smica, se usó un registro sı́smico recolectado
en el edificio instrumentado Jalapa, ubicado en la ciudad
x̂˙ r (t) = Âr (t)x̂r (t) + Br üg (t) + Lz̃r (t) (26) de México en el evento ocurrido el 21 de enero de 2003, con
ẑr (t) = D̂r (t)x̂r (t), z̃r (t) = zr − ẑr (t) (27) epicentro en Colima, cuyo acelerograma se muestra en la
Fig. 2. Además, suponiendo que el algorı́tmo propuesto es
Note que el estimador de estados (26) es implementable, robusto a ruido de medición, el acelerograma de la Fig. 2
debido a que el par (Ar , Dr ) en (22)-(23) es completamente fue contaminado con ruido Gaussiano de media cero y
observable con rango 2n, (Jiménez-Fabián y Alvarez-Icaza, varianza 1, como se ilustra en la Fig. 3
2010) y por lo tanto todas las mediciones no disponibles
pueden ser estimadas. Además, éste tiene la estructura de 30
un observador Luenberger (Luenberger, 1971), donde x̂r ,
ẑr ∈ R2n×1 , Âr and D̂r son los estimados de xr , zr , Ar y 20
Dr , respectivamente. Las matrices estimadas son definidas
como 10
üg [cm/s2 ]
[ ]
0n×n In×n 0
Âr (t) = 2d 2d (28)
βM r (t)Λr ηM r (t)Λr
[ ]
2d 2d
−10
D̂r (t) = βM r (t)Λr ηMr (t)Λ r (29)
−20
donde, βM2d d
2
r (t) y ηM r (t) están conformadas por los estima-
2 2
dos de βM r y ηM r , producidos por el algoritmo de mı́nimos −30
cuadrados en (19) y (20), respectivamente
−40
0 5 10 15 20 25
además, sea el error de estimación Tiempo [s]
x̃r (t) = xr (t) − x̂r (t) (30)
Figura 2. Excitación sı́smica
cuya dinámica satisface la ecuación diferencial
Aplicando la excitación sı́smica únicamente en la base del
x̃˙ r (t) = ẋr (t) − x̂˙ r (t) (31a) edificio, y usando mediciones de aceleración de los pisos,
el observador adaptable propuesto logra recuperar la res-
= [Âr (t) − LD̂r (t)]x̃r (t) + ζ(t) (31b) puesta estructural del edificio identificando los parámetros
estructurales de referencia. La Fig. 4 muestra la evolución
donde ζ(t) = [Ãr (t) − LD̃r (t)]xr (t); Si la ganancia del de las velocidades de cortante identificadas, mientras los
observador L satisface coeficientes de amortiguamiento identificados son presen-
L = [0n×n γIn×n ]T , γ > 0 (32) tados en la Fig. 5. En ambos casos es posible observar que
621
CHAPTER 16. SIGNAL PROCESSING
20 700
10 600
üg [cm/s2 ]
500
Θ̂β [m/s]
0
400
−10
300
−20
q
200
−30
100
−40
0 5 10 15 20 25
0
Tiempo [s] 0 1 2 3 4
Tiempo [s]
Figura 3. Excitación sı́smica con ruido Gaussiano
la convergencia de los parámetros se logra en un tiempo Figura 4. Evolución de las velocidades de cortante identi-
menor a los 2 s, lo que confirma la eficacia del algorit- ficadas
mo. Para tiempos posteriores al señalado, la convergencia
paramétrica se mantiene. Otra manera de mostrar que el η̂1 η̂2 η̂3 η̂4 η̂5 η̂6
esquema de identificación propuesto funciona satisfacto-
riamente es observando la norma del error de estimación 60
∥ε∥2 = ∥z − ẑ∥2 , la cual efectivamente disminuye a medida
transcurre el tiempo, con convergencia a cero antes de 50
los 2 s. Una ventaja importante es que a partir de los
Θ̂η [N s/m]
5. CONCLUSIÓN AGRADECIMIENTOS
Se ha presentado un observador adaptable para edificios Los autores agradecen el apoyo de CONACYT. El primero
modelados bajo el enfoque de propagación de ondas, capaz también agradece el apoyo de la Coordinación de Estudios
de recuperar los parámetros estructurales desconocidos, de Posgrado de la Universidad Nacional Autónoma de
como son las velocidades de propagación de onda y el México. Esta investigación fue realizada con apoyo de los
amortiguamiento de Kelvin. A diferencia del enfoque tra- proyectos UNAM-PAPIIT IN109414 y IN109316.
dicional, en este artı́culo se ha replanteado el esquema de
identificación fuera de lı́nea como uno de identificación REFERENCIAS
de parámetros y estimación de estados en tiempo real. Chang, P.C., Flatau, A., y Liu, S. (2003). Review paper:
Adicionalmente, se han estimado los desplazamientos y Health monitoring of civil infrastructure. Structural
velocidades en cada uno de los pisos. Este nuevo algoritmo Health Monitoring, 2(3), 257–267.
mejora los métodos tradicionales de monitoreo de salud Chopra, A. (1995). Dynamics of Structures: Theory and
estructural. Además, los parámetros identificados pueden Applications to Earthquake Engineering. Upper Saddle
ser usados como indicadores de daño al compararlos con River, NJ, Prentice Hall.
622
CHAPTER 16. SIGNAL PROCESSING
1 30
u̇6
20
û˙ 6
0.8
Velocidad [cm/s]
kεk2 = kz − ẑk2
10
0.6
0
0.4
−10
0.2
−20
0 −30
0 1 2 3 4 0 5 10 15 20 25
Tiempo [s] Tiempo [s]
Figura 6. Norma del error paramétrico Figura 8. Comparación entre la velocidad real y su esti-
mada
0.8 the new modal time-histories method. The structural
u6 design of tall and special building, 21(9), 621–641.
0.6 û6 Michel, C. y Gueguen, P. (2010). Time-frequency analysis
Desplazamiento [cm]
623
CHAPTER 17
SYSTEM IDENTIFICATION
CHAPTER 17. SYSTEM IDENTIFICATION
625
CHAPTER 17. SYSTEM IDENTIFICATION
de orden reducido, en nuestro caso de estudio, el modelo tales de la velocidad, concentración y temperatura que se
desarrollado por Badillo-Hernandez et al. (2013) y Canales repiten muchas veces a lo largo del reactor. Bajo esta visión
(2013). el coeficiente de dispersión D constituye un parámetro
a determinar de acuerdo al tipo de lecho empacado del
La importancia de conocer la dispersión que existe dentro
equipo y régimen de flujo, entre otras propiedades fı́sicas
de un reactor de gasificación lleva a buscar distintas meto-
del fluido. En otras palabras, el grado de mezclado en el
dologı́as no destructivas para determinar de forma experi-
patrón de flujo de una corriente en un reactor es modelado
mental el coeficiente de dispersión y ası́ poder ajustar los
mediante el coeficiente de dispersión D.
modelos de parámetros concentrados.
Un método comúnmente utilizado en la Ingenierı́a Quı́mica 2.2 Modelo NRCTAs (Badillo-Hernandez et al., 2013)
para determinar el grado de dispersión presente en reacto-
res de tanque agitado y flujo pistón es la técnica de traza- Partiendo de la ecuación (1), se han desarrollado méto-
dores expuesta en Levenspiel (2011). Ésta es aplicada para dos de alta complejidad que logran aproximar el compor-
reacciones donde el estado de agregación de la materia es tamiento de sistemas dinámicos de dimensión infinita a
lı́quido. Por otra parte, los casos de estudio presentados en sistemas de dimensión finita. De acuerdo al esquema de
este trabajo están referidos a reacciones donde el estado de modelado clásico de ingenierı́a de reactores (Coste et al.,
agregación de la materia es principalmente gas, por lo cual 1961; Levenspiel, 1962), se puede proponer una configura-
el propósito es desarrollar una metodologı́a de trazadores ción de un conjunto de N tanques interconectados en serie
basados en Levenspiel (2011) aplicable a los reactores de y/o cascada-retro-flujo, que pueda representar el carácter
gasificación de biomasa. distribuido de los sistemas de procesos, como se muestra
en la Fig. 1.
En consecuencia, basados en un modelo dinámico en
parámetros concentrados de orden reducido para gasifica-
dores se propone encontrar una metodologı́a no destructiva
que permita identificar el grado de dispersión (inverso del
número de Peclet) presente para una clase de reactores de
gasificación tomando como punto de partida la técnica de
trazadores presentada en Levenspiel (2011). Figura 1. Representación de un tren con N tanques inter-
La estructura del trabajo es la siguiente, en la sección 2 conectados en cascada con flujos de retromezclado
se presenta el modelo que describe el comportamiento del
reactor de gasificación mediante un tren de N reactores Bajo esta idea, Badillo-Hernandez et al. (2013) presenta un
continuos de tanque agitado. En la sección 3 se presentan modelo que logra capturar la dinámica global del sistema,
algunos conceptos necesarios para aplicar la técnica de el cual se puede visualizar de forma compacta por medio
trazadores. Durante la sección 4 se extiende la técnica del sistema algebraico diferencial:
mostrada utilizando métricas de forma, para ası́, en la Ms
Vk ċs,k = qk−1 cs,k−1 − qkM s cs,k + KM MDa rk Vk , (2a)
sección 5, aplicarla a dos reactores de gasificación de flujos
descendientes, uno estratificado y el otro en configuración
H
Vk η̇k = α qk−1 H
ηk−1 − qout,k ηk − Qη,k (c, η) + qkb,H ηk+1 ,
Imbert. Finalmente, en la sección 6 se presentan las (2b)
Mg
conclusiones obtenidas después de analizar los resultados 0 = qk−1 cg,k−1 − q1M g cg,k + qkb,M g cg,k+1 + MDa rk Vk ,
obtenidos. (2c)
2. MODELO DINÁMICO DE UN REACTOR DE donde
GASIFICACIÓN DH,k Ak DH,k Ak
qkH = vH,k Ak + , qkb,H =
PeH ∆zk+1 PeH ∆zk+1
2.1 Preliminares
La ecuación (2) es utilizada para representar la dinámica
Con base en el enfoque de ingenierı́a de reactores quı́micos, de un tanque interno k, se extiende de forma analóga para
es posible formular esquemas de control si se parte de el primero y el último. El desarrollo y detalle del modelo
la ecuación de transporte generalizada (1), que surge de completo se extiende en Badillo-Hernandez et al. (2013) y
fenómenos de transporte y unifica las leyes de conservación Hernández-Torres (2016).
de momento, energı́a y masa (Bird, Stewart, y Lightfoot
1960). 3. TÉCNICA DE TRAZADORES
Almacenamiento Convección Dispersión Generación
z}|{ z }| { z }| { z }| {
∂t ϕ = −(∇ · ϕU) + ∇ · D∇ϕ + S(ϕ, r) , (1) La técnica de trazadores es un experimento estı́mulo-
respuesta, esto es, a través de una excitación se pretende
donde
conocer la cantidad de tiempo que le toma a una determi-
ϕ ≡ variable que se conserva por unidad de volumen nada concentración pasar a lo largo de un reactor desde
U ≡ vector de las velocidades de flujo convectivo un punto inicial (entrada) a un punto final (salida), Fig. 2.
D ≡ coeficiente de transporte de dispersión Como resultado, la curva generada a la salida del reactor se
S ≡ término fuente generalizado conoce como distribución de tiempos de residencia (RTD
r ≡ vector de coordenadas espaciales por sus siglas en inglés).
La dispersión longitudinal axial modela fenómenos de flujo Para estimular al sistema se considera solamente el flujo
de naturaleza estadı́stica, es decir, fluctuaciones elemen- en estado estacionario del fluido a través de un recipiente,
626
CHAPTER 17. SYSTEM IDENTIFICATION
∂C
= D∇2 C, (4)
∂t
se puede obtener una ecuación que caracterice el grado de
retromezclado del fluido que circula en dirección x
∂C ∂2C
=D 2, (5)
∂t ∂x
donde D se denomina coeficiente de dispersión longitudinal
Figura 2. Experimento estimulo-respuesta. o axial.
sin reacción y sin cambios de densidad. La RTD se define Realizando una transformación (Levenspiel, 1999) de la
a partir de la función E(t) que satisface ecuación (5) se llega a la expresión
Z ∞ 2
E(t)dt = 1 (3) ∂C D ∂ C ∂C
= − , (6)
0 ∂θ uL ∂z 2 ∂z
donde la integral es denominada como la normalización de donde u es la velocidad de flujo, L es la longitud del
la distribución (Levenspiel, 2011). reactor, z = (ut + x)/L y θ = t/t = tu/L. A el grupo
D
adimensional uL se le conoce como número de dispersión
del recipiente y mide el grado de dispersión axial que existe
a lo largo del reactor. Por otro lado, P e = uLD es conocido
como el número de Peclet .
627
CHAPTER 17. SYSTEM IDENTIFICATION
1. Llevar al reactor de gasificación a un estado estacio- además poder hacerlo ante la presencia de incertidumbre
nario (punto de encendido). paramétrica. Debido a que las curvas de trazador, Fig. 4,
2. Suministrar
6
un pulso de aire caliente en un tiempo
1.2
poseen una estructura asimétrica, se propone explotar esta
t0 , con flujo constante, a distintas temperaturas ∆T propiedad con base en un análisis mediante métricas de
y duraciones
5 ∆t. 1
forma, las métricas utilizadas son: oblicuidad, curtosidad
3. Observar y tomar lectura a la salida del reactor de la y coeficiente de variación.
curva 4generada por el pulso. 0.8
Oblicuidad. Es el tercer momento (µ3 ) respecto a la me-
4. Calcular el tiempo medio y la varianza (t, σ 2 ).
dia, también llamado medida de asimetrı́a o deformación.
5. Calcular
3 los tiempos de residencia del rastreador.
0.6
Su objetivo es estudiar la deformación horizontal de los
6. De acuerdo a la condición de frontera que se posea,
valores de la variable respecto al valor central de la media.
se elige
2
la ecuación correspondiente y se calcula
0.4 el
Dependiendo del valor de la oblicuidad se sabe que:
coeficiente de dispersión.
7. Se compara
1
el D/uL obtenido mediante simulación
0.2
Si obl < 0, la mayorı́a de los datos se encuentran a la
con el valor nominal correspondiente a dicha simula- izquierda de la media.
0 0
ción. 0 20 40 60 80 100 120 140 160 180 200 0 0.5Si obl > 10, entonces1.5 la mayorı́a de los datos se
30 1.2 encuentran a la derecha de la media.
1.2
Si obl = 0, la muestra corresponde a una distribución
25 1 normal con una simetrı́a perfecta.
1
N 3
20 0.8
0.8
1 X xi − µ µ3
obl = = 3, (7)
15
0.6
0.6 N i=1 σ σ
10
0.4
0.4 donde σ, es la desviación estándar y µ, es la media
aritmética.
5 0.2
0.2
Curtosidad. El concepto de curtosis o apuntamiento de
0
00 20 40 60 80 100 120 140 160 180 200
0
0
Fischer
0.5
pretende1
comparar1.5la curva de una distribución
120 140 160 180 200 0 0.5 1 1.5
con la curva de la variable normal en función de la cantidad
1.2 de valores extremos de la distribucion. Basándose en el
dato de que una distribución normal cumple con
1
µ4
= 3, (8)
0.8 σ4
la curtosidad se puede definir como
0.6
N 4
1 X xi − x µ4
0.4 k= = 4 − 3, (9)
N i=1 σ σ
0.2
Figura 4. Curvas de los tiempos de residencia para tres Coeficiente de variación. Es la relación entre la des-
distintos números de Peclet con un pulso de entrada viación estándar y su media
∆Te = 700 [K] y una duración de ∆t = 30 [seg]. σ
coef v = . (10)
µ
La Fig. 4 presenta la curva RTD obtenida utilizando el
modelo (2). Sin embargo, al realizar varias pruebas y Con el uso de estas nuevas métricas, complementándolas
modificar parámetros tanto del modelo (2), como del pulso con el método de RTD, se pretende estimar la disper-
de entrada del rastreador, los resultados obtenidos no sión ante incertidumbre paramétrica, ya que al realizar
logran ser plausibles para garantizar una buena estimación las pruebas experimentales no se conocen con certeza los
del número de Peclet, ya que estos varı́an dependiendo valores de algunos parámetros. Por esta razón es necesario
de la cantidad de datos utilizados para calcular t̄ y σ 2 , realizar la estimación del coeficiente de dispersión ante
afectando la solución de la ecuación de recipiente abierto la presencia de perturbaciones en los parámetros que en
(Hernández-Torres, 2016). general son más inciertos. Con base en los parámetros
elegidos en el estudio de Canales (2013), se toman tres de
4. TÉCNICA DE TRAZADORES MEDIANTE ellos ya que son los que principalmente pueden provocar
ANÁLISIS DE MÉTRICAS DE FORMA una estimación errónea del número de Peclet: la energı́a
de activación de la pirólisis Ep1 , el diámetro inicial de la
Con los resultados obtenidos al aplicar el método anterior partı́cula dp0 , y la entalpı́a de reacción del carbonizado
e indentificando las limitaciones que éste tiene para los ∆Hc5 . En consecuencia, se decide perturbar dichos coefi-
reactores de gasificación, surge el interés de mejorar la cientes dentro de un rango considerable y observar como
estimación del número de dispersión (número de Peclet) y esto afecta los resultados obtenidos.
628
CHAPTER 17. SYSTEM IDENTIFICATION
5. APLICACIÓN DE LA METODOLOGÍA 10
8
En la Fig. 5 se presenta el número de Peclet nominal vs
la métrica del coeficiente de variación ante la presencia de 7
4
2.8 2.75
3
2.75 2.7
2
2.65 2 3 4 5 6 7 8 9 10
2.7
2.65 2.6
50 16
2.6 2.55
45
14
2.55 2.5 40
12
2.45 35
2.5
10
30
2.45 2.4
25 8
2.35
1
1
10 20 10
6
15
2.75
4
10
2.7
2
5
2.65
0 0
2 3 4 5 6 7 8 9 10 2 3 4
2.6
2.55
Figura 6. Número de Peclet estimado contra el nominal
2.5
(arriba) y ECM para un pulso de Te = 700 [K]
2.45 (abajo).
2.4 3 %H y 2 − 3 %O, esto es debido a que para este reactor
2.35
en particular la composición del carbonizado influye en
1
1
10 10
la predicción correcta del gas de sı́ntesis que se obtie-
ne comparada contra datos experimentales (Santamarı́a,
Figura 5. Métricas de forma ante la presencia de pertur- 2016). El modelo del gasificador Imbert corresponde a una
baciones con un pulso de entrada de Te = 700 [K]. planta piloto propiedad del Instituto de Ingenierı́a, en el
cual el aire se alimenta a una temperatura de 600◦ C, por
Una manera de visualizar con mayor claridad la estimación lo que para no dañar al reactor en una situación real,
del número de Peclet en relación con el Peclet nominal se considera un incremento máximo de Te = 200 [K],
se puede ver en la Fig. 6; en esta gráfica se esclarece la dejando como grado de libertad la duración del pulso ∆t,
banda de incertidumbre que se puede llegar a tener al no para este trabajo se presentan los resultados obtenidos
conocer al 100 % algunos parámetros. Ası́ mismo también cuando ∆t = 10 [s]. Aplicando la métodologı́a descrita
se observa el error medio cuadrático (EMC) resultante que anteriormente se obtuvieron las Figs. 7 a 9.
se logró al estimar el grado de dispersión, en dicha gráfica
se percibe un error por debajo del 5 % para dos parámetros
6. CONCLUSIONES
(Ep1 y dp0 ).
Se diseñó una metodologı́a basada en métricas estadı́sticas
5.2 Gasificador tipo Imbert
que fue probada ante incertidumbre paramétrica en dos
modelos de reactores de gasificación. Se demostró me-
Al igual que en el reactor anterior, se aplicó el método diante simulación que el número de PeH es robustamente
diseñado para determinar mediante simulación el número identificable con un EMC del 3 %. Para el gasificador
de Peclet presente en un gasificador tipo Imbert. El modelo estratificado se observó que el parámetro ∆Hc5 , es el
y los parámetros utilizados son tomados de Santamarı́a que influye más en la buena estimación de la dispersión,
(2016), la principal diferencia entre el modelo presentado mientras que con los demás parámetros se crea una banda
anteriormente y el Imbert radica en que para el último la de incertidumbre muy pequeña.
alimentación de biomasa y aire se da en zonas distintas, pe-
ro se preserva la dirección a la cual fluyen. Los parámetros Para el caso de la configuración Imbert se observa que el
que se variaron fueron los mismos que en el gasificador parámetro Ep1 es el que más influye en la estimación del
estratificado, añadiendo el cambio en la composición del número de Peclet dentro del reactor, esto es debido a que
carbonizado, pasando de ser 100 %C a 95 %C con 2 − dicho parámetro corresponde al 65 % del valor considerado
629
CHAPTER 17. SYSTEM IDENTIFICATION
25 Coeficiente de variación
10
±10% dp
±5% ∆Hc5
20 ±2% Ep1
9
2 − 3% H
Sin Perturbar
∆T [K]
15
P eH estimado
8
10 7
6
5
5
0
0 50 100 150 200 250
t [s] 4
4 5 6 7 8 9 10
3 P eH nominal
30
2.5
25
2 Ep1
dp
20 ∆Hc5
Eθ
1.5
%error P e
%H
15
1
0.5 10
0 5
0 0.5 1 1.5 2 2.5 3
θ
0
4 5 6 7 8 9 10
Figura 7. Curva de tiempo de residencia para P e = 6 Pe
630
CHAPTER 17. SYSTEM IDENTIFICATION
Abstract: This article describes a methodology that enables students and people associated with this
process know the operation and controls the most significant parameters of a binary distillation column.
To do this data is used of an event IFAC. Which start from a linearized model of the plant, that is, at a
given time. In which the composition in the bottom, top and pressure are controlled. To this end, a
comparison of two control strategies performed as PID (Proportional-Integrator-Derivative) and LQR
(Linear Quadratic Regulator), which control a MIMO process of three input variables and three output
variables. To achieve this, it has developed a program in MATLAB® / Simulink
Keywords: Chemical Industry, Distillation Columns, L.Q.R (Lineal Quadratic Regulator), PID Control
(Proportional-Integrator-Derivative).
supply) and four output variables (compositions at the top,
1. INTRODUCTION
pressure, the temperature at the button and structure at the
Distillation is, so far, a widely used separation technique in button) (Fig. 1) (Seborg, Edgar, & Duncan, 2004).
the process industry around the world. In United, Estates
exist around 40.000 distillation columns, which represent
approximately 3% of total energy consumption in the
country. As a result, improving this process and its control
could positively affect decrease energy consumption, increase
product quality and enhance protection of environmental
resources.
Column distillation’s monitoring and modelling, under
traditional methods, constitute complex tasks due to the
variety of characteristics within the process. Besides these
features, the process itself is nonlinear and susceptible to
operative restrictions, which reduce the efficacy of linear
controllers (Werdan, 2016).
A column is physically composed three major components: Fig .1 Individual step-response models for distillation column
(i) a condenser and an accumulator on the top; (ii) a column with three inputs and four outputs. Each model represents the
where the contact with liquid/vapour takes place; and (iii) a step response for 120 minutes (Hokanson, 1992).
re-heater at the bottom. On the other hand, a distillation
column includes a complex dynamic because of variables As a starting point, in (1) will be considered able to balance
with six degrees of freedom, which include pressure, flux and the function of a binary column of distillation. This equation
temperature and composition. These last two at both the top involves an input feed (F), a distilled composition on the lid
as well as the button (Borroto, 2015). (D), and structure at the button (B).
Previous works have achieved significant results. For F D B (1)
example, shows a response of a column with a controlled In (2) is the result of applying components as in (1). In these
variable (CV) before a manipulated variable (MVs), equations, it could observe that the input feed is the sum of (i)
represented on the X and Y-axis respectively. This reaction is a distillate flow times its composition; and (ii) sediment
a MIMO process with three inputs variables (heat, reflux,
631
CHAPTER 17. SYSTEM IDENTIFICATION
discharge times its balance. It is important to point out that both, only stem pressure of every component, as well as the
this equation is valid for both light and heavy components as molar fraction of each component in the solution, represented
well. by (8). Here, PA is the A’s component partial pressure in the
F ZF1 D XD1 B XB1 steam; PA is the pressure of pure A’s steam and X A is the
(2)
molar fraction of A in the liquid state. For the calculation of
From above two concepts raise. One of the concepts is the the vapour in equilibrium, the bubble point procedure is used,
reflux ratio. This represented as the returning flow to the with a pressure of 1 atm on (9, 10 and 11).
dome (L), or as the quantity of condensation that returns to
the column, divided by the produced in the distillery (D); as PA PA X A (8)
shown in (3).
L PA PA PB (9)
R (3)
D PA PA X A PB 1 X A (10)
Similarly, ass in (4) represent the boiling or evaporation that
PA X A
returns to the column as vapour (V) and the resulting product yA (11)
(B). P
On these equations, P is the system’s total pressure, PB is the
V
S (4) partial pressure of pure component B in the vapour and X A is
B the molar fraction of A on the steam state.
The other concept to take into considerations is the recovery
fraction. It could calculate as a division between moles
Once steam and liquid in equilibrium are calculated, the data
leaving the dome by moles entering the dome. This
of steam (x) and liquid (y) will be graphed to obtain a trend
represents how much of the most volatile component that
line of y = f(x) and x= f (y). From these, the coefficients will
goes into the column, is going to come out the dome. Ass in
be extracted and entered into the database, mentioned
(5) shows this relationship. In addition, a higher value means
previously (Otiniano, 2004).
a better purification process.
1.1 Controller Design based on (PID)
DXD1
f1 (5) The PID controller is the most common control algorithm
FZF1 today. Process control, more than 95% of the control loops
To know how much could retrieve from the heavier mixture, are of PID type, although most of them are PI control
the lighter mixture subtracts the value of one. This could do (O’Dwyer, 2012).
on the bottom and the feeder. As in the previous process, a
lower value is preferable ass in (6) explains this (Findley,
Generally, in a control system requirements are different. The
1983).
key factors are to present good performance against
B (1 X B1) disturbances and noise load measuring conditions robustness
fh (6) to process changes and adjusted response as possible to the
F (1 ZF1) point. The compromise between these requirements is
Another value to be calculated is the vapour pressure. To do fundamental. For a system with only feedback error, the
so, a database with several components and their constants attempt to meet all demands with PID control is tough
should enter on the Antoine equation, which describes the (Tanda, 2016).
relationship between temperature and pure substances
1.2 Linear Quadratic Regulators (L.Q.R)
saturation pressure of the steam as shown in (7). Here, Pv is
the vapour pressure of the pure component, while A, B and C Assuming that the equation of states could measure the whole
constants are the characteristics of every single compound. T system's state showed on (15 and 16), then, it is possible to
is the temperature in Celsius degrees. design a state feedback control, which allows calculating u.
By doing this, it will be possible to minimise the value of the
B
A quadratic objective function, the infinite horizon defined by
T C (7)
Pv 10 (12) (Guzmán, 2012) where ´ Q ´ and ´ R ´ are symmetric real
definite matrices.
To apply the above equation should be selected components
more and less volatile mix (Distilled and Sediment,
J ( x Qx 1 u T ru)dt
T
632
CHAPTER 17. SYSTEM IDENTIFICATION
On (13), P is a positive matrix defined as a solution of the u 1 = Temperature value on the boiler, at the button of the
algebraic equation of Riccati as in (14) (Lewis, 1986). tower
u 2 Refrigerant temperature at the condenser
AT P PA PB r 1 B T P Q 0 (14)
It could prove that, as A and B are a solid pair, first and u 3 Reflux controller on the top
second terms correspond to the controlled output energy and y1
Y is the output vector that includes three outputs:
the control signal, respectively, minimising L.Q.R strategy. y y2
However, this correlation is inversely proportional which y
means that the lower is one; the greater should be the other. 3
The role of the regulator is to establish a minimum error y1 The most volatile component´s composition at the button
between behaviours in conflict, as lower as possible, as the of the tower
greater the control, the lower the output (Guzmán, 2012). y 2 The most volatile component´s composition on the top
Sckoaert and Rawlings show how to design an M.P.C
of the tower
controller, with a finite horizon as a counterpart of an L.Q.R.
(Moriano & Freddy, 2012) (Rawlings S. P., 1998) (Rawlings y 3 Pressure on the top of the tower
J. a., 2010). There is also intermediate transition states:
x1 The most volatile component´s structure in the condenser
2. METHODOLOGY AND METHODS. x10 The most unstable component´s composition in the
cauldron
2.1 Process description x11 Pressure
x 2 The most volatile component´s composition in the 1
st
The data about the distillation column considered in this
article contains eight trays, one input feed, one condenser, plate
x 9 The most volatile component´s composition in the 8
th
and one boiler. These data could use on space estate matrix.
A very practical problem could include a binary distillation plate
column characterized by a pressure variation as described,
w1 Feed changing concentration on the entrance
which depicts a distillation column to control (Fig. 2).
y m Outputs of the systems, which could be measured.
This design is subject to control or regulate three
outputs y1, y 2 , y 3 , against non-measurable disturbs w1, . For
a faster and desirable response, the following limitations
should consider:
u1 2.5 , u2 2.5 . For all three cases t 0 , u3 0.30
For w1 1 y t 0 . In this problem, measured outputs are
chosen, and for this reason, the criteria for selection should
consider the statement of the problem. In general, a controller
that uses the fewer number of measured outputs is desirable
(E.J.Davison, 1967).
633
CHAPTER 17. SYSTEM IDENTIFICATION
Temperature at the Product or composition on the top( The aforementioned means that the vapour pressure ( Xv )
refrigerant or condenser Cto ) [g/ml x100 =%, o en or in
more fluid pressure ( X l ) equals one atm, as shown in the
which means temperature molar fraction which is
on the top exit ( Tr ) [˚C] dimensionless] following equation (21). Stressing that the pressure variation
Reflux (Re) [g/min] Pressure on the top (Pt) [ N/ m 2 or in the model description assumed a value constant 1 atm.
Pa or Pascal]
Xv X l 1atm
Disturbances (21)
Input feed, flow or mixture ( Fv ) [g/min] Related at a given time, using linearized data at a given time,
to reflect this behaviour the response of the input variables
and process output uncoupled, i.e., nine transfer functions
2.3 Controller design based on PID (three output variables that combine analysed three output
variables).
The design part of a plant model in state space previously For the self-tuning of the PID control algorithm is first
defined matrices A, B, C and D, which can establish the performed a study of controllability of the output variables,
behaviour of the three inputs combined with the three that is to say, know the final state and the initial situation of
outputs. the matrix, as shown in (22) and (23).
Thus, through a program in MATLAB® constant signals and
open loop input (Te-Re) were applied (Te-Co) and (Ebb), to X (n) X (0) matrix arrays (22)
analyse the behaviour of the outputs (Co-Bottom), (Co-Top)
and (Pt). Thus all three functions are chosen coupled transfer (23)
(18, 19 and 20 respectively), which were most affected by [ D *CB *CAB *CA* 2B...C * A* n 1* B] n
these changes at its input and, that verifying that both affect Such matrix Controllability, shows that the process is not
the change in the outputs. controllable. Then Observability analysis of the output
Output 1 (Composition at the bottom): variables is the final state, and the initial condition of the
0.0025 S 10 0.0007 S 9 0.0001S 8 (18) array is known, X (n) -X (0) = matrix arrays such as in (24).
S 0.3102 S 0.0389 S 0.0026 S 0.0001S
11 10 9 8 7
The open-loop system coupled and uncoupled and three L.Q.R provides a great freedom for designing and delivers a
respectively (Fig, 3). low computational cost for control law. This makes it
possible to implement a digital control system. However,
although L.Q.R control is proportional, it is necessary to
include an integral action for removing possible stationary
errors over those variables deemed necessaries (Ying Y,
1992).
To apply L.Q.R, it develops a program that allows obtaining
a step-like response with an optimal value. After that,
calculate Q’s structure for every single element, where the
diagonal of the matrix are the restrictions of the elements or
the value of the states. In (25) indicates that maximum and
minimum values of the outputs should take for every
intermediate space
1
Q (25)
Fig. 3 Answer open-loop process, when excited with a step Maximum Value Minimum Value 2
signal coupling Namely, R should calculate. But, in this case, the values are
every intermediate space inputs. Then, they replaced in (26).
Then, to stabilise the system, a PID controller is This could achieve simulating the system as in (Fig. 5). by
implemented, establishing the conditions setpoint variables this, maximum and minimum of process.
normalised considering the composition on the bottom 1 (26)
R
(sediment) is as close to zero, the structure at the top Maximum Value Minimum Value 2
(distilled) is as close to one, and pressure on top is as close to
one.
634
CHAPTER 17. SYSTEM IDENTIFICATION
Fig. 3 Response process when excited with a step signal to open loop and decoupling.
635
CHAPTER 17. SYSTEM IDENTIFICATION
Table 2 shows all values in the quadratic performance matrix K lqr A, B, Q, R (27)
Q, which obtained by replacing the outputs on every
intermediate space.
Table.4 feedback values for each state
Table 2 Values Hall adds Matrix Quadratic performance
Q. Name Value Name Value
44. 0 0 0 0 0 0 0 0 0 0
K1 0.0041 K7 0.0017
44 K2 7.16X10-4 K8 -0.0036
0 44. 0 0 0 0 0 0 0 0 0 K3 -0.2491 K9 0.3980
44
0 0 44. 0 0 0 0 0 0 0 0 K4 0.0024 K10 8.619x10-4
44 K5 -0.0037 K11 -0.0024
0 0 0 44. 0 0 0 0 0 0 0
44 K6 -0.0965
0 0 0 0 44. 0 0 0 0 0 0
44
0 0 0 0 0 44. 0 0 0 0 0
44 Show the model for distillation column system in a space
0 0 0 0 0 0 44. 0 0 0 0 state, where x1 to x11 were calculated as intermediate states
44
0 0 0 0 0 0 0 44. 0 0 0 (Fig. 7).
44
0 0 0 0 0 0 0 0 44. 0 0
44
Following the program developed, a step-like input of
0 0 0 0 0 0 0 0 0 44. 0 amplitude one applied to the data found previously as shown
44 in (Fig. 6). The behaviour of the three outputs (composition
0 0 0 0 0 0 0 0 0 0 12.
5 at the button, composition on the top, pressure on the top) can
see. They do not follow any reference, but they do minimise
Next, the values of the quadratic performance values R the error of the reference. By applying L.Q.R strategy, the
showed in Table 3. behaviour could determine.
Table 3 Found quadratic matrix R, production values Following the execution of the program, eleven intermediate
44.44 0 0 stated calculated (Fig. 8). Their values identified which fulfill
the purpose of this kind of control (Fig. 9).
0 44.44 0
0 0 12.5
The values found previously replaced on (27) to obtain
feedback gains for controller optimization, as shown in Table
4.
636
CHAPTER 17. SYSTEM IDENTIFICATION
637
CHAPTER 17. SYSTEM IDENTIFICATION
Seborg, D., Edgar, T., & Duncan, M. (2004). Process Werdan, J. P. (2016, Abril 08). Modelagem Empirica de
Dynamics and Control. NJ-EEUU: John Wiley&Sons, Colunas de Destilação Utilizando Redes Neurais de
Inc. Wavelets para Otimização E Controle de Processos.
Tanda, M. a. (2016). Herramienta para la sintonía robusta BEQ. Retrieved Abril 10, 2016.
decontroladores PI/PID de dos grados de libertad. Ying Y, M. a. (1992). Bilinear Control Strategy for paper
Revista Iberoamericana de Automática e Informática making process (Vol. 111).
Industrial RIAI, Vol 13, 22-31.
638
CHAPTER 17. SYSTEM IDENTIFICATION
Abstract: In this paper we present a new model for skid-steering mobile robots, taking into
account the relation between the friction and angular velocity of the vehicle. Then, by using
the parameter estimation toolbox of Simulink-Matlab, we calculate the model parameters with
a good fitting to real data. Finally, we design a state space feedback controller with integral
action, to make the controller robust against the friction phenomena that present the vehicle
when turning. The simulation results show good performance of the controller in reference
tracking as well as control actions.
639
CHAPTER 17. SYSTEM IDENTIFICATION
[ ] [ ]
vx r r [ω ]
= 0.5 − r r L
(6)
ω ωR
YG YG
It is noteworthy to mention that we found very important
to multiply the term pw |ω| to the original matrix R̄,
because we realized that the friction in x and y direction
Fig. 1. Angular and speed control of Skid-Steer Mobile is proportional to the angular speed of the car. Now, using
Robot (SSMR) in path following applications. equation (6) we can rewrite the model equation (1) in
function of the new state space vector [ωL , ωR ]T as shown
manipulated by the Controller Unit through pulse width in (7),
modulation (PWM). Each motor has a gear transmission
with a reduction 16:1, an encoder autonics E50s and a [ ] [ ]
ω˙L ω
current sensor Allegro ACS709. Additionally the system M̄ p + C̄p L + R̄(q̇) = B̄τ, (7)
need an angle sensor for the rotation control, whereby ω˙R ωR
a magnetometer on a NAVIO board was added to the where,
instrumentation of the robot. The Figure 2 illustrates the
main components in the control system and the interaction [ ][ ] [ ]
between them. KmL 0 iaL iaL
B̄τ = B̄N = B̄Km (8)
0 KmR iaR iaR
m m
1
M̄ p = mx2G + I mx2G + I (9)
2 −( )(
YG YG
[ ]
rmXG ω −YG YG
C̄p = (10)
2 −1 −1
[ ] [ ] [ ]
ω˙L −1 iaL ω
= (M̄ p) (B̄Km = −C̄p L − R̄). (11)
ω˙R iaR ωR
Fig. 2. Drive system scheme of SSMR. A magnetometer
Finally, we can rewrite (7) defining and augmented state
sensor is also required for angle control of the Robot.
space vector x(t) = [ωL , ωR , θ, iaL , iaR ]T , as result the
model can be written as
2.1 Skid-Steer Mobile Robot Model
The mathematical model of the vehicle can be divided in ω˙L −1 ¯ −1
three parts: kinematics, dynamics and drive models. This ω˙R −M¯ p Cp 0 M¯ p B̄Km
θ̇ = −0.5YG 0.5YG 0 0 0
paper focuses especially on the dynamic part. The main −N KbL /La 0 0 −Ra/La 0 + · · · (12)
equation that describes the dynamics of the robot is given ia˙L
0 − N KbR 0 0 − Ra/La
by (C.Linares and A Gallegos (2015); Jeon et al. (2014); ia˙R
Elshazly et al. (2014))
0 0 −1
0 0 [ ] M¯ p R̄
M̄ (q)η̇ + C̄(q̇)η + R̄(q̇) = B̄(q)τ ; (1) Kpwm VL 0
+ La 0
VR − 0 (13)
where η = [vx , ω]T , vx is the relative front velocity, ω is the Kpwm 0
rotational velocity of the robot, τ is the torque input and 0
La
q = [X, Y, θ]T represents the generalized coordinates of the
center of mass of the robot. The matrices M̄ , B̄, C̄andR̄ are 2.2 Parameter Estimation and Validation
given respectively:
[ ] Once the model was determined, we performed a param-
m 0 eter estimation (grey box) using MATLAB Simulink. The
M̄ = (2) Table 1 shows some parameters which were measured from
0 mXG θ̇
the robot directly and the Table 2 shows the estimated
[ ]
0 mXG θ̇ parameters. A series of experiments around the operation
C̄ = (3) region was carried out to acquire the real signals used in
−mXG θ̇ 0
estimation and validation of the model parameters.
[ ]
Frx (q̇) The data obtained in the test was divided in two part, 70%
R̄ = pw |ω| (4)
XG Fry (q̇) + Mr for training and 30% for validation. Then, the data was
[ ] standardize to a standard deviation of one for being used
1 1 1 in the parameter estimation toolbox of Simulink-Matlab,
B̄ = (5)
r −c c as shown in Figure 3.
640
CHAPTER 17. SYSTEM IDENTIFICATION
Table 1. System parameters Model Validation The input data for the experiments
were generated from the pilot with a remote control ”Spek-
Parameter Rep. Value Unit
trum Dx8i” and using the MATLAB function ”idinput”
Robot mass without wheels m 58 kg
around the operation point. The Figure 5 illustrates the
Wheel radius r 0.4 Meters
Wheel wide w 0.06 Meters
results of the estimation for each state variable: Angular
Robot wide a 0.4 Meters velocity of the left wheel ωL , Angular velocity of the right
Robot long L 0.6 Meters wheel ωR , rotation angle of the robot θ, Left motor current
Motor resistance Ra 0.33 Ω iaL and Right motor current iaR . As is depicted in Figure
Motor gear ratio N 16 5, the model parameters found fit pretty good to the real
data.
−1 c̄ −1
−M¯ p Cp 0 M¯ p B̄Km
1 1
A = − Y + Y YR + YL
0 0 0 (15)
Fig. 4. The picture shows the Rover: A four wheel skid- R L
steering mobile Robot. N KbL Ra
− 0 0 − 0
La La
Table 2. Estimated parameters
0 0
0 0
Parameter Rep. Value Unit 0 0
y coordinate rotation center YG 0.38 m
B= Kpwm
(16)
x coordinate rotation center XG 0.059 m La 0
Electromotive force constant RM KbR 0.0525 V /rad/sec Kpwm
Electromotive force constant LM KbL 0.0692 V /rad/sec 0 0
Motor torque constant RM KmR 0.0320 N.m/Amp La
Motor torque constant LM KmL 0.0379 N.m/Amp
Motor inductance La 0.0688 H
Where Kpwm is the conversion a duty cycle given by:
Moment of inertia of the robot I 15 kg.m2 V oltBatt /100.
Friction coefficient in x direction ux 0.01
Friction coefficient in y direction uy 1.2 [ ]
c̄ = rmXG C1 C2
Proportional constant of friction pw 0.184 Cp (17)
(YR + YL )2 C3 C4
641
CHAPTER 17. SYSTEM IDENTIFICATION
3. CONTROL DESIGN
Fig. 7. Rover control actions. Dashed line control action for
Using the same sample time used in the parameter iden- left wheel, solid line for the right wheel. x axis is time
tification, the continuous time system given by (14) was in seconds, and y axis is magnitude in percentage.
discretized using zero-order hold method, as result we have
the discrete time model given in (18),
xk+1 = AD xk + BD uk (18)
Where AD and BD are the discrete-time counterparts of
the continuous time state matrix A and input matrix B.
For the sake of simplicity we use the same notation to
represent the continuous and discrete time versions of the
state x and input u.
642
CHAPTER 17. SYSTEM IDENTIFICATION
REFERENCES
Alhelou, M., Dib, A., and Albitar, C. (2015). Trajectory
tracking for skid-steering mobile robots using sliding-
mode control. In Control Conference (ASCC), 2015 10th
Asian, 1–6. IEEE.
C.Linares, A.A.E.R. and A Gallegos, J. (2015).
Seguimiento robusto de trayectorias en la transportación
de objetos por un robot móvil. In AMCA (ed.),
Congreso Nacional de Control Automático, pp. 417
–422.
Elshazly, O., Abo-Ismail, A., Abbas, H.S., and Zyada, Z.
(2014). Skid steering mobile robot modeling and control.
In Control (CONTROL), 2014 UKACC International
Conference on, 62–67. IEEE.
Jeon, S., Jeong, W., and Park, D. (2014). A stable tracking
control of skid steered mobile platform. In Informatics
in Control, Automation and Robotics (ICINCO), 2014
11th International Conference on, volume 2, 556–561.
IEEE.
Kozlowski, K. and Pazderski, D. (2004). Modeling and
control of a 4-wheel skid-steering mobile robot. Int. J.
Appl. Math. Comput. Sci, 14(4), 477–496.
Mandow, A., Martinez, J.L., Morales, J., Blanco, J.L.,
Garcia-Cerezo, A., and Gonzalez, J. (2007). Experi-
mental kinematics for wheeled skid-steer mobile robots.
In Intelligent Robots and Systems, 2007. IROS 2007.
IEEE/RSJ International Conference on, 1222–1227.
IEEE.
Morales, J., Martı́nez, J.L., Mandow, A., Pequeno-Boter,
A., and Garcı́a-Cerezo, A. (2010). Simplified power con-
sumption modeling and identification for wheeled skid-
steer robotic vehicles on hard horizontal ground. In In-
telligent Robots and Systems (IROS), 2010 IEEE/RSJ
International Conference on, 4769–4774. IEEE.
Ortigoza, R., Aranda, M., Ortigoza, G., Guzmán, V.,
Vilchis, M., González, G., Lozada, J., and Carbajal,
M. (2012). Wheeled mobile robots: a review. Latin
America Transactions, IEEE (Revista IEEE America
Latina), 10(6), 2209–2217.
Siegwart, R., Nourbakhsh, I.R., and Scaramuzza, D.
(2011). Introduction to autonomous mobile robots. MIT
press.
Velázquez, R. and Lay-Ekuakille, A. (2011). A review of
models and structures for wheeled mobile robots: Four
case studies. In Advanced Robotics (ICAR), 2011 15th
International Conference on, 524–529. IEEE.
643
CHAPTER 17. SYSTEM IDENTIFICATION
Abstract: The work presented aimed to find an ARMAX model for a gas mixer describing its dynamics.
The method considered conducting an identification nonparametric which was to excite the mixer input
signals by a computer to determine basic but valuable information generated. Then a parametric
identification which consisted excite the mixer input with a pseudo random binary signal to obtain the
data with which the model coefficients were obtained was performed. Before it was necessary to define
the model structure and the method of least squares model errors minimized. The cross-validation tests
and correlation performed showed the effectiveness of the method. Based on the results of the mixer, it is
concluded that the Valve-oxigen duct-transmisor and Compressor-air duct-transmisor subsystems are
fourth-order delay.
Keywords: Mixer, Model ARMAX, experimental modeling, least squares method, cross validation.
644
CHAPTER 17. SYSTEM IDENTIFICATION
(2)
(t ) [ y(t 1).. y(t na), , , u(t 1)..u(t nb), , , e(t 1)..e(t nc)]
T
(3)
a1..ana , , , b1..bnb , , , c1..cnc t
645
CHAPTER 17. SYSTEM IDENTIFICATION
646
CHAPTER 17. SYSTEM IDENTIFICATION
Datos de Trabajo del Ducto de Oxígeno Datos de Trabajo del Ducto de Aire Para el modelo estimado del ducto de aire el
4 3.5
MSE es 0.0165 y el MSE para el modelo
Entrada u(k) 2 estimado del ducto de oxígeno es 0.0610.
Entrada u(k)
0 3
La estructura de modelo implementada en
-2 Simulink para los subsistemas Ducto de aire y
-4 2.5 Ducto de oxígeno del mezclador se muestra en
200 250 300 0 50 100
Muestras(k) Muestras(k)
la figura 5.
3.5 4
3
Salida y(k)
2
3
1.5
1 2.5
200 250 300 0 50 100
Muestras(k) Muestras(k)
0.5
-0.5
0 5 10 15 20 25
0.4
0.2
-0.2
-25 -20 -15 -10 -5 0 5 10 15 20 25
Retardo k
0.5
0
Figura 5. Diagrama Simulink para la respuesta
-0.5
0 5 10 15 20 25 del mezclador ante los subsistemas del
0.4
Función de correlación cruzada entre entrada FA y residuos de Salida FM mezclador.
0.2
-0.2
Los resultados de la prueba de validación
-0.4
cruzada tanto del subsistema Ducto de Oxígeno
-25 -20 -15 -10 -5 0 5 10 15 20 25
Retardo k como del Ducto de Aire muestran el buen ajuste
entre los datos observados y los datos estimados
Figura 4. Autocorrelación y correlación cruzada con modelo ARMAX.
de los subsistemas del mezclador.
Los resultados de la validación por simulación
para los subsistemas se muestran en la figura 6.
647
CHAPTER 17. SYSTEM IDENTIFICATION
648
CHAPTER 17. SYSTEM IDENTIFICATION
649
CHAPTER 18
TRAFFIC CONTROL
650
CHAPTER 18. TRAFFIC CONTROL
Abstract: This paper presents the comparative analysis of two types of road traffic models
including motorcycle dynamics: A macroscopic model corresponding to an extension of the
S-model, and the microscopic model found in the Simulation of Urban MObility (SUMO)
simulator, namely the Kraußmodel. Furthermore, both models are tested in a case study and
validated by means of the app Traffic Sensors which has been developed for acquiring multimodal
traffic volumes. simulation results show that the accuracy of the modified S-model is improved
by including an optimal motorcycle factor.
Currently, large cities have an increasing number of ve- case study. Microscopic simulation is achieved thanks to
hicles and externalities such as congestion, pollution and the Simulation of Urban MObility (SUMO) open-source
increased fuel consumption, among others. However, not package. SUMO has been actively developed from 2002
only cars are contributing to the problem, but also other by the Institute of Transportation Systems at the Ger-
vehicles such as public transportation buses, trucks, bicy- man Aerospace Center (DLR), featuring a validated car-
cles, pedestrians and motorcycles (Bellemans et al., 2002). following model and several tools for network and de-
Hence, the proposed traffic models should represent ev- mand modelling. Furthermore, the Traffic Control In-
ery dynamic of urban traffic composition and their re- terface (TraCI) allows to interact with the simulation
lationships, regardless of its macroscopic or microscopic in real-time. Specifically, in this work the implemen-
modelling. The main goal of these models is to manage tation of TraCI for the Matlab
R
environment, namely
traffic networks efficiently, in order to monitor, control and TraCI4Matlab has been used (Acosta et al., 2014). The
take decisions in real time. The microscopic traffic models macroscopic simulation was performed in Matlab
R
con-
describe dynamics for each vehicle as a particle, involving sidering the S model proposed in (Lin et al., 2012), which
characteristics such as speed, acceleration, position and is a well-known macroscopic model used mainly in control
manoeuvres of single vehicles, while macroscopic models applications. These simulations were carried out in a case
consider the traffic as a fluid with properties such as study located in the city of Medellin (Colombia) and val-
density, average speed and flow. However, each model is idated with data obtained through Traffic Sensors, which
used for different purposes (He et al., 2014). Microscopic is an Android application that allows to collect traffic data
models generally have a high computational cost in large- for different vehicle types.
scale networks hindering real-time applications and they
are not easy to validate with real data. On the other hand, This paper is organized as follows: Sections 1 and 2 present
macroscopic models are often inaccurate, but have few a review of microscopic and macroscopic models and
variables allowing an easy validation and a lower com- simulation. Section 3 presents a case study in the city of
putational cost. Actually, the inclusion of other kinds of the Medellı́n. Section 4 briefly describes the software tools
vehicles (different to cars) in macroscopic or microscopic developed for simulation and data acquisition. In Section
simulation is not easy because they consider vehicles with 5 experimental results are showed. Finally, in section 6,
similar characteristics to cars. In this sense, if motorcy- conclusions are presented.
cles or other vehicles are taken into account in an urban
network, they could lead to an inaccurate simulation. 1. MICROSCOPIC TRAFFIC MODELS
The aim of this work is to compare the incorporation Microscopic traffic models consider the behaviour of each
of motorcycle dynamics in microscopic and macroscopic vehicle using first-order differential equations “in a frame-
simulation including motorcycle dynamics in a simplified work close to Newtonian mechanics” (Bellomo and Dogbe,
? Research supported by: COLCIENCIAS under the doctoral schol- 2011). Thus, these models consider the dynamics of each
arship, convocation number 647 and a special acknowledgement to vehicle in relation to the vehicles that surround it. For this
COLCIENCIAS project: Modelamiento y control de tráfico urbano reason, microscopic models are known as models based
en la ciudad de Medellı́n etapa 2, código 1118-669-45309, CT 202- on stimulus and response. The importance of microscopic
2015. Universidad Nacional de Colombia Proyecto HERMES 25374. models lies in the difficulty of macroscopic models to
651
CHAPTER 18. TRAFFIC CONTROL
represent different phenomena such as the type of vehicles Where ṽ(t) is the velocity of the leader, g(t) is the gap
and drivers and the effect of lane changes (Kesting et al., between the two vehicles, τ is the follower braking reaction
2008). time, v̄, a and b are the average speed of the two cars,
their maximum acceleration and deceleration, respectively.
Microscopic models have allowed to describe the heteroge-
Finally and η are the noise amplitude and a random
neous behaviour of drivers (Ossen and Hoogendoorn, 2011)
number between 0 and 1. The Krauß model is a collision
and lane change manoeuvres and their impacts, which
free model whose main assumption is that the braking
validates several features found empirically in macroscopic
distance of the follower plus the distance given by his/her
models (Kesting et al., 2007; Laval and Daganzo, 2006).
reaction time is lower than the braking distance of the
These models consist of two main types: longitudinal mod-
leader plus the space between the leader and the follower.
els, where the most remarkable are car following models,
which describe the movement of vehicles along a lane; and Although microscopic traffic simulators have been success-
lateral models, which describe lane change and overtaking fully validated in many scenarios around the world, there
manoeuvres. are still some aspects that need further improvements,
including the refinement of lane change models and the
The general form of a car following model is described by
incorporation of mixed traffic conditions, specially in sce-
the differential equations (4) and (5) (Wilson and Ward,
narios from countries like India and Colombia, where there
2011).
is a lower lane-dependency in the driving behaviour.
g(t) − ṽ(t)τ
vsaf e = ṽ(t) + (4) leave µu,d,o gu,d,o (k)
v̄/b + τ αu,d,o (k) = min ,
cd
vdes = min{v(t) + ah, vsaf e , vmax } (5) (12)
qu,d,o (k) arriv βu,d,o (k)(Cd,o − nd,o (k))
v(t + h) = max{0, vdes − aη} (6) +αu,d (k), ,
cd cd
652
CHAPTER 18. TRAFFIC CONTROL
lane free
Where l, Nu,d and vu,d are the average length of the
vehicles, the number of lanes in link (u, d) and the free
flow speed. Hence, the S model is described by equations Fig. 2. Map of case study and distribution of tasks.
(16)–(7)
2.1 S Model extension The experimental data were obtained on August 20th 2015
at 9:00 am (-5 GMT) with the help of four participants
In order to include the effect of motorcycles in the S labelled in Figure 2 with numbers from 1 to 4. The data
model, the Passenger Car Equivalent (PCE) concept is collected was classified into cars and motorcycles for all
added, where one possible representation of motorcycles is cases. Each participant had a variable to be measured,
through a factor between vehicles (cars) and motorcycles. that is: Participant 1 measures the inflow to link (u, d)
enter
This concept is clearly defined in (Yperman, 2011), and αu,d (k), participant 2 measures the arrival flow at the
arrive
can be included explicitly in the inflow as: end of the queue αu,d (k), participant 3 measures the
leave
outflow from link (u, d) αu,d (k) and participant 4 mea-
enter
αu,d venter
(k) = αu,d menter
(k) + MF αu,d (k) (17) sures the green and red time of traffic light gu,d (k). The
information of each measurement point was recorded in
venter menter plain text through the android mobile application, namely
Where αu,d (k) is the measured vehicle inflow, αu,d (k)
is the measured motorcycle inflow and MF is the motor- Traffic Sensors, developed in the project (MOYCOT,
cycle factor which can take values between 0 and 1. 2016) specifically for this purpose. Traffic Sensors is briefly
described in section 4.
3. CASE STUDY This case study was modelled in the Simulation of Ur-
ban MObility (SUMO) microsimulator (including motor-
In order to validate the macroscopic and microscopic cycles), to compare the results with the experimental data
models, a traffic experiment was proposed in the city of and with the results obtained with the S model.
653
CHAPTER 18. TRAFFIC CONTROL
Matlab TraCI SUMO that the value yielding the minimum mean square error
application simulation is MF = 0.6. Additionally, Figure 4(a) shows the traffic
light signal with G and R representing, respectively, the
TraCI4Matlab sumolib4matlab NetworkhEditor green and red traffic light states.
h Online h Networkhparsing forhSUMO
h interaction h Demand h Sethuphthe
h generation h scenario G
h Importhfrom
h OSMh
Normalized error
output a wide variety of aggregated and disaggregated
data including vehicle trajectories, densities, emissions 0.8
and traffic lights states. These data can be obtained at
the end of the simulation in form of XML files, or in Optimal
0.6
execution time through the TraCI API. On the other hand,
the macroscopic models were developed and analysed in
Matlab
R
. Furthermore, traffic lights control strategies 0 0.2 0.4 0.6 0.8 1
are also developed in this environment in the MOYCOT Moto Factor
project (Noreña et al., 2015). These reasons motivated (b) Moto factor against normalized mean
the development of an implementation of TraCI for the square error
Matlab language, which was called TraCI4Matlab. Thanks
to the commands provided by TraCI4Matlab, it is possible Fig. 4. Traffic light signal and Moto Factor calculation.
to retrieve and change the properties of the objects in
the SUMO simulation in real time, as showed in figure Furthermore, the same scenario was implemented in the
3. Additionally, TraCI4Matlab was developed using agile SUMO microsimulator and compared with the macro-
methodologies including the re-engineering of the available scopic simulation, through the modified S model (2.1) with
Python implementation, which shortened the development typical parameters (theoretical data), and with the real
time considerably. Furthermore, tools for traffic demand data (experimental data). The main states of the link, total
generation in Matlab and graphical network editing have number of vehicles and total queued vehicles are showed
been developed under the same methodologies (Acosta in Figure 5. In addition, Figure 5 shows some accuracy
et al., 2015). differences between the theoretical and simulation data.
The first ones are apparently closer to the real data, with
4.2 Traffic Sensors an error of 2.39 veh and 2.7 veh for the number of vehicles
and vehicles in queue, respectively. On the other hand,
Traffic Sensors is a lightweight Android application aimed the errors for simulation data are 4.02 veh and 1.89 veh.
to the collection of multi-modal traffic data and traffic In this way, theoretical results show an improvement of
lights states, which allowed the validation of the micro- 13.89% over the results obtained in microsimulation. This
scopic and macroscopic simulations described in this work. improvement was achieved thanks to the inclusion of the
The data obtained with traffic sensors is time-stamped optimal motorcycle factor for this specific link.
using the clock of the device, enabling synchronized col-
laborative experiments. Furthermore, Traffic Sensors syn- 6. CONCLUSION AND FURTHER RESEARCH
chronizes these data with a remote server, simplifying the
collection and analysis tasks. This paper compared a microscopic and macroscopic
model in an experimental scenario including motorcycles.
5. EXPERIMENTAL RESULTS Both simulations were performed using traditional meth-
ods, however, in the macroscopic modelling an equivalence
Taking into account the mean square error in the number factor between cars and motorcycle (MF ) was included
of vehicles between the experimental data and theoretical using the S model, making a substantial difference. Further
data (S model), the optimal Passenger Car Equivalent research will include a dynamic motorcycle factor or other
(PCE), MF , was found as showed in Figure 4(b), where representation, where there will be an explicit interaction
several simulations were performed varying the PCE be- between motorcycles and cars considering the particulari-
tween 0 and 1 with an increment of 0.1. It was found ties of each vehicle.
654
CHAPTER 18. TRAFFIC CONTROL
655
CHAPTER 18. TRAFFIC CONTROL
Abstract: This paper contains the results of simulation in the implementation of a Generalized
Predcitive Control (GPC) Model for a traffic light system of an isolated intersection of network
traffic, using the SUMO microsimulator as simulated plant and the linear prediction model
called Modified Multi-class Queueing Networks (MMQM) for the GPC.
656
CHAPTER 18. TRAFFIC CONTROL
657
CHAPTER 18. TRAFFIC CONTROL
x2 (n + 1) = x2 (n) + a2 (n) − α2,6 x2 (n) (4) the estimation through optimization procedures based
x6 (n + 1) = x6 (n) + α2,6 x2 (n) − α6,12 x6 (n) − ... on input/output data. The identified parameters were
−α6,13 x6 (n) − α6,14 x6 (n) (5) fi,max in this case f 1, f 2, f 3, f 4 related with the max-
imum output flow of vehicles that can cross from one
x12 (n + 1) = x12 (n) + α6,12 x6 (n) − gu2 (n)f2 link to another through the intersection. They depend
x13 (n + 1) = x13 (n) + α6,13 x6 (n) − gu2 (n)f2 directly on the input demand of vehicles ai (n) of each
x14 (n + 1) = x14 (n) + α6,14 x6 (n) − gu2 (n)f2 link and must take nonnegative values. The parameters
α1,5 , α2,6 , α3,7 , α4,8 , α5,9 , α5,10 , α5,11 , α6,12 , α6,13 , α6,14 ,
x3 (n + 1) = x3 (n) + a3 (n) − α3,7 x3 (n) (6)
α7,15 , α7,16 α7,17 , α8,18 , α8,19 , α8,20 are not estimated, they
x7 (n + 1) = x7 (n) + α3,7 x3 (n) − α7,15 x7 (n) − ... are assumed to be a constant and balanced for a distribu-
−α7,16 x7 (n) − α7,17 x7 (n) (7) tion of vehicles on each turning direction.
x15 (n + 1) = x15 (n) + α7,15 x7 (n) − gu2 (n)f3
An input signal was designed in such a way that the green
x16 (n + 1) = x16 (n) + α7,16 x7 (n) − gu2 (n)f3 light time changes in an interval between 10 and 50 seconds
x17 (n + 1) = x17 (n) + α7,17 x7 (n) − gu2 (n)f3 to ensure that a cycle time (60 seconds) contains both
x4 (n + 1) = x4 (n) + a4 (n) − α4,8 x4 (n) (8) phases. The identified parameters are shown in Table 2
x8 (n + 1) = x8 (n) + α4,8 x4 (n) − α8,18 x8 (n) − ... Table 2. Parameter estimation
−α8,19 x8 (n) − α8,20 x8 (n) (9)
Parameter Value
x18 (n + 1) = x18 (n) + α8,18 x8 (n) − gu1 (n)f4 f1 0.0072
x19 (n + 1) = x19 (n) + α8,19 x8 (n) − gu1 (n)f4 f2 0.0046
x20 (n + 1) = x20 (n) + α8,20 x8 (n) − gu1 (n)f4 f3 0.0053
f4 0.0067
3.2 Experimental set up
Figure 4 shows the comparison between the MMCM model
The controlled signal for the MMQM model and SUMO and the SUMO data when the input signal is applied.
was g1 (n) and the total light cycle was set to a constant Both respond to the control signal, decreasing the vehicles
value cd = 60[s]. The input vehicle demand was considered accumulation when the green light time increases and vice-
as a disturbance (in this case known) for estimating the versa. It can also be noticed that the model trend fits the
parameters. The inputs were chosen in order to avoid SUMO data.
saturating the intersection. Additionally, the cumulative
sum of the vehicles entering each link, for one hour (3600 70
seconds) is as follows. Control signal g1[s]
Queue link 1 SUMO[Veh.]
60
Queue link 1 MMQM[Veh.]
1600 50
1400
40
1200
30
1000
Vehicles
20
800
10
600
0
400 0 500 1000 1500 2000 2500 3000 3500
Time[s]
200
(a) Phase 1 comparison
0
0 500 1000 1500 2000 2500 3000 3500
Time[s]
70
Control signal g2 [s]
Fig. 3. Input demand 60
Queue link2 SUMO[Veh.]
Queue link 2 MMQM[Veh.]
o3a 400
0
o4a 400 0 500 1000 1500 2000 2500 3000 3500
Time[s]
658
CHAPTER 18. TRAFFIC CONTROL
3.4 State estimator denote minimum and maximum values for the phase
duration ui . The sum of phase durations is the total cycle
A Kalman filter based estimator was designed using the time (60s), the maximum and minimum values of queue
extended model described in equations (Kalman, 1960). length yi based on link the capacity and the variables are
This model includes an integral action for representing the subject to the MMQN model.
uncertainties in the system.
4. SIMULATION RESULTS
x(n + 1) = Ax(n) + Bu(n) + Gw(n) + Bd d(n) (10)
The obtained queue length of each link with yref = 0 and
y(n) = Cx(n) + v(n)
constant input demands are shown in following figures:
d(n + 1) = d(n) + Bwd wd (n)
The terms Gw and Vn corresponds to the process and mea- 12 9
Vehicles
Vehicles
follows: 6
5
4 3
2
2
x(n + 1) A Bd
x(n) B 1
= + u(n) (11) 0 0
" w(n) # (a) vihicles queue link 1 (b) vihicles queue link 2
G 0
+ wd (n)
0 Bwd 12 9
8
10
7
8 6
" #
Vehicles
Vehicles
5
x(n) 6
4
2
2
1
0 0
0 500 1000 1500 2000 2500 3000 3500 0 500 1000 1500 2000 2500 3000 3500
Covariance matrices tuned to design the filter were Qn = Time [s] Time [s]
105 I y Rn = 10I, Bd = B. Finally, the state observer is: (c) vihicles queue link 3 (d) vihicles queue link 4
in section 3.3 ):
3
2.8
N
X N
X
min ||y(n + j) − yref ||2Q + ||u(n + j)||2R 2.6
U
j=1 j=0
2.4
(13)
2.2
0 500 1000 1500 2000 2500 3000 3500
Time[s]
Subject to:
x(n + j + 1) = Ax(n + j) + Bu(n + j) Fig. 6. GPC control signal with constant demand
y(n + j) = Cx(n + j)
10 ≤ ui (n + j) ≤ 50 ∀j = 1, ..., N In order to test the response of the GPC controller in
0 ≤ yi (n + j) ≤ 44 ∀j = 1, ..., N another situation, a change on the input demand was
tested. The constant input changes to variable demand
u1 (n + j) + u2 (n + j) = 60 ∀j = 1, ..., N in the middle of the simulation time, t = 1800s. Thus,
figure 7 shows how the controller is looking for keeping
The aim is to have the minimum queue length, thus the the network balance by reducing the green time of Phase
reference signal is set to zero (no queue). The constraints 1.
659
CHAPTER 18. TRAFFIC CONTROL
4 4
x 10 x 10
3.8 5
u1 u1
3.6 u2 4.5 u2
3.4 4
3.2 3.5
Time [ms]
Time [ms]
3 3
2.8 2.5
2.6 2
2.4 1.5
2.2 1
0 500 1000 1500 2000 2500 3000 3500 0 500 1000 1500 2000 2500 3000 3500
Time [s] Time[s]
Fig. 7. GPS control signal with variable demand Fig. 9. GPC control signal reference exchange
4.1 Reference change Table 3. Comparison of strategies
10
8
Daganzo, C.F. (1995). Requiem for second-order fluid
approximations of traffic flow. Transportation Research
6
Part B, 29(4), 277–286.
4
Kalman, R. (1960). A new approach to linear filtering
2 and prediction problems. Journal of basic Engineering,
0 82(1), 35–45.
0 500 1000 1500 2000 2500 3000 3500
Time[s] Krajzewicz, D., Erdmann, J., Behrisch, M., and Bieker,
L. (2012). Recent Development and Applications of
Fig. 8. Vehicles in the queue link 2 reference exchange {SUMO - Simulation of Urban MObility}. International
Journal On Advances in Systems and Measurements,
In Figure 9, it is observed that the time interval in Green 5(3), 128–138.
of Phase 2 decreases in order to achieve the reference. Le, T., Vu, H.L., Nazarathy, Y., Vo, Q.B., and Hoogen-
doorn, S. (2013). Linear-quadratic model predictive con-
5. CONCLUSION trol for urban traffic networks. Transportation Research
Part C: Emerging Technologies, 36, 498–512.
Optimal controllers are subject to covariance matrices that Lighthill, M.J. and Whitham, G.B. (1955). On Kinematic
define the weights to give importance to some control ob- Waves. II. A Theory of Traffic Flow on Long Crowded
jectives over others. Therefore the tuning of these matrices Roads. Proceedings of the Royal Society A: Mathemati-
is a critical part in the design of these controllers. If the cal, Physical and Engineering Sciences, 229(1178), 317–
value of the R matrix is very big the controller does not 345.
respond to significant changes in demand, because the Norena, L., Sarrazola, A., Jim, E., Jaramillo, O., Portilla,
control variable is strongly penalized. C., Alejandro, M., and Espinosa, J. (2015). Model Pre-
The total number of vehicles waiting at the interaction dictive Control of an Urban Traffic Intersection Based
during simulation, compared between strategies fixed time on the Modified Multi-class Queuing Networks Models.
and GPC, as shown in the Table 3 1–4.
Papageorgiou, M., Kiakaki, C., Dinopoulou, V., and Kot-
The GPC controller has cumulated 695 vehicles less at the sialos, a. (2003). Review of road traffic control strategies.
intersection compared with to the fix time control strategy. Proceedings of the IEEE, 91(12), 2043–2067.
660
CHAPTER 18. TRAFFIC CONTROL
661
CHAPTER 18. TRAFFIC CONTROL
Abstract: This article presents a Nonlinear Model Predictive Control (MPC) for controlling
the lateral dynamics of a passenger vehicle, based on a planar Single Track Model. This MPC
controller is aimed at tracking suitable reference values on its states in order to achieve a
lane change manoeuvre. Simulation results are showed for an scenario with a single vehicle and
another scenario with two vehicles in the target lane, for different initial conditions that validate
the proposed nonlinear MPC.
662
CHAPTER 18. TRAFFIC CONTROL
where:
4. SIMULATION RESULTS FOR A SINGLE VEHICLE
• m is the vehicle’s mass.
• ẋ0 is the vehicle’s velocity in direction of its longitu- The MPC controller presented in section 3 has been tested
dinal axis, assumed constant. in simulation using Matlab
R
and Simulink, based on the
• Iz is the vehicle’s yaw moment of inertia. parameters found in Table 1. The parameters related to
• lf , lr are the distances from the vehicle’s center of the vehicle are the same used by (Rajamani, 2011), and
mass G to its front and rear wheels, respectively. correspond to a passenger sedan. Satisfactory results were
• Cαf , Cαr are the cornering stiffness for the front and obtained with a sample time Ts = 0.5, though Ts = 0.05
rear wheels, respectively. could have been used, as in (Anderson et al., 2010), where
real experiments have been made. The prediction horizon
The set of equations (1) defines the nonlinear model was chosen as N = 10, since the interest is on modelling
ẋ = f (x, u) to be used in the MPC controller. It is worth lane changes. This value corresponds to 5s, and can be
to note that small tire slip angles are assumed. Hence, regarded as the approximate maximum duration of a lane
the wheel’s lateral forces acting on the vehicle can be change, according to (Toledo and Zohar, 2015). Finally,
considered as linear functions of the corresponding slip the, δmin , δmax and ∆δmax restrictions values have been
angles, with the cornering stiffness acting as the relating taken from (Anderson et al., 2010) and (Falcone et al.,
proportionality constant. The slip angle is defined as the 2007).
angle between the velocity vector acting on the tire and
its orientation. Moreover, small angle approximations are Figure 2 shows the trajectory obtained with the proposed
made in regard to the vehicle’s slip angle, defined as the MPC. A reference value of 3.3m in the YW state was set,
663
CHAPTER 18. TRAFFIC CONTROL
4 0.03
Y ∆δ
3.5 Yref ∆δmax
0.02
3
0.01
2.5
∆δ (rad/s)
Y (m)
2 0
1.5
−0.01
1
−0.02
0.5
0 −0.03
0 5 10 15 20 0 5 10 15 20
t (s) t (s)
Fig. 2. Simulation results for the single vehicle scenario, Fig. 4. Simulation results for the single vehicle scenario,
showing the obtained trajectory showing the rate of change of the front wheel steering
angle
0.2
δ
0.15 δmax
0.1
0.05
δ (rad)
−0.05
−0.1
−0.15
Fig. 5. Notation for the position of vehicles involved in a
−0.2
0 5 10 15 20
lane change
t (s) dp,q (k + j) ≥ dsaf e , ∀j = 1, ..., N (9)
With:
Fig. 3. Simulation results for the single vehicle scenario,
showing the front wheel steering angle dp,q (k + j) = krp q
W (k + j) − rW (k + j)k (10)
which can be taken as the approximate distance between Equations (9) and (10) state that the euclidean distance
the center of two lanes. Figures 3 and 4 show the control between the subject vehicle and the vehicles in the target
action calculated by the MPC, its rate of change and the lane must be greater than a given safety distance dsaf e .
corresponding constraints. Note that this approach assumes that the subject vehicle
knows the position of those in the target lane, which could
be achieved with technologies such as vehicular communi-
5. INCLUDING VEHICLES IN THE TARGET LANE cations, DGPS and computer vision. Another interesting
aspect is that the subject vehicle should accurately pre-
dict the position of vehicles in the target lane within the
Consider a scenario with one vehicle that wants to execute
prediction horizon, which could be achieved through one
a lane change (called “subject vehicle”, from now), iden-
of the many well-known car-following models (Li and Sun,
tified with the superscript p, and vehicles in the target
2012).
lane identified with the superscripts p − 1 and p + 1 as
showed in figure 5. An additional constraint is required to
prevent the subject vehicle to collision with the vehicles in 6. SIMULATION RESULTS CONSIDERING
the target lane, which can be defined, as follows. Let: VEHICLES IN THE TARGET LANE
664
CHAPTER 18. TRAFFIC CONTROL
0.2
1
δ(XW (0) = 10)
0.15 1
δ(XW (0) = 0)
0.1 δmax
0.05
δ (rad)
0
−0.05
−0.1
3
35
Y (m)
0 15
0 5 10 15 20
t (s)
10
7. CONCLUSIONS REFERENCES
Anderson, S.J., Peters, S.C., Pilutti, T.E., and Iag-
In this article, a nonlinear Model Predictive Control nemma, K. (2010). An optimal-control-based frame-
(MPC) of a passenger vehicle aimed for lane changes work for trajectory planning, threat assessment, and
was presented. This MPC is based on a simplified Single semi-autonomous control of passenger vehicles in hazard
Track Model (STM), including two states that describe avoidance scenarios. International Journal of Vehicle
the position of the vehicle in a global coordinate system, Autonomous Systems, 8(2), 190–216.
so that proper reference values can be tracked in order Bifulco, G.N., Pariota, L., Simonelli, F., and Di Pace,
to execute the lane change maneuver. The effectiveness R. (2013). Development and testing of a fully Adap-
of the proposed MPC could be validated in simulation. tive Cruise Control system. Transportation Research
Furthermore, an additional constraint was included in Part C: Emerging Technologies, 29, 156–170. doi:
order to consider the influence of vehicles in the target 10.1016/j.trc.2011.07.001.
665
CHAPTER 18. TRAFFIC CONTROL
666