Professional Documents
Culture Documents
In-Situ Optimal Control of Nutrient Solution For Soilless Cultivation
In-Situ Optimal Control of Nutrient Solution For Soilless Cultivation
2
Feng Chen\ Haidong He y ongning Tang
Department of Automation School ofInformation Technology
University of Science and Technology of China Illinois State University
Hefei, China Chicago, USA
1 chenfeng@ustc.edu.cn ytang@iIIinois.edu
2haidongee@gmail.com
Abstract-Soilless cultivation is one of main production modes vegetable growth and nutrient solution. Therefore, it is a
in precision agriculture. Nutrient solution for soilless challenge to achieve optimal control of nutrient solution!
cultivation primarily consists of macroelements such as The purpose of this work is to find an in-situ optimal
2 2 control policy for nutrient solution under natural climate
N03 - , H2PO� , K+ , Ca + , Mg + , and SO;- . In
environment. Research on optimal control for nutrient
vegetable-nutrient solution system, there are complicated
solution has been received considerable attention in recent
correlations between greenhouse vegetable growth and
two decades [3]. Several efforts have been done to use
nutrient solution, thus it is a challenge to achieve optimal
nutrient solution as efficient as possible for soilless
control of nutrient solution. In this work, based on Q-Iearning,
cultivation. In 1999, Savvas et al. first proposed an algorithm
we first propose an in-situ optimal control method of nutrient
solution compositions for greenhouse vegetable. Instead of
to perform replenishment and reuse of the drain nutrient
modeling the correlations between greenhouse vegetable
solution in close hydroponic systems on the basis of
growth and nutrient solution, this method searches for optimal dispensing as many nutrients and water in it as were needed
control policy through systematic interaction with the to maintain a target electrical conductivity in the irrigation
environment. The effect of nutrient solution compositions on solution. To achieve this in each watering application,
photosynthetic rate of greenhouse vegetable is experimentally dilution ratios of the concentrated fertilizer solutions are
investigated, and on this basis reward function is designed. The automatically adjusted based on this algorithm, in relation to
experimental results show our method is effective and practical. the volume and the electrical conductivity of the drain
solution [4]. Lykas et al. presented a method to control
Keywords-nutrient solution; in-suit; optimal control; Q recirculating nutrient solutions in hydroponic crops made use
learning of correction algorithms to manage soilless cultures by
measurement of electrical conductivity (EC) and pH [5].
I. INTRODUCTION Based on the least square fit with neural BP network, Yang
et al. developed a method to control pH and EC of nutrient
Soilless cultivation is one of main production modes for
solution [6]. Zhang et al. studied an unlimited growth
greenhouse agriculture. The control system of closed cycle
cultivation mode for greenhouse tomato grows under deep
for nutrient solution is a key technique in solilless cultivation,
flow technique condition, By researching on the control
which involves growing greenhouse vegetables by collecting
technique of the nutrient solution root system
left over nutrient solution, blending it with fresh nutrient
microenvironment, a control system is developed to control
solution, and recirculating in subsequent irrigation [1]. The
pH, EC and temperature of nutrient solution [7].
nutrient solutions can be drained back down and re-utilized
All the aforementioned studies fulfill the control of
to avoid the waste of nutrients and to reduce the
nutrient solution only rely on pH and electronic conductivity
environmental impact, thus fulfilling recent directives for
(EC) values. Because pH represents the total acidity and
environmental protection.
basicity of solution and EC values represents the total
Nutrient solution for soilless cultivation primarily
activity of ions, there is no the function relation between pH,
consists of the macroelements such as N03 -, H2PO� , EC and the macroelements of nutrient solution [8], such
2 Lf 2 2 . methods can not achieve optimal control of nutrient solution
K+ , Ca + , 1vig + , and S04 - . Optimal control of compositions.
nutrient solution for soilless cultivation can minimize water Based on Q-leaming, we first propose an in-situ optimal
and fertilizer losses and gain maximum the yield and high control algorithm for nutrient solution compositions. Instead
quality of vegetables for greenhouse culture [2]. of modeling the complex correlations between greenhouse
Greenhouse vegetable is influenced by not only its vegetable growth and its nutrient environment, this algorithm
surrounding climate but the nutrient solution. Vegetable finds optimal control policy of nutrient solution
growth and its nutrient environment form a dynamic and compositions through systematic interaction with the
complex system under greenhouse environment. Because the environment. The correlations between the nutrient solution
growth process of greenhouse vegetable has the properties compositions and the photosynthesis rate of greenhouse
such as large time delay and uncertainty, it is extremely vegetable are experimentally investigated. On this basis, the
difficult to model the correlations between greenhouse
reward function is designed. The experimental results Photosynthetic rate can represent real-time status of
demonstrate this algorithm is effective and feasible. greenhouse vegetable growing. The current sensor
techniques make it possible to on-line measure
II. Q -LEARNING photosynthetic rate of greenhouse vegetable. Therefore, we
Reinforcement learning is a type of unsupervised choose photosynthetic rate as evaluation indicator of real
learning and consistent with the behaviorism theory time growing status of greenhouse vegetable.
presented by Brooks [9]. Recently, reinforcement learning The correlations between nutrient solution compositions
has been the focus of machine learning and applied and photosynthetic rate of the greenhouse vegetable are
successfully in many fields [lO-13]. Q-learning is one of the explored. Through experimental investigation for two years,
most important reinforcement learning algorithms [14]. In we have the findings in the following:
reinforcement learning module, an agent acts on environment • In a given greenhouse climate, proper composition
and receives reinforcement signal which is either punishment concentrations of nutrient solution can increase
or reward induced by environment state transition. The photosynthetic rate of greenhouse vegetable. Too high
learning task for an agent is to search for an optimal policy: or too low composition concentration of nutrient
state-action sequences, which is an action selection rule solution will lead to decrease the photosynthetic rate
corresponding to a given state. The basic model of (shown as Fig.2)
reinforcement learning is shown as Fig.l.
State s
25
Rewardr
00
Actiolla 1�
�
.� 15
{
c
10
'"'
413
2011 3rd International Conference on Advanced Computer Control (ICACC 2011)
S1' S2' ..., S;, ..., Sn is adopted to represent greenhouse basic unit. The basic unit is determined according to the
following formula:
vegetable growing process, where i =1 denotes initial time
and i = n is terminal time in one iteration of the nutrient I"JX = Ymax - Ymin (4)
solution control. The related definition is given in the a
following: Here, Ymax and Ymin express the maximum and
Definition: in-situ optimal control of nutrient solution for minimum of the nutrient solution compositions, a is
greenhouse vegetable is defined as a quadri-tuple constant and generally set to 5 or 10. Search space can be
)
(S, A,P, R , where, S is a finite set of the growing states significantly reduced with state segmentation and action
definition.
of greenhouse vegetable;
Furthermore, in order to avoid trapping into local
A = AN X Ap X AK X 4: X AM X As is finite set of optimum, Boltzmann distribution is introduced to select the
executable joint action, where A N' Ap, AK, Ac' AM and joint actions, which can achieve effective tradeoff between
exploration and exploitation during the search process.
As are the subsets of joint action set A, which are used to
IV. ALGORITHM DESCRIPTION
2 2
respectively control N03-, H2PO�, K+, Ca +, Mg +,
Let a be learning rate, r be discounted rate, where
and SO;- in nutrient solution, P is transition probability of a, rE (0,1) , a policy f is a function from states to
actions: f: S � a, and f(x) denotes the actions to be
state; R : S X A � R is a numerical reward function. The
control system aims at finding a policy that can obtain
optimal discounted cumulative rewards for any discrete state conducted in state x. For a fixed policy f, let VI (x) be
the value function for policy f , which is desired rewards
sequences in the conditions of unknown P and R.
Reward function is real-time photosynthetic rate, which
is defined as PHO /PHOMAX (PHOMAX is possible given that the process begins in state x and follows policy
maximal photosynthetic rate). f thereafter. The control algorithm for nutrient solution is
As the states and actions are continuous values, it leads to described as follows:
extremely large search space. Considering greenhouse Q+- initialized actions' value
vegetable growth characteristics of delay time and k= 0 Ilk is iterative index
imprecision, MHO and MAR can be segmented in
Order S e is goal state
term of formula (2) and (3) to decrease the search space.
1: Repeat
pho1 , -1.0-:;;�PHO < -0.5
S +- current state
pho2 , - 0.5-:;; �PHO < 0 If S = se then goto 1 endif
MHO = pho3 , �PHO=O (2) Select an action to perform III term of Boltzmann
distribution:
eQ(s,a)IT
pho4 , 0.0 < �PHO < 0.5
phos, 0.5-:;; �PHO < 1.0 p(a,s) =
>Q(s,b)IT
beLA
pal], - 50-:;; �PAR < -30 II Where T is a temperature parameter used to adjust
degree of randomization for choice action
par2 , -30-:;; �PAR < -10 If the concentration of composition i :2: the maximum
MAR = pa1) , -10-:;;�PAR-:;;10 (3) concentration of composition i then aj = null endif
par4 , 10 < �PAR < 30 Perform action a and update Q(s, a) function:
pars' 30-:;; �PAR < 50 Q(s,a)� (1- a)Q(s,a)+ a[r + }'V (y)]
Where, the units of MHO and MAR are II V(y) Q(y,j(y)) , for each YES : f(Y)f- a
= that
2
f.111101 / m S . Q(y,a) maxbeA Q(y,b)
=
Considering both the possible variation range of nutrient Revise policy value Vj(s) maxbeA Q(s,b)
=
414
2011 3rd International Conference on Advanced Computer Control (ICACC 2011)
Greenhouse vegetable and its nutrient environment form [II) 0 Jangmin, L. B. Jongwoo, Byoung-Tak Zhang, et aI., "Adaptive
stock trading with dynamic asset allocation using reinforcement
a dynamic and complex system. Due to difficulty for
learning," Information Sciences,vol. 176,pp. 2121-2147,2006.
modeling the correlations between greenhouse vegetable
[12) K. Macek, 1. Petrovic, N. Perie, "A reinforcement learning approach
growing and nutrient solution compositions, it is a challenge to obstacle avoidance of mobile robots," 7th International Workshop
to achieve optimal control of nutrient solution. Based on on Advanced Motion Control,July 2002,pp.462 - 466.
415
2011 3rd International Conference on Advanced Computer Control (ICACC 2011)
[13] Steven O. Whitehead, Long-li Lin, "Reinforcement learning of non [IS] Weijie liang, Wei Liu, New technique for soilless cultivation of
Markov decision processes," Artificial Intelligent, vo1.73, pp. 271- vegetable,Beijing lindun Press,I998
306,1995. [16] F. Lafont, 1.F. Balmat, "Optimized fuzzy control of a greenhouse,"
[14] C l. C H. Watkins, P.Oayan. "Technical notes: Q-Iearning," Fuzzy Sets and Systems,vol. 128,pp. 47-59,2002
Machine Learning, vol. 82,pp. 39-46,1992.
416