You are on page 1of 7

Computers and Electronics in Agriculture 130 (2016) 13–19

Contents lists available at ScienceDirect

Computers and Electronics in Agriculture


journal homepage: www.elsevier.com/locate/compag

Original papers

A computational environment to support research in sugarcane


agriculture
Carlos Driemeier a, Liu Yi Ling a, Guilherme M. Sanches a,b, Angélica O. Pontes a,
Paulo S. Graziano Magalhães a,b,⇑, João E. Ferreira c
a
Brazilian Bioethanol Science and Technology Laboratory (CTBE), National Center for Research in Energy and Materials (CNPEM), Caixa Postal 6192, CEP 13083-970 Campinas,
São Paulo, Brazil
b
School of Agriculture Engineering – FEAGRI, University of Campinas – UNICAMP, Campinas, SP, Brazil
c
Institute of Mathematics and Statistics – IME, University of São Paulo – USP, São Paulo, SP, Brazil

a r t i c l e i n f o a b s t r a c t

Article history: Sugarcane is an important crop for tropical and sub-tropical countries. Like other crops, sugarcane agri-
Received 26 October 2015 cultural research and practice is becoming increasingly data intensive, with several modeling frameworks
Received in revised form 23 August 2016 developed to simulate biophysical processes in farming systems, all dependent on databases for accurate
Accepted 4 October 2016
predictions of crop production. We developed a computational environment to support experiments in
sugarcane agriculture and this article describes data acquisition, formatting, storage, and analysis. The
potential to support creation of new agricultural knowledge is demonstrated through joint analysis of
Keywords:
three experiments in sugarcane precision agriculture. Analysis of these case studies emphasizes spatial
Precision agriculture
Sugarcane
and temporal variations in soil attributes, sugarcane quality, and sugarcane yield. The developed compu-
Database tational framework will aid data-driven advances in sugarcane agricultural research.
Workflow Ó 2016 Elsevier B.V. All rights reserved.

1. Introduction Precision Agriculture (PA), which comprises a set of tools to help


farmers understand and manage soils and crops inherent spatial
Sugarcane is an important crop mainly in tropical and sub- and temporal variability. PA relies on collection, analysis, process-
tropical countries. Brazil is the largest sugarcane producer, with ing, and synthesis of voluminous georeferenced data, which can be
9 Mha cultivated to produce 659 million Mg of sugarcane in the collected from a number of different technologies (Zamykal and
2015/2016 season, resulting in 34,600 Mg of sugar and 29 billion Everingham, 2009). Research and technology on PA have advanced
L of ethanol (CONAB, 2015). In addition to sugar and ethanol, Brazil considerably in the past 20 years (Bramley, 2009). Due to its
is today the country with the largest installed capacity of biomass- intense use of information, PA has grown and evolved to incorpo-
based electricity generation (IRENA, 2015). In 2015, the supply of rate the best of multidisciplinary science and technology (Zamykal
electricity from biomass had estimated growth of 7%, with a total and Everingham, 2009), requiring farmers to look at their business
generation over 22 TW h, where sugarcane accounts for 80%. from different perspectives (Srinivasan, 2006).
Several modeling frameworks such as AUSCANE, QCANE, Sugarcane production system, however, differs substantially
APSIM, MOSICAS and CANEGRO (Marin et al., 2011) are increas- from major staple crops, affecting development and adoption of
ingly being employed to simulate biophysical process in sugarcane agricultural technologies. Comparison between a major cereal
farming systems. They are all dependent on databases, exemplify- (e.g., wheat) and sugarcane highlights some key differences. Wheat
ing the many ways in which agriculture is moving towards inten- area worldwide is 215 Mha, primarily in temperate zones, com-
sive data acquisition and processing. In addition, agriculture pared to 26 Mha of sugarcane, primarily in tropical developing
worldwide is witnessing a growing adoption of the so-called countries, especially in Brazil. Furthermore, the harvested part of
cereal crops is the grain, with mean yields of 3.2 Mg ha1 for wheat,
compared to 71 Mg ha1 for harvesting the stalks of sugarcane in
⇑ Corresponding author at: Brazilian Bioethanol Science and Technology Labora- 2013 (FAO, 2015). Differences in area and location make sugarcane
tory (CTBE), National Center for Research in Energy and Materials (CNPEM), Caixa a small fraction of the global market for agricultural technology. In
Postal 6192, CEP 13083-970 Campinas, São Paulo, Brazil.
addition, the high tonnage of sugarcane requires dedicated
E-mail address: paulo.graziano@bioetanol.org.br (Paulo S. Graziano Magalhães).

http://dx.doi.org/10.1016/j.compag.2016.10.002
0168-1699/Ó 2016 Elsevier B.V. All rights reserved.
14 C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19

technologies, such as tailored yield monitors (Magalhães and Cerri,


2007). Due to specificities of the sugarcane system, and despite
rapid adoption of auto-steer in tractors and harvesters (Bramley
and Trengove, 2013; Silva et al., 2011), PA is not yet adopted by
the sugarcane-based sugar-ethanol industry as it is for other agri-
cultural systems (Gebbers and Adamchuk, 2010). According to sur-
veys conducted in Brazil (Anselmi et al., 2014; Avanzi et al., 2014;
Silva et al., 2011) and Australia (Bramley and Trengove, 2013),
low PA adoption can be explained by four factors: relative advan-
tage (usefulness), compatibility, trialability and observability. For
sugarcane production, perceived usefulness is correlated with
increased crop yield, reduced costs, and improved management.
On the other hand, high costs of equipment, lack of qualified staff
and lack of information on PA technologies were pointed by sugar-
cane farmers as the main barriers (Silva et al., 2011).
In this context, efforts have been primarily dedicated to exper-
iments aiming at establishing the scientific grounds and demon- Fig. 1. Tasks for handling of raw data prior to insertion into the database.
strating the advantages of PA techniques applied to sugarcane
(Portz et al., 2011; Rodrigues et al., 2012). Because of these goals,
characterization of soil and plant attributes in experiments is much 2.2. Database
more comprehensive than the expected for large-scale PA practice.
Furthermore, testing data acquisition technologies and contextual- A relational database for sugarcane agricultural experiments
izing their outputs are important goals of the experimentation was created and named BDAgro – CTBE Database of Agricultural
stage. Considering the above, treating the diversity of measurable Experiments, as detailed in a Technical Memorandum (Pontes
attributes is a critical point in experimentation for sugarcane PA. et al., 2014). BDAgro was constructed having PostgreSQL as rela-
The data-driven character of PA has attracted the attention of tional database management system and pgAdmin as database
the research community from many different areas. For instance, administration and development platform.
there are studies on clustering algorithm to delineate management BDAgro conceptual model, i.e. entity-relationship model
zones (Tagarakis et al., 2012), data acquisition techniques with (Elmasri and Navathe, 2010), includes entities associated to man-
remote sensing (Mulla, 2013; Song et al., 2009), and software agement and responsibilities (e.g., records of projects and responsi-
architecture for data analysis and integration of sensor based PA ble persons). Nevertheless, more relevant for the analytical
monitoring (Chen et al., 2015). purposes of the computational environment, BDAgro represents
In this work, we present a computational environment created agricultural experiments through the following entities:
to support sugarcane agricultural research, including but not lim-
ited to research in PA. Data acquisition, formatting, verification,
 Experiment is defined by a certain land area during a certain
storage, and analysis are discussed. To demonstrate the applicabil-
period of time. The land area is most often an open agricultural
ity of the computational environment, data of soil chemistry, sug-
field, but may also be inside close environments such as
arcane quality, and sugarcane yield from three experiments are
greenhouses.
jointly analyzed and discussed.
 Event is one important fact within one experiment. Events may
be of three types: (i) intervention, associated with change in
experimental land area (e.g., harvest, soil fertilization); (ii) char-
2. Computational environment
acterization, associated with data acquisition without change in
land area (e.g., characterization of soil attributes); and (iii) plan-
2.1. Handling of raw data
ning, representing a record associated with neither physical
change in land area nor new data acquisition (e.g., nutrient
Sugarcane agricultural experiments may include several
prescription).
sources of raw data, including data acquired by different analytical
laboratories and by various types of sensors. The current version of  Static data is data generated by events. It is termed static
the computational environment is able to process data in matrix because each event is defined at a specific moment within
formats. Processing of images (e.g., from unmanned aerial vehicles one experiment. Static data has x and y spatial coordinates as
and satellite) is foreseen as a future upgrade for the system. attributes. Additional attributes depend on type of static data
The database has an expandable set of allowed matrix formats – (i.e., on type of measurements). Soil attributes, sugarcane qual-
essentially one matrix format for each type of measurement. To ity, and sugarcane yield are examples of types of static data.
assure that matrixes are properly recorded in the database, we rou-  Dynamic data is data acquired continuously during the course
tinely handle raw data following the tasks presented in Fig. 1. of one experiment. Meteorological information is one example
Using spreadsheets, raw data from sensors and laboratory files of dynamic data.
are converted into data matrixes consistent with the predefined
database formats. Such data matrixes are verified and then We will refer to these entities as the article follows. However,
inserted into the database. Importantly, data acquisition and for- the analysis of the case studies does not yet include any dynamic
matting are performed by agricultural field scientist, while verifi- data because of the complexity of agricultural analysis using fine
cation and insertion are performed by computational workers. temporal granularity.
Among other advantages, this division of tasks assures an indepen-
dent verification of data veracity. Verifications include matrix for- 2.3. Data analysis
mats, measurement units, and typical range of values acceptable
for a certain measured attribute. Once verified, data matrixes are We adopted the Work-Event-Data-flow (WED-flow) approach
inserted into the database using python-generated SQL scripts. (Ferreira et al., 2010) as the methodology for modeling analysis
C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19 15

workflows. WED-flow addresses the integration of three main flow The third experiment was conducted in a 10-ha commercial
paradigms: workflow, event-flow and data-flow. These three flow sugarcane field (Fig. 2c) that belongs to GRUPO USJ, a sugar and
paradigms combine the concepts of workflow composition, trans- ethanol plant located in Araras, São Paulo State in the southeast
actions (i.e., activities), events, and data states. At an abstract level, region of Brazil (22° 230 3800 S, 47° 180 0400 W). Sugarcane variety
the WED-flow is modeled as a SAGA (Garcia-Molina and Salem, SP80-3280 was planted in 2007 and was mechanically green har-
1987), composed from SAGA steps, each of which is enclosed in a vested along the cropping seasons. Liming and fertilization were
transaction. The main advantage of WED-flow approach is the inte- performed according to usual recommendations for sugarcane crop
gration of three main flow paradigms supporting modeling of anal- planting (Raij et al., 1997) at a fixed rate. For the subsequent ratoon
ysis workflows. More concretely, in this paper, we assume that: the crops (2009–2011), no fertilizers were applied. A total of 117 sam-
flow of work is a set of analysis steps; the flow of analysis event is a ple points were established on a 30 m regular grid to sample the
set of constraints (that once satisfied triggers the following analy- sugarcane quality parameters and physical and chemical soil attri-
sis step); and flow of data is a set of data states generated by each butes. Additional 14 refinement points were also taken. Plant sam-
analysis step. An analytical module was created within BDAgro by pling occurred just before the respective harvests at the peak of the
constructing a set of tables containing the data states resulting sugarcane maturity. The three sites (Fig. 2a–c) had been under con-
from analysis workflows. The statistical computing integrated to tinuous sugarcane cultivation for more than 30 years.
the database is performed with the R programming language. Annually the areas were harvested using sugarcane harvesters
equipped with auto-guidance system with a RTK signal (Trimble,
Navigation Ltd, Sunnyvale, CA, USA) and a yield monitor (Simpro-
3. Case studies
cana, Enalta, São Carlos, Brazil). Annually, after harvest, soil sam-
ples were taken again at same grid to diagnosis some deficiency
Data presented in this paper has been collected from 2007 to
and recommend fertilizer application using VRT when applicable
2014 in different sugarcane fields. Sugarcane is a semi-perennial
(field 1 and 2). At each field, the experiment has been conducted
crop, which typically grows in cycles of four to six years, being har-
for the whole sugarcane cycle, i.e. from planting, following through
vested and fertilized annually.
the consecutive ratoons, during 4 cycles.

3.1. Three Experiments


3.2. Data acquisition techniques
The first and second field experiments were conducted from
2010 to 2014 in two adjacent 50 ha areas in a commercial sugar- Soil and crop data have been acquired with two types of tech-
cane field in Serra Azul – SP, Brazil (21° 160 4100 S and 47° 320 niques: (i) sampling at grid points, for physical and chemical soil
1000 W), which belongs to Pedra Mill. Before sugarcane planting, a attributes and (ii) scanning the area by soil (soil apparent electrical
survey of the area was carry out in November 2010 to establish conductivity) and crop ‘‘on-the-go” sensors (crop spectrum reflec-
the soil chemical and physical conditions and nutrient need for tance), which are mounted on vehicles equipped with differential
crop implementation. In the first field experiment (Fig. 2a), the global positioning system (DGPS). Data acquisition with on-the-
area was divided into a regular 50 m grid with 204 sample points go sensors has relatively lower costs and is promising for large-
located using a differential global positioning system (DGPS) scale PA practice. On the other hand, sampling at grid points is
(Ag114TM, Trimble, Navigation Ltd, Sunnyvale, CA, USA). Additional labor intensive, requiring manual collection of samples for offline
refinement points were taken when necessary (Fig. 2a). Soil sample laboratory analysis. Such sampling approach is likely of limited
was taken at two depths (0–0.20 and 0.20–0.40 m) at each grid applicability for large-scale PA. However, for the purpose of exper-
point and a wet-chemical analysis was done to determine soil imentation, sampling-based techniques are employed because
physical and chemical attributes (macro and micronutrients). In they allow measurement of much more diverse sets of soil and
the second experiment, the area was divided into a regular plant attributes, thus expanding the scope of the experiments.
150 m grid with 24 sample points (Fig. 2b) located using the same Sample collection at grid points is recorded in the database as
technology. Soil samples were taken at the same depths, and sent events of characterization, which generate static data. Measured
to a laboratory for wet-chemical analysis. In both areas, sugarcane attributes of soil chemistry include pH, contents of organic matter
was planted in April 2011. Annually the area was fertilized with K, (OM), and concentrations of macronutrients (P and K) and
P and N and soil samples were taken at the same spots. micronutrients (B, Mg, S, Ca, Mn, Fe, Cu, and Zn). Soil samples were

Fig. 2. Photographs from land areas of experiments 1–3 (A-C). Grid points (black dots), grid refinement points (red crosses, A and C), and the track from the sugarcane
harvester (yellow lines in the rectangular area zoomed at the inset of (A) are shown). (For interpretation of the references to colour in this figure legend, the reader is referred
to the web version of this article.)
16 C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19

collected from two soil layers: 0–0.20 and 0.20–0.40 m deep in removed by a filtering step. Any entry deviating from the mean
experiments 1 and 2; 0–0.20 and 0.20–0.50 m deep in experiment by more than three standard deviations (for a given attribute)
3. Sugarcane quality parameters (Fiber, Brix, and Pol) were mea- was treated as an outlier.
sured from sugarcane samples collected immediately before har- Following linearization and filtering, the next steps aim at rep-
vest. Sugarcane yield was measured by an on-the-go yield resenting all measurements of one experiment as a matrix of
monitor that determines sugarcane yield during the harvesting entries U = {uij}, where i is the index for grid points and j for mea-
operation (Magalhães and Cerri, 2007). In the database, yield is a sured attributes. Sampling at grid points straightforwardly gener-
type of static data associated with a characterization event. Each ates data in this matrix format. Nevertheless, in case of proximal
event of yield characterization is simultaneous to the intervention measurements, such as those performed in grid refinement points
event of harvest (Section 2.2). In this article, we analyze sugarcane (Fig. 2), uij is calculated as the average of the proximal points. On
yield, whereas we omit data from other on-the-go sensors (of the other hand, each attribute from on-the-go sensor is a function
apparent electrical conductivity, crop vegetation indexes). uj(x,y), where (x,y) are the spatial coordinates of the sensor track on
the field. The value of uij is estimated by employing linear regres-
sion to fit a plane to uj(x,y) points within a circle of 50 m diameter
3.3. Analysis workflow centered at the grid point i.
With attributes estimated at grid points, Moran0 s I spatial auto-
We developed a data analysis workflow aiming at providing correlation, Ij, is calculated by considering the connection matrix M
synthetic views of (i) spatiotemporal variability in each measured whose terms equal one for neighbor grid points, and zero other-
attribute and (ii) correlations among soil attributes that are pre- wise. The matrix L is obtained by normalizing M to have each
sumably associated with soil pH. The model of the analysis work- row summing to unit. Then, Ij = zjT L zj, where zj is the vector of
flow is sketched in Fig. 3, representing flow of analysis the mean-centered normalized attribute estimated at grid points,
(represented by boxes), flow of events (represented by arrows),  j)/rj (Cliff and Ord, 1973).
zij = (uij  u
and flow of data states (represented by databases). Note that Measured attributes vary widely concerning their range of val-
results of analysis (Fig. 3) are correlations, spatial autocorrelations, ues as well as their data acquisition technologies. Therefore, preci-
and outputs from principal component analysis (PCA). In essence, sion in attribute measurement also vary widely. Typical attribute
all these results are based on correlations, which would be detri- precisions were estimated as ‘‘noise” levels sj. For attributes deter-
mentally affected by nonlinearities and outliers. mined by sampling at grid points, sj was estimated as the standard
In order to avoid such issues, linearization and filtering steps deviation of measurements in proximal points. For attributes
are applied beforehand in the workflow. Linearization takes the obtained from on-the-go sensors, sj was estimated from the stan-
logarithm of concentrations of soil components (OM, macronutri- dard error of the linear regression coefficient that estimates uij.
ents and micronutrients), keeping other attributes unchanged. Principal Component Analysis (PCA) was employed to reduce
The logarithm scale reduced the positive skewness from concen- the dimensionality of the attribute space. Prior to PCA, missing
tration distributions and is additionally justifiable due to ubiquity data in matrix {uij} was imputed by the Expectation-
of linear relations with log (concentration) found in physical- Maximization (EM) algorithm associated with a multivariate nor-
chemistry (Atkins and Paula, 2010). Outliers in data sets are mal model (Johnson and Wichern, 2007), stopping EM iterations
when imputed uij change way less than sj. This imputation
approach preserves the data covariance structure, thus being well
suited as data preparation method for PCA. PCA was performed
from correlation matrixes, which means that each attribute con-
tributes with normalized, unit variance.

4. Results and discussion

4.1. Spatial autocorrelations

The ‘‘null hypothesis” of PA is uniform management of the field


(Whelan and McBratney, 2000). A corollary is that, to be relevant
for PA, a measured attribute should present positive spatial auto-
correlation. That is, an attribute j having spatial autocorrelation
Ij  0 is consistent with random spatial distribution and, therefore,
cannot be used to justify any site-specific action on the field. On
the other hand, greater values of Ij might justify site-specific
interventions.
Considering the diversity of attributes characterized in the
three experiments, spatial autocorrelation Ij is distributed between
0.2 and 0.8 (Fig. 4). Negative Ij is observed mainly in experi-
ment 2 (Fig. 4B). Most likely negative Ij can be interpreted as
Ij  0 perturbed by statistical noise, which are greater in experi-
ment 2 due to fewer grid points (24) (Fig. 2). Noteworthy,
Webster and Oliver (2007) recommend that no fewer than 100
sampling points are necessary to estimate the variability of a cer-
tain field. Furthermore, it is important to note that, for the three
experiments, many attributes are close to Ij  0 so that such mea-
sured attributes should not be employed to support site-specific
Fig. 3. Data analysis workflow. interventions on the field.
C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19 17

Fig. 4. Inter-year correlation against Moran0 s I spatial autocorrelation for attributes of soil chemistry (black; OM, P, K, B, Mg, S, Ca, Mn, Fe, Cu, and Zn), sugarcane quality (blue;
Fbr, Brix, Pol), and sugarcane yield (red; Yi). Results from experiments 1–3 are shown in plots A-C. Year of measurement is represented by the inclination of the labels, as
indicated in the legends. Distinct soil layers are represented by Bold (0–0.20 m) and Italic (either 0.20–0.40 m or 0.20–0.50 m) labels. Diagonal dashed lines (y = x) and
horizontal dotted line (zero inter-year correlation) are shown to guide the eyes. (For interpretation of the references to colour in this figure legend, the reader is referred to the
web version of this article.)

4.2. Spatiotemporal variability in soil attributes appreciable correlations and spatial autocorrelations are only occa-
sionally observed in one (experiment 3) out of three experiments
Statistical noise tends to reduce the magnitude of all types of suggests that significant spatiotemporal variability in sugarcane
correlations, including spatial autocorrelation and correlation quality may often remain undetected.
between sequential measurements of a given attribute. Fig. 4 plots
inter-year correlation against spatial autocorrelation Ij. The inter-
year correlation is an index for the temporal stability of the spatial
4.4. Spatiotemporal variability in sugarcane yield
variability of a given attribute. For attributes of soil chemistry
(black labels), a clear trend is observed in experiment 1 (Fig. 4A),
Yield is certainly a major aim in crop management. In other
with attributes clustered near the diagonal dashed line (y = x).
crops, yield spatial variations are commonly found to be rather
Indeed, inter-year correlation is positively correlated (r = 0.73)
stable across several harvests (Godwin et al., 2003). In sugarcane,
with spatial autocorrelation. This trend is consistent with inter-
however, we have been observing a lack of temporal stability in
year correlation and spatial autocorrelation being both substan-
yield spatial variability. In experiment 1, yield in 2014 has a posi-
tially reduced by statistical noise. The correlation between inter-
tive (0.2) correlation with yield in 2013. This level of correlation
year correlation and spatial autocorrelation is weaker but still pos-
locates yield close to the diagonal trend observed in Fig. 4A, which
itive in experiments 2 and 3 (r = 0.54, r = 0.32, respectively). This
would be consistent with temporal stability reduced mainly by sta-
result indicates that statistical noise is also a major factor reducing
tistical noise. On the other hand, yield in 2013 has a near zero cor-
inter-year correlation and spatial autocorrelations in experiments
relation with yield in 2012 (Fig. 4A). This yield data is quite below
2 and 3. In these two experiments, most attributes of soil chemical
the diagonal trend of Fig. 4A and the lack of inter-year correlation
attributes are above the diagonal dashed line (Fig. 4B and C), i.e., for
cannot be attributed to statistical noise. Even more extreme case is
most attributes inter-year correlation is greater than spatial auto-
observed in experiment 2, where correlation between 2012 and
correlation. This observation might result from inherently lower
2011 yields is negative (0.6).
spatial autocorrelations in experiments 2 and 3, due to the greater
A more detailed investigation indicated two major causes for
spacing between grid points in experiment 2, and perhaps due to
the zero or negative inter-year correlations in yield. One cause is
the more homogeneous, flat terrain of experiment 3.
occasional sensor failure. Readings from yield monitor are very
noisy (Fig. 5a), even after removal of outliers (Fig. 5b). The example
4.3. Spatiotemporal variability in sugarcane quality of Fig. 5b has mean yield of 93 Mg ha1 and standard deviation of
47 Mg ha1. Typical precision (sj) in yield is reduced to about 1–
In experiments 1 and 2, attributes of sugarcane quality have 4 Mg ha1 due to multiple readings used to estimate uij by regres-
spatial autocorrelation and inter-year correlation of approximately sion (Fig. 3). However, with such noisy signal, some sensor failures
zero (blue labels in Fig. 4A and B, respectively). This observation remain unfiltered, detrimentally affecting the precision of yield
indicates that sugarcane quality is consistent, at least approxi- estimates and reducing associated correlations. The second cause
mately, with random spatiotemporal variations. Experiment 3 of low inter-year correlations originates from real changes in yield,
shows a different behavior. In year 2011, Brix and Pol (quality attri- especially due to damages inflicted to the sugarcane crop. Harvest
butes associated with sugar content) have spatial autocorrelation operation may damage the roots of the sugarcane plants, occasion-
Ij  0.3 and correlation with previous year of 0.2 (Fig. 4C). These ally pulling roots out of the soil. The result of such damage is a
are significant correlations, demonstrating that within-field spatial localized yield reduction in the next harvests, thus reducing
variability and temporal stability of Brix and Pol are possible issues inter-year correlation. Such damages to roots may become visible
to be managed by PA techniques. Correlations of even higher mag- as gaps in the sugarcane field. Indeed, field observations by agrono-
nitudes are observed in sugarcane fiber contents (‘Fbr’ labels in mists have been suggesting that crop damaging during harvest is
Fig. 4C), reinforcing that spatiotemporal management of sugarcane the major cause for declining yield across the multiyear crop cycles
quality is possible, at least in principle. Nevertheless, the fact that (Zamykal and Everingham, 2009).
18 C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19

Fig. 5. Reads from yield monitor before (a) and (b) after the filter step of the analysis workflow. Data from 2013 harvest of experiment 1.

Fig. 6. Loadings plot from principal component analysis of soil attributes presumably associated with soil pH (OM, Ca, Mg, and pH). Results from experiments 1–3 are shown
in plots A-C. Year of measurement is represented by the inclination of the labels, as indicated in the legends. Distinct soil layers are represented by Bold (0–0.20 m) and Italic
(either 0.20–0.40 m or 0.20–0.50 m) labels. Dotted lines are a pair of orthogonal axes rotated to have one axis aligned with the clusters of pH loadings (A and C).

4.5. PCA of attributes associated with soil pH clustering of loadings (Fig. 6B), which might be due to poorer
statistics arising from fewer grid points (Fig. 2).
The numerous soil and plant attributes exist in an abstract Experiments 1 and 3 also share the relative directions of loading
space of high dimensionality where visualization of the informa- clusters. The centroids of the pH clusters form an angle slightly
tion is impossible. Hence, reduction of dimensionality is often greater than 90° with the centroids of OM clusters
desirable and PCA is a statistical technique to do it. Fig. 6 presents (Fig. 6A and C). This relative direction indicates a tendency to have
the PC1& PC2 loadings from PCA applied to attributes judged to be slightly negative correlations between pH and OM, consistent with
potentially associated with soil pH. The attributes are pH itself, the decreasing pH due to higher levels of decomposing OM. Further-
main elements of lime (Mg and Ca, applied to soils to increase their more, the loading clusters of Ca and Mg are observed in-between
pH), and organic matter (OM, whose decomposition is thought to those of pH and OM (Fig. 6A and C). Proximity with loadings of
decrease soil pH). PCA was applied separately for each experiment. pH is consistent with Ca and Mg causing increases in soil pH. On
Analyzed attributes span sequential years as well as the two soil the other hand, the proximity between loadings of OM and of lime
layers that were characterized. Our interest in soil pH is due to elements (Mg and Ca) is consistent with OM acting as storage med-
the possibility of pH control using lime broadcasted on the soil sur- ium for Mg and Ca available in soils. These relative directions in
face with variable rate technology, which could reduce costs by loadings plots demonstrate the possibility of finding common
applying lime prescribed according to site-specific demands. behavior across multiple sugarcane agricultural experiments.
As a first observation, experiments 1 and 3 show loadings that
are clustered (Fig. 6A and C). In each experiment, there is one clus- 5. Conclusions
ter for pH, one for lime elements (Mg and Ca), and one for OM. Such
clustering indicates high correlation between soil top and bottom This paper described a computational environment to support
layers. The clustering also indicates that spatial variations of pH, research in sugarcane agriculture. Data acquisition, formatting,
Ca, Mg, and OM are mostly preserved along the years. This is and verification steps are performed prior to data insertion into a
instructive and perhaps surprising, considering that experiments dedicated database named BDAgro. Data analysis is integrated to
1 and 3 employed distinct strategies to control soil pH (Section 3.1). the database by recording data states generated through data anal-
Loadings of OM are comparatively more spread than loadings of ysis workflows. Such workflows can be tailored for the specific
pH, Ca, and Mg (Fig. 6A and C). This observation is consistent with aims of each study.
noisier OM measurements as well as with significant dynamics in The computational environment was employed to jointly ana-
spatial variability of soil OM. Experiment 2 does not present such lyze three experiments in sugarcane precision agriculture (PA).
C. Driemeier et al. / Computers and Electronics in Agriculture 130 (2016) 13–19 19

These experiments comprised characterization of soil attributes, Approach. In: Meersman, R., Dillon, T., Herrero, P. (Eds.), On the Move to
Meaningful Internet Systems: Otm 2010, PT I, Hersonissos, Greece, pp. 150–167.
sugarcane quality, and sugarcane yield. Our analysis showed low
Garcia-Molina, H., Salem, K., 1987. Sagas. In: ACM SIGMOD International Conference
spatial autocorrelations and low inter-year correlations for most on Management of Data. San Francisco, pp. 249–259.
of the sugarcane quality and yield attributes. This finding revealed Gebbers, R., Adamchuk, V.I., 2010. Precision agriculture and food security. Science
important limitations in measurements of sugarcane quality, while 327, 828–831. http://dx.doi.org/10.1126/science.1183899.
Godwin, R.J., Wood, G.A.A., Taylor, J.C., Knight, S.M., Welsh, J.P., 2003. Precision
yield seems to be primarily impaired by crop damages inflicted by farming of cereal crops : a review of a six year experiment to develop
the mechanical harvest processes, such as ratoon damage and soil management guidelines. Biosyst. Eng. 84, 375–391. http://dx.doi.org/10.1016/
compaction. Furthermore, our multivariate analysis of features S1537-5110(03)00031-X.
IRENA, 2015. Renewable energy capacity statistics 2015. Irena 44. http://dx.doi.org/
associated with soil pH revealed common behavior in two experi- 10.1016/j.renene.2014.09.059.
ments, evidencing the possibility of common underlying principles Johnson, R.A., Wichern, D.W., 2007. Statistical Analysis. Pearson - Prentice Hall,
to be identified across multiple sugarcane agricultural environ- Upper Saddle River.
Magalhães, P.S.G., Cerri, D.G.P., 2007. Yield monitoring of sugar cane. Biosyst. Eng.
ments. Overall, these case studies demonstrate the usefulness of 96, 1–6. http://dx.doi.org/10.1016/j.biosystemseng.2006.10.002.
the computational environment for supporting research in sugar- Marin, F.R., Jones, J.W., Royce, F., Suguitani, C., Donzeli, J.L., Filho, W.J.P., Nassif, D.S.
cane agriculture, which will positively impact the sustainability P., 2011. Parameterization and evaluation of predictions of DSSAT/CANEGRO for
Brazilian sugarcane. Agron. J. 103, 304. http://dx.doi.org/10.2134/
and competitiveness of the sugar-ethanol industry. agronj2010.0302.
Mulla, D.J., 2013. Twenty five years of remote sensing in precision agriculture: key
Acknowledgements advances and remaining knowledge gaps. Biosyst. Eng. 114, 358–371. http://dx.
doi.org/10.1016/j.biosystemseng.2012.08.009.
Pontes, A.O., Ling, L.Y., Sanches, G.M., Magalhães, P.S.G., Ferreira, J.E., Driemeier, C.E.,
Project funded by CNPq and Fundação de Amparo à Pesquisa do 2014. BDAgro – CTBE Database of Agricultural Experiments, Technical
Estado de São Paulo (FAPESP grants 2011/01817-9 and Memorandum, MeT 11. Campinas. available at <https://forms.cnpem.
2015/01587-0). br/formularios/prodbiblio/DB/2338/MeT%20112014.pdf>.
Portz, G., Molin, J.P., Jasper, J., 2011. Active crop sensor to detect variability of
nitrogen supply and biomass on sugarcane fields. Precis. Agric. 13, 33–44.
References http://dx.doi.org/10.1007/s11119-011-9243-4.
van Raij, B., Cantarella, H., Quaggio, J.A., Furlani, A.M.C., 1997. Recomendações de
Anselmi, A.A., Bredemeier, C., Federizzi, L.C., Molin, J.P., 2014. Factors related to adubação e calagem para o Estado de São Paulo. Instituto Agronômico,
adoption of precision agriculture technologies in southern Brazil. In: ISPA (Ed.), Fundação IAC, Campinas. 285p.
Proc. of the 12th International Conference on Precision Agriculture. ISPA, Rodrigues Jr., F.A., Magalhães, P.S.G., Franco, H.C.J., 2012. Soil attributes and leaf
Sacramento, California, p. 11. nitrogen estimating sugar cane quality parameters: brix, pol and fibre. Precis.
Atkins, P., Paula, J. de, 2010. Physical Chemistry. Oxford University Press, Oxford. Agric. 14, 270–289. http://dx.doi.org/10.1007/s11119-012-9294-1.
Avanzi, J.C., Borghi, E., Bortolon, L., Bortolon, E.S.O., Luchiari Jr., A., Inamasu, R.Y., Silva, C.B., de Moraes, M.A.F.D., Molin, J.P., 2011. Adoption and use of precision
Bernardi, A.C.C., 2014. Adoption Level of Precision Agriculture for Brazilian agriculture technologies in the sugarcane industry of São Paulo state, Brazil.
Farmers - 2011/2012 crop year. In: ISPA (Ed.), Proc. of the 12th International Precis. Agric. 12, 67–81. http://dx.doi.org/10.1007/s11119-009-9155-8.
Conference on Precision Agriculture. ISPA, Sacramento, California, pp. 1–2. Song, X., Wang, J., Huang, W., Liu, L., Yan, G., Pu, R., 2009. The delineation of
Bramley, R., Trengove, S.A.M., 2013. Precision agriculture in Australia: present agricultural management zones with high resolution remotely sensed data.
status and recent development. Eng. Agric. 33, 575–588. Precis. Agric. 10, 471–487. http://dx.doi.org/10.1007/s11119-009-9108-2.
Bramley, R.G.V., 2009. Lessons from nearly 20 years of Precision Agriculture Srinivasan, A., 2006. Precision Agriculture: An overview. Handbook of Precision
research, development, and adoption as a guide to its appropriate application. Agriculture, pp. 3–18.
Crop Pasture Sci. 60, 197–217. http://dx.doi.org/10.1071/CP08304. Tagarakis, A., Liakos, V., Fountas, S., Koundouras, S., Gemtos, T.A., 2012.
Chen, N., Zhang, X., Wang, C., 2015. Integrated open geospatial web service enabled Management zones delineation using fuzzy clustering techniques in
cyber-physical information infrastructure for precision agriculture monitoring. grapevines. Precis. Agric. 14, 18–39. http://dx.doi.org/10.1007/s11119-012-
Comput. Electron. Agric. 111, 78–91. 9275-4.
Cliff, A.D., Ord, J.K., 1973. Spatial Autocorrelation. Pion, London. Webster, R., Oliver, M.A., 2007. Geostatistics for Environmental Sciences. John Wiley
CONAB, Ministério da Agricultura, Abastecimento, P., 2015. SAFRA 2015/16 - & Sonns, Ltd. http://dx.doi.org/10.1002/9780470517277.
Terceiro levantamento. Whelan, B.M., McBratney, A.B., 2000. The ‘‘null hypothesis” of precision agriculture
Elmasri, R., Navathe, S.B., 2010. Fundamental of Database Systems. Addison- management. Precis. Agric. 2, 265–279.
Wesley. Zamykal, D., Everingham, Y.L., 2009. Sugarcane and Precision Agriculture:
FAO, Food and Agricultere Organization of the United Nations, 2015. Statistics Quantifying Variability Is Only Half the Story – A Review. In: Lichtfouse, E.
Division – FAOSTAT) [www Document]. URL http://faostat3.fao.org (accessed (Ed.), Climate Change, Intercropping, Pest Control and Beneficial
10.1.15). Microorganisms. Springer, Netherlands, Dordrecht, pp. 189–218. http://dx.doi.
Ferreira, J., Takai, O., Malkoviski, S., Pu, C., 2010. Reducing Exception Handling org/10.1007/978-90-481-2716-0.
Complexity in Business Process Modeling and Implementation: The WED-Flow

You might also like