You are on page 1of 4

Приложение 2

к пис. № 1015/04-8 от 28.07.2014г.

The Poseidon Project: An Introduction


The use of Cognitive Computing for real-world societal challenges

Robert-Jan Sipsa,b,c Wing-Yan Manb, Bram Haversb Gert-Jan Keizerc


VU University
a b
Center for Advanced Studies Dutch Courage Foundation
c

Department of Computer Science IBM Netherlands Amsterdam, The Netherlands


Amsterdam, The Netherlands Amsterdam, The Netherlands treasurer@dutchcourage.org
hjsips@vu.nl {wingyanman,
bram_havers}@nl.ibm.com

Abstract— This paper presents an introduction to the Poseidon (1) To accommodate the increasing world population,
Project. An open collaborative research project aiming to reduce combined with the growing preference for animal
the Water Footprint in (crop) agriculture, by opening up access based diets, global food production needs to increase
to low-cost in situ sensing and cognitive computing to farmers
substantially, and
globally. Currently the Poseidon Project, is in the inception
phase. We give an overview of the motivation, the underlying (2) Global soil and water resources are limited, vulnerable
scientific principles, and a future outlook on the project. and coupled with the very processes that govern
climate change.
Index Terms—Water Footprint, Open Computing, Cognitive
Computing, Machine Learning Currently, many regions in the world are already seeing the
effects of the continuously increasing demand. A prominent
I. INTRODUCTION example is Central Asia, as observed by Sorg et al. [6]
Climate Change is widely agreed to be already a reality [1].
The extreme weather in recent years [2][3], provides visible “Water shortages in summer will place the entire region’s
proof of the changes in our global climate and the need to agricultural system under pressure, thus fuelling tensions
mitigate the consequences. Countries in the developed world, that have existed since the collapse of the Soviet Union in
such as the Netherlands and the UK are preparing themselves the early 1990s. The high water demand for irrigation has
for increased risk of flooding from ocean and rivers [4], by already transformed downstream sections of powerful
rapidly increasing the available budgets for Flood Control. rivers such as the Syr Darya, Amu Darya and Ili into small
rivulets, thus exacerbating the drying-out of the Aral Sea
In the developing world however, most severe impact is
expected in food security and access to clean drinking water and Lake Balkash.”
[1]. In many countries in Africa, Asia and Latin America,
where agriculture is the major source of income –e.g. 60-70% Research and policy have traditionally observed shortages
of the population in Africa relies on agriculture for their in water as national or regional problems. Hoekstra and
income- the impact of Climate Change is expected to be Mekonnen [7] observe that governments have traditionally
potentially disastrous [5], strengthened by the fact that the low matched national water demand to national water supply,
access to technology in the developing world makes the studied at local, national or river-basin level, thereby ignoring
agricultural systems less resistant to the effects of Climate the global dimension of water demand patterns.
Change [1]. However, given the active global trade in water-intensive
Even more stress is expected on global agriculture, through commodities, such as cotton or bovine meat, they propose to
rising population and increasing welfare in many parts of the measure the Water Footprint of a nation, rather than its
world, changing both the amount of food necessary, as well as national water use. Figure 1 depicts the Virtual Water balance
the types of food demanded. In a recent review paper, Lal [5] of nations, showing in red the countries which are high
concludes: importers of fresh water and in green the countries which
export fresh water.
By using this definition, Chapagain et al. [8] conclude that
EU25 consumers are 20% responsible for the decline of the
Aral Sea in Central Asia, the most famous example of the will discuss how Cognitive Computing and Open Source hardware are
effects of water extraction for irrigation. expected to contribute.
All in all, mitigation of the global water shortage should
aim at 2 core elements: III. OPEN COMPUTING
Significant progress has been made in ubiquitous
computing. The rise of Open Source hardware, such as the
Raspberry Pi and Arduino, dramatically reduced the cost of
hardware deployment in, most prominently research and
education.
For instance, Fisher and Gould [16] report the use of Open
Source hardware in the context of Precision Agriculture
Research. In a comparison with commercial-grade hardware,
they conclude that Open Source hardware can be a reasonable-
Figure 1. Virtual water balance per country and direction of gross virtual water flows quality, low-cost alternative for commercial solutions, which
related to trade in agricultural and industrial products over the period 1996–2005. Only often create a large cost boundary for in field research projects;
the biggest gross flows (>15 Gm3∕y) are shown. [6] the reported Open Source soil moisture probe promises great
cost-reduction for Precision Irrigation; the reported hardware
(1) Increasing the resistance of third-world agriculture solution costs around $100,-, where commercial, closed,
towards the effects of climate change and devices cost at least the tenfold.
(2) Increasing awareness about the global dimension of However, they conclude that significant effort has to be
water shortage. spent in the calibration of sensors and that the use of these open
solutions requires significant technical knowledge from the
In this paper, we describe the Poseidon Project, aimed at users; moreover, the quality of low-cost sensors may lag
using recent technological advances to tackle the first of these. behind compared to commercial devices.

II. INCREASING RESISTANCE: PRECISION IRRIGATION IV. COGNITIVE COMPUTING


Precision irrigation is defined as “site-specific irrigation Cognitive computing promises to have a disruptive effect
management that relies on variable application of water”. It is on computer science [9]. At its core are problems that humans
emerging as a potential solution to increase the productivity find simple to solve, and machines find difficult. Applications
and reduce the environmental impact of irrigated agriculture include question answering, medical diagnosis, natural
[5]. Studies have demonstrated that using precision irrigation, language processing, etc. Recent projects, such as IBM’s
water-use by crop-agriculture can be reduced by up to 20% in highly publicized Watson question answering computer, have
Australia [12], helping tremendously in lowering agricultural highlighted that new approaches and massive computing can
footprint. combine to make significant progress on these problems.
The situation is even more striking in the arid regions of Example domains include like Astronomy [9], Medicine [10]
Central Asia, where excessive, non-beneficial water use as and Flood Control [4].
deep percolation and an inadequate use of the available soil These machine learning based approaches bring a radical
water and groundwater contribution is commonly practiced. change: rather than spending significant effort on programming
Simulations [14] have demonstrated that water usage could be a solution, effort is spent devising evaluation metrics and
easily reduced by 25%, through implementation of mild deficit gathering annotated data – ground truth data – for training and
irrigation. It may be expected that more advanced models and evaluating a system.
realtime, in situ monitoring could bring even more reduction in Similar approaches have been shown successful on (low-
water usage [15]. cost) sensors, as demonstrated in the work done by Mititelu et
However, despite the promise of lowering water usage and al.[19] and Overeem et al.[18] These former of these studies
the obvious need for water-usage reduction in many parts of demonstrates that, using Machine Learning, we may detect
the world, precision irrigation has not found global uptake as of normal and abnormal behavior in a levee, without making
today. For instance, Roth et al. observe that in the Australian assumptions of the underlying geophysical structure, thereby
Cotton Industry, one of the most advanced agricultural reducing the reliance on mathematical modelling. The latter is
industries in terms of irrigation-scheduling tools only ~40% of an example of Social Sensing, systems in which users are
production is aided by Soil-Moisture probes [13]. encouraged to share data and/or Social Media sources are used
Vellidis et al. [15] observe 3 core reasons for a low uptake of to learn about the physical world. In the work by Overeem et
precision agriculture, current systems are either (1) too expensive, (2) al., the embedded sensors in Smart Phones can be correlated
too unreliable or (3) too complicated for uptake in a farm with the ambient temperature and, as such, enable highly
environment. granular climate measurements in an urban environment; a
We believe that recent developments in Computer Science can highly relevant topic in Urban Water Management, where
bring solutions to all of these blockades, enabling a leap-frog in
adaption of precision agriculture. In the remainder of this paper, we
current systems are to coarse in time and space to effectively Within the data collection layer, data from the remote
model the microclimate within a city. devices are collected and abstracted (to compensate for
Social Sensing is gaining uptake and application, yet differences in measurement unit, granularity, etc.). Moreover,
various authors have noted that correct analysis of Big and the data of sensors on close geographical and temporal distance
human-generated data is often very difficult and error-prone. are collected in Measurement vectors (analogous to the
Grimberg et al. [20] demonstrate this, by analyzing Twitter CrowdTruth Annotation Vectors), for details on CrowdTruth,
and Foursquare data generated around Hurricane Sandy (2012). we refer to [17] and [22]. In short, using this paradigm, we
They observer that interesting patterns can be derived, like the consider the measurements of groups of low-cost sensors as
the stop and start of nightlife before and after the storm and the opinions on the actual value, rather than individual
large increase in grocery shopping before Sandy. However, measurements.
using these same data, they also demonstrate that one could In contrast to, e.g. the work by Mititelu et al. [19] where
easily draw wrong conclusions, for example that Manhattan unsupervised machine learning (Kohonen SOM) is used to
(the area with most active Twitter users) was the pinnacle of detect outliers, a common practice in anomaly detection; this
the storm. approach would allow for a evaluation of individual sensors
Boyd et al. [21] issue an even wider claim against the idea through the CrowdTruth metrics, as reported in [22], [23] and
that Big Data would solve issues of bias and descriptiveness, [24]. The crowdsourcing concepts of worker and micro-task
often assumed in applications of Big Data. They suggest that are transferred in our approach to sensor and temporal unit, e.g.
the analysis of Human Generated Big Data would require a soil-moisture sensor – worker - completes one measurement
interaction between Social Sciences and Computer Science, to –microtask- per hour. Similarly, the CrowdTruth-metrics,
better deal with the underlying social phenomena found in identified to detect, e.g. spam, can be transferred to sensory
human generated data. data. Worker Quality becomes Sensor Quality (e.g. identifying
Recent work by Aroyo and Welty, CrowdTruth [17], aims failing sensors). For a full overview of metrics, we refer to
to tackle these issues, by harnessing the span of opinion (or [22]. Within the metrics layer, these CrowdTruth metrics are
disagreement on the semantics of text and images, by means of applied to the vector representations of our sensory data.
Crowd-Sourcing. It is based on two main observations; (1) On top of these metrics (which result in a normalized
often, the interpretation of data is a matter of semantic numerical value) and the reported measurement vectors,
interpretation, multiple opinions may hold and may correct, classifiers are trained, to identify 3 classes: (1) need for
e.g. on the meaning of text and terminology and (2) humans irrigation, (2) no need, (3) over-irrigated. In the future the input
and computers can work together successfully, on tasks which dimensions for these classifiers are expected to be extended
are difficult for computers, but easy for humans and vice-versa. with, e.g. weather reports (one may choose not to irrigate, or to
An example of the former is the understanding of language, irrigate extra) based on weather data.
which is difficult for computers and easy for humans. The Finally, within the presentation layer, results are reported
latter can be exemplified by, e.g., large calculations; easy for to the farmers by means of a mobile push message (through
machines and difficult for humans. Currently,the Crowd- app, or sms).
Watson system applies CrowdTruth to generate training data
for IBMs Watson System and is focused on annotation of Text,
Images and Video [22].
V. THE POSEIDON PROJECT
The Poseidon Project aims to combine Open Source
hardware, CrowdTruth and Machine Learning to increase the
reliability of low-cost sensor measurements in the context of
precision agriculture At the heart of Poseidon is an Open
Source cognitive computing platform, based on IBM Bluemix 1.
Figure 2 depicts the core components of the platform,
which we will discuss in more detail below.
Within the in situ sensing layer. are the physical Open Figure 2. High level architecture for the Poseidon Platform.
Source measurement devices deployed in an agricultural field.
These devices consist of Open Source computing platforms
(like arduino and raspberry pi) and sensors, which may span a VI. DISCUSSION
wide variety of configurations and quality. The Open Devices
Within this paper we have presented a framework for
will connect to the data collection layer, using the open
precision irrigation based on low-cost sensors. This framework,
MQTT 2 protocol.
exploits advances in Cognitive Computing and Open Source
hardware, to tackle the current inhibitors for large-scale
1
http://www.bluemix.net adoption of precision irrigation.
2
http://www.mqtt.net In the coming decades, rising population and consumption
are expected to place additional stress on the available natural
resources of the world. The main premise behind the Poseidon Cancer using Electonic Medical Records. Brain Informatics and
Project is that we can no longer afford to perceive these Health. Lecture Notes in Computer Science. 8609. pp. 132-141.
challenges within geographical boundaries (i.e. as a regional [12] Hedley, C., Yule, I. and Bradbury, S. (2010). Analysis of
problem), or within the boundaries of a scientific discipline precision irrigation for variable soils at five pastoral and arable
(i.e., hydrology problem). Solutions may be found by applying production sites in New Zwaland. In 19th World Soil Congress.
advances of some fields on the issues of another field. One pp. 1-6.
example thereof is to apply CrowdTruth and the related metrics [13] Roth, G., Harris, G., Gillies, M., Montgomery, J. and
in the context of low-cost sensors, yet, we may expect that Wigginton, D. Water-use effciciency and productivity trends in
other approaches, such as the use of Crowd-Sourcing and Australian irrigated cotton: a review. Crop & Pasture Science,
Niche-Sourcing could prove to be successful in Precision 2013, 64, 1033–1048.
Agriculture. We may expect to see applications in which [14] Pereira, L. S., Paredes, P., Сholpankulov, E. D., Inchenkova, O.
farmers can share and vote on pictures of crops to help their P., Teodoro, P. R., & Horst, M. G. (2009). Irrigation scheduling
peers to identify health and quality issues (niche sourcing) or to strategies for cotton to cope with water scarcity in the Fergana
Valley, Central Asia. Agricultural water management, 96(5),
have large crowds identify anomalies in crops (crowd
723-735.
sourcing). These approaches have proven to be successful in,
[15] Vellidis, G., V.13 Garrick, S. Pocknee, C. Perry, C. Kvien, M.
e.g., astrophysics [25] and could potentially help to preserve
Tucker. (2007). How wireless will change agriculture. In:
harvests of the future.
Stafford, J.V. (Ed.), Precision Agriculture ‘07 – Proceedings of
ACKNOWLEDGMENTS the Sixth European Conference on Precision Agriculture
(6ECPA), Skiathos, Greece, pp. 57-67.
We thank the Dutch Courage foundation for enabling this
[16] Fisher, D., Gould, P. (2012). Open-Source Hardware is a low-
project, and the volunteers at IBM and Delft University of cost alternative for scientific instrumentation and research.
Technology, who generously offer their spare time in the Modern Instrumentation. 2012, 1, 8-20.
development of this platform. [17] Aroyo, L., Welty, C. (2014). Truth is a Lie: 7 Myths about
Human Annotation. AI Magazine. 2014 (in press).
REFERENCES
[18] Overeem, A., R Robinson, J. C., Leijnse, H., Steeneveld, G. J., P
[1] Kurukulasuriya, Pradeep; Rosenthal, Shane. (2013). Climate Horn, B. K., & Uijlenhoet, R. (2013). Crowdsourcing urban air
Change and Agriculture : A Review of Impacts and Adaptations. temperatures from smartphone battery
World Bank, Washington, DC. temperatures. Geophysical Research Letters, 40(15), 4081-4085.
[2] Lal, R.; Delgado, J.A.; Gulliford, J.; Nielsen, D.; Rice, C.W.; [19] Mititelu, G., Sips, R., Havers, B., Szlavik, Z., (2012). Towards
van Pelt, R.S. Adapting agriculture to drought and extreme an Universial Change Detection Framework in Leveers.
events. J. Soil Water Conserv. 2012, 67, 162–166. Proceedings of the 22nd BeNeLux Conference on Artficial
[3] Lyall, S. Heat, Flood or Icy Cold, Extreme Weather Rages Intelligence., Maastricht, The Netherlands.
Worldwide; The New York Times: New York, NY, USA, 2013. [20] Grinberg, N., Naaman, M., Shaw, B., & Lotan, G. (2013, July).
[4] Sips, R., van der Vlis, A., Nagel, R. And Havers, B. (2013). A Extracting Diurnal Patterns of Real World Activity from Social
Case for Evidence Based Levee Management using Sensor Media. In ICWSM
Technology. IBM Journal of Research and Development. 57(5) [21] Boyd, D., & Crawford, K. (2012). Critical questions for big
pp. 5:1-5:12. data: Provocations for a cultural, technological, and scholarly
[5] Lal, R. (2014). Climate Strategic Soil Management. Challenges. phenomenon. Information, Communication & Society, 15(5),
2014, 5, 43-74. 662-679
[6] Sorg, A., Bolch, T., Stoffel, M., Solomina, O., & Beniston, M. [22] Inel, O., Khamkham, K., Cristea, T., Dumitrache, A., Rutjes, A.,
(2012). Climate change impacts on glaciers and runoff in Tien van der Ploeg, J., Romaszko, L., Aroyo., L., Sips, R. (2014).
Shan (Central Asia). Nature Climate Change, 2(10), 725-731. CrowdTruth: Machine-Human Computation Framework for
Harnessing Disagreement in Gathering Annotated Data.
[7] Hoekstra, A., Mekonnen, M. (2012). The water footprint of
International Semantic Web Conference 2014. (in press).
humanity. Proceedings of the National Academy of Sciences,
109(3). 3232-3237. [23] Aroyo, L., Welty, C.: Measuring crowd truth for medical
[8] Chapagain, A., Hoekstra, A., Savenije, H. and Gautam, R. relation extraction. In:AAAI2013 Fall Symp. on Semantics for
(2006). The water footprint of cotton consumption: an impact of Big Data. (2013)
the worldwide consumption of cotton products on the water [24] Soberón, G., Aroyo, L., Welty, C., Inel, O., Lin, H., Overmeen,
resources of cotton producing countries. Ecological economics. M.: Measuringcrowd truth: Disagreement metrics combined
60(1). 186-203. with worker behavior filters. In: Proc. of CrowdSem2013
[9] Kelly III, J., Hamm, S. (2013). Smart Machines: IBMs Watson Workshop), ISWC2013. (2013)
and the Era of Cognitice Computing. Colombia University [25] Lintott, C. J., Schawinski, K., Slosar, A., Land, K., Bamford, S.,
Press. Thomas, D and Vandenberg, J. (2008). Galaxy Zoo:
[10] Engbersen, A., Brink, A., and Sips, R. (2013). SKA: The morphologies derived from visual inspection of galaxies from
Ultimate Big Data Challenge. World Summit on Big Data and the Sloan Digital Sky Survey. Monthly Notices of the Royal
Organizational Design. Sorbonne, Paris, 2013. Astronomical Society, 389(3), 1179-1189.
[11] Hoogendoorn, M., Moons, L., Numans, M. and Sips, R., (2014).
Utilizing Data Mining for Predictive Modeling of Colorectal

You might also like