Professional Documents
Culture Documents
#vmworldapps
Disclaimer
Technical feasibility and market demand will affect final delivery. Pricing and packaging for any new technologies or features
discussed or presented have not been determined.
Whats Common?
Whats Common?
Big Data allows you to find opportunities you didnt know you had
Fast Data allows you to respond to opportunities before they are gone
Big Data
Petabytes vs. Gigabytes Democratize BI
Fast Data
Low latency expectations Horizontal scale
Big Data
Petabytes vs. Gigabytes Democratize BI
Fast Data
Low latency expectations Horizontal scale
Big Data
Petabytes vs. Gigabytes Democratize BI
Flexible Data
Multi-structured data Developer productivity
Fast Data
Low latency expectations Horizontal scale
Big Data
Petabytes vs. Gigabytes Democratize BI
Flexible Data
Multi-structured data Developer productivity
Cloud Delivery
Virtualized Offered -as-a-Service
10
Elastic
12
Polling UI CEP/BAM
(vendor proprietary)
Transform
OLTP Query/Update
Analytic Queries Other Data Online DB Delay Transform Batch Data Warehouse
(structured)
13
ACQUIRE
TRANSFORM
ANALYZE
Batch
SQL
SQL DB MPP DB
HDFS, MapReduce
Visualization
Analytics
14
Derived Insight
HDFS, MapReduce
Online Apps
Online DB
15
HDFS, MapReduce
In-Memory Data Grids Buffer data, process events, In-memory Map-reduce (VMWare GemFire, SQLFire, Oracle Coherence, etc.)
16
Stream Processing Derive insight with continuous event processing (Apache S4, STORM, Esper, StreamBase, GemFire)
Cannot lose events else derived value is questionable Post processing Raw, derived events (batch analytics)
17
STAGE to SCALE
Staged Events Driven Architecture
18
19
20
Works with rigid or flexible Schema (JSON, Objects, SQL) Cache frequently used DB data for transform, massaging Partitioned listeners for filtering, event transform, etc.
Handle thousands of concurrent connections Replicated Region Synchronous replication for slow changing data
21
21
Continuously Available
22
22
23
Distributed Processing
When data changes, subscribers are pushed Async events reliably -All related data is accessible at memory speeds
24
Distributed framework for unbounded streams Custom App processing code that filters, routes, joins
multiple streams
Main proposition is horizontally, elastically scalable processors Simple config model to create processing pipelines
25
26
27
27
28
29
30
Store all raw, derived data in Hadoop Async, parallel write behind from data grid into HDFS
Each partition writes batches in parallel for max throughput
31
Use Spring Integration to orchestrate the pipeline Patterns: Pub-sub, splits, routers, Xfrm, etc.
32
33
Multi-Site Capability
Active Everywhere
Data
34
Multi-Site Capability
Active Everywhere
35
Distribute
GemFire can keep clusters that are distributed around the world synchronized in real-time and can operate reliably in Disconnected, Intermittent and Low-Bandwidth network environments.
36
37
...
...
...
...
External Sources
Fast Ingest --XTP-SQLFire
...
Transaction Data
...
38
THANK YOU
39
APP-CAP1250
#vmworldapps