Professional Documents
Culture Documents
IBM BigInsights:
Bringing you big value from Big Data
IBM’s approach
Portfolio overview
BigInsights
• Open source core platform with Apache Hadoop
• IBM technologies for enhanced analytics
• How BigInsights fits within a broader IT infrastructure
300,000 tweets
> 1 PB per day
1 in 3 decisions based on information they
don’t trust, or don’t have
per minute
gas turbines
200 million emails
per minute 220,000 photos
per minute 1 in 2 Business leaders say they don’t
have access to the information they
need to do their jobs
1 ZB = 1 billion TB
IT
Business Users
Delivers a platform
Determine what to enable creative
question to ask discovery
IT Business
Structures the Explores what
data to answer questions could be
that question asked
Monthly sales reports Brand sentiment
Profitability analysis Product strategy
Customer surveys Maximum asset utilization
1 “Analytics Pays Back $13.01 for Every Dollar Spent” Nucleus Research, September 2014
2 “Analytics: The speed advantage” IBM Institute for Business Value, 2014
Multi-channel customer
sentiment and experience a
analysis
Detect life-threatening
conditions at hospitals in
time to intervene
Enterprise
Time Series Warehouse Predictive Case Management
and Mart What Could Happen?
Ingestion Zone
and
Operational
Geo Spatial Information
Descriptive Analytic Applications
Landing and What Has Happened?
Archive Zone
Video & Analytic
Image Appliances Cloud Services
Exploration and
Discovery
Relational What Do You Have?
Information Governance, Security and Business Continuity ISV Solutions
Social Network
Engage:
Implementing infrastructure and 22%-27% 25% 0%
change
running pilot activities
2012 to 2014 2015
Explore:
Exploring internal use cases and 43%-47% 53% 125%
increase
developing a strategy 2012 to 2014 2015
Educate:
Learning about 24%-26% 10% 250%
decrease
big data capabilities 2012 to 2014 2015
• Integrate and
manage the full IBM ANALYTICS PLATFORM
Built on Spark. Hybrid. Trusted.
variety, velocity and
volume of Big Data
Discovery Predictive Prescriptive Content
• Apply advanced & Exploration Analytics Analytics Analytics
analytics
• Visualize all available Business Intelligence
data for ad-hoc
analysis
Data Content Hadoop & Data
• Support workload Mgmt Mgmt NoSQL Warehouse
optimization and
scheduling Information Integration & Governance
• Provide for security
and governance Spark Analytics Operating System
On premises Machine Learning On cloud
• Integrate with
enterprise software Data at rest & In-motion. Inside & outside the firewall. Structured & unstructured.
lHelium SW
Apache Hadoop
Distributed file system, popular API (MapReduce)
for clustered computing
Originally designed for batch processing of massive
data volumes, varied data formats
Apache Spark
General purpose, high-speed data processing
engine for clustered computing
In-memory processing, popular built-in libraries
(e.g., machine learning)
No built-in storage. Attaches to other data stores
(e.g., Hadoop Distributed File System)
ODPi Members include: Ampool, Altiscale, ArenaData, AsiaInfo, Capgemini, DataTorrent, EMC, GE,
Hortonworks, IBM, Infosys, NEC, Pivotal, PLDT, SAS, Squid Solutions, SyncSort, Telstra, Toshiba, UNIFi,
VMware, WANdisco, Xiilab, zData and Zettaset.
27
27 © 2016 IBM Corporation
Big SQL query federation = virtualized data access
Transparent
Appears to be one source
Programmers don’t need to know how /
where data is stored
Heterogeneous
Accesses data from diverse sources
Autonomous
Non-disruptive to data sources, existing
applications, systems.
High Performance
Optimization of distributed queries
SQL tools,
applications Data sources
Spreadsheet-like
interface
Explore, manipulate
data without writing
code
Invoke pre-built
functions
Generate charts
Export results of
analysis
Create custom plug-ins
...
2. Scale out R
1
Data Sources
• Partitioning of large data (“divide”)
• Parallel cluster execution of 3 Scalable
pushed down R code (“conquer”) Statistic
• All of this from within the R s Engine
environment (Jaql, Map/Reduce
are hidden from you
• Almost any R package can run in
this environment 2
Or, push R
3. Scalable machine learning functions R Packages
• A scalable statistics engine that right on the
provides canned algorithms, and data
an ability to author new ones, all
via R Embedded R Execution
37 © 2016 IBM Corporation
Overview of BigInsights
Insights in
microseconds
Connectivity to varied
data sources
39 © 2016 IBM Corporation
Limited use license: Cognos BI
Model, explore, analyze
data from many sources
Connection to BigInsights
via Big SQL
Lower Skill
+ Less Cost
Buy only what you need.
Start small and grow. EQUALS
http://www.ibm.com/cloud
42 © 2016 IBM Corporation
http://www.bluemix.net
Summary and Fast Start
$100M
$24B Announced investment
in IBM Interactive
Experience, creating
9
Investment Analytics
10 new labs worldwide
in both organic
development $1B Solution
Centers
and 30+ To bring Developing
acquisitions cognitive curriculum
services and and training for
applications analytics with
to market
1,000
universities
BigInsights
Enables firms to exploit growing variety, velocity, and volume of data
Delivers diverse range of analytics
Leverages and extends open source
Provides enterprise-class features and supporting services
Complement existing software investments and commercial offerings
IBM advantage
Full solution spanning software, hardware & services
Rapid technology advances through partnerships with IBM Research
Global reach
IBM’s Expertise - takes the guesswork out and delivers savings in time and cost for your
early enablement and success
IBM’s Analytics Solution - provides unmatched capabilities for processing and analyzing all
types of data
Skills & Knowledge Transfer - ensures knowledge transfer and training roadmap for skills
enablement in your organization for new analytics requirements
Standard Research Use Case Selection Product Selection Skills & Knowledge Services Soluiton
Roadmap Success
Time to insights
Knowledge Transfer
Stampede Analytics Prototypes
Solution
BVA / Roadmaps Success
IBM Expertise
https://www-01.ibm.com/software/data/services/stampede.html
THINK
IBM big data • IBM big data • IBM big data © 2016 IBM Corporation