Professional Documents
Culture Documents
We Do Hadoop
Spring
Page 1 2015
© Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop emerged as foundation of new data architecture
Apache Hadoop is an open source data platform for
Application
managing large volumes of high velocity and variety of data
Batch Processing
MapReduce • Built by Yahoo! to be the heartbeat of its ad & search business
Storage • Donated to Apache Software Foundation in 2005 with rapid adoption by
HDFS
large web properties & early adopter enterprises
• Incredibly disruptive to current platform economics
2 New Data
New Internet of Things
Business Value
Docs, emails
Server logs
LAGGARDS
ERP CRM SCM
2012 Traditional
2.8 Zettabytes
Data
Marts
Business
Analytics
Visualization
& Dashboards
Archive Data off EDW
Move rarely used data to Hadoop as active
archive, store more data longer
° ° ° ° ° ° ° ° ° N
Financial Services Improved Customer Service Insurance Underwriting Aggregate Banking Data as a Service
Cross-sell & Upsell of Financial Products Risk Analysis for Usage-Based Car Insurance Identify Claims Errors for Reimbursement
Unified Household View of the Customer Searchable Data for NPTB Recommendations Protect Customer Data from Employee Misuse
Telecom Analyze Call Center Contacts Records Network Infrastructure Capacity Planning Call Detail Records (CDR) Analysis
Inferred Demographics for Improved Targeting Proactive Maintenance on Transmission Equipment Tiered Service for High-Value Customers
360° View of the Customer Supply Chain Optimization Website Optimization for Path to Purchase
Retail Localized, Personalized Promotions A/B Testing for Online Advertisements Data-Driven Pricing, improved loyalty programs
Customer Segmentation Personalized, Real-time Offers In-Store Shopper Behavior
Supply Chain and Logistics Optimize Warehouse Inventory Levels Product Insight from Electronic Usage Data
Manufacturing Assembly Line Quality Assurance Proactive Equipment Maintenance Crowdsource Quality Assurance
Single View of a Product Throughout Lifecycle Connected Car Data for Ongoing Innovation Improve Manufacturing Yields
Electronic Medical Records Monitor Patient Vitals in Real-Time Use Genomic Data in Medical Trials
Healthcare Improving Lifelong Care for Epilepsy Rapid Stroke Detection and Intervention Monitor Medical Supply Chain to Reduce Waste
Reduce Patient Re-Admittance Rates Video Analysis for Surgical Decision Support Healthcare Analytics as a Service
Unify Exploration & Production Data Monitor Rig Safety in Real-Time Geographic exploration
Oil & Gas
DCA to Slow Well Declines Curves Proactive Maintenance for Oil Field Equipment Define Operational Set Points for Wells
Single View of Entity CBM & Autonomic Logistic Analysis Sentiment Analysis on Program Effectiveness
Government
Prevent Fraud, Waste and Abuse Proactive Maintenance for Public Infrastructure Meet Deadlines for Government Reporting
Systems of Insight
• Centralized Architecture
Multiple applications on a shared data set
DATA LAKE with consistent levels of service
Drivers:
1. Cost Optimization
2. Advanced Analytic Apps
SCOPE
Page 11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
HDP: Any Data, Any Application, Anywhere
Any Data Anywhere
Deploy applications fueled by clickstream, sensor, Implement HDP naturally across the
social, mobile, geo-location, server log, and other complete range of deployment options
new paradigm datasets with existing legacy
datasets. commodity appliance cloud
ERP
CRM
SCM
Clickstream
Web
Geoloca3on
Internet
of
Server
Files,
emails
&
Social
Things
Logs
hybrid
Any Application
• Deep integration with ecosystem
partners to extend existing Over 70 Hortonworks Certified YARN Apps
investments and skills
• Broadest set of applications through
the stable of YARN-Ready applications
(6 total)
40% Dev
Dec 2013 Staff
Three Perficient
Nov 2013
Production
Aug 2013 Production
Apps
Training Cluster
June 2013 (3 total)
Begin July 2013 & Dev 60 Nodes
Hortonworks Begins 2 PB
Hadoop
Execution Partnership
Customer Momentum
• 330+ customers (as of year-end 2014)
2. Partnerships
✓ Technology Partnership
✓ Customer Partnership
3. Open-Source + Open-Community
✓ Development Model
✓ Business Model - SUBSCRIPTION, NO LICENSES
✓ Leadership
Governance Governance
… Governance
Resource Management
Cluster N
Cluster 1
Cluster 2
Governance Security Batch Interactive Real-time
YARN
Page 18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
The value of HDP’s Centralized Architecture
Hortonworks Data Platform Other Hadoop Vendors
A centralized architecture built on YARN Siloed “with” YARN Architecture
Governance
Shared services ensure consistent, effective
Fragmented and bolt-on application of the key
Consistent polices implemented for governance &
Security services increases complexity and risk
Services security. Single point for operations
streamlines deployment
Operations
Resource Mgmnt
Applications require separate clusters:
Efficient Hardware
Shared storage and processing creates lack of sharing creates duplication of storage
Resources efficiencies: less hardware, less cost. and processing: more hardware, more data
movement, and more cost
People
Governance Shared metadata & management of lifecycle Application specific, requires multiple definitions
Consistent Security Comprehensive policy, consistent enforcement Inconsistent app specific security policy, increases risk
Services
Operations Single configuration point, eases deployment, production Configure multiple clusters, resource intense
Resource Mgmnt Efficient sharing & predictable performance for all apps Resources monopolized by specific apps and users
Efficient Hardware Shared resources result in less hardware, less cost Expensive, each cluster/app requires dedicated hardware
Resources
People Single team to manage/maintain cluster Multiple cluster, multiple teams to manage, added cost
Engines New engines slide on YARN seamlessly New engines require costly integration effort
Ease of Applications New applications inherit consistent services Extended deployment time, re-implementing services
Expansion
Clusters More apps & data expand cluster. No new cluster. More apps & data require new cluster and new costs
DATA MANAGEMENT
Deployment Choice
Linux Windows On-Premise Cloud
Teradata u u u u
Informatica u u u HDP is
Oracle u u Apache Hadoop
• Connection Slider 11 11
• Partnership Ambari 34 27
Oozie 3 2
We partner with customers with subscription
Zookeeper 2 1
offering. Our success is predicated on yours.
Knox 13 3
Ranger 10 n/a
TOTAL 161 108
Source: Apache Software Foundation. As of 11/7/2014.
Page 23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Open Source IS the standard for platform technology
Modern platform standards are defined by open communities
Roadmap matches user For Hadoop, the ASF provides guidelines and
requirements not vendor a governance framework and the open
monetization requirements community defines the standards for Hadoop.
Performance 5% Project 3
Configuration
25%
.
Executing Jobs
20%
Full Lifecycle Subscription Support .
Cluster Administration
18%
Support through EVERY phase of adoption of .
HDP Upgrades
3% your Hadoop project to ensure your success
Enhancement Requests
3% Project N
TOTAL 100% Architecture &
Development Implementation Production Expansion
Hortonworks Support
HDP: A completely open data platform HDP: A centralized architecture built on YARN
Platforms are ultimately defined by open communities. Any application, any data, anywhere.
BusinessObjects BI
HDP 2.2
RDBMS
EDW
MPP
& Integration
Broad Partnerships
Governance
Data Access
Operations
HANA
Security
INFRASTRUCTURE
YARN
Over 600 partners work with us to
Data Management
certify their applications to work with
Hadoop so they can extend big data
to their users
SOURCES
EXISTING
Clickstream
Web
&Social
Geoloca3on
Sensor
&
Server
Logs
Unstructured
Systems
Machine
BusinessObjects BI
OPERATIONAL TOOLS
DATA
DATA SYSTEMS
SYSTEM INFRASTRUCTURE
MicrosoN
Analy3cs
PlaOorm
System
The forward-looking statements made in this prospectus relate only to events as of the date
on which the statements are made and we undertake no obligation to update any of the
information in this presentation.
Trademarks
Page 31 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hortonworks is a trademark of Hortonworks, Inc. in the United States and other jurisdictions.