You are on page 1of 19

Big Data Overview

Big Data Overview

© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Big Data: Where is coming from
Today based on transactions

Decisions
Business
Business Data Transaction Business
Model Model Collection Intelligence

$€¥
Transactional ---------

Data --------- ---------


---------

Social Media -------

Web pages
IM Blogs
Images
Instant Email
messaging
Search Engine Audio Video

3 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Tomorrow better decision
Today Data Context
Unmanaged Content
Managed Content

Social Sensors Multi- Document Message Business Transactions and Interactions

Operation, Governance, Security


Media Data Data Media Management Data

CRM – ERM – SCM

Operation, Governance, Security

Operation, Governance, Security

Operation, Governance, Security

Operation, Governance, Security


FMS – HRM

Clicks Search $€¥ Transaction Data

Classic ETL Processing


Web RFID Collaboration
Business Intelligence & Analytics

Operation, Governance, Security


Sensors
Feeds Images File Hosting
devices Messaging
System
Blog GPS Video File Sharing Enterprise Data Warehouse

IM and Analytical, Dashboards,


Other Content
Forum Audio VOIP Reports, Visualization
events Management

External Content or
4 Discarded Content
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Internal Content
What exactly is Big Data?
Datasets whose volume, velocity, variety and complexity exceed ability of commonly used
software tools to capture, process, store, manage, and analyze them.

Velocity Volume

Big
Data
Variety Complexity

Information Sources

$€¥

CRM, SCM, ERP Video IT Ops Email Transactional Data Mobile Audio Texts Social Media Search Images

*Gartner, Inc., “Big Data” Is Only the Beginning of Extreme Information Management”, Mark A. Beyer, Anne Lapkin, Nicholas Gall, Donald Feinberg, Valentin T. Sribar, Published 7 April 2011

• Both structured and unstructured data present new opportunities for buyers, service providers,
researchers and consumers as a means for better understanding the world.
5 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
6 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Big Data Solution

© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP’s HAVEn – big data platform

HAVEn

Hadoop/ Autonomy Vertica Enterprise nApps


HDFS IDOL Security
Catalog massive Process and index Analyze at Collect & unify Powering
volumes of all information extreme scale machine data HP Software
distributed data in real-time + your apps

Transactional
Social media Video Audio Email Texts Mobile data Documents IT/OT Search engine Images
8 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
hp.com/haven
HP offers a complete analytics platform
HP Offers a wide array of optimized platform
Data Sources Data Processing Data Analytics User Interfaces
HP Hadoop AppSystems &
Reference Architectures
Structured

Databases SQL Compliant Analytics


Warehouses Business
Vertica
ERP, CRM Users

HP Hadoop Solutions
Connectors

Log Files
Hadoop Applications
Machine Data
Unstructured

Social media Meaning based analytics


Customer
Calls
Autonomy
Emails

Consulting Services

SL4500 Architecture

Partner eco-System (Intel, SaS, Syncsort)


9 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Apache Hadoop

© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Apache Hadoop is a software framework
Two Core HADOOP system components
An Open source Linux-based platform Storage for Big Data Data processing
for data storage and data processing Hadoop
that is… Distributed File
System (HDFS) MapReduce
 Scalable
 Fault tolerant Self-healing, Distributed
 Distributed high bandwidth Computing
clustered storage Framework
… has recently be ported on Windows by Hortonworks for Microsoft

It has the flexibility to store It excels at processing It scales


and mine any type of data complex data economically
• Query previously inaccessible • Scale-out architecture divides • Deployable on commodity
structured and unstructured data workloads across multiple nodes hardware
• Not bound by single schema • Flexible file system eliminates • Open source platform guards
ETL bottlenecks against vendor lock
11 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
12 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
13 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Source from Hortonwork sites
14 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
15 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Source from Hortonwork sites
16 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
17 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
18 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
19 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.

You might also like