You are on page 1of 33

Big Data and Hadoop Essentials

Harsha Vachhani
July 23, 2017
Google Developer Group
Women Techmakers
Event
Table of Contents
Big Data Concept, Origin, Usecase

Big Data challenges, traditional solutions, Comparison between RDBMS

and Hadoop

Hadoop - Introduction, Origin

Hadoop Architecture - HDFS and Mapreduce

Hadoop Ecosystem

Hadoop Distributions

2
3
3 Vs of Big Data

4
5
6
7
8
9
10
History of Apache Hadoop

11
Core Hadoop Framework

12
Why we need a file system

13
14
15
16
17
18
Mapreduce

19
Mapreduce

20
Mapreduce

21
22
MapReduce Example

23
24
YARN
Key Benefits of Hadoop
2.0 YARN Component-

It offers improved
cluster utilization
Highly scalable
Beyond Java
Novel programming
models and services
Agility

25
Hadoop Cluster

26
Hadoop Ecosystem

27
28
29
30
31
Hadoop Distribution - Vendors and
Hosting

32
Thank You

33

You might also like