Welcome to Scribd!

UNIT II Hadoop

Uploaded by

0% found this document useful (0 votes)

8 views11 pages

HDFS is a distributed file system that stores data across multiple nodes. It uses three daemons - the NameNode, Secondary NameNode, and DataNodes. The NameNode manages metadata and maintains file system namespace. Secondary NameNode assists NameNode by periodically merging metadata edits. DataNodes store data blocks and service read/write requests. MapReduce is the processing layer, using a JobTracker to coordinate jobs and TaskTrackers to process tasks on DataNodes.

Original Description:

Original Title

UNIT-II-Hadoop

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views11 pages

UNIT II Hadoop

Uploaded by

Yuvaraj V, Assistant Professor, BCA

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 11

Search inside document

HDFS

• Features of HDFS
– HDFS is a distributed file system which is horizontally scalable and
reliable.
– Data in HDFS is stored on multiple nodes in a distributed manner.
– HDFS is developed in Java.
– The architecture of HDFS is inspired from Google File System.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India 1
Hadoop 1.x - HDFS

• Three major daemons perform the responsibilities of

HDFS in Hadoop 1.x.
• They are
– NameNode
– DataNode

– Secondary NameNode.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India 2
Hadoop 1.x - HDFS
Hadoop 1.x - HDFS

• NameNode
– It is the master server and maintains the metadata of the stored
files.
– NameNode stores the name, size, owner, group, permissions etc. of a
file as the metadata of the data files.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x - HDFS

• Secondary NameNode
– Runs on a separate machine in the cluster.

– It manages the metadata for the NameNode, i.e. it reads the file
system edits and creates the updated metadata for the NameNode.

– If the existing NameNode fails, then the updated metadata is used to

set up a new NameNode.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x - HDFS

• DataNode
– It runs on the slave nodes.

– The machines which host the DataNode daemon store the data files.
– It provides access to the files when requested by the client.
– The DataNode periodically sends heartbeat messages to the
NameNode indicating that it is alive.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x - HDFS

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x – MapReduce
Hadoop 1.x - MapReduce

• MapReduce is the data processing layer of Hadoop.

– Job Tracker

– Task Tracker

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x – MapReduce
Hadoop 1.x - MapReduce

• Job Tracker
– Job tracker is the single instance running on the master server.

– The job to be executed is first submitted to the Job Tracker.

– Job tracker also initiates separate tasks on various DataNodes in the
cluster.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India
Hadoop 1.x – MapReduce

• Task Tracker
– Task tracker runs on the slave nodes along with the DataNode
daemons.
– Task tracker initiates the tasks which are assigned by the Job Tracker.
– Task tracker returns the status of the tasks running on the slave
machines to the job tracker.

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India 9
Hadoop 1.x – MapReduce

Dr.N.G.P. Arts and Science College

Coimbatore,Tamil Nadu, India 10
Dr.N.G.P. Arts and Science College
Coimbatore,Tamil Nadu, India 11

AWS Module 3 - AWS Global Infrastructure
Document36 pages
AWS Module 3 - AWS Global Infrastructure
dsadasdasdas
0% (1)
AWS SysOps Administrator Certification Training
Document3 pages
AWS SysOps Administrator Certification Training
ather zaya
No ratings yet
Hadoop Interview Questions New
Document9 pages
Hadoop Interview Questions New
Rupali Shetty
No ratings yet
Hadoop: Data Processing and Modelling
From Everand
Hadoop: Data Processing and Modelling
Garry Turkington
No ratings yet
Comparison of Open-Source Cloud Management Platforms - OpenStack and OpenNebula
Document5 pages
Comparison of Open-Source Cloud Management Platforms - OpenStack and OpenNebula
Carlos Vásquez
No ratings yet
2 Soamanager Crear Puerto Logico
Document47 pages
2 Soamanager Crear Puerto Logico
Michaela Velgica Pacheco
No ratings yet
Hadoop Interview Questions
Document17 pages
Hadoop Interview Questions
patricia
No ratings yet
Hadoop Overview: Open Source Framework Processing Large Amounts of Heterogeneous Data Sets Distributed Fashion
Document62 pages
Hadoop Overview: Open Source Framework Processing Large Amounts of Heterogeneous Data Sets Distributed Fashion
Mousoomi Baruah
No ratings yet
Hadoop Chapter 1
Document6 pages
Hadoop Chapter 1
Swati
No ratings yet
Bda 201070046 01
Document24 pages
Bda 201070046 01
HARSH NAG
No ratings yet
Hadoop Architecture
Document30 pages
Hadoop Architecture
Nandini Malviya
No ratings yet
Hadoop Week 2
Document40 pages
Hadoop Week 2
Rahul Kolluri
No ratings yet
Bda - Unit 2
Document56 pages
Bda - Unit 2
Kajal Vaniya
No ratings yet
Shortnotes For Cloud
Document22 pages
Shortnotes For Cloud
Mahi Mahi
No ratings yet
HDFS 79
Document74 pages
HDFS 79
bhargavi
No ratings yet
Module-2 PPT-1
Document126 pages
Module-2 PPT-1
Lahari bilimale
No ratings yet
Unit IV Notes
Document34 pages
Unit IV Notes
Apoorva Rauniyar
No ratings yet
School of Computer Engineering: Kalinga Institute of Industrial Technology Deemed To Be University Bhubaneswar-751024
Document260 pages
School of Computer Engineering: Kalinga Institute of Industrial Technology Deemed To Be University Bhubaneswar-751024
21053386
No ratings yet
Module 2.1
Document21 pages
Module 2.1
Priyanka Bandagale
No ratings yet
Hdfs
Document7 pages
Hdfs
temp41304
No ratings yet
Unit V Cloud Technologies and Advancements 8
Document33 pages
Unit V Cloud Technologies and Advancements 8
Jaya Prakash M
No ratings yet
Hadoop Architecture: Er. Gursewak Singh Dcse
Document12 pages
Hadoop Architecture: Er. Gursewak Singh Dcse
Daisy Kawatra
No ratings yet
By Pallavi Mandal Class: CS-B Roll No.: 2014BCS1150
Document17 pages
By Pallavi Mandal Class: CS-B Roll No.: 2014BCS1150
neerendra pratap singh
No ratings yet
Unit - 3
Document34 pages
Unit - 3
sixit37787
No ratings yet
Unit 3 Da
Document43 pages
Unit 3 Da
aadityapawar210138
No ratings yet
Hadoop 1.x Architecture: Name: Siddhant Singh Chandel PRN: 20020343053
Document4 pages
Hadoop 1.x Architecture: Name: Siddhant Singh Chandel PRN: 20020343053
Siddhant Singh
No ratings yet
Basic Hadoop Interview Questionsxyzz
Document18 pages
Basic Hadoop Interview Questionsxyzz
shubham rathod
No ratings yet
Hadoop Interview1
Document27 pages
Hadoop Interview1
paramreddy2000
No ratings yet
Cloud Computing - Unit 3
Document38 pages
Cloud Computing - Unit 3
lightfreezzer
No ratings yet
Unit 3 - Hadoop
Document10 pages
Unit 3 - Hadoop
badaltanwarr
No ratings yet
Large-Scale Data Analytics: Traditional Database Systems
Document11 pages
Large-Scale Data Analytics: Traditional Database Systems
venkata
No ratings yet
Unit 5-Cloud PDF
Document33 pages
Unit 5-Cloud PDF
GOKUL b
No ratings yet
Hadoop Interview Guide
Document34 pages
Hadoop Interview Guide
Nadeem Khan Khan
100% (1)
Untitled
Document37 pages
Untitled
asha
No ratings yet
Lecture Notes Hadoop
Document11 pages
Lecture Notes Hadoop
sakshi kureley
No ratings yet
Unit 3
Document15 pages
Unit 3
xcgfxgvx
No ratings yet
Module 2 Hadoop
Document23 pages
Module 2 Hadoop
additiladdha
No ratings yet
Fbda Unit-3
Document27 pages
Fbda Unit-3
Aruna Aruna
No ratings yet
Unit-Iv CC&BD CS71
Document148 pages
Unit-Iv CC&BD CS71
Hael
No ratings yet
Business Intelligence & Big Data Analytics-CSE3124Y
Document26 pages
Business Intelligence & Big Data Analytics-CSE3124Y
splokbov
No ratings yet
UNIT V-Cloud Computing
Document33 pages
UNIT V-Cloud Computing
Jayanth V 19CS045
No ratings yet
Hadoop Interview Questions
Document14 pages
Hadoop Interview Questions
satish.sathya.a2012
No ratings yet
Hadoop Interview Questions
Document14 pages
Hadoop Interview Questions
satish.sathya.a2012
No ratings yet
Unit-2 Hadoop HDFS Hadoopecosystem
Document25 pages
Unit-2 Hadoop HDFS Hadoopecosystem
sisodiyaa853
No ratings yet
CS8791-Cloud Computing UNIT 5 Notes
Document33 pages
CS8791-Cloud Computing UNIT 5 Notes
Quarantine 2.0
No ratings yet
Jenny Blog
Document12 pages
Jenny Blog
Amit Bhartiya
No ratings yet
Hadoop Presentaton
Document47 pages
Hadoop Presentaton
Jhumri Talaiya
No ratings yet
SEN-762 Advanced Big Data Analytics
Document39 pages
SEN-762 Advanced Big Data Analytics
بالیراجپوت
No ratings yet
Introduction To Big Data and Hadoop
Document29 pages
Introduction To Big Data and Hadoop
Manoj K Upadhyaya
100% (1)
Bda Summer 2022 Solution
Document30 pages
Bda Summer 2022 Solution
Vivek
No ratings yet
Big Data Hadoop Questions
Document7 pages
Big Data Hadoop Questions
Bala Giridhar
No ratings yet
Hadoop Week 3
Document60 pages
Hadoop Week 3
Rahul Kolluri
No ratings yet
Hadoop Overview
Document16 pages
Hadoop Overview
Sunil D Patil
100% (1)
Cloud Computing - Unit 5 Notes
Document33 pages
Cloud Computing - Unit 5 Notes
steffinamorin L
No ratings yet
Unit 2
Document30 pages
Unit 2
Awadhesh Maurya
No ratings yet
CC Unit-5
Document33 pages
CC Unit-5
Rajamanikkam Rajamanikkam
No ratings yet
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
Document47 pages
Prepared By: Manoj Kumar Joshi & Vikas Sawhney
kavitha
No ratings yet
Cheat Sheet 1
Document2 pages
Cheat Sheet 1
Anusha Gupta
No ratings yet
Unit 5 Print
Document32 pages
Unit 5 Print
sivapunithan S
No ratings yet
Hadoop, A Distributed Framework For Big Data
Document55 pages
Hadoop, A Distributed Framework For Big Data
sonia choudhary
No ratings yet
HDFS Commands Updated
Document87 pages
HDFS Commands Updated
sowjanya kandukuri
No ratings yet
Unit - II
Document64 pages
Unit - II
praneelp2000
No ratings yet
Module 2
Document131 pages
Module 2
ARUN KUMAR P
No ratings yet
Hadoop Beginner's Guide
From Everand
Hadoop Beginner's Guide
Garry Turkington
Rating: 4 out of 5 stars
4/5 (7)
Dse-Iv E (Theory-Programming) : BATCH: 2019
Document11 pages
Dse-Iv E (Theory-Programming) : BATCH: 2019
Yuvaraj V, Assistant Professor, BCA
No ratings yet
18mit13c U2
Document13 pages
18mit13c U2
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Programme:: B.Sc. CS/IT/CA
Document12 pages
Programme:: B.Sc. CS/IT/CA
Yuvaraj V, Assistant Professor, BCA
No ratings yet
7.6.3 Exploring IPv6 Addressing On Routers
Document3 pages
7.6.3 Exploring IPv6 Addressing On Routers
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Basic IP Servies - Assignment
Document20 pages
Basic IP Servies - Assignment
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Open Ele-Cyber Security-II-NAAC Hours
Document7 pages
Open Ele-Cyber Security-II-NAAC Hours
Yuvaraj V, Assistant Professor, BCA
No ratings yet
IPV-4 ROUTING-Assignment Questions With Solutions
Document15 pages
IPV-4 ROUTING-Assignment Questions With Solutions
Yuvaraj V, Assistant Professor, BCA
No ratings yet
7.2.3 Routing Troubleshooting Tools
Document7 pages
7.2.3 Routing Troubleshooting Tools
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Elective-I-Fundamental of Networks-Assignment
Document42 pages
Elective-I-Fundamental of Networks-Assignment
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Lecture 17
Document136 pages
Lecture 17
Yuvaraj V, Assistant Professor, BCA
No ratings yet
B.SC - IT-UG-CBCS-2019-STRUCTURE & SCHEME-19.3.21 (WITHOUT COP) - Merged-Merged
Document54 pages
B.SC - IT-UG-CBCS-2019-STRUCTURE & SCHEME-19.3.21 (WITHOUT COP) - Merged-Merged
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Cyber Security-Unit-I
Document64 pages
Cyber Security-Unit-I
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Cyber Security-Unit-II
Document29 pages
Cyber Security-Unit-II
Yuvaraj V, Assistant Professor, BCA
No ratings yet
I-Mha - Unit-Iii
Document23 pages
I-Mha - Unit-Iii
Yuvaraj V, Assistant Professor, BCA
No ratings yet
LECT3
Document87 pages
LECT3
Yuvaraj V, Assistant Professor, BCA
No ratings yet
7.2.1 IPv4 Routing Overview
Document5 pages
7.2.1 IPv4 Routing Overview
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Package Program
Document1 page
Package Program
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Unit III SE
Document99 pages
Unit III SE
Yuvaraj V, Assistant Professor, BCA
No ratings yet
C - Unit-Iii
Document25 pages
C - Unit-Iii
Yuvaraj V, Assistant Professor, BCA
No ratings yet
DBMS Functions Program
Document8 pages
DBMS Functions Program
Yuvaraj V, Assistant Professor, BCA
No ratings yet
I-Mha - Unit-V
Document6 pages
I-Mha - Unit-V
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Os Unit Ii
Document69 pages
Os Unit Ii
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Operators in SQL
Document6 pages
Operators in SQL
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Os Unit Iv
Document62 pages
Os Unit Iv
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Unit I Normalization
Document51 pages
Unit I Normalization
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Hive Lecture Notes
Document17 pages
Hive Lecture Notes
Yuvaraj V, Assistant Professor, BCA
100% (1)
Static Partitioning in Hive
Document2 pages
Static Partitioning in Hive
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Creating Tables in Hive
Document3 pages
Creating Tables in Hive
Yuvaraj V, Assistant Professor, BCA
No ratings yet
Advanced+Functions+in+Hive (Students Dat)
Document2 pages
Advanced+Functions+in+Hive (Students Dat)
Yuvaraj V, Assistant Professor, BCA
No ratings yet
CDP Private Cloud Fundamentals 200810
Document31 pages
CDP Private Cloud Fundamentals 200810
Nav SerVa
No ratings yet
Slide-3 (Cloud Computing Models)
Document43 pages
Slide-3 (Cloud Computing Models)
worksukaina
No ratings yet
Rekap Nilai Monev Ikm 2022 Umum
Document9 pages
Rekap Nilai Monev Ikm 2022 Umum
Nanda Bya
No ratings yet
Videos ArchivoTrabajo Final
Document70 pages
Videos ArchivoTrabajo Final
Jorge Portella
No ratings yet
Application Layer Overview and Web/HTTP
Document41 pages
Application Layer Overview and Web/HTTP
ramna k
No ratings yet
Horizontally Scaling and Vertically Scaling
Document4 pages
Horizontally Scaling and Vertically Scaling
Shoeb Ahmed Khan
No ratings yet
IBM MQ With Weblogic Using SSL Connectivity
Document30 pages
IBM MQ With Weblogic Using SSL Connectivity
Nghiêm Tuấn
No ratings yet
Microsoft Cloud Services: Windows Azure Service Windows Azure Appfabric
Document14 pages
Microsoft Cloud Services: Windows Azure Service Windows Azure Appfabric
Mithu gopi
No ratings yet
Cloud Curriculum
Document7 pages
Cloud Curriculum
Vinay Kumar
No ratings yet
AWS Sample Paper 4
Document29 pages
AWS Sample Paper 4
killerboy5652
No ratings yet
Applsci 12 00140
Document14 pages
Applsci 12 00140
Vũ Nguyễn Trần
No ratings yet
p2p and DHT in IT
Document27 pages
p2p and DHT in IT
KAUSHAL SINGH
No ratings yet
IBM Websphere Application
Document3 pages
IBM Websphere Application
Denazareth Jesus
No ratings yet
Architecture Web Server PDF
Document15 pages
Architecture Web Server PDF
sushil4056
No ratings yet
Hyper-V Over SMB
Document17 pages
Hyper-V Over SMB
Carlos Díaz
No ratings yet
How Can We Achieve Load Balance and Fault Tolerance of SOAP Over HTTP Web Service in The Tibco Domain? Details Resolution
Document8 pages
How Can We Achieve Load Balance and Fault Tolerance of SOAP Over HTTP Web Service in The Tibco Domain? Details Resolution
kuruguntla
No ratings yet
Kubernetes+ CKA +0300+ +logging Monitoring
Document19 pages
Kubernetes+ CKA +0300+ +logging Monitoring
Rafael Francisco Do Prado
No ratings yet
HDFS Vs AFS
Document4 pages
HDFS Vs AFS
abulfaiziqbal
No ratings yet
AWS Core Services
Document5 pages
AWS Core Services
Aakash Jain
No ratings yet
May Jun 2023
Document2 pages
May Jun 2023
Satyam Dash
No ratings yet
Rmi Ejb
Document7 pages
Rmi Ejb
pravesh sharma
No ratings yet
Eucalyptus Cloud Computing PDF
Document2 pages
Eucalyptus Cloud Computing PDF
Brad
No ratings yet
Fdwreyt 54 W
Document3 pages
Fdwreyt 54 W
Juan Roger HC
No ratings yet
Question Bank III Unit CC
Document2 pages
Question Bank III Unit CC
Prabha K
No ratings yet
A Performance Evaluation of Containers Running On Managed Kubernetes Services
Document5 pages
A Performance Evaluation of Containers Running On Managed Kubernetes Services
Tony Guo
No ratings yet
Products Whizcard Saa c02!26!23
Document132 pages
Products Whizcard Saa c02!26!23
HARSHAD BAJPAI
No ratings yet