Welcome to Scribd!

Lecture 3-1

Uploaded by

0% found this document useful (0 votes)

3 views1 page

Apache Hadoop is an open source framework that efficiently processes large volumes of data across a cluster of commodity hardware. It divides large files into smaller pieces that are stored and processed in parallel across multiple machines. The Hadoop cluster accommodates three key components - the Hadoop Distributed File System (HDFS) for storage, MapReduce for processing, and YARN for resource management. HDFS follows a master/slave architecture with a single NameNode that coordinates data access and stores metadata, and DataNodes that store file blocks and perform read/write operations according to the NameNode's instructions.

Original Description:

Original Title

lecture 3-1

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

3 views1 page

Lecture 3-1

Uploaded by

Ismail Mourid

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Nom et Prénom : EL AMINE MEHDI

Apache Hadoop is an open source, Scalable, and Fault tolerant

framework written in Java. It efficiently processes large volumes of data on
a cluster of commodity hardware. Hadoop is not only a storage system but
it is a platform for large data storage as well as processing.
In this lecture, we get a look on how Apache Hadoop works uder the
hood. So when Apache Hadoop is getting fed a huge file, the framework
divides that chunk of big data into smaller pieces and stores them across
multiple machines to be processed in parallel, so that’s why Hadoop
interconnects an army of widely-available and relatively inexpensive
machines that form a Hadoop cluster, and no matter what the size of the
file that the user feeds to Hadoop, each one of its clusters accommodates
three functional layers, Hadoop distributed file systems for data storage,
Hadoop MapReduce for processing, and Hadoop Yarn for resource
management.
Then we get a brief introduction to HDFS, a distributed file systems that
follows master/slave architecture. It consists of a single namenode and
many datanodes. In the HDFS architecture, a file is divided into one or
more blocks of 128 Mb (the size can be changed in the configurations)
and stored in separate datanodes. Datanodes are responsible for
operations such as block creation, deletion and replication according to
namenode instructions. Apart from that, they are responsible to perform
read-write operations on file systems.
Namenode acts as the master server and the central controller for HDFS. It
holds the file system metadata and maintains the file system namespace.
Namenode oversees the condition of the datanode and coordinates
access to data.

Hadoop Ecosystem
Document55 pages
Hadoop Ecosystem
nehal
No ratings yet
Hadoop Ecosystem PDF
Document55 pages
Hadoop Ecosystem PDF
Rishabh Gupta
No ratings yet
Hadoop Overview
Document16 pages
Hadoop Overview
Sunil D Patil
100% (1)
Hadoop Ecosystem PDF
Document6 pages
Hadoop Ecosystem PDF
Kittu
No ratings yet
Hadoop Tutorial
Document3 pages
Hadoop Tutorial
Sundaram yadav
No ratings yet
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
From Everand
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Wei Liu
No ratings yet
Bda Lab Manual
Document40 pages
Bda Lab Manual
vishalatdwork573
0% (1)
Big Data Ana Unit - II Part - II (Hadoop Architecture)
Document47 pages
Big Data Ana Unit - II Part - II (Hadoop Architecture)
Mokshada Yadav
No ratings yet
Lecture 3-1
Document1 page
Lecture 3-1
Ismail Mourid
No ratings yet
Introduction To Hadoop
Document5 pages
Introduction To Hadoop
Hanumanthu Gouthami
No ratings yet
Hadoop Interview1
Document27 pages
Hadoop Interview1
paramreddy2000
No ratings yet
h13999 Hadoop Ecs Data Services WP
Document9 pages
h13999 Hadoop Ecs Data Services WP
Vijay Reddy
No ratings yet
Hadoop
Document11 pages
Hadoop
Inu Kag
No ratings yet
2 Hadoop
Document20 pages
2 Hadoop
YASH PRAJAPATI
No ratings yet
Big Data Analytics Unit-3
Document15 pages
Big Data Analytics Unit-3
4241 DAYANA SRI VARSHA
No ratings yet
Hadoop Big Data: Follow This Link To Know About Features of Hadoop
Document85 pages
Hadoop Big Data: Follow This Link To Know About Features of Hadoop
mvdurgadevi
No ratings yet
Unit 3
Document15 pages
Unit 3
xcgfxgvx
No ratings yet
What Is The Hadoop Ecosystem
Document5 pages
What Is The Hadoop Ecosystem
Zahra Mea
No ratings yet
CC-KML051-Unit V
Document17 pages
CC-KML051-Unit V
Fdjs
No ratings yet
BDA Notes
Document25 pages
BDA Notes
mrudula.sb
No ratings yet
Getting Started With HDP Sandbox
Document107 pages
Getting Started With HDP Sandbox
risdianto sigma
No ratings yet
Cloud Computing
Document19 pages
Cloud Computing
Afia Faryad
No ratings yet
Unit-Iv CC&BD CS71
Document148 pages
Unit-Iv CC&BD CS71
Hael
No ratings yet
Mapreduce
Document15 pages
Mapreduce
manasa
No ratings yet
Os Bittu
Document10 pages
Os Bittu
Vishwa Moorthy
No ratings yet
BDA Unit 2 Q&A
Document14 pages
BDA Unit 2 Q&A
viswakranthipalagiri
No ratings yet
High Performance Fault-Tolerant Hadoop Distributed File System
Document9 pages
High Performance Fault-Tolerant Hadoop Distributed File System
Editor IJRITCC
No ratings yet
Apache Hadoop Technology
Document1 page
Apache Hadoop Technology
Seethal Kumars
No ratings yet
Unit 3
Document61 pages
Unit 3
Ramstage Testing
No ratings yet
BD - Unit - II - Hadoop Frameworks and HDFS
Document37 pages
BD - Unit - II - Hadoop Frameworks and HDFS
Prem Kumar
No ratings yet
Hadoop Introduction PDF
Document3 pages
Hadoop Introduction PDF
Tahseef Reza
No ratings yet
Apache Hadoop
Document11 pages
Apache Hadoop
Imaad Ukaye
No ratings yet
Bda Summer 2022 Solution
Document30 pages
Bda Summer 2022 Solution
Vivek
No ratings yet
Haddob Lab Report
Document12 pages
Haddob Lab Report
Magneto Eric Apollyon Thorn
No ratings yet
Hadoop
Document7 pages
Hadoop
Mayank Rai
No ratings yet
Apache Hadoop: Abstract
Document1 page
Apache Hadoop: Abstract
Sainath Reddy
No ratings yet
Hadoop
Document6 pages
Hadoop
Vikas Sinha
No ratings yet
Experiment No - 01
Document14 pages
Experiment No - 01
AYAAN Satkut
No ratings yet
Big Data Analytics
Document26 pages
Big Data Analytics
iasccoe354
No ratings yet
Ibm Hadoop
Document4 pages
Ibm Hadoop
4022 MALISHWARAN M
No ratings yet
Bda Unit 4 Material
Document37 pages
Bda Unit 4 Material
Siva Saikumar Reddy K
No ratings yet
Hadoop Ecosystem
Document56 pages
Hadoop Ecosystem
RUGAL NEEMA MBA 2021-23 (Delhi)
No ratings yet
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
Document38 pages
BDA - Chapter-1-Components of Hadoop Ecosystem - Lecture 3
dnyanbavkar
No ratings yet
Unit IV
Document65 pages
Unit IV
Raghavendra Vithal Goud
No ratings yet
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
Document17 pages
Hadoop Distributed File System: Presented by Mohammad Sufiyan Nagaraju Kola Prudhvi Krishna Kamireddy
Sufiyan Mohammad
No ratings yet
BDA Lab Assignment 1 PDF
Document20 pages
BDA Lab Assignment 1 PDF
parth shah
No ratings yet
Hadoop Questions
Document14 pages
Hadoop Questions
Shreya Kasturia
No ratings yet
Big Data Technology Stack
Document12 pages
Big Data Technology Stack
Khalid Imran
No ratings yet
BDA Lab Assignment 2
Document18 pages
BDA Lab Assignment 2
parth shah
No ratings yet
Hadoop Unit-4
Document44 pages
Hadoop Unit-4
Kishore Parimi
No ratings yet
Bda - 10
Document7 pages
Bda - 10
deshpande.pxresh
No ratings yet
HADOOP
Document40 pages
HADOOP
saadiaiftikhar123
No ratings yet
Apache Hadoop: Jump To Navigation Jump To Search
Document2 pages
Apache Hadoop: Jump To Navigation Jump To Search
Varun Malik
No ratings yet
Module III Note
Document36 pages
Module III Note
johnsonjoshal5
No ratings yet
Unit 2 Hadoop
Document67 pages
Unit 2 Hadoop
AKSHAY Kumar
No ratings yet
Big Data Hadoop Questions
Document7 pages
Big Data Hadoop Questions
Bala Giridhar
No ratings yet
Hadoop Ecosystem PDF
Document55 pages
Hadoop Ecosystem PDF
Rishabh Gupta
No ratings yet
Compare Hadoop & Spark Criteria Hadoop Spark
Document18 pages
Compare Hadoop & Spark Criteria Hadoop Spark
dasari ramya
No ratings yet
BigData Unit 2
Document15 pages
BigData Unit 2
Sreedhar Arikatla
No ratings yet
Unit III
Document86 pages
Unit III
Farhan Sj
No ratings yet