Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Download
Standard view
Full view
of .
Look up keyword
Like this
15Activity
0 of .
Results for:
No results containing your search query
P. 1
Hadoop Distributed File System

Hadoop Distributed File System

Ratings:

4.63

(8)
|Views: 1,967|Likes:
Published by Tikhon Bernstam

More info:

Published by: Tikhon Bernstam on Feb 22, 2008
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less

05/08/2014

pdf

text

original

 
Hadoop Distributed File System
Dhruba Borthakur June, 2007
 
Goals of HDFS
Very Large Distributed File System
 –
10K nodes, 100 million files, 10 PB
Assumes Commodity Hardware
 –
Files are replicated to handle hardware failure
 –
Detect failures and recovers from them
Optimized for Batch Processing
 –
Data locations exposed so that computationscan move to where data resides
 –
Provides very high aggregate bandwidth
 
Distributed File System
Single Namespace for entire cluster 
Data Coherency
 –
Write-once-read-many access model
 –
Client does not see a file until the creator hasclosed it
Files are broken up into blocks
 –
Typically 128 MB block size
 –
Each block replicated on multiple DataNodes
Intelligent Client
 –
Client can find location of blocks
 –
Client accesses data directly from DataNode

Activity (15)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
talk2parimi liked this
talk2parimi liked this
karthikeya liked this
guptesanket liked this
Ashoka Vanjare liked this
Nive Ditha liked this
shashwat2010 liked this
Dhiraj Shrestha liked this

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->