Welcome to Scribd, the world's digital library. Read, publish, and share books and documents. See more
Standard view
Full view
of .
Look up keyword
Like this
0 of .
Results for:
No results containing your search query
P. 1
Hadoop Distributed File System

Hadoop Distributed File System



|Views: 1,967|Likes:
Published by Tikhon Bernstam

More info:

Published by: Tikhon Bernstam on Feb 22, 2008
Copyright:Attribution Non-commercial


Read on Scribd mobile: iPhone, iPad and Android.
download as PDF, TXT or read online from Scribd
See more
See less





Hadoop Distributed File System
Dhruba Borthakur June, 2007
Goals of HDFS
Very Large Distributed File System
10K nodes, 100 million files, 10 PB
Assumes Commodity Hardware
Files are replicated to handle hardware failure
Detect failures and recovers from them
Optimized for Batch Processing
Data locations exposed so that computationscan move to where data resides
Provides very high aggregate bandwidth
Distributed File System
Single Namespace for entire cluster 
Data Coherency
Write-once-read-many access model
Client does not see a file until the creator hasclosed it
Files are broken up into blocks
Typically 128 MB block size
Each block replicated on multiple DataNodes
Intelligent Client
Client can find location of blocks
Client accesses data directly from DataNode

Activity (15)

You've already reviewed this. Edit your review.
1 hundred reads
1 thousand reads
talk2parimi liked this
talk2parimi liked this
karthikeya liked this
guptesanket liked this
Ashoka Vanjare liked this
Nive Ditha liked this
shashwat2010 liked this
Dhiraj Shrestha liked this

You're Reading a Free Preview

/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->