Welcome to Scribd. Sign in or start your free trial to enjoy unlimited e-books, audiobooks & documents.Find out more
Standard view
Full view
of .
Look up keyword
Like this
0 of .
Results for:
No results containing your search query
P. 1


Ratings: (0)|Views: 6|Likes:
Published by sandeep_nagar29

More info:

Categories:Types, School Work
Published by: sandeep_nagar29 on Feb 17, 2009
Copyright:Attribution Non-commercial


Read on Scribd mobile: iPhone, iPad and Android.
download as PDF or read online from Scribd
See more
See less


Hadoop Distributed File System
Dhruba Borthakur June, 2007
Goals of HDFS
Very Large Distributed File System
10K nodes, 100 million files, 10 PB
Assumes Commodity Hardware
Files are replicated to handle hardware failure
Detect failures and recovers from them
Optimized for Batch Processing
Data locations exposed so that computationscan move to where data resides
Provides very high aggregate bandwidth
Distributed File System
Single Namespace for entire cluster 
Data Coherency
Write-once-read-many access model
Client does not see a file until the creator hasclosed it
Files are broken up into blocks
Typically 128 MB block size
Each block replicated on multiple DataNodes
Intelligent Client
Client can find location of blocks
Client accesses data directly from DataNode

You're Reading a Free Preview

/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->