Welcome to Scribd. Sign in or start your free trial to enjoy unlimited e-books, audiobooks & documents.Find out more
Download
Standard view
Full view
of .
Look up keyword
Like this
1Activity
0 of .
Results for:
No results containing your search query
P. 1
21mpd7qycyacgj9aa4ut

21mpd7qycyacgj9aa4ut

Ratings: (0)|Views: 6|Likes:
Published by sandeep_nagar29

More info:

Categories:Types, School Work
Published by: sandeep_nagar29 on Feb 17, 2009
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as PDF or read online from Scribd
See more
See less

06/14/2009

 
Hadoop Distributed File System
Dhruba Borthakur June, 2007
 
Goals of HDFS
Very Large Distributed File System
 –
10K nodes, 100 million files, 10 PB
Assumes Commodity Hardware
 –
Files are replicated to handle hardware failure
 –
Detect failures and recovers from them
Optimized for Batch Processing
 –
Data locations exposed so that computationscan move to where data resides
 –
Provides very high aggregate bandwidth
 
Distributed File System
Single Namespace for entire cluster 
Data Coherency
 –
Write-once-read-many access model
 –
Client does not see a file until the creator hasclosed it
Files are broken up into blocks
 –
Typically 128 MB block size
 –
Each block replicated on multiple DataNodes
Intelligent Client
 –
Client can find location of blocks
 –
Client accesses data directly from DataNode

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->