Distributed File Systems

Distributed File Systems
1
Organization
 General concepts
 Introduction
 NFS
 AFS
 CODA
2
Introduction
 File system were originally developed for
centralized computer systems and desktop
computers.
 File system was as an operating system
facility providing a convenient programming
interface to disk storage.
3
Introduction
 The main purposes of using file in operating systems
are:
 Permanent storage of information
 Sharing the information: A file can be created by one
application and the shared with different applications.
 A file system is a subsystem of an operating system

that perform file Management activities such as
organization, storing, retrieval, naming, Sharing and
protection of files.
4
What Distributed File Systems Provide
 Access to data stored at servers using file system

interfaces
 What are the file system interfaces?
 Open a file, check status of a file, close a file
 Read data from a file
 Write data to a file
 Lock a file or part of a file
 List files in a directory, create/delete a directory
 Delete a file, rename a file, add a symlink (is a special
type of file that contains a reference to another
file or directory in the form of an absolute or
relative path ) to a file
 etc
5
DFS?
 A distributed file system (DFS) is a file system with data
stored on a server.
 The DFS makes it convenient to share information and files
among users on a network in a controlled and authorized
way.
 The server allows the client users to share files and store data
just like they are storing the information locally.
 However, the servers have full control over the data and give
access control to the clients.
 A DFS makes it possible to restrict access to the file system
depending on access lists or capabilities on both the servers
and the clients, depending on how the protocol is designed.
6
Why DFSs are Useful
 Data sharing among multiple users
 User mobility
 Location transparency
 Backups and centralized management
7
Cont…
 Clients and Servers:
 Clients access files and directories that are provided
by one or more file servers.
 File Servers provide a client with a file service
interface and a view of the file system
 Servers allow clients to perform operations from the
file service interface on the files and directories
 Operations: add/remove, read/write
 Servers may provide different views to different
clients
8
Desirable features of DFS
 A good distributed file system should have the following features.
 Transparency:-
 Location: a client cannot tell where a file is located, where the
name of a file does not reveal any hint of the file's physical storage
location.
• Location transparent: location of the file doesn’t appear in the name of
the file
• ex: /server1/dir1/file specifies the server but not where the server is
located -> server can move the file in the network without changing
the path
 Migration: a file can transparently move to another server

 Replication: multiple copies of a file may exist
 Concurrency: multiple clients access the same file
9
Cont…
 Flexibility: In a flexible DFS, it must be
possible to add or replace file servers.
 Also, a DFS should support multiple
underlying file system types (e.g., various
Unix file systems, various Windows file
systems, etc.)
10
Cont…
 Reliability:
 Consistency: employing replication and allowing
concurrent access to files may introduce consistency
problems.
 Security: clients must authenticate themselves and servers

must determine whether clients are authorized to perform
requested operation. Furthermore communication between
clients and the file server must be secured.
 Fault tolerance: clients should be able to continue working

if a file server crashes. Likewise, data must not be lost and
a restarted file server must be able to recover to a valid
state.
11
Cont…
 Performance: In order for a DFS to offer good
performance it may be necessary to distribute
requests across multiple servers.
 Multiple servers may also be required if the
amount of data stored by a file system is very
large.
12
Cont…
 Scalability: A scalable DFS will avoid
centralized components such as a centralized
naming service, a centralized locking facility,
and a centralized file store.
 A scalable DFS must be able to handle an
increasing number of files and users.
 It must also be able to handle growth over a
geographic area (e.g., clients that are widely
spread over the world), as well as clients from
different administrative domains.
13
Cont…
 Another features are:
 User mobility
 Simplicity and ease of use
 High availability
 High reliability
 Security
 Heterogeneity
14
Accessing Remote Files
 There are several models to service a client’s file access request when the
accessed file is a remote file
 Remote service model:- The client’s request is performed at the server’s node.
 Data-caching model :-It attempts to reduce the amount of network

traffic by taking advantage of the locality feature found in file access. The
client’s request is performed at the client’s node itself by using the cached
data. Compared with remote service model, this model greatly reduces
network traffic.
 Almost all existing distributed file systems implement some form of

caching.
 One problem, referred to as the cache consistency problem the cached

data consistent with the original file content.
15
Units of Data Transfer
 In the systems that use the data-caching model, an important issue is to
decide the unit of data transfer. The other models are:
 File-level transfer model. In this model, when an operation requires file

data to be transferred across the network in either direction between a
client and a server, the whole file is moved.
 Block-level transfer model. In this model, file data transfers across the
network between a client and a server take place in units of file blocks.
 Byte-level transfer model. In this model, file data transfers across the
network between a client and a server take place in units of bytes.
 Record-level transfer model. In this model, the transference unit is the

record.
16
Semantics of File Sharing
 (a) On a single processor, when a read follows a write ,
the value returned by the read is the value just written.
 (b) In a distributed system with caching, absolute values may
be returned.
17
Cont….
Four ways of dealing with the shared files in a distributed system.
18
Semantics of File Sharing . Unix
 Every operation on a file is instantly visible to all

parties
 A Read following a Write will return the value just
written
 For all users of the file
 Enforces (requires) a total global order on all file
operations to return most recent value
 Distributed fileserver must replicate this behavior
19
Semantics of File Sharing . Unix
 Distributed UNIX Semantics
 Could use a single centralized server which would thus serialize all file
operations
 Provides poor performance
 In the case of a DFS, it is possible to achieve such semantics
if there is only a single file server and no client-side caching
is used.
 In practice, such a system is unrealistic because caches are
needed for performance and write-through caches are
expensive.
 Furthermore deploying only a single file server is bad for
scalability.
 Because of this it is impossible to achieve Unix semantics
with distributed file systems.
20
Semantics of File Sharing . Session
 In the case of session semantics, changes to an open
file are only locally visible.
 Only after a file is closed, changes are propagated to
the server (and other clients).
 This raises the issue of what happens if two clients
modify the same file simultaneously.
 It is generally up to the server to resolve conflicts
and merge the changes.
21
Semantics of File Sharing . Immutable files
 Immutable files cannot be altered after they have
been closed.
 In order to change a file, instead of overwriting the
contents of the existing file a new file must be
created.
 This file may then replace the old one as a whole.
 Problems with this approach include a race condition
when two clients try to replace the same file ……
 OR what to do with processes that are reading a file
at the same time as it is being replaced by another
process.
22
Semantics of File Sharing . Atomic
Transactions
 In the transaction model, a sequence of file manipulations can
be executed indivisibly, which implies that two transactions
can never interfere.
 Changes are all or nothing
 Begin-Transaction
 End-Transaction
 System responsible for enforcing serialization
 Ensuring that concurrent transactions produce results consistent with
some serial execution
 Transaction systems commonly track the read/write component
operations
 Familiar aid of atomicity provided by transaction model to
implementers of distributed systems
 Commit and rollback both very useful in simplifying implementation
 This is the standard model for databases, but it is expensive to
implement. 23
Caching
 Caching is often used to improve the performance of
a DFS.
 In a DFS caching involves storing either a whole
file, or the results of file service operations.
 Caching can be performed at two locations: at the
server and at the client.
 Server-side caching makes use of file caching
provided by the host operating system. This is
transparent to the server and helps to improve the
server’s performance by reducing costly disk
accesses.
24
Cont…
 Client-side caching comes in two flavours: on-disk
caching, and in-memory caching.
 On-disk caching involves the creation of (temporary)
files on the client’s disk. These can either be
complete files (as in the upload/download model) or
they can contain partial file state, attributes, etc.
 In-memory caching stores the results of requests in
the client-machine’s memory. This can be process-
local (in the client process), in the kernel, or in a
separate dedicated caching process.
25
Caching…
cache hit-
for example, a web browser program might check its local cache on
disk to see if it has a local copy of the contents of a web page at a
particular URL.
The percentage of accesses that result in cache hits is known as the

hit rate or hit ratio of the cache.
26
Caching…
 Key decisions in distributed systems in case of
caching:
 Granularity of cached data (large versus small)
 Cache size (large versus small, fixed versus dynamically
changing)
 Replacement policy
 In distributed systems a file-caching scheme should
also address the following key decisions:
 Cache location
 Modification propagation
 Cache validation
27
Caching…
28
Caching…
29
Caching…
 Modification propagation-
 The aim is keeping file data cached at multiple client
nodes consistent.
 There are several approaches related with:
 When to propagate modifications made to a cached data to
the corresponding file server
 How to verify the validity of cached data
 The modification propagation used has a critical
effect on the system’s performance and reliability.
 The file semantics supported depends greatly on the
modification propagation scheme used.
30
Caching…
 Write through Scheme.-
When a cache entry is modified, the new value is immediately
sent to the server for updating the original copy of the file.
 Advantages: reliability and suitability for UNIX-like
semantics.
 Drawback: write-through caches are too expensive to be
useful, the consistency of caches will be damaged.
 Delayed-Write Scheme.-
 The aim is to reduce network traffic for writes.
 where writes are not propagated to the server immediately,
but in the background later on, and write-on-close where
the server receives updates only after the file is closed.
31
Caching…
 Delayed-Write Scheme helps in performance
improvement for write accesses due the following
reasons:
 Write accesses complete more quickly because the new
value is written only in the cache , the client performing
the write.
 Adding a delay to write-on-close has the benefit of
avoiding redundant writes.
 Gathering of all file updates and sending them together to
the server is more efficient than sending each update
separately.
 Reliability Problems. Modifications not yet send to
the server from a client’s cache will be lost if the
client crashes.
32
Caching…
 Cache Validation Schemes.-
 A file data may simultaneously reside in the cache of multiple nodes.
 The modification propagation policy only specifies when the master
copy of a file at the server node is update upon modification of a
cache entry.
 Client initiated approach.-
 Check on file open. With this option, a client’s cache entry is
validated only when the client opens the corresponding file for use. It
is suitable for supporting session semantics.
 Server initiated approach.-
 maintaining a record of which files are cached by which clients.
33
File Replication
 To improve the performance and fault
tolerance of a DFS ----> is to replicate its
content.
 A replicating DFS maintains multiple copies
of files on different servers.
 This can prevent data loss, protect a system
against down time of a single server, and
distribute the overall workload.
34
File Replication
 There are three approaches to replication in a DFS:
 Explicit replication : The client explicitly writes files to

multiple servers. This approach requires explicit support from
the client and does not provide transparency.
 Lazy file replication : The server copies files to other servers

after the files are written. Remote files are only brought up to
date when the files are sent to the server. Affects the
consistency of the file state.
 Group file replication : write requests are simultaneously sent

to a group of servers. This keeps all the replicas up to date, and
allows clients to read consistent file state from any replica.
35
File Replication
 Differences between Replication and Caching
 A replica is associated with a server, whereas a cached
copy is normally associated with a client.
 The existence of a cached copy is primarily dependent on
the locality in file access patterns, whereas the existence
of a replica normally depends on availability and
performance requirements.
 As compared to a cached copy, a replica is widely known,
secure, available, complete and accurate.
 A cached copy is dependent upon a replica.
36
File Replication …
 The possible benefits that offer the replication
of data are:
 Increased availability
 Increased reliability
 Improved response time
 Reduced network traffic
 Improved system throughput
 Better scalability
37
Fault tolerance
 The fault tolerance is an important issue in the design of a
distributed file system. The characteristics of fault tolerance
kind of systems make possible several fault situations.
 The primary file properties that directly influence the
availability of a distributed system to tolerate faults are:
 Availability. It is a reference about the fraction of the time for which

the file is available for use. Replication is a mechanism for improving
the availability of a file.
 Robustness. It refers to its power to survive crashes of the storage
device.
 Recoverability. Ability to be rolled back to an earlier, consistent state
when an operation on the file fails or is aborted by the client.
38
Distinctions between Stateful vs Stateless
service
The file servers that implement a distributed file service can be stateless or stateful.
 Failure Recovery-
 stateful server
 If a stateful server loses all its volatile state in a crash.
• Restore state by recovery protocol based on a dialog with clients ,or
abort operations that were underway when the crash occurred.
• Server needs to be aware of client failures in order to reclaim space
allocated to record the state of crashed client processes (orphan
detection and elimination).
 stateless server
 With stateless server ,the effects of server failures and recovery are almost
unnoticeable.
 The main advantage of stateless servers is that they can easily recover
from failure. Because there is no state that must be restored, a failed server
can simply restart after a crash and immediately provide services to clients
as though nothing happened.
 A newly restored server can respond to a self-contained request without
any difficulty.
39
NFS Introduction
 Network File System (NFS) is a distributed
file system protocol originally developed
by Sun Microsystems in 1984, allowing a user
on a client computer to access files over
a network in a manner similar to how local
storage is accessed.
 The basic idea behind NFS is that each file
server provides a standardized view of its
local file system (each NFS server supports
the same model).
 NFS allows an arbitrary number of clients and
servers to share a common file system over
local area or wide area networks . 40
NFS Introduction
 NFS version 1, 2, 3 &4.
 NFS is a collection of protocols that together
provide clients with a model of a distributed
file system.
 The NFS protocols have been designed in
such a way that different implementations
should easily interoperate. In this way, NFS
can run on heterogeneous collection of
computers.
 An open standard with clear and simple
interfaces.
41
Cont…
 Supports many of the design requirements
already mentioned:
 transparency
 heterogeneity
 efficiency
 fault tolerance
42
NFS Architecture
(a) The remote access model

(b) The upload/download model
43
Cont…
There are benefits and drawbacks to both models.
The remote access model -
 In the first approach (The remote access model) all operations are
performed at the server itself, with clients simply sending commands to
the server.
 It makes it possible for the file server to order all operations and therefore
allow concurrent modifications to the files.
 A drawback is that the client can only use files if it has contact with the file
server. If the file server goes down, or the network connection is broken, then
the client loses access to the files.
The upload/download model -
 In the second model (The upload/download model), files are downloaded
from the server to the client. Modifications are performed directly at the
client after which the file is uploaded back to the server.
 It can avoid generating traffic every time. Also, a client can potentially
use a file even if it cannot access the file server.
 A drawback of performing operations locally and then sending an updated file
back to the server is that concurrent modification of a file by different clients
can cause problems.
44
NFS Architecture…
A client accesses the file system using the system calls provided by its local operating
system.
The local OS file system interface is replaced by an interface to the Virtual File System
(VFS), which is a standard for interfacing to different (distributed) file systems. The whole
idea of the VFS is to hide the differences between various file systems.
Operations on the VFS interface are either passed to a local file system, or passed to a
separate component known as the NFS Client, which takes care of handling access to
files stored at a remote server.
In NFS all, client -server communication is done through RPCs.
On the server side, the NFS server is responsible for handling incoming client request.
The RPC stub unmarshals requests and the NFS server converts them to regular VFS file
operations that are subsequently passed to the VFS layer.
45
NFS Architecture…
 An important advantage of this scheme is that
NFS is largely independent of local file
systems.
 It really does not matter whether the operating
systems at the client or server implements a
UNIX file system, a Windows 2000 file
system, or even an old MS-DOS file system.
 The only important issue is that these file
systems are accommodating with the file
system model offered by NFS.
46
File System Model
47
Communication
a) Reading data from a file in NFS version 3.

b) Reading data using a compound procedure in
version 4. By this procedure several RPCs can be
grouped into a single request.
48
NFS. Characteristics
 Stateless server, so the user's identity and access rights must be checked
by the server on each request.
 In the local file system they are checked only on open()
 Every client request is accompanied by the user ID and group ID
 Server is exposed to imposter attacks unless the user ID and group ID are
protected by encryption
 Kerberos has been integrated with NFS to provide a stronger and more
comprehensive security solution
 Mount operation:
• Mount ( remotehost, remotedirectory, localdirectory)
 Server maintains a table of clients who have mounted file systems at that
server
 Each client maintains a table of mounted file systems holding:
< IP address, port number, file handle>
“File handle a number that the operating system assigns temporarily to a

file when it is opened. The handle is used throughout the session to access
the file. A special area of main memory is reserved for file handles, and
the size of this area determines how many files can be open at once.”
49
NFS Naming
 The fundamental idea underlying the NFS naming model is to
provide clients complete transparent access to a remote file
system as maintained by a server.
 This transparency is achieved by letting a client be able to

mount a remote file system into its own local file system.
 Instead of mounting an entire file system, NFS allows client

to mount only part of a file systems.
50
NFS Naming
Mounting (part of) a remote file system in NFS
51
NFS Naming
Mounting nested directories from multiple servers in NFS

An NFS server can not itself mount directories that are exported by other servers. However, it is
not allowed to export those directories to its own clients.
Instead, a client will have to explicitly mount such a directory from the server that maintains it.
This restriction comes from simplicity. If a server could export a directory that is mounted from
another server, it would have to return special file handles that include an identifier for a server.
NFS does not support such file handles. 52
NFS. Automounting
 Problem with the NFS naming model has to decide when a remote file
system should be mounted.
 Consider a large system with thousands of users. Assume that each user
has a local directory /home that is used to mount the home directories of
other users.
 For Example: - Alice’s home directory may be locally available to her as

/home/Alice, although the actual files are stored on a remote server. This
directory can be automatically mounted when Alice logs into her workstation.
 for every user, logging in could incur a lot of communication and

administrative overhead. In addition, it would require that all users are
known in advance.
 On demand mounting of a remote file system is handled in NFS by an

automunter, which runs as a separate process on the client machine. The
principle underlying an automounter is relatively simple.
53
Cont…
 By considering a simple automounter, assume that
for each user, the home directories of all users are
available through the local directory /home.
 When a client machine boots, the automounter starts with
mounting this directory.
 The effect of this local mount is that whenever
program attempts to access /home, the OS kernel
will forward a lookup operation to the NFS client,
which in this case, will forward the request to the
automounter in its role as NFS server, as shown in
below figure.
54
Cont…
55
Cont…
 Automounter first creates a subdirectory /alice in
/home . it then looks up the NFS server that exports
Alice’s home directory to subsequently and mount
that directory in /home/alice.
56
NFS. File Attributes
 An NFS file has a number of associated attributes. In version
3, the set of attributes was fixed and every implementation
was expected to support those attributes. With version 4, the
set of file attributes has been split into a set of mandatory
attributes that every implementation must support, a set of
recommended attributes that should be preferably supported,
and an additional set of named attributes.
57
Cont…
58
Synchronization: File Locking in NFS
Lock operation is used to request a read or write lock on a consecutive range of bytes in a file.
if the lock cannot be granted due to another conflicting lock, the client gets back an error
message and has ask the server at a latter time. The client request to be put on a FIFO-ordered
list maintained by the server. As soon as conflicting lock has been removed, the server will grant
the next lock to the client at the top of the list.
Removing a lock from a file is done by means of locku operation.
Using the renew operation, a client requests the server to renew the lease on its lock.
59
NFS Client Caching
A server may delegate some of its rights to a client when a file is opened.
An important consequence of delegating a file to a client is that the server needs
to be able to recall the delegation, for example,
when another client on a different machine needs to obtain access rights to
the file. Recalling a delegation requires that the server can do a callback to the
client.
60
NFS The security architecture
61
NFS. Access Control
62
NFS. Scalability
 The performance of a single server can be increased
by the addition of processors, disks and controllers.
 To improve the performance- Additional servers
must be installed and the file systems must be
reallocated between them.
63
Andrew File System (AFS)
64
Introduction
Andrew File System (AFS) is one of the distributed file systems
that been developed at Carnegie Mellon University (CMU).
The design of AFS reflects an intention to support information

sharing on a large scale by minimizing client-server
communication.
It is designed to be heterogeneous, scalable, & it runs

efficiently on variations on UNIX.
Main Goal:
AFS provide scalability to thousands of workstation at one
site while offering users, applications & administrations the
conveniences of a shared file system.
65
AFS Design
2 design characteristics-
• Whole-file serving
• Whole-file caching
66
AFS Design (cont.)
Whole-file serving
the entire contents of directories and

files are transmitted to client
computers by AFS server
67
AFS Design (cont.)
Whole-file caching
once a copy of a file or a chunk has been
transferred to a client computer it is stored
in a cache on the local disk. The cache
contains several hundreds of the files most
recently used on that computer. The cache
is permanent, surviving reboots of the client
computer. Local copies of files are used to
satisfy clients’ open requests in preference to
remote copies whenever possible.
68
Implementation
AFS is implemented as two software

components that exist as UNIX processes
called Vice and Venus.
• Vice - name given to the server

software that runs as a user-level
UNIX process in each server computer
• Venus - user-level process that runs in

each client computer and corresponds
to the client module.
69
Distribution of processes in the AFS
Workstations Servers
User Venus
program
Vice
UNIX kernel
UNIX kernel
User Venus Network

program
UNIX kernel
Vice
Venus
User
program UNIX kernel
UNIX kernel
70
Implementation (cont.)
The files available to user processes
running on workstations are either local or
shared.
• Local files are handled as normal UNIX
files, stored on a workstation’s disk and
are available only to local user
processes.
• Shared files are stored on servers, and

copies of them are cached on the local
disks of workstations
71
The UNIX kernel in each workstation and
server is modified version of BSD UNIX.
The modifications are designed to intercept

open, close and some other file system
calls when they refer to files in the shared
name space and pass them to the Venus
process in the client computer.
Venus manages the cache, removing the

least recently used files when a new file is
acquired from a server to make the
required space if the partition is full.
72
The workstation cache is usually large
enough to accommodate several hundreds
average-sized files, rendering the
workstation largely independent of the Vice
servers once a working set of the current
user’s files and frequently used system files
has been cached.
73
• User programs use conventional UNIX
pathnames to refer to files, but AFS uses file
identifier (fids) in the communication between
the Venus and Vice processes.
• The Vice servers accept requests only in terms

of fids.
• Venus translates the pathnames supplied by

clients into fids using a step-by-step lookup to
obtain the information from the file directories
held in Vice servers.
74
User process UNIX kernel Venus Net Vice
open(FileName, If FileName refers to a
mode) file in shared file space,
pass the request to Check list of files in
local cache. If not
Venus. present or there is no
valid callback promise
send a request for the
file to the Vice server
that is custodian of the
volume containing the Transfer a copy of the
file. file and a callback
promise to the
Place the copy of the workstation. Log the
Open the local file and file in the local file callback promise.
return the file system, enter its local
descriptor to the name in the local cache
application. list and return the local
name to UNIX.
read(FileDescriptor, Perform a normal

Buffer, length) UNIX read operation
on the local copy.
write(FileDescriptor, Perform a normal
Buffer, length) UNIX write operation
on the local copy.
close(FileDescriptor) Close the local copy Replace the file
and notify Venus that If the local copy has
been changed, send a contents and send a
the file has been closed. callback to all other
copy to the Vice server clients holding ca llba ck
that is the custodian of promises on the file.
the file.
75
Implementation of file system calls in AFS
Cache Consistency Mechanisms
• Files and directories are cached on the local

disks of clients using consistency mechanism
based on callbacks.
• When Vice supplies a copy of a file to a Venus

process, it is also provides a callback promise
which is a token issued by Vice server.
• The callback promises are stored with the

cached files on the workstation disks and have
two states: valid and cancelled.
76
Cache Consistency Mechanisms (cont.)
 If the value is cancelled ----- then a fresh copy of

the file must be fetched from the Vice server, but
if the token is valid ----- then the cached copy can
be opened and used without reference to Vice.
 For each file with a valid token, Venus must send a

cache validation request containing the file
modification timestamp to the server.
77
Management Issues
UNIX Kernel modifications Bulk Transfers
Economy
Reliability
OTHER ASPECTS
Security OF AFS Availability
Threads
Location Database
Read-only replicas
Partial File Caching

Wide-area Support
Performance
78
 Management Issues

AFS has the ability to move volumes between file
servers for load-balancing purposes. It also has the
ability to dump a snapshot of a volume to tape. Dump
volume functionality is to support three modes of
operation. These include:-
 dumping the entire volume
 dumping only the parts of a volume that reside on the local
disk residency, and
 only dumping the volume header files.
79
 Security

AFS uses Kerberos authentication to enforce
security without inhibiting legitimate uses of
the system .
80
 Performance
Whole-file caching and the callback

protocol led to dramatically reduced loads
on the server.
81
 Availability
AFS is available for most Berkeley-derived

UNIX variants including Ultrix, SunOS,
AIX, and HP/UX.
82
 Reliability
As servers go down, only non-replicated
portions of the AFS filespace become
inaccessible, leaving the rest of the file
system intact (undamaged). Replicated
data is automatically fetched from one of
the remaining servers. The clients
automatically balance requests between
all the servers containing a particular file.
83
 Bulk Transfers
AFS transfers files between clients and

servers in 64-kilobyte chunks. The use of
such a large packet size is an important
aid to performance, minimizing the effect
of network latency. Thus the design of AFS
enables the use of the network to be
optimized.
84
 UNIX Kernel Modifications
The UNIX kernel in AFS hosts is altered so

that Vice can perform file operations in
terms of file handles instead of the
conventional UNIX file descriptors.
85
 Location database
Every server contains a copy of a

fully replicated location database
giving a mapping of volume name to
servers.
86
 Threads
The implementation of Vice and Venus

make use of threads package to enable
requests to be processed concurrently at
both the client (where several user
processes may have file access requests in
progress concurrently) and at the server.
87
 Read-only replicas
Volumes containing files that are

frequently read but rarely modified such
as the UNIX /bin and /usr/bin directories
of system commands and /man directory
of manual pages, can be replicated as
read-only volumes at several servers.
88
 Partial File Caching
AFS allowing file data to be transferred

and cached in 64-kbyte blocks while still
retaining the consistency semantics and
other features of the AFS protocol.
89
 Wide-area support
AFS supports multiple administrative cells,

each with its own servers, clients, system
administrators and users. Each cell is
completely autonomous environment, but
a federation of cells can cooperate in
presenting users with a uniform, faultless
file name space.
90
CODA File System
91
Introduction
 Coda has been developed at Carnegie Mellon University (CMU)
in 1990s.
 Coda is a version 2 of Andrew File System (AFS) which was
also developed at CMU.
 Coda is integrated with a number of UNIX-based operating
system such as LINUX.
 Coda file system allow a client to continue operation despite
being disconnected from a server.
 Coda was designed to be a scalable, and highly available
distributed file system.
 An important goal was to achieve a high degree of naming and

location transparency so that the system would appear to its users
very similar to a pure local file system.
92
Overall Organization Of AFS
Virtue
Workstation
give users
and process
access to file
system
Centrally Administered
Maintaining a local
collection of files
93
Continue…
 Venus
 Every Virtue Workstation hosts a user-level process called
Venus.
 Which is similar to NFS Client.
 It is responsible for providing access to the files that are

maintained by the Vice file servers.
 Venus is also responsible for allowing the client to continue

operation even if access to the file servers is impossible.
94
Internal Organization Of a Virtue
Workstation
1. Client Call any
operation on to a file
2. VFS intercepts all calls from

client application
3. Forwards these calls to

either local file system or 4. Venus Communicate with Vice File
Venus servers using RPC.
95
Communication
 Interprocess communication in Coda is performed using RPC2.
 Each time a remote procedure is called, the RPC2 client start a

new thread that sends an invocation request to the server and
block until it receives an answers.
 Server sends back messages to the client to let it know it is still

working.
 If server dies thread will report back failure to the calling

application.
 RPC2 support for side effects.
96
Side Effect in Coda’s RPC2 system
97
Side Effect
 A side effect is a mechanism by which the client and server can
communicating using application-specific protocol.
 Example:
 Client opening a file at a video-server
 RPC2 allows the client and server to set up a separate
connection for transferring the video data to the client on time.
 Connection setup is done as a side-effect of an RPC call.
 RPC2 runtime system provides an interface of side-effect
routines.
 There are routines for a setting up a connection and routines
for transferring data.
 These routines are automatically called by the RPC2 runtime
system at client and server.
98
RPC2 Support Multicasting
a) Sending an invalidation message one at a time.

b) Sending invalidation messages in parallel.
99
Naming
 Coda provides a globally shared name space that is
maintained by vice servers.
 Client have accessed to this name space by a special
subdirectory in their local name space such as /afs.
 Files are grouped into units referred to as Volumes.
 It corresponds to a partial sub tree in the shared name space
as maintained by vice servers.
– Volume is the basic unit of mounting
– Namespace: /afs/filename
– Name lookup can cross mount points: support for detecting
crossing and automounts
 Support high degree of naming transparency.
100
Continue…
101
File Identifiers
 The collection of files may be replicated and distributed
across multiple Vice servers.
 Each file in coda is contained in exactly one volume.
 A volume may be replicated across several servers.
 Logical volume:- Represent a possible replicated physical
volume and associated with Replicated Volume Identifier
(RVID).
 Each physical volume has its own Volume Identifier (VID).
 Coda is assign each file a 96 bit file identifier.
RVID File Handle
32 bit 64 bit
102
Continue…
1. Client passes the RVID of the file
identifier.
64 bit file handle

identifies the file within
a volume.
2. Returns list of VIDs associated with
RVID.
•Lookup the server that is currently hosting the particular replica of the
logical volume.
•Return the current location of that specific physical volume.
103
Sharing Files in Coda
104
Client Caching
 Coda is maintained by Callbacks.
 Server keep track of which client have a copy of that file
cached ,locally.
 A server record a Callback promise for a client.
 When a client updates its local copy of the file it notifies
the server.
 Server sends an invalidation message to the other clients.
 Invalidation message is Callback break.
 The server discard the callback promise it held for the
client it just send an invalidation.
105
Client Caching
106
SUMMARY
 Introduction to distributed file system
 NFS
 AFS
 CODA
107

Distributed File Systems

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Distributed File Systems

Uploaded by

Copyright:

Available Formats

Distributed File Systems

 A file system is a subsystem of an operating system

 Access to data stored at servers using file system

 Migration: a file can transparently move to another server

 Security: clients must authenticate themselves and servers

 Fault tolerance: clients should be able to continue working

 Data-caching model :-It attempts to reduce the amount of network

 Almost all existing distributed file systems implement some form of

 One problem, referred to as the cache consistency problem the cached

 File-level transfer model. In this model, when an operation requires file

 Record-level transfer model. In this model, the transference unit is the

Four ways of dealing with the shared files in a distributed system.

 Every operation on a file is instantly visible to all

 Distributed fileserver must replicate this behavior

The percentage of accesses that result in cache hits is known as the

 Explicit replication : The client explicitly writes files to

 Lazy file replication : The server copies files to other servers

 Group file replication : write requests are simultaneously sent

 Improved response time

 Reduced network traffic

 Improved system throughput

 Availability. It is a reference about the fraction of the time for which

(a) The remote access model

a) Reading data from a file in NFS version 3.

“File handle a number that the operating system assigns temporarily to a

 This transparency is achieved by letting a client be able to

 Instead of mounting an entire file system, NFS allows client

Mounting (part of) a remote file system in NFS

Mounting nested directories from multiple servers in NFS

 For Example: - Alice’s home directory may be locally available to her as

 for every user, logging in could incur a lot of communication and

 On demand mounting of a remote file system is handled in NFS by an

The design of AFS reflects an intention to support information

It is designed to be heterogeneous, scalable, & it runs

the entire contents of directories and

AFS is implemented as two software

• Vice - name given to the server

• Venus - user-level process that runs in

User Venus Network

• Shared files are stored on servers, and

The modifications are designed to intercept

Venus manages the cache, removing the

• The Vice servers accept requests only in terms

• Venus translates the pathnames supplied by

read(FileDescriptor, Perform a normal

• Files and directories are cached on the local

• When Vice supplies a copy of a file to a Venus

• The callback promises are stored with the

 If the value is cancelled ----- then a fresh copy of

 For each file with a valid token, Venus must send a

UNIX Kernel modifications Bulk Transfers

Partial File Caching

Whole-file caching and the callback

AFS is available for most Berkeley-derived

AFS transfers files between clients and

The UNIX kernel in AFS hosts is altered so

Every server contains a copy of a

The implementation of Vice and Venus

Volumes containing files that are

AFS allowing file data to be transferred

AFS supports multiple administrative cells,

 An important goal was to achieve a high degree of naming and

 Which is similar to NFS Client.