100% found this document useful (1 vote)

3K views30 pages

Distributed File System Overview

A distributed file system allows files to be shared across multiple machines. Clients can access files in a transparent way without needing to know the physical location. There are different approaches to implementing a distributed file system, including how clients and servers are structured, how files and directories are managed, and how files are named and located. Caching and replication help improve performance and availability. Maintaining consistency between cached copies and the master files is a key challenge.

Uploaded by

Rajat Aggarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

3K views30 pages

Distributed File System Overview

Uploaded by

Rajat Aggarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

ByPriyan Agarwal Priyanka Dey Pulkit Kapoor Rahul Aggarwal Rajat Aggarwal Ritika Goyal Rohit Mathur Sahib

Setia

DISTRIBUTED FILE SYSTEM

A Distributed File System ( DFS ) is simply a classical model of a file system ( as discussed before ) distributed across multiple machines. The purpose is to promote sharing of dispersed files. This is an area of active research interest today.
The resources on a particular machine are local to itself. Resources on other machines are remote. A file system provides a service for clients. The server interface is the normal set of file operations: create, read, etc. on files.

DFS- definition

Clients, servers, and storage are dispersed across machines. Configuration and implementation may vary

Servers may run on dedicated machines, OR Servers and clients can be on the same machines. The OS itself can be distributed (with the file system a part of that distribution. A distribution layer can be interposed between a conventional OS and the file system.

Clients should view a DFS the same way they would a centralized FS; the distribution is hidden at a lower level. Performance is concerned with throughput and response time.

Distributed File System Implementation

System Structure

- Are clients and servers different? = no distinction between clients and servers. All machines run the same basic software. = the file server and directory server are just user programs, so a system can be configured to run client and server software on the same machines or not, as it wishes. = clients and servers are fundamentally different machines, in terms of either hardware or software.

How the file and directory service is structured?

= combine the two into a single server that handles all the directory and file calls itself. = keep them separated, which is flexible and make software simpler. However, this method requires more communication.
Whether or not file, directory and other servers should

maintain state information about clients.

Naming
Naming is the mapping between logical and physical objects.
Example: A user filename maps to <cylinder, sector>. In a conventional file system, it's understood where the file actually

resides; the system and disk are known.

In a transparent DFS, the location of a file, somewhere in the

network, is hidden.
File replication means multiple copies of a file; mapping returns a

SET of locations for the replicas.

Transparency
Location transparency a) The name of a file does not reveal any hint of the file's

physical storage location. b)File name still denotes a specific, although hidden, set of physical disk blocks. c) This is a convenient way to share data. d)Can expose correspondence between component units and machines.

Location independence The name of a file doesn't need to be changed when the file's physical

storage location changes. Dynamic, one-to-many mapping.

Better file abstraction.
Promotes sharing the storage space itself. Separates the naming hierarchy from the storage devices hierarchy.

Most DFSs today:

Support location transparent systems. Do NOT support migration; (automatic movement of a file from machine

to machine.)
Files are permanently associated with specific disk blocks.

The ANDREW DFS AS AN EXAMPLE:

Is location independent. Supports file mobility. Separation of FS and OS allows for disk-less systems. These have lower cost

and convenient system upgrades. The performance is not as good. NAMING SCHEMES: There are three main approaches to naming files: 1. Files are named with a combination of host and local name.
This guarantees a unique name. NOT location transparent NOR location

independent.
Same naming works on local and remote files. The DFS is a loose collection

of independent file systems.

2. Remote directories are mounted to local directories.

So a local system seems to have a coherent directory structure.
The remote directories must be explicitly mounted. The files are location

independent.
SUN NFS is a good example of this technique.

3. A single global name structure spans all the files in the system.
The DFS is built the same way as a local file system. Location independent.

IMPLEMENTATION TECHNIQUES
Can Map directories or larger aggregates rather than individual files. A non-transparent mapping technique:

name ----> < system, disk, cylinder, sector >

A transparent mapping technique:

name ----> file_identifier ----> < system, disk, cylinder, sector >
So when changing the physical location of a file, only the file identifier

need be modified. This identifier must be "unique" in the universe.

STATEFUL VS. STATELESS SERVICE:

Stateful: A server keeps track of information about client requests.
It maintains what files are opened by a client; connection identifiers;

server caches. Memory must be reclaimed when client closes file or when client dies. Stateless: Each client request provides complete information needed by the server (i.e., filename, file offset ).
The server can maintain information on behalf of the client, but it's

not required. Useful things to keep include file info for the last N files touched.

STATEFUL VS. STATELESS SERVICE:

Performance is better for stateful.
Don't need to parse the filename each time, or "open/close" file on every

request. Stateful can have a read-ahead cache. Fault Tolerance: A stateful server loses everything when it crashes.
Server must poll clients in order to renew its state. Client crashes force the server to clean up its encached information. Stateless remembers nothing so it can start easily after a crash.

System Structure
Advantages of stateless servers Fault tolerance No Open/Close calls needed Advantages of stateful servers Shorter request messages

Better performance

No server space wasted on tables Read ahead possible No limits on number of open filesIdempotency easier No problems if a client crashes File locking possible

REMOTE FILE ACCESS- CACHING

Reduce network traffic by retaining recently accessed disk blocks in a cache, so that repeated accesses to the same information can be handled locally. If required data is not already cached, a copy of data is brought from the server to the user. Perform accesses on the cached copy. Files are identified with one master copy residing at the server machine, Copies of (parts of) the file are scattered in different caches.

Cache Consistency Problem -- Keeping the cached copies consistent with the master file.

Caching
CACHING
A remote service ((RPC) has these characteristic steps:
a) The client makes a request for file access.

b) The request is passed to the server in message format.

c) The server makes the file access. d) Return messages bring the result back to the client.

This is equivalent to performing a disk access for each request.

CACHE LOCATION:
Caching is a mechanism for maintaining disk data on the local machine. This data can be kept in the local memory or in the local disk. Caching can be advantageous both for read ahead and read again. The cost of getting data from a cache is a few HUNDRED instructions; disk accesses cost THOUSANDS of instructions. The master copy of a file doesn't move, but caches contain replicas of portions of the file. Caching behaves just like "networked virtual memory".

CACHE LOCATION:
What should be cached? << blocks <---> files >>. Bigger sizes give a better hit rate; Smaller give better transfer times.
Caching on disk gives:
Better reliability.

Caching in memory gives:

The possibility of diskless work stations, Greater speed,

Since the server cache is in memory, it allows the use of only one mechanism.

CACHE UPDATE POLICY:

A write through cache has good reliability. But the user must wait for writes to get to the server. Used by NFS. Delayed write - write requests complete more rapidly. Data may be written over the previous cache write, saving a remote write. Poor reliability on a crash.
Flush sometime later tries to regulate the frequency of writes. Write on close delays the write even longer.

Which would you use for a database file? For file editing?

Example NFS with Caches

CACHE CONSISTENCY:
The basic issue is, how to determine that the client-cached data is consistent with what's on the server.
Client - initiated approach -

The client asks the server if the cached data is OK. What should be the frequency of "asking"? On file open, at fixed time interval, ...?
Server - initiated approach -

Possibilities: A and B both have the same file open. When A closes the file, B "discards" its copy. Then B must start over. The server is notified on every open. If a file is opened for writing, then disable caching by other clients for that file. Get read/write permission for each block; then disable caching only for particular blocks.

COMPARISON OF CACHING AND REMOTE SERVICE:

Many remote accesses can be handled by a local cache. There's a great deal of locality of reference in file accesses. Servers can be accessed only occasionally rather than for each access.
Caching causes data to be moved in a few big chunks rather than in many smaller pieces; this leads to considerable efficiency for the network. Cache consistency is the major problem with caching. When there are infrequent writes, caching is a win. In environments with many writes, the work required to maintain consistency overwhelms caching advantages. Caching requires a whole separate mechanism to support acquiring and storage of large amounts of data. Remote service merely does what's required for each call. As such, caching introduces an extra layer and mechanism and is more complicated than remote service.

FILE REPLICATION:
Duplicating files on multiple machines improves availability and performance. Placed on failure-independent machines ( they won't fail together ). Replication management should be "location-opaque". The main problem is consistency - when one copy changes, how do other copies reflect that change? Often there is a tradeoff: consistency versus availability and performance. Example:

"Demand replication" is like whole-file caching; reading a file causes it to be cached locally. Updates are done only on the primary file at which time all other copies are invalidated.
Atomic and serialized invalidation isn't guaranteed ( message could get lost / machine could crash. )

ANDREW FILE SYSTEM

A distributed computing environment (Andrew) under development since

1983 at Carnegie-Mellon University, purchased by IBM and released as Transarc DFS, now open sourced as OpenAFS.
OVERVIEW:
AFS tries to solve complex issues such as uniform name space, location-

independent file sharing, client-side caching (with cache consistency), secure authentication (via Kerberos) Also includes server-side caching (via replicas), high availability Can span 5,000 workstations

AFS
Clients have a partitioned space of file names:
a local name space and a shared name space
Dedicated servers, called Vice, present the shared name space to the clients

as an homogeneous, identical, and location transparent file hierarchy

Workstations run the Virtue protocol to communicate with Vice. Are required to have local disks where they store their local name space Servers collectively are responsible for the storage and management of the

shared name space

AFS
Clients and servers are structured in clusters interconnected by a

backbone LAN
A cluster consists of a collection of workstations and a cluster server and is

connected to the backbone by a router

A key mechanism selected for remote file operations is whole file caching

Opening a file causes it to be cached, in its entirety, on the local disk

SHARED NAME SPACE:

The server file space is divided into volumes. Volumes contain files of only one user. It's these volumes that are the level of granularity attached to a client.
A vice file can be accessed using a fid = <volume number, vnode >. The fid doesn't depend on machine location. A client queries a volume-location database for this

information.
Volumes can migrate between servers to balance space and utilization. Old server has "forwarding" instructions and handles client updates during migration.

Read-only volumes ( system files, etc. ) can be replicated. The volume database knows how to find these.

FILE OPERATIONS AND CONSISTENCY SEMANTICS:

Andrew caches entire files form servers

A client workstation interacts with Vice servers only during opening and closing of files Venus caches files from Vice when they are opened, and stores modified copies of files back when they are closed Reading and writing bytes of a file are done by the kernel without Venus intervention on the cached copy Venus caches contents of directories and symbolic links, for path-name translation Exceptions to the caching policy are modifications to directories that are made directly on the server responsibility for that directory

IMPLEMENTATION Flow of a request:

Deflection of open/close: The client kernel is modified to detect references to vice files. The request is forwarded to Venus with these steps: Venus does pathname translation. Asks Vice for the file Moves the file to local disk Passes inode of file back to client kernel. Venus maintains caches for status ( in memory ) and data ( on local disk.)

A server user-level process handles client requests.

A lightweight process handles concurrent RPC requests from clients. State information is cached in this process.

Susceptible to reliability problems.

Thank you

Common questions

The Andrew File System (AFS) differs from traditional DFS implementations by emphasizing whole-file caching and location independence. In AFS, when a file is opened, the entire file is cached on the local disk, allowing file accesses to be handled locally without frequent server communication . This reduces server load and improves access speed compared to traditional DFS that might require server communication for each access. AFS uses a unique naming convention (fid) making it independent of machine location and supports file mobility . Consistency is managed by updating modified files back to the server upon file closure, which might differ from systems that update in real-time .

Caching improves DFS performance by reducing network traffic and speeding up data access through local storage of frequently accessed disk blocks. This allows repeated access to be handled locally, drastically decreasing the time needed to fetch data compared to remote service access . However, caching complicates system management due to the cache consistency problem: ensuring that client-cached data remains consistent with the server’s master copy, especially in environments with frequent writes . Factors influencing caching effectiveness include the size of cached data, cache location (memory or disk), cache update policies (e.g., write-through or delayed write), and the frequency of data access patterns .

The Network File System (NFS) illustrates remote file access techniques by implementing location-independent file access through a client/server architecture where clients access files over a network as if they were local. NFS provides a mount protocol that seamlessly integrates remote directories into the client's directory structure, promoting transparent access . NFS uses a write-through cache policy where changes are immediately communicated back to the server, ensuring reliability but potentially at the cost of performance due to network latency . This implementation allows for consistent and reliable file access even when files reside on remote servers, balancing transparency with performance.

Stateful DFS services maintain information about client requests, such as open files and connection identifiers, which improves performance by avoiding the need to repeatedly open and close files on each request. However, they lose all state information on a crash, requiring complex recovery . Stateless services do not keep state information, enabling easier recovery after a crash but requiring each client request to include complete details, which can reduce performance since file open/close calls are needed with every request . Thus, stateful services tend to offer better performance but lower fault tolerance, whereas stateless services are more robust at the cost of performance.

Dynamic file mobility in the Andrew DFS system impacts security and management by allowing files to be moved across servers without changing their unique identifiers (fids), supporting load balancing and efficient resource use but posing challenges in security management. Such mobility requires robust authentication and access controls to ensure that moving files do not allow unauthorized access, handled in AFS via Kerberos authentication . Management complexity increases as dynamic movement entails tracking files' locations in real-time, requiring sophisticated tools for both administrative oversight and system-scaled tracking to ensure files remain accessible and protected throughout migration processes .

A Distributed File System (DFS) manages file transparency through location transparency and location independence. Location transparency ensures that the name of a file does not give any hint about the physical storage location, which makes sharing data more convenient by hiding the distribution of files across multiple machines . Location independence means that the file's name does not need to change if its physical storage location changes, promoting better abstraction and separation of naming and storage hierarchies . The benefits include easier file sharing, improved abstraction, and simplified system management as users interact with the file system as if all files are local, even though they are distributed across different machines.

Cache consistency policies significantly impact the design and operation of DFS by determining how well client-side data remains synchronized with server data. Strategies to maintain cache coherence include client-initiated consistency checks, where the client periodically queries the server to verify data validity, often performed on file open or at set intervals . Server-initiated approaches might involve notifying clients of changes to cached data, requiring complex mechanisms to ensure all clients receive updates promptly. Disabling caching during write operations is another method to ensure data consistency . These strategies influence performance, where tighter coherence control can reduce scalability and increase latency but ensure reliable data consistency.

A single global name structure in DFS provides a location-independent approach where all files are part of one unified naming hierarchy, simplifying cross-system file access and management . This simplifies administrative tasks and improves user interaction by presenting a coherent namespace without needing explicit mounts. However, the disadvantages include potential scalability issues, as the system complexity can increase with the number of files and systems in the namespace. Additionally, maintaining global consistency can be challenging, especially in large and dynamic environments where files frequently change location or ownership .

Location independence in DFS enhances system performance and flexibility by allowing files to be relocated across the network without altering the file's name. This separation of naming and storage hierarchies improves abstraction, enabling easier file migration to balance loads or accommodate changes in system architecture . It allows better resource utilization and flexibility in managing storage resources, as administrators can optimize storage location dynamically without interrupting user access. Moreover, it facilitates scaling, as adding new storage or balancing existing storage can occur seamlessly, increasing overall system efficiency .

File replication enhances availability and performance in DFS by duplicating files across multiple machines, which improves access speed and provides redundancy so that a failure on one machine doesn't prevent access to the file . This method enhances availability by ensuring that multiple copies on independent machines provide failover capabilities. However, the primary challenge introduced is maintaining consistency among the replicated files. When one copy is modified, all other copies must reflect that change, which can be difficult to manage, especially if atomic and serialized invalidation isn't guaranteed .

Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
No ratings yet
Presentation ON Distributed File System: Institute of Engineering and Technology Bundelkhand University
51 pages
Cloud Computing Unit III Full
No ratings yet
Cloud Computing Unit III Full
195 pages
01 NetAppStorageEnvironment IG
100% (1)
01 NetAppStorageEnvironment IG
42 pages
Day 3 Storage
No ratings yet
Day 3 Storage
62 pages
Cluster Management Workflows For OnCommand System
No ratings yet
Cluster Management Workflows For OnCommand System
91 pages
Whitepaper Varonis Security Privacy Standards Practices
No ratings yet
Whitepaper Varonis Security Privacy Standards Practices
32 pages
Welcome To The Isilon Fundamentals Course.: Publish Date: February 2016
No ratings yet
Welcome To The Isilon Fundamentals Course.: Publish Date: February 2016
89 pages
Varonis Whitepaper POV 070615
No ratings yet
Varonis Whitepaper POV 070615
8 pages
Erasure Coding vs. RAID in Storage Solutions
No ratings yet
Erasure Coding vs. RAID in Storage Solutions
22 pages
Clustered Data ONTAP 83 Data Protection Tape
100% (1)
Clustered Data ONTAP 83 Data Protection Tape
108 pages
San Bootcamp
No ratings yet
San Bootcamp
6 pages
Unit III RAID Technology
No ratings yet
Unit III RAID Technology
77 pages
01 Overview SG PDF
No ratings yet
01 Overview SG PDF
22 pages
NetWorker 9.0 Performance Optimization Planning Guide
No ratings yet
NetWorker 9.0 Performance Optimization Planning Guide
80 pages
Object Storage 101
No ratings yet
Object Storage 101
25 pages
NFS Protocol: Evolution and Future
No ratings yet
NFS Protocol: Evolution and Future
69 pages
OpenStack Object Storage Overview
No ratings yet
OpenStack Object Storage Overview
1 page
Ransomware Protection Insights from NetApp
No ratings yet
Ransomware Protection Insights from NetApp
102 pages
Linux+Recommended+Settings-Multipathing-IO Balance
No ratings yet
Linux+Recommended+Settings-Multipathing-IO Balance
11 pages
NetAppPowershellToolkit FAQ Final
No ratings yet
NetAppPowershellToolkit FAQ Final
7 pages
Best Practices For VTL, NAS, StoreOnce Catalyst
No ratings yet
Best Practices For VTL, NAS, StoreOnce Catalyst
164 pages
Hedvig Architecture Overview PDF
No ratings yet
Hedvig Architecture Overview PDF
27 pages
STRSW ILT DATAPROT REV06 - ExerciseGuide PDF
No ratings yet
STRSW ILT DATAPROT REV06 - ExerciseGuide PDF
72 pages
Architecture & Consideration
No ratings yet
Architecture & Consideration
7 pages
Unstructured Data in The Cloud With ECS: Mikhail Vladimirov Senior Sales Engineer
No ratings yet
Unstructured Data in The Cloud With ECS: Mikhail Vladimirov Senior Sales Engineer
32 pages
Netapp Powershell Toolkit: Version History
No ratings yet
Netapp Powershell Toolkit: Version History
6 pages
S3 For ONTAP 9.10 Technical FAQ 2022.02.15
No ratings yet
S3 For ONTAP 9.10 Technical FAQ 2022.02.15
12 pages
14.NFS Server
100% (1)
14.NFS Server
15 pages
Object-Based Storage: IEEE Communications Magazine September 2003
No ratings yet
Object-Based Storage: IEEE Communications Magazine September 2003
8 pages
IT Admin Command Reference
No ratings yet
IT Admin Command Reference
39 pages
NetApp Filer Setup Command Guide
No ratings yet
NetApp Filer Setup Command Guide
4 pages
Distributed File Systems
No ratings yet
Distributed File Systems
75 pages
Five Little-Known Tips To Increase Netapp Storage Resiliency
No ratings yet
Five Little-Known Tips To Increase Netapp Storage Resiliency
4 pages
NetWorker Modules Overview - SRG
No ratings yet
NetWorker Modules Overview - SRG
51 pages
Detailed Course Outline Netapp C-Mode
No ratings yet
Detailed Course Outline Netapp C-Mode
2 pages
Netapp Upgrade Coursepartner
No ratings yet
Netapp Upgrade Coursepartner
31 pages
ONTAP 90 Upgrade Express Guide
No ratings yet
ONTAP 90 Upgrade Express Guide
21 pages
NetWorker 8.2 SP1 Performance Optimization Planning Guide
No ratings yet
NetWorker 8.2 SP1 Performance Optimization Planning Guide
70 pages
NetApp Filer Configuration Guide
No ratings yet
NetApp Filer Configuration Guide
53 pages
NFSv4 ACL Setup and Inheritance Guide
No ratings yet
NFSv4 ACL Setup and Inheritance Guide
4 pages
STRSW Ilt D8cadm Rev03 Studentguide
No ratings yet
STRSW Ilt D8cadm Rev03 Studentguide
568 pages
CCIE DC Storage Section 003 Fibre Channel Switching
0% (1)
CCIE DC Storage Section 003 Fibre Channel Switching
10 pages
Ps - WP - Flashblade As Archive For Rubrik - 02
No ratings yet
Ps - WP - Flashblade As Archive For Rubrik - 02
20 pages
ONTAP 9 NDMP Configuration Express Guide
No ratings yet
ONTAP 9 NDMP Configuration Express Guide
22 pages
NetApp Data ONTAP Features Guide
No ratings yet
NetApp Data ONTAP Features Guide
2 pages
Netapp Performance Monitoring
No ratings yet
Netapp Performance Monitoring
3 pages
NetApp StorageGRID
No ratings yet
NetApp StorageGRID
4 pages
File Access and Protocols Management Guide
No ratings yet
File Access and Protocols Management Guide
385 pages
Storage Area Network
No ratings yet
Storage Area Network
5 pages
Module 10-Data Protection - Participant Guide
No ratings yet
Module 10-Data Protection - Participant Guide
89 pages
PowerScale/ECS Admin Guide April 2022
No ratings yet
PowerScale/ECS Admin Guide April 2022
21 pages
Brocade Switch Zoning via CLI
No ratings yet
Brocade Switch Zoning via CLI
6 pages
E-Series: Netapp E-Series Storage Systems Mirroring Feature Guide
No ratings yet
E-Series: Netapp E-Series Storage Systems Mirroring Feature Guide
27 pages
Iscsi Vs FC: European Storage Competence Center
No ratings yet
Iscsi Vs FC: European Storage Competence Center
49 pages
Overview of Distributed File Systems
No ratings yet
Overview of Distributed File Systems
9 pages
Distributed File System
No ratings yet
Distributed File System
43 pages
Unit-3 Part1
No ratings yet
Unit-3 Part1
57 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
Distributed File Systems
No ratings yet
Distributed File Systems
50 pages
Learning Disabilities
No ratings yet
Learning Disabilities
24 pages
Lanzatech Syngas To Ethanol (IICHe)
No ratings yet
Lanzatech Syngas To Ethanol (IICHe)
30 pages
UPSC Mains: Buddhism & Nalanda
No ratings yet
UPSC Mains: Buddhism & Nalanda
25 pages
SAEJ743 V 001
No ratings yet
SAEJ743 V 001
19 pages
Lesson Plan Grammar
No ratings yet
Lesson Plan Grammar
2 pages
Classifying Real and Rational Numbers
No ratings yet
Classifying Real and Rational Numbers
48 pages
Fundametal Differences
No ratings yet
Fundametal Differences
7 pages
Answer:: Q1. What Is Marginal Costing? Explain and How Is It Different From Absorption Costing?
No ratings yet
Answer:: Q1. What Is Marginal Costing? Explain and How Is It Different From Absorption Costing?
2 pages
KBX PGW Price List 2019
No ratings yet
KBX PGW Price List 2019
9 pages
Jammu and Kashmir NTSE Syllabus (Stage I)
No ratings yet
Jammu and Kashmir NTSE Syllabus (Stage I)
5 pages
Grade 10 Science: Endocrine and Reproductive Systems
No ratings yet
Grade 10 Science: Endocrine and Reproductive Systems
8 pages
MP25P1 Abb
No ratings yet
MP25P1 Abb
2 pages
Urban Planning for Community Growth
No ratings yet
Urban Planning for Community Growth
15 pages
The Slave's Dream: Yearning for Freedom
No ratings yet
The Slave's Dream: Yearning for Freedom
12 pages
Android Unit 1
No ratings yet
Android Unit 1
11 pages
Five (5) P'S of Strategy (Mintzberg)
No ratings yet
Five (5) P'S of Strategy (Mintzberg)
3 pages
Minor Triads and Inversions: Concept 2
100% (1)
Minor Triads and Inversions: Concept 2
2 pages
S3 - MLP - 20-21 - 2nd - Term - Exam - RP (3rd Draft)
No ratings yet
S3 - MLP - 20-21 - 2nd - Term - Exam - RP (3rd Draft)
7 pages
Meier and Rauch - 10.1 and 10.5
No ratings yet
Meier and Rauch - 10.1 and 10.5
7 pages
Ultrason Sup ® /sup +high Performance+Thermoplastics+for+Membranes
No ratings yet
Ultrason Sup ® /sup +high Performance+Thermoplastics+for+Membranes
16 pages
Year 8 Homework HT3
No ratings yet
Year 8 Homework HT3
2 pages
MCQ Database Engineering (260 Questions)
100% (2)
MCQ Database Engineering (260 Questions)
46 pages
Challenges in New Normal Education
100% (1)
Challenges in New Normal Education
8 pages
Sound PPT-2
No ratings yet
Sound PPT-2
16 pages
Themes in Twelve Angry Men & Mockingbird
No ratings yet
Themes in Twelve Angry Men & Mockingbird
3 pages
Family Relationships and Traditions
No ratings yet
Family Relationships and Traditions
20 pages
Planer Quick Return Mechanisms
No ratings yet
Planer Quick Return Mechanisms
6 pages
KSAOs: Reliability and Validity in Selection
No ratings yet
KSAOs: Reliability and Validity in Selection
3 pages
2025 - 5 - CS3100 - Syllabus 2
No ratings yet
2025 - 5 - CS3100 - Syllabus 2
4 pages
Messages Unit 1 Exercise
No ratings yet
Messages Unit 1 Exercise
1 page

Distributed File System Overview

Uploaded by

Distributed File System Overview

Uploaded by

ByPriyan Agarwal Priyanka Dey Pulkit Kapoor Rahul Aggarwal Rajat Aggarwal Ritika Goyal Rohit Mathur Sahib

DISTRIBUTED FILE SYSTEM

Distributed File System Implementation

How the file and directory service is structured?

maintain state information about clients.

resides; the system and disk are known.

SET of locations for the replicas.

storage location changes. Dynamic, one-to-many mapping.

Most DFSs today:

The ANDREW DFS AS AN EXAMPLE:

of independent file systems.

2. Remote directories are mounted to local directories.

name ----> < system, disk, cylinder, sector >

need be modified. This identifier must be "unique" in the universe.

STATEFUL VS. STATELESS SERVICE:

STATEFUL VS. STATELESS SERVICE:

REMOTE FILE ACCESS- CACHING

b) The request is passed to the server in message format.

This is equivalent to performing a disk access for each request.

Caching in memory gives:

CACHE UPDATE POLICY:

Example NFS with Caches

COMPARISON OF CACHING AND REMOTE SERVICE:

ANDREW FILE SYSTEM

as an homogeneous, identical, and location transparent file hierarchy

shared name space

connected to the backbone by a router

Opening a file causes it to be cached, in its entirety, on the local disk

SHARED NAME SPACE:

FILE OPERATIONS AND CONSISTENCY SEMANTICS:

IMPLEMENTATION Flow of a request:

A server user-level process handles client requests.

Susceptible to reliability problems.

Common questions

How does the Andrew File System (AFS) differ from traditional DFS implementations in terms of file operations and consistency?

How does the Andrew File System (AFS) differ from traditional DFS implementations in terms of file operations and consistency?

Explain how caching improves DFS performance but could complicate system management. What are the factors that influence its effectiveness?

Explain how caching improves DFS performance but could complicate system management. What are the factors that influence its effectiveness?

What role does the Network File System (NFS) play in demonstrating remote file access techniques, and how does it implement transparent caching?

What role does the Network File System (NFS) play in demonstrating remote file access techniques, and how does it implement transparent caching?

What are the main differences between stateful and stateless DFS services, and how do these affect performance and fault tolerance?

What are the main differences between stateful and stateless DFS services, and how do these affect performance and fault tolerance?

In what ways does dynamic file mobility supported by the Andrew DFS system impact overall security and management?

In what ways does dynamic file mobility supported by the Andrew DFS system impact overall security and management?

How does a Distributed File System (DFS) manage file transparency, and what benefits does this offer?

How does a Distributed File System (DFS) manage file transparency, and what benefits does this offer?

Describe the impact of cache consistency policies on the design and operation of DFS. What strategies are used to maintain cache coherence?

Describe the impact of cache consistency policies on the design and operation of DFS. What strategies are used to maintain cache coherence?

What are the advantages and disadvantages of using a single global name structure in DFS?

What are the advantages and disadvantages of using a single global name structure in DFS?

How does the concept of location independence in DFS contribute to system performance and flexibility?

How does the concept of location independence in DFS contribute to system performance and flexibility?

In what ways does file replication enhance the availability and performance of DFS, and what challenge does it introduce?

In what ways does file replication enhance the availability and performance of DFS, and what challenge does it introduce?

You might also like