The zebra striped network file system

November 1995

Author:
John Henry Hartman
Univ. of California, Berkeley

Publisher:

University of California at Berkeley
Computer Science Division 571 Evans Hall Berkeley, CA
United States

Order Number:UMI Order No. GAX95-29334

Bibliometrics

Abstract

This dissertation presents a new network file system, called Zebra, that provides high performance file access and is highly available. Zebra stripes file data across its servers, so that multiple servers may participate in a file access and the file access bandwidth therefore scales with the number of servers. Zebra is also highly available because it stores parity information in the style of a RAID (Patterson88) disk array; this increases storage costs slightly but allows the system to continue operation even while a single storage server is unavailable.

Zebra is different from other striped network file systems in the way in which it stripes data. Instead of striping individual files (file-based striping), Zebra forms the data written by each client into an append-only log, which is then striped across the servers. In addition, the parity of each log is computed and stored as the log is striped. I call this form of striping log-based striping, and its operation is similar to that of a log-structured file system (LFS) (Rosenblum91). Zebra can be thought of as a log-structured network file system: whereas LFS uses a log abstraction at the interface between a file server and its disks, Zebra uses a log abstraction at the interface between a client and its servers. Striping logs, instead of files, simplifies Zebra's parity mechanism, reduces parity overhead, and allows clients to batch together small writes.

I have built a prototype implementation of Zebra in the Sprite operating system (Ousterhout88). Measurements of the prototype show that Zebra provides 4-5 times the throughput of the standard Sprite file system or NFS for large files, and a 15-300% improvement for writing small files. The utilizations of the system resources indicate that the prototype can scale to support a maximum aggregate write bandwidth of 20 Mbytes/second, or about ten clients writing at their maximum rate.

Cited By

Chang F and Gibson G Automatic I/O hint generation through speculative execution Proceedings of the third symposium on Operating systems design and implementation, (1-14)

Contributors

John Henry Hartman
The University of Arizona
- Publication Years1991 - 2021
- Publication counts32
- Citation count1,338
- Available for Download13
- Downloads (cumulative)15,047
- Downloads (12 months)1,184
- Downloads (6 weeks)206
- Average Downloads per Article1,157
- Average Citation per Article42
View Full Profile

Index Terms

The zebra striped network file system
1. Networks
  1. Network architectures
  2. Network protocols
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems
        File systems management

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

The Zebra striped network file system

Zebra is a network file system that increases throughput by striping the file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an ...
The Zebra striped network file system

Zebra is a network file system that increases throughput by striping file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an ...
The Zebra striped network file system
SOSP '93: Proceedings of the fourteenth ACM symposium on Operating systems principles

Zebra is a network file system that increases throughput by striping file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an ...

Browse Theses

Sections

Cited By

Index Terms