Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1374596.1374604acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

GIGA+: scalable directories for shared file systems

Published: 11 November 2007 Publication History

Abstract

There is an increasing use of high-performance computing (HPC) clusters with thousands of compute nodes that, with the advent of multi-core CPUs, will impose a significant challenge for storage systems: The ability to scale to handle I/O generated by applications executing in parallel in tens of thousands of threads. One such challenge is building scalable directories for cluster storage - i.e., directories that can store billions to trillions of entries and handle hundreds of thousands of operations per second.

References

[1]
Private Communication with Garth A. Gibson, Panasas Inc.
[2]
M. Baker and J. K. Ousterhout. Availability in the Sprite Distributed File System. Operating Systems Review, 25(2), Apr. 1991.
[3]
M. Eisler, P. Corbett, M. Kazar, D. S. Nydick, and J. C. Wagner. Data ONTAP GX: A Scalable Storage Cluster. In Proc. of the FAST '07 Conference on File and Storage Technologies, San Jose CA, Feb. 2007.
[4]
R. Fagin, J. Nievergelt, N. Pippenger, and H. R. Strong. Extendible Hashing - A Fast Access Method for Dynamic Files. ACM Transactions on Database Systems, 4(3), Sept. 1979.
[5]
P. L. Lehman and S. B. Yao. Efficient Locking for Concurrent Operations on B-Trees. ACM Transactions on Database Systems, 6(4), Dec. 1981.
[6]
J. MacCormick, N. Murphy, M. Najork, C. A. Thekkath, and L. Zhou. Boxwood: Abstractions as the Foundation for Storage Infrastructure. In Proc. of the 6th Symposium on Operating Systems Design and Implementation (OSDI '04), San Francisco CA, Dec. 2004.
[7]
J. K. Ousterhout, H. D. Costa, D. Harrison, J. A. Kunze, M. Kupfer, and J. G. Thompson. A Trace-Driven Analysis of the UNIX 4.2 BSD File System. In Proc. of 10th ACM Symposium on Operating Systems Principles (SOSP '85), Orcas Island WA, Dec. 1985.
[8]
PVFS2. Parallel Virtual File System, Version 2. http://www.pvfs2.org.
[9]
R. Ross, E. Felix, B. Loewe, L. Ward, J. Nunez, J. Bent, E. Salmon, and G. Grider. High end computing revitalization task force (HECRTF), inter agency working group (HECIWG) file systems and I/O research guidance workshop. http://institutes.lanl.gov/hec-fsio/docs/HECIWG-FSIO-FY06-Workshop-Document-FINAL6.pdf, 2006.
[10]
F. Schmuck and R. Haskin. GPFS: A Shared-Disk File System for Large Computing Clusters. In Proc. of the FAST '02 Conference on File and Storage Technologies, Monterey CA, Jan. 2002.
[11]
A. Sweeney, D. Doucette, W. Hu, C. Anderson, M. Nishimoto, and G. Peck. Scalability in the XFS File System. In Proc. of USENIX Conference '96, San Jose CA, 1996.
[12]
T. Y. Ts'o. Planned Extensions to the Linux Ext2/Ext3 Filesystem. In Proc. of USENIX Conference '02, FREENIX Track, Monterey CA, 2002.
[13]
VERIZON. 'Trans-Pacific Express' to Offer Greater Speed, Reliability and Efficiency. http://newscenter.verizon.com/press-releases/verizon/2006/verizon-business-joins.html, Dec. 2006.

Cited By

View all
  • (2023)FileScaleProceedings of the 2023 ACM Symposium on Cloud Computing10.1145/3620678.3624784(459-474)Online publication date: 30-Oct-2023
  • (2022)Survey of Distributed File System Design ChoicesACM Transactions on Storage10.1145/346540518:1(1-34)Online publication date: 2-Mar-2022
  • (2019)Pream: Enhancing HPC Storage System Performance with Pre-Allocated Metadata Management Mechanism2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS)10.1109/HPCC/SmartCity/DSS.2019.00069(413-420)Online publication date: Aug-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
PDSW '07: Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing '07
November 2007
72 pages
ISBN:9781595938992
DOI:10.1145/1374596
  • Conference Chair:
  • Garth A. Gibson
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 November 2007

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

SC '07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 17 of 41 submissions, 41%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)FileScaleProceedings of the 2023 ACM Symposium on Cloud Computing10.1145/3620678.3624784(459-474)Online publication date: 30-Oct-2023
  • (2022)Survey of Distributed File System Design ChoicesACM Transactions on Storage10.1145/346540518:1(1-34)Online publication date: 2-Mar-2022
  • (2019)Pream: Enhancing HPC Storage System Performance with Pre-Allocated Metadata Management Mechanism2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS)10.1109/HPCC/SmartCity/DSS.2019.00069(413-420)Online publication date: Aug-2019
  • (2019)HopsFS: Scaling Hierarchical File System Metadata Using NewSQL DatabasesEncyclopedia of Big Data Technologies10.1007/978-3-319-77525-8_146(965-979)Online publication date: 20-Feb-2019
  • (2018)Scalable Metadata Management Techniques for Ultra-Large Distributed Storage Systems -- A Systematic ReviewACM Computing Surveys10.1145/321268651:4(1-37)Online publication date: 31-Jul-2018
  • (2018)HSAStore: A Hierarchical Storage Architecture for Computing Systems Containing Large-Scale Intermediate DataCollaborative Computing: Networking, Applications and Worksharing10.1007/978-3-030-00916-8_54(591-601)Online publication date: 26-Sep-2018
  • (2017)HopsFSProceedings of the 15th Usenix Conference on File and Storage Technologies10.5555/3129633.3129642(89-103)Online publication date: 27-Feb-2017
  • (2017)MetaKV: A Key-Value Store for Metadata Management of Distributed Burst Buffers2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2017.39(1174-1183)Online publication date: May-2017
  • (2017)A new approach for directory management in GlusterFS2017 9th International Conference on Information and Knowledge Technology (IKT)10.1109/IKT.2017.8258635(166-174)Online publication date: Oct-2017
  • (2016)StageFS: A Parallel File System Optimizing Metadata Performance for SSD Based Clusters2016 IEEE Trustcom/BigDataSE/ISPA10.1109/TrustCom.2016.0330(2147-2152)Online publication date: Aug-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media