Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Semantic-Aware Metadata Organization Paradigm in Next-Generation File Systems

Published: 01 February 2012 Publication History

Abstract

Existing data storage systems based on the hierarchical directory-tree organization do not meet the scalability and functionality requirements for exponentially growing data sets and increasingly complex metadata queries in large-scale, Exabyte-level file systems with billions of files. This paper proposes a novel decentralized semantic-aware metadata organization, called SmartStore, which exploits semantics of files' metadata to judiciously aggregate correlated files into semantic-aware groups by using information retrieval tools. The key idea of SmartStore is to limit the search scope of a complex metadata query to a single or a minimal number of semantically correlated groups and avoid or alleviate brute-force search in the entire system. The decentralized design of SmartStore can improve system scalability and reduce query latency for complex queries (including range and top-k queries). Moreover, it is also conducive to constructing semantic-aware caching, and conventional filename-based point query. We have implemented a prototype of SmartStore and extensive experiments based on real-world traces show that SmartStore significantly improves system scalability and reduces query latency over database approaches. To the best of our knowledge, this is the first study on the implementation of complex queries in large-scale file systems.

Cited By

View all
  • (2018)H2CloudProceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225083(1-10)Online publication date: 13-Aug-2018
  • (2018)Scalable Metadata Management Techniques for Ultra-Large Distributed Storage Systems -- A Systematic ReviewACM Computing Surveys10.1145/321268651:4(1-37)Online publication date: 31-Jul-2018
  • (2017)A workload-aware flash translation layer enhancing performance and lifespan of TLC/SLC dual-mode flash memory in embedded systemsMicroprocessors & Microsystems10.1016/j.micpro.2016.12.00952:C(343-354)Online publication date: 1-Jul-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Parallel and Distributed Systems  Volume 23, Issue 2
February 2012
190 pages

Publisher

IEEE Press

Publication History

Published: 01 February 2012

Author Tags

  1. File systems
  2. metadata management
  3. performance evaluation.
  4. scalability

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)H2CloudProceedings of the 47th International Conference on Parallel Processing10.1145/3225058.3225083(1-10)Online publication date: 13-Aug-2018
  • (2018)Scalable Metadata Management Techniques for Ultra-Large Distributed Storage Systems -- A Systematic ReviewACM Computing Surveys10.1145/321268651:4(1-37)Online publication date: 31-Jul-2018
  • (2017)A workload-aware flash translation layer enhancing performance and lifespan of TLC/SLC dual-mode flash memory in embedded systemsMicroprocessors & Microsystems10.1016/j.micpro.2016.12.00952:C(343-354)Online publication date: 1-Jul-2017
  • (2017)`MaaS'Journal of Intelligent Manufacturing10.1007/s10845-015-1076-y28:8(1871-1891)Online publication date: 1-Dec-2017
  • (2015)A Proximity-Aware Interest-Clustered P2P File Sharing SystemIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.232703326:6(1509-1523)Online publication date: 1-Jun-2015
  • (2015)Multi-Granularity Locality-Sensitive Bloom FilterIEEE Transactions on Computers10.1109/TC.2015.240101164:12(3500-3514)Online publication date: 1-Dec-2015
  • (2014)FASTProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC.2014.67(754-765)Online publication date: 16-Nov-2014

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media