Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1629575.1629577acmconferencesArticle/Chapter ViewAbstractPublication PagessospConference Proceedingsconference-collections
research-article

FAWN: a fast array of wimpy nodes

Published: 11 October 2009 Publication History

Abstract

This paper presents a new cluster architecture for low-power data-intensive computing. FAWN couples low-power embedded CPUs to small amounts of local flash storage, and balances computation and I/O capabilities to enable efficient, massively parallel access to data.
The key contributions of this paper are the principles of the FAWN architecture and the design and implementation of FAWN-KV--a consistent, replicated, highly available, and high-performance key-value storage system built on a FAWN prototype. Our design centers around purely log-structured datastores that provide the basis for high performance on flash storage, as well as for replication and consistency obtained using chain replication on a consistent hashing ring. Our evaluation demonstrates that FAWN clusters can handle roughly 350 key-value queries per Joule of energy--two orders of magnitude more than a disk-based system.

References

[1]
Flexible I/O Tester. http://freshmeat.net/projects/fio/.
[2]
M. Al-Fares, A. Loukissas, and A. Vahdat. A scalable, commodity, data center network architecture. In Proc. ACM SIGCOMM, Aug. 2008.
[3]
L.A. Barroso and U. Hölzle. The case for energy-proportional computing. phComputer, 40 (12): 33--37, 2007.
[4]
BerkeleyDB Reference Guide. Memory-only or Flash configurations. http://www.oracle.com/technology/documentation/berkeley-db/db/ref/program/ram.html.
[5]
W. Bowman, N. Cardwell, C. Kozyrakis, C. Romer, and H. Wang. Evaluation of existing architectures in IRAM systems. In Workshop on Mixing Logic and DRAM, 24th International Symposium on Computer Architecture, June 1997.
[6]
A.M. Caulfield, L.M. Grupp, and S. Swanson. Gordon: Using flash memory to build fast, power-efficient clusters for data-intensive applications. In 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'09), Mar. 2009.
[7]
J.S. Chase, D. Anderson, P. Thakar, A. Vahdat, and R. Doyle. Managing energy and server resources in hosting centers. In Proc. 18th ACM Symposium on Operating Systems Principles (SOSP), Oct. 2001.
[8]
H. Dai, M. Neufeld, and R. Han. ELF: an efficient log-structured flash file system for micro sensor nodes. In Proceedings of the ACM Conference on Embedded Networked Sensor Systems (SenSys), Nov. 2004.
[9]
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In Proc. 6th USENIX OSDI, Dec. 2004.
[10]
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's highly available key-value store. In Proc. 21st ACM Symposium on Operating Systems Principles (SOSP), Oct. 2007.
[11]
Dell XS11-VX8. Dell fortuna. "http://www1.euro.dell.com/content/topics/topic.aspx/emea/corporate/pre%ssoffice/2009/uk/en/2009_05_20_brk_000?c=uk&l=en", 2009.
[12]
F. Douglis, F. Kaashoek, B. Marsh, R. Caceres, K. Li, and J. Tauber. Storage alternatives for mobile computers. In Proc. 1st USENIX OSDI, pages 25--37, Nov. 1994.
[13]
L. Ganesh, H. Weatherspoon, M. Balakrishnan, and K. Birman. Optimizing power consumption in large scale storage systems. In Proc. HotOS XI, May 2007.
[14]
A. Gara, M.A. Blumrich, D. Chen, G. L.-T. Chiu, et al. Overview of the Blue Gene/L system architecture. IBM J. Res and Dev., 49 (2/3), May 2005.
[15]
S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google file system. In Proc. 19th ACM Symposium on Operating Systems Principles (SOSP), Oct. 2003.
[16]
A. Greenberg, N. Jain, S. Kandula, C. Kim, P. Lahiri, D. Maltz, P. Patel, and S. Sengupta. VL2: A scalable and flexible data center network. In Proc. ACM SIGCOMM, Aug. 2009.
[17]
S.D. Gribble, E.A. Brewer, J.M. Hellerstein, and D. Culler. Scalable, distributed data structures for Internet service construction. In Proc. 4th USENIX OSDI, Nov. 2000.
[18]
C. Guo, H. Wu, K. Tan, L. Shi, Y. Zhang, and S. Lu. DCell: A scalable and fault-tolerant network structure for data centers. In Proc. ACM SIGCOMM, Aug. 2008.
[19]
C. Guo, G. Lu, D. Li, H. Wu, X. Zhang, Y. Shi, C. Tian, Y. Zhang, and S. Lu. BCube: A high performance, server-centric network architecture for modular data centers. In Proc. ACM SIGCOMM, Aug. 2009.
[20]
J. Hamilton. Cooperative expendable micro-slice servers (CEMS): Low cost, low power servers for Internet scale services. http://mvdirona.com/jrh/TalksAndPapers/JamesHamilton_CEMS.pdf, 2009.
[21]
Intel. Penryn Press Release. http://www.intel.com/pressroom/archive/releases/20070328fact.htm.
[22]
Iozone. Filesystem Benchmark. http://www.iozone.org.
[23]
JFFS2. The Journaling Flash File System. http://sources.redhat.com/jffs2/.
[24]
B. Johnson. Facebook, personal communication, Nov. 2008.
[25]
R.H. Katz. Tech titans building boom. IEEE Spectrum, Feb. 2009.
[26]
A. Kawaguchi, S. Nishioka, and H. Motoda. A flash-memory based file system. In Proc. USENIX Annual Technical Conference, Jan. 1995.
[27]
L. Lamport. The part-time parliament. ACM Trans. Comput. Syst., 16 (2): 133--169, 1998. ISSN 0734-2071.
[28]
S.-W. Lee, B. Moon, C. Park, J.-M. Kim, and S.-W. Kim. A case for flash memory SSD in enterprise database applications. In Proc. ACM SIGMOD, June 2008.
[29]
Y. Li, B. He, Q. Luo, and K. Yi. Tree indexing on flash disks. In Proceedings of 25th International Conference on Data Engineering, Mar. 2009.
[30]
K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, and S. Reinhardt. Understanding and designing new server architectures for emerging warehouse-computing environments. In International Symposium on Computer Architecture (ISCA), June 2008.
[31]
J. MacCormick, N. Murphy, M. Najork, C.A. Thekkath, and L. Zhou. Boxwood: abstractions as the foundation for storage infrastructure. In Proc. 6th USENIX OSDI, Dec. 2004.
[32]
G. Mathur, P. Desnoyers, D. Ganesan, and P. Shenoy. Capsule: an energy-optimized object storage system for memory-constrained sensor devices. In Proceedings of the ACM Conference on Embedded Networked Sensor Systems (SenSys), Oct. 2006.
[33]
Memcached. A distributed memory object caching system. http://www.danga.com/memcached/.
[34]
Microsoft Marlowe. Peering into future of cloud computing. http://research.microsoft.com/en-us/news/features/ccf-022409.aspx, 2009.
[35]
D. Myers. On the use of NAND flash memory in high-performance relational databases. M.S. Thesis, MIT, Feb. 2008.
[36]
S. Nath and P.B. Gibbons. Online maintenance of very large random samples on flash storage. In phProc. VLDB, Aug. 2008.
[37]
S. Nath and A. Kansal. FlashDB: Dynamic self-tuning database for NAND flash. In Proceedings of ACM/IEEE International Conference on Information Processing in Sensor Networks, Apr. 2007.
[38]
Netezza. Business intelligence data warehouse appliance. http://www.netezza.com/, 2006.
[39]
nilfs. Continuous snapshotting filesystem for Linux. http://www.nilfs.org.
[40]
M. Polte, J. Simsa, and G. Gibson. Enabling enterprise solid state disks performance. In phProc. Workshop on Integrating Solid-state Memory into the Storage Hierarchy, Mar. 2009.
[41]
Project Voldemort. A distributed key-value storage system. http://project-voldemort.com.
[42]
S. Quinlan and S. Dorward. Venti: A new approach to archival storage. In Proc. USENIX Conference on File and Storage Technologies (FAST), pages 89--101, Jan. 2002.
[43]
E. Riedel, C. Faloutsos, G.A. Gibson, and D. Nagle. Active disks for large-scale data processing. phIEEE Computer, 34 (6): 68--74, June 2001.
[44]
S. Rivoire, M.A. Shah, P. Ranganathan, and C. Kozyrakis. JouleSort: A balanced energy-efficient benchmark. In Proc. ACM SIGMOD, June 2007.
[45]
M. Rosenblum and J.K. Ousterhout. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems, 10 (1): 26--52, 1992.
[46]
S.W. Schlosser, J.L. Griffin, D.F. Nagle, and G.R. Ganger. Filling the memory access gap: A case for on-chip magnetic storage. Technical Report CMU-CS-99-174, Carnegie Mellon University, Nov. 1999.
[47]
F.B. Schneider. Byzantine generals in action: implementing fail-stop processors. ACM Trans. Comput. Syst., 2 (2): 145--154, 1984. ISSN 0734-2071.
[48]
I. Stoica, R. Morris, D. Karger, M.F. Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for Internet applications. In Proc. ACM SIGCOMM, Aug. 2001.
[49]
M.W. Storer, K.M. Greenan, E.L. Miller, and K. Voruganti. Pergamum: Replacing tape with energy efficient, reliable, disk-based archival storage. In Proc. USENIX Conference on File and Storage Technologies, Feb. 2008.
[50]
A. Szalay, G. Bell, A. Terzis, A. White, and J. Vandenberg. Low power Amdahl blades for data intensive computing, 2009.
[51]
J. Terrace and M.J. Freedman. Object storage on CRAQ: High-throughput chain replication for read-mostly workloads. In Proc. USENIX Annual Technical Conference, June 2009.
[52]
N. Tolia, Z. Wang, M. Marwah, C. Bash, P. Ranganathan, and X. Zhu. Delivering energy proportionality with non energy-proportional systems -- optimizing the ensemble. In Proc. HotPower, Dec. 2008.
[53]
D. Tsirogiannis, S. Harizopoulos, M.A. Shah, J.L. Wiener, and G. Graefe. Query processing techniques for solid state drives. In Proc. ACM SIGMOD, June 2009.
[54]
R. van Renesse and F.B. Schneider. Chain replication for supporting high throughput and availability. In Proc. 6th USENIX OSDI, Dec. 2004.
[55]
WattsUp. .NET Power Meter. http://wattsupmeters.com.
[56]
M. Weiser, B. Welch, A. Demers, and S. Shenker. Scheduling for reduced CPU energy. In Proc. 1st USENIX OSDI, pages 13--23, Nov. 1994.
[57]
M. Wu and W. Zwaenepoel. eNVy: A non-volatile, main memory storage system. In Proc. 6th International Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Oct. 1994.
[58]
D. Zeinalipour-Yazti, S. Lin, V. Kalogeraki, D. Gunopulos, and W.A. Najjar. Microhash: An efficient index structure for flash-based sensor devices. In Proc. 4th USENIX Conference on File and Storage Technologies, Dec. 2005.
[59]
Q. Zhu, Z. Chen, L. Tan, Y. Zhou, K. Keeton, and J. Wilkes. Hibernator: Helping disk arrays sleep through the winter. In Proc. 20th ACM Symposium on Operating Systems Principles (SOSP), Oct. 2005.

Cited By

View all
  • (2024)Aleph Filter: To Infinity in Constant TimeProceedings of the VLDB Endowment10.14778/3681954.368202717:11(3644-3656)Online publication date: 1-Jul-2024
  • (2024)Beyond Bloom: A Tutorial on Future Feature-Rich FiltersCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654681(636-644)Online publication date: 9-Jun-2024
  • (2023)LEED: A Low-Power, Fast Persistent Key-Value Store on SmartNIC JBOFsProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3604880(1012-1027)Online publication date: 10-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SOSP '09: Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
October 2009
346 pages
ISBN:9781605587523
DOI:10.1145/1629575
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 October 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cluster computing
  2. design
  3. energy efficiency
  4. flash
  5. measurement
  6. performance

Qualifiers

  • Research-article

Conference

SOSP09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 131 of 716 submissions, 18%

Upcoming Conference

SOSP '24

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)127
  • Downloads (Last 6 weeks)6
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Aleph Filter: To Infinity in Constant TimeProceedings of the VLDB Endowment10.14778/3681954.368202717:11(3644-3656)Online publication date: 1-Jul-2024
  • (2024)Beyond Bloom: A Tutorial on Future Feature-Rich FiltersCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654681(636-644)Online publication date: 9-Jun-2024
  • (2023)LEED: A Low-Power, Fast Persistent Key-Value Store on SmartNIC JBOFsProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3604880(1012-1027)Online publication date: 10-Sep-2023
  • (2023)InfiniFilter: Expanding Filters to Infinity and BeyondProceedings of the ACM on Management of Data10.1145/35892851:2(1-27)Online publication date: 20-Jun-2023
  • (2023)EVStore: Storage and Caching Capabilities for Scaling Embedding Tables in Deep Recommendation SystemsProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3575693.3575718(281-294)Online publication date: 27-Jan-2023
  • (2023)Junkyard Computing: Repurposing Discarded Smartphones to Minimize CarbonProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3575693.3575710(400-412)Online publication date: 27-Jan-2023
  • (2023)TH-iSSD: Design and Implementation of a Generic and Reconfigurable Near-Data Processing FrameworkACM Transactions on Embedded Computing Systems10.1145/356345622:6(1-23)Online publication date: 9-Nov-2023
  • (2023)Rambda: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA56546.2023.10071127(499-515)Online publication date: Feb-2023
  • (2023)An Active Storage System for Intelligent Data Analysis and ManagementDesign and Applications of Emerging Computer Systems10.1007/978-3-031-42478-6_6(143-183)Online publication date: 17-Aug-2023
  • (2022)Accelerating range queries of primary and secondary indices for key-value separationProceedings of the 13th Symposium on Cloud Computing10.1145/3542929.3563479(226-239)Online publication date: 7-Nov-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media