Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

L24: Parallelism, performance, energy efficiency, and cost trade-offs in future sensor platforms

Published: 05 September 2013 Publication History

Abstract

Networks of sensors must process large amounts of intermittently-available data in situ. This motivates the investigation of means for achieving high performance when required, but ultra-low-power dissipation when idle. One approach to this challenge is the use of embedded multiprocessor systems, leading to trade-offs between parallelism, performance, energy efficiency, and cost. To evaluate these trade-offs and to gain insight for future system designs, this article presents the design, implementation, and evaluation of a miniature, energy-scalable, 24-processor module, L24, for use in embedded sensor systems.
Analytic results and empirical evidence motivating such embedded multiprocessors is provided, and a parallel fixed-point fast Fourier transform implementation is presented. This application is used as a challenging but realistic evaluator of the presented hardware platform. Through a combination of hardware measurements, instruction-level microarchitectural simulation, and analytic modeling, it is demonstrated that the platform provides idle power dissipation over an order of magnitude lower than systems employing a monolithic processor of equivalent performance, while dynamic power dissipation remains competitive.
Taking into account both application computation and interprocessor communication demands, it is shown that there may exist an optimum operating voltage that minimizes either time-to-solution, energy usage, or the energy-delay product. This optimum operating point is formulated analytically, calibrated with system measurements, and evaluated for the hardware platform and application presented.

References

[1]
Actel Corporation. 2009. IGLOO family of low-power Flash-based FPGAs. http://www.actel.com/.
[2]
Adler, R., Flanigan, M., Huang, J., Kling, R., Kushalnagar, N., Nachman, L., Wan, C.-Y., and Yarvis, M. 2005. Intel mote 2: An advanced platform for demanding sensor network applications. In Proceedings of the 3rd International Conference on Embedded Networked Sensor Systems (SenSys'05). ACM Press, New York, NY, 298--298.
[3]
Atmel Corporation. 2007. Datasheet, AT91 ARM Thumb-based Microcontrollers, AT91SAM7S256. http://www.atmel.com.
[4]
Atmel Corporation. 2008. Datasheet, AT91 ARM Thumb-based Microcontrollers, AT91SAM9261. http://www.atmel.com.
[5]
Barroso, L. A. and Hölzle, U. 2007. The case for energy-proportional computing. Comput. 40, 12, 33--37.
[6]
Benbasat, A. Y. and Paradiso, J. A. 2005. A compact modular wireless sensor platform. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 56.
[7]
Beutel, J., Kasten, O., and Ringwald, M. 2003. Poster abstract: Btnodes—a distributed platform for sensor nodes. In Proceedings of the 1st International Conference on Embedded Networked Sensor Systems (SenSys'03). ACM Press, New York, NY, 292--293.
[8]
Dally, W. and Towles, B. 2004. Principles and Practices of Interconnection Networks. Morgan Kaufmann, Burlington, MA.
[9]
Duhamel, P. and Vetterli, M. 1990. Fast Fourier transforms: A tutorial review and a state of the art. Signal Process. 19, 4, 259--299.
[10]
Edmonds, N., Stark, D., and Davis, J. 2005. Mass: Modular architecture for sensor systems. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 53.
[11]
Ekanayake, V., Clinton Kelly, I., and Manohar, R. 2004. An ultra low-power processor for sensor networks. In Proceedings of the 11th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-XI). ACM Press, New York, NY, 27--36.
[12]
Freescale Semiconductor Inc. 2007. Semiconductor packaging technologies: System-in-package, package-on-package and redistributed chip packaging: Progressing toward 3G radio-in-package. http://www.freescale.com.
[13]
Frigo, M. 1999. A fast Fourier transform compiler. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI'99). ACM, New York, NY, 169--180.
[14]
Grama, A. Y., Gupta, A., and Kumar, V. 1993. Isoefficiency: Measuring the scalability of parallel algorithms and architectures. IEEE Parallel Distrib. Technol. 1, 3, 12--21.
[15]
Gupta, A. and Kumar, V. 1993. The scalability of FFT on parallel computers. IEEE Trans. Parallel Distrib. Syst. 4, 8, 922--932.
[16]
Hammel, T. and Rich, M. 2007. A higher capability sensor node platform suitable for demanding applications. In Proceedings of the 6th International Conference on Information Processing in Sensor Networks (IPSN'07). ACM, New York, NY, 138--147.
[17]
Hempstead, M., Tripathi, N., Mauro, P., Wei, G.-Y., and Brooks, D. 2005. An ultra low power system architecture for sensor network applications. In Proceedings of the 32nd Annual International Symposium on Computer Architecture (ISCA'05). IEEE Computer Society, Washington, DC, 208--219.
[18]
Hill, J. L. and Culler, D. E. 2002. Mica: A wireless platform for deeply embedded networks. IEEE Micro 22, 6, 12--24.
[19]
Horowitz, M., Indermaur, T., and Gonzalez, R. 1994. Low-power digital design. In Proceedings of the IEEE Symposium on Low Power Electronics. Digest of Technical Papers. 8--11.
[20]
Kautz, W. H. 1968. Bounds on directed (d,k) graphs. Theory of cellular logic networks and machines AFCRL-68-0668 Final report, 20--28.
[21]
Koopman, P. and Siewiorek, D. 1988. The impact of Rent's rule on massive parallelism. In Proceedings of the 2nd Symposium on the Frontiers of Massively Parallel Computation. 59--62.
[22]
Koren, I. and Krishna, C. 2007. Fault Tolerant Systems. Morgan Kaufmann Publishers Inc. San Francisco, CA.
[23]
Landman, B. S. and Russo, R. L. 1971. On a pin versus block relationship for partitions of logic graphs. IEEE Trans. Comput. 20, 1469--1479.
[24]
Lymberopoulos, D., Priyantha, N. B., and Zhao, F. 2007. mPlatform: A reconfigurable architecture and efficient data sharing mechanism for modular sensor nodes. In Proceedings of the 6th International Conference on Information Processing in Sensor Networks (IPSN'07). ACM, New York, NY, 128--137.
[25]
Lymberopoulos, D. and Savvides, A. 2005. Xyz: A motion-enabled, power aware sensor node platform for distributed sensor network applications. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 63.
[26]
McIntire, D., Ho, K., Yip, B., Singh, A., Wu, W., and Kaiser, W. J. 2006. The low power energy aware processing (leap)embedded networked sensor system. In Proceedings of the 5th International Conference on Information Processing in Sensor Networks (IPSN'06). ACM Press, New York, NY, 449--457.
[27]
Nachman, L., Kling, R., Adler, R., Huang, J., and Hummel, V. 2005. The Intel mote platform: A Bluetooth-based sensor network for industrial monitoring. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 61.
[28]
Panchapakesan, G. and Sengupta, A. 1999. On a lightwave network topology using Kautz digraphs. IEEE Trans. Comput. 48, 10, 1131--1138.
[29]
Parallax Inc. 2007. Propeller#8482; P8X32A Preliminary Datasheet. http://www.parallax.com.
[30]
Park, C., Liu, J., and Chou, P. H. 2005. Eco: An ultra-compact low-power wireless sensor node for real-time motion monitoring. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). Number 54. IEEE Press, Piscataway, NJ.
[31]
Polastre, J., Szewczyk, R., and Culler, D. 2005. Telos: Enabling ultra-low power wireless research. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 48.
[32]
Portilla, J., de Castro, A., de la Torre, E., and Riesgo, T. 2006. A modular architecture for nodes in wireless sensor networks. J. Univ. Comput. Sci. 12, 3, 328--339.
[33]
Rolim, J., Tvrdík, P., Trdlička, J., and Vrt'o, I. 1998. Bisecting de Bruijn and Kautz graphs. Discrete Appl. Math. 85, 1, 87--97.
[34]
Sakurai, T. and Newton, A. 1990. Alpha-power law MOSFET model and its applications to CMOS inverter delay and other formulas. IEEE J. Solid-State Circuits 25, 2, 584--594.
[35]
Salinger, P. and Tvrdík, P. 1998. All-to-all scatter in Kautz networks. In Proceedings of the 4th International Euro-Par Conference on Parallel Processing (Euro-Par'98). Springer-Verlag, London, UK, 1057--1061.
[36]
Schott, B., Bajura, M., Czarnaski, J., Flidr, J., Tho, T., and Wang, L. 2005. A modular power-aware microsensor with > 1000x dynamic power range. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks (IPSN'05). IEEE Press, Piscataway, NJ, 66.
[37]
Stanley-Marbell, P. 2008. Ladon—A 24-processor low-power performance-scalable processor module for sensor platforms. Tech. rep. ESR-2008-05, Technische Universiteit Eindhoven. January.
[38]
Stanley-Marbell, P. 2011. Parallelism, performance, and energy-efficiency tradeoffs for in situ sensor data processing. IEEE Embed. Syst. Let. 3, 1, 16--19.
[39]
Stanley-Marbell, P., Caparrós Cabezas, V., and Luijten, R. 2011. Pinned to the walls: Impact of packaging and application properties on the memory and power walls. In Proceedings of the 17th IEEE/ACM International Symposium on Low-Power Electronics and Design (ISLPED'11). IEEE Press, Piscataway, NJ, 51--56.
[40]
Stanley-Marbell, P. and Marculescu, D. 2007. Sunflower: Full-system, embedded microarchitecture evaluation. In Proceedings of the 2nd European conference on High Performance Embedded Architectures and Compilers (HiPEAC'07). Lecture Notes in Computer Science, vol. 4367, Springer-Verlag, Berlin Heidelberg, 168--182.
[41]
Taylor, W., Leonard, J., and Stewart, L. C. 2011. Efficient tilings of de Bruijn and Kautz graphs. CoRR abs/1101.1932.
[42]
Texas Instruments Inc. 2006. Datasheet, MSP430x22x2, MSP430x22x4 mixed signal microcontroller. http://www.ti.com/.
[43]
Warneke, B. and Pister, K. J. 2004. An ultra-low energy microcontroller for Smart Dust wireless sensor networks. In Proceedings of the IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC'04). 316--317.
[44]
XMOS Ltd. 2009. XS1-G4000 datasheet.
[45]
Yamashita, S., Shimura, T., Aiki, K., Ara, K., Ogata, Y., Shimokawa, I., Tanaka, T., Kuriyama, H., Shimada, K., and Yano, K. 2006. A 15 × 15mm, 1, μA, reliable sensor-net module: Enabling application-specific nodes. In Proceedings of the 5th International Conference on Information Processing in Sensor Networks (IPSN'06). ACM Press, New York, NY, 383--390.

Cited By

View all
  • (2015)How device properties influence energy-delay metrics and the energy-efficiency of parallel computationsProceedings of the Workshop on Power-Aware Computing and Systems10.1145/2818613.2818744(31-35)Online publication date: 4-Oct-2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems
ACM Transactions on Embedded Computing Systems  Volume 13, Issue 1
August 2013
332 pages
ISSN:1539-9087
EISSN:1558-3465
DOI:10.1145/2501626
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 05 September 2013
Accepted: 01 October 2011
Revised: 01 May 2011
Received: 01 December 2010
Published in TECS Volume 13, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. FFT
  2. Parallelism
  3. low power
  4. sensor networks

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)How device properties influence energy-delay metrics and the energy-efficiency of parallel computationsProceedings of the Workshop on Power-Aware Computing and Systems10.1145/2818613.2818744(31-35)Online publication date: 4-Oct-2015

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media