Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1874620.1874878acmconferencesArticle/Chapter ViewAbstractPublication PagesdateConference Proceedingsconference-collections
research-article

Latency criticality aware on-chip communication

Published: 20 April 2009 Publication History

Abstract

Packet-switched interconnect fabric is a promising on-chip communication solution for many-core architectures. It offers high throughput and excellent scalability for on-chip data and protocol transactions. The main problem posed by this communication fabric is the potentially-high and nondeterministic network latency caused by router data buffering and resource arbitration. This paper describes a new method to minimize on-chip network latency, which is motivated by the observation that only a small percentage of on-chip data and protocol traffic is latency-critical. Existing work focusing on minimizing average network latency is thus suboptimal. Such techniques expend most of the design, area, and power overhead accelerating latency-noncritical traffic for which there is no corresponding application-level speedup.
We propose run-time techniques that identify latency-critical traffic by leveraging network data-transaction and protocol information. Latency-critical traffic is permitted to bypass router pipeline stages and latency-noncritical traffic. These techniques are evaluated via a router design that has been implemented using TSMC 65nm technology. Detailed network latency simulation and hardware characterization demonstrate that, for latency-critical traffic, the proposed solution closely approximates the ideal interconnect even under heavy load while preserving throughput for both latency-critical and noncritical traffic.

References

[1]
P. Gratz, C. Kim, K. Sankaralingam, H. Hanson, P. Shivakumar, S. W. Keckler, and D. Burger, "On-chip interconnection networks of the TRIPS chip," IEEE Micro, vol. 27, no. 5, pp. 41--50, Sept. 2007.
[2]
"Tilera TILE64 chip-multiprocessor," http://www.tilera.com.
[3]
L. Seiler, et al., "Larrabee: a many-core x86 architecture for visual computing," ACM Trans. on Graphics, vol. 27, no. 3, pp. 6--23, Aug. 2008.
[4]
D. Pham, et al., "Overview of the architecture, circuit design, and physical implementation of a first-generation cell processor," IEEE J. Solid-State Circuits, vol. 41, no. 1, pp. 179--196, Jan. 2006.
[5]
S. Vangal, et al., "An 80-tile 1.28 TFLOPS network-on-chip in 65nm CMOS," in Proc. Int. Solid-State Circuits Conf., Feb. 2007, pp. 98--589.
[6]
W. J. Dally and B. Towles, Principles and Practices of Interconnection Networks. San Francisco, CA: Morgan Kaufmann Pub., 2003.
[7]
A. Kumar, L.-S. Peh, P. Kundu, and N. K. Jha, "Express virtual channels: towards the ideal interconnection fabric," in Proc. Int. Symp. Computer Architecture, June 2007.
[8]
Z. Lu, M. Liu, and A. Jantsch, "Layered switching for networks on chip," in Proc. Design Automation Conf., June 2007, pp. 122--127.
[9]
R. Mullins, A. West, and S. Moore, "Low-latency virtual-channel routers for on-chip networks," in Proc. Int. Symp. Computer Architecture, 2004, pp. 188--197.
[10]
A. Kumar, P. Kundu, A. Singh, L.-S. Peh, and N. Jha, "A 4.6Tbits/s 3.6GHz single-cycle NoC router with a novel switch allocator in 65nm cmos," in Proc. Int. Conf. on Computer Design, Oct. 2007, pp. 63--70.
[11]
N. E. Jerger, M. Lipasti, and L.-S. Peh, "Circuit-switched coherence," IEEE Computer Architecture Letters, vol. 6, no. 1, pp. 5--8, 2007.
[12]
N. Eisley, L.-S. Peh, and L. Shang, "In-network cache coherence," in Proc. Int. Symp. Microarchitecture, Dec. 2006, pp. 321--332.
[13]
E. Bolotin, Z. Guz, I. Cidon, R. Ginosar, and A. Kolodny, "The power of priority: NoC based distributed cache coherency," in Proc. Int. Symp. Networks-on-Chip, May 2007, pp. 117--126.
[14]
"CACTI: An integrated cache access time, cycle time, area, leakage, and dynamic power model," http://quid.hpl.hp.com:9082/cacti/.
[15]
M. R. Marty and M. D. Hill, "Virtual hierarchies to support server consolidation," in Proc. Int. Symp. Computer Architecture, June 2007, pp. 46--56.
[16]
N. L. Binkert, R. G. Dreslinski, L. R. Hsu, K. T. Lim, A. G. Saidi, and S. K. Reinhardt, "The M5 simulator: Modeling networked systems," IEEE Micro, vol. 26, no. 4, pp. 52--60, 2006.
[17]
"SPLASH2 website," http://www-flash.stanford.edu/apps/SPLASH/.
[18]
M.-L. Li, R. Sasanka, S. V. Adve, Y.-K. Chen, and E. Debes, "The ALPbench benchmark suite for complex multimedia applications," in Proc. Int. Symp. Workload Characterization, Oct. 2005, pp. 34--35.

Cited By

View all
  • (2015)A Lightweight Early Arbitration Method for Low-Latency Asynchronous 2D-Mesh NoC'sProceedings of the 52nd Annual Design Automation Conference10.1145/2744769.2744777(1-6)Online publication date: 7-Jun-2015
  • (2013)Proactive aging management in heterogeneous NoCs through a criticality-driven routing approachProceedings of the Conference on Design, Automation and Test in Europe10.5555/2485288.2485536(1032-1037)Online publication date: 18-Mar-2013
  • (2010)Efficient throughput-guarantees for latency-sensitive networks-on-chipProceedings of the 2010 Asia and South Pacific Design Automation Conference10.5555/1899721.1899846(529-534)Online publication date: 18-Jan-2010

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DATE '09: Proceedings of the Conference on Design, Automation and Test in Europe
April 2009
1776 pages
ISBN:9783981080155

Sponsors

  • EDAA: European Design Automation Association
  • ECSI
  • EDAC: Electronic Design Automation Consortium
  • SIGDA: ACM Special Interest Group on Design Automation
  • The IEEE Computer Society TTTC
  • The IEEE Computer Society DATC
  • The Russian Academy of Sciences: The Russian Academy of Sciences

Publisher

European Design and Automation Association

Leuven, Belgium

Publication History

Published: 20 April 2009

Check for updates

Qualifiers

  • Research-article

Conference

DATE '09
Sponsor:
  • EDAA
  • EDAC
  • SIGDA
  • The Russian Academy of Sciences

Acceptance Rates

Overall Acceptance Rate 518 of 1,794 submissions, 29%

Upcoming Conference

DATE '25
Design, Automation and Test in Europe
March 31 - April 2, 2025
Lyon , France

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2015)A Lightweight Early Arbitration Method for Low-Latency Asynchronous 2D-Mesh NoC'sProceedings of the 52nd Annual Design Automation Conference10.1145/2744769.2744777(1-6)Online publication date: 7-Jun-2015
  • (2013)Proactive aging management in heterogeneous NoCs through a criticality-driven routing approachProceedings of the Conference on Design, Automation and Test in Europe10.5555/2485288.2485536(1032-1037)Online publication date: 18-Mar-2013
  • (2010)Efficient throughput-guarantees for latency-sensitive networks-on-chipProceedings of the 2010 Asia and South Pacific Design Automation Conference10.5555/1899721.1899846(529-534)Online publication date: 18-Jan-2010

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media