Abstract
This paper explores using information about program branch probabilities to optimise reconfigurable designs. The basic premise is to promote utilization by dedicating more resources to branches which execute more frequently. A hardware compilation system has been developed for producing designs which are optimised for different branch probabilities. We propose an analytical queueing network performance model to determine the best design from observed branch probability information. The branch optimisation space is characterized in an experimental study for Xilinx Virtex FPGAs of two complex applications: video feature extraction and progressive refinement radiosity. For designs of equal performance, branch-optimised designs require 24% and 27.5% less area. For designs of equal area, branch optimised designs run upto 3 times faster. Our analytical performance model is shown to be highly accurate with relative error between 0.12 and 1.1 x 10− 4.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Celoxica Limited, Handel-C Language Reference Manual, version 3.1, document number RM-1003-3.0 (2002)
De Micheli, G.: Synthesis and Optimization of Digital Circuits. McGraw-Hill, New York (1994)
Harriss, T., Walke, R., Kienhuis, B., Deprettere, E.: Compilation from Matlab to process networks. Design Automation for Embedded Systems 7, 385–403 (2002)
Mencer, O., Huebert, H., Morf, M., Flynn, M.J.: StReAm: Object-oriented programming of stream architectures using PAM-Blox. In: Grünbacher, H., Hartenstein, R.W. (eds.) FPL 2000. O. Mencer, H. Huebert, M. Morf and M.J. Flynn, vol. 1896, pp. 595–604. Springer, Heidelberg (2000)
Mitrani, I.: Probabalistic Modelling. Cambridge University Press, Cambridge (1998)
Moller, T., Trumbore, B.: Fast, minimum storage ray-triangle intersection. Journal of Graphics Tools 2(1), 21–28 (1997)
Onvural, R.O.: Survey of closed queueing networks with blocking. ACM Computing Surveys 22(2), 83–121 (1990)
Styles, H., Luk, W.: Accelerating radiosity calculations using reconfigurable platforms. In: Proc. IEEE Symp. on Field-Programmable Custom Computing Machines, pp. 279–281 (2002)
Veen, A.H.: Dataflow machine architecture. ACM Computing Surveys 18(4), 365–396 (1986)
Weinhardt, M., Luk, W.: Pipeline vectorisation. IEEE Trans. on Comput.-Aided Design 20(2), 234–248 (2001)
Ziegler, H., So, B., Hall, M., Diniz, P.C.: Coarse-grain pipelining on multiple FPGA architectures. In: Proc. IEEE Symp. on Field-Programmable Custom Computing Machines, pp. 77–86 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Styles, H., Luk, W. (2003). Branch Optimisation Techniques for Hardware Compilation. In: Y. K. Cheung, P., Constantinides, G.A. (eds) Field Programmable Logic and Application. FPL 2003. Lecture Notes in Computer Science, vol 2778. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45234-8_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-45234-8_32
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40822-2
Online ISBN: 978-3-540-45234-8
eBook Packages: Springer Book Archive