Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3218603.3218621acmconferencesArticle/Chapter ViewAbstractPublication PagesislpedConference Proceedingsconference-collections
research-article
Public Access

RMAC: Runtime Configurable Floating Point Multiplier for Approximate Computing

Published: 23 July 2018 Publication History

Abstract

Approximate computing is a way to build fast and energy efficient systems, which provides responses of good enough quality tailored for different purposes. In this paper, we propose a novel approximate floating point multiplier which efficiently multiplies two floating numbers and yields a high precision product. RMAC approximates the costly mantissa multiplication to a simple addition between the mantissa of input operands. To tune the level of accuracy, RMAC looks at the first bit of the input mantissas as well as the first N bits of the result of addition to dynamically estimate the maximum multiplication error rate. Then, RMAC decides to either accept the approximate result or re-execute the exact multiplication. Depending on the value of N, the proposed RMAC can be configured to achieve different levels of accuracy. We integrate the proposed RMAC in AMD southern Island GPU, by replacing RMAC with the existing floating point units. We test the efficiency and accuracy of the enhanced GPU on a wide range of applications including multimedia and machine learning applications. Our evaluations show that a GPU enhanced by the proposed RMAC can achieve 5.2x energydelay product improvement as opposed to GPU using conventional FPUs while ensuring less than 2% quality loss. Comparing our approach with other state-of-the-art approximate multipliers shows that RMAC can achieve 3.1x faster and 1.8x more energy efficient computations while providing the same quality of service.

References

[1]
A. Fehske et al., "The global footprint of mobile communications: The ecological and economic perspective," IEEE Communications Magazine, vol. 49, no. 8, 2011.
[2]
F. Xia et al., "Internet of things," International Journal of Communication Systems, vol. 25, no. 9, p. 1101, 2012.
[3]
J. Gubbi et al., "Internet of things (iot): A vision, architectural elements, and future directions," Future generation computer systems, vol. 29, no. 7, pp. 1645--1660, 2013.
[4]
B. Yao et al., "Multifractal analysis of image profiles for the characterization and detection of defects in additive manufacturing," Journal of Manufacturing Science and Engineering, vol. 140, no. 3, p. 031014, 2018.
[5]
M. S. Riazi et al., "Camsure: Secure content-addressable memory for approximate search," ACM TECS, vol. 16, no. 5s, p. 136, 2017.
[6]
J. Han et al., "Approximate computing: An emerging paradigm for energy-efficient design," in IEEE ETS, pp. 1--6, IEEE, 2013.
[7]
C. Liu et al., "A low-power, high-performance approximate multiplier with configurable partial error recovery," in IEEE/ACM DATE, p. 95, IEEE, 2014.
[8]
M. Imani et al., "Resistive configurable associative memory for approximate computing," in DATE, pp. 1327--1332, IEEE, 2016.
[9]
M. Imani et al., "Masc: Ultra-low energy multiple-access single-charge tcam for approximate computing," in IEEE/ACM DATE, pp. 373--378, IEEE, 2017.
[10]
X. Jiao et al., "Energy-efficient neural networks using approximate computation reuse," in DATE, pp. 1223--1228, IEEE, 2018.
[11]
M. Imani et al., "Approximate computing using multiple-access single-charge associative memory," IEEE TETC, 2016.
[12]
Y. Kim et al., "Orchard: Visual object recognition accelerator based on approximate in-memory processing," in IEEE/ACM ICCAD, pp. 25--32, IEEE, 2017.
[13]
M. Imani et al., "Ultra-efficient processing in-memory for data intensive applications," in DAC, p. 6, ACM, 2017.
[14]
J. Von Neumann, "Probabilistic logics and the synthesis of reliable organisms from unreliable components," Automata studies, vol. 34, pp. 43--98, 1956.
[15]
F. Imani and other, "Factal pattern recognition of image profiles for manufacturing process monitoring and control," in ASME MSEC, p. 1, 2018.
[16]
S. Hashemi et al., "tldrum: A dynamic range unbiased multiplier for approximate applications," in ICCAD, pp. 418--425, IEEE Press, 2015.
[17]
S. Narayanamoorthy et al., "Energy-efficient approximate multiplication for digital signal processing and classification applications," TVLSI, vol. 23, no. 6, pp. 1180--1184, 2015.
[18]
P. Kulkarni et al., "Trading accuracy for power with an underdesigned multiplier architecture," in IVLSI, pp. 346--351, IEEE, 2011.
[19]
M. Imani et al., "Acam: Approximate computing based on adaptive associative memory with online learning," in IEEE/ACM ISLPED, pp. 162--167, 2016.
[20]
M. Imani et al., "Canna: neural network acceleration using configurable approximation on gpgpu," in IEEE ASPLOS, IEEE, 2018.
[21]
M. Imani et al., "Cfpu: Configurable floating point multiplier for energy-efficient computing," in DAC, pp. 1--6, IEEE, 2017.
[22]
D. Jeon et al., "Design methodology for voltage-overscaled ultra-low-power systems," TCAS II, vol. 59, no. 12, pp. 952--956, 2012.
[23]
K. He et al., "Circuit-level timing-error acceptance for design of energy-efficient dct/idct-based systems," TCSVT, vol. 23, no. 6, pp. 961--974, 2013.
[24]
D. Mohapatra et al., "Design of voltage-scalable meta-functions for approximate computing," in DATE, pp. 1--6, IEEE, 2011.
[25]
M. Imani et al., "Remam: low energy resistive multi-stage associative memory for energy efficient computing," in IEEE ISQED, pp. 101--106, IEEE, 2016.
[26]
K. Bhardwaj et al., "Power-and area-efficient approximate wallace tree multiplier for error-resilient systems," in ISQED, pp. 263--269, IEEE, 2014.
[27]
R. Ubal et al., "Multi2sim: a simulation framework for cpu-gpu computing," in PACT, pp. 335--344, ACM, 2012.
[28]
"Aflopoco {online}. available:http://flopoco.gforge.inria.fr/,"
[29]
"Amd app sdk v2.5 {online}. available: http://www.amd.com/stream,"
[30]
"Caltech 101 {online}. http://www.vision.caltech.edu/image_datasets/caltech101/,"
[31]
S. Che et al., "Rodinia: A benchmark suite for heterogeneous computing," in Workload Characterization, 2009. IISWC 2009. IEEE International Symposium on, pp. 44--54, Ieee, 2009.
[32]
Y. LeCun et al., "The mnist database of handwritten digits," 1998.
[33]
"Uci machine learning repository." http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones.
[34]
"Uci machine learning repository." http://archive.ics.uci.edu/ml/datasets/ISOLET.

Cited By

View all
  • (2024)Hardware-Efficient Logarithmic Floating-Point Multipliers for Error-Tolerant ApplicationsIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2023.332632971:1(209-222)Online publication date: Jan-2024
  • (2024)Design of a Hardware-Efficient Floating-Point Multiplier with Dynamic Segmentation2024 19th Conference on Ph.D Research in Microelectronics and Electronics (PRIME)10.1109/PRIME61930.2024.10559705(1-4)Online publication date: 9-Jun-2024
  • (2024)Design Wireless Communication Circuits and Systems Using Approximate ComputingDesign and Applications of Emerging Computer Systems10.1007/978-3-031-42478-6_20(531-565)Online publication date: 14-Jan-2024
  • Show More Cited By

Index Terms

  1. RMAC: Runtime Configurable Floating Point Multiplier for Approximate Computing

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        ISLPED '18: Proceedings of the International Symposium on Low Power Electronics and Design
        July 2018
        327 pages
        ISBN:9781450357043
        DOI:10.1145/3218603
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 23 July 2018

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Approximate Computing
        2. Deep Neural Network
        3. Energy Efficiency
        4. Floating Point Multiplications

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        Conference

        ISLPED '18
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 398 of 1,159 submissions, 34%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)122
        • Downloads (Last 6 weeks)20
        Reflects downloads up to 16 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Hardware-Efficient Logarithmic Floating-Point Multipliers for Error-Tolerant ApplicationsIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2023.332632971:1(209-222)Online publication date: Jan-2024
        • (2024)Design of a Hardware-Efficient Floating-Point Multiplier with Dynamic Segmentation2024 19th Conference on Ph.D Research in Microelectronics and Electronics (PRIME)10.1109/PRIME61930.2024.10559705(1-4)Online publication date: 9-Jun-2024
        • (2024)Design Wireless Communication Circuits and Systems Using Approximate ComputingDesign and Applications of Emerging Computer Systems10.1007/978-3-031-42478-6_20(531-565)Online publication date: 14-Jan-2024
        • (2023)SG-Float: Achieving Memory Access and Computing Power Reduction Using Self-Gating Float in CNNsACM Transactions on Embedded Computing Systems10.1145/362458222:6(1-22)Online publication date: 9-Nov-2023
        • (2023)A Survey on Approximate Multiplier Designs for Energy Efficiency: From Algorithms to CircuitsACM Transactions on Design Automation of Electronic Systems10.1145/361029129:1(1-37)Online publication date: 24-Jul-2023
        • (2023)Approximation Opportunities in Edge Computing Hardware: A Systematic Literature ReviewACM Computing Surveys10.1145/357277255:12(1-49)Online publication date: 3-Mar-2023
        • (2023)Approximate Computing: Hardware and Software Techniques, Tools and Their ApplicationsJournal of Circuits, Systems and Computers10.1142/S021812662430001033:04Online publication date: 20-Sep-2023
        • (2023)On the Design of Iterative Approximate Floating-Point MultipliersIEEE Transactions on Computers10.1109/TC.2022.321646572:6(1623-1635)Online publication date: 1-Jun-2023
        • (2023)Novel, Configurable Approximate Floating-point Multipliers for Error-Resilient Applications2023 24th International Symposium on Quality Electronic Design (ISQED)10.1109/ISQED57927.2023.10129296(1-7)Online publication date: 5-Apr-2023
        • (2023)On the Facilitation of Voltage Over-Scaling and Minimization of Timing Errors in Floating-Point Multipliers2023 IEEE 29th International Symposium on On-Line Testing and Robust System Design (IOLTS)10.1109/IOLTS59296.2023.10224887(1-7)Online publication date: 3-Jul-2023
        • Show More Cited By

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media