Article

How Many Bits Does it Take to Quantize Your Neural Network?

Authors:

Mirco Giacobbe,

Thomas A. Henzinger,

Mathias LechnerAuthors Info & Claims

Tools and Algorithms for the Construction and Analysis of Systems: 26th International Conference, TACAS 2020, Held as Part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2020, Dublin, Ireland, April 25–30, 2020, Proceedings, Part II

Pages 79 - 97

https://doi.org/10.1007/978-3-030-45237-7_5

Published: 25 April 2020 Publication History

Abstract

Quantization converts neural networks into low-bit fixed-point computations which can be carried out by efficient integer-only hardware, and is standard practice for the deployment of neural networks on real-time embedded devices. However, like their real-numbered counterpart, quantized networks are not immune to malicious misclassification caused by adversarial attacks. We investigate how quantization affects a network’s robustness to adversarial attacks, which is a formal verification question. We show that neither robustness nor non-robustness are monotonic with changing the number of bits for the representation and, also, neither are preserved by quantization from a real-numbered network. For this reason, we introduce a verification method for quantized neural networks which, using SMT solving over bit-vectors, accounts for their exact, bit-precise semantics. We built a tool and analyzed the effect of quantization on a classifier for the MNIST dataset. We demonstrate that, compared to our method, existing methods for the analysis of real-numbered networks often derive false conclusions about their quantizations, both when determining robustness and when detecting attacks, and that existing methods for quantized networks often miss attacks. Furthermore, we applied our method beyond robustness, showing how the number of bits in quantization enlarges the gender bias of a predictor for students’ grades.

References

[1]

Students performance in exams. https://www.kaggle.com/spscientist/students-performance-in-exams

[2]

Barocas, S., Hardt, M., Narayanan, A.: Fairness in machine learning. In: Proceeding of NIPS (2017)

[3]

Barrett, C., Conway, C.L., Deters, M., Hadarean, L., Jovanović, D., King, T., Reynolds, A., Tinelli, C.: Cvc4. In: International Conference on Computer Aided Verification. pp. 171–177. Springer (2011)

[4]

Bunel, R.R., Turkaslan, I., Torr, P.H.S., Kohli, P., Mudigonda, P.K.: A unified view of piecewise linear neural network verification. In: NeurIPS. pp. 4795–4804 (2018)

[5]

De Moura, L., Bjørner, N.: Z3: An efficient smt solver. In: International conference on Tools and Algorithms for the Construction and Analysis of Systems. pp. 337–340. Springer (2008)

[6]

Dutertre, B.: Yices 2.2. In: International Conference on Computer Aided Verification. pp. 737–744. Springer (2014).

[7]

Dutta, S., Jha, S., Sankaranarayanan, S., Tiwari, A.: Output range analysis for deep feedforward neural networks. In: NFM. Lecture Notes in Computer Science, vol. 10811, pp. 121–138. Springer (2018)

[8]

Ehlers, R.: Formal verification of piece-wise linear feed-forward neural networks. In: ATVA. Lecture Notes in Computer Science, vol. 10482, pp. 269–286. Springer (2017)

[9]

Evtimov, I., Eykholt, K., Fernandes, E., Kohno, T., Li, B., Prakash, A., Rahmati, A., Song, D.: Robust physical-world attacks on deep learning models. arXiv preprint arXiv:1707.08945 1 (2017)

[10]

Gehr, T., Mirman, M., Drachsler-Cohen, D., Tsankov, P., Chaudhuri, S., Vechev, M.T.: AI2: safety and robustness certification of neural networks with abstract interpretation. In: IEEE Symposium on Security and Privacy. pp. 3–18. IEEE (2018)

[11]

Hadarean, L., Hyvarinen, A., Niemetz, A., Reger, G.: Smt-comp 2019. https://smt-comp.github.io/2019/results (2019)

[12]

Huang, X., Kwiatkowska, M., Wang, S., Wu, M.: Safety verification of deep neural networks. In: CAV (1). Lecture Notes in Computer Science, vol. 10426, pp. 3–29. Springer (2017)

[13]

Jacob, B., Kligys, S., Chen, B., Zhu, M., Tang, M., Howard, A.G., Adam, H., Kalenichenko, D.: Quantization and training of neural networks for efficient integer-arithmetic-only inference. In: CVPR. pp. 2704–2713. IEEE Computer Society (2018)

[14]

Katz, G., Barrett, C.W., Dill, D.L., Julian, K., Kochenderfer, M.J.: Reluplex: An efficient SMT solver for verifying deep neural networks. In: CAV (1). Lecture Notes in Computer Science, vol. 10426, pp. 97–117. Springer (2017)

[15]

Krizhevsky, A., Hinton, G.: Convolutional deep belief networks on cifar-10. Unpublished manuscript 40(7) (2010)

[16]

Moosavi-Dezfooli, S., Fawzi, A., Frossard, P.: Deepfool: A simple and accurate method to fool deep neural networks. In: CVPR. pp. 2574–2582. IEEE Computer Society (2016)

[17]

Nair, V., Hinton, G.E.: Rectified linear units improve restricted boltzmann machines. In: ICML. pp. 807–814. Omnipress (2010)

[18]

Narodytska, N., Kasiviswanathan, S.P., Ryzhyk, L., Sagiv, M., Walsh, T.: Verifying properties of binarized deep neural networks. In: AAAI. pp. 6615–6624. AAAI Press (2018)

[19]

Niemetz, A., Preiner, M., Biere, A.: Boolector 2.0. JSAT 9, 53–58 (2014)

[20]

Pulina, L., Tacchella, A.: An abstraction-refinement approach to verification of artificial neural networks. In: CAV. Lecture Notes in Computer Science, vol. 6174, pp. 243–257. Springer (2010)

[21]

Pulina, L., Tacchella, A.: Challenging SMT solvers to verify neural networks. AI Commun. 25(2), 117–135 (2012)

[22]

Schönherr, L., Kohls, K., Zeiler, S., Holz, T., Kolossa, D.: Adversarial attacks against automatic speech recognition systems via psychoacoustic hiding. In: accepted for Publication, NDSS (2019)

[23]

Singh, G., Gehr, T., Püschel, M., Vechev, M.T.: An abstract domain for certifying neural networks. In: POPL. ACM (2019)

[24]

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I.J., Fergus, R.: Intriguing properties of neural networks. CoRR abs/1312.6199 (2013)

[25]

Tjeng, V., Xiao, K.Y., Tedrake, R.: Evaluating robustness of neural networks with mixed integer programming (2018)

[26]

Tsipras, D., Santurkar, S., Engstrom, L., Turner, A., Madry, A.: Robustness may be at odds with accuracy. In: International Conference on Learning Representations (2019)

[27]

Xiang, W., Tran, H., Johnson, T.T.: Output reachable set estimation and verification for multilayer neural networks. IEEE Trans. Neural Netw. Learning Syst. 29(11), 5777–5783 (2018)

[28]

Zhao, Y., Shumailov, I., Mullins, R., Anderson, R.: To compress or not to compress: Understanding the interactions between adversarial attacks and neural network compression. In: SysML Conference (2019)

Cited By

Jin JOhrimenko ORubinstein BPintor MChen XJagielski M(2024)Getting a-Round Guarantees: Floating-Point Attacks on Certified RobustnessProceedings of the 2024 Workshop on Artificial Intelligence and Security10.1145/3689932.3694761(53-64)Online publication date: 6-Nov-2024
https://dl.acm.org/doi/10.1145/3689932.3694761
Matos Jde Lima Filho EBessa IManino ESong XCordeiro L(2024)Counterexample Guided Neural Network Quantization RefinementIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.333531343:4(1121-1134)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1109/TCAD.2023.3335313
Zhang YChen GSong FSun JDong J(2024)Certified Quantization Strategy Synthesis for Neural NetworksFormal Methods10.1007/978-3-031-71162-6_18(343-362)Online publication date: 9-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-71162-6_18
Show More Cited By

Recommendations

Learning to quantize deep neural networks: a competitive-collaborative approach
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference

Reducing the model size and computation costs for dedicated AI accelerator designs, neural network quantization methods have attracted momentous attention recently. Unfortunately, merely minimizing quantization loss using constant discretization causes ...
Stealing bits from a quantized source

We consider "bit stealing" scenarios where the rate of a source code must be reduced without prior planning. We first investigate the efficiency of source requantization to reduce rate, which we term successive degradation. We focus on finite-alphabet ...
Side-Match Vector Quantizers Using Neural Network Based Variance Predictor for Image Coding
ITNG '09: Proceedings of the 2009 Sixth International Conference on Information Technology: New Generations

Side-match vector quantizer reduces bit-rates in image coding through exploiting the correlations of neighboring vectors. This paper presents a new side-match vector quantization method for image coding using a neural network-based variance predictor. ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

Tools and Algorithms for the Construction and Analysis of Systems: 26th International Conference, TACAS 2020, Held as Part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2020, Dublin, Ireland, April 25–30, 2020, Proceedings, Part II

Apr 2020

436 pages

ISBN:978-3-030-45236-0

DOI:10.1007/978-3-030-45237-7

Editors:
Armin Biere
Johannes Kepler University, Linz, Austria
,
David Parker
University of Birmingham, Birmingham, UK

© The Author(s) 2020.

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 25 April 2020

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

12
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Jin JOhrimenko ORubinstein BPintor MChen XJagielski M(2024)Getting a-Round Guarantees: Floating-Point Attacks on Certified RobustnessProceedings of the 2024 Workshop on Artificial Intelligence and Security10.1145/3689932.3694761(53-64)Online publication date: 6-Nov-2024
https://dl.acm.org/doi/10.1145/3689932.3694761
Matos Jde Lima Filho EBessa IManino ESong XCordeiro L(2024)Counterexample Guided Neural Network Quantization RefinementIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.333531343:4(1121-1134)Online publication date: 1-Apr-2024
https://dl.acm.org/doi/10.1109/TCAD.2023.3335313
Zhang YChen GSong FSun JDong J(2024)Certified Quantization Strategy Synthesis for Neural NetworksFormal Methods10.1007/978-3-031-71162-6_18(343-362)Online publication date: 9-Sep-2024
https://dl.acm.org/doi/10.1007/978-3-031-71162-6_18
Downing MJust RFraser G(2023)Quantitative Robustness Analysis of Neural NetworksProceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3597926.3605231(1527-1531)Online publication date: 12-Jul-2023
https://dl.acm.org/doi/10.1145/3597926.3605231
Zhang YZhao ZChen GSong FChen T(2023)Precise Quantitative Analysis of Binarized Neural Networks: A BDD-based ApproachACM Transactions on Software Engineering and Methodology10.1145/356321232:3(1-51)Online publication date: 27-Apr-2023
https://dl.acm.org/doi/10.1145/3563212
Song XSun YMustafa MCordeiro L(2023)QNNRepair: Quantized Neural Network RepairSoftware Engineering and Formal Methods10.1007/978-3-031-47115-5_18(320-339)Online publication date: 6-Nov-2023
https://dl.acm.org/doi/10.1007/978-3-031-47115-5_18
Tao YLiu WSong FLiang ZWang JZhu H(2023)An Automata-Theoretic Approach to Synthesizing Binarized Neural NetworksAutomated Technology for Verification and Analysis10.1007/978-3-031-45329-8_18(380-400)Online publication date: 24-Oct-2023
https://dl.acm.org/doi/10.1007/978-3-031-45329-8_18
Zhang YSong FSun J(2023)QEBVerif: Quantization Error Bound Verification of Neural NetworksComputer Aided Verification10.1007/978-3-031-37703-7_20(413-437)Online publication date: 17-Jul-2023
https://dl.acm.org/doi/10.1007/978-3-031-37703-7_20
Zhang YZhao ZChen GSong FZhang MChen TSun J(2022)QVIP: An ILP-based Formal Verification Approach for Quantized Neural NetworksProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556916(1-13)Online publication date: 10-Oct-2022
https://dl.acm.org/doi/10.1145/3551349.3556916
Simić SInverso OTribastone M(2021)Bit-Precise Verification of Discontinuity Errors Under Fixed-Point ArithmeticSoftware Engineering and Formal Methods10.1007/978-3-030-92124-8_25(443-460)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.1007/978-3-030-92124-8_25
Show More Cited By

View Options

View options

Media

Figures

Other

Tables

View Table of Contents