Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3368089.3417065acmconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article

Reducing DNN labelling cost using surprise adequacy: an industrial case study for autonomous driving

Published: 08 November 2020 Publication History

Abstract

Deep Neural Networks (DNNs) are rapidly being adopted by the automotive industry, due to their impressive performance in tasks that are essential for autonomous driving. Object segmentation is one such task: its aim is to precisely locate boundaries of objects and classify the identified objects, helping autonomous cars to recognise the road environment and the traffic situation. Not only is this task safety critical, but developing a DNN based object segmentation module presents a set of challenges that are significantly different from traditional development of safety critical software. The development process in use consists of multiple iterations of data collection, labelling, training, and evaluation. Among these stages, training and evaluation are computation intensive while data collection and labelling are manual labour intensive. This paper shows how development of DNN based object segmentation can be improved by exploiting the correlation between Surprise Adequacy (SA) and model performance. The correlation allows us to predict model performance for inputs without manually labelling them. This, in turn, enables understanding of model performance, more guided data collection, and informed decisions about further training. In our industrial case study the technique allows cost savings of up to 50% with negligible evaluation inaccuracy. Furthermore, engineers can trade off cost savings versus the tolerable level of inaccuracy depending on different development phases and scenarios.

Supplementary Material

Auxiliary Teaser Video (fse20ind-p115-p-teaser.mp4)
This paper shows how development of DNN based object segmentation can be improved by exploiting the correlation between Surprise Adequacy (SA) and model performance. The correlation allows us to predict model performance for inputs without manually labelling them. This, in turn, enables understanding of model performance, more guided data collection, and informed decisions about further training. In our industrial case study the technique allows cost savings of up to 50% with negligible evaluation inaccuracy. Furthermore, engineers can trade off cost savings versus the tolerable level of inaccuracy depending on different development phases and scenarios.
Auxiliary Presentation Video (fse20ind-p115-p-video.mp4)
This paper shows how development of DNN based object segmentation can be improved by exploiting the correlation between Surprise Adequacy (SA) and model performance. The correlation allows us to predict model performance for inputs without manually labelling them. This, in turn, enables understanding of model performance, more guided data collection, and informed decisions about further training. In our industrial case study the technique allows cost savings of up to 50% with negligible evaluation inaccuracy. Furthermore, engineers can trade off cost savings versus the tolerable level of inaccuracy depending on different development phases and scenarios.

References

[1]
William H Beluch, Tim Genewein, Andreas Nürnberger, and Jan M Köhler. 2018. The power of ensembles for active learning in image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9368-9377.
[2]
Roy De Maesschalck, Delphine Jouan-Rimbaud, and Désiré L Massart. 2000. The mahalanobis distance. Chemometrics and intelligent laboratory systems 50, 1 ( 2000 ), 1-18.
[3]
Robert Feldt, Simon Poulding, David Clark, and Shin Yoo. 2016. Test Set Diameter: Quantifying the Diversity of Sets of Test Cases. In Proceedings of the IEEE International Conference on Software Testing, Verification, and Validation (ICST 2016 ). 223-233.
[4]
D. Feng, C. Haase-Schütz, L. Rosenbaum, H. Hertlein, C. Gläser, F. Timm, W. Wiesbeck, and K. Dietmayer. 2020. Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges. IEEE Transactions on Intelligent Transportation Systems ( 2020 ), 1-20.
[5]
Yarin Gal, Riashat Islam, and Zoubin Ghahramani. 2017. Deep bayesian active learning with image data. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 1183-1192.
[6]
Jinhan Kim, Robert Feldt, and Shin Yoo. 2019. Guiding Deep Learning System Testing using Surprise Adequacy. In Proceedings of the 41th International Conference on Software Engineering (ICSE 2019 ). IEEE Press, 1039-1049. https: //doi.org/10.1109/ICSE. 2019.00108
[7]
Kimin Lee, Kibok Lee, Honglak Lee, and Jinwoo Shin. 2018. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks. In Advances in Neural Information Processing Systems 31, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.). Curran Associates, Inc., 7167-7177. http://papers.nips.cc/paper/7947-a-simple-unified-framework-fordetecting-out-of-distribution-samples-and-adversarial-attacks.pdf
[8]
Dimitris Liparas, Lefteris Angelis, and Robert Feldt. 2012. Applying the Mahalanobis-Taguchi strategy for software defect diagnosis. Automated Software Engineering 19, 2 ( 2012 ), 141-165.
[9]
Lei Ma, Felix Juefei-Xu, Jiyuan Sun, Chunyang Chen, Ting Su, Fuyuan Zhang, Minhui Xue, Bo Li, Li Li, Yang Liu, Jianjun Zhao, and Yadong Wang. 2018. DeepGauge: Comprehensive and Multi-Granularity Testing Criteria for Gauging the Robustness of Deep Learning Systems. CoRR abs/ 1803.07519 ( 2018 ). arXiv: 1803.07519 http://arxiv.org/abs/ 1803.07519
[10]
Shiqing Ma, Yingqi Liu, Wen-Chuan Lee, Xiangyu Zhang, and Ananth Grama. 2018. MODE: Automated Neural Network Model Debugging via State Diferential Analysis and Input Selection. In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (Lake Buena Vista, FL, USA) ( ESEC/FSE 2018). ACM, New York, NY, USA, 175-186. https://doi.org/10.1145/3236024.3236082
[11]
Sachin Mehta, Mohammad Rastegari, Anat Caspi, Linda Shapiro, and Hannaneh Hajishirzi. 2018. ESPNet: Eficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation. arXiv: 1803. 06815 [cs.CV]
[12]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. 2019. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Advances in Neural Information Processing Systems 32, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.). Curran Associates, Inc., 8024-8035.
[13]
Kexin Pei, Yinzhi Cao, Junfeng Yang, and Suman Jana. 2017. DeepXplore: Automated Whitebox Testing of Deep Learning Systems. In Proceedings of the 26th Symposium on Operating Systems Principles (Shanghai, China) (SOSP '17). ACM, New York, NY, USA, 1-18. https://doi.org/10.1145/3132747.3132785
[14]
Mennatullah Siam, Mostafa Gamal, Moemen Abdel-Razek, Senthil Yogamani, Martin Jagersand, and Hong Zhang. 2018. A comparative study of real-time semantic segmentation for autonomous driving. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops. 587-597.
[15]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. CoRR abs/1409.1556 ( 2014 ). http://arxiv.org/abs/1409.1556
[16]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1-9.
[17]
Yuchi Tian, Kexin Pei, Suman Jana, and Baishakhi Ray. 2018. DeepTest: Automated Testing of Deep-neural-network-driven Autonomous Cars. In Proceedings of the 40th International Conference on Software Engineering (Gothenburg, Sweden) (ICSE '18). ACM, New York, NY, USA, 303-314. https://doi.org/10.1145/3180155.3180220
[18]
Tianyi Wu, Sheng Tang, Rui Zhang, and Yongdong Zhang. 2018. CGNet: A Light-weight Context Guided Network for Semantic Segmentation. arXiv: 1811. 08201 [cs.CV]
[19]
Shin Yoo. 2019. SBST in the Age of Machine Learning Systems: Challenges Ahead. In Proceedings of the 12th International Workshop on Search-Based Software Testing (Montreal, Quebec, Canada) ( SBST '19). IEEE Press, Piscataway, NJ, USA, 2-2. https://doi.org/10.1109/SBST. 2019. 000-2
[20]
Ekim Yurtsever, Jacob Lambert, Alexander Carballo, and Kazuya Takeda. 2019. A Survey of Autonomous Driving: Common Practices and Emerging Technologies. CoRR abs/ 1906.05113 ( 2019 ). arXiv: 1906.05113 http://arxiv.org/abs/ 1906.05113
[21]
H. Zhang and W. K. Chan. 2019. Apricot: A Weight-Adaptation Approach to Fixing Deep Learning Models. In 2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). 376-387.

Cited By

View all
  • (2024)See the Forest, not Trees: Unveiling and Escaping the Pitfalls of Error-Triggering Inputs in Neural Network TestingProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680385(1605-1617)Online publication date: 11-Sep-2024
  • (2024)CooTest: An Automated Testing Approach for V2X Communication SystemsProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680373(1453-1465)Online publication date: 11-Sep-2024
  • (2024)Safety of Perception Systems for Automated Driving: A Case Study on ApolloACM Transactions on Software Engineering and Methodology10.1145/363196933:3(1-28)Online publication date: 15-Mar-2024
  • Show More Cited By

Index Terms

  1. Reducing DNN labelling cost using surprise adequacy: an industrial case study for autonomous driving

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ESEC/FSE 2020: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering
    November 2020
    1703 pages
    ISBN:9781450370431
    DOI:10.1145/3368089
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 November 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Autonomous Driving
    2. Deep Neural Network
    3. Software Testing

    Qualifiers

    • Research-article

    Conference

    ESEC/FSE '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 112 of 543 submissions, 21%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)61
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)See the Forest, not Trees: Unveiling and Escaping the Pitfalls of Error-Triggering Inputs in Neural Network TestingProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680385(1605-1617)Online publication date: 11-Sep-2024
    • (2024)CooTest: An Automated Testing Approach for V2X Communication SystemsProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680373(1453-1465)Online publication date: 11-Sep-2024
    • (2024)Safety of Perception Systems for Automated Driving: A Case Study on ApolloACM Transactions on Software Engineering and Methodology10.1145/363196933:3(1-28)Online publication date: 15-Mar-2024
    • (2024)Synthetic Datasets for Autonomous Driving: A SurveyIEEE Transactions on Intelligent Vehicles10.1109/TIV.2023.33310249:1(1847-1864)Online publication date: Jan-2024
    • (2024)Neuron importance-aware coverage analysis for deep neural network testingEmpirical Software Engineering10.1007/s10664-024-10524-x29:5Online publication date: 25-Jul-2024
    • (2024)Label Engineering Methods for ML SystemsIntelligent Systems and Applications10.1007/978-3-031-66336-9_33(464-474)Online publication date: 1-Aug-2024
    • (2023)FMCW Radar Sensors with Improved Range Precision by Reusing the Neural NetworkSensors10.3390/s2401013624:1(136)Online publication date: 26-Dec-2023
    • (2023)Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural NetworksACM Transactions on Software Engineering and Methodology10.1145/361759333:1(1-29)Online publication date: 23-Nov-2023
    • (2023)Prioritizing Testing Instances to Enhance the Robustness of Object Detection SystemsProceedings of the 14th Asia-Pacific Symposium on Internetware10.1145/3609437.3609446(194-204)Online publication date: 4-Aug-2023
    • (2023)In Defense of Simple Techniques for Neural Network Test Case SelectionProceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3597926.3598073(501-513)Online publication date: 12-Jul-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media