Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3691620.3695067acmconferencesArticle/Chapter ViewAbstractPublication PagesaseConference Proceedingsconference-collections
research-article

Bridging the Gap between Real-world and Synthetic Images for Testing Autonomous Driving Systems

Published: 27 October 2024 Publication History

Abstract

Deep Neural Networks (DNNs) for Autonomous Driving Systems (ADS) are typically trained on real-world images and tested using synthetic images from simulators. This approach results in training and test datasets with dissimilar distributions, which can potentially lead to erroneously decreased test accuracy. To address this issue, the literature suggests applying domain-to-domain translators to test datasets to bring them closer to the training datasets. However, translating images used for testing may unpredictably affect the reliability, effectiveness and efficiency of the testing process. Hence, this paper investigates the following questions in the context of ADS: Could translators reduce the effectiveness of images used for ADS-DNN testing and their ability to reveal faults in ADS-DNNs? Can translators result in excessive time overhead during simulation-based testing? To address these questions, we consider three domain-to-domain translators: CycleGAN and neural style transfer, from the literature, and SAEVAE, our proposed translator. Our results for two critical ADS tasks - lane keeping and object detection - indicate that translators significantly narrow the gap in ADS test accuracy caused by distribution dissimilarities between training and test data, with SAEVAE outperforming the other two translators. We show that, based on the recent diversity, coverage, and fault-revealing ability metrics for testing deep-learning systems, translators do not compromise the diversity and the coverage of test data nor do they lead to revealing fewer faults in ADS-DNNs. Further, among the translators considered, SAEVAE incurs a negligible overhead in simulation time and can be efficiently integrated into simulation-based testing. Finally, we show that translators increase the correlation between offline and simulation-based testing results, which can help reduce the cost of simulation-based testing. Our replication package is available online [1].

References

[1]
Replication package for the paper. 2024. [Online; accessed Aug 25, 2024].
[2]
Jie M. Zhang, Mark Harman, Lei Ma, and Yang Liu. Machine learning testing: Survey, landscapes and horizons. IEEE Trans. Softw. Eng., 48:1--36, Jan 2022.
[3]
Andrea Stocco, Brian Pulfer, and Paolo Tonella. Model vs system level testing of autonomous driving systems: a replication and extension study. Empir. Softw. Eng., 28(3):73, 2023.
[4]
Shuncheng Tang, Zhenya Zhang, Yi Zhang, Jixiang Zhou, Yan Guo, Shuang Liu, Shengjian Guo, Yan-Fu Li, Lei Ma, Yinxing Xue, and Yang Liu. A survey on automated driving system testing: Landscapes and trends. ACM Trans. Softw. Eng. Methodol., 32(5):124:1--124:62, 2023.
[5]
Mohammad Hossein Amini, Shervin Naseri, and Shiva Nejati. Evaluating the impact of flaky simulators on testing autonomous driving systems. Empir. Softw. Eng., 29(2):47, 2024.
[6]
Reza Matinnejad, Shiva Nejati, Lionel C. Briand, Thomas Bruckmann, and Claude Poull. Automated model-in-the-loop testing of continuous controllers using search. In Günther Ruhe and Yuanyuan Zhang, editors, Search Based Software Engineering - 5th International Symposium, SSBSE 2013, St. Petersburg, Russia, August 24--26, 2013. Proceedings, volume 8084 of Lecture Notes in Computer Science, pages 141--157. Springer, 2013.
[7]
Carlos A. González, Mojtaba Varmazyar, Shiva Nejati, Lionel C. Briand, and Yago Isasi. Enabling model testing of cyber-physical systems. In Andrzej Wasowski, Richard F. Paige, and Øystein Haugen, editors, Proceedings of the 21th ACM/IEEE International Conference on Model Driven Engineering Languages and Systems, MODELS 2018, Copenhagen, Denmark, October 14--19, 2018, pages 176--186. ACM, 2018.
[8]
Shiva Nejati, Lev Sorokin, Damir Safin, Federico Formica, Mohammad Mahdi Mahboob, and Claudio Menghi. Reflections on surrogate-assisted search-based testing: A taxonomy and two replication studies based on industrial ADAS and simulink models. Inf. Softw. Technol., 163:107286, 2023.
[9]
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016. http://www.deeplearningbook.org.
[10]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. Image-to-image translation with conditional adversarial networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21--26, 2017, pages 5967--5976. IEEE Computer Society, 2017.
[11]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22--29, 2017, pages 2242--2251. IEEE Computer Society, 2017.
[12]
Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge. Image style transfer using convolutional neural networks. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27--30, 2016, pages 2414--2423. IEEE Computer Society, 2016.
[13]
Fujun Luan, Sylvain Paris, Eli Shechtman, and Kavita Bala. Deep photo style transfer. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21--26, 2017, pages 6997--7005. IEEE Computer Society, 2017.
[14]
Yongcheng Jing, Yezhou Yang, Zunlei Feng, Jingwen Ye, Yizhou Yu, and Mingli Song. Neural style transfer: A review. IEEE Transactions on Visualization and Computer Graphics, 26(11):3365--3385, 2020.
[15]
Andrea Stocco, Brian Pulfer, and Paolo Tonella. Mind the gap! a study on the transferability of virtual versus physical-world testing of autonomous driving systems. IEEE Transactions on Software Engineering, 49(4):1928--1940, 2023.
[16]
Matteo Biagiola, Andrea Stocco, Vincenzo Riccio, and Paolo Tonella. Two is better than one: digital siblings to improve autonomous driving testing. Empir. Softw. Eng., 29(4):72, 2024.
[17]
Stefano Carlo Lambertenghi and Andrea Stocco. Assessing quality metrics for neural reality gap input mitigation in autonomous driving testing. In 17th IEEE International Conference on Software Testing, Verification and Validation (ICST 2024), 2024. To appear. Available at https://arxiv.org/abs/2404.18577.
[18]
Kexin Pei, Yinzhi Cao, Junfeng Yang, and Suman Jana. Deepxplore: automated whitebox testing of deep learning systems. Commun. ACM, 62(11):137--145, 2019.
[19]
Jinhan Kim, Robert Feldt, and Shin Yoo. Evaluating surprise adequacy for deep learning system testing. ACM Trans. Softw. Eng. Methodol., 32(2):42:1--42:29, 2023.
[20]
Alex Kulesza and Ben Taskar. Determinantal Point Processes for Machine Learning. Now Publishers Inc., Hanover, MA, USA, 2012.
[21]
Mohammed Oualid Attaoui, Hazem M. Fahmy, Fabrizio Pastore, and Lionel C. Briand. Supporting safety analysis of image-processing dnns through clustering-based approaches. ACM Trans. Softw. Eng. Methodol., 33(5):130:1--130:48, 2024.
[22]
Udacity Jungle Dataset. https://www.kaggle.com/datasets/andy8744/udacity-self-driving-car-behavioural-cloning, 2023. [Online; accessed 29-November-2023].
[23]
KITTI Dataset. https://www.cvlibs.net/datasets/kitti/, 2023. [Online; accessed 29-November-2023].
[24]
vKITTI Dataset. https://europe.naverlabs.com/research/computer-vision/proxy-virtual-worlds-vkitti-1/, 2023. [Online; accessed 29-November-2023].
[25]
BeamNG.tech Website. https://beamng.tech, 2023. [Online; accessed 3-March-2023].
[26]
Github repo for cyber-physical systems testing tool competition. https://github.com/sbft-cps-tool-competition/cps-tool-competition, 2023. [Online; accessed 10-April-2023].
[27]
Florian Tambon, Foutse Khomh, and Giuliano Antoniol. Gist: Generated inputs sets transferability in deep learning. ACM Trans. Softw. Eng. Methodol., Jun 2024.
[28]
Tim Salimans, Ian J. Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. Improved techniques for training gans. In Daniel D. Lee, Masashi Sugiyama, Ulrike von Luxburg, Isabelle Guyon, and Roman Garnett, editors, Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5--10, 2016, Barcelona, Spain, pages 2226--2234, 2016.
[29]
Konstantin Shmelkov, Cordelia Schmid, and Karteek Alahari. How good is my gan? In Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss, editors, Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8--14, 2018, Proceedings, Part II, volume 11206 of Lecture Notes in Computer Science, pages 218--234. Springer, 2018.
[30]
Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. In Yoshua Bengio and Yann LeCun, editors, 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings, 2015. Available at https://arxiv.org/abs/1409.1556.
[31]
Quoc V. Le, Marc'Aurelio Ranzato, Rajat Monga, Matthieu Devin, Greg Corrado, Kai Chen, Jeffrey Dean, and Andrew Y. Ng. Building high-level features using large scale unsupervised learning. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Edinburgh, Scotland, UK, June 26 - July 1, 2012, 2012. Available at https://icml.cc/2012/papers/73.pdf.
[32]
Diederik P. Kingma and Max Welling. Auto-encoding variational bayes. In Yoshua Bengio and Yann LeCun, editors, 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14--16, 2014, Conference Track Proceedings, 2014. Available at https://arxiv.org/abs/1312.6114.
[33]
Swaroopa Dola, Matthew B. Dwyer, and Mary Lou Soffa. Distribution-aware testing of neural networks using generative models. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22--30 May 2021, pages 226--237. IEEE, 2021.
[34]
Andrea Stocco, Michael Weiss, Marco Calzana, and Paolo Tonella. Misbehaviour prediction for autonomous driving systems. In Gregg Rothermel and Doo-Hwan Bae, editors, ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pages 359--371. ACM, 2020.
[35]
Rafael Padilla, Sergio L. Netto, and Eduardo A. B. da Silva. A survey on performance metrics for object-detection algorithms. In 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), pages 237--242, 2020.
[36]
Coco challenge. https://cocodataset.org/#home, 2023. Accessed: 2023-12-05.
[37]
Matteo Biagiola, Stefan Klikovits, Jarkko Peltomäki, and Vincenzo Riccio. SBFT tool competition 2023 - cyber-physical systems track. In IEEE/ACM International Workshop on Search-Based and Fuzz Testing, SBFT@ICSE 2023, Melbourne, Australia, May 14, 2023, pages 45--48. IEEE, 2023.
[38]
Paolo Arcaini and Ahmet Cetinkaya. CRAG at the SBFT 2023 tool competition - cyber-physical systems track. In IEEE/ACM International Workshop on Search-Based and Fuzz Testing, SBFT@ICSE 2023, Melbourne, Australia, May 14, 2023, pages 41--42. IEEE, 2023.
[39]
Vincenzo Riccio and Paolo Tonella. Model-based exploration of the frontier of behaviours for deep learning system testing. In Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann, editors, ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8--13, 2020, pages 876--888. ACM, 2020.
[40]
Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York, NY, 2nd edition, 2009.
[41]
Unity game engine. https://unity.com, 2023. Accessed: 2023-12-05.
[42]
Nvidia dave2. https://developer.nvidia.com/blog/deep-learning-self-driving-cars/, 2023. Accessed: 2023-12-05.
[43]
Udacity self-driving challenge 2. https://github.com/udacity/self-driving-car/tree/master/challenges/challenge-2, 2023. Accessed: 2023-12-05.
[44]
Yolov5. https://github.com/ultralytics/yolov5, 2023. Accessed: 2023-12-05.
[45]
Chitta Ranjan. Understanding Deep Learning: Application in Rare Event Prediction. Connaissance Publishing, Dec 2020. URL: www.understandingdeeplearning.com.
[46]
Xavier Glorot and Yoshua Bengio. Understanding the difficulty of training deep feedforward neural networks. In Yee Whye Teh and D. Mike Titterington, editors, Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2010, Chia Laguna Resort, Sardinia, Italy, May 13--15, 2010, volume 9 of JMLR Proceedings, pages 249--256. JMLR.org, 2010.
[47]
Naive Random Test Generator. https://github.com/sbft-cps-tool-competition/cps-tool-competition/tree/main/sample_test_generators, 2023. [Online; accessed 09-May-2024].
[48]
Leland McInnes and John Healy. Accelerated hierarchical density based clustering. In 2017 IEEE International Conference on Data Mining Workshops (ICDMW), pages 33--42, 2017.
[49]
Fitash Ul Haq, Donghwan Shin, Shiva Nejati, and Lionel Claude Briand. Can offline testing of deep neural networks replace their online testing? Empir. Softw. Eng., 26(5):90, 2021.
[50]
BeamNG AI. https://beamngpy.readthedocs.io/en/latest/beamngpy.html#beamngpy.api.vehicle.AIApi, 2023. [Online; accessed 09-May-2024].
[51]
Richard C. Dorf. Modern Control Systems. Pearson, 13th edition, 2016.
[52]
Vincenzo Riccio and Paolo Tonella. When and why test generators for deep learning produce invalid inputs: an empirical study. In 45th IEEE/ACM International Conference on Software Engineering, ICSE 2023, Melbourne, Australia, May 14--20, 2023, pages 1161--1173. IEEE, 2023.
[53]
Fitash Ul Haq, Donghwan Shin, Shiva Nejati, and Lionel C. Briand. Comparing offline and online testing of deep neural networks: An autonomous car case study. In 13th IEEE International Conference on Software Testing, Validation and Verification, ICST 2020, Porto, Portugal, October 24--28, 2020, pages 85--95. IEEE, 2020.
[54]
Yuchi Tian, Kexin Pei, Suman Jana, and Baishakhi Ray. Deeptest: automated testing of deep-neural-network-driven autonomous cars. In Michel Chaudron, Ivica Crnkovic, Marsha Chechik, and Mark Harman, editors, Proceedings of the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, May 27 - June 03, 2018, pages 303--314. ACM, 2018.
[55]
Yolo-ret: Towards high accuracy real-time object detection on edge gpus. https://developer.nvidia.com/embedded/community/jetson-projects/yolo_ret, 2023. Accessed: 2023-12-05.
[56]
Deep Learning for Object Detection with DIGITS. https://developer.nvidia.com/blog/deep-learning-object-detection-digits/, 2023. [Online; accessed 09-May-2024].
[57]
Accelerating Large-Scale Object Detection with TensorRT. https://developer.nvidia.com/blog/large-scale-object-detection-tensorrt/, 2023. [Online; accessed 09-May-2024].
[58]
End-to-End Deep Learning for Self-Driving Cars. https://developer.nvidia.com/blog/deep-learning-self-driving-cars/, 2023. [Online; accessed 09-May-2024].
[59]
Guannan Lou, Yao Deng, Xi Zheng, Mengshi Zhang, and Tianyi Zhang. Testing of autonomous driving systems: Where are we and where should we go? In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, page 31--43, New York, NY, USA, 2022. Association for Computing Machinery.
[60]
Apollo Platform. https://github.com/ApolloAuto/apollo, 2023. [Online; accessed 09-May-2024].
[61]
Autoware: Open-source software for urban autonomous driving. https://github.com/autowarefoundation/autoware?tab=readme-ov-file, 2023. [Online; accessed 09-May-2024].
[62]
Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. CARLA: An open urban driving simulator. In Proceedings of the 1st Annual Conference on Robot Learning, pages 1--16, 2017.
[63]
Jean Pierre Allamaa, Panagiotis Patrinos, Herman Van der Auweraer, and Tong Duy Son. Sim2real for autonomous vehicle control using executable digital twin. IFAC-PapersOnLine, 55(24):385--391, 2022.
[64]
Afsoon Afzal, Deborah S. Katz, Claire Le Goues, and Christopher Steven Timperley. Simulation for robotics test automation: Developer perspectives. In 14th IEEE Conference on Software Testing, Verification and Validation, ICST 2021, Porto de Galinhas, Brazil, April 12--16, 2021, pages 263--274. IEEE, 2021.
[65]
Markus Borg, Raja Ben Abdessalem, Shiva Nejati, François-Xavier Jegeden, and Donghwan Shin. Digital twins are not monozygotic - cross-replicating ADAS testing in two industry-grade automotive simulators. In 14th IEEE Conference on Software Testing, Verification and Validation, ICST 2021, Porto de Galinhas, Brazil, April 12--16, 2021, pages 383--393. IEEE, 2021.
[66]
Junlin Han, Mehrdad Shoeiby, Lars Petersson, and Mohammad Ali Armin. Dual contrastive learning for unsupervised image-to-image translation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 746--755, 2021.
[67]
Mengshi Zhang, Yuqun Zhang, Lingming Zhang, Cong Liu, and Sarfraz Khurshid. Deeproad: Gan-based metamorphic testing and input validation framework for autonomous driving systems. In Marianne Huchard, Christian Kästner, and Gordon Fraser, editors, Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, ASE 2018, Montpellier, France, September 3--7, 2018, pages 132--142. ACM, 2018.
[68]
Husheng Zhou, Wei Li, Zelun Kong, Junfeng Guo, Yuqun Zhang, Bei Yu, Lingming Zhang, and Cong Liu. Deepbillboard: systematic physical-world testing of autonomous driving systems. In Gregg Rothermel and Doo-Hwan Bae, editors, ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June - 19 July, 2020, pages 347--358. ACM, 2020.
[69]
Felipe Codevilla, Antonio M Lopez, Vladlen Koltun, and Alexey Dosovitskiy. On offline evaluation of vision-based driving models. In Proceedings of the European Conference on Computer Vision (ECCV), pages 236--251, 2018.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASE '24: Proceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering
October 2024
2587 pages
ISBN:9798400712487
DOI:10.1145/3691620
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2024

Check for updates

Badges

Author Tags

  1. image-to-image translation
  2. autonomous driving systems (ADS)
  3. deep learning
  4. generative adversarial networks
  5. online testing

Qualifiers

  • Research-article

Funding Sources

Conference

ASE '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 82 of 337 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 47
    Total Downloads
  • Downloads (Last 12 months)47
  • Downloads (Last 6 weeks)14
Reflects downloads up to 14 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media