DeepRoad: GAN-based metamorphic testing and input validation framework for autonomous driving systems
Proceedings of the 33rd ACM/IEEE international conference on automated …, 2018•dl.acm.org
While Deep Neural Networks (DNNs) have established the fundamentals of image-based
autonomous driving systems, they may exhibit erroneous behaviors and cause fatal
accidents. To address the safety issues in autonomous driving systems, a recent set of
testing techniques have been designed to automatically generate artificial driving scenes to
enrich test suite, eg, generating new input images transformed from the original ones.
However, these techniques are insufficient due to two limitations: first, many such synthetic …
autonomous driving systems, they may exhibit erroneous behaviors and cause fatal
accidents. To address the safety issues in autonomous driving systems, a recent set of
testing techniques have been designed to automatically generate artificial driving scenes to
enrich test suite, eg, generating new input images transformed from the original ones.
However, these techniques are insufficient due to two limitations: first, many such synthetic …
While Deep Neural Networks (DNNs) have established the fundamentals of image-based autonomous driving systems, they may exhibit erroneous behaviors and cause fatal accidents. To address the safety issues in autonomous driving systems, a recent set of testing techniques have been designed to automatically generate artificial driving scenes to enrich test suite, e.g., generating new input images transformed from the original ones. However, these techniques are insufficient due to two limitations: first, many such synthetic images often lack diversity of driving scenes, and hence compromise the resulting efficacy and reliability. Second, for machine-learning-based systems, a mismatch between training and application domain can dramatically degrade system accuracy, such that it is necessary to validate inputs for improving system robustness.
In this paper, we propose DeepRoad, an unsupervised DNN-based framework for automatically testing the consistency of DNN-based autonomous driving systems and online validation. First, DeepRoad automatically synthesizes large amounts of diverse driving scenes without using image transformation rules (e.g. scale, shear and rotation). In particular, DeepRoad is able to produce driving scenes with various weather conditions (including those with rather extreme conditions) by applying Generative Adversarial Networks (GANs) along with the corresponding real-world weather scenes. Second, DeepRoad utilizes metamorphic testing techniques to check the consistency of such systems using synthetic images. Third, DeepRoad validates input images for DNN-based systems by measuring the distance of the input and training images using their VGGNet features. We implement DeepRoad to test three well-recognized DNN-based autonomous driving systems in Udacity self-driving car challenge. The experimental results demonstrate that DeepRoad can detect thousands of inconsistent behaviors for these systems, and effectively validate input images to potentially enhance the system robustness as well.
ACM Digital Library