Abstract
Nowadays the use of deep learning (DL) based systems is widely extended in several areas such as facial recognition, voice and audio processing or perception systems. The training process that must be performed for proper functionality requires a large amount of data with the required characteristics needed for the task to be executed. The process of obtaining new adequate training data is complex and tedious, therefore multiple techniques such as data augmentation or transfer learning have been developed in order to have a greater amount of knowledge in the network without the need to search for new data sources. The aim of this paper is to study the effect of the inclusion of knowledge from transfer learning in a 2D image detector trained with real world and synthetic data from multiple sources. The detector that is going to be trained will be focused on autonomous driving tasks, therefore we decide to use KITTI as the real world data source and our AD PerDevkit (based on CARLA) and Virtual-KITTI as the synthetic sources.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Geiger, A., Lenz, P., Urtasun, R.: Are we sready for autonomous driving? the kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition, CVPR (2012)
Sun, P., et al.: Scalability in perception for autonomous driving: Waymo open dataset. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR (2020)
Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: An open urban driving simulator. In: Proceedings of the 1st Annual Conference on Robot Learning, pp. 1–16 (2017)
Gaidon, A., Wang, Q., Cabon, Y., Vig, E.: Virtual worlds as proxy for multi-object tracking analysis. In: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 4340–4349 (2016)
Cabon, Y., Murray, N., Humenberger, M.: Virtual kitti 2 (2020)
Unity. https://unity.com/
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J. Big Data 6(1), 1–48 (2019). https://doi.org/10.1186/s40537-019-0197-0
Weng, X., et al.: All-In-One Drive: A Large-Scale Comprehensive Perception Dataset with High-Density Long-Range Point Clouds (2021)
de la Peña, J., et al.: AD PerDevKit: An autonomous driving perception development kit using CARLA simulator and ROS - Accepted ITSC (2022)
Girshick, R.: Fast R-CNN (2015)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks (2016)
Redmon, J., Farhadi, A.: YOLOv3: An incremental improvement. arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: Optimal speed and accuracy of object detection. arXiv:2004.10934 (2020)
Jocher, G., et al.: ultralytics/yolov5: v6.0 - yolov5 models, roboflow integration, tensorflow export, opencv dnn support. (2021). https://doi.org/10.5281/zenodo.5563715
Tan, M., Pang, R., Le, Q.V.: EfficientDet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781-10790 (2020)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Acknowledgements
This work has been funded in part from the Spanish MICINN/FEDER through the Techs4AgeCar project (RTI2018-099263-B-C21) and from the RoboCity2030-DIH-CM project (P2018/NMT- 4331), funded by Programas de actividades I+D (CAM), cofunded by EU Structural Funds and Scholarship for Introduction to Research activity by University of Alcalá
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Antunes, M., Bergasa, L.M., Araluce, J., Gutiérrez, R., Arango, J.F., Ocaña, M. (2023). Including Transfer Learning and Synthetic Data in a Training Process of a 2D Object Detector for Autonomous Driving. In: Tardioli, D., Matellán, V., Heredia, G., Silva, M.F., Marques, L. (eds) ROBOT2022: Fifth Iberian Robotics Conference. ROBOT 2022. Lecture Notes in Networks and Systems, vol 590. Springer, Cham. https://doi.org/10.1007/978-3-031-21062-4_38
Download citation
DOI: https://doi.org/10.1007/978-3-031-21062-4_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21061-7
Online ISBN: 978-3-031-21062-4
eBook Packages: EngineeringEngineering (R0)