Abstract
A vast amount of research has been conducted about deep learning and its applications in Computer Vision (CV). However, the application to project an object instance onto a real image or video in a semantically coherent manner, such that the projected object is indistinguishable from a real object, is only in its infancy. In our research, we aim to evaluate a generative model which is able to generate and place an object instance onto an image in a semantically coherent manner using a where and a what module; both of these employ Generative Adversarial Networks (GANs). Furthermore, we improve the shape generation by adding a classifier before the training data is used. Finally, we intend to increase the training stability by using an alternative training methodology and adjusting the Jenson-Shannon divergence to the Wasserstein distance. The implication of this work is the improved stability of an existing generative model, which inserts instances onto an image. Furthermore, we were also able to improve its performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.: Improved Training of Wasserstein GANs. Curran Associates Inc. (2017)
de Hoog, J., Pepermans, M., Mercelis, S., Hellinckx, P.: Towards a scalable distributed real-time hybrid simulator for autonomous vehicles. In: Xhafa, F., Leu, F.-Y., Ficco, M., Yang, C.-T. (eds.) 3PGCIC 2018. LNDECT, vol. 24, pp. 447–456. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-02607-3_41
Harshvardhan, G.M., Mahendra, K.G., Manjusha, P., Siddharth, S.R.: A comprehensive survey and analysis of generative models in machine learning. Comput. Sci. Rev. 38, 100285 (2020)
Gui, J., Sun, Z., Wen, Y., Tao, D., Ye, J.: A review on generative adversarial networks: algorithms, theory, and applications. arXiv preprint arXiv:2001.06937 (2020)
Lee, D., Liu, S., Gu, J., Liu, M.-Y., Yang, M.-H., Kautz, J.: Context-aware synthesis and placement of object instances. Curran Associates Inc (2018)
Oussidi, A., Elhassouny, A.: Deep generative models: Survey. ISCV (2018)
Goodfellow, I., et al.: Generative adversarial nets. Curran Associates Inc. (2014)
Seunghoon, H., Dingdong, Y., Jongwook, C., Honglak, L.: Inferring semantic layout for hierarchical text-to-image synthesis. arXiv preprint arXiv:1801.05091 (2018)
Pan, Z., Yu, W., Yi, X., Khan, A., Yuan, F., Zheng, Y.: Recent progress on generative adversarial networks (GANs): a Survey. IEEE Access (2019)
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. Curran Associates Inc. (2015)
Balemans, D., De Boeck, Y., de Hoog, J., Anwar, A., Mercelis, S., Hellinckx, P.: Towards hybrid camera sensor simulation for autonomous vehicles. In: Barolli, L., Takizawa, M., Yoshihisa, T., Amato, F., Ikeda, M. (eds.) 3PGCIC 2020. LNNS, vol. 158, pp. 291–300. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-61105-7_29
Martin, A., Léon, B.: Towards principled methods for training generative adversarial networks. arXiv preprint arXiv:1701.04862 (2017)
Zhu, J.-Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. ICCV (2017)
Martin, A., Soumith, C., Léon, B.: Wasserstein generative adversarial networks. PMLR (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Duym, J., Anwar, A., de Hoog, J., Mercelis, S., Hellinckx, P. (2023). Improving Context-Aware Synthesis and Placement of Object Instances. In: Barolli, L. (eds) Advances on P2P, Parallel, Grid, Cloud and Internet Computing. 3PGCIC 2022. Lecture Notes in Networks and Systems, vol 571. Springer, Cham. https://doi.org/10.1007/978-3-031-19945-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-031-19945-5_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-19944-8
Online ISBN: 978-3-031-19945-5
eBook Packages: EngineeringEngineering (R0)