Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3240508.3240699acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches

Published: 15 October 2018 Publication History

Abstract

3D objects modeling has gained considerable attention in the visual computing community. We propose a low-cost unsupervised learning model for 3D objects reconstruction from hand-drawn sketches. Recent advancements in deep learning opened new opportunities to learn high-quality 3D objects from 2D sketches via supervised networks. However, the limited availability of labeled 2D hand-drawn sketches data (i.e. sketches and its corresponding 3D ground truth models) hinders the training process of supervised methods. In this paper, driven by a novel design of combination of retrieval and reconstruction process, we developed a learning paradigm to reconstruct 3D objects from hand-drawn sketches, without the use of well-labeled hand-drawn sketch data during the entire training process. Specifically, the paradigm begins with the training of an adaption network via autoencoder with adversarial loss, embedding the unpaired 2D rendered image domain with the hand-drawn sketch domain to a shared latent vector space. Then from the embedding latent space, for each testing sketch image, we retrieve a few (e.g. five) nearest neighbors from the training 3D data set as prior knowledge for a 3D Generative Adversarial Network. Our experiments verify our network's robust and superior performance in handling 3D volumetric object generation from single hand-drawn sketch without requiring any 3D ground truth labels.

References

[1]
Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein gan. arXiv preprint arXiv:1701.07875 (2017).
[2]
Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, and Longin Jan Latecki. 2016. Gift: A real-time and scalable 3d shape search engine. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5023--5032.
[3]
David Berthelot, Tom Schumm, and Luke Metz. 2017. BEGAN: Boundary Equilibrium Generative Adversarial Networks. arXiv preprint arXiv:1703.10717 (2017).
[4]
Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015).
[5]
Christopher B Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, and Silvio Savarese. 2016. 3d-r2n2: A unified approach for single and multi-view 3d object reconstruction. In European Conference on Computer Vision. Springer, 628--644.
[6]
Frederic Cordier, Hyewon Seo, Jinho Park, and Jun Yong Noh. 2011. Sketching of mirror-symmetric shapes. IEEE Transactions on Visualization and Computer Graphics 17, 11 (2011), 1650--1662.
[7]
Johanna Delanoy, Adrien Bousseau, Mathieu Aubry, Phillip Isola, and Alyosha Efros. 2017. What You Sketch Is What You Get: 3D Sketching using Multi-View Deep Volumetric Prediction. arXiv preprint arXiv:1707.08390 (2017).
[8]
Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2010. Sketch-based 3D shape retrieval. In ACM SIGGRAPH 2010 Talks. ACM, 5.
[9]
Yotam Gingold, Takeo Igarashi, and Denis Zorin. 2009. Structured annotations for 2D-to-3D modeling. In ACM Transactions on Graphics (TOG), Vol. 28. ACM, 148.
[10]
Rohit Girdhar, David F Fouhey, Mikel Rodriguez, and Abhinav Gupta. 2016. Learning a predictable and generative vector representation for objects. In European Conference on Computer Vision. Springer, 484--499.
[11]
Bill Green. 2002. Canny edge detection tutorial. Internet: http://www.scribd.com/doc/40036113/Canny-Edge-Detection-Tutorial (2002).
[12]
Xiaoguang Han, Chang Gao, and Yizhou Yu. 2017. DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling. arXiv preprint arXiv:1706.02042 (2017).
[13]
Takeo Igarashi, Satoshi Matsuoka, and Hidehiko Tanaka. 2007. Teddy: a sketching interface for 3D freeform design. In Acm siggraph 2007 courses. ACM, 21.
[14]
Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning. 448--456.
[15]
Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
[16]
Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, and Marc'Aurelio Ranzato. 2017. Fader Networks: Manipulating Images by Sliding Attributes. arXiv preprint arXiv:1706.00409 (2017).
[17]
Bo Li, Yijuan Lu, Afzal Godil, Tobias Schreck, Masaki Aono, Henry Johan, Jose M Saavedra, and Shoki Tashiro. 2013. SHREC13 track: large scale sketch-based 3D shape retrieval.
[18]
Bo Li, Yijuan Lu, Chunyuan Li, Afzal Godil, Tobias Schreck, Masaki Aono, Martin Burtscher, Hongbo Fu, Takahiko Furuya, Henry Johan, et al. 2014. Shrec14 track: extended large scale sketch-based 3D shape retrieval. In Eurographics workshop on 3D object retrieval, Vol. 2014.
[19]
Hod Lipson and Moshe Shpitalni. 2000. Conceptual design and analysis by sketching. AI EDAM 14, 5 (2000), 391--401.
[20]
Hod Lipson and Moshe Shpitalni. 2007. Optimization-based reconstruction of a 3D object from a single freehand line drawing. In ACM SIGGRAPH 2007 courses. ACM, 45.
[21]
Jerry Liu, Fisher Yu, and Thomas Funkhouser. 2017. Interactive 3D modeling with a generative adversarial network. arXiv preprint arXiv:1706.05170 (2017).
[22]
Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of the IEEE International Conference on Computer Vision. 3730--3738.
[23]
Zhaoliang Lun, Matheus Gadelha, Evangelos Kalogerakis, Subhransu Maji, and Rui Wang. 2017. 3D shape reconstruction from sketches via multi-view convolutional networks. arXiv preprint arXiv:1707.06375 (2017).
[24]
Jitendra Malik and Dror Maydan. 1989. Recovering three-dimensional shape from a single image of curved objects. IEEE Transactions on Pattern Analysis and Machine Intelligence 11, 6 (1989), 555--566.
[25]
Daniel Maturana and Sebastian Scherer. 2015. Voxnet: A 3d convolutional neural network for real-time object recognition. In Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on. IEEE, 922--928.
[26]
Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015).
[27]
Ryan Schmidt, Azam Khan, Karan Singh, and Gord Kurtenbach. 2009. Analytic drawing of 3D scaffolds. In ACM Transactions on Graphics (TOG), Vol. 28. ACM, 149.
[28]
Cloud Shao, Adrien Bousseau, Alla Sheffer, and Karan Singh. 2012. CrossShade: shading concept sketches using cross-section curves. ACM Transactions on Graphics 31, 4 (2012).
[29]
Abhishek Sharma, Oliver Grau, and Mario Fritz. 2016. Vconv-dae: Deep volumetric shape learning without object labels. In Computer Vision--ECCV 2016 Workshops. Springer, 236--250.
[30]
Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the IEEE international conference on computer vision. 945--953.
[31]
Daniel Sykora, Ladislav Kavan, Martin Čadík, Ondřej Jamriska, Alec Jacobson, Brian Whited, Maryann Simmons, and Olga Sorkine-Hornung. 2014. Ink-and-ray: Bas-relief meshes for adding global illumination effects to hand-drawn characters. ACM Transactions on Graphics (TOG) 33, 2 (2014), 16.
[32]
Jiajun Wu, Chengkai Zhang, Tianfan Xue, William T Freeman, and Joshua B Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In Advances in Neural Information Processing Systems. 82--90.
[33]
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1912--1920.
[34]
Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015).
[35]
Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser, and Jianxiong Xiao. 2015. Lsun: Construction of a large-scale image dataset using deep learning with hands in the loop. arXiv preprint arXiv:1506.03365 (2015).
[36]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv preprint arXiv:1703.10593 (2017).

Cited By

View all
  • (2024)Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687931(1-6)Online publication date: 15-Jul-2024
  • (2024)Meta-Learning 3D Shape Segmentation Functions2024 10th International Conference on Automation, Robotics and Applications (ICARA)10.1109/ICARA60736.2024.10553035(516-520)Online publication date: 22-Feb-2024
  • (2024)Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00445(4651-4660)Online publication date: 16-Jun-2024
  • Show More Cited By

Index Terms

  1. Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '18: Proceedings of the 26th ACM international conference on Multimedia
    October 2018
    2167 pages
    ISBN:9781450356657
    DOI:10.1145/3240508
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 October 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. generative model
    2. sketch modeling
    3. unsupervised learning

    Qualifiers

    • Research-article

    Conference

    MM '18
    Sponsor:
    MM '18: ACM Multimedia Conference
    October 22 - 26, 2018
    Seoul, Republic of Korea

    Acceptance Rates

    MM '18 Paper Acceptance Rate 209 of 757 submissions, 28%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)73
    • Downloads (Last 6 weeks)9
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687931(1-6)Online publication date: 15-Jul-2024
    • (2024)Meta-Learning 3D Shape Segmentation Functions2024 10th International Conference on Automation, Robotics and Applications (ICARA)10.1109/ICARA60736.2024.10553035(516-520)Online publication date: 22-Feb-2024
    • (2024)Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00445(4651-4660)Online publication date: 16-Jun-2024
    • (2023)3D Reconstruction from 2D Plans Exemplified by Bridge StructuresRemote Sensing10.3390/rs1503067715:3(677)Online publication date: 23-Jan-2023
    • (2023)TreeSketchNet: From Sketch to 3D Tree Parameters GenerationACM Transactions on Intelligent Systems and Technology10.1145/357983114:3(1-29)Online publication date: 24-Mar-2023
    • (2023)Sketch2PQ: Freeform Planar Quadrilateral Mesh Design via a Single SketchIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.317085329:9(3826-3839)Online publication date: 1-Sep-2023
    • (2023)Deep Learning for Free-Hand Sketch: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.314885345:1(285-312)Online publication date: 1-Jan-2023
    • (2023)Democratising 2D Sketch to 3D Shape Retrieval Through Pivoting2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.02127(23218-23229)Online publication date: 1-Oct-2023
    • (2023)Generative urban design: A systematic review on problem formulation, design generation, and decision-makingProgress in Planning10.1016/j.progress.2023.100795(100795)Online publication date: Jul-2023
    • (2023)Sketch2Photo: Synthesizing photo-realistic images from sketches via global contextsEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105608117(105608)Online publication date: Jan-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media