research-article

Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches

Authors:

Yi FangAuthors Info & Claims

MM '18: Proceedings of the 26th ACM international conference on Multimedia

Pages 1820 - 1828

https://doi.org/10.1145/3240508.3240699

Published: 15 October 2018 Publication History

Abstract

3D objects modeling has gained considerable attention in the visual computing community. We propose a low-cost unsupervised learning model for 3D objects reconstruction from hand-drawn sketches. Recent advancements in deep learning opened new opportunities to learn high-quality 3D objects from 2D sketches via supervised networks. However, the limited availability of labeled 2D hand-drawn sketches data (i.e. sketches and its corresponding 3D ground truth models) hinders the training process of supervised methods. In this paper, driven by a novel design of combination of retrieval and reconstruction process, we developed a learning paradigm to reconstruct 3D objects from hand-drawn sketches, without the use of well-labeled hand-drawn sketch data during the entire training process. Specifically, the paradigm begins with the training of an adaption network via autoencoder with adversarial loss, embedding the unpaired 2D rendered image domain with the hand-drawn sketch domain to a shared latent vector space. Then from the embedding latent space, for each testing sketch image, we retrieve a few (e.g. five) nearest neighbors from the training 3D data set as prior knowledge for a 3D Generative Adversarial Network. Our experiments verify our network's robust and superior performance in handling 3D volumetric object generation from single hand-drawn sketch without requiring any 3D ground truth labels.

References

[1]

Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein gan. arXiv preprint arXiv:1701.07875 (2017).

[2]

Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, and Longin Jan Latecki. 2016. Gift: A real-time and scalable 3d shape search engine. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5023--5032.

[3]

David Berthelot, Tom Schumm, and Luke Metz. 2017. BEGAN: Boundary Equilibrium Generative Adversarial Networks. arXiv preprint arXiv:1703.10717 (2017).

[4]

Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, et al. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015).

[5]

Christopher B Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, and Silvio Savarese. 2016. 3d-r2n2: A unified approach for single and multi-view 3d object reconstruction. In European Conference on Computer Vision. Springer, 628--644.

[6]

Frederic Cordier, Hyewon Seo, Jinho Park, and Jun Yong Noh. 2011. Sketching of mirror-symmetric shapes. IEEE Transactions on Visualization and Computer Graphics 17, 11 (2011), 1650--1662.

Digital Library

[7]

Johanna Delanoy, Adrien Bousseau, Mathieu Aubry, Phillip Isola, and Alyosha Efros. 2017. What You Sketch Is What You Get: 3D Sketching using Multi-View Deep Volumetric Prediction. arXiv preprint arXiv:1707.08390 (2017).

Digital Library

[8]

Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2010. Sketch-based 3D shape retrieval. In ACM SIGGRAPH 2010 Talks. ACM, 5.

Digital Library

[9]

Yotam Gingold, Takeo Igarashi, and Denis Zorin. 2009. Structured annotations for 2D-to-3D modeling. In ACM Transactions on Graphics (TOG), Vol. 28. ACM, 148.

Digital Library

[10]

Rohit Girdhar, David F Fouhey, Mikel Rodriguez, and Abhinav Gupta. 2016. Learning a predictable and generative vector representation for objects. In European Conference on Computer Vision. Springer, 484--499.

[11]

Bill Green. 2002. Canny edge detection tutorial. Internet: http://www.scribd.com/doc/40036113/Canny-Edge-Detection-Tutorial (2002).

[12]

Xiaoguang Han, Chang Gao, and Yizhou Yu. 2017. DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling. arXiv preprint arXiv:1706.02042 (2017).

Digital Library

[13]

Takeo Igarashi, Satoshi Matsuoka, and Hidehiko Tanaka. 2007. Teddy: a sketching interface for 3D freeform design. In Acm siggraph 2007 courses. ACM, 21.

Digital Library

[14]

Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning. 448--456.

Digital Library

[15]

Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).

[16]

Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, and Marc'Aurelio Ranzato. 2017. Fader Networks: Manipulating Images by Sliding Attributes. arXiv preprint arXiv:1706.00409 (2017).

[17]

Bo Li, Yijuan Lu, Afzal Godil, Tobias Schreck, Masaki Aono, Henry Johan, Jose M Saavedra, and Shoki Tashiro. 2013. SHREC13 track: large scale sketch-based 3D shape retrieval.

[18]

Bo Li, Yijuan Lu, Chunyuan Li, Afzal Godil, Tobias Schreck, Masaki Aono, Martin Burtscher, Hongbo Fu, Takahiko Furuya, Henry Johan, et al. 2014. Shrec14 track: extended large scale sketch-based 3D shape retrieval. In Eurographics workshop on 3D object retrieval, Vol. 2014.

Digital Library

[19]

Hod Lipson and Moshe Shpitalni. 2000. Conceptual design and analysis by sketching. AI EDAM 14, 5 (2000), 391--401.

Digital Library

[20]

Hod Lipson and Moshe Shpitalni. 2007. Optimization-based reconstruction of a 3D object from a single freehand line drawing. In ACM SIGGRAPH 2007 courses. ACM, 45.

Digital Library

[21]

Jerry Liu, Fisher Yu, and Thomas Funkhouser. 2017. Interactive 3D modeling with a generative adversarial network. arXiv preprint arXiv:1706.05170 (2017).

[22]

Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep learning face attributes in the wild. In Proceedings of the IEEE International Conference on Computer Vision. 3730--3738.

Digital Library

[23]

Zhaoliang Lun, Matheus Gadelha, Evangelos Kalogerakis, Subhransu Maji, and Rui Wang. 2017. 3D shape reconstruction from sketches via multi-view convolutional networks. arXiv preprint arXiv:1707.06375 (2017).

[24]

Jitendra Malik and Dror Maydan. 1989. Recovering three-dimensional shape from a single image of curved objects. IEEE Transactions on Pattern Analysis and Machine Intelligence 11, 6 (1989), 555--566.

Digital Library

[25]

Daniel Maturana and Sebastian Scherer. 2015. Voxnet: A 3d convolutional neural network for real-time object recognition. In Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on. IEEE, 922--928.

Digital Library

[26]

Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015).

[27]

Ryan Schmidt, Azam Khan, Karan Singh, and Gord Kurtenbach. 2009. Analytic drawing of 3D scaffolds. In ACM Transactions on Graphics (TOG), Vol. 28. ACM, 149.

Digital Library

[28]

Cloud Shao, Adrien Bousseau, Alla Sheffer, and Karan Singh. 2012. CrossShade: shading concept sketches using cross-section curves. ACM Transactions on Graphics 31, 4 (2012).

Digital Library

[29]

Abhishek Sharma, Oliver Grau, and Mario Fritz. 2016. Vconv-dae: Deep volumetric shape learning without object labels. In Computer Vision--ECCV 2016 Workshops. Springer, 236--250.

[30]

Hang Su, Subhransu Maji, Evangelos Kalogerakis, and Erik Learned-Miller. 2015. Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the IEEE international conference on computer vision. 945--953.

Digital Library

[31]

Daniel Sykora, Ladislav Kavan, Martin Čadík, Ondřej Jamriska, Alec Jacobson, Brian Whited, Maryann Simmons, and Olga Sorkine-Hornung. 2014. Ink-and-ray: Bas-relief meshes for adding global illumination effects to hand-drawn characters. ACM Transactions on Graphics (TOG) 33, 2 (2014), 16.

Digital Library

[32]

Jiajun Wu, Chengkai Zhang, Tianfan Xue, William T Freeman, and Joshua B Tenenbaum. 2016. Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In Advances in Neural Information Processing Systems. 82--90.

Digital Library

[33]

Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1912--1920.

[34]

Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015).

[35]

Fisher Yu, Ari Seff, Yinda Zhang, Shuran Song, Thomas Funkhouser, and Jianxiong Xiao. 2015. Lsun: Construction of a large-scale image dataset using deep learning with hands in the loop. arXiv preprint arXiv:1506.03365 (2015).

[36]

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv preprint arXiv:1703.10593 (2017).

Cited By

Wang FSheng JJiang KZhang ZZheng JZhao B(2024)Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687931(1-6)Online publication date: 15-Jul-2024
https://doi.org/10.1109/ICME57554.2024.10687931
Hao YHuang HYuan SFang Y(2024)Meta-Learning 3D Shape Segmentation Functions2024 10th International Conference on Automation, Robotics and Applications (ICARA)10.1109/ICARA60736.2024.10553035(516-520)Online publication date: 22-Feb-2024
https://doi.org/10.1109/ICARA60736.2024.10553035
Wang XWei HFan XZhao D(2024)Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00445(4651-4660)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.00445
Show More Cited By

Index Terms

Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Unsupervised learning

Recommendations

3D Shape Reconstruction from Free-Hand Sketches
Computer Vision – ECCV 2022 Workshops
Abstract
Sketches are arguably the most abstract 2D representations of real-world objects. Although a sketch usually has geometrical distortion and lacks visual cues, humans can effortlessly envision a 3D object from it. This suggests that sketches encode ...
Improve Deep Learning with Unsupervised Objective
Neural Information Processing
Abstract
We propose a novel approach capable of embedding the unsupervised objective into hidden layers of the deep neural network (DNN) for preserving important unsupervised information. To this end, we exploit a very simple yet effective unsupervised ...
Enhancing Sparse Retrieval via Unsupervised Learning
SIGIR-AP '23: Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region

Recent work has shown that neural retrieval models excel at text ranking tasks in a supervised setting when given large amounts of manually labeled training data. However, it remains an open question how to train unsupervised retrieval models that are ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '18: Proceedings of the 26th ACM international conference on Multimedia

October 2018

2167 pages

ISBN:9781450356657

DOI:10.1145/3240508

General Chairs:
Susanne Boll
University of Oldenburg, Germany
,
Kyoung Mu Lee
Seoul National University, Korea
,
Jiebo Luo
University of Rochester, USA
,
Wenwu Zhu
Tsinghua University, China
,
Program Chairs:
Hyeran Byun
Yonsei University, Korea
,
Chang Wen Chen
State Univ. Of New York at Buffalo, USA
,
Rainer Lienhart
University of Augsburg, Germany
,
Tao Mei
JD AI, China

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '18

Sponsor:

SIGMM

MM '18: ACM Multimedia Conference

October 22 - 26, 2018

Seoul, Republic of Korea

Acceptance Rates

MM '18 Paper Acceptance Rate 209 of 757 submissions, 28%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

25
Total Citations
View Citations
889
Total Downloads

Downloads (Last 12 months)73
Downloads (Last 6 weeks)9

Reflects downloads up to 13 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wang FSheng JJiang KZhang ZZheng JZhao B(2024)Single Free-Hand Sketch Guided Free-Form Deformation For 3D Shape Generation2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687931(1-6)Online publication date: 15-Jul-2024
https://doi.org/10.1109/ICME57554.2024.10687931
Hao YHuang HYuan SFang Y(2024)Meta-Learning 3D Shape Segmentation Functions2024 10th International Conference on Automation, Robotics and Applications (ICARA)10.1109/ICARA60736.2024.10553035(516-520)Online publication date: 22-Feb-2024
https://doi.org/10.1109/ICARA60736.2024.10553035
Wang XWei HFan XZhao D(2024)Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.00445(4651-4660)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.00445
Poku-Agyemang KReiterer A(2023)3D Reconstruction from 2D Plans Exemplified by Bridge StructuresRemote Sensing10.3390/rs1503067715:3(677)Online publication date: 23-Jan-2023
https://doi.org/10.3390/rs15030677
Manfredi GCapece NErra UGruosso M(2023)TreeSketchNet: From Sketch to 3D Tree Parameters GenerationACM Transactions on Intelligent Systems and Technology10.1145/357983114:3(1-29)Online publication date: 24-Mar-2023
https://dl.acm.org/doi/10.1145/3579831
Deng ZLiu YPan HJabi WZhang JDeng B(2023)Sketch2PQ: Freeform Planar Quadrilateral Mesh Design via a Single SketchIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.317085329:9(3826-3839)Online publication date: 1-Sep-2023
https://doi.org/10.1109/TVCG.2022.3170853
Xu PHospedales TYin QSong YXiang TWang L(2023)Deep Learning for Free-Hand Sketch: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.314885345:1(285-312)Online publication date: 1-Jan-2023
https://doi.org/10.1109/TPAMI.2022.3148853
Chowdhury PBhunia ASain AKoley SXiang TSong Y(2023)Democratising 2D Sketch to 3D Shape Retrieval Through Pivoting2023 IEEE/CVF International Conference on Computer Vision (ICCV)10.1109/ICCV51070.2023.02127(23218-23229)Online publication date: 1-Oct-2023
https://doi.org/10.1109/ICCV51070.2023.02127
Jiang FMa JWebster CChiaradia AZhou YZhao ZZhang X(2023)Generative urban design: A systematic review on problem formulation, design generation, and decision-makingProgress in Planning10.1016/j.progress.2023.100795(100795)Online publication date: Jul-2023
https://doi.org/10.1016/j.progress.2023.100795
Liu HXu YChen F(2023)Sketch2Photo: Synthesizing photo-realistic images from sketches via global contextsEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105608117(105608)Online publication date: Jan-2023
https://doi.org/10.1016/j.engappai.2022.105608
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents