Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.14709 (cs)

[Submitted on 26 Jun 2023]

Title:Self-supervised novel 2D view synthesis of large-scale scenes with efficient multi-scale voxel carving

Authors:Alexandra Budisteanu, Dragos Costea, Alina Marcu, Marius Leordeanu

View PDF

Abstract:The task of generating novel views of real scenes is increasingly important nowadays when AI models become able to create realistic new worlds. In many practical applications, it is important for novel view synthesis methods to stay grounded in the physical world as much as possible, while also being able to imagine it from previously unseen views. While most current methods are developed and tested in virtual environments with small scenes and no errors in pose and depth information, we push the boundaries to the real-world domain of large scales in the new context of UAVs. Our algorithmic contributions are two folds. First, we manage to stay anchored in the real 3D world, by introducing an efficient multi-scale voxel carving method, which is able to accommodate significant noises in pose, depth, and illumination variations, while being able to reconstruct the view of the world from drastically different poses at test time. Second, our final high-resolution output is efficiently self-trained on data automatically generated by the voxel carving module, which gives it the flexibility to adapt efficiently to any scene. We demonstrated the effectiveness of our method on highly complex and large-scale scenes in real environments while outperforming the current state-of-the-art. Our code is publicly available: this https URL.

Comments:	11 pages, 3 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2306.14709 [cs.CV]
	(or arXiv:2306.14709v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.14709

Submission history

From: Alina Marcu M.Sc [view email]
[v1] Mon, 26 Jun 2023 13:57:05 UTC (4,819 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised novel 2D view synthesis of large-scale scenes with efficient multi-scale voxel carving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised novel 2D view synthesis of large-scale scenes with efficient multi-scale voxel carving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators