Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.10066 (cs)

[Submitted on 21 Jun 2022]

Title:RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Authors:Ruoxi Shi, Xinyang Jiang, Caihua Shan, Yansen Wang, Dongsheng Li

View PDF

Abstract:Vector graphics (VG) have been ubiquitous in our daily life with vast applications in engineering, architecture, designs, etc. The VG recognition process of most existing methods is to first render the VG into raster graphics (RG) and then conduct recognition based on RG formats. However, this procedure discards the structure of geometries and loses the high resolution of VG. Recently, another category of algorithms is proposed to recognize directly from the original VG format. But it is affected by the topological errors that can be filtered out by RG rendering. Instead of looking at one format, it is a good solution to utilize the formats of VG and RG together to avoid these shortcomings. Besides, we argue that the VG-to-RG rendering process is essential to effectively combine VG and RG information. By specifying the rules on how to transfer VG primitives to RG pixels, the rendering process depicts the interaction and correlation between VG and RG. As a result, we propose RendNet, a unified architecture for recognition on both 2D and 3D scenarios, which considers both VG/RG representations and exploits their interaction by incorporating the VG-to-RG rasterization process. Experiments show that RendNet can achieve state-of-the-art performance on 2D and 3D object recognition tasks on various VG datasets.

Comments:	CVPR 2022 Oral
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.10066 [cs.CV]
	(or arXiv:2206.10066v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.10066

Submission history

From: Ruoxi Shi [view email]
[v1] Tue, 21 Jun 2022 01:23:11 UTC (5,033 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators