Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.06594 (cs)

[Submitted on 11 Dec 2023 (v1), last revised 23 Sep 2024 (this version, v2)]

Title:Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops

Authors:Aditya Prakash, Arjun Gupta, Saurabh Gupta

Abstract:Objects undergo varying amounts of perspective distortion as they move across a camera's field of view. Models for predicting 3D from a single image often work with crops around the object of interest and ignore the location of the object in the camera's field of view. We note that ignoring this location information further exaggerates the inherent ambiguity in making 3D inferences from 2D images and can prevent models from even fitting to the training data. To mitigate this ambiguity, we propose Intrinsics-Aware Positional Encoding (KPE), which incorporates information about the location of crops in the image and camera intrinsics. Experiments on three popular 3D-from-a-single-image benchmarks: depth prediction on NYU, 3D object detection on KITTI & nuScenes, and predicting 3D shapes of articulated objects on ARCTIC, show the benefits of KPE.

Comments:	ECCV 2024, Project Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2312.06594 [cs.CV]
	(or arXiv:2312.06594v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.06594

Submission history

From: Aditya Prakash [view email]
[v1] Mon, 11 Dec 2023 18:28:55 UTC (992 KB)
[v2] Mon, 23 Sep 2024 14:23:07 UTC (922 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators