Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.09301 (cs)

[Submitted on 17 Sep 2023 (v1), last revised 27 Sep 2023 (this version, v3)]

Title:RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

Authors:Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

View PDF

Abstract:The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited. However, the variability of background, pose distribution, and texture can greatly influence the generalization ability. Therefore, we present a large-scale synthetic dataset RenderIH for interacting hands with accurate and diverse pose annotations. The dataset contains 1M photo-realistic images with varied backgrounds, perspectives, and hand textures. To generate natural and diverse interacting poses, we propose a new pose optimization algorithm. Additionally, for better pose estimation accuracy, we introduce a transformer-based pose estimation network, TransHand, to leverage the correlation between interacting hands and verify the effectiveness of RenderIH in improving results. Our dataset is model-agnostic and can improve more accuracy of any hand pose estimation method in comparison to other real or synthetic datasets. Experiments have shown that pretraining on our synthetic data can significantly decrease the error from 6.76mm to 5.79mm, and our Transhand surpasses contemporary methods. Our dataset and code are available at this https URL.

Comments:	Accepted by ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.09301 [cs.CV]
	(or arXiv:2309.09301v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.09301

Submission history

From: Lijun Li [view email]
[v1] Sun, 17 Sep 2023 15:30:58 UTC (53,768 KB)
[v2] Tue, 19 Sep 2023 02:12:40 UTC (53,768 KB)
[v3] Wed, 27 Sep 2023 16:02:13 UTC (53,768 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators