Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.09525 (cs)

[Submitted on 18 Aug 2023]

Title:Improving 3D Pose Estimation for Sign Language

Authors:Maksym Ivashechkin, Oscar Mendez, Richard Bowden

View PDF

Abstract:This work addresses 3D human pose reconstruction in single images. We present a method that combines Forward Kinematics (FK) with neural networks to ensure a fast and valid prediction of 3D pose. Pose is represented as a hierarchical tree/graph with nodes corresponding to human joints that model their physical limits. Given a 2D detection of keypoints in the image, we lift the skeleton to 3D using neural networks to predict both the joint rotations and bone lengths. These predictions are then combined with skeletal constraints using an FK layer implemented as a network layer in PyTorch. The result is a fast and accurate approach to the estimation of 3D skeletal pose. Through quantitative and qualitative evaluation, we demonstrate the method is significantly more accurate than MediaPipe in terms of both per joint positional error and visual appearance. Furthermore, we demonstrate generalization over different datasets. The implementation in PyTorch runs at between 100-200 milliseconds per image (including CNN detection) using CPU only.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.09525 [cs.CV]
	(or arXiv:2308.09525v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.09525

Submission history

From: Maksym Ivashechkin [view email]
[v1] Fri, 18 Aug 2023 13:05:10 UTC (9,206 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving 3D Pose Estimation for Sign Language

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving 3D Pose Estimation for Sign Language

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators