Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.01311 (cs)

[Submitted on 2 Nov 2022 (v1), last revised 3 Nov 2022 (this version, v2)]

Title:Distill and Collect for Semi-Supervised Temporal Action Segmentation

Authors:Sovan Biswas, Anthony Rhodes, Ramesh Manuvinakurike, Giuseppe Raffa, Richard Beckwith

View PDF

Abstract:Recent temporal action segmentation approaches need frame annotations during training to be effective. These annotations are very expensive and time-consuming to obtain. This limits their performances when only limited annotated data is available. In contrast, we can easily collect a large corpus of in-domain unannotated videos by scavenging through the internet. Thus, this paper proposes an approach for the temporal action segmentation task that can simultaneously leverage knowledge from annotated and unannotated video sequences. Our approach uses multi-stream distillation that repeatedly refines and finally combines their frame predictions. Our model also predicts the action order, which is later used as a temporal constraint while estimating frames labels to counter the lack of supervision for unannotated videos. In the end, our evaluation of the proposed approach on two different datasets demonstrates its capability to achieve comparable performance to the full supervision despite limited annotation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.01311 [cs.CV]
	(or arXiv:2211.01311v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.01311

Submission history

From: Sovan Biswas [view email]
[v1] Wed, 2 Nov 2022 17:34:04 UTC (3,469 KB)
[v2] Thu, 3 Nov 2022 17:45:26 UTC (3,469 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Distill and Collect for Semi-Supervised Temporal Action Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Distill and Collect for Semi-Supervised Temporal Action Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators