Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition

Mathieu Riand; Laurent Dollé; Patrick Le Callet

doi:10.5220/0010689500003061

Communication Dans Un Congrès Année : 2021

Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition

(1, 2) , (2) , (1)

1
2

Mathieu Riand

Fonction : Auteur
PersonId : 1328861
IdHAL : mathieu-riand

Image Perception Interaction

CEA Tech Pays-de-la-Loire

Laurent Dollé

Fonction : Auteur
PersonId : 990346
IdHAL : ldollecea

CEA Tech Pays-de-la-Loire

Patrick Le Callet

Fonction : Auteur
PersonId : 15969
IdHAL : patrick-le-callet
ORCID : 0000-0002-2143-7063
IdRef : 060370068

Image Perception Interaction

Résumé

In this paper we studied the influence of adding skeleton data on top of human actions videos when performing self-supervised learning and action recognition. We show that adding this information without additional constraints actually hurts the accuracy of the network; we argue that the added skeleton is not considered by the network and seen as a noise masking part of the natural image. We bring first results on puzzle solving and video action recognition to support this hypothesis.

Mots clés

Self-supervised Learning Siamese Network Skeleton Keypoints Action Recognition Few-shot Learning

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

106895.pdf (1.29 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte
licence : CC BY - Paternité

Mathieu Riand : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03946524

Soumis le : jeudi 19 janvier 2023-11:03:06

Dernière modification le : mardi 23 avril 2024-10:18:03

Archivage à long terme le : jeudi 20 avril 2023-18:27:35

Dates et versions

hal-03946524 , version 1 (19-01-2023)

Licence

Paternité

Identifiants

HAL Id : hal-03946524 , version 1
DOI : 10.5220/0010689500003061

Citer

Mathieu Riand, Laurent Dollé, Patrick Le Callet. Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition. ROBOVIS 2021 : 2nd International Conference on Robotics, Computer Vision and Intelligent Systems, Oct 2021, Online streaming, France. ⟨10.5220/0010689500003061⟩. ⟨hal-03946524⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA INSTITUT-TELECOM CNRS INRIA EC-NANTES UNAM DRT LS2N LS2N-IPI NANTES-UNIVERSITE

32 Consultations

14 Téléchargements

Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager