Spatio-Temporal Deviation Calibration for Skeleton-Based Human Action Recognition

  • Gerard Marcos Freixas*
  • , Zunlei Feng*
  • , Kelvin Ting Zuo Han
  • , Cheng Jin*
  • , Jiacong Hu
  • , Jie Lei
  • , Xingjiao Wu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Human Action Recognition (HAR) plays an important role in various applications such as video surveillance, human-computer interaction, and healthcare. In recent years, there has been increasing interest in using skeleton-based representations for HAR due to their robustness to changes in appearance and viewpoints. However, skeleton-based approaches suffer from the class similarity problem due to the high intraclass variability and low interclass separability, because of the inherent structure of the human skeleton. Graph Convolutional Networks (GCNs) have reached remarkable results, modeling spatio-temporal relationships of a skeleton sequence. Motivated by the class similarity problem and the lack of a GCN optimization tool for skeleton-based HAR context, we propose a simple gradient-based re-training pipeline applicable to any state-of-the-art GCN model that constrains low-confidence predictions, guiding the model towards the ground-truth categories. By focusing on the relevant frames and nodes of each category, certain incorrect patterns yielded by these low-confidence samples are ignored, leading to a notable optimization of the model. Experimental results demonstrate the flexibility and effectiveness of the proposed method, improving up to 3% the accuracy of mainstream GCN recognizers.

Original languageEnglish
Title of host publicationNeural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
EditorsMufti Mahmud, Maryam Doborjeh, Kevin Wong, Andrew Chi Sing Leung, Zohreh Doborjeh, M. Tanveer
PublisherSpringer Science and Business Media Deutschland GmbH
Pages180-194
Number of pages15
ISBN (Print)9789819669622
DOIs
StatePublished - 2025
Externally publishedYes
Event31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
Duration: 2 Dec 20246 Dec 2024

Publication series

NameCommunications in Computer and Information Science
Volume2287 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
Country/TerritoryNew Zealand
CityAuckland
Period2/12/246/12/24

Keywords

  • class similarity problem
  • graph convolutional network
  • skeleton-based human action recognition

Fingerprint

Dive into the research topics of 'Spatio-Temporal Deviation Calibration for Skeleton-Based Human Action Recognition'. Together they form a unique fingerprint.

Cite this