Zero-Shot Action Recognition with Knowledge Enhanced Generative Adversarial Networks

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Zero-Shot Action Recognition (ZSAR) aims to recognise action classes in videos that have never been seen during model training. In some approaches, ZSAR has been achieved by generating visual features for unseen classes based on the semantic information of the unseen class labels using generative adversarial networks (GANs). Therefore, the problem is converted to standard supervised learning since the unseen visual features are accessible. This approach alleviates the lack of labelled samples of unseen classes. In addition, objects appearing in the action instances could be used to create enriched semantics of action classes and therefore, increase the accuracy of ZSAR. In this paper, we consider using, in addition to the label, objects related to that action label. For example, the objects 'horse' and 'saddle' are highly related to the action 'Horse Riding' and these objects can bring additional semantic meaning. In this work, we aim to improve the GAN-based framework by incorporating object-based semantic information related to the class label with three approaches: replacing the class labels with objects, appending objects to the class, and averaging objects with the class. Then, we evaluate the performance using a subset of the popular dataset UCF101. Our experimental results demonstrate that our approach is valid since when including appropriate objects into the action classes, the baseline is improved by 4.93%.

Original languageEnglish
Title of host publicationIJCCI 2021 - Proceedings of the 13th International Joint Conference on Computational Intelligence
EditorsThomas Back, Christian Wagner, Jonathan Garibaldi, H. K. Lam, Marie Cottrell, Juan Julian Merelo, Kevin Warwick
PublisherScience and Technology Publications, Lda
Pages254-264
Number of pages11
ISBN (Electronic)9789897585340
DOIs
Publication statusPublished - 2021
Event13th International Joint Conference on Computational Intelligence, IJCCI 2021 - Virtual, Online
Duration: 25 Oct 202127 Oct 2021

Publication series

NameICETE International Conference on E-Business and Telecommunication Networks (International Joint Conference on Computational Intelligence)
Volume2021-October
ISSN (Print)2184-2825

Conference

Conference13th International Joint Conference on Computational Intelligence, IJCCI 2021
CityVirtual, Online
Period25/10/2127/10/21

Keywords

  • Generative Adversarial Networks
  • Human Action Recognition
  • Zero-Shot Learning

Fingerprint

Dive into the research topics of 'Zero-Shot Action Recognition with Knowledge Enhanced Generative Adversarial Networks'. Together they form a unique fingerprint.

Cite this