Learning Representational Invariances for Data-Efficient Action Recognition

Last update: Nov 22, 2022

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Official PyTorch implementation for Learning Representational Invariances for Data-Efficient Action Recognition. We follow the code structure of MMAction2.

See the project page for more details.

Installation

We use PyTorch-1.6.0 with CUDA-10.2 and Torchvision-0.7.0.

Please refer to install.md for installation.

Data Preparation

First, please download human detection results and put them in the corresponding folder under data: UCF-101, HMDB-51, Kinetics-100.

Second, please refer to data_preparation.md to prepare raw frames of UCF-101 and HMDB-51. (Instructions of extracting frames from Kinetics-100 will be available soon.)

(Optional) You can download the pre-extracted ImageNet scores: UCF-101, HMDB-51.

Training

We use 8 RTX2080 Ti GPUs to run our experiments. You would need to adjust your training schedule accordingly if you have less GPUs. Please refer to here.

Supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For supervised baseline, please use config files in configs/recognition/r2plus1d.
For strongly-augmented supervised learning, please use config files in configs/supervised_aug.

Semi-supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train_semi.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For single dataset semi-supervised learning, please use config files in configs/semi.
For cross-dataset semi-supervised learning, please use config files in configs/semi_both.

Testing

# Multi-GPU testing
./tools/dist_test.sh $CONFIG ${path_to_your_ckpt} ${num_of_gpus} --eval top_k_accuracy

# Single-GPU testing
python tools/test.py $CONFIG ${path_to_your_ckpt} --eval top_k_accuracy

NOTE: Do not use multi-GPU testing if you are currently using multi-GPU training.

Other details

Please see getting_started.md for the basic usage of MMAction2.

Acknowledgement

Codes are built upon MMAction2.

Learning Representational Invariances for Data-Efficient Action Recognition

Related tags

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Installation

Data Preparation

Training

Supervised learning

Semi-supervised learning

Testing

Other details

Acknowledgement

Owner

Virginia Tech Vision and Learning Lab

Motion planning environment for Sampling-based Planners

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Fbone (Flask bone) is a Flask (Python microframework) starter/template/bootstrap/boilerplate application.

Train neural network for semantic segmentation (deep lab V3) with pytorch in less then 50 lines of code

LUKE -- Language Understanding with Knowledge-based Embeddings

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

tf2onnx - Convert TensorFlow, Keras and Tflite models to ONNX.

Beginner-friendly repository for Hacktober Fest 2021. Start your contribution to open source through baby steps. 💜

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

Fast Scattering Transform with CuPy/PyTorch

Complete the code of prefix-tuning in low data setting

A project that uses optical flow and machine learning to detect aimhacking in video clips.

Hso-groupie - A pwnable challenge in Real World CTF 4th

AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation

Using VideoBERT to tackle video prediction

Winning solution of the Indoor Location & Navigation Kaggle competition

A simple configurable bot for sending arXiv article alert by mail

Unofficial PyTorch code for BasicVSR

Unity Propagation in Bayesian Networks Handling Inconsistency via Unity Smoothing

ParmeSan: Sanitizer-guided Greybox Fuzzing