Global-Local Attention for Emotion Recognition

Last update: Apr 21, 2022

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Python 3
Install tensorflow (or tensorflow-gpu) >= 2.0.0
Install some other packages

pip install cython
pip install opencv-python==4.3.0.36 matplotlib numpy==1.18.5 dlib

Dataset

We provide the NCAER-S dataset with original images and extracted faces (a .txt file with 4 bounding box coordinate) in the NCAERS dataset.

The dataset can be downloaded at Google Drive

Note that the dataset and label should have structure like the followings:

NCAER-S 
│
└───images
│   │
│   └───class_1
│   │   │   img1.jpg
│   │   │   img2.jpg
│   │   │   ...
│   └───class_2
│       │   img1.jpg
│       │   img2.jpg
│       │   ...
│   
└───crop
│   │
│   └───class_1
│   │   │   img1.txt
│   │   │   img2.txt
│   │   │   ...
│   └───class_2
│       │   img1.txt
│       │   img2.txt
│       │   ...

Running

Our code supports these types of execution with argument -m or --mode:

#extract faces from <train, val or test> dataset (specified in config.py)
python run.py -m extract dataset_type=train

#train the model with config specified in the config.py
python run.py -m train 

#evaluate the trained model on the dataset <dataset_type>
python run.py -m eval --dataset_type=test --trained_weights=path/to/weights

Evaluation

Our trained model is available at weights/glamor-net/Model.

Firstly, please download the dataset and extract it into "data/" directory.
Then specified the path to the test data (images and crop):

config = config.copy({
    'test_images': 'path_to_test_images',
    'test_crop':   'path_to_test_cropped_faces' #(.txt files),
})

Run this command to evaluate the model. We are using the classification accuracy as our evaluation metric.

# Evaluate our model in the test set
python run.py -m eval --dataset_type=test --trained_weights=weights/glamor-net/Model

Training

Firstly please extract the faces from train set (val set is optional)

Specify the path to the dataset in config.py (train_images, val_images, test_images)
Specify the desired face-extracted output path in config.py (train_crop, val_crop, test_crop)

config = config.copy({

    'train_images': 'path_to_training_images',
    'train_crop':   'path_to_training_cropped_faces' #(.txt files),

    'val_images': 'path_to_validation_images',
    'val_crop':   'path_to_validation_cropped_faces' #(.txt files)

})

Perform face extraction on both dataset_type by running the commands:

python run.py -m extract --dataset_type=<train, val or test>

Start training:

# Train a new model from sratch
python run.py -m train 

# Continue training a model that you had trained earlier
python run.py -m train --resume=path/to/trained_weights

# Resume the last checkpoint model
python run.py -m train --resume=last

Prediction

We support prediction on single image or on images in a directory by running this command:

# Predict on single image
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/1.jpg --output=path/to/out/directory

# Predict on images in directory
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/ --output=out/

Use the help option to see a description of all available command line arguments

Global-Local Attention for Emotion Recognition

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Dataset

Running

Evaluation

Training

Prediction

Use the help option to see a description of all available command line arguments

Owner

Minh Nhat Le

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

🏖 Keras Implementation of Painting outside the box

DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.

The authors' official PyTorch SigWGAN implementation

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Implementation of paper "Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal"

Multi-Task Deep Neural Networks for Natural Language Understanding

Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

A 2D Visual Localization Framework based on Essential Matrices [ICRA2020]

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

Code accompanying the paper Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs (Chen et al., CVPR 2020, Oral).

Improving Compound Activity Classification via Deep Transfer and Representation Learning

General purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends)

robomimic: A Modular Framework for Robot Learning from Demonstration

ML for NLP and Computer Vision.

使用yolov5训练自己数据集(详细过程)并通过flask部署