Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Last update: Dec 20, 2022

Related tags

Deep Learning StrengthNet

Overview

StrengthNet

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

https://arxiv.org/abs/2110.03156

Dependency

Ubuntu 18.04.5 LTS

GPU: Quadro RTX 6000
Driver version: 450.80.02
CUDA version: 11.0

Python 3.5

tensorflow-gpu 2.0.0b1 (cudnn=7.6.0)
scipy
pandas
matplotlib
librosa

Environment set-up

For example,

conda create -n strengthnet python=3.5
conda activate strengthnet
pip install -r requirements.txt
conda install cudnn=7.6.0

Usage

Run python utils.py to extract .wav to .h5;
Run python train.py to train a CNN-BLSTM based StrengthNet;

Evaluating new samples

Put the waveforms you wish to evaluate in a folder. For example, / /
Run python test.py --rootdir / /

This script will evaluate all the .wav files in / /, and write the results to / / /StrengthNet_result_raw.txt.

By default, the output/strengthnet.h5 pretrained model is used.

Citation

If you find this work useful in your research, please consider citing:

@misc{liu2021strengthnet,
      title={StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis}, 
      author={Rui Liu and Berrak Sisman and Haizhou Li},
      year={2021},
      eprint={2110.03156},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Resources

The ESD corpus is released by the HLT lab, NUS, Singapore.

The strength scores for the English samples of the ESD corpus are available here.

Acknowledgements:

MOSNet: https://github.com/lochenchou/MOSNet

Relative Attributes: Relative Attributes

License

This work is released under MIT License (see LICENSE file for details).

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Related tags

Overview

StrengthNet

Dependency

Environment set-up

Usage

Evaluating new samples

Citation

Resources

Acknowledgements:

License

Owner

RuiLiu

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Depth image based mouse cursor visual haptic

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

A Deep Learning based project for creating line art portraits.

Multi-Glimpse Network With Python

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Simple STAC Catalogs discovery tool.

Videocaptioning.pytorch - A simple implementation of video captioning

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Implementation of the Point Transformer layer, in Pytorch

Weighted K Nearest Neighbors (kNN) algorithm implemented on python from scratch.

A Moonraker plug-in for real-time compensation of frame thermal expansion

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

Graduation Project

[CVPR'2020] DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

An official implementation of MobileStyleGAN in PyTorch

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

Related tags

Overview

StrengthNet

Dependency

Environment set-up

Usage

Evaluating new samples

Citation

Resources

Acknowledgements:

License

Owner

RuiLiu

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Depth image based mouse cursor visual haptic

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

A Deep Learning based project for creating line art portraits.

Multi-Glimpse Network With Python

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Simple STAC Catalogs discovery tool.

Videocaptioning.pytorch - A simple implementation of video captioning

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Implementation of the Point Transformer layer, in Pytorch

Weighted K Nearest Neighbors (kNN) algorithm implemented on python from scratch.

A Moonraker plug-in for real-time compensation of frame thermal expansion

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

Graduation Project

[CVPR'2020] DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

An official implementation of MobileStyleGAN in PyTorch

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch