Unofficial PyTorch Implementation of Multi-Singer

Last update: Dec 28, 2022

Related tags

Deep Learning Multi-Singer

Overview

Multi-Singer

Unofficial PyTorch Implementation of Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus.

Requirements

See requirements in requirement.txt:

linux
python 3.6
pytorch 1.0+
librosa
json, tqdm, logging

TODO

1026: upload code
1024: implement multi-singer & perceptual loss
1023: implement singer encoder

Getting started

Apply recipe to your own dataset

Put any wav files in data directory
Edit configuration in config/config.yaml

1. Pretrain

Pretrain the Singer Embedding Extractor using repository here, and set the 'enc_model_fpath' in config/config.yaml

Note: Please set params as those in 'encoder/params_data' and 'encoder/params_model'.

2. Preprocess

Extract mel-spectrogram

python preprocess.py -i data/wavs -o data/feature -c config/config.yaml

-i your audio folder

-o output acoustic feature folder

-c config file

3. Train

Training conditioned on mel-spectrogram

python train.py -i data/feature -o checkpoints/ --config config/config.yaml

-i acoustic feature folder

-o directory to save checkpoints

-c config file

4. Inference

python inference.py -i data/feature -o outputs/  -c checkpoints/*.pkl -g config/config.yaml

-i acoustic feature folder

-o directory to save generated speech

-c checkpoints file

-c config file

5. Singing Voice Synthesis

For Singing Voice Synthesis:

Take modified FastSpeech for mel-spectrogram synthesis
Use synthesized mel-spectrogram in Multi-Singer for waveform synthesis.

Acknowledgements

Citation

Please cite this repository by the "Cite this repository" of About section (top right of the main page).

Question

Feel free to contact me at [email protected]

Unofficial PyTorch Implementation of Multi-Singer

Related tags

Overview

Multi-Singer

Requirements

TODO

Getting started

Apply recipe to your own dataset

1. Pretrain

Note: Please set params as those in 'encoder/params_data' and 'encoder/params_model'.

2. Preprocess

3. Train

4. Inference

5. Singing Voice Synthesis

Acknowledgements

Citation

Question

Owner

SunMail-hub

[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

YOLOPのPythonでのONNX推論サンプル

Code for `BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery`, Neurips 2021

Torchreid: Deep learning person re-identification in PyTorch.

PyTorch implementation for the paper Visual Representation Learning with Self-Supervised Attention for Low-Label High-Data Regime

Generate text captions for images from their CLIP embeddings. Includes PyTorch model code and example training script.

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations,

pixelNeRF: Neural Radiance Fields from One or Few Images

This repository provides an efficient PyTorch-based library for training deep models.

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring

Yolo Traffic Light Detection With Python

Using Clinical Drug Representations for Improving Mortality and Length of Stay Predictions

PyTorch implementation of Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction (ICCV 2021).

Contour-guided image completion with perceptual grouping (BMVC 2021 publication)