Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Last update: Aug 28, 2022

Related tags

Deep Learning AequeVox

Overview

AequeVox

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

README under development.

Python Packages Required

numpy
scipy
math
librosa
random
time
json
threading
re
nltk

ASR Specific Packages

Google Cloud

speech
Storage

Microsoft Azure

Azure.cognitiveservices.speech

IBM Cloud

ibm_watson
ibm_watson.websocket
Ibm_cloud_sdk_core.authenticators

The code is separated into 2 sections, Generation and Analysis.

Generation:

transGen.py

Lists all transformation types and magnitudes to be used. Can be modified as necessary.
Requires the specification of file names of all the original speech files.

Generates transformed speech files with form {Original File Name}{Transformation Type Abbreviation}{Magnitude of Transformation Parameter, theta}.wav

List of Abbreviations.

A - Amplitude
C - Clipping
D - Drop
F - Frame
HP - Highpass
LP - LP
N - Noise
S - Scale

GCP_Recog.py

Requires Google cloud client libraries and associated keys.

Takes a group name and the list of all original files in the group to generate transcripts.

MS_Recog.py

Requires Microsoft Azure client libraries and associated key and region.

Takes a group name and the list of all original files in the group to generate transcripts.

IBM_Recog.py

Requires IBM client libraries and associated key and service URL..

Takes a group name and the list of all original files in the group to generate transcripts.

compASR.py

Takes the names of two ASR systems and group names to generate a distance metric. Result yields text files with distance metrics for specified groups.

Users are requested to use the distance metrics to calculate the D values for each transformation.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Related tags

Overview

AequeVox

Owner

Sai Sathiesh

Learning to Prompt for Vision-Language Models.

A Traffic Sign Recognition Project which can help the driver recognise the signs via text as well as audio. Can be used at Night also.

Introducing neural networks to predict stock prices

This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

NLP made easy

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Implementation detail for paper "Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet"

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

A collection of awesome resources image-to-image translation.

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

A hue shift helper for OBS

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

A texturizer that I just made. Nothing special here.

alfred-py: A deep learning utility library for human

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

Research on controller area network Intrusion Detection Systems

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Related tags

Overview

AequeVox

Owner

Sai Sathiesh

Learning to Prompt for Vision-Language Models.

A Traffic Sign Recognition Project which can help the driver recognise the signs via text as well as audio. Can be used at Night also.

Introducing neural networks to predict stock prices

This repository contains a PyTorch implementation of the paper Learning to Assimilate in Chaotic Dynamical Systems.

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

NLP made easy

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Implementation detail for paper "Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet"

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

PyTorch implementation of the WarpedGANSpace: Finding non-linear RBF paths in GAN latent space (ICCV 2021)

A collection of awesome resources image-to-image translation.

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

A hue shift helper for OBS

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

A texturizer that I just made. Nothing special here.

alfred-py: A deep learning utility library for **human**

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

Research on controller area network Intrusion Detection Systems

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

alfred-py: A deep learning utility library for human