Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Last update: May 05, 2022

Overview

Speech_38_ru_commands

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Программа умеет распознавать 38 ключевых слов на русском языке , произнесенных в микрофон из списка:

дальше, вперед, назад, вверх, вниз, выше, ниже, домой, громче, тише, лайк, дизлайк, следующий, предыдущий, сначала, перемотай, выключи, стоп, хватит, замолчи, заткнись, останови, пауза, включи, смотреть, продолжи, играй, запусти, ноль, один, два, три, четыре, пять, шесть, семь, восемь, девять.

Используемая модель была подготовлена для соревнования Yandex Cup 2021 ML Challenge: ASR. Получило 3 место из 54 участников. с показателем точности 92.01

Скачать модель по ссылке https://disk.yandex.ru/d/L053qF-0OPKlog

Пример запуска программы:

python speech_38_ru_commands.py --porog 1.2

где , число 1.2 - это порог уверенности в команде. Можно задавать в диапазоне 0.0 - 7.9999

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Related tags

Overview

Speech_38_ru_commands

Owner

Andrey

NLP-based analysis of poor Chinese movie reviews on Douban

Collection of useful (to me) python scripts for interacting with napari

Korean extractive summarization. 2021 AI 텍스트 요약 온라인 해커톤 화성갈끄니까팀 코드

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Simple bots or Simbots is a library designed to create simple bots using the power of python. This library utilises Intent, Entity, Relation and Context model to create bots .

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Text-to-Speech for Belarusian language

Words-per-minute - A terminal app written in python utilizing the curses module that tests the user's ability to type

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

Python utility library for compositing PDF documents with reportlab.

A library for Multilingual Unsupervised or Supervised word Embeddings

Stuff related to Ben Eater's 8bit breadboard computer

An extensive UI tool built using new data scraped from BBC News

ACL'22: Structured Pruning Learns Compact and Accurate Models

This repository contains the code for "Generating Datasets with Pretrained Language Models".

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

A toolkit for document-level event extraction, containing some SOTA model implementations

Reformer, the efficient Transformer, in Pytorch