Speech Rankings

This project mimics CSRankings to generate an ordered list of researchers in speech/spoken language processing along with their possible research topics, based on recent publications on important venues of the field, so as to help students seeking for PhD studies to find desirable advisors.

How to use

The pre-generated report is available at here. To build it by yourself,

Run prepare_data.py to build publications.json and authors.json, or simply use the data provided, covering those from 2011 to 2021.
Run export.py to generate the report.

How does it work

We scrape author metadata and publication data of the following three types of venues from DBLP, including:

Speech venues: Interspeech, Speech Communications, SLT, SSW, ASRU, IWSLT
Mixed venues: ICASSP, TASLP
General venues: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, KDD, AAAI, IJCAI

All publications in Speech venues are included. Paricularly for Interspeech, section/field of each paper are collected from ISCA Archive to show possible research topics of each researcher. So are the keywords from IEEE Xplore for papers published on IEEE-held venues. Keywords (as well as titles) are also used to filter out non-speech papers in Mixed venues by a set of rules. Titles are used to identify speech papers in General venues. Researchers are sorted by the total number of publications.

The collected data contain errors, and the project is neither intended to index speech-related papers nor to compare researchers in the field.

A CSRankings-like index for speech researchers

Related tags

Overview

Speech Rankings

How to use

How does it work

Owner

Mutian He

End-to-end image captioning with EfficientNet-b3 + LSTM with Attention

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

내부 작업용 django + vue(vuetify) boilerplate. 짠 하면 돌아감.

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

LCG T-TEST USING EUCLIDEAN METHOD

Index different CKAN entities in Solr, not just datasets

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

A programming language with logic of Python, and syntax of all languages.

Awesome-NLP-Research (ANLP)

📔️ Generate a text-based journal from a template file.

Segmenter - Transformer for Semantic Segmentation

Spert NLP Relation Extraction API deployed with torchserve for inference

Use AutoModelForSeq2SeqLM in Huggingface Transformers to train COMET

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

天池中药说明书实体识别挑战冠军方案；中文命名实体识别；NER; BERT-CRF & BERT-SPAN & BERT-MRC；Pytorch

profile tools for pytorch nn models

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.