Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Last update: Oct 14, 2022

Related tags

Text Data & NLP loa

Overview

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

This is the project page for Finding Label and Model Errors in Perception Data With Learned Observation Assertions.

Please read the paper for full technical details.

Installation

In the root directory, run

pip install -e .

Examples

We provide an example of the Lyft Level 5 percetion dataset. We have provided model predictions for convenience, but you will need to download the dataset here.

All of the scripts are available in examples/lyft_level5. In order to run the scripts, do the following:

Set the data directories in constants.py.
Learn the priors with learn_priors.py.
Run LOA with prior_lyft.py.

You can visualize the results with viz_track.py.

Citation

If you find this project useful, please cite us at

@article{kang2021finding,
  title={Finding Label and Model Errors in Perception Data With Learned Observation Assertions},
  author={Kang, Daniel and Arechiga, Nikos and Pillai, Sudeep and Bailis, Peter and Zaharia, Matei},
}

and contact us if you deploy LOA!

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Related tags

Overview

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Installation

Examples

Citation

Owner

Stanford Future Data Systems

NL. The natural language programming language.

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

A script that automatically creates a branch name using google translation api and jira api

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

This repository contains examples of Task-Informed Meta-Learning

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Common Voice Dataset explorer

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP

(ACL 2022) The source code for the paper "Towards Abstractive Grounded Summarization of Podcast Transcripts"

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

RecipeReduce: Simplified Recipe Processing for Lazy Programmers

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

BiNE: Bipartite Network Embedding

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence