DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Last update: Dec 15, 2022

Overview

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Authors: Jaemin Cho, Abhay Zala, and Mohit Bansal (UNC Chapel Hill)
Paper

Visual Reasoning

Please see ./paintskills for our DETR-based visual reasoning skill evaluation.

Reference

Please cite our paper if you use our dataset in your works:

@article{Cho2022DallEval,
  title         = {DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers},
  author        = {Jaemin Cho and Abhay Zala and Mohit Bansal},
  year          = {2022},
  archivePrefix = {arXiv},
  primaryClass  = {cs.CV},
  eprint        = {2202.04053}
}

Owner

Jaemin Cho

GitHub Repository https://arxiv.org/abs/2202.04053

A unet implementation for Image semantic segmentation

Unet-pytorch a unet implementation for Image semantic segmentation 参考网上的Unet做分割的代码，做了一个针对kaggle地盐识别的，请去以下地址获取数据集: https://www.kaggle.com/c/tgs-salt-id

3 Jun 29, 2022

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment This repo contains an official implementation of MegBA. MegBA is a

336 Dec 27, 2022

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

VisualGPT Our Paper VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning Main Architecture of Our VisualGPT Downloa

140 Dec 28, 2022

Interpretable-contrastive-word-mover-s-embedding

Interpretable-contrastive-word-mover-s-embedding Paper Datasets Here is a Dropbox link to the datasets used in the paper: https://www.dropbox.com/sh/n

0 Nov 02, 2021

Learning High-Speed Flight in the Wild

Learning High-Speed Flight in the Wild This repo contains the code associated to the paper Learning Agile Flight in the Wild. For more information, pl

391 Dec 29, 2022

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning This repository is the official implementation of "SHRIMP: Sparser Random Featur

0 Dec 16, 2021

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

The Official PyTorch Implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

3 Oct 15, 2021

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning Authors repo (alphabetical) Constantin (CoEich), Mayukh (Mayukh

331 Jan 03, 2023

Implement of homography net by pytorch

HomographyNet Implement of homography net by pytorch Brief Introduction This project is based on the work Homography-Net: @article{detone2016deep, t

4 May 19, 2022

EfficientNetV2 implementation using PyTorch

EfficientNetV2-S implementation using PyTorch Train Steps Configure imagenet path by changing data_dir in train.py python main.py --benchmark for mode

86 Dec 29, 2022

Pseudo-Visual Speech Denoising

Pseudo-Visual Speech Denoising This code is for our paper titled: Visual Speech Enhancement Without A Real Visual Stream published at WACV 2021. Autho

94 Oct 22, 2022

PyTorchVideo is a deeplearning library with a focus on video understanding work

PyTorchVideo is a deeplearning library with a focus on video understanding work. PytorchVideo provides resusable, modular and efficient components needed to accelerate the video understanding researc

2.7k Jan 07, 2023

Build Graph Nets in Tensorflow

Graph Nets library Graph Nets is DeepMind's library for building graph networks in Tensorflow and Sonnet. Contact 5.2k Jan 05, 2023

Toolkit for collecting and applying prompts

PromptSource Promptsource is a toolkit for collecting and applying prompts to NLP datasets. Promptsource uses a simple templating language to programa

998 Jan 03, 2023

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

TimeLens: Event-based Video Frame Interpolation This repository is about the High Speed Event and RGB (HS-ERGB) dataset, used in the 2021 CVPR paper T

544 Dec 19, 2022

PyTorch experiments with the Zalando fashion-mnist dataset

zalando-pytorch PyTorch experiments with the Zalando fashion-mnist dataset Project Organization ├── LICENSE ├── Makefile - Makefile with co

31 Sep 25, 2021

NeurIPS 2021 Datasets and Benchmarks Track

82 Dec 11, 2022

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Prefix Tuning Files: . ├── gpt2 # Code for GPT2 style autoregressive LM │ ├── train_e2e.py # high-level script

530 Jan 04, 2023

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets) Using mixup data augmentation as reguliraztion and tuning the hyper par

2 Jan 16, 2022

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

StrengthNet Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis" https://arxiv.org/abs/2110

65 Dec 20, 2022

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Related tags

Overview

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Visual Reasoning

Reference

Owner

Jaemin Cho

A unet implementation for Image semantic segmentation

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning

Interpretable-contrastive-word-mover-s-embedding

Learning High-Speed Flight in the Wild

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

Implement of homography net by pytorch

EfficientNetV2 implementation using PyTorch

Pseudo-Visual Speech Denoising

PyTorchVideo is a deeplearning library with a focus on video understanding work

Build Graph Nets in Tensorflow

Toolkit for collecting and applying prompts

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

PyTorch experiments with the Zalando fashion-mnist dataset

NeurIPS 2021 Datasets and Benchmarks Track

Prefix-Tuning: Optimizing Continuous Prompts for Generation

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"