The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Last update: Dec 27, 2022

Related tags

Deep Learning bmvc2021

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

^{Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval.}

Citation

@inproceedings{choudhury2021curious,
author = {Choudhury, Subhabrata and Laina, Iro and Rupprecht, Christian and Vedaldi, Andrea},
booktitle = {British Machine Vision Conference}
title = {The Curious Layperson: Fine-Grained Image Recognition without Expert Labels}
volume = {32},
year = {2021}
}

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Related tags

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

Citation

Owner

Subhabrata Choudhury

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Türkiye Canlı Mobese Görüntülerinde Profesyonel Nesne Takip Sistemi

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

This repository is based on Ultralytics/yolov5, with adjustments to enable rotate prediction boxes.

With this package, you can generate mixed-integer linear programming (MIP) models of trained artificial neural networks (ANNs) using the rectified linear unit (ReLU) activation function

A rule learning algorithm for the deduction of syndrome definitions from time series data.

Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals"

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

sequitur is a library that lets you create and train an autoencoder for sequential data in just two lines of code

Materials for upcoming beginner-friendly PyTorch course (work in progress).

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

This repository contains the code needed to train Mega-NeRF models and generate the sparse voxel octrees

Implementation of hyperparameter optimization/tuning methods for machine learning & deep learning models

This repository contains small projects related to Neural Networks and Deep Learning in general.

Resources related to our paper "CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain"

Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021)

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Code for the paper "A Study of Face Obfuscation in ImageNet"

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval