A general, feasible, and extensible framework for classification tasks.

Last update: Nov 22, 2022

Overview

Pytorch Classification

A general, feasible and extensible framework for 2D image classification.

Features

Easy to configure (model, hyperparameters)
Training progress monitoring and visualization
Weighted sampling / weighted loss / kappa loss / focal loss for imbalance dataset
Kappa metric for evaluating model on imbalance dataset
Different learning rate schedulers and warmup support
Data augmentation
Multiple GPUs support

Installation

Recommended environment:

python 3.8+
pytorch 1.7.1+
torchvision 0.8.2+
tqdm
munch
packaging
tensorboard

To install the dependencies, run:

$ git clone https://github.com/YijinHuang/pytorch-classification.git
$ cd pytorch-classification
$ pip install -r requirements.txt

How to use

1. Use one of the following two methods to build your dataset:

Folder-form dataset:

Organize your images as follows:

├── your_data_dir
    ├── train
        ├── class1
            ├── image1.jpg
            ├── image2.jpg
            ├── ...
        ├── class2
            ├── image3.jpg
            ├── image4.jpg
            ├── ...
        ├── class3
        ├── ...
    ├── val
    ├── test

Here, val and test directory have the same structure of train. Then replace the value of 'data_path' in BASIC_CONFIG in configs/default.yaml with path to your_data_dir and keep 'data_index' as null.

Dict-form dataset:

Define a dict as follows:

your_data_dict = {
    'train': [
        ('path/to/image1', 0), # use int. to represent the class of images (start from 0)
        ('path/to/image2', 0),
        ('path/to/image3', 1),
        ('path/to/image4', 2),
        ...
    ],
    'test': [
        ('path/to/image5', 0),
        ...
    ],
    'val': [
        ('path/to/image6', 0),
        ...
    ]
}

Then use pickle to save it:

import pickle
pickle.dump(your_data_dict, open('path/to/pickle/file', 'wb'))

Finally, replace the value of 'data_index' in BASIC_CONFIG in configs/default.yaml with 'path/to/pickle/file' and set 'data_path' as null.

2. Update your training configurations and hyperparameters in configs/default.yaml.

3. Run to train:

$ CUDA_VISIBLE_DEVICES=x python main.py

Optional arguments:

-c yaml_file      Specify the config file (default: configs/default.yaml)
-o                Overwrite save_path and log_path without warning
-p                Print configs before training

4. Monitor your training progress in website 127.0.0.1:6006 by running:

$ tensorborad --logdir=/path/to/your/log --port=6006

Tips to use tensorboard on a remote server

A general, feasible, and extensible framework for classification tasks.

Related tags

Overview

Pytorch Classification

Features

Installation

How to use

Owner

Eugene

Predicting Event Memorability from Contextual Visual Semantics

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

ObjDetApp deploys a pytorch model for object detection

Pytorch implementation of the paper Time-series Generative Adversarial Networks

pix2pix in tensorflow.js

SegNet-Basic with Keras

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims.

How the Deep Q-learning method works and discuss the new ideas that makes the algorithm work

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

U-Net: Convolutional Networks for Biomedical Image Segmentation

CBKH: The Cornell Biomedical Knowledge Hub

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

A general, feasible, and extensible framework for classification tasks.

Related tags

Overview

Pytorch Classification

Features

Installation

How to use

Owner

Eugene

Predicting Event Memorability from Contextual Visual Semantics

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

*ObjDetApp* deploys a pytorch model for object detection

Pytorch implementation of the paper Time-series Generative Adversarial Networks

pix2pix in tensorflow.js

SegNet-Basic with Keras

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims.

How the Deep Q-learning method works and discuss the new ideas that makes the algorithm work

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

U-Net: Convolutional Networks for Biomedical Image Segmentation

CBKH: The Cornell Biomedical Knowledge Hub

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

ObjDetApp deploys a pytorch model for object detection