Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

Last update: Oct 26, 2022

Related tags

Deep Learning i-Blurry

Overview

The Official Implementation of CLIB (Continual Learning for i-Blurry)

Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference
Hyunseo Koh^*, Dahyun Kim^*, Jung-Woo Ha, Jonghyun Choi
ICLR 2022 [Paper]
(* indicates equal contribution)

Overview

Abstract

Despite rapid advances in continual learning, a large body of research is devoted to improving performance in the existing setups. While a handful of work do propose new continual learning setups, they still lack practicality in certain aspects. For better practicality, we first propose a novel continual learning setup that is online, task-free, class-incremental, of blurry task boundaries and subject to inference queries at any moment. We additionally propose a new metric to better measure the performance of the continual learning methods subject to inference queries at any moment. To address the challenging setup and evaluation protocol, we propose an effective method that employs a new memory management scheme and novel learning techniques. Our empirical validation demonstrates that the proposed method outperforms prior arts by large margins.

Results

Results of CL methods on various datasets, for online continual learning on i-Blurry-50-10 split, measured by $A_\text{AUC}$ metric. For more details, please refer to our paper.

Methods	CIFAR10	CIFAR100	TinyImageNet	ImageNet
EWC++	57.34±2.10	35.35±1.96	22.26±1.15	24.81
BiC	58.38±0.54	33.51±3.04	22.80±0.94	27.41
ER-MIR	57.28±2.43	35.35±1.41	22.10±1.14	20.48
GDumb	53.20±1.93	32.84±0.45	18.17±0.19	14.41
RM	23.00±1.43	8.63±0.19	5.74±0.30	6.22
Baseline-ER	57.46±2.25	35.61±2.08	22.45±1.15	25.16
CLIB	70.26±1.28	46.67±0.79	23.87±0.68	28.16

Getting Started

To set up the environment for running the code, you can either use the docker container, or manually install the requirements in a virtual environment.

Using Docker Container (Recommended)

We provide the Docker image khs8157/iblurry on Docker Hub for reproducing the results. To download the docker image, run the following command:

docker pull khs8157/iblurry:latest

After pulling the image, you may run the container via following command:

docker run --gpus all -it --shm-size=64gb -v /PATH/TO/CODE:/PATH/TO/CODE --name=CONTAINER_NAME khs8157/iblurry:latest bash

Replace the arguments written in italic with your own arguments.

Requirements

Python3
Pytorch (>=1.9)
torchvision (>=0.10)
numpy
pillow~=6.2.1
torch_optimizer
randaugment
easydict
pandas~=1.1.3

If not using Docker container, install the requirements using the following command

pip install -r requirements.txt

Running Experiments

Downloading the Datasets

CIFAR10, CIFAR100, and TinyImageNet can be downloaded by running the corresponding scripts in the dataset/ directory. ImageNet dataset can be downloaded from Kaggle.

Experiments Using Shell Script

Experiments for the implemented methods can be run by executing the shell scripts provided in scripts/ directory. For example, you may run CL experiments using CLIB method by

bash scripts/clib.sh

You may change various arguments for different experiments.

NOTE: Short description of the experiment. Experiment result and log will be saved at results/DATASET/NOTE.
- WARNING: logs/results with the same dataset and note will be overwritten!
MODE: CL method to be applied. Methods implemented in this version are: [clib, er, ewc++, bic, mir, gdumb, rm]
DATASET: Dataset to use in experiment. Supported datasets are: [cifar10, cifar100, tinyimagenet, imagenet]
N_TASKS: Number of tasks. Note that corresponding json file should exist in collections/ directory.
N: Percentage of disjoint classes in i-blurry split. N=100 for full disjoint, N=0 for full blurry. Note that corresponding json file should exist in collections/ directory.
M: Blurry ratio of blurry classes in i-blurry split. Note that corresponding json file should exist in collections/ directory.
GPU_TRANSFORM: Perform AutoAug on GPU, for faster running.
USE_AMP: Use automatic mixed precision (amp), for faster running and reducing memory cost.
MEM_SIZE: Maximum number of samples in the episodic memory.
ONLINE_ITER: Number of model updates per sample.
EVAL_PERIOD: Period of evaluation queries, for calculating $A_\text{AUC}$ .

Citation

If you used our code or i-blurry setup, please cite our paper.

@inproceedings{koh2022online,
  title={Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference},
  author={Koh, Hyunseo and Kim, Dahyun and Ha, Jung-Woo and Choi, Jonghyun},
  booktitle={ICLR},
  year={2022}
}

License

Copyright (C) 2022-present NAVER Corp.

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see <https://www.gnu.org/licenses/>.

Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

Related tags

Overview

The Official Implementation of CLIB (Continual Learning for i-Blurry)

Overview

Abstract

Results

Getting Started

Using Docker Container (Recommended)

Requirements

Running Experiments

Downloading the Datasets

Experiments Using Shell Script

Citation

License

Owner

NAVER AI

Code for Understanding Pooling in Graph Neural Networks

Preprocessed Datasets for our Multimodal NER paper

A data-driven maritime port simulator

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

[3DV 2021] Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

Predicting Student Attentiveness using OpenCV

Class-Attentive Diffusion Network for Semi-Supervised Classification [AAAI'21] (official implementation)

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

Deep Q-network learning to play flappybird.

Subpopulation detection in high-dimensional single-cell data

Mini Software that give reminder to drink water as per your weight.

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

This repository contains the code for our fast polygonal building extraction from overhead images pipeline.

Source code for "Interactive All-Hex Meshing via Cuboid Decomposition [SIGGRAPH Asia 2021]".

[ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Offline Reinforcement Learning with Implicit Q-Learning

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

An implementation for `Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction`

Multi-Task Learning as a Bargaining Game