PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Last update: Dec 21, 2022

Related tags

Deep Learning directclr

Overview

DirectCLR

DirectCLR is a simple contrastive learning model for visual representation learning. It does not require a trainable projector as SimCLR. It is able to prevent dimensional collapse and outperform SimCLR with a linear projector.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

@article{Jing2021UnderstandingDC,
  title={Understanding Dimensional Collapse in Contrastive Self-supervised Learning},
  author={Li Jing and Pascal Vincent and Yann LeCun and Yuandong Tian},
  journal={arXiv preprint arXiv:2110.09348},
  year={2021}
}

DirectCLR Training

Install PyTorch and download ImageNet by following the instructions in the requirements section of the PyTorch ImageNet training example. The code has been developed for PyTorch version 1.7.1 and torchvision version 0.8.2, but it should work with other versions just as well.

Our best model is obtained by running the following command:

python main.py --data /path/to/imagenet/ --mode directclr --dim 360

Mode can be chosen as:

simclr: standard SimCLR with two layer nonlinear projector;

single: SimCLR with single layer linear projector;

baseline: SimCLR without a projector;

directclr: DirectCLR with single layer linear projector;

Training time is approximately 7 hours on 32 v100 GPUs.

Evaluation: Linear Classification

Train a linear probe on the representations. Freeze the weights of the resnet and use the entire ImageNet training set.

python linear_probe.py /path/to/imagenet/ /path/to/checkpoint/resnet50.pth

Linear probe time is approximately 20 hours on 8 v100 GPUs.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Related tags

Overview

DirectCLR

DirectCLR Training

Evaluation: Linear Classification

License

Owner

Meta Research

Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation

Official repository for Natural Image Matting via Guided Contextual Attention

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Deep Federated Learning for Autonomous Driving

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

Official repository for "Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring".

Cross View SLAM

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

A collection of 100 Deep Learning images and visualizations

Pytorch implementation for the EMNLP 2020 (Findings) paper: Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

NeRF visualization library under construction

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)

Tools for investing in Python

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

Fully-automated scripts for collecting AI-related papers

The Official TensorFlow Implementation for SPatchGAN (ICCV2021)