Self-Supervised Learning with Kernel Dependence Maximization

Last update: Dec 29, 2022

Related tags

Overview

Self-Supervised Learning with Kernel Dependence Maximization

This is the code for SSL-HSIC, a self-supervised learning loss proposed in the paper Self-Supervised Learning with Kernel Dependence Maximization (https://arxiv.org/abs/2106.08320).

Using this implementation should achieve a top-1 accuracy on Imagenet around 74.8% using 128 Cloud TPU v2/3.

Installation

To set up a Python3 virtual environment with the required dependencies, run:

python3 -m venv ssl_hsic_env
source ssl_hsic_env/bin/activate
pip install --upgrade pip
pip install -r ssl_hsic/requirements.txt

Usage

Pre-training

For pre-training on ImageNet with SSL-HSIC loss:

mkdir /tmp/ssl_hsic
python3 -m ssl_hsic.experiment \
--config=ssl_hsic/config.py:default \
--jaxline_mode=train

This is going to pre-train for 1000 epochs. Change config to config.py:test for testing purpose. See jaxline documentation for more information on jaxline_mode.

If save_dir is provided in config.py, the last checkpoint is saved and can be used for evaluation.

Linear Evaluation

For linear evaluation with the saved checkpoint:

mkdir /tmp/ssl_hsic
python3 -m ssl_hsic.eval_experiment \
--config=ssl_hsic/eval_config.py:default \
--jaxline_mode=train

This is going to train a linear layer for 90 epochs. Change config to eval_config.py:test for testing.

Citing this work

If you use this code in your work, please consider referencing our work:

@inproceedings{
  li2021selfsupervised,
  title={Self-Supervised Learning with Kernel Dependence Maximization},
  author={Yazhe Li and Roman Pogodin and Danica J. Sutherland and Arthur Gretton},
  booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
  year={2021},
  url={https://openreview.net/forum?id=0HW7A5YZjq7}
}

Disclaimer

This is not an official Google product.

Self-Supervised Learning with Kernel Dependence Maximization

Related tags

Overview

Self-Supervised Learning with Kernel Dependence Maximization

Installation

Usage

Pre-training

Linear Evaluation

Citing this work

Disclaimer

Owner

DeepMind

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

CSPML (crystal structure prediction with machine learning-based element substitution)

A simple python program that can be used to implement user authentication tokens into your program...

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

ALBERT-pytorch-implementation - ALBERT pytorch implementation

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Unofficial PyTorch implementation of the Adaptive Convolution architecture for image style transfer

AugLiChem - The augmentation library for chemical systems.

OpenIPDM is a MATLAB open-source platform that stands for infrastructures probabilistic deterioration model

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Code for "The Box Size Confidence Bias Harms Your Object Detector"

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

Generative Adversarial Text-to-Image Synthesis

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

Testing and Estimation of structural breaks in Stata

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296