Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Last update: Dec 29, 2022

Related tags

Overview

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Code repo for paper Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations.

Dependencies

torch=1.8.1
transformers=4.9.0
sentence-transformers=2.0.0

Please view `requirements.txt' for more details.

Train

Self-distillation:

>> bash train_self_distill.sh 0

0 denotes GPU device index.

Mutual-distillation (two GPUs needed):

>> bash train_mutual_distill.sh 1,2

Train with your custom corpus:

>> CUDA_VISIBLE_DEVICES=0,1 python src/mutual_distill_parallel.py \
         --batch_size_bi_encoder 128 \
         --batch_size_cross_encoder 64 \
         --num_epochs_bi_encoder 10 \
         --num_epochs_cross_encoder 1 \
         --cycle 3 \
         --bi_encoder1_pooling_mode cls \
         --bi_encoder2_pooling_mode cls \
         --init_with_new_models \
         --task custom \
         --random_seed 2021 \
         --custom_corpus_path CORPUS_PATH

CORPUS_PATH should point to your custom corpus in which every line should be a sentence pair in the form of sent1||sent2.

Evaluate

>> python src/eval.py

Authors

Fangyu Liu: Main contributor

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Related tags

Overview

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Dependencies

Train

Evaluate

Authors

Security

License

Owner

Amazon

The open-source and free to use Python package miseval was developed to establish a standardized medical image segmentation evaluation procedure

Official PyTorch implementation of paper: Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation (ICCV 2021 Oral Presentation)

Implementation for the IJCAI2021 work "Beyond the Spectrum: Detecting Deepfakes via Re-synthesis"

A Python script that creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editing software such as FinalCut Pro for further adjustments.

AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Jupyter notebooks for using & learning Keras

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

MagFace: A Universal Representation for Face Recognition and Quality Assessment

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

ScaleNet: A Shallow Architecture for Scale Estimation

[ICCV 2021] Deep Hough Voting for Robust Global Registration

U-Net: Convolutional Networks for Biomedical Image Segmentation

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Towards Part-Based Understanding of RGB-D Scans

Preparation material for Dropbox interviews

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品

A time series processing library

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'