A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

Make a Turtlebot3 follow a figure 8 trajectory and create a robot arm and make it follow a trajectory

Task-based end-to-end model learning in stochastic optimization

NVIDIA container runtime

This is the repo for Uncertainty Quantification 360 Toolkit.

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Official implementation of CATs: Cost Aggregation Transformers for Visual Correspondence NeurIPS'21

My course projects for the 2021 Spring Machine Learning course at the National Taiwan University (NTU)

MultiTaskLearning - Multi Task Learning for 3D segmentation

Video lie detector using xgboost - A video lie detector using OpenFace and xgboost

用opencv的dnn模块做yolov5目标检测，包含C++和Python两个版本的程序

A toolkit for Lagrangian-based constrained optimization in Pytorch

Vector Quantized Diffusion Model for Text-to-Image Synthesis

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

Deep Learning Models for Causal Inference

Neural HMMs are all you need (for high-quality attention-free TTS)

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Global Filter Networks for Image Classification

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)