Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

Laplacian Score-regularized Concrete Autoencoders

TensorLight - A high-level framework for TensorFlow

A voice recognition assistant similar to amazon alexa, siri and google assistant.

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai

ONNX Command-Line Toolbox

Turning SymPy expressions into JAX functions

A library for hidden semi-Markov models with explicit durations

Nonnegative spatial factorization for multivariate count data

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

yolov5 deepsort 行人车辆跟踪检测计数

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Action Segmentation Evaluation

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

A Pytorch Implementation of ClariNet

Official implementation of EfficientPose

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

Laplacian Score-regularized Concrete Autoencoders

TensorLight - A high-level framework for TensorFlow

A voice recognition assistant similar to amazon alexa, siri and google assistant.

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai

ONNX Command-Line Toolbox

Turning SymPy expressions into JAX functions

A library for hidden semi-Markov models with explicit durations

Nonnegative spatial factorization for multivariate count data

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

yolov5 deepsort 行人 车辆 跟踪 检测 计数

A python interface for training Reinforcement Learning bots to battle on pokemon showdown

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Action Segmentation Evaluation

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

A Pytorch Implementation of ClariNet

Official implementation of EfficientPose

yolov5 deepsort 行人车辆跟踪检测计数