Code repository for EMNLP 2021 paper 'Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods'

Last update: Sep 25, 2022

Related tags

Deep Learning AttributionAttack

Overview

Adversarial Attacks on Knowledge Graph Embeddings
via Instance Attribution Methods

This is the code repository to accompany the EMNLP 2021 paper on adversarial attacks on KGE models.
For any questions or feedback, add an issue or email me at: [email protected]

Overview

The figure illustrates adversarial attacks against KGE models for fraud detection. The knowledge graph consists of two types of entities - Person and BankAccount. The missing target triple to predict is (Sam, allied_with, Joe). Original KGE model predicts this triple as True, i.e. assigns it a higher score relative to synthetic negative triples. But a malicious attacker uses the instance attribution methods to either (a) delete an adversarial triple or (b) add an adversarial triple. Now, the KGE model predicts the missing target triple as False.

The attacker uses the instance attribution methods to identify the training triples that are most influential for model's prediciton on the target triple. These influential triples are used as adversarial deletions. Using the influential triple, the attacker further selects adversarial additions by replacing one of the two entities of the influential triple with the most dissimilar entity in the embedding space. For example, if the attacker identifies that (Sam, deposits_to, Suspicious_Account) is the most influential triple for predicting (Sam, allied_with, Joe), then they can add (Sam, deposits_to, Non_Suspicious_Account) to reduce the influence of the influential triple.

Reproducing the results

Setup

python = 3.8.5
pytorch = 1.4.0
numpy = 1.19.1
jupyter = 1.0.0
pandas = 1.1.0
matplotlib = 3.2.2
scikit-learn = 0.23.2
seaborn = 0.11.0

Experiments reported in the paper were run in the conda environment attribution_attack.yml.

Steps

The codebase and the bash scripts used for experiments are in KGEAttack.
To preprocess the original dataset, use the bash script preprocess.sh.
For each model-dataset combination, there is a bash script to train the original model, generate attacks from baselines and proposed attacks; and train poisoned model. These scripts are named as model-dataset.sh.
The instructions in these scripts are grouped together under the echo statements which indicate what they do.
The commandline argument --reproduce-results uses the hyperparameters that were used for the experiments reported in the paper. These hyperparameter values can be inspected in the function set_hyperparams() in utils.py.
To reproduce the results, specific instructions from the bash scripts can be run on commandline or the full script can be run.
All experiments in the paper were run on a shared HPC cluster that had Nvidia RTX 2080ti, Tesla K40 and V100 GPUs.

References

Parts of this codebase are based on the code from following repositories

Citation

@inproceedings{bhardwaj-etal-2021-adversarial,
    title = "Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods",
    author = "Bhardwaj, Peru  and
      Kelleher, John  and
      Costabello, Luca  and
      O{'}Sullivan, Declan",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.648",
    pages = "8225--8239",
    }

Code repository for EMNLP 2021 paper 'Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods'

Related tags

Overview

Adversarial Attacks on Knowledge Graph Embeddings
via Instance Attribution Methods

This is the code repository to accompany the EMNLP 2021 paper on adversarial attacks on KGE models.
For any questions or feedback, add an issue or email me at: [email protected]

Overview

Reproducing the results

Setup

Steps

References

Citation

Owner

Peru Bhardwaj

A Pytorch Implementation of ClariNet

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

SMPL-X: A new joint 3D model of the human body, face and hands together

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

RealTime Emotion Recognizer for Machine Learning Study Jam's demo

Deep Surface Reconstruction from Point Clouds with Visibility Information

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summary.

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

Sequential Model-based Algorithm Configuration

Group project for MFIN7036. Our goal is to predict firm profitability with text-based competition measures.

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Learning-Augmented Dynamic Power Management

Face Recognition Attendance Project

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

Code repository for EMNLP 2021 paper 'Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods'

Related tags

Overview

Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods

This is the code repository to accompany the EMNLP 2021 paper on adversarial attacks on KGE models. For any questions or feedback, add an issue or email me at: [email protected]

Overview

Reproducing the results

Setup

Steps

References

Citation

Owner

Peru Bhardwaj

A Pytorch Implementation of ClariNet

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

SMPL-X: A new joint 3D model of the human body, face and hands together

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

RealTime Emotion Recognizer for Machine Learning Study Jam's demo

Deep Surface Reconstruction from Point Clouds with Visibility Information

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summary.

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

Sequential Model-based Algorithm Configuration

Group project for MFIN7036. Our goal is to predict firm profitability with text-based competition measures.

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Learning-Augmented Dynamic Power Management

Face Recognition Attendance Project

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

Adversarial Attacks on Knowledge Graph Embeddings
via Instance Attribution Methods

This is the code repository to accompany the EMNLP 2021 paper on adversarial attacks on KGE models.
For any questions or feedback, add an issue or email me at: [email protected]