Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Last update: Nov 19, 2022

Overview

Improving evidential deep learning via multi task learning

It is a repository of AAAI2022 paper, “Improving evidential deep learning via multi-task learning”, by Dongpin Oh and Bonggun Shin.

This repository contains the code to reproduce the Multi-task evidential neural network (MT-ENet), which uses the Lipschitz MSE loss function as the additional loss function of the evidential regression network (ENet). The Lipschitz MSE loss function can improve the accuracy of the ENet while preserving its uncertainty estimation capability, by avoiding gradient conflict with the NLL loss function—the original loss function of the ENet.

Setup

Please refer to "requirements.txt" for requring packages of this repo.

pip install -r requirements.txt

Training the ENet with the Lipschitz-MSE loss: example

from mtevi.mtevi import EvidentialMarginalLikelihood, EvidenceRegularizer, modified_mse
...
net = EvidentialNetwork() ## Evidential regression network
nll_loss = EvidentialMarginalLikelihood() ## original loss, NLL loss
reg = EvidenceRegularizer() ## evidential regularizer
mmse_loss = modified_mse ## lipschitz MSE loss
...
for inputs, labels in dataloader:
	gamma, nu, alpha, beta = net(inputs)
	loss = nll_loss(gamma, nu, alpha, beta, labels)
	loss += reg(gamma, nu, alpha, beta, labels)
	loss += mmse_loss(gamma, nu, alpha, beta, labels)
	loss.backward()

Quick start

Synthetic data experiment.

python synthetic_exp.py

UCI regression benchmark experiments.

python uci_exp_norm -p energy

Drug target affinity (DTA) regression task on KIBA and Davis datasets.

python train_evinet.py -o test --type davis -f 0 --evi # ENet
python train_evinet.py -o test --type davis -f 0  # MT-ENet

Gradient conflict experiment on the DTA benchmarks

python check_conflict.py --type davis -f 0 # Conflict between the Lipschitz MSE (proposed) and NLL loss. 
python check_conflict.py --type davis -f 0 --abl # Conflict between the simple MSE loss and NLL loss.

Characteristic of the Lipschitz MSE loss

The Lipschitz MSE loss function can support training the ENet to more accurately predicts target values.
It regularizes its gradient to prevent gradient conflict with the NLL loss--the original loss function--if the NLL loss increases predictive uncertainty of the ENet.
Please check our paper for details.

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Related tags

Overview

Improving evidential deep learning via multi task learning

Setup

Training the ENet with the Lipschitz-MSE loss: example

Quick start

Characteristic of the Lipschitz MSE loss

Owner

deargen

a basic code repository for basic task in CV(classification,detection,segmentation)

This repo. is an implementation of ACFFNet, which is accepted for in Image and Vision Computing.

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

PFFDTD is an open-source FDTD simulator for 3D room acoustics

USAD - UnSupervised Anomaly Detection on multivariate time series

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

D-NeRF: Neural Radiance Fields for Dynamic Scenes

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

NR-GAN: Noise Robust Generative Adversarial Networks

This repository contains all data used for writing a research paper Multiple Object Trackers in OpenCV: A Benchmark, presented in ISIE 2021 conference in Kyoto, Japan.

A machine learning package for streaming data in Python. The other ancestor of River.

[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

This repository contains code from the paper "TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network"

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Rot-Pro: Modeling Transitivity by Projection in Knowledge Graph Embedding

WRENCH: Weak supeRvision bENCHmark

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)