Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxiv.org/abs/2108.09084).

Last update: Jan 30, 2022

Related tags

Deep Learning Fastformer-Keras

Overview

Fastformer-Keras

Unofficial Tensorflow-Keras implementation of Fastformer based on paper Fastformer: Additive Attention Can Be All You Need.

Tensorflow-keras port of the following repositories:

- https://github.com/wilile26811249/Fastformer-PyTorch

- https://github.com/cheesama/stock-transformer

I just cleaned up and translated their work, All credits whatsoever goes to them! :)

Usage :

from fastformer import Fastformer
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Input, Concatenate, GlobalAveragePooling1D, Dropout, Dense

in_seq = Input(shape=(128, 64))
x = Fastformer(64)(in_seq)
x = GlobalAveragePooling1D(data_format='channels_first')(x)
x = Dense(64, activation = 'relu')(x)
out = Dense(1, activation = 'linear')(x)
model = Model(inputs = in_seq, outputs = out)
model.compile(loss = 'mse', optimizer = 'adam', metrics = ['mae', 'mape'])

Citation :

@misc{wu2021fastformer,
    title={Fastformer: Additive Attention Can Be All You Need},
    author={Chuhan Wu, Fangzhao Wu, Tao Qi and Yongfeng Huang},
    year={2021},
    eprint={2108.09084v2},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxiv.org/abs/2108.09084).

Related tags

Overview

Fastformer-Keras

Tensorflow-keras port of the following repositories:

- https://github.com/wilile26811249/Fastformer-PyTorch

- https://github.com/cheesama/stock-transformer

Usage :

Citation :

If this implement have any problem please let me know, thank you.

Owner

Yam Peleg

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

DualGAN-tensorflow: tensorflow implementation of DualGAN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Material for my PyConDE & PyData Berlin 2022 Talk "5 Steps to Speed Up Your Data-Analysis on a Single Core"

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

Progressive Image Deraining Networks: A Better and Simpler Baseline

Data labels and scripts for fastMRI.org

Fully Adaptive Bayesian Algorithm for Data Analysis (FABADA) is a new approach of noise reduction methods. In this repository is shown the package developed for this new method based on \citepaper.

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

🐸STT integration examples

use machine learning to recognize gesture on raspberrypi

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

An implementation of the WHATWG URL Standard in JavaScript

The implementation of DeBERTa