An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Last update: Oct 26, 2022

Related tags

Text Data & NLP FantasyBert

Overview

FantasyBert

English | 中文

Introduction

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations.

You can implement various NLP task conveniently with many functions such as adv training, fp16, gradient clip, r-drop, early stop, etc.

Installation

pip install fantasybert

The lastest verion is 0.1.3

Tutorials

tutorial

Task in examples

semantic text similarity

Datasets and Results

The datasets are downloaded and evaluated on CLUE.

Others

Some code are edited on transformers(tokenizaiton) and fastnlp(trainer), I simplified the code and added some new functions.

The part of models directly uses the pretrain model in transformers, I tried write bert models in bert4pytorch, but due to time limit and lack of ability, it failed to achieve the quality and efficiency of transformers as did.

This project is not for commerical use and is only for private use.

Owner

Fan

an interest-movitaved NLPer in Ottawa University...

GitHub Repository

AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

37 Nov 29, 2022

precise iris segmentation

PI-DECODER Introduction PI-DECODER, a decoder structure designed for Precise Iris Segmentation and Location. The decoder structure is shown below: Ple

8 Aug 08, 2022

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

Ελληνικά νέα (Python script) / Greek News Feed (Python script) Ελληνικά English Το 2017 είχα υλοποιήσει ένα Python script για να εμφανίζει τα τωρινά ν

1 Jun 14, 2022

Stand-alone language identification system

langid.py readme Introduction langid.py is a standalone Language Identification (LangID) tool. The design principles are as follows: Fast Pre-trained

2k Jan 04, 2023

Black for Python docstrings and reStructuredText (rst).

Style-Doc Style-Doc is Black for Python docstrings and reStructuredText (rst). It can be used to format docstrings (Google docstring format) in Python

13 Oct 24, 2022

A curated list of efficient attention modules

awesome-fast-attention A curated list of efficient attention modules

891 Dec 22, 2022

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets (product titles, images, comments, etc.).

55 Nov 22, 2022

code for modular summarization work published in ACL2021 by Krishna et al

This repository contains the code for running modular summarization pipelines as described in the publication Krishna K, Khosla K, Bigham J, Lipton ZC

21 Nov 24, 2022

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

textgenrnn Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code, or quickly tr

4.8k Dec 30, 2022

Yet another Python binding for fastText

pyfasttext Warning! pyfasttext is no longer maintained: use the official Python binding from the fastText repository: https://github.com/facebookresea

230 Nov 16, 2022

Shirt Bot is a discord bot which uses GPT-3 to generate text

SHIRT BOT · Shirt Bot is a discord bot which uses GPT-3 to generate text. Made by Cyclcrclicly#3420 (474183744685604865) on Discord. Support Server EX

31 Oct 31, 2022

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

PMR computer tutorials on HMMs (2021-2022) This is a repository for computer tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a Univer

10 Dec 06, 2022

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Related tags

Overview

FantasyBert

English | 中文

Introduction

Installation

Tutorials

Task in examples

Datasets and Results

Others

Owner

Fan

AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

precise iris segmentation

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

Stand-alone language identification system

Black for Python docstrings and reStructuredText (rst).

A curated list of efficient attention modules

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

code for modular summarization work published in ACL2021 by Krishna et al

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

Yet another Python binding for fastText

Shirt Bot is a discord bot which uses GPT-3 to generate text

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

COVID-19 Related NLP Papers

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Exploring dimension-reduced embeddings

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Text preprocessing, representation and visualization from zero to hero.

NLP command-line assistant powered by OpenAI

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training