B-MRC

MRC approach for Aspect-based Sentiment Analysis (ABSA)

Paper: Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extraction

Dataset: https://github.com/xuuuluuu/SemEval-Triplet-data

Usage

Prepare data:

python data_process.py --data_path data/14lap --version bidirectional (unidirectional)

Arguments:
    --data_path :       Path to the dataset
    --version   :       Optional version: unidirectional (A2O) and bidirectional (A2O + O2A) 
                        (default = 'bidirectiona')
                        Choices=['uni', 'bi', 'unidirectional', 'bidirectional']

python make_data_dual --data_path data/14lap/preprocess --version bidirectional (unidirectional)

Arguments:
    --data_path :       Path to the dataset
    --version   :       Optional version: unidirectional (A2O) and bidirectional (A2O + O2A)
                        (default = 'bidirectiona')
                        Choices=['uni', 'bi', 'unidirectional', 'bidirectional']

python make_data_standard --data_path data/14lab/pair --output_path ./data/14lap/preprocess

Arguments:
    --data_path  :      Path to the dataset
    --output_path:      Path to the output data

Training:

python main.py \
    --version bidirectional (unidirectional) \
    --data_path ./data/14lap/preprocess/ \
    --mode train \
    --model_type bert-base-uncased \
    --epoch_num 40 \
    --batch_size 4 \
    --learning_rate 1e-3

MRC approach for Aspect-based Sentiment Analysis (ABSA)

Related tags

Overview

B-MRC

Usage

Owner

Phuc Phan

nlabel is a library for generating, storing and retrieving tagging information and embedding vectors from various nlp libraries through a unified interface.

An open-source NLP research library, built on PyTorch.

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Speech to text streamlit app

ASCEND Chinese-English code-switching dataset

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)

This repository will contain the code for the CVPR 2021 paper "GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields"

Simple Speech to Text, Text to Speech

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

A Fast Sequence Transducer Implementation with PyTorch Bindings

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

A versatile token stream for handwritten parsers.

BiQE: Code and dataset for the BiQE paper

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.

NLP: SLU tagging

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow