BERT-SST2-Prod

Reproduction process of BERT on SST2 dataset

安装说明

下载代码库

git clone https://github.com/JunnYu/BERT-SST2-Prod

进入文件夹，安装requirements

pip install -r requirements.txt

安装PaddlePaddle与PyTorch

# CPU版本的PaddlePaddle
pip install paddlepaddle==2.2.0 -i https://mirror.baidu.com/pypi/simple
# 如果希望安装GPU版本的PaddlePaddle，可以使用下面的命令
# pip install paddlepaddle-gpu==2.2.0.post112 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# 安装PyTorch
pip install torch==1.10.0+cu113 torchvision==0.11.1+cu113 torchaudio==0.10.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

注意: 本项目依赖于paddlepaddle-2.2.0版本，安装时需要注意。

验证PaddlePaddle是否安装成功

运行python，输入下面的命令。

import paddle
paddle.utils.run_check()
print(paddle.__version__)

如果输出下面的内容，则说明PaddlePaddle安装成功。

PaddlePaddle is installed successfully! Let's start deep learning with PaddlePaddle now.
2.2.0

验证PyTorch是否安装成功

运行python，输入下面的命令，如果可以正常输出，则说明torch安装成功。

import torch
print(torch.__version__)
# 如果安装的是cpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.])
print(torch.Tensor([1.0]))
# 如果安装的是gpu版本，可以按照下面的命令确认torch是否安装成功
# 期望输出为 tensor([1.], device='cuda:0')
print(torch.Tensor([1.0]).cuda())

Reproduction process of BERT on SST2 dataset

Related tags

Overview

BERT-SST2-Prod

安装说明

Owner

yujun

Weakly-supervised Text Classification Based on Keyword Graph

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

This simple Python program calculates a love score based on your and your crush's full names in English

Code-autocomplete, a code completion plugin for Python

Paddlespeech Streaming ASR GUI

Use AutoModelForSeq2SeqLM in Huggingface Transformers to train COMET

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

A toolkit for document-level event extraction, containing some SOTA model implementations

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

Rethinking the Truly Unsupervised Image-to-Image Translation - Official PyTorch Implementation (ICCV 2021)

Multiple implementations for abstractive text summurization , using google colab

A Fast Sequence Transducer Implementation with PyTorch Bindings

运小筹公众号是致力于分享运筹优化(LP、MIP、NLP、随机规划、鲁棒优化)、凸优化、强化学习等研究领域的内容以及涉及到的算法的代码实现。

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

Paddle2.x version AI-Writer

(ACL 2022) The source code for the paper "Towards Abstractive Grounded Summarization of Podcast Transcripts"

Just a Basic like Language for Zeno INC

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary