spaCy plugin for Transformers , Udify, ELmo, etc.

Last update: Nov 21, 2022

Related tags

Overview

Camphr - spaCy plugin for Transformers, Udify, Elmo, etc.

Camphr is a Natural Language Processing library that helps in seamless integration for a wide variety of techniques from state-of-the-art to conventional ones. You can use Transformers , Udify, ELmo, etc. on spaCy.

Check the documentation for more information.

(For Japanese: https://qiita.com/tamurahey/items/53a1902625ccaac1bb2f)

Features

A spaCy plugin - Easily integration for a wide variety of methods
Transformers with spaCy - Fine-tuning pretrained model with Hydra. Embedding vector
Udify - BERT based multitask model in 75 languages
Elmo - Deep contextualized word representations
Rule base matching with Aho-Corasick, Regex
(for Japanese) KNP

License

Camphr is licensed under Apache 2.0.

Comments

NER Problem

Hello!

First of all I would like to thank you for the great work on lib Camphr. It's been very useful to me! Can you help me with this doubt? I used lib to train a name recognition model (ner) but when I load the model using nlp = (spacy.load ("~ / outputs // 2020-04-30 // 22-28-36 // models // 9 "), and I pass a text (doc = nlp (" I live in Brazil ")), I can't get any entity recognition (doc.ents >> ()). Could you tell me why this is happening?

opened by gabrielluz07 9

Gender and number subtags generation

I was comparing the default morpho-syntactic tags generated by camphr-udify and https://github.com/Hyperparticle/udify.

import spacy
import stanza
from spacy_conll import ConllFormatter

nlp=spacy.load("en_udify")
conllformatter = ConllFormatter(nlp)
nlp.add_pipe(conllformatter, last=True)

doc=nlp("Mother Teresa devoted her entire life to helping others") 
print(doc._.conll_str)

1	Mother	Mother	PROPN		_	2	compound	_	_
2	Teresa	Teresa	PROPN		_	3	nsubj	_	_
3	devoted	devote	VERB		_	0	root	_	_
4	her	her	PRON		_	6	nmod:poss	_	_
5	entire	entire	ADJ		_	6	amod	_	_
6	life	life	NOUN		_	3	obj	_	_
7	to	to	SCONJ		_	8	mark	_	_
8	helping	help	VERB		_	3	advcl	_	_
9	others	other	NOUN		_	8	obj	_	SpaceAfter=No

Tags returned by https://github.com/Hyperparticle/udify, for the same input.

prediction:  1  Mother  Mother  PROPN   _       Number=Sing     2       compound        _       _
2       Teresa  Teresa  PROPN   _       Number=Sing     3       nsubj   _       _
3       devoted devote  VERB    _       Mood=Ind|Tense=Past|VerbForm=Fin        0       root    _       _
4       her     her     PRON    _       Gender=Fem|Number=Sing|Person=3|Poss=Yes|PronType=Prs   6       nmod:poss      _                                               _
5       entire  entire  ADJ     _       Degree=Pos      6       amod    _       _
6       life    life    NOUN    _       Number=Sing     3       obj     _       _
7       to      to      SCONJ   _       _       8       mark    _       _
8       helping help    VERB    _       VerbForm=Ger    3       advcl   _       _
9       others  other   NOUN    _       Number=Plur     8       obj     _       _

Gender and number subtags are missing in camphr-udify. Could we have those included by default please?

thanks, Ranjita

enhancement

opened by ranjita-naik 6

Camphr+KNP returns an incorrect dependency tag when using a specific adposition.
Hello. I report a problem that is happened when analyzing universal dependencies in Japanese text using KNP. When I use a adposition “から”, camphr returns a following wrong result (that shows the conj dependency tag on NOUN→VERB, but an expectation result is the obl dependency tag on VERB→NOUN).

(Note that "再結晶", "留去" are the words I added manually, but other VERB words that existed in the original dictionary such as "除去", "撹拌" generates similarly incorrect results.) Same problems sometimes occur when using an adposition "と".

But using other adpositions, such as “より”, “にて”, camphr returns a correct result.

Environment:

Docker(python:3.7-buster)

spacy = 2.3.2

camphr = 0.6.5

pyknp = 0.4.5

Juman++ ver.1.02

KNP ver.4.19
opened by undermakingbook 6
Python 3.8

Camphr is currently pinned at python < 3.8, is there a specific reason for this and if so, what can we do to help?

Edit: sorry, I just saw #19, still, what can we do to help?

opened by Evpok 5
Support multi labels textcat pipe for transformers
closes #9

Add TrfForMultiLabelSequenceClassification for multiple text classification.

pipe name: transformers_multilabel_sequence_classifier

Add docs for fine-tuning multi textcat pipe

https://github.com/PKSHATechnology-Research/camphr/blob/feature%2Fmulti-textcat/docs/source/notes/finetune_transformers.rst#multilabel-text-classification

enhancement
opened by tamuhey 5
unofficial-udify, allennlp, and transformers conflicting dependencies

I'm trying to install udify on WSL as shown below.

$ pip install unofficial-udify==0.3.0 [email protected]://github.com/PKSHATechnology-Research/camphr_models/releases/download/0.7.0/en_udify-0.7.tar.gz

ERROR: Cannot install unofficial-udify and unofficial-udify==0.3.0 because these package versions have conflicting dependencies.

The conflict is caused by: unofficial-udify 0.3.0 depends on transformers<3.0.0 and >=2.3.0 allennlp 1.3.0 depends on transformers<4.1 and >=4.0 unofficial-udify 0.3.0 depends on transformers<3.0.0 and >=2.3.0 allennlp 1.2.2 depends on transformers<3.6 and >=3.4 unofficial-udify 0.3.0 depends on transformers<3.0.0 and >=2.3.0 allennlp 1.2.1 depends on transformers<3.5 and >=3.1 unofficial-udify 0.3.0 depends on transformers<3.0.0 and >=2.3.0 allennlp 1.2.0 depends on transformers<3.5 and >=3.1 unofficial-udify 0.3.0 depends on transformers<3.0.0 and >=2.3.0 allennlp 1.1.0 depends on transformers<3.1 and >=3.0

Is this a known issue? Could you suggest a workaroudn please?
bug

opened by ranjita-naik 3
Missing tag information

I noticed that the spacy tag field is empty. Is this a known issue? It looks like Udify supports some level of ufeats tagging (https://universaldependencies.org/u/feat/index.html)? I wonder if I'm supposed to b getting any of this in Spacy and I have a bug in my setup, or if it just isn't implemented yet? Would it be souced in token.tag like I'm thinking (if it does exist)?

I also noticed that displacy doesn't render the POS info. I am wondering if that is related?

BTW, just have to say that this is awesome.

opened by tslater 3
ImportError: cannot import name 'load_udify' from 'camphr.pipelines' following the example
I followed the example here: https://camphr.readthedocs.io/en/latest/notes/udify.html

I did only see the 0.7.0 model, so I went with that instead. Anyway, the German and English examples work great, but the Japanese one gives me this error:

>>> from camphr.pipelines import load_udify Traceback (most recent call last): File "<stdin>", line 1, in <module> ImportError: cannot import name 'load_udify' from 'camphr.pipelines' (/home/tyler/camphr/env/lib/python3.8/site-packages/camphr/pipelines/__init__.py)
opened by tslater 3
doc.ents empty, doc.is_nered == False

I followed the documentation to fine-tune the bert-base-cased (en) ner model and then made a spacy doc with text "Bob Jones and Barack Obama went up the hill in Wisconsin." but the resulting doc has doc.ents = () and doc.is_nered = False.

Am I missing something?

Thank you!

opened by jack-rory-staunton 3
Improvement for サ変 of KNP

Inside _get_child_dep(c), pos for 名詞,サ変名詞 is changed into VERB when it is followed by AUX. So now I think that _get_dep(tag[0]) should be done after _get_child_dep(c).

opened by KoichiYasuoka 3
Bump transformers from 3.0.2 to 4.1.1
Bumps transformers from 3.0.2 to 4.1.1.

Release notes

Sourced from transformers's releases.

Patch release: better error message & invalid trainer attribute

This patch releases introduces:

A better error message when trying to instantiate a SentencePiece-based tokenizer without having SentencePiece installed. #8881

Fixes an incorrect attribute in the trainer. #8996

Transformers v4.0.0: Fast tokenizers, model outputs, file reorganization

Transformers v4.0.0-rc-1: Fast tokenizers, model outputs, file reorganization

Breaking changes since v3.x

Version v4.0.0 introduces several breaking changes that were necessary.

1. AutoTokenizers and pipelines now use fast (rust) tokenizers by default.

The python and rust tokenizers have roughly the same API, but the rust tokenizers have a more complete feature set. The main breaking change is the handling of overflowing tokens between the python and rust tokenizers.

How to obtain the same behavior as v3.x in v4.x

The pipelines now contain additional features out of the box. See the token-classification pipeline with the grouped_entities flag.

The auto-tokenizers now return rust tokenizers. In order to obtain the python tokenizers instead, the user may use the use_fast flag by setting it to False:

In version v3.x:

from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("xxx")

to obtain the same in version v4.x:

from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("xxx", use_fast=False)

2. SentencePiece is removed from the required dependencies

The requirement on the SentencePiece dependency has been lifted from the setup.py. This is done so that we may have a channel on anaconda cloud without relying on conda-forge. This means that the tokenizers that depend on the SentencePiece library will not be available with a standard transformers installation.

This includes the slow versions of:

XLNetTokenizer

AlbertTokenizer

CamembertTokenizer

MBartTokenizer

PegasusTokenizer

T5Tokenizer

ReformerTokenizer

XLMRobertaTokenizer

How to obtain the same behavior as v3.x in v4.x

Commits

bfa4ccf Release: v4.1.1

e0790cc Fix TAPAS doc

6d2e864 Put all models in the constants (#9170)

f83d9c8 v4.1.0 docs

f5438ab Release: v4.1.0

ac2c7e3 Remove erroneous character

77d6941 Fix gradient clipping for Sharded DDP (#9168)

1aca3d6 Add disclaimer to TAPAS rst file (#9167)

dc9f245 Torch scatter with torch 1.7.0

9a67185 Experimental support for fairscale ShardedDDP (#9139)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

@dependabot badge me will comment on this PR with code to add a "Dependabot enabled" badge to your readme

Additionally, you can set the following in your Dependabot dashboard:

Update frequency (including time of day and day of week)

Pull request limits (per update run and/or open at any time)

Out-of-range updates (receive only lockfile updates, if desired)

Security updates (receive only security updates, if desired)

dependencies
opened by dependabot-preview[bot] 2
Bump certifi from 2021.5.30 to 2022.12.7 in /packages/camphr_pattern_search
Bumps certifi from 2021.5.30 to 2022.12.7.

Commits

9e9e840 2022.12.07

b81bdb2 2022.09.24

939a28f 2022.09.14

aca828a 2022.06.15.2

de0eae1 Only use importlib.resources's new files() / Traversable API on Python ≥3.11 ...

b8eb5e9 2022.06.15.1

47fb7ab Fix deprecation warning on Python 3.11 (#199)

b0b48e0 fixes #198 -- update link in license

9d514b4 2022.06.15

4151e88 Add py.typed to MANIFEST.in to package in sdist (#196)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

dependencies
opened by dependabot[bot] 0
Bump numpy from 1.21.0 to 1.22.0 in /packages/camphr_pattern_search
Bumps numpy from 1.21.0 to 1.22.0.

Release notes

Sourced from numpy's releases.

v1.22.0

NumPy 1.22.0 Release Notes

NumPy 1.22.0 is a big release featuring the work of 153 contributors spread over 609 pull requests. There have been many improvements, highlights are:

Annotations of the main namespace are essentially complete. Upstream is a moving target, so there will likely be further improvements, but the major work is done. This is probably the most user visible enhancement in this release.

A preliminary version of the proposed Array-API is provided. This is a step in creating a standard collection of functions that can be used across application such as CuPy and JAX.

NumPy now has a DLPack backend. DLPack provides a common interchange format for array (tensor) data.

New methods for quantile, percentile, and related functions. The new methods provide a complete set of the methods commonly found in the literature.

A new configurable allocator for use by downstream projects.

These are in addition to the ongoing work to provide SIMD support for commonly used functions, improvements to F2PY, and better documentation.

The Python versions supported in this release are 3.8-3.10, Python 3.7 has been dropped. Note that 32 bit wheels are only provided for Python 3.8 and 3.9 on Windows, all other wheels are 64 bits on account of Ubuntu, Fedora, and other Linux distributions dropping 32 bit support. All 64 bit wheels are also linked with 64 bit integer OpenBLAS, which should fix the occasional problems encountered by folks using truly huge arrays.

Expired deprecations

Deprecated numeric style dtype strings have been removed

Using the strings "Bytes0", "Datetime64", "Str0", "Uint32", and "Uint64" as a dtype will now raise a TypeError.

(gh-19539)

Expired deprecations for loads, ndfromtxt, and mafromtxt in npyio

numpy.loads was deprecated in v1.15, with the recommendation that users use pickle.loads instead. ndfromtxt and mafromtxt were both deprecated in v1.17 - users should use numpy.genfromtxt instead with the appropriate value for the usemask parameter.

(gh-19615)

... (truncated)

Commits

4adc87d Merge pull request #20685 from charris/prepare-for-1.22.0-release

fd66547 REL: Prepare for the NumPy 1.22.0 release.

125304b wip

c283859 Merge pull request #20682 from charris/backport-20416

5399c03 Merge pull request #20681 from charris/backport-20954

f9c45f8 Merge pull request #20680 from charris/backport-20663

794b36f Update armccompiler.py

d93b14e Update test_public_api.py

7662c07 Update init.py

311ab52 Update armccompiler.py

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

dependencies
opened by dependabot[bot] 0

Releases(0.7.0)

0.7.0(Aug 21, 2020)
[dependencies] Bump pyknp from 0.4.4 to 0.4.5 #80

[dependencies] Bump spacy from 2.2.4 to 2.3.2 #81

[dependencies] Bump torch from 1.5.1 to 1.6.0 #82

[closed] move allennlp to camphr_allennlp #79

[dependencies] Bump hypothesis from 5.23.11 to 5.23.12 #73

[dependencies] Bump pytest from 5.4.3 to 6.0.1 #66

[closed] fix get_doc_char_span and covering span #78

[closed] fix index error #77

[closed] add lemma search to PatternSearch #76

[dependencies] Bump pytextspan from 0.2.2 to 0.3.0 #74

[closed] improve beamsearch performance for k ==1 #75

[closed] use pyknp #71

[closed] add normalizer to pattern search #70

[closed] Pattern searcher becomes able to search with lemma and lower #65

[closed] 形容詞接頭辞 into PART #63

[closed] fix deps #62

Source code(tar.gz)
Source code(zip)
0.6.0(Jul 9, 2020)
[dependencies] Bump scikit-learn from 0.22.2.post1 to 0.23.1 #61

[dependencies] Bump pytest from 5.3.2 to 5.4.3 #60

[closed] support allennlp v1 #59

[closed] Improvement for サ変 of KNP #56

[closed] refactor #55

Source code(tar.gz)
Source code(zip)
0.5.22(Apr 24, 2020)
[bug] fix transformers eval batchsize failure #50

Source code(tar.gz)
Source code(zip)
0.5.21(Apr 22, 2020)
[bug] Proper treatment of PUNCTs for KNP #48

Source code(tar.gz)
Source code(zip)
0.5.20(Apr 14, 2020)
[enhancement] dependency improvement for KNP #47

Thanks for contributing, @KoichiYasuoka!
Source code(tar.gz)
Source code(zip)
0.5.19(Apr 13, 2020)
[enhancement] update transformers dependency #46

[CI] Skip slow ci if unnecessary #45

[enhancement] Refactor/knp dependency parser #44

[enhancement] Tentative dependencies for KNP #43

Thanks for contributing, @KoichiYasuoka!
Source code(tar.gz)
Source code(zip)
0.5.18(Apr 10, 2020)
[enhancement] juman TAG_MAP tentative support #41

[bug] Fix misuse Vocab() in Language instantiation #42

Source code(tar.gz)
Source code(zip)
0.5.17(Apr 9, 2020)
[enhancement] Revert sentencepiece lang from v0.4 #40

Source code(tar.gz)
Source code(zip)
0.5.16(Apr 9, 2020)
[enhancement] add functools.lru_cache to knp extensions. #39

Source code(tar.gz)
Source code(zip)
0.5.15.dev0(Apr 8, 2020)

Source code(tar.gz)
Source code(zip)
0.5.15(Apr 8, 2020)

No changelog for this release.
Source code(tar.gz)
Source code(zip)
0.5.14(Apr 8, 2020)
[enhancement] tag and bunsetsu can be directly got from token #38

[enhancement] Feature/knp para noun chunks #37

[bug] fix noun chunker for para phrase #36

[enhancement][**refactor**] Refactor/knp noun chunker #35

Source code(tar.gz)
Source code(zip)
0.5.13(Apr 6, 2020)
Bug fix

Separate parallel clause in noun chunks into two or more chunks #34

Source code(tar.gz)
Source code(zip)
0.5.12(Apr 6, 2020)
New Features

Support knp noun chunker and knp dependency parser #33

Source code(tar.gz)
Source code(zip)
0.5.11(Mar 27, 2020)
New features

It is now possible to retrieve KNP result from spacy.doc (#31)

Source code(tar.gz)
Source code(zip)
0.5.10(Mar 18, 2020)

Removed the version restriction python<3.8. This will allow users to install camphr with python3.8, but macos users will fail. see (#29) for details.
Source code(tar.gz)
Source code(zip)
0.5.9(Mar 3, 2020)
Improvements

juman and knp now accepts longer text (#23)

Source code(tar.gz)
Source code(zip)
0.5.8(Mar 3, 2020)
Bug fix

fix transformers requirements (#24)

Source code(tar.gz)
Source code(zip)
0.5.7(Feb 21, 2020)
bug fix

fix camphr.utils.get_requirements_line

Source code(tar.gz)
Source code(zip)
0.5.5(Feb 21, 2020)
New features

Multi labels textcat pipe for transformers (#14)

Source code(tar.gz)
Source code(zip)
0.5.3(Feb 17, 2020)
New Features

Computing val loss in TorchLanguage.evaluate` #13

Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository https://camphr.readthedocs.io/en/latest/

multi-label，classifier，text classification，多标签文本分类，文本分类，BERT，ALBERT，multi-label-classification，seq2seq，attention，beam search

30 Dec 12, 2022

Python generation script for BitBirds

BitBirds generation script Intro This is published under MIT license, which means you can do whatever you want with it - entirely at your own risk. Pl

286 Dec 06, 2022

Final Project Bootcamp Zero

The Quest (Pygame) Descripción Este es el repositorio de código The-Quest para el proyecto final Bootcamp Zero de KeepCoding. El juego consiste en la

1 Mar 02, 2022

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

R²SQL The PyTorch implementation of paper Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing. (AAAI 2021) Requirement

60 Dec 31, 2022

Tool which allow you to detect and translate text.

Text detection and recognition This repository contains tool which allow to detect region with text and translate it one by one. Description Two pretr

176 Nov 28, 2022

Count the frequency of letters or words in a text file and show a graph.

Word Counter By EBUS Coding Club Count the frequency of letters or words in a text file and show a graph. Requirements Python 3.9 or higher matplotlib

0 Apr 09, 2022

Clone a voice in 5 seconds to generate arbitrary speech in real-time

This repository is forked from Real-Time-Voice-Cloning which only support English. English | 中文 Features 🌍 Chinese supported mandarin and tested with

25.6k Jan 06, 2023

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Summarization, translation, Q&A, text generation and more at blazing speed using a T5 version implemented in ONNX. This package is still in alpha stag

211 Dec 28, 2022

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

Yase Yet Another Sequence Encoder - encode sequences to vector of vectors in python ! Why Yase ? Yase enable you to encode any sequence which can be r

12 Aug 19, 2021

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2. It is trained (finetuned) on a curated list of approximately 45K Python (~470MB) files gathered from the

91 Sep 23, 2022

[WWW 2021 GLB] New Benchmarks for Learning on Non-Homophilous Graphs

New Benchmarks for Learning on Non-Homophilous Graphs Here are the codes and datasets accompanying the paper: New Benchmarks for Learning on Non-Homop

94 Dec 21, 2022

本插件是pcrjjc插件的重置版，可以独立于后端api运行

pcrjjc2 本插件是pcrjjc重置版，不需要使用其他后端api，但是需要自行配置客户端本项目基于AGPL v3协议开源，由于项目特殊性，禁止基于本项目的任何商业行为配置方法环境需求：.net framework 4.5及以上 jre8 别忘了装jre8 别忘了装jre8 别忘了装jre8

132 Dec 26, 2022

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

APSIPA-TER This code is the implementation of Text Emotion Recognition (TER) with linguistic features. The network model is BERT with a pretrained mod

1 Feb 08, 2022

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

Live Action Map (LAM) An attempt to use open source data on Twitter to map areas with active conflict. Right now it is used for the Ukraine-Russia con

171 Nov 21, 2022

A multi-voice TTS system trained with an emphasis on quality

TorToiSe Tortoise is a text-to-speech program built with the following priorities: Strong multi-voice capabilities. Highly realistic prosody and inton

2.1k Jan 01, 2023

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Indobenchmark Toolkit Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG) resources fo

11 Aug 26, 2022

An automated program that helps customers of Pizza Palour place their pizza orders

PIzza_Order_Assistant Introduction An automated program that helps customers of Pizza Palour place their pizza orders. The program uses voice commands

1 Dec 26, 2021

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

823 Dec 28, 2022

Harvis is designed to automate your C2 Infrastructure.

Harvis Harvis is designed to automate your C2 Infrastructure, currently using Mythic C2. 📌 What is it? Harvis is a python tool to help you create mul

99 Oct 06, 2022

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

3 Jan 12, 2022

spaCy plugin for Transformers , Udify, ELmo, etc.

Related tags

Overview

Camphr - spaCy plugin for Transformers, Udify, Elmo, etc.

Features

License

Comments

Patch release: better error message & invalid trainer attribute

Transformers v4.0.0: Fast tokenizers, model outputs, file reorganization

Transformers v4.0.0-rc-1: Fast tokenizers, model outputs, file reorganization

Breaking changes since v3.x

1. AutoTokenizers and pipelines now use fast (rust) tokenizers by default.

How to obtain the same behavior as v3.x in v4.x

2. SentencePiece is removed from the required dependencies

How to obtain the same behavior as v3.x in v4.x

v1.22.0

NumPy 1.22.0 Release Notes

Expired deprecations

Deprecated numeric style dtype strings have been removed

Expired deprecations for loads, ndfromtxt, and mafromtxt in npyio

Releases(0.7.0)

0.7.0(Aug 21, 2020)

0.6.0(Jul 9, 2020)

0.5.22(Apr 24, 2020)

0.5.21(Apr 22, 2020)

0.5.20(Apr 14, 2020)

0.5.19(Apr 13, 2020)

0.5.18(Apr 10, 2020)

0.5.17(Apr 9, 2020)

0.5.16(Apr 9, 2020)

0.5.15.dev0(Apr 8, 2020)

0.5.15(Apr 8, 2020)

0.5.14(Apr 8, 2020)

0.5.13(Apr 6, 2020)

Bug fix

0.5.12(Apr 6, 2020)

New Features

0.5.11(Mar 27, 2020)

New features

0.5.10(Mar 18, 2020)

0.5.9(Mar 3, 2020)

Improvements

0.5.8(Mar 3, 2020)