Add support for ELECTRA by stefan-it · Pull Request #349 · deepset-ai/FARM

stefan-it · 2020-05-05T22:51:36Z

Hi,

this PR adds the previously introduced ELECTRA pre-training approach into FARM:

The ELECTRA model was proposed in the paper: ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. ELECTRA is a new pre-training approach which trains two transformer models: the generator and the discriminator. The generator’s role is to replace tokens in a sequence, and is therefore trained as a masked language model. The discriminator, which is the model we’re interested in, tries to identify which tokens were replaced by the generator in the sequence.

Implementation notes

The Hugging Face Transformer library was updated to the latest 2.8 version to support ELECTRA model. Like DistilBERT, an additional pooler needs to be defined to get a one vector per sequence representation.

Experiments

I did pre-liminary experiments with CoNLL-2003 for NER. The configuration can be found under experiments/electra_eval/conll2003_en_config.json.

Result for one run on using the ELECTRA base model: 94.30% (dev) and 89.86% (test).

brandenchan

This looks good to me! The structure of the Electra LM is like that of XLNet and in my tests I was able to get Electra to train on doc classification.

Thanks for your effort on this PR @stefan-it !

tholor

Looking good! Thanks for working on this!

PhilipMay · 2020-08-14T11:12:04Z

Thanks @stefan-it 🥇

stefan-it added 4 commits May 6, 2020 00:07

requirements: bump Transformers to latest 2.8 version

72f5d66

tokenization: add support for ELECTRA

2a32db2

modeling: add support for new ELECTRA model

dda68d1

experiments: add configuration for English ELECTRA evaluation

67f80d2

tholor requested a review from brandenchan May 7, 2020 08:02

tholor added enhancement New feature or request part: model labels May 7, 2020

brandenchan approved these changes May 7, 2020

View reviewed changes

tholor approved these changes May 7, 2020

View reviewed changes

tholor merged commit 2a33382 into deepset-ai:master May 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for ELECTRA#349

Add support for ELECTRA#349
tholor merged 4 commits intodeepset-ai:masterfrom
stefan-it:master

stefan-it commented May 5, 2020

Uh oh!

brandenchan left a comment

Uh oh!

tholor left a comment

Uh oh!

PhilipMay commented Aug 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

stefan-it commented May 5, 2020

Implementation notes

Experiments

Uh oh!

brandenchan left a comment

Choose a reason for hiding this comment

Uh oh!

tholor left a comment

Choose a reason for hiding this comment

Uh oh!

PhilipMay commented Aug 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants