Skip to content
This repository was archived by the owner on Apr 8, 2025. It is now read-only.

Refactoring Processor for LM Finetuning (FastTokenizers)#659

Merged
tholor merged 12 commits intorefactor_processor_qafrom
refactor_processor_lm_fine
Dec 21, 2020
Merged

Refactoring Processor for LM Finetuning (FastTokenizers)#659
tholor merged 12 commits intorefactor_processor_qafrom
refactor_processor_lm_fine

Conversation

@tholor
Copy link
Copy Markdown
Member

@tholor tholor commented Dec 18, 2020

Moving to fast tokenizers and new preprocessing stages ...

Breaking change: Slow tokenizers won't be supported any more for this type of task

Copy link
Copy Markdown
Contributor

@Timoeller Timoeller left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good.

Please also fix the lm finetuning tests (if they should be failing).

We need to make get start of word better in a coming release (branden also has a version as function of a single processor).

@tholor tholor merged commit 803b41b into refactor_processor_qa Dec 21, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants