Introduce Numba-based FSM utilities by brandonwillard · Pull Request #272 · dottxt-ai/outlines

brandonwillard · 2023-09-06T21:59:58Z

This PR introduces Numba JITed FSM utilities with 20x speed-ups over the current pure Python implementations.

It also introduces a more memory efficient "end-to-end" means of producing FSM indices. I avoided implementing it this way originally because it involves multiple iterations through a vocabulary, but, in order to address some memory-related shortcomings of the CFG indexing approaches tested in #178, this might be the better approach for now. It's a clear trade-off between processing and memory—now leaning toward processing—but, with the JIT speed-ups, it's reasonable.

Closes #226 (for now), closes #239, and should help with #192.

Tie this into the Regex implementation.
Map end states to tokens in the index, so that there's no need to re-walk the FSM after sampling.
This was always how it was supposed to work, but our previous prototype didn't implement it. Since we're updating/replacing that prototype, it might be best to add it now.
Cache computed masks.
Investigate Numba compilation and caching.
We need to make sure that caching works exactly as expected (i.e. only once for all the index-building code).
Test out some multi-threading approaches to the end-to-end indexing.
Consider using uints for the states, instead of int64.
Consider Python packaging/Numba AOT options.

AL-377 · 2023-11-02T01:54:31Z

will it help to speed up the "self.regex_fsm = regex_pattern.to_fsm().reduce()" in outlines 0.0.8，i found when the set the constrain field long like maxLength=1000, it takes very long in regex_fsm construction

rlouf · 2023-11-02T03:20:35Z

Did you try with 0.0.9?

brandonwillard self-assigned this Sep 6, 2023

brandonwillard added enhancement optimization Related to performance optimizations structured generation Linked to structured generation labels Sep 6, 2023

brandonwillard force-pushed the numba-fsa-implementation branch from 23fe929 to 80a2fb1 Compare September 6, 2023 22:05

brandonwillard marked this pull request as draft September 6, 2023 22:06

brandonwillard force-pushed the numba-fsa-implementation branch 5 times, most recently from 27feb9d to 7ae64b3 Compare September 9, 2023 23:56

brandonwillard requested a review from rlouf September 9, 2023 23:57

brandonwillard marked this pull request as ready for review September 10, 2023 00:00

brandonwillard mentioned this pull request Sep 10, 2023

Reproducible bug #263

Closed

brandonwillard force-pushed the numba-fsa-implementation branch 5 times, most recently from 9f416a5 to d79967e Compare September 16, 2023 03:40

brandonwillard mentioned this pull request Sep 16, 2023

Fix missing spaces in Tokenizer.convert_token_to_string #280

Merged

brandonwillard force-pushed the numba-fsa-implementation branch 4 times, most recently from 9a186fe to c7b3cc8 Compare September 16, 2023 20:26

brandonwillard mentioned this pull request Sep 16, 2023

Fix whitespace and control character handling in JSON guidance #283

Merged

brandonwillard force-pushed the numba-fsa-implementation branch from c7b3cc8 to 6739b30 Compare September 17, 2023 21:43

rlouf mentioned this pull request Sep 19, 2023

Example to generate dating app profiles- combines prompt templating with JSON generation #287

Merged

veezbo mentioned this pull request Sep 21, 2023

Usage of constr in particular during JSON generation seems to dramatically increase runtime #292

Closed

brandonwillard mentioned this pull request Sep 21, 2023

Use FSMs for scanning during grammar-guided generation #178

Merged

4 tasks

brandonwillard force-pushed the numba-fsa-implementation branch from 6739b30 to bd032c4 Compare September 23, 2023 00:34

brandonwillard force-pushed the numba-fsa-implementation branch from bd032c4 to 61cf813 Compare September 23, 2023 17:24

Add special_tokens to Tokenizer interface

e4c2cb7

brandonwillard force-pushed the numba-fsa-implementation branch 2 times, most recently from ab84dc5 to b4d4b2b Compare September 26, 2023 17:10

Make tokenizers hashable

4e058c8

brandonwillard force-pushed the numba-fsa-implementation branch 2 times, most recently from 87c97cd to fb37a1c Compare September 27, 2023 18:57

Refactor Regex and introduce Numba-based FSM utilities

9004440

brandonwillard force-pushed the numba-fsa-implementation branch from fb37a1c to 9004440 Compare September 27, 2023 19:06

brandonwillard merged commit 38b0b10 into dottxt-ai:main Sep 29, 2023

brandonwillard deleted the numba-fsa-implementation branch September 29, 2023 00:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Numba-based FSM utilities#272

Introduce Numba-based FSM utilities#272
brandonwillard merged 3 commits intodottxt-ai:mainfrom
brandonwillard:numba-fsa-implementation

brandonwillard commented Sep 6, 2023 •

edited

Loading

Uh oh!

AL-377 commented Nov 2, 2023

Uh oh!

rlouf commented Nov 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

brandonwillard commented Sep 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AL-377 commented Nov 2, 2023

Uh oh!

rlouf commented Nov 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

brandonwillard commented Sep 6, 2023 •

edited

Loading