Skip to content

Conversation

@stas00
Copy link
Collaborator

@stas00 stas00 commented Oct 28, 2025

As I was integrating ALST/Ulysses SP into HF Accelerate/Trainer I noticed that the initial UlyssesSPAttentionHF.register_with_transformers API was a bit inflexible/confusing wrt variable seqlen.

This PR deprecates the misleading max_length arg name, replaces it with seq_length and makes the latter optional if seq_length_is_variable is True.

Updated tests and docs.

@stas00 stas00 merged commit 02da373 into master Oct 29, 2025
14 of 16 checks passed
@stas00 stas00 deleted the stas/ulysses-register branch October 29, 2025 00:10
@stas00
Copy link
Collaborator Author

stas00 commented Oct 29, 2025

Thank you, Tunji!

rraminen pushed a commit to rraminen/DeepSpeed that referenced this pull request Dec 1, 2025
)

As I was integrating ALST/Ulysses SP into HF Accelerate/Trainer I
noticed that the initial
`UlyssesSPAttentionHF.register_with_transformers` API was a bit
inflexible/confusing wrt variable seqlen.

This PR deprecates the misleading `max_length` arg name, replaces it
with `seq_length` and makes the latter optional if
`seq_length_is_variable` is True.

Updated tests and docs.

Signed-off-by: Stas Bekman <[email protected]>
Signed-off-by: rraminen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants