👷 Added Kernels on the Hub x TRL guide #3969

sergiopaniego · 2025-08-28T15:26:14Z

What does this PR do?

Update all examples scripts to accept kernels from the Hub.
Only supported for SFT Trainer?
Benchmarks

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2025-08-28T15:31:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-09-02T19:03:40Z

docs/source/_toctree.yml

    title: Command Line Interface (CLI)
  - local: jobs_training
    title: Training using Jobs
+  - local: kernels_hub


I'd move this section under "Integration"

- local: deepspeed_integration title: DeepSpeed local: kernel_hub_integration title: Kernel Hub - local: liger_kernel_integration title: Liger Kernel

Updated!
I considered Integrations to be the section for external toolkits so I was unsure about adding it there.

docs/source/kernels_hub.md

Vaibhavs10 · 2025-09-03T10:09:17Z

docs/source/kernels_hub.md

+
+[PLOT]
+
+## Combining FlashAttention Kernels with Liger Kernels


do we pull the kernel from the hub here as well? - if yes, then let's mention it.

Not pulled from the Hub at the moment

Vaibhavs10 · 2025-09-03T10:09:49Z

docs/source/kernels_hub.md

+</Tip>
+
+
+## Comparing Attention Implementations


not sure if this is really needed tbh (in the spirit to ship fast)

Co-authored-by: vb <[email protected]>

sergiopaniego · 2025-09-03T12:34:33Z

Should I add the script used for benchmarking somewhere? It's a modification of sft.py with a callback (gist)

docs/source/kernels_hub.md

Co-authored-by: Kashif Rasul <[email protected]>

lewtun

LGTM with a turbo nit :) Great stuff @sergiopaniego !

docs/source/kernels_hub.md

lewtun · 2025-09-04T12:59:45Z

docs/source/kernels_hub.md

+
+Building Flash Attention from source can be time-consuming, often taking anywhere from several minutes to hours, depending on your hardware, CUDA/PyTorch configuration, and whether precompiled wheels are available.  
+
+In contrast, **Hugging Face Kernels** provide a much faster and more reliable workflow. Developers don’t need to worry about complex setups—everything is handled automatically. In our benchmarks, kernels were ready to use in about **2.5 seconds**, with no compilation required. This allows you to start training almost instantly, significantly accelerating development. Simply specify the desired version, and `kernels` takes care of the rest.


This is such a cool feature of kernels! I've lost a cumulative few days of my life waiting for FA2 to compile :D

I know the feeling 😭

Co-authored-by: lewtun <[email protected]>

Co-authored-by: vb <[email protected]> Co-authored-by: Kashif Rasul <[email protected]> Co-authored-by: lewtun <[email protected]>

Added Kernels on the Hub guide

202385d

sergiopaniego added 6 commits August 28, 2025 17:32

Updated uv scripts with kernels

bf3a9a2

Extended uv dependency

44f0ae7

Typo fixed

b5d4e48

Included details about CLI

4a1d627

Updated

fb0bdf2

Merge branch 'main' of github.com:huggingface/trl into kernels_hub_docs

925b0cf

qgallouedec reviewed Sep 2, 2025

View reviewed changes

Vaibhavs10 reviewed Sep 3, 2025

View reviewed changes

sergiopaniego and others added 8 commits September 3, 2025 13:47

Updated

9bf26ed

Update docs/source/kernels_hub.md

5e21905

Co-authored-by: vb <[email protected]>

Update docs/source/kernels_hub.md

6facbb3

Co-authored-by: vb <[email protected]>

Update docs/source/kernels_hub.md

ff87748

Co-authored-by: vb <[email protected]>

Update docs/source/kernels_hub.md

69b258d

Co-authored-by: vb <[email protected]>

Merge branch 'main' into kernels_hub_docs

f46b1dc

Updated toctree

00d8278

Updated

268368b

sergiopaniego requested review from lewtun and qgallouedec September 3, 2025 12:26

kashif reviewed Sep 3, 2025

View reviewed changes

docs/source/kernels_hub.md Outdated Show resolved Hide resolved

sergiopaniego and others added 3 commits September 4, 2025 09:52

Update docs/source/kernels_hub.md

0bb3375

Co-authored-by: Kashif Rasul <[email protected]>

Merge branch 'main' into kernels_hub_docs

0567e7f

Merge branch 'main' into kernels_hub_docs

f286bc2

lewtun approved these changes Sep 4, 2025

View reviewed changes

Update docs/source/kernels_hub.md

63ae8fe

Co-authored-by: lewtun <[email protected]>

sergiopaniego merged commit 0c69fd2 into main Sep 4, 2025
10 of 11 checks passed

sergiopaniego deleted the kernels_hub_docs branch September 4, 2025 13:37

SamY724 pushed a commit to SamY724/trl that referenced this pull request Sep 6, 2025

👷 Added Kernels on the Hub x TRL guide (huggingface#3969)

ddebb81

Co-authored-by: vb <[email protected]> Co-authored-by: Kashif Rasul <[email protected]> Co-authored-by: lewtun <[email protected]>


		[PLOT]

		## Combining FlashAttention Kernels with Liger Kernels


		Building Flash Attention from source can be time-consuming, often taking anywhere from several minutes to hours, depending on your hardware, CUDA/PyTorch configuration, and whether precompiled wheels are available.

		In contrast, Hugging Face Kernels provide a much faster and more reliable workflow. Developers don’t need to worry about complex setups—everything is handled automatically. In our benchmarks, kernels were ready to use in about 2.5 seconds, with no compilation required. This allows you to start training almost instantly, significantly accelerating development. Simply specify the desired version, and `kernels` takes care of the rest.

👷 Added Kernels on the Hub x TRL guide #3969

👷 Added Kernels on the Hub x TRL guide #3969

Uh oh!

Conversation

sergiopaniego commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 28, 2025

Uh oh!

qgallouedec Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Vaibhavs10 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Vaibhavs10 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego commented Sep 3, 2025

Uh oh!

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

sergiopaniego Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sergiopaniego commented Aug 28, 2025 •

edited

Loading