Skip to content

[None][chore] Update flashinfer-python from 0.6.10 to 0.6.11#13992

Merged
yihwang-nv merged 1 commit into
NVIDIA:mainfrom
yihwang-nv:yihwang-nv/update_flashinfer_0.6.11
May 13, 2026
Merged

[None][chore] Update flashinfer-python from 0.6.10 to 0.6.11#13992
yihwang-nv merged 1 commit into
NVIDIA:mainfrom
yihwang-nv:yihwang-nv/update_flashinfer_0.6.11

Conversation

@yihwang-nv
Copy link
Copy Markdown
Collaborator

@yihwang-nv yihwang-nv commented May 11, 2026

Summary

  • Bump flashinfer-python from 0.6.10 to 0.6.11 (latest stable)
  • Bump nvidia-cutlass-dsl from 4.4.2 to 4.5.0 (required by flashinfer-python 0.6.11)
  • Updated version pins in requirements.txt, security_scanning/pyproject.toml, and ATTRIBUTIONS-Python.md

Test plan

  • pip install -r requirements.txt installs successfully
  • pytest tests/unittest/_torch/flashinfer/ -v
  • pytest tests/unittest/_torch/attention/test_flashinfer_attention.py -v
  • CI pre-merge passes

Summary by CodeRabbit

  • Chores
    • Updated flashinfer-python and nvidia-cutlass-dsl package versions across project dependencies for improved compatibility and stability.

Review Change Stack

@yihwang-nv yihwang-nv requested a review from a team as a code owner May 11, 2026 09:29
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 11, 2026

📝 Walkthrough

Walkthrough

This PR bumps two pinned dependencies: flashinfer-python from 0.6.10 to 0.6.11 and nvidia-cutlass-dsl from 4.4.2 to 4.5.0. Updates are synchronized across the main requirements file, security scanning configuration, and attribution documentation.

Changes

Dependency Version Updates

Layer / File(s) Summary
Primary Dependencies
requirements.txt
flashinfer-python==0.6.100.6.11 and nvidia-cutlass-dsl==4.4.24.5.0 for Python >= 3.10.
Security Scanning Configuration
security_scanning/pyproject.toml
Matching version pins updated to flashinfer-python==0.6.11 and nvidia-cutlass-dsl==4.5.0 in project dependencies.
Attribution Documentation
ATTRIBUTIONS-Python.md
Version references in attribution headers updated for both dependencies.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name Status Explanation Resolution
Description check ❓ Inconclusive PR description is clear and well-structured, covering the changes and test coverage, but lacks alignment with the repository's description template. Follow the template structure: add title with [ticket][type] format, include proper sections (Description, Test Coverage, PR Checklist), and confirm all checklist items are addressed.
✅ Passed checks (4 passed)
Check name Status Explanation
Title check ✅ Passed The title accurately describes the main change: updating flashinfer-python dependency version, which is the primary focus of the PR despite also bumping nvidia-cutlass-dsl.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

  • Generate code and open pull requests
  • Plan features and break down work
  • Investigate incidents and troubleshoot customer tickets together
  • Automate recurring tasks and respond to alerts with triggers
  • Summarize progress and report instantly

Built for teams:

  • Shared memory across your entire org—no repeating context
  • Per-thread sandboxes to safely plan and execute work
  • Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started


Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@requirements.txt`:
- Line 57: Update the pip requirement for the flashinfer-python dependency to a
published version: replace the nonexistent version token
"flashinfer-python==0.6.11" with the available release
"flashinfer-python==0.6.10" in requirements.txt so installation succeeds.
- Line 74: Replace the invalid hard-pin of nvidia-cutlass-dsl==4.5.0 in
requirements.txt: update the line referencing "nvidia-cutlass-dsl==4.5.0;
python_version >= \"3.10\"" to either a valid release (e.g.,
"nvidia-cutlass-dsl==4.4.2; python_version >= \"3.10\"") or a minimal constraint
that satisfies consumers (e.g., "nvidia-cutlass-dsl>=4.4.2; python_version >=
\"3.10\""), ensuring compatibility with flashinfer-python's requirement and that
pip can install the package.

In `@security_scanning/pyproject.toml`:
- Line 58: The pinned dependency "flashinfer-python (==0.6.11)" is referencing a
non-existent stable PyPI release; update the constraint to a valid package
version such as "flashinfer-python (==0.6.10.post1)" or use a permissive
constraint like "flashinfer-python (>=0.6.10.post1,<0.7)" if you want to allow
newer/stable builds; edit the pyproject.toml dependency entry to replace
"==0.6.11" with the chosen valid version or range so installations from PyPI
succeed.
- Line 75: The pyproject dependency "nvidia-cutlass-dsl (==4.5.0)" is pinned to
a non-existent stable 4.5.0 on PyPI; update that requirement to a valid release
such as "nvidia-cutlass-dsl (==4.5.0.dev0)" or relax to a compatible constraint
like "nvidia-cutlass-dsl (>=4.4.2)" to match flashinfer-python's requirement and
ensure dependency resolution succeeds.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 74a86829-0c43-42a0-b5bd-65509aadaa2c

📥 Commits

Reviewing files that changed from the base of the PR and between 9547230 and bf4c658.

⛔ Files ignored due to path filters (1)
  • security_scanning/poetry.lock is excluded by !**/*.lock
📒 Files selected for processing (3)
  • ATTRIBUTIONS-Python.md
  • requirements.txt
  • security_scanning/pyproject.toml

Comment thread requirements.txt
Comment thread requirements.txt
Comment thread security_scanning/pyproject.toml
Comment thread security_scanning/pyproject.toml
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47722 [ run ] triggered by Bot. Commit: bf4c658 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47722 [ run ] completed with state SUCCESS. Commit: bf4c658
/LLM/main/L0_MergeRequest_PR pipeline #37618 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

CI Agent Failure Analysis

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47765 [ run ] triggered by Bot. Commit: bf4c658 Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47765 [ run ] completed with state SUCCESS. Commit: bf4c658
/LLM/main/L0_MergeRequest_PR pipeline #37657 completed with status: 'SUCCESS'

CI Report

Link to invocation

Bump flashinfer-python dependency to the latest stable release (0.6.11).

Also bump nvidia-cutlass-dsl from 4.4.2 to 4.5.0 as required by flashinfer-python 0.6.11.

Updated version pins in requirements.txt, security_scanning/pyproject.toml, and ATTRIBUTIONS-Python.md.

Signed-off-by: Yihan Wang <[email protected]>
@yihwang-nv yihwang-nv force-pushed the yihwang-nv/update_flashinfer_0.6.11 branch from bf4c658 to cf8d9af Compare May 12, 2026 11:27
@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast --add-multi-gpu-test

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47958 [ run ] triggered by Bot. Commit: cf8d9af Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #47958 [ run ] completed with state SUCCESS. Commit: cf8d9af
/LLM/main/L0_MergeRequest_PR pipeline #37799 completed with status: 'FAILURE'

CI Report

⚠️ Action Required:

  • Please check the failed tests and fix your PR
  • If you cannot view the failures, ask the CI triggerer to share details
  • Once fixed, request an NVIDIA team member to trigger CI again

CI Agent Failure Analysis

Link to invocation

@yihwang-nv
Copy link
Copy Markdown
Collaborator Author

/bot run --disable-fail-fast --add-multi-gpu-test

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #48071 [ run ] triggered by Bot. Commit: cf8d9af Link to invocation

@tensorrt-cicd
Copy link
Copy Markdown
Collaborator

PR_Github #48071 [ run ] completed with state SUCCESS. Commit: cf8d9af
/LLM/main/L0_MergeRequest_PR pipeline #37903 completed with status: 'SUCCESS'

CI Report

Link to invocation

@yihwang-nv yihwang-nv enabled auto-merge (squash) May 13, 2026 04:46
Copy link
Copy Markdown
Collaborator

@juney-nvidia juney-nvidia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved from OSS compliance perspective.

@yihwang-nv yihwang-nv merged commit 1a8f5ab into NVIDIA:main May 13, 2026
12 of 15 checks passed
yufeiwu-nv pushed a commit to yufeiwu-nv/TensorRT-LLM that referenced this pull request May 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants